Sample records for quality genomic dna

  1. Evaluation of whole genome amplified DNA to decrease material expenditure and increase quality.

    PubMed

    Bækvad-Hansen, Marie; Bybjerg-Grauholm, Jonas; Poulsen, Jesper B; Hansen, Christine S; Hougaard, David M; Hollegaard, Mads V

    2017-06-01

    The overall aim of this study is to evaluate whole genome amplification of DNA extracted from dried blood spot samples. We wish to explore ways of optimizing the amplification process, while decreasing the amount of input material and inherently the cost. Our primary focus of optimization is on the amount of input material, the amplification reaction volume, the number of replicates and amplification time and temperature. Increasing the quality of the amplified DNA and the subsequent results of array genotyping is a secondary aim of this project. This study is based on DNA extracted from dried blood spot samples. The extracted DNA was subsequently whole genome amplified using the REPLIg kit and genotyped on the PsychArray BeadChip (assessing > 570,000 SNPs genome wide). We used Genome Studio to evaluate the quality of the genotype data by call rates and log R ratios. The whole genome amplification process is robust and does not vary between replicates. Altering amplification time, temperature or number of replicates did not affect our results. We found that spot size i.e. amount of input material could be reduced without compromising the quality of the array genotyping data. We also showed that whole genome amplification reaction volumes can be reduced by a factor of 4, without compromising the DNA quality. Whole genome amplified DNA samples from dried blood spots is well suited for array genotyping and produces robust and reliable genotype data. However, the amplification process introduces additional noise to the data, making detection of structural variants such as copy number variants difficult. With this study, we explore ways of optimizing the amplification protocol in order to reduce noise and increase data quality. We found, that the amplification process was very robust, and that changes in amplification time or temperature did not alter the genotyping calls or quality of the array data. Adding additional replicates of each sample also lead to

  2. Efficient isolation method for high-quality genomic DNA from cicada exuviae.

    PubMed

    Nguyen, Hoa Quynh; Kim, Ye Inn; Borzée, Amaël; Jang, Yikweon

    2017-10-01

    In recent years, animal ethics issues have led researchers to explore nondestructive methods to access materials for genetic studies. Cicada exuviae are among those materials because they are cast skins that individuals left after molt and are easily collected. In this study, we aim to identify the most efficient extraction method to obtain high quantity and quality of DNA from cicada exuviae. We compared relative DNA yield and purity of six extraction protocols, including both manual protocols and available commercial kits, extracting from four different exoskeleton parts. Furthermore, amplification and sequencing of genomic DNA were evaluated in terms of availability of sequencing sequence at the expected genomic size. Both the choice of protocol and exuvia part significantly affected DNA yield and purity. Only samples that were extracted using the PowerSoil DNA Isolation kit generated gel bands of expected size as well as successful sequencing results. The failed attempts to extract DNA using other protocols could be partially explained by a low DNA yield from cicada exuviae and partly by contamination with humic acids that exist in the soil where cicada nymphs reside before emergence, as shown by spectroscopic measurements. Genomic DNA extracted from cicada exuviae could provide valuable information for species identification, allowing the investigation of genetic diversity across consecutive broods, or spatiotemporal variation among various populations. Consequently, we hope to provide a simple method to acquire pure genomic DNA applicable for multiple research purposes.

  3. Simultaneous isolation of high-quality DNA, RNA, miRNA and proteins from tissues for genomic applications

    PubMed Central

    Peña-Llopis, Samuel; Brugarolas, James

    2014-01-01

    Genomic technologies have revolutionized our understanding of complex Mendelian diseases and cancer. Solid tumors present several challenges for genomic analyses, such as tumor heterogeneity and tumor contamination with surrounding stroma and infiltrating lymphocytes. We developed a protocol to (i) select tissues of high cellular purity on the basis of histological analyses of immediately flanking sections and (ii) simultaneously extract genomic DNA (gDNA), messenger RNA (mRNA), noncoding RNA (ncRNA; enriched in microRNA (miRNA)) and protein from the same tissues. After tissue selection, about 12–16 extractions of DNA/RNA/protein can be obtained per day. Compared with other similar approaches, this fast and reliable methodology allowed us to identify mutations in tumors with remarkable sensitivity and to perform integrative analyses of whole-genome and exome data sets, DNA copy numbers (by single-nucleotide polymorphism (SNP) arrays), gene expression data (by transcriptome profiling and quantitative PCR (qPCR)) and protein levels (by western blotting and immunohistochemical analysis) from the same samples. Although we focused on renal cell carcinoma, this protocol may be adapted with minor changes to any human or animal tissue to obtain high-quality and high-yield nucleic acids and proteins. PMID:24136348

  4. An Efficient Method for Genomic DNA Extraction from Different Molluscs Species

    PubMed Central

    Pereira, Jorge C.; Chaves, Raquel; Bastos, Estela; Leitão, Alexandra; Guedes-Pinto, Henrique

    2011-01-01

    The selection of a DNA extraction method is a critical step when subsequent analysis depends on the DNA quality and quantity. Unlike mammals, for which several capable DNA extraction methods have been developed, for molluscs the availability of optimized genomic DNA extraction protocols is clearly insufficient. Several aspects such as animal physiology, the type (e.g., adductor muscle or gills) or quantity of tissue, can explain the lack of efficiency (quality and yield) in molluscs genomic DNA extraction procedure. In an attempt to overcome these aspects, this work describes an efficient method for molluscs genomic DNA extraction that was tested in several species from different orders: Veneridae, Ostreidae, Anomiidae, Cardiidae (Bivalvia) and Muricidae (Gastropoda), with different weight sample tissues. The isolated DNA was of high molecular weight with high yield and purity, even with reduced quantities of tissue. Moreover, the genomic DNA isolated, demonstrated to be suitable for several downstream molecular techniques, such as PCR sequencing among others. PMID:22174651

  5. DNA Extraction Protocols for Whole-Genome Sequencing in Marine Organisms.

    PubMed

    Panova, Marina; Aronsson, Henrik; Cameron, R Andrew; Dahl, Peter; Godhe, Anna; Lind, Ulrika; Ortega-Martinez, Olga; Pereyra, Ricardo; Tesson, Sylvie V M; Wrange, Anna-Lisa; Blomberg, Anders; Johannesson, Kerstin

    2016-01-01

    The marine environment harbors a large proportion of the total biodiversity on this planet, including the majority of the earths' different phyla and classes. Studying the genomes of marine organisms can bring interesting insights into genome evolution. Today, almost all marine organismal groups are understudied with respect to their genomes. One potential reason is that extraction of high-quality DNA in sufficient amounts is challenging for many marine species. This is due to high polysaccharide content, polyphenols and other secondary metabolites that will inhibit downstream DNA library preparations. Consequently, protocols developed for vertebrates and plants do not always perform well for invertebrates and algae. In addition, many marine species have large population sizes and, as a consequence, highly variable genomes. Thus, to facilitate the sequence read assembly process during genome sequencing, it is desirable to obtain enough DNA from a single individual, which is a challenge in many species of invertebrates and algae. Here, we present DNA extraction protocols for seven marine species (four invertebrates, two algae, and a marine yeast), optimized to provide sufficient DNA quality and yield for de novo genome sequencing projects.

  6. Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing.

    PubMed

    Feehan, Joanna M; Scheibel, Katherine E; Bourras, Salim; Underwood, William; Keller, Beat; Somerville, Shauna C

    2017-03-31

    The powdery mildew fungi are a group of economically important fungal plant pathogens. Relatively little is known about the molecular biology and genetics of these pathogens, in part due to a lack of well-developed genetic and genomic resources. These organisms have large, repetitive genomes, which have made genome sequencing and assembly prohibitively difficult. Here, we describe methods for the collection, extraction, purification and quality control assessment of high molecular weight genomic DNA from one powdery mildew species, Golovinomyces cichoracearum. The protocol described includes mechanical disruption of spores followed by an optimized phenol/chloroform genomic DNA extraction. A typical yield was 7 µg DNA per 150 mg conidia. The genomic DNA that is isolated using this procedure is suitable for long-read sequencing (i.e., > 48.5 kbp). Quality control measures to ensure the size, yield, and purity of the genomic DNA are also described in this method. Sequencing of the genomic DNA of the quality described here will allow for the assembly and comparison of multiple powdery mildew genomes, which in turn will lead to a better understanding and improved control of this agricultural pathogen.

  7. DNA methylation profiling of genomic DNA isolated from urine in diabetic chronic kidney disease: A pilot study

    PubMed Central

    Sexton-Oates, Alexandra; Carmody, Jake; Ekinci, Elif I.; Dwyer, Karen M.; Saffery, Richard

    2018-01-01

    Aim To characterise the genomic DNA (gDNA) yield from urine and quality of derived methylation data generated from the widely used Illuminia Infinium MethylationEPIC (HM850K) platform and compare this with buffy coat samples. Background DNA methylation is the most widely studied epigenetic mark and variations in DNA methylation profile have been implicated in diabetes which affects approximately 415 million people worldwide. Methods QIAamp Viral RNA Mini Kit and QIAamp DNA micro kit were used to extract DNA from frozen and fresh urine samples as well as increasing volumes of fresh urine. Matched buffy coats to the frozen urine were also obtained and DNA was extracted from the buffy coats using the QIAamp DNA Mini Kit. Genomic DNA of greater concentration than 20μg/ml were used for methylation analysis using the HM850K array. Results Irrespective of extraction technique or the use of fresh versus frozen urine samples, limited genomic DNA was obtained using a starting sample volume of 5ml (0–0.86μg/mL). In order to optimize the yield, we increased starting volumes to 50ml fresh urine, which yielded only 0–9.66μg/mL A different kit, QIAamp DNA Micro Kit, was trialled in six fresh urine samples and ten frozen urine samples with inadequate DNA yields from 0–17.7μg/mL and 0–1.6μg/mL respectively. Sufficient genomic DNA was obtained from only 4 of the initial 41 frozen urine samples (10%) for DNA methylation profiling. In comparison, all four buffy coat samples (100%) provided sufficient genomic DNA. Conclusion High quality data can be obtained provided a sufficient yield of genomic DNA is isolated. Despite optimizing various extraction methodologies, the modest amount of genomic DNA derived from urine, may limit the generalisability of this approach for the identification of DNA methylation biomarkers of chronic diabetic kidney disease. PMID:29462136

  8. DNA methylation profiling of genomic DNA isolated from urine in diabetic chronic kidney disease: A pilot study.

    PubMed

    Lecamwasam, Ashani; Sexton-Oates, Alexandra; Carmody, Jake; Ekinci, Elif I; Dwyer, Karen M; Saffery, Richard

    2018-01-01

    To characterise the genomic DNA (gDNA) yield from urine and quality of derived methylation data generated from the widely used Illuminia Infinium MethylationEPIC (HM850K) platform and compare this with buffy coat samples. DNA methylation is the most widely studied epigenetic mark and variations in DNA methylation profile have been implicated in diabetes which affects approximately 415 million people worldwide. QIAamp Viral RNA Mini Kit and QIAamp DNA micro kit were used to extract DNA from frozen and fresh urine samples as well as increasing volumes of fresh urine. Matched buffy coats to the frozen urine were also obtained and DNA was extracted from the buffy coats using the QIAamp DNA Mini Kit. Genomic DNA of greater concentration than 20μg/ml were used for methylation analysis using the HM850K array. Irrespective of extraction technique or the use of fresh versus frozen urine samples, limited genomic DNA was obtained using a starting sample volume of 5ml (0-0.86μg/mL). In order to optimize the yield, we increased starting volumes to 50ml fresh urine, which yielded only 0-9.66μg/mL A different kit, QIAamp DNA Micro Kit, was trialled in six fresh urine samples and ten frozen urine samples with inadequate DNA yields from 0-17.7μg/mL and 0-1.6μg/mL respectively. Sufficient genomic DNA was obtained from only 4 of the initial 41 frozen urine samples (10%) for DNA methylation profiling. In comparison, all four buffy coat samples (100%) provided sufficient genomic DNA. High quality data can be obtained provided a sufficient yield of genomic DNA is isolated. Despite optimizing various extraction methodologies, the modest amount of genomic DNA derived from urine, may limit the generalisability of this approach for the identification of DNA methylation biomarkers of chronic diabetic kidney disease.

  9. An optimized method for high quality DNA extraction from microalga Prototheca wickerhamii for genome sequencing.

    PubMed

    Jagielski, Tomasz; Gawor, Jan; Bakuła, Zofia; Zuchniewicz, Karolina; Żak, Iwona; Gromadka, Robert

    2017-01-01

    The complex cell wall structure of algae often precludes efficient extraction of their genetic material. The purpose of this study was to design a next-generation sequencing-suitable DNA isolation method for unicellular, achlorophyllous, yeast-like microalgae of the genus Prototheca , the only known plant pathogens of both humans and animals. The effectiveness of the newly proposed scheme was compared with five other, previously described methods, commonly used for DNA isolation from plants and/or yeasts, available either as laboratory-developed, in-house assays, based on liquid nitrogen grinding or different enzymatic digestion, or as commercially manufactured kits. All five, previously described, isolation assays yielded DNA concentrations lower than those obtained with the new method, averaging 16.15 ± 25.39 vs 74.2 ± 0.56 ng/µL, respectively. The new method was also superior in terms of DNA purity, as measured by A260/A280 (-0.41 ± 4.26 vs 2.02 ± 0.03), and A260/A230 (1.20 ± 1.12 vs 1.97 ± 0.07) ratios. Only the liquid nitrogen-based method yielded DNA of comparable quantity (60.96 ± 0.16 ng/µL) and quality (A260/A280 = 2.08 ± 0.02; A260/A230 = 2.23 ± 0.26). Still, the new method showed higher integrity, which was best illustrated upon electrophoretic analysis. Genomic DNA of Prototheca wickerhamii POL-1 strain isolated with the protocol herein proposed was successfully sequenced on the Illumina MiSeq platform. A new method for DNA isolation from Prototheca algae is described. The method, whose protocol involves glass beads pulverization and cesium chloride (CsCl) density gradient centrifugation, was demonstrated superior over the other common assays in terms of DNA quantity and quality. The method is also the first to offer the possibility of preparation of DNA template suitable for whole genome sequencing of Prototheca spp.

  10. "Isogaba Maware": quality control of genome DNA by checkpoints.

    PubMed

    Kitazono, A; Matsumoto, T

    1998-05-01

    Checkpoints maintain the interdependency of cell cycle events by permitting the onset of an event only after the completion of the preceding event. The DNA replication checkpoint induces a cell cycle arrest until the completion of the DNA replication. Similarly, the DNA damage checkpoint arrests cell cycle progression if DNA repair is incomplete. A number of genes that play a role in the two checkpoints have been identified through genetic studies in yeasts, and their homologues have been found in fly, mouse, and human. They form signaling cascades activated by a DNA replication block or DNA damage and subsequently generate the negative constraints on cell cycle regulators. The failure of these signaling cascades results in producing offspring that carry mutations or that lack a portion of the genome. In humans, defects in the checkpoints are often associated with cancer-prone diseases. Focusing mainly on the studies in budding and fission yeasts, we summarize the recent progress.

  11. Brief Guide to Genomics: DNA, Genes and Genomes

    MedlinePlus

    ... Sheets A Brief Guide to Genomics About NHGRI Research About the International HapMap Project Biological Pathways Chromosome Abnormalities Chromosomes Cloning Comparative Genomics DNA Microarray Technology DNA Sequencing Deoxyribonucleic Acid ( ...

  12. Chemically synthesized silver nanoparticles as cell lysis agent for bacterial genomic DNA isolation

    NASA Astrophysics Data System (ADS)

    Goswami, Gunajit; Boruah, Himangshu; Gautom, Trishnamoni; Jyoti Hazarika, Dibya; Barooah, Madhumita; Boro, Robin Chandra

    2017-12-01

    Silver nanoparticles (AgNPs) have seen a recent spurt of use in varied fields of science. In this paper, we showed a novel application of AgNP as a promising microbial cell-lysis agent for genomic DNA isolation. We utilized chemically synthesized AgNPs for lysing bacterial cells to isolate their genomic DNA. The AgNPs efficiently lysed bacterial cells to yield good quality DNA that could be subsequently used for several molecular biology works.

  13. Direct detection of methylation in genomic DNA

    PubMed Central

    Bart, A.; van Passel, M. W. J.; van Amsterdam, K.; van der Ende, A.

    2005-01-01

    The identification of methylated sites on bacterial genomic DNA would be a useful tool to study the major roles of DNA methylation in prokaryotes: distinction of self and nonself DNA, direction of post-replicative mismatch repair, control of DNA replication and cell cycle, and regulation of gene expression. Three types of methylated nucleobases are known: N6-methyladenine, 5-methylcytosine and N4-methylcytosine. The aim of this study was to develop a method to detect all three types of DNA methylation in complete genomic DNA. It was previously shown that N6-methyladenine and 5-methylcytosine in plasmid and viral DNA can be detected by intersequence trace comparison of methylated and unmethylated DNA. We extended this method to include N4-methylcytosine detection in both in vitro and in vivo methylated DNA. Furthermore, application of intersequence trace comparison was extended to bacterial genomic DNA. Finally, we present evidence that intrasequence comparison suffices to detect methylated sites in genomic DNA. In conclusion, we present a method to detect all three natural types of DNA methylation in bacterial genomic DNA. This provides the possibility to define the complete methylome of any prokaryote. PMID:16091626

  14. Development of forensic-quality full mtGenome haplotypes: success rates with low template specimens.

    PubMed

    Just, Rebecca S; Scheible, Melissa K; Fast, Spence A; Sturk-Andreaggi, Kimberly; Higginbotham, Jennifer L; Lyons, Elizabeth A; Bush, Jocelyn M; Peck, Michelle A; Ring, Joseph D; Diegoli, Toni M; Röck, Alexander W; Huber, Gabriela E; Nagl, Simone; Strobl, Christina; Zimmermann, Bettina; Parson, Walther; Irwin, Jodi A

    2014-05-01

    Forensic mitochondrial DNA (mtDNA) testing requires appropriate, high quality reference population data for estimating the rarity of questioned haplotypes and, in turn, the strength of the mtDNA evidence. Available reference databases (SWGDAM, EMPOP) currently include information from the mtDNA control region; however, novel methods that quickly and easily recover mtDNA coding region data are becoming increasingly available. Though these assays promise to both facilitate the acquisition of mitochondrial genome (mtGenome) data and maximize the general utility of mtDNA testing in forensics, the appropriate reference data and database tools required for their routine application in forensic casework are lacking. To address this deficiency, we have undertaken an effort to: (1) increase the large-scale availability of high-quality entire mtGenome reference population data, and (2) improve the information technology infrastructure required to access/search mtGenome data and employ them in forensic casework. Here, we describe the application of a data generation and analysis workflow to the development of more than 400 complete, forensic-quality mtGenomes from low DNA quantity blood serum specimens as part of a U.S. National Institute of Justice funded reference population databasing initiative. We discuss the minor modifications made to a published mtGenome Sanger sequencing protocol to maintain a high rate of throughput while minimizing manual reprocessing with these low template samples. The successful use of this semi-automated strategy on forensic-like samples provides practical insight into the feasibility of producing complete mtGenome data in a routine casework environment, and demonstrates that large (>2kb) mtDNA fragments can regularly be recovered from high quality but very low DNA quantity specimens. Further, the detailed empirical data we provide on the amplification success rates across a range of DNA input quantities will be useful moving forward as PCR

  15. Assessment of data processing to improve reliability of microarray experiments using genomic DNA reference.

    PubMed

    Yang, Yunfeng; Zhu, Mengxia; Wu, Liyou; Zhou, Jizhong

    2008-09-16

    Using genomic DNA as common reference in microarray experiments has recently been tested by different laboratories. Conflicting results have been reported with regard to the reliability of microarray results using this method. To explain it, we hypothesize that data processing is a critical element that impacts the data quality. Microarray experiments were performed in a gamma-proteobacterium Shewanella oneidensis. Pair-wise comparison of three experimental conditions was obtained either with two labeled cDNA samples co-hybridized to the same array, or by employing Shewanella genomic DNA as a standard reference. Various data processing techniques were exploited to reduce the amount of inconsistency between both methods and the results were assessed. We discovered that data quality was significantly improved by imposing the constraint of minimal number of replicates, logarithmic transformation and random error analyses. These findings demonstrate that data processing significantly influences data quality, which provides an explanation for the conflicting evaluation in the literature. This work could serve as a guideline for microarray data analysis using genomic DNA as a standard reference.

  16. Survey of protein–DNA interactions in Aspergillus oryzae on a genomic scale

    PubMed Central

    Wang, Chao; Lv, Yangyong; Wang, Bin; Yin, Chao; Lin, Ying; Pan, Li

    2015-01-01

    The genome-scale delineation of in vivo protein–DNA interactions is key to understanding genome function. Only ∼5% of transcription factors (TFs) in the Aspergillus genus have been identified using traditional methods. Although the Aspergillus oryzae genome contains >600 TFs, knowledge of the in vivo genome-wide TF-binding sites (TFBSs) in aspergilli remains limited because of the lack of high-quality antibodies. We investigated the landscape of in vivo protein–DNA interactions across the A. oryzae genome through coupling the DNase I digestion of intact nuclei with massively parallel sequencing and the analysis of cleavage patterns in protein–DNA interactions at single-nucleotide resolution. The resulting map identified overrepresented de novo TF-binding motifs from genomic footprints, and provided the detailed chromatin remodeling patterns and the distribution of digital footprints near transcription start sites. The TFBSs of 19 known Aspergillus TFs were also identified based on DNase I digestion data surrounding potential binding sites in conjunction with TF binding specificity information. We observed that the cleavage patterns of TFBSs were dependent on the orientation of TF motifs and independent of strand orientation, consistent with the DNA shape features of binding motifs with flanking sequences. PMID:25883143

  17. Quality scores for 32,000 genomes

    DOE PAGES

    Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran; ...

    2014-12-08

    More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes hadmore » quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.« less

  18. Quality scores for 32,000 genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran

    More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes hadmore » quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.« less

  19. DNA cards: determinants of DNA yield and quality in collecting genetic samples for pharmacogenetic studies.

    PubMed

    Mas, Sergi; Crescenti, Anna; Gassó, Patricia; Vidal-Taboada, Jose M; Lafuente, Amalia

    2007-08-01

    As pharmacogenetic studies frequently require establishment of DNA banks containing large cohorts with multi-centric designs, inexpensive methods for collecting and storing high-quality DNA are needed. The aims of this study were two-fold: to compare the amount and quality of DNA obtained from two different DNA cards (IsoCode Cards or FTA Classic Cards, Whatman plc, Brentford, Middlesex, UK); and to evaluate the effects of time and storage temperature, as well as the influence of anticoagulant ethylenediaminetetraacetic acid on the DNA elution procedure. The samples were genotyped by several methods typically used in pharmacogenetic studies: multiplex PCR, PCR-restriction fragment length polymorphism, single nucleotide primer extension, and allelic discrimination assay. In addition, they were amplified by whole genome amplification to increase genomic DNA mass. Time, storage temperature and ethylenediaminetetraacetic acid had no significant effects on either DNA card. This study reveals the importance of drying blood spots prior to isolation to avoid haemoglobin interference. Moreover, our results demonstrate that re-isolation protocols could be applied to increase the amount of DNA recovered. The samples analysed were accurately genotyped with all the methods examined herein. In conclusion, our study shows that both DNA cards, IsoCode Cards and FTA Classic Cards, facilitate genetic and pharmacogenetic testing for routine clinical practice.

  20. Deppdb--DNA electrostatic potential properties database: electrostatic properties of genome DNA.

    PubMed

    Osypov, Alexander A; Krutinin, Gleb G; Kamzolova, Svetlana G

    2010-06-01

    The electrostatic properties of genome DNA influence its interactions with different proteins, in particular, the regulation of transcription by RNA-polymerases. DEPPDB--DNA Electrostatic Potential Properties Database--was developed to hold and provide all available information on the electrostatic properties of genome DNA combined with its sequence and annotation of biological and structural properties of genome elements and whole genomes. Genomes in DEPPDB are organized on a taxonomical basis. Currently, the database contains all the completely sequenced bacterial and viral genomes according to NCBI RefSeq. General properties of the genome DNA electrostatic potential profile and principles of its formation are revealed. This potential correlates with the GC content but does not correspond to it exactly and strongly depends on both the sequence arrangement and its context (flanking regions). Analysis of the promoter regions for bacterial and viral RNA polymerases revealed a correspondence between the scale of these proteins' physical properties and electrostatic profile patterns. We also discovered a direct correlation between the potential value and the binding frequency of RNA polymerase to DNA, supporting the idea of the role of electrostatics in these interactions. This matches a pronounced tendency of the promoter regions to possess higher values of the electrostatic potential.

  1. DEPPDB - DNA electrostatic potential properties database. Electrostatic properties of genome DNA elements.

    PubMed

    Osypov, Alexander A; Krutinin, Gleb G; Krutinina, Eugenia A; Kamzolova, Svetlana G

    2012-04-01

    Electrostatic properties of genome DNA are important to its interactions with different proteins, in particular, related to transcription. DEPPDB - DNA Electrostatic Potential (and other Physical) Properties Database - provides information on the electrostatic and other physical properties of genome DNA combined with its sequence and annotation of biological and structural properties of genomes and their elements. Genomes are organized on taxonomical basis, supporting comparative and evolutionary studies. Currently, DEPPDB contains all completely sequenced bacterial, viral, mitochondrial, and plastids genomes according to the NCBI RefSeq, and some model eukaryotic genomes. Data for promoters, regulation sites, binding proteins, etc., are incorporated from established DBs and literature. The database is complemented by analytical tools. User sequences calculations are available. Case studies discovered electrostatics complementing DNA bending in E.coli plasmid BNT2 promoter functioning, possibly affecting host-environment metabolic switch. Transcription factors binding sites gravitate to high potential regions, confirming the electrostatics universal importance in protein-DNA interactions beyond the classical promoter-RNA polymerase recognition and regulation. Other genome elements, such as terminators, also show electrostatic peculiarities. Most intriguing are gene starts, exhibiting taxonomic correlations. The necessity of the genome electrostatic properties studies is discussed.

  2. Whole genome amplification of DNA extracted from FFPE tissues.

    PubMed

    Bosso, Mira; Al-Mulla, Fahd

    2011-01-01

    Whole genome amplification systems were developed to meet the increasing research demands on DNA resources and to avoid DNA shortage. The technology enables amplification of nanogram amounts of DNA into microgram quantities and is increasingly used in the amplification of DNA from multiple origins such as blood, fresh frozen tissue, formalin-fixed paraffin-embedded tissues, saliva, buccal swabs, bacteria, and plant and animal sources. This chapter focuses on the use of GenomePlex(®) tissue Whole Genome Amplification Kit, to amplify DNA directly from archived tissue. In addition, this chapter documents our unique experience with the utilization of GenomePlex(®) amplified DNA using several molecular techniques including metaphase Comparative Genomic Hybridization, array Comparative Genomic Hybridization, and real-time quantitative polymerase chain reaction assays. GenomePlex(®) is a registered trademark of Rubicon Genomics Incorporation.

  3. Genome Calligrapher: A Web Tool for Refactoring Bacterial Genome Sequences for de Novo DNA Synthesis.

    PubMed

    Christen, Matthias; Deutsch, Samuel; Christen, Beat

    2015-08-21

    Recent advances in synthetic biology have resulted in an increasing demand for the de novo synthesis of large-scale DNA constructs. Any process improvement that enables fast and cost-effective streamlining of digitized genetic information into fabricable DNA sequences holds great promise to study, mine, and engineer genomes. Here, we present Genome Calligrapher, a computer-aided design web tool intended for whole genome refactoring of bacterial chromosomes for de novo DNA synthesis. By applying a neutral recoding algorithm, Genome Calligrapher optimizes GC content and removes obstructive DNA features known to interfere with the synthesis of double-stranded DNA and the higher order assembly into large DNA constructs. Subsequent bioinformatics analysis revealed that synthesis constraints are prevalent among bacterial genomes. However, a low level of codon replacement is sufficient for refactoring bacterial genomes into easy-to-synthesize DNA sequences. To test the algorithm, 168 kb of synthetic DNA comprising approximately 20 percent of the synthetic essential genome of the cell-cycle bacterium Caulobacter crescentus was streamlined and then ordered from a commercial supplier of low-cost de novo DNA synthesis. The successful assembly into eight 20 kb segments indicates that Genome Calligrapher algorithm can be efficiently used to refactor difficult-to-synthesize DNA. Genome Calligrapher is broadly applicable to recode biosynthetic pathways, DNA sequences, and whole bacterial genomes, thus offering new opportunities to use synthetic biology tools to explore the functionality of microbial diversity. The Genome Calligrapher web tool can be accessed at https://christenlab.ethz.ch/GenomeCalligrapher  .

  4. Chromatin Dynamics in Genome Stability: Roles in Suppressing Endogenous DNA Damage and Facilitating DNA Repair

    PubMed Central

    Nair, Nidhi; Shoaib, Muhammad

    2017-01-01

    Genomic DNA is compacted into chromatin through packaging with histone and non-histone proteins. Importantly, DNA accessibility is dynamically regulated to ensure genome stability. This is exemplified in the response to DNA damage where chromatin relaxation near genomic lesions serves to promote access of relevant enzymes to specific DNA regions for signaling and repair. Furthermore, recent data highlight genome maintenance roles of chromatin through the regulation of endogenous DNA-templated processes including transcription and replication. Here, we review research that shows the importance of chromatin structure regulation in maintaining genome integrity by multiple mechanisms including facilitating DNA repair and directly suppressing endogenous DNA damage. PMID:28698521

  5. Genome instabilities arising from ribonucleotides in DNA.

    PubMed

    Klein, Hannah L

    2017-08-01

    Genomic DNA is transiently contaminated with ribonucleotide residues during the process of DNA replication through misincorporation by the replicative DNA polymerases α, δ and ε, and by the normal replication process on the lagging strand, which uses RNA primers. These ribonucleotides are efficiently removed during replication by RNase H enzymes and the lagging strand synthesis machinery. However, when ribonucleotides remain in DNA they can distort the DNA helix, affect machineries for DNA replication, transcription and repair, and can stimulate genomic instabilities which are manifest as increased mutation, recombination and chromosome alterations. The genomic instabilities associated with embedded ribonucleotides are considered here, along with a discussion of the origin of the lesions that stimulate particular classes of instabilities. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. DNA demethylation in the Arabidopsis genome

    PubMed Central

    Penterman, Jon; Zilberman, Daniel; Huh, Jin Hoe; Ballinger, Tracy; Henikoff, Steven; Fischer, Robert L.

    2007-01-01

    Cytosine DNA methylation is considered to be a stable epigenetic mark, but active demethylation has been observed in both plants and animals. In Arabidopsis thaliana, DNA glycosylases of the DEMETER (DME) family remove methylcytosines from DNA. Demethylation by DME is necessary for genomic imprinting, and demethylation by a related protein, REPRESSOR OF SILENCING1, prevents gene silencing in a transgenic background. However, the extent and function of demethylation by DEMETER-LIKE (DML) proteins in WT plants is not known. Using genome-tiling microarrays, we mapped DNA methylation in mutant and WT plants and identified 179 loci actively demethylated by DML enzymes. Mutations in DML genes lead to locus-specific DNA hypermethylation. Reintroducing WT DML genes restores most loci to the normal pattern of methylation, although at some loci, hypermethylated epialleles persist. Of loci demethylated by DML enzymes, >80% are near or overlap genes. Genic demethylation by DML enzymes primarily occurs at the 5′ and 3′ ends, a pattern opposite to the overall distribution of WT DNA methylation. Our results show that demethylation by DML DNA glycosylases edits the patterns of DNA methylation within the Arabidopsis genome to protect genes from potentially deleterious methylation. PMID:17409185

  7. Detection of DNA Methylation by Whole-Genome Bisulfite Sequencing.

    PubMed

    Li, Qing; Hermanson, Peter J; Springer, Nathan M

    2018-01-01

    DNA methylation plays an important role in the regulation of the expression of transposons and genes. Various methods have been developed to assay DNA methylation levels. Bisulfite sequencing is considered to be the "gold standard" for single-base resolution measurement of DNA methylation levels. Coupled with next-generation sequencing, whole-genome bisulfite sequencing (WGBS) allows DNA methylation to be evaluated at a genome-wide scale. Here, we described a protocol for WGBS in plant species with large genomes. This protocol has been successfully applied to assay genome-wide DNA methylation levels in maize and barley. This protocol has also been successfully coupled with sequence capture technology to assay DNA methylation levels in a targeted set of genomic regions.

  8. Extracting DNA from 'jaws': high yield and quality from archived tiger shark (Galeocerdo cuvier) skeletal material.

    PubMed

    Nielsen, E E; Morgan, J A T; Maher, S L; Edson, J; Gauthier, M; Pepperell, J; Holmes, B J; Bennett, M B; Ovenden, J R

    2017-05-01

    Archived specimens are highly valuable sources of DNA for retrospective genetic/genomic analysis. However, often limited effort has been made to evaluate and optimize extraction methods, which may be crucial for downstream applications. Here, we assessed and optimized the usefulness of abundant archived skeletal material from sharks as a source of DNA for temporal genomic studies. Six different methods for DNA extraction, encompassing two different commercial kits and three different protocols, were applied to material, so-called bio-swarf, from contemporary and archived jaws and vertebrae of tiger sharks (Galeocerdo cuvier). Protocols were compared for DNA yield and quality using a qPCR approach. For jaw swarf, all methods provided relatively high DNA yield and quality, while large differences in yield between protocols were observed for vertebrae. Similar results were obtained from samples of white shark (Carcharodon carcharias). Application of the optimized methods to 38 museum and private angler trophy specimens dating back to 1912 yielded sufficient DNA for downstream genomic analysis for 68% of the samples. No clear relationships between age of samples, DNA quality and quantity were observed, likely reflecting different preparation and storage methods for the trophies. Trial sequencing of DNA capture genomic libraries using 20 000 baits revealed that a significant proportion of captured sequences were derived from tiger sharks. This study demonstrates that archived shark jaws and vertebrae are potential high-yield sources of DNA for genomic-scale analysis. It also highlights that even for similar tissue types, a careful evaluation of extraction protocols can vastly improve DNA yield. © 2016 John Wiley & Sons Ltd.

  9. Genomic relations among 31 species of Mammillaria haworth (Cactaceae) using random amplified polymorphic DNA.

    PubMed

    Mattagajasingh, Ilwola; Mukherjee, Arup Kumar; Das, Premananda

    2006-01-01

    Thirty-one species of Mammillaria were selected to study the molecular phylogeny using random amplified polymorphic DNA (RAPD) markers. High amount of mucilage (gelling polysaccharides) present in Mammillaria was a major obstacle in isolating good quality genomic DNA. The CTAB (cetyl trimethyl ammonium bromide) method was modified to obtain good quality genomic DNA. Twenty-two random decamer primers resulted in 621 bands, all of which were polymorphic. The similarity matrix value varied from 0.109 to 0.622 indicating wide variability among the studied species. The dendrogram obtained from the unweighted pair group method using arithmetic averages (UPGMA) analysis revealed that some of the species did not follow the conventional classification. The present work shows the usefulness of RAPD markers for genetic characterization to establish phylogenetic relations among Mammillaria species.

  10. DNA-DNA hybridization values and their relationship to whole-genome sequence similarities.

    PubMed

    Goris, Johan; Konstantinidis, Konstantinos T; Klappenbach, Joel A; Coenye, Tom; Vandamme, Peter; Tiedje, James M

    2007-01-01

    DNA-DNA hybridization (DDH) values have been used by bacterial taxonomists since the 1960s to determine relatedness between strains and are still the most important criterion in the delineation of bacterial species. Since the extent of hybridization between a pair of strains is ultimately governed by their respective genomic sequences, we examined the quantitative relationship between DDH values and genome sequence-derived parameters, such as the average nucleotide identity (ANI) of common genes and the percentage of conserved DNA. A total of 124 DDH values were determined for 28 strains for which genome sequences were available. The strains belong to six important and diverse groups of bacteria for which the intra-group 16S rRNA gene sequence identity was greater than 94 %. The results revealed a close relationship between DDH values and ANI and between DNA-DNA hybridization and the percentage of conserved DNA for each pair of strains. The recommended cut-off point of 70 % DDH for species delineation corresponded to 95 % ANI and 69 % conserved DNA. When the analysis was restricted to the protein-coding portion of the genome, 70 % DDH corresponded to 85 % conserved genes for a pair of strains. These results reveal extensive gene diversity within the current concept of "species". Examination of reciprocal values indicated that the level of experimental error associated with the DDH method is too high to reveal the subtle differences in genome size among the strains sampled. It is concluded that ANI can accurately replace DDH values for strains for which genome sequences are available.

  11. DNA Damage Reduces the Quality, but Not the Quantity of Human Papillomavirus 16 E1 and E2 DNA Replication.

    PubMed

    Bristol, Molly L; Wang, Xu; Smith, Nathan W; Son, Minkyeong P; Evans, Michael R; Morgan, Iain M

    2016-06-22

    Human papillomaviruses (HPVs) are causative agents in almost all cervical carcinomas. HPVs are also causative agents in head and neck cancer, the cases of which are increasing rapidly. Viral replication activates the DNA damage response (DDR) pathway; associated proteins are recruited to replication foci, and this pathway may serve to allow for viral genome amplification. Likewise, HPV genome double-strand breaks (DSBs) could be produced during replication and could lead to linearization and viral integration. Many studies have shown that viral integration into the host genome results in unregulated expression of the viral oncogenes, E6 and E7, promoting HPV-induced carcinogenesis. Previously, we have demonstrated that DNA-damaging agents, such as etoposide, or knocking down viral replication partner proteins, such as topoisomerase II β binding protein I (TopBP1), does not reduce the level of DNA replication. Here, we investigated whether these treatments alter the quality of DNA replication by HPV16 E1 and E2. We confirm that knockdown of TopBP1 or treatment with etoposide does not reduce total levels of E1/E2-mediated DNA replication; however, the quality of replication is significantly reduced. The results demonstrate that E1 and E2 continue to replicate under genomically-stressed conditions and that this replication is mutagenic. This mutagenesis would promote the formation of substrates for integration of the viral genome into that of the host, a hallmark of cervical cancer.

  12. The Neandertal genome and ancient DNA authenticity

    PubMed Central

    Green, Richard E; Briggs, Adrian W; Krause, Johannes; Prüfer, Kay; Burbano, Hernán A; Siebauer, Michael; Lachmann, Michael; Pääbo, Svante

    2009-01-01

    Recent advances in high-thoughput DNA sequencing have made genome-scale analyses of genomes of extinct organisms possible. With these new opportunities come new difficulties in assessing the authenticity of the DNA sequences retrieved. We discuss how these difficulties can be addressed, particularly with regard to analyses of the Neandertal genome. We argue that only direct assays of DNA sequence positions in which Neandertals differ from all contemporary humans can serve as a reliable means to estimate human contamination. Indirect measures, such as the extent of DNA fragmentation, nucleotide misincorporations, or comparison of derived allele frequencies in different fragment size classes, are unreliable. Fortunately, interim approaches based on mtDNA differences between Neandertals and current humans, detection of male contamination through Y chromosomal sequences, and repeated sequencing from the same fossil to detect autosomal contamination allow initial large-scale sequencing of Neandertal genomes. This will result in the discovery of fixed differences in the nuclear genome between Neandertals and current humans that can serve as future direct assays for contamination. For analyses of other fossil hominins, which may become possible in the future, we suggest a similar ‘boot-strap' approach in which interim approaches are applied until sufficient data for more definitive direct assays are acquired. PMID:19661919

  13. GBshape: a genome browser database for DNA shape annotations

    PubMed Central

    Chiu, Tsu-Pei; Yang, Lin; Zhou, Tianyin; Main, Bradley J.; Parker, Stephen C.J.; Nuzhdin, Sergey V.; Tullius, Thomas D.; Rohs, Remo

    2015-01-01

    Many regulatory mechanisms require a high degree of specificity in protein-DNA binding. Nucleotide sequence does not provide an answer to the question of why a protein binds only to a small subset of the many putative binding sites in the genome that share the same core motif. Whereas higher-order effects, such as chromatin accessibility, cooperativity and cofactors, have been described, DNA shape recently gained attention as another feature that fine-tunes the DNA binding specificities of some transcription factor families. Our Genome Browser for DNA shape annotations (GBshape; freely available at http://rohslab.cmb.usc.edu/GBshape/) provides minor groove width, propeller twist, roll, helix twist and hydroxyl radical cleavage predictions for the entire genomes of 94 organisms. Additional genomes can easily be added using the GBshape framework. GBshape can be used to visualize DNA shape annotations qualitatively in a genome browser track format, and to download quantitative values of DNA shape features as a function of genomic position at nucleotide resolution. As biological applications, we illustrate the periodicity of DNA shape features that are present in nucleosome-occupied sequences from human, fly and worm, and we demonstrate structural similarities between transcription start sites in the genomes of four Drosophila species. PMID:25326329

  14. The contribution of co-transcriptional RNA:DNA hybrid structures to DNA damage and genome instability

    PubMed Central

    Hamperl, Stephan; Cimprich, Karlene A.

    2014-01-01

    Accurate DNA replication and DNA repair are crucial for the maintenance of genome stability, and it is generally accepted that failure of these processes is a major source of DNA damage in cells. Intriguingly, recent evidence suggests that DNA damage is more likely to occur at genomic loci with high transcriptional activity. Furthermore, loss of certain RNA processing factors in eukaryotic cells is associated with increased formation of co-transcriptional RNA:DNA hybrid structures known as R-loops, resulting in double-strand breaks (DSBs) and DNA damage. However, the molecular mechanisms by which R-loop structures ultimately lead to DNA breaks and genome instability is not well understood. In this review, we summarize the current knowledge about the formation, recognition and processing of RNA:DNA hybrids, and discuss possible mechanisms by which these structures contribute to DNA damage and genome instability in the cell. PMID:24746923

  15. GBshape: a genome browser database for DNA shape annotations.

    PubMed

    Chiu, Tsu-Pei; Yang, Lin; Zhou, Tianyin; Main, Bradley J; Parker, Stephen C J; Nuzhdin, Sergey V; Tullius, Thomas D; Rohs, Remo

    2015-01-01

    Many regulatory mechanisms require a high degree of specificity in protein-DNA binding. Nucleotide sequence does not provide an answer to the question of why a protein binds only to a small subset of the many putative binding sites in the genome that share the same core motif. Whereas higher-order effects, such as chromatin accessibility, cooperativity and cofactors, have been described, DNA shape recently gained attention as another feature that fine-tunes the DNA binding specificities of some transcription factor families. Our Genome Browser for DNA shape annotations (GBshape; freely available at http://rohslab.cmb.usc.edu/GBshape/) provides minor groove width, propeller twist, roll, helix twist and hydroxyl radical cleavage predictions for the entire genomes of 94 organisms. Additional genomes can easily be added using the GBshape framework. GBshape can be used to visualize DNA shape annotations qualitatively in a genome browser track format, and to download quantitative values of DNA shape features as a function of genomic position at nucleotide resolution. As biological applications, we illustrate the periodicity of DNA shape features that are present in nucleosome-occupied sequences from human, fly and worm, and we demonstrate structural similarities between transcription start sites in the genomes of four Drosophila species. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. TECHNICAL BRIEF: Isolation of total DNA from postmortem human eye tissues and quality comparison between iris and retina

    PubMed Central

    Wang, Jay Ching Chieh; Wang, Aikun; Gao, Jiangyuan; Cao, Sijia; Samad, Idris; Zhang, Dean; Ritland, Carol; Cui, Jing Z.

    2012-01-01

    Background Recent genomic technologies have propelled our understanding of the mechanisms underlying complex eye diseases such as age-related macular degeneration (AMD). Genotyping postmortem eye tissues for known single nucleotide polymorphisms (SNPs) associated with AMD may prove valuable, especially when combined with information obtained through other methods such as immunohistochemistry, western blot, enzyme-linked immunosorbent assay (ELISA), and proteomics. Initially intending to genotype postmortem eye tissues for AMD-related SNPs, our group became interested in isolating and comparing the quality of DNA from the iris and retina of postmortem donor eyes. Since there is no previously published protocol in the literature on this topic, we present a protocol suitable for isolating high-quality DNA from postmortem eye tissues for genomic studies. Methods DNA from 33 retinal samples and 35 iris samples was extracted using the phenol-chloroform-isoamyl method from postmortem donor eye tissues. The quantity of DNA was measured with a spectrophotometer while the quality was checked using gel electrophoresis. The DNA samples were then amplified with PCR for the complement factor H (CFH) gene. The purified amplified products were then genotyped for the SNPs in the CFH gene. Results Regarding concentration, the retina yielded 936 ng/μl of DNA, while the iris yielded 78 ng/μl of DNA. Retinal DNA was also purer than iris DNA (260/280=1.78 vs. 1.46, respectively), and produced superior PCR results. Retinal tissue yielded significantly more DNA than the iris tissue per mg of sample (21.7 ng/μl/mg vs. 7.42 ng/μl/mg). Retinal DNA can be readily amplified with PCR, while iris DNA can also be amplified by adding bovine serum albumin. Overall, retinal tissues yielded DNA of superior quality, quantity, and suitability for genotyping and genomic studies. Conclusions The protocol presented here provides a clear and reliable method for isolating total DNA from postmortem eye

  17. Sequencing intractable DNA to close microbial genomes.

    PubMed

    Hurt, Richard A; Brown, Steven D; Podar, Mircea; Palumbo, Anthony V; Elias, Dwayne A

    2012-01-01

    Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  18. Z-DNA-induced super-transport of energy within genomes

    NASA Astrophysics Data System (ADS)

    Kulish, Vladimir V.; Heng, Li; Dröge, Peter

    2007-10-01

    Spontaneous transitions of genomic DNA segments from right-handed B-DNA into the left-handed, high-energy Z conformation are unstable within topologically relaxed DNA molecules, such as mammalian chromosomes. Here we show, from direct application of the principles of statistical physics with a promoter region in the mouse genome as a representative example, that the life span for this alternate DNA conformation may be much smaller than the characteristic time of thermal fluctuations that cause the B-to-Z transition. Surprisingly, such a short existence of Z-DNA is important because it can be responsible for super-transport of energy within a genome. This type of energy transport can be utilized by a cell to communicate information about the state of particular chromatin domains within chromosomes or as a buffer against genome instability.

  19. Myeloperoxidase-induced Genomic DNA-centered Radicals*

    PubMed Central

    Gomez-Mejiba, Sandra E.; Zhai, Zili; Gimenez, Maria S.; Ashby, Michael T.; Chilakapati, Jaya; Kitchin, Kirk; Mason, Ronald P.; Ramirez, Dario C.

    2010-01-01

    Myeloperoxidase (MPO) released by activated neutrophils can initiate and promote carcinogenesis. MPO produces hypochlorous acid (HOCl) that oxidizes the genomic DNA in inflammatory cells as well as in surrounding epithelial cells. DNA-centered radicals are early intermediates formed during DNA oxidation. Once formed, DNA-centered radicals decay by mechanisms that are not completely understood, producing a number of oxidation products that are studied as markers of DNA oxidation. In this study we employed the 5,5-dimethyl-1-pyrroline N-oxide-based immuno-spin trapping technique to investigate the MPO-triggered formation of DNA-centered radicals in inflammatory and epithelial cells and to test whether resveratrol blocks HOCl-induced DNA-centered radical formation in these cells. We found that HOCl added exogenously or generated intracellularly by MPO that has been taken up by the cell or by MPO newly synthesized produces DNA-centered radicals inside cells. We also found that resveratrol passed across cell membranes and scavenged HOCl before it reacted with the genomic DNA, thus blocking DNA-centered radical formation. Taken together our results indicate that the formation of DNA-centered radicals by intracellular MPO may be a useful point of therapeutic intervention in inflammation-induced carcinogenesis. PMID:20406811

  20. Whole genome DNA methylation: beyond genes silencing.

    PubMed

    Tirado-Magallanes, Roberto; Rebbani, Khadija; Lim, Ricky; Pradhan, Sriharsa; Benoukraf, Touati

    2017-01-17

    The combination of DNA bisulfite treatment with high-throughput sequencing technologies has enabled investigation of genome-wide DNA methylation at near base pair level resolution, far beyond that of the kilobase-long canonical CpG islands that initially revealed the biological relevance of this covalent DNA modification. The latest high-resolution studies have revealed a role for very punctual DNA methylation in chromatin plasticity, gene regulation and splicing. Here, we aim to outline the major biological consequences of DNA methylation recently discovered. We also discuss the necessity of tuning DNA methylation resolution into an adequate scale to ease the integration of the methylome information with other chromatin features and transcription events such as gene expression, nucleosome positioning, transcription factors binding dynamic, gene splicing and genomic imprinting. Finally, our review sheds light on DNA methylation heterogeneity in cell population and the different approaches used for its assessment, including the contribution of single cell DNA analysis technology.

  1. Whole genome DNA methylation: beyond genes silencing

    PubMed Central

    Tirado-Magallanes, Roberto; Rebbani, Khadija; Lim, Ricky; Pradhan, Sriharsa; Benoukraf, Touati

    2017-01-01

    The combination of DNA bisulfite treatment with high-throughput sequencing technologies has enabled investigation of genome-wide DNA methylation at near base pair level resolution, far beyond that of the kilobase-long canonical CpG islands that initially revealed the biological relevance of this covalent DNA modification. The latest high-resolution studies have revealed a role for very punctual DNA methylation in chromatin plasticity, gene regulation and splicing. Here, we aim to outline the major biological consequences of DNA methylation recently discovered. We also discuss the necessity of tuning DNA methylation resolution into an adequate scale to ease the integration of the methylome information with other chromatin features and transcription events such as gene expression, nucleosome positioning, transcription factors binding dynamic, gene splicing and genomic imprinting. Finally, our review sheds light on DNA methylation heterogeneity in cell population and the different approaches used for its assessment, including the contribution of single cell DNA analysis technology. PMID:27895318

  2. Modified salting-out method: high-yield, high-quality genomic DNA extraction from whole blood using laundry detergent.

    PubMed

    Nasiri, H; Forouzandeh, M; Rasaee, M J; Rahbarizadeh, F

    2005-01-01

    Different approaches have been used to extract DNA from whole blood. In most of these methods enzymes (such as proteinase K and RNAse A) or toxic organic solvents (such as phenol or guanidine isothiocyanate) are used. Since these enzymes are expensive, and most of the materials that are used routinely are toxic, it is desirable to apply an efficient DNA extraction procedure that does not require the use of such materials. In this study, genomic DNA was extracted by the salting-out method, but instead of using an analytical-grade enzyme and chemical detergents, as normally used for DNA isolation, a common laundry powder was used. Different concentrations of the powder were tested, and proteins were precipitated by NaCl-saturated distilled water. Finally, DNA precipitation was performed with the use of 96% ethanol. From the results, we conclude that the optimum concentration of laundry powder for the highest yield and purity of isolated DNA is 30 mg/mL. The procedure was optimized, and a final protocol is suggested. Following the same protocol, DNA was extracted from 100 blood samples, and their amounts were found to be >50 microg/mL of whole blood. The integrity of the DNA fragments was confirmed by agarose gel electrophoresis. Furthermore, the extracted DNA was used as a template for PCR reaction. The results obtained from PCR showed that the final solutions of extracted DNA did not contain any inhibitory material for the enzyme used in the PCR reaction, and indicated that the isolated DNA was of good quality. These results show that this method is simple, fast, safe, and cost-effective, and can be used in medical laboratories and research centers. Copyright 2005 Wiley-Liss, Inc.

  3. Quantification of genomic relationship from DNA pooled samples

    USDA-ARS?s Scientific Manuscript database

    Use of DNA pooling for GWAS has been demonstrated to reduce genotypic costs up to 90% while achieving similar power to individual genotyping. Recent work has focused on use of DNA pooling to inform problems in genomic prediction. This study is designed to demonstrate the efficacy of estimating genom...

  4. Hawkeye and AMOS: visualizing and assessing the quality of genome assemblies

    PubMed Central

    Schatz, Michael C.; Phillippy, Adam M.; Sommer, Daniel D.; Delcher, Arthur L.; Puiu, Daniela; Narzisi, Giuseppe; Salzberg, Steven L.; Pop, Mihai

    2013-01-01

    Since its launch in 2004, the open-source AMOS project has released several innovative DNA sequence analysis applications including: Hawkeye, a visual analytics tool for inspecting the structure of genome assemblies; the Assembly Forensics and FRCurve pipelines for systematically evaluating the quality of a genome assembly; and AMOScmp, the first comparative genome assembler. These applications have been used to assemble and analyze dozens of genomes ranging in complexity from simple microbial species through mammalian genomes. Recent efforts have been focused on enhancing support for new data characteristics brought on by second- and now third-generation sequencing. This review describes the major components of AMOS in light of these challenges, with an emphasis on methods for assessing assembly quality and the visual analytics capabilities of Hawkeye. These interactive graphical aspects are essential for navigating and understanding the complexities of a genome assembly, from the overall genome structure down to individual bases. Hawkeye and AMOS are available open source at http://amos.sourceforge.net. PMID:22199379

  5. Differential DNA Methylation Analysis without a Reference Genome.

    PubMed

    Klughammer, Johanna; Datlinger, Paul; Printz, Dieter; Sheffield, Nathan C; Farlik, Matthias; Hadler, Johanna; Fritsch, Gerhard; Bock, Christoph

    2015-12-22

    Genome-wide DNA methylation mapping uncovers epigenetic changes associated with animal development, environmental adaptation, and species evolution. To address the lack of high-throughput methods for DNA methylation analysis in non-model organisms, we developed an integrated approach for studying DNA methylation differences independent of a reference genome. Experimentally, our method relies on an optimized 96-well protocol for reduced representation bisulfite sequencing (RRBS), which we have validated in nine species (human, mouse, rat, cow, dog, chicken, carp, sea bass, and zebrafish). Bioinformatically, we developed the RefFreeDMA software to deduce ad hoc genomes directly from RRBS reads and to pinpoint differentially methylated regions between samples or groups of individuals (http://RefFreeDMA.computational-epigenetics.org). The identified regions are interpreted using motif enrichment analysis and/or cross-mapping to annotated genomes. We validated our method by reference-free analysis of cell-type-specific DNA methylation in the blood of human, cow, and carp. In summary, we present a cost-effective method for epigenome analysis in ecology and evolution, which enables epigenome-wide association studies in natural populations and species without a reference genome. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  6. Human Genome Research: Decoding DNA

    Science.gov Websites

    instructions for making all the protein molecules for all the different kinds of cells of the human body dropdown arrow Site Map A-Z Index Menu Synopsis Human Genome Research: Decoding DNA Resources with DeLisi played a pivotal role in proposing and initiating the Human Genome Program in 1986. The U.S

  7. Canonical DNA Repair Pathways Influence R-Loop-Driven Genome Instability.

    PubMed

    Stirling, Peter C; Hieter, Philip

    2017-10-27

    DNA repair defects create cancer predisposition in humans by fostering a higher rate of mutations. While DNA repair is quite well characterized, recent studies have identified previously unrecognized relationships between DNA repair and R-loop-mediated genome instability. R-loops are three-stranded nucleic acid structures in which RNA binds to genomic DNA to displace a loop of single-stranded DNA. Mutations in homologous recombination, nucleotide excision repair, crosslink repair, and DNA damage checkpoints have all now been linked to formation and function of transcription-coupled R-loops. This perspective will summarize recent literature linking DNA repair to R-loop-mediated genomic instability and discuss how R-loops may contribute to mutagenesis in DNA-repair-deficient cancers. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. DNA damage in cells exhibiting radiation-induced genomic instability

    DOE PAGES

    Keszenman, Deborah J.; Kolodiuk, Lucia; Baulch, Janet E.

    2015-02-22

    Cells exhibiting radiation induced genomic instability exhibit varied spectra of genetic and chromosomal aberrations. Even so, oxidative stress remains a common theme in the initiation and/or perpetuation of this phenomenon. Isolated oxidatively modified bases, abasic sites, DNA single strand breaks and clustered DNA damage are induced in normal mammalian cultured cells and tissues due to endogenous reactive oxygen species generated during normal cellular metabolism in an aerobic environment. While sparse DNA damage may be easily repaired, clustered DNA damage may lead to persistent cytotoxic or mutagenic events that can lead to genomic instability. In this study, we tested the hypothesismore » that DNA damage signatures characterised by altered levels of endogenous, potentially mutagenic, types of DNA damage and chromosomal breakage are related to radiation-induced genomic instability and persistent oxidative stress phenotypes observed in the chromosomally unstable progeny of irradiated cells. The measurement of oxypurine, oxypyrimidine and abasic site endogenous DNA damage showed differences in non-double-strand breaks (DSB) clusters among the three of the four unstable clones evaluated as compared to genomically stable clones and the parental cell line. These three unstable clones also had increased levels of DSB clusters. The results of this study demonstrate that each unstable cell line has a unique spectrum of persistent damage and lead us to speculate that alterations in DNA damage signaling and repair may be related to the perpetuation of genomic instability.« less

  9. Defining functional DNA elements in the human genome

    PubMed Central

    Kellis, Manolis; Wold, Barbara; Snyder, Michael P.; Bernstein, Bradley E.; Kundaje, Anshul; Marinov, Georgi K.; Ward, Lucas D.; Birney, Ewan; Crawford, Gregory E.; Dekker, Job; Dunham, Ian; Elnitski, Laura L.; Farnham, Peggy J.; Feingold, Elise A.; Gerstein, Mark; Giddings, Morgan C.; Gilbert, David M.; Gingeras, Thomas R.; Green, Eric D.; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D.; Myers, Richard M.; Pazin, Michael J.; Ren, Bing; Stamatoyannopoulos, John A.; Weng, Zhiping; White, Kevin P.; Hardison, Ross C.

    2014-01-01

    With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease. PMID:24753594

  10. [Quality of DNA from archival pathological samples of gallbladder cancer].

    PubMed

    Roa, Iván; de Toro, Gonzalo; Sánchez, Tamara; Slater, Jeannie; Ziegler, Anne Marie; Game, Anakaren; Arellano, Leonardo; Schalper, Kurt; de Aretxabala, Xabier

    2013-12-01

    The quality of the archival samples stored at pathology services could be a limiting factor for molecular biology studies. To determine the quality of DNA extracted from gallbladder cancer samples at different institutions. One hundred ninety four samples coming from five medical centers in Chile, were analyzed. DNA extraction was quantified determining genomic DNA concentration. The integrity of DNA was determined by polymerase chain reaction amplification of different length fragments of a constitutive gene (β-globin products of 110, 268 and 501 base pairs). The mean DNA concentration obtained in 194 gallbladder cancer samples was 48 ± 43.1 ng/µl. In 22% of samples, no amplification was achieved despite obtaining a mean DNA concentration of 58.3 ng/ul. In 81, 67 and 22% of samples, a DNA amplification of at least 110, 268 or 501 base pairs was obtained, respectively. No differences in DNA concentration according to the source of the samples were demonstrated. However, there were marked differences in DNA integrity among participating centers. Samples from public hospitals were of lower quality than those from private clinics. Despite some limitations, in 80% of cases, the integrity of DNA in archival samples from pathology services in our country would allow the use of molecular biology techniques.

  11. SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells.

    PubMed

    Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang

    2018-01-01

    Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. © 2018 Han et al.; Published by Cold Spring Harbor Laboratory Press.

  12. SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells

    PubMed Central

    Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang

    2018-01-01

    Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. PMID:29208629

  13. DNA Repair and Genome Maintenance in Bacillus subtilis

    PubMed Central

    Lenhart, Justin S.; Schroeder, Jeremy W.; Walsh, Brian W.

    2012-01-01

    Summary: From microbes to multicellular eukaryotic organisms, all cells contain pathways responsible for genome maintenance. DNA replication allows for the faithful duplication of the genome, whereas DNA repair pathways preserve DNA integrity in response to damage originating from endogenous and exogenous sources. The basic pathways important for DNA replication and repair are often conserved throughout biology. In bacteria, high-fidelity repair is balanced with low-fidelity repair and mutagenesis. Such a balance is important for maintaining viability while providing an opportunity for the advantageous selection of mutations when faced with a changing environment. Over the last decade, studies of DNA repair pathways in bacteria have demonstrated considerable differences between Gram-positive and Gram-negative organisms. Here we review and discuss the DNA repair, genome maintenance, and DNA damage checkpoint pathways of the Gram-positive bacterium Bacillus subtilis. We present their molecular mechanisms and compare the functions and regulation of several pathways with known information on other organisms. We also discuss DNA repair during different growth phases and the developmental program of sporulation. In summary, we present a review of the function, regulation, and molecular mechanisms of DNA repair and mutagenesis in Gram-positive bacteria, with a strong emphasis on B. subtilis. PMID:22933559

  14. Fabrication of high quality cDNA microarray using a small amount of cDNA.

    PubMed

    Park, Chan Hee; Jeong, Ha Jin; Jung, Jae Jun; Lee, Gui Yeon; Kim, Sang-Chul; Kim, Tae Soo; Yang, Sang Hwa; Chung, Hyun Cheol; Rha, Sun Young

    2004-05-01

    DNA microarray technology has become an essential part of biological research. It enables the genome-scale analysis of gene expression in various types of model systems. Manufacturing high quality cDNA microarrays of microdeposition type depends on some key factors including a printing device, spotting pins, glass slides, spotting solution, and humidity during spotting. UsingEthe Microgrid II TAS model printing device, this study defined the optimal conditions for producing high density, high quality cDNA microarrays with the least amount of cDNA product. It was observed that aminosilane-modified slides were superior to other types of surface modified-slides. A humidity of 30+/-3% in a closed environment and the overnight drying of the spotted slides gave the best conditions for arraying. In addition, the cDNA dissolved in 30% DMSO gave the optimal conditions for spotting compared to the 1X ArrayIt, 3X SSC and 50% DMSO. Lastly, cDNA in the concentration range of 100-300 ng/ micro l was determined to be best for arraying and post-processing. Currently, the printing system in this study yields reproducible 9000 spots with a spot size 150 mm diameter, and a 200 nm spot spacing.

  15. [Genome-scale sequence data processing and epigenetic analysis of DNA methylation].

    PubMed

    Wang, Ting-Zhang; Shan, Gao; Xu, Jian-Hong; Xue, Qing-Zhong

    2013-06-01

    A new approach recently developed for detecting cytosine DNA methylation (mC) and analyzing the genome-scale DNA methylation profiling, is called BS-Seq which is based on bisulfite conversion of genomic DNA combined with next-generation sequencing. The method can not only provide an insight into the difference of genome-scale DNA methylation among different organisms, but also reveal the conservation of DNA methylation in all contexts and nucleotide preference for different genomic regions, including genes, exons, and repetitive DNA sequences. It will be helpful to under-stand the epigenetic impacts of cytosine DNA methylation on the regulation of gene expression and maintaining silence of repetitive sequences, such as transposable elements. In this paper, we introduce the preprocessing steps of DNA methylation data, by which cytosine (C) and guanine (G) in the reference sequence are transferred to thymine (T) and adenine (A), and cytosine in reads is transferred to thymine, respectively. We also comprehensively review the main content of the DNA methylation analysis on the genomic scale: (1) the cytosine methylation under the context of different sequences; (2) the distribution of genomic methylcytosine; (3) DNA methylation context and the preference for the nucleotides; (4) DNA- protein interaction sites of DNA methylation; (5) degree of methylation of cytosine in the different structural elements of genes. DNA methylation analysis technique provides a powerful tool for the epigenome study in human and other species, and genes and environment interaction, and founds the theoretical basis for further development of disease diagnostics and therapeutics in human.

  16. A highly efficient strategy to determine genotypes of genetically-engineered mice using genomic DNA purified from hair roots.

    PubMed

    Otaño-Rivera, Víctor; Boakye, Amma; Grobe, Nadja; Almutairi, Mohammed M; Kursan, Shams; Mattis, Lesan K; Castrop, Hayo; Gurley, Susan B; Elased, Khalid M; Boivin, Gregory P; Di Fulvio, Mauricio

    2017-04-01

    Genotyping of genetically-engineered mice is necessary for the effective design of breeding strategies and identification of mutant mice. This process relies on the identification of DNA markers introduced into genomic sequences of mice, a task usually performed using the polymerase chain reaction (PCR). Clearly, the limiting step in genotyping is isolating pure genomic DNA. Isolation of mouse DNA for genotyping typically involves painful procedures such as tail snip, digit removal, or ear punch. Although the harvesting of hair has previously been proposed as a source of genomic DNA, there has been a perceived complication and reluctance to use this non-painful technique because of low DNA yields and fear of contamination. In this study we developed a simple, economic, and efficient strategy using Chelex® resins to purify genomic DNA from hair roots of mice which are suitable for genotyping. Upon comparison with standard DNA purification methods using a commercially available kit, we demonstrate that Chelex® efficiently and consistently purifies high-quality DNA from hair roots, minimizing pain, shortening time and reducing costs associated with the determination of accurate genotypes. Therefore, the use of hair roots combined with Chelex® is a reliable and more humane alternative for DNA genotyping.

  17. YAP controls retinal stem cell DNA replication timing and genomic stability

    PubMed Central

    Cabochette, Pauline; Vega-Lopez, Guillermo; Bitard, Juliette; Parain, Karine; Chemouny, Romain; Masson, Christel; Borday, Caroline; Hedderich, Marie; Henningfeld, Kristine A; Locker, Morgane; Bronchain, Odile; Perron, Muriel

    2015-01-01

    The adult frog retina retains a reservoir of active neural stem cells that contribute to continuous eye growth throughout life. We found that Yap, a downstream effector of the Hippo pathway, is specifically expressed in these stem cells. Yap knock-down leads to an accelerated S-phase and an abnormal progression of DNA replication, a phenotype likely mediated by upregulation of c-Myc. This is associated with an increased occurrence of DNA damage and eventually p53-p21 pathway-mediated cell death. Finally, we identified PKNOX1, a transcription factor involved in the maintenance of genomic stability, as a functional and physical interactant of YAP. Altogether, we propose that YAP is required in adult retinal stem cells to regulate the temporal firing of replication origins and quality control of replicated DNA. Our data reinforce the view that specific mechanisms dedicated to S-phase control are at work in stem cells to protect them from genomic instability. DOI: http://dx.doi.org/10.7554/eLife.08488.001 PMID:26393999

  18. Migration of mitochondrial DNA in the nuclear genome of colorectal adenocarcinoma.

    PubMed

    Srinivasainagendra, Vinodh; Sandel, Michael W; Singh, Bhupendra; Sundaresan, Aishwarya; Mooga, Ved P; Bajpai, Prachi; Tiwari, Hemant K; Singh, Keshav K

    2017-03-29

    Colorectal adenocarcinomas are characterized by abnormal mitochondrial DNA (mtDNA) copy number and genomic instability, but a molecular interaction between mitochondrial and nuclear genome remains unknown. Here we report the discovery of increased copies of nuclear mtDNA (NUMT) in colorectal adenocarcinomas, which supports link between mtDNA and genomic instability in the nucleus. We name this phenomenon of nuclear occurrence of mitochondrial component as numtogenesis. We provide a description of NUMT abundance and distribution in tumor versus matched blood-derived normal genomes. Whole-genome sequence data were obtained for colon adenocarcinoma and rectum adenocarcinoma patients participating in The Cancer Genome Atlas, via the Cancer Genomics Hub, using the GeneTorrent file acquisition tool. Data were analyzed to determine NUMT proportion and distribution on a genome-wide scale. A NUMT suppressor gene was identified by comparing numtogenesis in other organisms. Our study reveals that colorectal adenocarcinoma genomes, on average, contains up to 4.2-fold more somatic NUMTs than matched normal genomes. Women colorectal tumors contained more NUMT than men. NUMT abundance in tumor predicted parallel abundance in blood. NUMT abundance positively correlated with GC content and gene density. Increased numtogenesis was observed with higher mortality. We identified YME1L1, a human homolog of yeast YME1 (yeast mitochondrial DNA escape 1) to be frequently mutated in colorectal tumors. YME1L1 was also mutated in tumors derived from other tissues. We show that inactivation of YME1L1 results in increased transfer of mtDNA in the nuclear genome. Our study demonstrates increased somatic transfer of mtDNA in colorectal tumors. Our study also reveals sex-based differences in frequency of NUMT occurrence and that NUMT in blood reflects NUMT in tumors, suggesting NUMT may be used as a biomarker for tumorigenesis. We identify YME1L1 as the first NUMT suppressor gene in human and

  19. DNA Precursor Metabolism and Mitochondrial Genome Stability

    DTIC Science & Technology

    2003-04-01

    mitochondrial DNA replication , to learn how the pool sizes are regulated, and to understand how perturbations of normal dNTP metabolism within the...mitochondria raises the possibility, however unlikely, that it is serving a function in addition to its role in DNA replication . The literature on non-DNA...is below since many authors do not follow the 200 word limit 14. SUBJECT TERMS Mitochondria, Genome stability, DNA precursors, Mitochondrial DNA

  20. Extraction of high-quality DNA from ethanol-preserved tropical plant tissues.

    PubMed

    Bressan, Eduardo A; Rossi, Mônica L; Gerald, Lee T S; Figueira, Antonio

    2014-04-24

    Proper conservation of plant samples, especially during remote field collection, is essential to assure quality of extracted DNA. Tropical plant species contain considerable amounts of secondary compounds, such as polysaccharides, phenols, and latex, which affect DNA quality during extraction. The suitability of ethanol (96% v/v) as a preservative solution prior to DNA extraction was evaluated using leaves of Jatropha curcas and other tropical species. Total DNA extracted from leaf samples stored in liquid nitrogen or ethanol from J. curcas and other tropical species (Theobroma cacao, Coffea arabica, Ricinus communis, Saccharum spp., and Solanum lycopersicon) was similar in quality, with high-molecular-weight DNA visualized by gel electrophoresis. DNA quality was confirmed by digestion with EcoRI or HindIII and by amplification of the ribosomal gene internal transcribed spacer region. Leaf tissue of J. curcas was analyzed by light and transmission electron microscopy before and after exposure to ethanol. Our results indicate that leaf samples can be successfully preserved in ethanol for long periods (30 days) as a viable method for fixation and conservation of DNA from leaves. The success of this technique is likely due to reduction or inactivation of secondary metabolites that could contaminate or degrade genomic DNA. Tissue conservation in 96% ethanol represents an attractive low-cost alternative to commonly used methods for preservation of samples for DNA extraction. This technique yields DNA of equivalent quality to that obtained from fresh or frozen tissue.

  1. Extraction of high-quality DNA from ethanol-preserved tropical plant tissues

    PubMed Central

    2014-01-01

    Background Proper conservation of plant samples, especially during remote field collection, is essential to assure quality of extracted DNA. Tropical plant species contain considerable amounts of secondary compounds, such as polysaccharides, phenols, and latex, which affect DNA quality during extraction. The suitability of ethanol (96% v/v) as a preservative solution prior to DNA extraction was evaluated using leaves of Jatropha curcas and other tropical species. Results Total DNA extracted from leaf samples stored in liquid nitrogen or ethanol from J. curcas and other tropical species (Theobroma cacao, Coffea arabica, Ricinus communis, Saccharum spp., and Solanum lycopersicon) was similar in quality, with high-molecular-weight DNA visualized by gel electrophoresis. DNA quality was confirmed by digestion with EcoRI or HindIII and by amplification of the ribosomal gene internal transcribed spacer region. Leaf tissue of J. curcas was analyzed by light and transmission electron microscopy before and after exposure to ethanol. Our results indicate that leaf samples can be successfully preserved in ethanol for long periods (30 days) as a viable method for fixation and conservation of DNA from leaves. The success of this technique is likely due to reduction or inactivation of secondary metabolites that could contaminate or degrade genomic DNA. Conclusions Tissue conservation in 96% ethanol represents an attractive low-cost alternative to commonly used methods for preservation of samples for DNA extraction. This technique yields DNA of equivalent quality to that obtained from fresh or frozen tissue. PMID:24761774

  2. Reversing DNA Methylation: Mechanisms, Genomics, and Biological Functions

    PubMed Central

    Wu, Hao; Zhang, Yi

    2014-01-01

    Methylation of cytosines in the mammalian genome represents a key epigenetic modification and is dynamically regulated during development. Compelling evidence now suggests that dynamic regulation of DNA methylation is mainly achieved through a cyclic enzymatic cascade comprised of cytosine methylation, iterative oxidation of methyl group by TET dioxygenases, and restoration of unmodified cytosines by either replication-dependent dilution or DNA glycosylase-initiated base excision repair. In this review, we discuss the mechanism and function of DNA demethylation in mammalian genomes, focusing particularly on how developmental modulation of the cytosine-modifying pathway is coupled to active reversal of DNA methylation in diverse biological processes. PMID:24439369

  3. Guidelines for whole genome bisulphite sequencing of intact and FFPET DNA on the Illumina HiSeq X Ten.

    PubMed

    Nair, Shalima S; Luu, Phuc-Loi; Qu, Wenjia; Maddugoda, Madhavi; Huschtscha, Lily; Reddel, Roger; Chenevix-Trench, Georgia; Toso, Martina; Kench, James G; Horvath, Lisa G; Hayes, Vanessa M; Stricker, Phillip D; Hughes, Timothy P; White, Deborah L; Rasko, John E J; Wong, Justin J-L; Clark, Susan J

    2018-05-28

    Comprehensive genome-wide DNA methylation profiling is critical to gain insights into epigenetic reprogramming during development and disease processes. Among the different genome-wide DNA methylation technologies, whole genome bisulphite sequencing (WGBS) is considered the gold standard for assaying genome-wide DNA methylation at single base resolution. However, the high sequencing cost to achieve the optimal depth of coverage limits its application in both basic and clinical research. To achieve 15× coverage of the human methylome, using WGBS, requires approximately three lanes of 100-bp-paired-end Illumina HiSeq 2500 sequencing. It is important, therefore, for advances in sequencing technologies to be developed to enable cost-effective high-coverage sequencing. In this study, we provide an optimised WGBS methodology, from library preparation to sequencing and data processing, to enable 16-20× genome-wide coverage per single lane of HiSeq X Ten, HCS 3.3.76. To process and analyse the data, we developed a WGBS pipeline (METH10X) that is fast and can call SNPs. We performed WGBS on both high-quality intact DNA and degraded DNA from formalin-fixed paraffin-embedded tissue. First, we compared different library preparation methods on the HiSeq 2500 platform to identify the best method for sequencing on the HiSeq X Ten. Second, we optimised the PhiX and genome spike-ins to achieve higher quality and coverage of WGBS data on the HiSeq X Ten. Third, we performed integrated whole genome sequencing (WGS) and WGBS of the same DNA sample in a single lane of HiSeq X Ten to improve data output. Finally, we compared methylation data from the HiSeq 2500 and HiSeq X Ten and found high concordance (Pearson r > 0.9×). Together we provide a systematic, efficient and complete approach to perform and analyse WGBS on the HiSeq X Ten. Our protocol allows for large-scale WGBS studies at reasonable processing time and cost on the HiSeq X Ten platform.

  4. Detection of Streptococcus mutans Genomic DNA in Human DNA Samples Extracted from Saliva and Blood

    PubMed Central

    Vieira, Alexandre R.; Deeley, Kathleen B.; Callahan, Nicholas F.; Noel, Jacqueline B.; Anjomshoaa, Ida; Carricato, Wendy M.; Schulhof, Louise P.; DeSensi, Rebecca S.; Gandhi, Pooja; Resick, Judith M.; Brandon, Carla A.; Rozhon, Christopher; Patir, Asli; Yildirim, Mine; Poletta, Fernando A.; Mereb, Juan C.; Letra, Ariadne; Menezes, Renato; Wendell, Steven; Lopez-Camelo, Jorge S.; Castilla, Eduardo E.; Orioli, Iêda M.; Seymen, Figen; Weyant, Robert J.; Crout, Richard; McNeil, Daniel W.; Modesto, Adriana; Marazita, Mary L.

    2011-01-01

    Caries is a multifactorial disease, and studies aiming to unravel the factors modulating its etiology must consider all known predisposing factors. One major factor is bacterial colonization, and Streptococcus mutans is the main microorganism associated with the initiation of the disease. In our studies, we have access to DNA samples extracted from human saliva and blood. In this report, we tested a real-time PCR assay developed to detect copies of genomic DNA from Streptococcus mutans in 1,424 DNA samples from humans. Our results suggest that we can determine the presence of genomic DNA copies of Streptococcus mutans in both DNA samples from caries-free and caries-affected individuals. However, we were not able to detect the presence of genomic DNA copies of Streptococcus mutans in any DNA samples extracted from peripheral blood, which suggests the assay may not be sensitive enough for this goal. Values of the threshold cycle of the real-time PCR reaction correlate with higher levels of caries experience in children, but this correlation could not be detected for adults. PMID:21731912

  5. Droplet digital PCR-based EGFR mutation detection with an internal quality control index to determine the quality of DNA.

    PubMed

    Kim, Sung-Su; Choi, Hyun-Jeung; Kim, Jin Ju; Kim, M Sun; Lee, In-Seon; Byun, Bohyun; Jia, Lina; Oh, Myung Ryurl; Moon, Youngho; Park, Sarah; Choi, Joon-Seok; Chae, Seoung Wan; Nam, Byung-Ho; Kim, Jin-Soo; Kim, Jihun; Min, Byung Soh; Lee, Jae Seok; Won, Jae-Kyung; Cho, Soo Youn; Choi, Yoon-La; Shin, Young Kee

    2018-01-11

    In clinical translational research and molecular in vitro diagnostics, a major challenge in the detection of genetic mutations is overcoming artefactual results caused by the low-quality of formalin-fixed paraffin-embedded tissue (FFPET)-derived DNA (FFPET-DNA). Here, we propose the use of an 'internal quality control (iQC) index' as a criterion for judging the minimum quality of DNA for PCR-based analyses. In a pre-clinical study comparing the results from droplet digital PCR-based EGFR mutation test (ddEGFR test) and qPCR-based EGFR mutation test (cobas EGFR test), iQC index ≥ 0.5 (iQC copies ≥ 500, using 3.3 ng of FFPET-DNA [1,000 genome equivalents]) was established, indicating that more than half of the input DNA was amplifiable. Using this criterion, we conducted a retrospective comparative clinical study of the ddEGFR and cobas EGFR tests for the detection of EGFR mutations in non-small cell lung cancer (NSCLC) FFPET-DNA samples. Compared with the cobas EGFR test, the ddEGFR test exhibited superior analytical performance and equivalent or higher clinical performance. Furthermore, iQC index is a reliable indicator of the quality of FFPET-DNA and could be used to prevent incorrect diagnoses arising from low-quality samples.

  6. Sequencing of the large dsDNA genome of Oryctes rhinoceros nudivirus using multiple displacement amplification of nanogram amounts of virus DNA.

    PubMed

    Wang, Yongjie; Kleespies, Regina G; Ramle, Moslim B; Jehle, Johannes A

    2008-09-01

    The genomic sequence analysis of many large dsDNA viruses is hampered by the lack of enough sample materials. Here, we report a whole genome amplification of the Oryctes rhinoceros nudivirus (OrNV) isolate Ma07 starting from as few as about 10 ng of purified viral DNA by application of phi29 DNA polymerase- and exonuclease-resistant random hexamer-based multiple displacement amplification (MDA) method. About 60 microg of high molecular weight DNA with fragment sizes of up to 25 kbp was amplified. A genomic DNA clone library was generated using the product DNA. After 8-fold sequencing coverage, the 127,615 bp of OrNV whole genome was sequenced successfully. The results demonstrate that the MDA-based whole genome amplification enables rapid access to genomic information from exiguous virus samples.

  7. Selective Gene Delivery for Integrating Exogenous DNA into Plastid and Mitochondrial Genomes Using Peptide-DNA Complexes.

    PubMed

    Yoshizumi, Takeshi; Oikawa, Kazusato; Chuah, Jo-Ann; Kodama, Yutaka; Numata, Keiji

    2018-05-14

    Selective gene delivery into organellar genomes (mitochondrial and plastid genomes) has been limited because of a lack of appropriate platform technology, even though these organelles are essential for metabolite and energy production. Techniques for selective organellar modification are needed to functionally improve organelles and produce transplastomic/transmitochondrial plants. However, no method for mitochondrial genome modification has yet been established for multicellular organisms including plants. Likewise, modification of plastid genomes has been limited to a few plant species and algae. In the present study, we developed ionic complexes of fusion peptides containing organellar targeting signal and plasmid DNA for selective delivery of exogenous DNA into the plastid and mitochondrial genomes of intact plants. This is the first report of exogenous DNA being integrated into the mitochondrial genomes of not only plants, but also multicellular organisms in general. This fusion peptide-mediated gene delivery system is a breakthrough platform for both plant organellar biotechnology and gene therapy for mitochondrial diseases in animals.

  8. Evaluation of the efficacy of constitutional array-based comparative genomic hybridization in the diagnosis of aneuploidy using genomic and amplified DNA.

    PubMed

    Tan, Niap H; Palmer, Rodger; Wang, Rubin

    2010-02-01

    Array-based comparative genomic hybridization (array CGH) is a new molecular technique that has the potential to revolutionize cytogenetics. However, use of high resolution array CGH in the clinical setting is plagued by the problem of widespread copy number variations (CNV) in the human genome. Constitutional microarray, containing only clones that interrogate regions of known constitutional syndromes, may circumvent the dilemma of detecting CNV of unknown clinical significance. The present study investigated the efficacy of constitutional microarray in the diagnosis of trisomy. Test samples included genomic DNA from trisomic cell lines, amplification products of 50 ng of genomic DNA and whole genome amplification products of single cells. DNA amplification was achieved by means of multiple displacement amplification (MDA) over 16 h. The trisomic and sex chromosomes copy number imbalances in the genomic DNA were correctly identified by the constitutional microarrays. However, there was a failure to detect the trisomy in the amplification products of 50 ng of genomic DNA and whole genome amplification products of single cells. Using carefully selected clones, Spectral Genomics constitutional microarray was able to detect the chromosomal copy number imbalances in genomic DNA without the confounding effects of CNV. The diagnostic failure in amplified DNA samples could be attributed to the amplification process. The MDA duration of 16 h generated excessive amount of biases and shortening the duration might minimize the problem.

  9. Quality control and quality assurance in genotypic data for genome-wide association studies

    PubMed Central

    Laurie, Cathy C.; Doheny, Kimberly F.; Mirel, Daniel B.; Pugh, Elizabeth W.; Bierut, Laura J.; Bhangale, Tushar; Boehm, Frederick; Caporaso, Neil E.; Cornelis, Marilyn C.; Edenberg, Howard J.; Gabriel, Stacy B.; Harris, Emily L.; Hu, Frank B.; Jacobs, Kevin; Kraft, Peter; Landi, Maria Teresa; Lumley, Thomas; Manolio, Teri A.; McHugh, Caitlin; Painter, Ian; Paschall, Justin; Rice, John P.; Rice, Kenneth M.; Zheng, Xiuwen; Weir, Bruce S.

    2011-01-01

    Genome-wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome-wide association studies. This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy-Weinberg equilibrium (HWE) test p-values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis (PCA) to SNP selection. The methods are illustrated with examples from the ‘Gene Environment Association Studies’ (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of genome-wide association studies. PMID:20718045

  10. Small terminase couples viral DNA-binding to genome-packaging ATPase activity

    PubMed Central

    Roy, Ankoor; Bhardwaj, Anshul; Datta, Pinaki; Lander, Gabriel C.; Cingolani, Gino

    2012-01-01

    SUMMARY Packaging of viral genomes into empty procapsids is powered by a large DNA-packaging motor. In most viruses, this machine is composed of a large (L) and a small (S) terminase subunit complexed with a dodecamer of portal protein. Here, we describe the 1.75 Å crystal structure of the bacteriophage P22 S-terminase in a nonameric conformation. The structure presents a central channel ~23 Å in diameter, sufficiently large to accommodate hydrated B-DNA. The last 23 residues of S-terminase are essential for binding to DNA and assembly to L-terminase. Upon binding to its own DNA, S-terminase functions as a specific activator of L-terminase ATPase activity. The DNA-dependent stimulation of ATPase activity thus rationalizes the exclusive specificity of genome-packaging motors for viral DNA in the crowd of host DNA, ensuring fidelity of packaging and avoiding wasteful ATP hydrolysis. This posits a model for DNA-dependent activation of genome-packaging motors of general interest in virology. PMID:22771211

  11. Development of a Method to Implement Whole-Genome Bisulfite Sequencing of cfDNA from Cancer Patients and a Mouse Tumor Model.

    PubMed

    Maggi, Elaine C; Gravina, Silvia; Cheng, Haiying; Piperdi, Bilal; Yuan, Ziqiang; Dong, Xiao; Libutti, Steven K; Vijg, Jan; Montagna, Cristina

    2018-01-01

    The goal of this study was to develop a method for whole genome cell-free DNA (cfDNA) methylation analysis in humans and mice with the ultimate goal to facilitate the identification of tumor derived DNA methylation changes in the blood. Plasma or serum from patients with pancreatic neuroendocrine tumors or lung cancer, and plasma from a murine model of pancreatic adenocarcinoma was used to develop a protocol for cfDNA isolation, library preparation and whole-genome bisulfite sequencing of ultra low quantities of cfDNA, including tumor-specific DNA. The protocol developed produced high quality libraries consistently generating a conversion rate >98% that will be applicable for the analysis of human and mouse plasma or serum to detect tumor-derived changes in DNA methylation.

  12. Characterization of the repetitive DNA elements in the genome of fish lymphocystis disease viruses.

    PubMed

    Schnitzler, P; Darai, G

    1989-09-01

    The complete DNA nucleotide sequence of the repetitive DNA elements in the genome of fish lymphocystis disease virus (FLDV) isolated from two different species (flounder and dab) was determined. The size of these repetitive DNA elements was found to be 1413 bp which corresponds to the DNA sequences of the 5' terminus of the EcoRI DNA fragment B (0.034 to 0.052 m.u.) and to the EcoRI DNA fragment M (0.718 to 0.736 m.u.) of the FLDV genome causing lymphocystis disease in flounder and plaice. The degree of DNA nucleotide homology between both regions was found to be 99%. The repetitive DNA element in the genome of FLDV isolated from other fish species (dab) was identified and is located within the EcoRI DNA fragment B and J of the viral genome. The DNA nucleotide sequence of one duplicate of this repetition (EcoRI DNA fragment J) was determined (1410 bp) and compared to the DNA nucleotide sequences of the repetitive DNA elements of the genome of FLDV isolated from flounder. It was found that the repetitive DNA elements of the genome of FLDV derived from two different fish species are highly conserved and possess a degree of DNA sequence homology of 94%. The DNA sequences of each strand of the individual repetitive element possess one open reading frame.

  13. No genome barriers to promiscuous DNA

    NASA Astrophysics Data System (ADS)

    Lewin, R.

    1984-06-01

    Farrelly and Butow (1983) used the term 'promiscuous DNA' in their report of the apparent natural transfer of yeast mitochondrial DNA sequences into the nuclear genome. Ellis (1982) applied the same term in an editorial comment. It is pointed out since that time the subject of DNA's promiscuity has exploded with a series of reports. According to a report by Stern (1984), movement of DNA sequences between chloroplasts and mitochondria is not just a rare event but is a rampant process. It was recently concluded that 'the widespread presence of ctDNA sequences in plant mtDNA is best regarded as a dramatic demonstration of the dynamo nature of interactions between the chloroplast and the mitochondrion, similar to the ongoing process of interorganellar DNA transfer already documented between mitochondrion and nucleus and between chloroplast and nucleus'.

  14. Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase.

    PubMed

    Ahmed, Wareed; Sala, Claudia; Hegde, Shubhada R; Jha, Rajiv Kumar; Cole, Stewart T; Nagaraja, Valakunja

    2017-05-01

    Movement of the transcription machinery along a template alters DNA topology resulting in the accumulation of supercoils in DNA. The positive supercoils generated ahead of transcribing RNA polymerase (RNAP) and the negative supercoils accumulating behind impose severe topological constraints impeding transcription process. Previous studies have implied the role of topoisomerases in the removal of torsional stress and the maintenance of template topology but the in vivo interaction of functionally distinct topoisomerases with heterogeneous chromosomal territories is not deciphered. Moreover, how the transcription-induced supercoils influence the genome-wide recruitment of DNA topoisomerases remains to be explored in bacteria. Using ChIP-Seq, we show the genome-wide occupancy profile of both topoisomerase I and DNA gyrase in conjunction with RNAP in Mycobacterium tuberculosis taking advantage of minimal topoisomerase representation in the organism. The study unveils the first in vivo genome-wide interaction of both the topoisomerases with the genomic regions and establishes that transcription-induced supercoils govern their recruitment at genomic sites. Distribution profiles revealed co-localization of RNAP and the two topoisomerases on the active transcriptional units (TUs). At a given locus, topoisomerase I and DNA gyrase were localized behind and ahead of RNAP, respectively, correlating with the twin-supercoiled domains generated. The recruitment of topoisomerases was higher at the genomic loci with higher transcriptional activity and/or at regions under high torsional stress compared to silent genomic loci. Importantly, the occupancy of DNA gyrase, sole type II topoisomerase in Mtb, near the Ter domain of the Mtb chromosome validates its function as a decatenase.

  15. Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase

    PubMed Central

    Ahmed, Wareed; Sala, Claudia; Hegde, Shubhada R.; Jha, Rajiv Kumar

    2017-01-01

    Movement of the transcription machinery along a template alters DNA topology resulting in the accumulation of supercoils in DNA. The positive supercoils generated ahead of transcribing RNA polymerase (RNAP) and the negative supercoils accumulating behind impose severe topological constraints impeding transcription process. Previous studies have implied the role of topoisomerases in the removal of torsional stress and the maintenance of template topology but the in vivo interaction of functionally distinct topoisomerases with heterogeneous chromosomal territories is not deciphered. Moreover, how the transcription-induced supercoils influence the genome-wide recruitment of DNA topoisomerases remains to be explored in bacteria. Using ChIP-Seq, we show the genome-wide occupancy profile of both topoisomerase I and DNA gyrase in conjunction with RNAP in Mycobacterium tuberculosis taking advantage of minimal topoisomerase representation in the organism. The study unveils the first in vivo genome-wide interaction of both the topoisomerases with the genomic regions and establishes that transcription-induced supercoils govern their recruitment at genomic sites. Distribution profiles revealed co-localization of RNAP and the two topoisomerases on the active transcriptional units (TUs). At a given locus, topoisomerase I and DNA gyrase were localized behind and ahead of RNAP, respectively, correlating with the twin-supercoiled domains generated. The recruitment of topoisomerases was higher at the genomic loci with higher transcriptional activity and/or at regions under high torsional stress compared to silent genomic loci. Importantly, the occupancy of DNA gyrase, sole type II topoisomerase in Mtb, near the Ter domain of the Mtb chromosome validates its function as a decatenase. PMID:28463980

  16. Genomic gigantism: DNA loss is slow in mountain grasshoppers.

    PubMed

    Bensasson, D; Petrov, D A; Zhang, D X; Hartl, D L; Hewitt, G M

    2001-02-01

    Several studies have shown DNA loss to be inversely correlated with genome size in animals. These studies include a comparison between Drosophila and the cricket, Laupala, but there has been no assessment of DNA loss in insects with very large genomes. Podisma pedestris, the brown mountain grasshopper, has a genome over 100 times as large as that of Drosophila and 10 times as large as that of Laupala. We used 58 paralogous nuclear pseudogenes of mitochondrial origin to study the characteristics of insertion, deletion, and point substitution in P. pedestris and Italopodisma. In animals, these pseudogenes are "dead on arrival"; they are abundant in many different eukaryotes, and their mitochondrial origin simplifies the identification of point substitutions accumulated in nuclear pseudogene lineages. There appears to be a mononucleotide repeat within the 643-bp pseudogene sequence studied that acts as a strong hot spot for insertions or deletions (indels). Because the data for other insect species did not contain such an unusual region, hot spots were excluded from species comparisons. The rate of DNA loss relative to point substitution appears to be considerably and significantly lower in the grasshoppers studied than in Drosophila or Laupala. This suggests that the inverse correlation between genome size and the rate of DNA loss can be extended to comparisons between insects with large or gigantic genomes (i.e., Laupala and Podisma). The low rate of DNA loss implies that in grasshoppers, the accumulation of point mutations is a more potent force for obscuring ancient pseudogenes than their loss through indel accumulation, whereas the reverse is true for Drosophila. The main factor contributing to the difference in the rates of DNA loss estimated for grasshoppers, crickets, and Drosophila appears to be deletion size. Large deletions are relatively rare in Podisma and Italopodisma.

  17. T-DNA-genome junctions form early after infection and are influenced by the chromatin state of the host genome

    PubMed Central

    Tripathi, Pooja; Muth, Theodore R.

    2017-01-01

    Agrobacterium tumefaciens mediated T-DNA integration is a common tool for plant genome manipulation. However, there is controversy regarding whether T-DNA integration is biased towards genes or randomly distributed throughout the genome. In order to address this question, we performed high-throughput mapping of T-DNA-genome junctions obtained in the absence of selection at several time points after infection. T-DNA-genome junctions were detected as early as 6 hours post-infection. T-DNA distribution was apparently uniform throughout the chromosomes, yet local biases toward AT-rich motifs and T-DNA border sequence micro-homology were detected. Analysis of the epigenetic landscape of previously isolated sites of T-DNA integration in Kanamycin-selected transgenic plants showed an association with extremely low methylation and nucleosome occupancy. Conversely, non-selected junctions from this study showed no correlation with methylation and had chromatin marks, such as high nucleosome occupancy and high H3K27me3, that correspond to three-dimensional-interacting heterochromatin islands embedded within euchromatin. Such structures may play a role in capturing and silencing invading T-DNA. PMID:28742090

  18. Predictive genomics DNA profiling for athletic performance.

    PubMed

    Kambouris, Marios; Ntalouka, Foteini; Ziogas, Georgios; Maffulli, Nicola

    2012-12-01

    Genes control biological processes such as muscle, cartilage and bone formation, muscle energy production and metabolism (mitochondriogenesis, lactic acid removal), blood and tissue oxygenation (erythropoiesis, angiogenesis, vasodilatation), all essential in sport and athletic performance. DNA sequence variations in such genes confer genetic advantages that can be exploited, or genetic 'barriers' that could be overcome to achieve optimal athletic performance. Predictive Genomic DNA Profiling for athletic performance reveals genetic variations that may be associated with better suitability for endurance, strength and speed sports, vulnerability to sports-related injuries and individualized nutritional requirements. Knowledge of genetic 'suitability' in respect to endurance capacity or strength and speed would lead to appropriate sport and athletic activity selection. Knowledge of genetic advantages and barriers would 'direct' an individualized training program, nutritional plan and nutritional supplementation to achieving optimal performance, overcoming 'barriers' that results from intense exercise and pressure under competition with minimum waste of time and energy and avoidance of health risks (hypertension, cardiovascular disease, inflammation, and musculoskeletal injuries) related to exercise, training and competition. Predictive Genomics DNA profiling for Athletics and Sports performance is developing into a tool for athletic activity and sport selection and for the formulation of individualized and personalized training and nutritional programs to optimize health and performance for the athlete. Human DNA sequences are patentable in some countries, while in others DNA testing methodologies [unless proprietary], are non patentable. On the other hand, gene and variant selection, genotype interpretation and the risk and suitability assigning algorithms based on the specific Genomic variants used are amenable to patent protection.

  19. Human genomic DNA quantitation system, H-Quant: development and validation for use in forensic casework.

    PubMed

    Shewale, Jaiprakash G; Schneida, Elaine; Wilson, Jonathan; Walker, Jerilyn A; Batzer, Mark A; Sinha, Sudhir K

    2007-03-01

    The human DNA quantification (H-Quant) system, developed for use in human identification, enables quantitation of human genomic DNA in biological samples. The assay is based on real-time amplification of AluYb8 insertions in hominoid primates. The relatively high copy number of subfamily-specific Alu repeats in the human genome enables quantification of very small amounts of human DNA. The oligonucleotide primers present in H-Quant are specific for human DNA and closely related great apes. During the real-time PCR, the SYBR Green I dye binds to the DNA that is synthesized by the human-specific AluYb8 oligonucleotide primers. The fluorescence of the bound SYBR Green I dye is measured at the end of each PCR cycle. The cycle at which the fluorescence crosses the chosen threshold correlates to the quantity of amplifiable DNA in that sample. The minimal sensitivity of the H-Quant system is 7.6 pg/microL of human DNA. The amplicon generated in the H-Quant assay is 216 bp, which is within the same range of the common amplifiable short tandem repeat (STR) amplicons. This size amplicon enables quantitation of amplifiable DNA as opposed to a quantitation of degraded or nonamplifiable DNA of smaller sizes. Development and validation studies were performed on the 7500 real-time PCR system following the Quality Assurance Standards for Forensic DNA Testing Laboratories.

  20. Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

    PubMed

    Cao, Yinhe; Tung, Wen-Wen; Gao, J B

    2004-01-01

    With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.

  1. Mutagenic repair of double-stranded DNA breaks in vaccinia virus genomes requires cellular DNA ligase IV activity in the cytosol.

    PubMed

    Luteijn, Rutger David; Drexler, Ingo; Smith, Geoffrey L; Lebbink, Robert Jan; Wiertz, Emmanuel J H J

    2018-06-01

    Poxviruses comprise a group of large dsDNA viruses that include members relevant to human and animal health, such as variola virus, monkeypox virus, cowpox virus and vaccinia virus (VACV). Poxviruses are remarkable for their unique replication cycle, which is restricted to the cytoplasm of infected cells. The independence from the host nucleus requires poxviruses to encode most of the enzymes involved in DNA replication, transcription and processing. Here, we use the CRISPR/Cas9 genome engineering system to induce DNA damage to VACV (strain Western Reserve) genomes. We show that targeting CRISPR/Cas9 to essential viral genes limits virus replication efficiently. Although VACV is a strictly cytoplasmic pathogen, we observed extensive viral genome editing at the target site; this is reminiscent of a non-homologous end-joining DNA repair mechanism. This pathway was not dependent on the viral DNA ligase, but critically involved the cellular DNA ligase IV. Our data show that DNA ligase IV can act outside of the nucleus to allow repair of dsDNA breaks in poxvirus genomes. This pathway might contribute to the introduction of mutations within the genome of poxviruses and may thereby promote the evolution of these viruses.

  2. Molecular Analysis and Genomic Organization of Major DNA Satellites in Banana (Musa spp.)

    PubMed Central

    Čížková, Jana; Hřibová, Eva; Humplíková, Lenka; Christelová, Pavla; Suchánková, Pavla; Doležel, Jaroslav

    2013-01-01

    Satellite DNA sequences consist of tandemly arranged repetitive units up to thousands nucleotides long in head-to-tail orientation. The evolutionary processes by which satellites arise and evolve include unequal crossing over, gene conversion, transposition and extra chromosomal circular DNA formation. Large blocks of satellite DNA are often observed in heterochromatic regions of chromosomes and are a typical component of centromeric and telomeric regions. Satellite-rich loci may show specific banding patterns and facilitate chromosome identification and analysis of structural chromosome changes. Unlike many other genomes, nuclear genomes of banana (Musa spp.) are poor in satellite DNA and the information on this class of DNA remains limited. The banana cultivars are seed sterile clones originating mostly from natural intra-specific crosses within M. acuminata (A genome) and inter-specific crosses between M. acuminata and M. balbisiana (B genome). Previous studies revealed the closely related nature of the A and B genomes, including similarities in repetitive DNA. In this study we focused on two main banana DNA satellites, which were previously identified in silico. Their genomic organization and molecular diversity was analyzed in a set of nineteen Musa accessions, including representatives of A, B and S (M. schizocarpa) genomes and their inter-specific hybrids. The two DNA satellites showed a high level of sequence conservation within, and a high homology between Musa species. FISH with probes for the satellite DNA sequences, rRNA genes and a single-copy BAC clone 2G17 resulted in characteristic chromosome banding patterns in M. acuminata and M. balbisiana which may aid in determining genomic constitution in interspecific hybrids. In addition to improving the knowledge on Musa satellite DNA, our study increases the number of cytogenetic markers and the number of individual chromosomes, which can be identified in Musa. PMID:23372772

  3. Molecular analysis and genomic organization of major DNA satellites in banana (Musa spp.).

    PubMed

    Čížková, Jana; Hřibová, Eva; Humplíková, Lenka; Christelová, Pavla; Suchánková, Pavla; Doležel, Jaroslav

    2013-01-01

    Satellite DNA sequences consist of tandemly arranged repetitive units up to thousands nucleotides long in head-to-tail orientation. The evolutionary processes by which satellites arise and evolve include unequal crossing over, gene conversion, transposition and extra chromosomal circular DNA formation. Large blocks of satellite DNA are often observed in heterochromatic regions of chromosomes and are a typical component of centromeric and telomeric regions. Satellite-rich loci may show specific banding patterns and facilitate chromosome identification and analysis of structural chromosome changes. Unlike many other genomes, nuclear genomes of banana (Musa spp.) are poor in satellite DNA and the information on this class of DNA remains limited. The banana cultivars are seed sterile clones originating mostly from natural intra-specific crosses within M. acuminata (A genome) and inter-specific crosses between M. acuminata and M. balbisiana (B genome). Previous studies revealed the closely related nature of the A and B genomes, including similarities in repetitive DNA. In this study we focused on two main banana DNA satellites, which were previously identified in silico. Their genomic organization and molecular diversity was analyzed in a set of nineteen Musa accessions, including representatives of A, B and S (M. schizocarpa) genomes and their inter-specific hybrids. The two DNA satellites showed a high level of sequence conservation within, and a high homology between Musa species. FISH with probes for the satellite DNA sequences, rRNA genes and a single-copy BAC clone 2G17 resulted in characteristic chromosome banding patterns in M. acuminata and M. balbisiana which may aid in determining genomic constitution in interspecific hybrids. In addition to improving the knowledge on Musa satellite DNA, our study increases the number of cytogenetic markers and the number of individual chromosomes, which can be identified in Musa.

  4. GEAR: genomic enrichment analysis of regional DNA copy number changes.

    PubMed

    Kim, Tae-Min; Jung, Yu-Chae; Rhyu, Mun-Gan; Jung, Myeong Ho; Chung, Yeun-Jun

    2008-02-01

    We developed an algorithm named GEAR (genomic enrichment analysis of regional DNA copy number changes) for functional interpretation of genome-wide DNA copy number changes identified by array-based comparative genomic hybridization. GEAR selects two types of chromosomal alterations with potential biological relevance, i.e. recurrent and phenotype-specific alterations. Then it performs functional enrichment analysis using a priori selected functional gene sets to identify primary and clinical genomic signatures. The genomic signatures identified by GEAR represent functionally coordinated genomic changes, which can provide clues on the underlying molecular mechanisms related to the phenotypes of interest. GEAR can help the identification of key molecular functions that are activated or repressed in the tumor genomes leading to the improved understanding on the tumor biology. GEAR software is available with online manual in the website, http://www.systemsbiology.co.kr/GEAR/.

  5. Recurrent DNA inversion rearrangements in the human genome

    PubMed Central

    Flores, Margarita; Morales, Lucía; Gonzaga-Jauregui, Claudia; Domínguez-Vidaña, Rocío; Zepeda, Cinthya; Yañez, Omar; Gutiérrez, María; Lemus, Tzitziki; Valle, David; Avila, Ma. Carmen; Blanco, Daniel; Medina-Ruiz, Sofía; Meza, Karla; Ayala, Erandi; García, Delfino; Bustos, Patricia; González, Víctor; Girard, Lourdes; Tusie-Luna, Teresa; Dávila, Guillermo; Palacios, Rafael

    2007-01-01

    Several lines of evidence suggest that reiterated sequences in the human genome are targets for nonallelic homologous recombination (NAHR), which facilitates genomic rearrangements. We have used a PCR-based approach to identify breakpoint regions of rearranged structures in the human genome. In particular, we have identified intrachromosomal identical repeats that are located in reverse orientation, which may lead to chromosomal inversions. A bioinformatic workflow pathway to select appropriate regions for analysis was developed. Three such regions overlapping with known human genes, located on chromosomes 3, 15, and 19, were analyzed. The relative proportion of wild-type to rearranged structures was determined in DNA samples from blood obtained from different, unrelated individuals. The results obtained indicate that recurrent genomic rearrangements occur at relatively high frequency in somatic cells. Interestingly, the rearrangements studied were significantly more abundant in adults than in newborn individuals, suggesting that such DNA rearrangements might start to appear during embryogenesis or fetal life and continue to accumulate after birth. The relevance of our results in regard to human genomic variation is discussed. PMID:17389356

  6. Genomic Approach to Understand the Association of DNA Repair with Longevity and Healthy Aging Using Genomic Databases of Oldest-Old Population

    PubMed Central

    Kim, Hyun Soo

    2018-01-01

    Aged population is increasing worldwide due to the aging process that is inevitable. Accordingly, longevity and healthy aging have been spotlighted to promote social contribution of aged population. Many studies in the past few decades have reported the process of aging and longevity, emphasizing the importance of maintaining genomic stability in exceptionally long-lived population. Underlying reason of longevity remains unclear due to its complexity involving multiple factors. With advances in sequencing technology and human genome-associated approaches, studies based on population-based genomic studies are increasing. In this review, we summarize recent longevity and healthy aging studies of human population focusing on DNA repair as a major factor in maintaining genome integrity. To keep pace with recent growth in genomic research, aging- and longevity-associated genomic databases are also briefly introduced. To suggest novel approaches to investigate longevity-associated genetic variants related to DNA repair using genomic databases, gene set analysis was conducted, focusing on DNA repair- and longevity-associated genes. Their biological networks were additionally analyzed to grasp major factors containing genetic variants of human longevity and healthy aging in DNA repair mechanisms. In summary, this review emphasizes DNA repair activity in human longevity and suggests approach to conduct DNA repair-associated genomic study on human healthy aging.

  7. An Adenovirus DNA Replication Factor, but Not Incoming Genome Complexes, Targets PML Nuclear Bodies.

    PubMed

    Komatsu, Tetsuro; Nagata, Kyosuke; Wodrich, Harald

    2016-02-01

    Promyelocytic leukemia protein nuclear bodies (PML-NBs) are subnuclear domains implicated in cellular antiviral responses. Despite the antiviral activity, several nuclear replicating DNA viruses use the domains as deposition sites for the incoming viral genomes and/or as sites for viral DNA replication, suggesting that PML-NBs are functionally relevant during early viral infection to establish productive replication. Although PML-NBs and their components have also been implicated in the adenoviral life cycle, it remains unclear whether incoming adenoviral genome complexes target PML-NBs. Here we show using immunofluorescence and live-cell imaging analyses that incoming adenovirus genome complexes neither localize at nor recruit components of PML-NBs during early phases of infection. We further show that the viral DNA binding protein (DBP), an early expressed viral gene and essential DNA replication factor, independently targets PML-NBs. We show that DBP oligomerization is required to selectively recruit the PML-NB components Sp100 and USP7. Depletion experiments suggest that the absence of one PML-NB component might not affect the recruitment of other components toward DBP oligomers. Thus, our findings suggest a model in which an adenoviral DNA replication factor, but not incoming viral genome complexes, targets and modulates PML-NBs to support a conducive state for viral DNA replication and argue against a generalized concept that PML-NBs target incoming viral genomes. The immediate fate upon nuclear delivery of genomes of incoming DNA viruses is largely unclear. Early reports suggested that incoming genomes of herpesviruses are targeted and repressed by PML-NBs immediately upon nuclear import. Genome localization and/or viral DNA replication has also been observed at PML-NBs for other DNA viruses. Thus, it was suggested that PML-NBs may immediately sense and target nuclear viral genomes and hence serve as sites for deposition of incoming viral genomes and

  8. The protective function of noncoding DNA in genome defense of eukaryotic male germ cells.

    PubMed

    Qiu, Guo-Hua; Huang, Cuiqin; Zheng, Xintian; Yang, Xiaoyan

    2018-04-01

    Peripheral and abundant noncoding DNA has been hypothesized to protect the genome and the central protein-coding sequences against DNA damage in somatic genome. In the cytosol, invading exogenous nucleic acids may first be deactivated by small RNAs encoded by noncoding DNA via mechanisms similar to the prokaryotic CRISPR-Cas system. In the nucleus, the radicals generated by radiation in the cytosol, radiation energy and invading exogenous nucleic acids are absorbed, blocked and/or reduced by peripheral heterochromatin, and damaged DNA in heterochromatin is removed and excluded from the nucleus to the cytoplasm through nuclear pore complexes. To further strengthen the hypothesis, this review summarizes the experimental evidence supporting the protective function of noncoding DNA in the genome of male germ cells. Based on these data, this review provides evidence supporting the protective role of noncoding DNA in the genome defense of sperm genome through similar mechanisms to those of the somatic genome.

  9. A genomic landscape of mitochondrial DNA insertions in the pig nuclear genome provides evolutionary signatures of interspecies admixture.

    PubMed

    Schiavo, Giuseppina; Hoffmann, Orsolya Ivett; Ribani, Anisa; Utzeri, Valerio Joe; Ghionda, Marco Ciro; Bertolini, Francesca; Geraci, Claudia; Bovo, Samuele; Fontanesi, Luca

    2017-10-01

    Nuclear DNA sequences of mitochondrial origin (numts) are derived by insertion of mitochondrial DNA (mtDNA), into the nuclear genome. In this study, we provide, for the first time, a genome picture of numts inserted in the pig nuclear genome. The Sus scrofa reference nuclear genome (Sscrofa10.2) was aligned with circularized and consensus mtDNA sequences using LAST software. A total of 430 numt sequences that may represent 246 different numt integration events (57 numt regions determined by at least two numt sequences and 189 singletons) were identified, covering about 0.0078% of the nuclear genome. Numt integration events were correlated (0.99) to the chromosome length. The longest numt sequence (about 11 kbp) was located on SSC2. Six numts were sequenced and PCR amplified in pigs of European commercial and local pig breeds, of the Chinese Meishan breed and in European wild boars. Three of them were polymorphic for the presence or absence of the insertion. Surprisingly, the estimated age of insertion of two of the three polymorphic numts was more ancient than that of the speciation time of the Sus scrofa, supporting that these polymorphic sites were originated from interspecies admixture that contributed to shape the pig genome. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  10. Extraction of genomic DNA from yeasts for PCR-based applications.

    PubMed

    Lõoke, Marko; Kristjuhan, Kersti; Kristjuhan, Arnold

    2011-05-01

    We have developed a quick and low-cost genomic DNA extraction protocol from yeast cells for PCR-based applications. This method does not require any enzymes, hazardous chemicals, or extreme temperatures, and is especially powerful for simultaneous analysis of a large number of samples. DNA can be efficiently extracted from different yeast species (Kluyveromyces lactis, Hansenula polymorpha, Schizosaccharomyces pombe, Candida albicans, Pichia pastoris, and Saccharomyces cerevisiae). The protocol involves lysis of yeast colonies or cells from liquid culture in a lithium acetate (LiOAc)-SDS solution and subsequent precipitation of DNA with ethanol. Approximately 100 nanograms of total genomic DNA can be extracted from 1 × 10(7) cells. DNA extracted by this method is suitable for a variety of PCR-based applications (including colony PCR, real-time qPCR, and DNA sequencing) for amplification of DNA fragments of ≤ 3500 bp.

  11. A high-throughput Sanger strategy for human mitochondrial genome sequencing

    PubMed Central

    2013-01-01

    Background A population reference database of complete human mitochondrial genome (mtGenome) sequences is needed to enable the use of mitochondrial DNA (mtDNA) coding region data in forensic casework applications. However, the development of entire mtGenome haplotypes to forensic data quality standards is difficult and laborious. A Sanger-based amplification and sequencing strategy that is designed for automated processing, yet routinely produces high quality sequences, is needed to facilitate high-volume production of these mtGenome data sets. Results We developed a robust 8-amplicon Sanger sequencing strategy that regularly produces complete, forensic-quality mtGenome haplotypes in the first pass of data generation. The protocol works equally well on samples representing diverse mtDNA haplogroups and DNA input quantities ranging from 50 pg to 1 ng, and can be applied to specimens of varying DNA quality. The complete workflow was specifically designed for implementation on robotic instrumentation, which increases throughput and reduces both the opportunities for error inherent to manual processing and the cost of generating full mtGenome sequences. Conclusions The described strategy will assist efforts to generate complete mtGenome haplotypes which meet the highest data quality expectations for forensic genetic and other applications. Additionally, high-quality data produced using this protocol can be used to assess mtDNA data developed using newer technologies and chemistries. Further, the amplification strategy can be used to enrich for mtDNA as a first step in sample preparation for targeted next-generation sequencing. PMID:24341507

  12. Quantitative analysis of genomic DNA degradation in whole blood under various storage conditions for molecular diagnostic testing.

    PubMed

    Permenter, Jessalyn; Ishwar, Arjun; Rounsavall, Angie; Smith, Maddie; Faske, Jennifer; Sailey, Charles J; Alfaro, Maria P

    2015-12-01

    Proper storage of whole blood is crucial for isolating nucleic acids from leukocytes and to ensure adequate performance of downstream assays in the molecular diagnostic laboratory. Short-term and long-term storage recommendations are lacking for successful isolation of genomic DNA (gDNA). Container type (EDTA or heparin), temperature (4 °C and room temperature) and time (1-130 days) were assessed as criterion for sample acceptance policies. The percentage of integrated area (%Ti) between 150 and 10,000 bp from the 2200 TapeStation electropherogram was calculated to measure gDNA degradation. Refrigerated EDTA samples yielded gDNA with low %Ti (high quality). Heparinized samples stored at room temperature yielded gDNA of worst quality. Downstream analysis demonstrated that the quality of the gDNA correlated with the quality of the data; samples with high %Ti generated significantly lower levels of high molecular weight amplicons. Recommendations from these analyses include storing blood samples intended for nucleic acid isolation in EDTA tubes at 4 °C for long term storage (>10 days). gDNA should be extracted within 3 days when blood is stored at room temperature regardless of the container. Finally, refrigerated heparinized samples should not be stored longer than 9 days if expecting high quality gDNA isolates. Laboratories should consider many factors, in addition to the results obtained herein, to update their policies for sample acceptance for gDNA extraction intended for molecular genetic testing. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. Partial DNA-guided Cas9 enables genome editing with reduced off-target activity

    PubMed Central

    Yin, Hao; Song, Chun-Qing; Suresh, Sneha; Kwan, Suet-Yan; Wu, Qiongqiong; Walsh, Stephen; Ding, Junmei; Bogorad, Roman L; Zhu, Lihua Julie; Wolfe, Scot A; Koteliansky, Victor; Xue, Wen; Langer, Robert; Anderson, Daniel G

    2018-01-01

    CRISPR–Cas9 is a versatile RNA-guided genome editing tool. Here we demonstrate that partial replacement of RNA nucleotides with DNA nucleotides in CRISPR RNA (crRNA) enables efficient gene editing in human cells. This strategy of partial DNA replacement retains on-target activity when used with both crRNA and sgRNA, as well as with multiple guide sequences. Partial DNA replacement also works for crRNA of Cpf1, another CRISPR system. We find that partial DNA replacement in the guide sequence significantly reduces off-target genome editing through focused analysis of off-target cleavage, measurement of mismatch tolerance and genome-wide profiling of off-target sites. Using the structure of the Cas9–sgRNA complex as a guide, the majority of the 3′ end of crRNA can be replaced with DNA nucleotide, and the 5 - and 3′-DNA-replaced crRNA enables efficient genome editing. Cas9 guided by a DNA–RNA chimera may provide a generalized strategy to reduce both the cost and the off-target genome editing in human cells. PMID:29377001

  14. Genomic profiling of plastid DNA variation in the Mediterranean olive tree

    PubMed Central

    2011-01-01

    Background Characterisation of plastid genome (or cpDNA) polymorphisms is commonly used for phylogeographic, population genetic and forensic analyses in plants, but detecting cpDNA variation is sometimes challenging, limiting the applications of such an approach. In the present study, we screened cpDNA polymorphism in the olive tree (Olea europaea L.) by sequencing the complete plastid genome of trees with a distinct cpDNA lineage. Our objective was to develop new markers for a rapid genomic profiling (by Multiplex PCRs) of cpDNA haplotypes in the Mediterranean olive tree. Results Eight complete cpDNA genomes of Olea were sequenced de novo. The nucleotide divergence between olive cpDNA lineages was low and not exceeding 0.07%. Based on these sequences, markers were developed for studying two single nucleotide substitutions and length polymorphism of 62 regions (with variable microsatellite motifs or other indels). They were then used to genotype the cpDNA variation in cultivated and wild Mediterranean olive trees (315 individuals). Forty polymorphic loci were detected on this sample, allowing the distinction of 22 haplotypes belonging to the three Mediterranean cpDNA lineages known as E1, E2 and E3. The discriminating power of cpDNA variation was particularly low for the cultivated olive tree with one predominating haplotype, but more diversity was detected in wild populations. Conclusions We propose a method for a rapid characterisation of the Mediterranean olive germplasm. The low variation in the cultivated olive tree indicated that the utility of cpDNA variation for forensic analyses is limited to rare haplotypes. In contrast, the high cpDNA variation in wild populations demonstrated that our markers may be useful for phylogeographic and populations genetic studies in O. europaea. PMID:21569271

  15. Evaluation of different sources of DNA for use in genome wide studies and forensic application.

    PubMed

    Al Safar, Habiba S; Abidi, Fatima H; Khazanehdari, Kamal A; Dadour, Ian R; Tay, Guan K

    2011-02-01

    In the field of epidemiology, Genome-Wide Association Studies (GWAS) are commonly used to identify genetic predispositions of many human diseases. Large repositories housing biological specimens for clinical and genetic investigations have been established to store material and data for these studies. The logistics of specimen collection and sample storage can be onerous, and new strategies have to be explored. This study examines three different DNA sources (namely, degraded genomic DNA, amplified degraded genomic DNA and amplified extracted DNA from FTA card) for GWAS using the Illumina platform. No significant difference in call rate was detected between amplified degraded genomic DNA extracted from whole blood and amplified DNA retrieved from FTA™ cards. However, using unamplified-degraded genomic DNA reduced the call rate to a mean of 42.6% compared to amplified DNA extracted from FTA card (mean of 96.6%). This study establishes the utility of FTA™ cards as a viable storage matrix for cells from which DNA can be extracted to perform GWAS analysis.

  16. A Genome-Wide Map of Mitochondrial DNA Recombination in Yeast

    PubMed Central

    Fritsch, Emilie S.; Chabbert, Christophe D.; Klaus, Bernd; Steinmetz, Lars M.

    2014-01-01

    In eukaryotic cells, the production of cellular energy requires close interplay between nuclear and mitochondrial genomes. The mitochondrial genome is essential in that it encodes several genes involved in oxidative phosphorylation. Each cell contains several mitochondrial genome copies and mitochondrial DNA recombination is a widespread process occurring in plants, fungi, protists, and invertebrates. Saccharomyces cerevisiae has proved to be an excellent model to dissect mitochondrial biology. Several studies have focused on DNA recombination in this organelle, yet mostly relied on reporter genes or artificial systems. However, no complete mitochondrial recombination map has been released for any eukaryote so far. In the present work, we sequenced pools of diploids originating from a cross between two different S. cerevisiae strains to detect recombination events. This strategy allowed us to generate the first genome-wide map of recombination for yeast mitochondrial DNA. We demonstrated that recombination events are enriched in specific hotspots preferentially localized in non-protein-coding regions. Additionally, comparison of the recombination profiles of two different crosses showed that the genetic background affects hotspot localization and recombination rates. Finally, to gain insights into the mechanisms involved in mitochondrial recombination, we assessed the impact of individual depletion of four genes previously associated with this process. Deletion of NTG1 and MGT1 did not substantially influence the recombination landscape, alluding to the potential presence of additional regulatory factors. Our findings also revealed the loss of large mitochondrial DNA regions in the absence of MHR1, suggesting a pivotal role for Mhr1 in mitochondrial genome maintenance during mating. This study provides a comprehensive overview of mitochondrial DNA recombination in yeast and thus paves the way for future mechanistic studies of mitochondrial recombination and genome

  17. A genome-wide map of mitochondrial DNA recombination in yeast.

    PubMed

    Fritsch, Emilie S; Chabbert, Christophe D; Klaus, Bernd; Steinmetz, Lars M

    2014-10-01

    In eukaryotic cells, the production of cellular energy requires close interplay between nuclear and mitochondrial genomes. The mitochondrial genome is essential in that it encodes several genes involved in oxidative phosphorylation. Each cell contains several mitochondrial genome copies and mitochondrial DNA recombination is a widespread process occurring in plants, fungi, protists, and invertebrates. Saccharomyces cerevisiae has proved to be an excellent model to dissect mitochondrial biology. Several studies have focused on DNA recombination in this organelle, yet mostly relied on reporter genes or artificial systems. However, no complete mitochondrial recombination map has been released for any eukaryote so far. In the present work, we sequenced pools of diploids originating from a cross between two different S. cerevisiae strains to detect recombination events. This strategy allowed us to generate the first genome-wide map of recombination for yeast mitochondrial DNA. We demonstrated that recombination events are enriched in specific hotspots preferentially localized in non-protein-coding regions. Additionally, comparison of the recombination profiles of two different crosses showed that the genetic background affects hotspot localization and recombination rates. Finally, to gain insights into the mechanisms involved in mitochondrial recombination, we assessed the impact of individual depletion of four genes previously associated with this process. Deletion of NTG1 and MGT1 did not substantially influence the recombination landscape, alluding to the potential presence of additional regulatory factors. Our findings also revealed the loss of large mitochondrial DNA regions in the absence of MHR1, suggesting a pivotal role for Mhr1 in mitochondrial genome maintenance during mating. This study provides a comprehensive overview of mitochondrial DNA recombination in yeast and thus paves the way for future mechanistic studies of mitochondrial recombination and genome

  18. Automated genomic DNA purification options in agricultural applications using MagneSil paramagnetic particles

    NASA Astrophysics Data System (ADS)

    Bitner, Rex M.; Koller, Susan C.

    2002-06-01

    The automated high throughput purification of genomic DNA form plant materials can be performed using MagneSil paramagnetic particles on the Beckman-Coulter FX, BioMek 2000, and the Tecan Genesis robot. Similar automated methods are available for DNA purifications from animal blood. These methods eliminate organic extractions, lengthy incubations and cumbersome filter plates. The DNA is suitable for applications such as PCR and RAPD analysis. Methods are described for processing traditionally difficult samples such as those containing large amounts of polyphenolics or oils, while still maintaining a high level of DNA purity. The robotic protocols have ben optimized for agricultural applications such as marker assisted breeding, seed-quality testing, and SNP discovery and scoring. In addition to high yield purification of DNA from plant samples or animal blood, the use of Promega's DNA-IQ purification system is also described. This method allows for the purification of a narrow range of DNA regardless of the amount of additional DNA that is present in the initial sample. This simultaneous Isolation and Quantification of DNA allows the DNA to be used directly in applications such as PCR, SNP analysis, and RAPD, without the need for separate quantitation of the DNA.

  19. Comparative Analyses of DNA Methylation and Sequence Evolution Using Nasonia Genomes

    PubMed Central

    Park, Jungsun; Peng, Zuogang; Zeng, Jia; Elango, Navin; Park, Taesung; Wheeler, Dave; Werren, John H.; Yi, Soojin V.

    2011-01-01

    The functional and evolutionary significance of DNA methylation in insect genomes remains to be resolved. Nasonia is well situated for comparative analyses of DNA methylation and genome evolution, since the genomes of a moderately distant outgroup species as well as closely related sibling species are available. Using direct sequencing of bisulfite-converted DNA, we uncovered a substantial level of DNA methylation in 17 of 18 Nasonia vitripennis genes and a strong correlation between methylation level and CpG depletion. Notably, in the sex-determining locus transformer, the exon that is alternatively spliced between the sexes is heavily methylated in both males and females, whereas other exons are only sparsely methylated. Orthologous genes of the honeybee and Nasonia show highly similar relative levels of CpG depletion, despite ∼190 My divergence. Densely and sparsely methylated genes in these species also exhibit similar functional enrichments. We found that the degree of CpG depletion is negatively correlated with substitution rates between closely related Nasonia species for synonymous, nonsynonymous, and intron sites. This suggests that mutation rates increase with decreasing levels of germ line methylation. Thus, DNA methylation is prevalent in the Nasonia genome, may participate in regulatory processes such as sex determination and alternative splicing, and is correlated with several aspects of genome and sequence evolution. PMID:21693438

  20. Genome-wide survey of DNA-binding proteins in Arabidopsis thaliana: analysis of distribution and functions.

    PubMed

    Malhotra, Sony; Sowdhamini, Ramanathan

    2013-08-01

    The interaction of proteins with their respective DNA targets is known to control many high-fidelity cellular processes. Performing a comprehensive survey of the sequenced genomes for DNA-binding proteins (DBPs) will help in understanding their distribution and the associated functions in a particular genome. Availability of fully sequenced genome of Arabidopsis thaliana enables the review of distribution of DBPs in this model plant genome. We used profiles of both structure and sequence-based DNA-binding families, derived from PDB and PFam databases, to perform the survey. This resulted in 4471 proteins, identified as DNA-binding in Arabidopsis genome, which are distributed across 300 different PFam families. Apart from several plant-specific DNA-binding families, certain RING fingers and leucine zippers also had high representation. Our search protocol helped to assign DNA-binding property to several proteins that were previously marked as unknown, putative or hypothetical in function. The distribution of Arabidopsis genes having a role in plant DNA repair were particularly studied and noted for their functional mapping. The functions observed to be overrepresented in the plant genome harbour DNA-3-methyladenine glycosylase activity, alkylbase DNA N-glycosylase activity and DNA-(apurinic or apyrimidinic site) lyase activity, suggesting their role in specialized functions such as gene regulation and DNA repair.

  1. Analysis of the giant genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size.

    PubMed

    Kelly, Laura J; Renny-Byfield, Simon; Pellicer, Jaume; Macas, Jiří; Novák, Petr; Neumann, Pavel; Lysak, Martin A; Day, Peter D; Berger, Madeleine; Fay, Michael F; Nichols, Richard A; Leitch, Andrew R; Leitch, Ilia J

    2015-10-01

    Plants exhibit an extraordinary range of genome sizes, varying by > 2000-fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size. Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria. We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low-abundance repeat-derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal. Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high-abundance repeat families. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  2. Functional interrogation of non-coding DNA through CRISPR genome editing.

    PubMed

    Canver, Matthew C; Bauer, Daniel E; Orkin, Stuart H

    2017-05-15

    Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. Copyright © 2017 Elsevier Inc. All rights reserved.

  3. Functional interrogation of non-coding DNA through CRISPR genome editing

    PubMed Central

    Canver, Matthew C.; Bauer, Daniel E.; Orkin, Stuart H.

    2017-01-01

    Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. PMID:28288828

  4. High-throughput sequencing of three Lemnoideae (duckweeds) chloroplast genomes from total DNA.

    PubMed

    Wang, Wenqin; Messing, Joachim

    2011-01-01

    Chloroplast genomes provide a wealth of information for evolutionary and population genetic studies. Chloroplasts play a particularly important role in the adaption for aquatic plants because they float on water and their major surface is exposed continuously to sunlight. The subfamily of Lemnoideae represents such a collection of aquatic species that because of photosynthesis represents one of the fastest growing plant species on earth. We sequenced the chloroplast genomes from three different genera of Lemnoideae, Spirodela polyrhiza, Wolffiella lingulata and Wolffia australiana by high-throughput DNA sequencing of genomic DNA using the SOLiD platform. Unfractionated total DNA contains high copies of plastid DNA so that sequences from the nucleus and mitochondria can easily be filtered computationally. Remaining sequence reads were assembled into contiguous sequences (contigs) using SOLiD software tools. Contigs were mapped to a reference genome of Lemna minor and gaps, selected by PCR, were sequenced on the ABI3730xl platform. This combinatorial approach yielded whole genomic contiguous sequences in a cost-effective manner. Over 1,000-time coverage of chloroplast from total DNA were reached by the SOLiD platform in a single spot on a quadrant slide without purification. Comparative analysis indicated that the chloroplast genome was conserved in gene number and organization with respect to the reference genome of L. minor. However, higher nucleotide substitution, abundant deletions and insertions occurred in non-coding regions of these genomes, indicating a greater genomic dynamics than expected from the comparison of other related species in the Pooideae. Noticeably, there was no transition bias over transversion in Lemnoideae. The data should have immediate applications in evolutionary biology and plant taxonomy with increased resolution and statistical power.

  5. Nuclear import of viral DNA genomes.

    PubMed

    Greber, Urs F; Fassati, Ariberto

    2003-03-01

    The genomes of many viruses traffic into the nucleus, where they are either integrated into host chromosomes or maintained as episomal DNA and then transcriptionally activated or silenced. Here, we discuss the existing evidence on how the lentiviruses, adenoviruses, herpesviruses, hepadnaviruses and autonomous parvoviruses enter the nucleus. Depending on the size of the capsid enclosing the genome, three principles of viral nucleic acids import are discussed. The first principle is that the capsid disassembles in the cytosol or in a docked state at the nuclear pore complex and a subviral genomic complex is trafficked through the pore. Second, the genome is injected from a capsid that is docked to the pore complex, and third, import factors are recruited to cytosolic capsids to increase capsid affinity to the pore complex, mediate translocation and allow disassembly in the nucleoplasm.

  6. Frequent somatic transfer of mitochondrial DNA into the nuclear genome of human cancer cells.

    PubMed

    Ju, Young Seok; Tubio, Jose M C; Mifsud, William; Fu, Beiyuan; Davies, Helen R; Ramakrishna, Manasa; Li, Yilong; Yates, Lucy; Gundem, Gunes; Tarpey, Patrick S; Behjati, Sam; Papaemmanuil, Elli; Martin, Sancha; Fullam, Anthony; Gerstung, Moritz; Nangalia, Jyoti; Green, Anthony R; Caldas, Carlos; Borg, Åke; Tutt, Andrew; Lee, Ming Ta Michael; van't Veer, Laura J; Tan, Benita K T; Aparicio, Samuel; Span, Paul N; Martens, John W M; Knappskog, Stian; Vincent-Salomon, Anne; Børresen-Dale, Anne-Lise; Eyfjörd, Jórunn Erla; Myklebost, Ola; Flanagan, Adrienne M; Foster, Christopher; Neal, David E; Cooper, Colin; Eeles, Rosalind; Bova, Steven G; Lakhani, Sunil R; Desmedt, Christine; Thomas, Gilles; Richardson, Andrea L; Purdie, Colin A; Thompson, Alastair M; McDermott, Ultan; Yang, Fengtang; Nik-Zainal, Serena; Campbell, Peter J; Stratton, Michael R

    2015-06-01

    Mitochondrial genomes are separated from the nuclear genome for most of the cell cycle by the nuclear double membrane, intervening cytoplasm, and the mitochondrial double membrane. Despite these physical barriers, we show that somatically acquired mitochondrial-nuclear genome fusion sequences are present in cancer cells. Most occur in conjunction with intranuclear genomic rearrangements, and the features of the fusion fragments indicate that nonhomologous end joining and/or replication-dependent DNA double-strand break repair are the dominant mechanisms involved. Remarkably, mitochondrial-nuclear genome fusions occur at a similar rate per base pair of DNA as interchromosomal nuclear rearrangements, indicating the presence of a high frequency of contact between mitochondrial and nuclear DNA in some somatic cells. Transmission of mitochondrial DNA to the nuclear genome occurs in neoplastically transformed cells, but we do not exclude the possibility that some mitochondrial-nuclear DNA fusions observed in cancer occurred years earlier in normal somatic cells. © 2015 Ju et al.; Published by Cold Spring Harbor Laboratory Press.

  7. Frequent somatic transfer of mitochondrial DNA into the nuclear genome of human cancer cells

    PubMed Central

    Ju, Young Seok; Tubio, Jose M.C.; Mifsud, William; Fu, Beiyuan; Davies, Helen R.; Ramakrishna, Manasa; Li, Yilong; Yates, Lucy; Gundem, Gunes; Tarpey, Patrick S.; Behjati, Sam; Papaemmanuil, Elli; Martin, Sancha; Fullam, Anthony; Gerstung, Moritz; Nangalia, Jyoti; Green, Anthony R.; Caldas, Carlos; Borg, Åke; Tutt, Andrew; Lee, Ming Ta Michael; van't Veer, Laura J.; Tan, Benita K.T.; Aparicio, Samuel; Span, Paul N.; Martens, John W.M.; Knappskog, Stian; Vincent-Salomon, Anne; Børresen-Dale, Anne-Lise; Eyfjörd, Jórunn Erla; Flanagan, Adrienne M.; Foster, Christopher; Neal, David E.; Cooper, Colin; Eeles, Rosalind; Lakhani, Sunil R.; Desmedt, Christine; Thomas, Gilles; Richardson, Andrea L.; Purdie, Colin A.; Thompson, Alastair M.; McDermott, Ultan; Yang, Fengtang; Nik-Zainal, Serena; Campbell, Peter J.; Stratton, Michael R.

    2015-01-01

    Mitochondrial genomes are separated from the nuclear genome for most of the cell cycle by the nuclear double membrane, intervening cytoplasm, and the mitochondrial double membrane. Despite these physical barriers, we show that somatically acquired mitochondrial-nuclear genome fusion sequences are present in cancer cells. Most occur in conjunction with intranuclear genomic rearrangements, and the features of the fusion fragments indicate that nonhomologous end joining and/or replication-dependent DNA double-strand break repair are the dominant mechanisms involved. Remarkably, mitochondrial-nuclear genome fusions occur at a similar rate per base pair of DNA as interchromosomal nuclear rearrangements, indicating the presence of a high frequency of contact between mitochondrial and nuclear DNA in some somatic cells. Transmission of mitochondrial DNA to the nuclear genome occurs in neoplastically transformed cells, but we do not exclude the possibility that some mitochondrial-nuclear DNA fusions observed in cancer occurred years earlier in normal somatic cells. PMID:25963125

  8. Detection of Alicyclobacillus species in fruit juice using a random genomic DNA microarray chip.

    PubMed

    Jang, Jun Hyeong; Kim, Sun-Joong; Yoon, Bo Hyun; Ryu, Jee-Hoon; Gu, Man Bock; Chang, Hyo-Ihl

    2011-06-01

    This study describes a method using a DNA microarray chip to rapidly and simultaneously detect Alicyclobacillus species in orange juice based on the hybridization of genomic DNA with random probes. Three food spoilage bacteria were used in this study: Alicyclobacillus acidocaldarius, Alicyclobacillus acidoterrestris, and Alicyclobacillus cycloheptanicus. The three Alicyclobacillus species were adjusted to 2 × 10(3) CFU/ml and inoculated into pasteurized 100% pure orange juice. Cy5-dCTP labeling was used for reference signals, and Cy3-dCTP was labeled for target genomic DNA. The molar ratio of 1:1 of Cy3-dCTP and Cy5-dCTP was used. DNA microarray chips were fabricated using randomly fragmented DNA of Alicyclobacillus spp. and were hybridized with genomic DNA extracted from Bacillus spp. Genomic DNA extracted from Alicyclobacillus spp. showed a significantly higher hybridization rate compared with DNA of Bacillus spp., thereby distinguishing Alicyclobacillus spp. from Bacillus spp. The results showed that the microarray DNA chip containing randomly fragmented genomic DNA was specific and clearly identified specific food spoilage bacteria. This microarray system is a good tool for rapid and specific detection of thermophilic spoilage bacteria, mainly Alicyclobacillus spp., and is useful and applicable to the fruit juice industry.

  9. Isolation and amplification of genomic DNA from recalcitrant dried berries of black pepper (Piper nigrum L.)--a medicinal spice.

    PubMed

    Dhanya, K; Kizhakkayil, Jaleel; Syamkumar, S; Sasikumar, B

    2007-10-01

    Black pepper is an important medicinal spice traded internationally. The extraction of high quality genomic DNA for PCR amplification from dried black pepper is challenging because of the presence of the exceptionally large amount of oxidized polyphenolic compounds, polysaccharides and other secondary metabolites. Here we report a modified hexadecyl trimethyl ammonium bromide (CTAB) protocol by incorporating potassium acetate and a final PEG precipitation step to isolate PCR amplifiable genomic DNA from dried and powdered berries of black pepper. The protocol has trade implication as it will help in the PCR characterization of traded black peppers from different countries.

  10. Exploring the read-write genome: mobile DNA and mammalian adaptation.

    PubMed

    Shapiro, James A

    2017-02-01

    The read-write genome idea predicts that mobile DNA elements will act in evolution to generate adaptive changes in organismal DNA. This prediction was examined in the context of mammalian adaptations involving regulatory non-coding RNAs, viviparous reproduction, early embryonic and stem cell development, the nervous system, and innate immunity. The evidence shows that mobile elements have played specific and sometimes major roles in mammalian adaptive evolution by generating regulatory sites in the DNA and providing interaction motifs in non-coding RNA. Endogenous retroviruses and retrotransposons have been the predominant mobile elements in mammalian adaptive evolution, with the notable exception of bats, where DNA transposons are the major agents of RW genome inscriptions. A few examples of independent but convergent exaptation of mobile DNA elements for similar regulatory rewiring functions are noted.

  11. A streamlined method for analysing genome-wide DNA methylation patterns from low amounts of FFPE DNA.

    PubMed

    Ludgate, Jackie L; Wright, James; Stockwell, Peter A; Morison, Ian M; Eccles, Michael R; Chatterjee, Aniruddha

    2017-08-31

    Formalin fixed paraffin embedded (FFPE) tumor samples are a major source of DNA from patients in cancer research. However, FFPE is a challenging material to work with due to macromolecular fragmentation and nucleic acid crosslinking. FFPE tissue particularly possesses challenges for methylation analysis and for preparing sequencing-based libraries relying on bisulfite conversion. Successful bisulfite conversion is a key requirement for sequencing-based methylation analysis. Here we describe a complete and streamlined workflow for preparing next generation sequencing libraries for methylation analysis from FFPE tissues. This includes, counting cells from FFPE blocks and extracting DNA from FFPE slides, testing bisulfite conversion efficiency with a polymerase chain reaction (PCR) based test, preparing reduced representation bisulfite sequencing libraries and massively parallel sequencing. The main features and advantages of this protocol are: An optimized method for extracting good quality DNA from FFPE tissues. An efficient bisulfite conversion and next generation sequencing library preparation protocol that uses 50 ng DNA from FFPE tissue. Incorporation of a PCR-based test to assess bisulfite conversion efficiency prior to sequencing. We provide a complete workflow and an integrated protocol for performing DNA methylation analysis at the genome-scale and we believe this will facilitate clinical epigenetic research that involves the use of FFPE tissue.

  12. Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes

    PubMed Central

    Huang, Yongjie; Mrázek, Jan

    2014-01-01

    Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877

  13. DNA Breaks and End Resection Measured Genome-wide by End Sequencing.

    PubMed

    Canela, Andres; Sridharan, Sriram; Sciascia, Nicholas; Tubbs, Anthony; Meltzer, Paul; Sleckman, Barry P; Nussenzweig, André

    2016-09-01

    DNA double-strand breaks (DSBs) arise during physiological transcription, DNA replication, and antigen receptor diversification. Mistargeting or misprocessing of DSBs can result in pathological structural variation and mutation. Here we describe a sensitive method (END-seq) to monitor DNA end resection and DSBs genome-wide at base-pair resolution in vivo. We utilized END-seq to determine the frequency and spectrum of restriction-enzyme-, zinc-finger-nuclease-, and RAG-induced DSBs. Beyond sequence preference, chromatin features dictate the repertoire of these genome-modifying enzymes. END-seq can detect at least one DSB per cell among 10,000 cells not harboring DSBs, and we estimate that up to one out of 60 cells contains off-target RAG cleavage. In addition to site-specific cleavage, we detect DSBs distributed over extended regions during immunoglobulin class-switch recombination. Thus, END-seq provides a snapshot of DNA ends genome-wide, which can be utilized for understanding genome-editing specificities and the influence of chromatin on DSB pathway choice. Published by Elsevier Inc.

  14. High-Throughput Sequencing of Three Lemnoideae (Duckweeds) Chloroplast Genomes from Total DNA

    PubMed Central

    Wang, Wenqin; Messing, Joachim

    2011-01-01

    Background Chloroplast genomes provide a wealth of information for evolutionary and population genetic studies. Chloroplasts play a particularly important role in the adaption for aquatic plants because they float on water and their major surface is exposed continuously to sunlight. The subfamily of Lemnoideae represents such a collection of aquatic species that because of photosynthesis represents one of the fastest growing plant species on earth. Methods We sequenced the chloroplast genomes from three different genera of Lemnoideae, Spirodela polyrhiza, Wolffiella lingulata and Wolffia australiana by high-throughput DNA sequencing of genomic DNA using the SOLiD platform. Unfractionated total DNA contains high copies of plastid DNA so that sequences from the nucleus and mitochondria can easily be filtered computationally. Remaining sequence reads were assembled into contiguous sequences (contigs) using SOLiD software tools. Contigs were mapped to a reference genome of Lemna minor and gaps, selected by PCR, were sequenced on the ABI3730xl platform. Conclusions This combinatorial approach yielded whole genomic contiguous sequences in a cost-effective manner. Over 1,000-time coverage of chloroplast from total DNA were reached by the SOLiD platform in a single spot on a quadrant slide without purification. Comparative analysis indicated that the chloroplast genome was conserved in gene number and organization with respect to the reference genome of L. minor. However, higher nucleotide substitution, abundant deletions and insertions occurred in non-coding regions of these genomes, indicating a greater genomic dynamics than expected from the comparison of other related species in the Pooideae. Noticeably, there was no transition bias over transversion in Lemnoideae. The data should have immediate applications in evolutionary biology and plant taxonomy with increased resolution and statistical power. PMID:21931804

  15. DNA bending-induced phase transition of encapsidated genome in phage λ

    PubMed Central

    Lander, Gabriel C.; Johnson, John E.; Rau, Donald C.; Potter, Clinton S.; Carragher, Bridget; Evilevitch, Alex

    2013-01-01

    The DNA structure in phage capsids is determined by DNA–DNA interactions and bending energy. The effects of repulsive interactions on DNA interaxial distance were previously investigated, but not the effect of DNA bending on its structure in viral capsids. By varying packaged DNA length and through addition of spermine ions, we transform the interaction energy from net repulsive to net attractive. This allowed us to isolate the effect of bending on the resulting DNA structure. We used single particle cryo-electron microscopy reconstruction analysis to determine the interstrand spacing of double-stranded DNA encapsidated in phage λ capsids. The data reveal that stress and packing defects, both resulting from DNA bending in the capsid, are able to induce a long-range phase transition in the encapsidated DNA genome from a hexagonal to a cholesteric packing structure. This structural observation suggests significant changes in genome fluidity as a result of a phase transition affecting the rates of viral DNA ejection and packaging. PMID:23449219

  16. FA-SAT Is an Old Satellite DNA Frozen in Several Bilateria Genomes

    PubMed Central

    Chaves, Raquel; Ferreira, Daniela; Mendes-da-Silva, Ana; Meles, Susana; Adega, Filomena

    2017-01-01

    Abstract In recent years, a growing body of evidence has recognized the tandem repeat sequences, and specifically satellite DNA, as a functional class of sequences in the genomic “dark matter.” Using an original, complementary, and thus an eclectic experimental design, we show that the cat archetypal satellite DNA sequence, FA-SAT, is “frozen” conservatively in several Bilateria genomes. We found different genomic FA-SAT architectures, and the interspersion pattern was conserved. In Carnivora genomes, the FA-SAT-related sequences are also amplified, with the predominance of a specific FA-SAT variant, at the heterochromatic regions. We inspected the cat genome project to locate FA-SAT array flanking regions and revealed an intensive intermingling with transposable elements. Our results also show that FA-SAT-related sequences are transcribed and that the most abundant FA-SAT variant is not always the most transcribed. We thus conclude that the DNA sequences of FA-SAT and their transcripts are “frozen” in these genomes. Future work is needed to disclose any putative function that these sequences may play in these genomes. PMID:29608678

  17. A simple, fast, and inexpensive CTAB-PVP-silica based method for genomic DNA isolation from single, small insect larvae and pupae.

    PubMed

    Huanca-Mamani, W; Rivera-Cabello, D; Maita-Maita, J

    2015-07-17

    In this study, we report a modified CTAB-PVP method combined with silicon dioxide (silica) treatment for the extraction of high quality genomic DNA from a single larva or pupa. This method efficiently obtains DNA from small specimens, which is difficult and challenging because of the small amount of starting tissue. Maceration with liquid nitrogen, phenol treatment, and the ethanol precipitation step are eliminated using this methodology. The A260/A280 absorbance ratios of the isolated DNA were approximately 1.8, suggesting that the DNA is pure and can be used for further molecular analysis. The quality of the isolated DNA permits molecular applications and represents a fast, cheap, and effective alternative method for laboratories with low budgets.

  18. Genome-wide colonization of gene regulatory elements by G4 DNA motifs

    PubMed Central

    Du, Zhuo; Zhao, Yiqiang; Li, Ning

    2009-01-01

    G-quadruplex (or G4 DNA), a stable four-stranded structure found in guanine-rich regions, is implicated in the transcriptional regulation of genes involved in growth and development. Previous studies on the role of G4 DNA in gene regulation mostly focused on genomic regions proximal to transcription start sites (TSSs). To gain a more comprehensive understanding of the regulatory role of G4 DNA, we examined the landscape of potential G4 DNA (PG4Ms) motifs in the human genome and found that G4 motifs, not restricted to those found in the TSS-proximal regions, are bias toward gene-associated regions. Significantly, analyses of G4 motifs in seven types of well-known gene regulatory elements revealed a constitutive enrichment pattern and the clusters of G4 motifs tend to be colocalized with regulatory elements. Considering our analysis from a genome evolutionary perspective, we found evidence that the occurrence and accumulation of certain progenitors and canonical G4 DNA motifs within regulatory regions were progressively favored by natural selection. Our results suggest that G4 DNA motifs are ‘colonized’ in regulatory regions, supporting a likely genome-wide role of G4 DNA in gene regulation. We hypothesize that G4 DNA is a regulatory apparatus situated in regulatory elements, acting as a molecular switch that can modulate the role of the host functional regions, by transition in DNA structure. PMID:19759215

  19. Evaluation of FTA ® paper for storage of oral meta-genomic DNA.

    PubMed

    Foitzik, Magdalena; Stumpp, Sascha N; Grischke, Jasmin; Eberhard, Jörg; Stiesch, Meike

    2014-10-01

    The purpose of the present study was to evaluate the short-term storage of meta-genomic DNA from native oral biofilms on FTA(®) paper. Thirteen volunteers of both sexes received an acrylic splint for intraoral biofilm formation over a period of 48 hours. The biofilms were collected, resuspended in phosphate-buffered saline, and either stored on FTA(®) paper or directly processed by standard laboratory DNA extraction. The nucleic acid extraction efficiencies were evaluated by 16S rDNA targeted SSCP fingerprinting. The acquired banding pattern of FTA-derived meta-genomic DNA was compared to a standard DNA preparation protocol. Sensitivity and positive predictive values were calculated. The volunteers showed inter-individual differences in their bacterial species composition. A total of 200 bands were found for both methods and 85% of the banding patterns were equal, representing a sensitivity of 0.941 and a false-negative predictive value of 0.059. Meta-genomic DNA sampling, extraction, and adhesion using FTA(®) paper is a reliable method for storage of microbial DNA for a short period of time.

  20. DNA and RNA editing of retrotransposons accelerate mammalian genome evolution.

    PubMed

    Knisbacher, Binyamin A; Levanon, Erez Y

    2015-04-01

    Genome evolution is commonly viewed as a gradual process that is driven by random mutations that accumulate over time. However, DNA- and RNA-editing enzymes have been identified that can accelerate evolution by actively modifying the genomically encoded information. The apolipoprotein B mRNA editing enzymes, catalytic polypeptide-like (APOBECs) are potent restriction factors that can inhibit retroelements by cytosine-to-uridine editing of retroelement DNA after reverse transcription. In some cases, a retroelement may successfully integrate into the genome despite being hypermutated. Such events introduce unique sequences into the genome and are thus a source of genomic innovation. adenosine deaminases that act on RNA (ADARs) catalyze adenosine-to-inosine editing in double-stranded RNA, commonly formed by oppositely oriented retroelements. The RNA editing confers plasticity to the transcriptome by generating many transcript variants from a single genomic locus. If the editing produces a beneficial variant, the genome may maintain the locus that produces the RNA-edited transcript for its novel function. Here, we discuss how these two powerful editing mechanisms, which both target inserted retroelements, facilitate expedited genome evolution. © 2015 New York Academy of Sciences.

  1. Genome-wide association between DNA methylation and alternative splicing in an invertebrate

    PubMed Central

    2012-01-01

    Background Gene bodies are the most evolutionarily conserved targets of DNA methylation in eukaryotes. However, the regulatory functions of gene body DNA methylation remain largely unknown. DNA methylation in insects appears to be primarily confined to exons. Two recent studies in Apis mellifera (honeybee) and Nasonia vitripennis (jewel wasp) analyzed transcription and DNA methylation data for one gene in each species to demonstrate that exon-specific DNA methylation may be associated with alternative splicing events. In this study we investigated the relationship between DNA methylation, alternative splicing, and cross-species gene conservation on a genome-wide scale using genome-wide transcription and DNA methylation data. Results We generated RNA deep sequencing data (RNA-seq) to measure genome-wide mRNA expression at the exon- and gene-level. We produced a de novo transcriptome from this RNA-seq data and computationally predicted splice variants for the honeybee genome. We found that exons that are included in transcription are higher methylated than exons that are skipped during transcription. We detected enrichment for alternative splicing among methylated genes compared to unmethylated genes using fisher’s exact test. We performed a statistical analysis to reveal that the presence of DNA methylation or alternative splicing are both factors associated with a longer gene length and a greater number of exons in genes. In concordance with this observation, a conservation analysis using BLAST revealed that each of these factors is also associated with higher cross-species gene conservation. Conclusions This study constitutes the first genome-wide analysis exhibiting a positive relationship between exon-level DNA methylation and mRNA expression in the honeybee. Our finding that methylated genes are enriched for alternative splicing suggests that, in invertebrates, exon-level DNA methylation may play a role in the construction of splice variants by positively

  2. [Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].

    PubMed

    Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y

    2017-08-01

    To analyze and detect the whole genome sequence of human mitochondrial DNA (mtDNA) by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine

  3. Genome-Wide Negative Feedback Drives Transgenerational DNA Methylation Dynamics in Arabidopsis

    PubMed Central

    Kassam, Mohamed; Duvernois-Berthet, Evelyne; Cortijo, Sandra; Takashima, Kazuya; Saze, Hidetoshi; Toyoda, Atsushi; Fujiyama, Asao; Colot, Vincent; Kakutani, Tetsuji

    2015-01-01

    Epigenetic variations of phenotypes, especially those associated with DNA methylation, are often inherited over multiple generations in plants. The active and inactive chromatin states are heritable and can be maintained or even be amplified by positive feedback in a transgenerational manner. However, mechanisms controlling the transgenerational DNA methylation dynamics are largely unknown. As an approach to understand the transgenerational dynamics, we examined long-term effect of impaired DNA methylation in Arabidopsis mutants of the chromatin remodeler gene DDM1 (Decrease in DNA Methylation 1) through whole genome DNA methylation sequencing. The ddm1 mutation induces a drastic decrease in DNA methylation of transposable elements (TEs) and repeats in the initial generation, while also inducing ectopic DNA methylation at hundreds of loci. Unexpectedly, this ectopic methylation can only be seen after repeated self-pollination. The ectopic cytosine methylation is found primarily in the non-CG context and starts from 3’ regions within transcription units and spreads upstream. Remarkably, when chromosomes with reduced DNA methylation were introduced from a ddm1 mutant into a DDM1 wild-type background, the ddm1-derived chromosomes also induced analogous de novo accumulation of DNA methylation in trans. These results lead us to propose a model to explain the transgenerational DNA methylation redistribution by genome-wide negative feedback. The global negative feedback, together with local positive feedback, would ensure robust and balanced differentiation of chromatin states within the genome. PMID:25902052

  4. DNA Quantity and Quality in Remnants of Traffic-Killed Specimens of an Endangered Longhorn Beetle: A Comparison of Different Methods.

    PubMed

    Rusterholz, Hans-Peter; Ursenbacher, Sylvain; Coray, Armin; Weibel, Urs; Baur, Bruno

    2015-01-01

    The sampling of living insects should be avoided in highly endangered species when the sampling would further increase the risk of population extinction. Nonlethal sampling (wing clips or leg removals) can be an alternative to obtain DNA of individuals for population genetic studies. However, nonlethal sampling may not be possible for all insect species. We examined whether remnants of traffic-killed specimens of the endangered and protected flightless longhorn beetle Iberodorcadion fuliginator (L., 1758) can be used as a resource for population genetic analyses. Using insect fragments of traffic-killed specimens collected over 15 yr, we determined the most efficient DNA extraction method in relation to the state of the specimens (crushed, fragment, or intact), preservation (dried, airtight, or in ethanol), storage duration, and weight of the sample by assessing the quantity and quality of genomic DNA. A modified cetyltrimethyl ammonium bromide method provided the highest recovery rate of genomic DNA and the largest yield and highest quality of DNA. We further used traffic-killed specimens to evaluate two DNA amplification techniques (quantitative polymerase chain reaction [qPCR] and microsatellites). Both qPCR and microsatellites revealed successful DNA amplification in all degraded specimens or beetle fragments examined. However, relative qPCR concentration and peak height of microsatellites were affected by the state of specimen and storage duration but not by specimen weight. Our investigation demonstrates that degraded remnants of traffic-killed beetle specimens can serve as a source of high-quality genomic DNA, which allows to address conservation genetic issues. © The Author 2015. Published by Oxford University Press on behalf of the Entomological Society of America.

  5. DNA-PKcs, ATM, and ATR Interplay Maintains Genome Integrity during Neurogenesis.

    PubMed

    Enriquez-Rios, Vanessa; Dumitrache, Lavinia C; Downing, Susanna M; Li, Yang; Brown, Eric J; Russell, Helen R; McKinnon, Peter J

    2017-01-25

    The DNA damage response (DDR) orchestrates a network of cellular processes that integrates cell-cycle control and DNA repair or apoptosis, which serves to maintain genome stability. DNA-PKcs (the catalytic subunit of the DNA-dependent kinase, encoded by PRKDC), ATM (ataxia telangiectasia, mutated), and ATR (ATM and Rad3-related) are related PI3K-like protein kinases and central regulators of the DDR. Defects in these kinases have been linked to neurodegenerative or neurodevelopmental syndromes. In all cases, the key neuroprotective function of these kinases is uncertain. It also remains unclear how interactions between the three DNA damage-responsive kinases coordinate genome stability, particularly in a physiological context. Here, we used a genetic approach to identify the neural function of DNA-PKcs and the interplay between ATM and ATR during neurogenesis. We found that DNA-PKcs loss in the mouse sensitized neuronal progenitors to apoptosis after ionizing radiation because of excessive DNA damage. DNA-PKcs was also required to prevent endogenous DNA damage accumulation throughout the adult brain. In contrast, ATR coordinated the DDR during neurogenesis to direct apoptosis in cycling neural progenitors, whereas ATM regulated apoptosis in both proliferative and noncycling cells. We also found that ATR controls a DNA damage-induced G 2 /M checkpoint in cortical progenitors, independent of ATM and DNA-PKcs. These nonoverlapping roles were further confirmed via sustained murine embryonic or cortical development after all three kinases were simultaneously inactivated. Thus, our results illustrate how DNA-PKcs, ATM, and ATR have unique and essential roles during the DDR, collectively ensuring comprehensive genome maintenance in the nervous system. The DNA damage response (DDR) is essential for prevention of a broad spectrum of different human neurologic diseases. However, a detailed understanding of the DDR at a physiological level is lacking. In contrast to many in

  6. Hippo pathway and protection of genome stability in response to DNA damage.

    PubMed

    Pefani, Dafni E; O'Neill, Eric

    2016-04-01

    The integrity of DNA is constantly challenged by exposure to the damaging effects of chemical and physical agents. Elucidating the cellular mechanisms that maintain genomic integrity via DNA repair and cell growth control is vital because errors in these processes lead to genomic damage and the development of cancer. By gaining a deep molecular understanding of the signaling pathways regulating genome integrity it is hoped to uncover new therapeutics and treatment designs to combat cancer. Components of the Hippo pathway, a tumor-suppressor cascade, have recently been defined to limit cancer transformation in response to DNA damage. In this review, we briefly introduce the Hippo signaling cascade in mammals and discuss in detail how the Hippo pathway has been established as part of the DNA damage response, activated by apical signaling kinases that recognize breaks in DNA. We also highlight the significance of the Hippo pathway activator RASSF1A tumor suppressor, a direct target of ataxia telangiectasia mutated and ataxia telangiectasia and Rad3 related ATR. Furthermore we discuss how Hippo pathway in response DNA lesions can induce cell death via Yes-associated protein (YAP) (the canonical Hippo pathway effector) or promote maintenance of genome integrity in a YAP-independent manner. © 2015 FEBS.

  7. A High Quality Draft Consensus Sequence of the Genome of a Heterozygous Grapevine Variety

    PubMed Central

    Cartwright, Dustin A.; Cestaro, Alessandro; Pruss, Dmitry; Pindo, Massimo; FitzGerald, Lisa M.; Vezzulli, Silvia; Reid, Julia; Malacarne, Giulia; Iliev, Diana; Coppola, Giuseppina; Wardell, Bryan; Micheletti, Diego; Macalma, Teresita; Facci, Marco; Mitchell, Jeff T.; Perazzolli, Michele; Eldredge, Glenn; Gatto, Pamela; Oyzerski, Rozan; Moretto, Marco; Gutin, Natalia; Stefanini, Marco; Chen, Yang; Segala, Cinzia; Davenport, Christine; Demattè, Lorenzo; Mraz, Amy; Battilana, Juri; Stormo, Keith; Costa, Fabrizio; Tao, Quanzhou; Si-Ammour, Azeddine; Harkins, Tim; Lackey, Angie; Perbost, Clotilde; Taillon, Bruce; Stella, Alessandra; Solovyev, Victor; Fawcett, Jeffrey A.; Sterck, Lieven; Vandepoele, Klaas; Grando, Stella M.; Toppo, Stefano; Moser, Claudio; Lanchbury, Jerry; Bogden, Robert; Skolnick, Mark; Sgaramella, Vittorio; Bhatnagar, Satish K.; Fontana, Paolo; Gutin, Alexander; Van de Peer, Yves; Salamini, Francesco; Viola, Roberto

    2007-01-01

    Background Worldwide, grapes and their derived products have a large market. The cultivated grape species Vitis vinifera has potential to become a model for fruit trees genetics. Like many plant species, it is highly heterozygous, which is an additional challenge to modern whole genome shotgun sequencing. In this paper a high quality draft genome sequence of a cultivated clone of V. vinifera Pinot Noir is presented. Principal Findings We estimate the genome size of V. vinifera to be 504.6 Mb. Genomic sequences corresponding to 477.1 Mb were assembled in 2,093 metacontigs and 435.1 Mb were anchored to the 19 linkage groups (LGs). The number of predicted genes is 29,585, of which 96.1% were assigned to LGs. This assembly of the grape genome provides candidate genes implicated in traits relevant to grapevine cultivation, such as those influencing wine quality, via secondary metabolites, and those connected with the extreme susceptibility of grape to pathogens. Single nucleotide polymorphism (SNP) distribution was consistent with a diffuse haplotype structure across the genome. Of around 2,000,000 SNPs, 1,751,176 were mapped to chromosomes and one or more of them were identified in 86.7% of anchored genes. The relative age of grape duplicated genes was estimated and this made possible to reveal a relatively recent Vitis-specific large scale duplication event concerning at least 10 chromosomes (duplication not reported before). Conclusions Sanger shotgun sequencing and highly efficient sequencing by synthesis (SBS), together with dedicated assembly programs, resolved a complex heterozygous genome. A consensus sequence of the genome and a set of mapped marker loci were generated. Homologous chromosomes of Pinot Noir differ by 11.2% of their DNA (hemizygous DNA plus chromosomal gaps). SNP markers are offered as a tool with the potential of introducing a new era in the molecular breeding of grape. PMID:18094749

  8. DNA methylome of the 20-gigabase Norway spruce genome

    PubMed Central

    Ausin, Israel; Feng, Suhua; Yu, Chaowei; Liu, Wanlu; Kuo, Hsuan Yu; Jacobsen, Elise L.; Zhai, Jixian; Gallego-Bartolome, Javier; Wang, Lin; Egertsdotter, Ulrika; Street, Nathaniel R.; Jacobsen, Steven E.; Wang, Haifeng

    2016-01-01

    DNA methylation plays important roles in many biological processes, such as silencing of transposable elements, imprinting, and regulating gene expression. Many studies of DNA methylation have shown its essential roles in angiosperms (flowering plants). However, few studies have examined the roles and patterns of DNA methylation in gymnosperms. Here, we present genome-wide high coverage single-base resolution methylation maps of Norway spruce (Picea abies) from both needles and somatic embryogenesis culture cells via whole genome bisulfite sequencing. On average, DNA methylation levels of CG and CHG of Norway spruce were higher than most other plants studied. CHH methylation was found at a relatively low level; however, at least one copy of most of the RNA-directed DNA methylation pathway genes was found in Norway spruce, and CHH methylation was correlated with levels of siRNAs. In comparison with needles, somatic embryogenesis culture cells that are used for clonally propagating spruce trees showed lower levels of CG and CHG methylation but higher level of CHH methylation, suggesting that like in other species, these culture cells show abnormal methylation patterns. PMID:27911846

  9. DNA transposons have colonized the genome of the giant virus Pandoravirus salinus.

    PubMed

    Sun, Cheng; Feschotte, Cédric; Wu, Zhiqiang; Mueller, Rachel Lockridge

    2015-06-12

    Transposable elements are mobile DNA sequences that are widely distributed in prokaryotic and eukaryotic genomes, where they represent a major force in genome evolution. However, transposable elements have rarely been documented in viruses, and their contribution to viral genome evolution remains largely unexplored. Pandoraviruses are recently described DNA viruses with genome sizes that exceed those of some prokaryotes, rivaling parasitic eukaryotes. These large genomes appear to include substantial noncoding intergenic spaces, which provide potential locations for transposable element insertions. However, no mobile genetic elements have yet been reported in pandoravirus genomes. Here, we report a family of miniature inverted-repeat transposable elements (MITEs) in the Pandoravirus salinus genome, representing the first description of a virus populated with a canonical transposable element family that proliferated by transposition within the viral genome. The MITE family, which we name Submariner, includes 30 copies with all the hallmarks of MITEs: short length, terminal inverted repeats, TA target site duplication, and no coding capacity. Submariner elements show signs of transposition and are undetectable in the genome of Pandoravirus dulcis, the closest known relative Pandoravirus salinus. We identified a DNA transposon related to Submariner in the genome of Acanthamoeba castellanii, a species thought to host pandoraviruses, which contains remnants of coding sequence for a Tc1/mariner transposase. These observations suggest that the Submariner MITEs of P. salinus belong to the widespread Tc1/mariner superfamily and may have been mobilized by an amoebozoan host. Ten of the 30 MITEs in the P. salinus genome are located within coding regions of predicted genes, while others are close to genes, suggesting that these transposons may have contributed to viral genetic novelty. Our discovery highlights the remarkable ability of DNA transposons to colonize and shape

  10. The DNA-encoded nucleosome organization of a eukaryotic genome.

    PubMed

    Kaplan, Noam; Moore, Irene K; Fondufe-Mittendorf, Yvonne; Gossett, Andrea J; Tillo, Desiree; Field, Yair; LeProust, Emily M; Hughes, Timothy R; Lieb, Jason D; Widom, Jonathan; Segal, Eran

    2009-03-19

    Nucleosome organization is critical for gene regulation. In living cells this organization is determined by multiple factors, including the action of chromatin remodellers, competition with site-specific DNA-binding proteins, and the DNA sequence preferences of the nucleosomes themselves. However, it has been difficult to estimate the relative importance of each of these mechanisms in vivo, because in vivo nucleosome maps reflect the combined action of all influencing factors. Here we determine the importance of nucleosome DNA sequence preferences experimentally by measuring the genome-wide occupancy of nucleosomes assembled on purified yeast genomic DNA. The resulting map, in which nucleosome occupancy is governed only by the intrinsic sequence preferences of nucleosomes, is similar to in vivo nucleosome maps generated in three different growth conditions. In vitro, nucleosome depletion is evident at many transcription factor binding sites and around gene start and end sites, indicating that nucleosome depletion at these sites in vivo is partly encoded in the genome. We confirm these results with a micrococcal nuclease-independent experiment that measures the relative affinity of nucleosomes for approximately 40,000 double-stranded 150-base-pair oligonucleotides. Using our in vitro data, we devise a computational model of nucleosome sequence preferences that is significantly correlated with in vivo nucleosome occupancy in Caenorhabditis elegans. Our results indicate that the intrinsic DNA sequence preferences of nucleosomes have a central role in determining the organization of nucleosomes in vivo.

  11. Inferring genome-wide interplay landscape between DNA methylation and transcriptional regulation.

    PubMed

    Tang, Binhua; Wang, Xin

    2015-01-01

    DNA methylation and transcriptional regulation play important roles in cancer cell development and differentiation processes. Based on the currently available cell line profiling information from the ENCODE Consortium, we propose a Bayesian inference model to infer and construct genome-wide interaction landscape between DNA methylation and transcriptional regulation, which sheds light on the underlying complex functional mechanisms important within the human cancer and disease context. For the first time, we select all the currently available cell lines (>=20) and transcription factors (>=80) profiling information from the ENCODE Consortium portal. Through the integration of those genome-wide profiling sources, our genome-wide analysis detects multiple functional loci of interest, and indicates that DNA methylation is cell- and region-specific, due to the interplay mechanisms with transcription regulatory activities. We validate our analysis results with the corresponding RNA-sequencing technique for those detected genomic loci. Our results provide novel and meaningful insights for the interplay mechanisms of transcriptional regulation and gene expression for the human cancer and disease studies.

  12. Resurrection of DNA Function In Vivo from an Extinct Genome

    PubMed Central

    Pask, Andrew J.; Behringer, Richard R.; Renfree, Marilyn B.

    2008-01-01

    There is a burgeoning repository of information available from ancient DNA that can be used to understand how genomes have evolved and to determine the genetic features that defined a particular species. To assess the functional consequences of changes to a genome, a variety of methods are needed to examine extinct DNA function. We isolated a transcriptional enhancer element from the genome of an extinct marsupial, the Tasmanian tiger (Thylacinus cynocephalus or thylacine), obtained from 100 year-old ethanol-fixed tissues from museum collections. We then examined the function of the enhancer in vivo. Using a transgenic approach, it was possible to resurrect DNA function in transgenic mice. The results demonstrate that the thylacine Col2A1 enhancer directed chondrocyte-specific expression in this extinct mammalian species in the same way as its orthologue does in mice. While other studies have examined extinct coding DNA function in vitro, this is the first example of the restoration of extinct non-coding DNA and examination of its function in vivo. Our method using transgenesis can be used to explore the function of regulatory and protein-coding sequences obtained from any extinct species in an in vivo model system, providing important insights into gene evolution and diversity. PMID:18493600

  13. Discovery of cyanophage genomes which contain mitochondrial DNA polymerase.

    PubMed

    Chan, Yi-Wah; Mohr, Remus; Millard, Andrew D; Holmes, Antony B; Larkum, Anthony W; Whitworth, Anna L; Mann, Nicholas H; Scanlan, David J; Hess, Wolfgang R; Clokie, Martha R J

    2011-08-01

    DNA polymerase γ is a family A DNA polymerase responsible for the replication of mitochondrial DNA in eukaryotes. The origins of DNA polymerase γ have remained elusive because it is not present in any known bacterium, though it has been hypothesized that mitochondria may have inherited the enzyme by phage-mediated nonorthologous displacement. Here, we present an analysis of two full-length homologues of this gene, which were found in the genomes of two bacteriophages, which infect the chlorophyll-d containing cyanobacterium Acaryochloris marina. Phylogenetic analyses of these phage DNA polymerase γ proteins show that they branch deeply within the DNA polymerase γ clade and therefore share a common origin with their eukaryotic homologues. We also found homologues of these phage polymerases in the environmental Community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis (CAMERA) database, which fell in the same clade. An analysis of the CAMERA assemblies containing the environmental homologues together with the filter fraction metadata indicated some of these assemblies may be of bacterial origin. We also show that the phage-encoded DNA polymerase γ is highly transcribed as the phage genomes are replicated. These findings provide data that may assist in reconstructing the evolution of mitochondria.

  14. An optimized high quality male DNA extraction from spermatophores in open thelycum shrimp species.

    PubMed

    Planella, Laia; Heras, Sandra; Vera, Manuel; García-Marín, José-Luis; Roldán, María Inés

    2017-09-01

    The crucial step of most of the current genetic studies is the extraction of DNA of sufficient quantity and quality. Several genomic DNA isolation methods have been described to successfully obtain male DNA from shrimp species. However, all current protocols require invasive handling methods with males for DNA isolation. Using Aristeus antennatus as a model we tested a reliable non-invasive differential DNA extraction method to male DNA isolation from spermatophores attached to female thelycum. The present protocol provides high quality and quantity DNA for polymerase chain reaction amplification and male genotyping. This new approach could be useful to experimental shrimp culture to select sires with relevant genetic patterns for selective breeding programs. More importantly, it can be applied to identify the mating pairs and male structure in wild populations of species as A. antennatus, where males are often difficult to capture. Our method could be also valuable for biological studies on other spermatophore-using species, such as myriapods, arachnids and insects. © 2016 International Society of Zoological Sciences, Institute of Zoology/Chinese Academy of Sciences and John Wiley & Sons Australia, Ltd.

  15. Isolation from genomic DNA of sequences binding specific regulatory proteins by the acceleration of protein electrophoretic mobility upon DNA binding.

    PubMed

    Subrahmanyam, S; Cronan, J E

    1999-01-21

    We report an efficient and flexible in vitro method for the isolation of genomic DNA sequences that are the binding targets of a given DNA binding protein. This method takes advantage of the fact that binding of a protein to a DNA molecule generally increases the rate of migration of the protein in nondenaturing gel electrophoresis. By the use of a radioactively labeled DNA-binding protein and nonradioactive DNA coupled with PCR amplification from gel slices, we show that specific binding sites can be isolated from Escherichia coli genomic DNA. We have applied this method to isolate a binding site for FadR, a global regulator of fatty acid metabolism in E. coli. We have also isolated a second binding site for BirA, the biotin operon repressor/biotin ligase, from the E. coli genome that has a very low binding efficiency compared with the bio operator region.

  16. Suicidal function of DNA methylation in age-related genome disintegration.

    PubMed

    Mazin, Alexander L

    2009-10-01

    This article is dedicated to the 60th anniversary of 5-methylcytosine discovery in DNA. Cytosine methylation can affect genetic and epigenetic processes, works as a part of the genome-defense system and has mutagenic activity; however, the biological functions of this enzymatic modification are not well understood. This review will put forward the hypothesis that the host-defense role of DNA methylation in silencing and mutational destroying of retroviruses and other intragenomic parasites was extended during evolution to most host genes that have to be inactivated in differentiated somatic cells, where it acquired a new function in age-related self-destruction of the genome. The proposed model considers DNA methylation as the generator of 5mC>T transitions that induce 40-70% of all spontaneous somatic mutations of the multiple classes at CpG and CpNpG sites and flanking nucleotides in the p53, FIX, hprt, gpt human genes and some transgenes. The accumulation of 5mC-dependent mutations explains: global changes in the structure of the vertebrate genome throughout evolution; the loss of most 5mC from the DNA of various species over their lifespan and the Hayflick limit of normal cells; the polymorphism of methylation sites, including asymmetric mCpNpN sites; cyclical changes of methylation and demethylation in genes. The suicidal function of methylation may be a special genetic mechanism for increasing DNA damage and the programmed genome disintegration responsible for cell apoptosis and organism aging and death.

  17. Sequencing and comparative genomic analysis of 1227 Felis catus cDNA sequences enriched for developmental, clinical and nutritional phenotypes

    PubMed Central

    2012-01-01

    Background The feline genome is valuable to the veterinary and model organism genomics communities because the cat is an obligate carnivore and a model for endangered felids. The initial public release of the Felis catus genome assembly provided a framework for investigating the genomic basis of feline biology. However, the entire set of protein coding genes has not been elucidated. Results We identified and characterized 1227 protein coding feline sequences, of which 913 map to public sequences and 314 are novel. These sequences have been deposited into NCBI's genbank database and complement public genomic resources by providing additional protein coding sequences that fill in some of the gaps in the feline genome assembly. Through functional and comparative genomic analyses, we gained an understanding of the role of these sequences in feline development, nutrition and health. Specifically, we identified 104 orthologs of human genes associated with Mendelian disorders. We detected negative selection within sequences with gene ontology annotations associated with intracellular trafficking, cytoskeleton and muscle functions. We detected relatively less negative selection on protein sequences encoding extracellular networks, apoptotic pathways and mitochondrial gene ontology annotations. Additionally, we characterized feline cDNA sequences that have mouse orthologs associated with clinical, nutritional and developmental phenotypes. Together, this analysis provides an overview of the value of our cDNA sequences and enhances our understanding of how the feline genome is similar to, and different from other mammalian genomes. Conclusions The cDNA sequences reported here expand existing feline genomic resources by providing high-quality sequences annotated with comparative genomic information providing functional, clinical, nutritional and orthologous gene information. PMID:22257742

  18. Optimization of cDNA-AFLP experiments using genomic sequence data.

    PubMed

    Kivioja, Teemu; Arvas, Mikko; Saloheimo, Markku; Penttilä, Merja; Ukkonen, Esko

    2005-06-01

    cDNA amplified fragment length polymorphism (cDNA-AFLP) is one of the few genome-wide level expression profiling methods capable of finding genes that have not yet been cloned or even predicted from sequence but have interesting expression patterns under the studied conditions. In cDNA-AFLP, a complex cDNA mixture is divided into small subsets using restriction enzymes and selective PCR. A large cDNA-AFLP experiment can require a substantial amount of resources, such as hundreds of PCR amplifications and gel electrophoresis runs, followed by manual cutting of a large number of bands from the gels. Our aim was to test whether this workload can be reduced by rational design of the experiment. We used the available genomic sequence information to optimize cDNA-AFLP experiments beforehand so that as many transcripts as possible could be profiled with a given amount of resources. Optimization of the selection of both restriction enzymes and selective primers for cDNA-AFLP experiments has not been performed previously. The in silico tests performed suggest that substantial amounts of resources can be saved by the optimization of cDNA-AFLP experiments.

  19. Genome wide approaches to identify protein-DNA interactions.

    PubMed

    Ma, Tao; Ye, Zhenqing; Wang, Liguo

    2018-05-29

    Transcription factors are DNA-binding proteins that play key roles in many fundamental biological processes. Unraveling their interactions with DNA is essential to identify their target genes and understand the regulatory network. Genome-wide identification of their binding sites became feasible thanks to recent progress in experimental and computational approaches. ChIP-chip, ChIP-seq, and ChIP-exo are three widely used techniques to demarcate genome-wide transcription factor binding sites. This review aims to provide an overview of these three techniques including their experiment procedures, computational approaches, and popular analytic tools. ChIP-chip, ChIP-seq, and ChIP-exo have been the major techniques to study genome-wide in vivo protein-DNA interaction. Due to the rapid development of next-generation sequencing technology, array-based ChIP-chip is deprecated and ChIP-seq has become the most widely used technique to identify transcription factor binding sites in genome-wide. The newly developed ChIP-exo further improves the spatial resolution to single nucleotide. Numerous tools have been developed to analyze ChIP-chip, ChIP-seq and ChIP-exo data. However, different programs may employ different mechanisms or underlying algorithms thus each will inherently include its own set of statistical assumption and bias. So choosing the most appropriate analytic program for a given experiment needs careful considerations. Moreover, most programs only have command line interface so their installation and usage will require basic computation expertise in Unix/Linux. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  20. Genome-Wide Motif Statistics are Shaped by DNA Binding Proteins over Evolutionary Time Scales

    NASA Astrophysics Data System (ADS)

    Qian, Long; Kussell, Edo

    The composition of genomes with respect to short DNA motifs impacts the ability of DNA binding proteins to locate and bind their target sites. Since nonfunctional DNA binding can be detrimental to cellular functions and ultimately to organismal fitness, organisms could benefit from reducing the number of nonfunctional binding sites genome wide. Using in vitro measurements of binding affinities for a large collection of DNA binding proteins, in multiple species, we detect a significant global avoidance of weak binding sites in genomes. The underlying evolutionary process leaves a distinct genomic hallmark in that similar words have correlated frequencies, which we detect in all species across domains of life. We hypothesize that natural selection against weak binding sites contributes to this process, and using an evolutionary model we show that the strength of selection needed to maintain global word compositions is on the order of point mutation rates. Alternative contributions may come from interference of protein-DNA binding with replication and mutational repair processes, which operates with similar rates. We conclude that genome-wide word compositions have been molded by DNA binding proteins through tiny evolutionary steps over timescales spanning millions of generations.

  1. An alternative method for cDNA cloning from surrogate eukaryotic cells transfected with the corresponding genomic DNA.

    PubMed

    Hu, Lin-Yong; Cui, Chen-Chen; Song, Yu-Jie; Wang, Xiang-Guo; Jin, Ya-Ping; Wang, Ai-Hua; Zhang, Yong

    2012-07-01

    cDNA is widely used in gene function elucidation and/or transgenics research but often suitable tissues or cells from which to isolate mRNA for reverse transcription are unavailable. Here, an alternative method for cDNA cloning is described and tested by cloning the cDNA of human LALBA (human alpha-lactalbumin) from genomic DNA. First, genomic DNA containing all of the coding exons was cloned from human peripheral blood and inserted into a eukaryotic expression vector. Next, by delivering the plasmids into either 293T or fibroblast cells, surrogate cells were constructed. Finally, the total RNA was extracted from the surrogate cells and cDNA was obtained by RT-PCR. The human LALBA cDNA that was obtained was compared with the corresponding mRNA published in GenBank. The comparison showed that the two sequences were identical. The novel method for cDNA cloning from surrogate eukaryotic cells described here uses well-established techniques that are feasible and simple to use. We anticipate that this alternative method will have widespread applications.

  2. DNA damage checkpoint kinase ATM regulates germination and maintains genome stability in seeds

    PubMed Central

    Waterworth, Wanda M.; Footitt, Steven; Bray, Clifford M.; Finch-Savage, William E.; West, Christopher E.

    2016-01-01

    Genome integrity is crucial for cellular survival and the faithful transmission of genetic information. The eukaryotic cellular response to DNA damage is orchestrated by the DNA damage checkpoint kinases ATAXIA TELANGIECTASIA MUTATED (ATM) and ATM AND RAD3-RELATED (ATR). Here we identify important physiological roles for these sensor kinases in control of seed germination. We demonstrate that double-strand breaks (DSBs) are rate-limiting for germination. We identify that desiccation tolerant seeds exhibit a striking transcriptional DSB damage response during germination, indicative of high levels of genotoxic stress, which is induced following maturation drying and quiescence. Mutant atr and atm seeds are highly resistant to aging, establishing ATM and ATR as determinants of seed viability. In response to aging, ATM delays germination, whereas atm mutant seeds germinate with extensive chromosomal abnormalities. This identifies ATM as a major factor that controls germination in aged seeds, integrating progression through germination with surveillance of genome integrity. Mechanistically, ATM functions through control of DNA replication in imbibing seeds. ATM signaling is mediated by transcriptional control of the cell cycle inhibitor SIAMESE-RELATED 5, an essential factor required for the aging-induced delay to germination. In the soil seed bank, seeds exhibit increased transcript levels of ATM and ATR, with changes in dormancy and germination potential modulated by environmental signals, including temperature and soil moisture. Collectively, our findings reveal physiological functions for these sensor kinases in linking genome integrity to germination, thereby influencing seed quality, crucial for plant survival in the natural environment and sustainable crop production. PMID:27503884

  3. DNA damage checkpoint kinase ATM regulates germination and maintains genome stability in seeds.

    PubMed

    Waterworth, Wanda M; Footitt, Steven; Bray, Clifford M; Finch-Savage, William E; West, Christopher E

    2016-08-23

    Genome integrity is crucial for cellular survival and the faithful transmission of genetic information. The eukaryotic cellular response to DNA damage is orchestrated by the DNA damage checkpoint kinases ATAXIA TELANGIECTASIA MUTATED (ATM) and ATM AND RAD3-RELATED (ATR). Here we identify important physiological roles for these sensor kinases in control of seed germination. We demonstrate that double-strand breaks (DSBs) are rate-limiting for germination. We identify that desiccation tolerant seeds exhibit a striking transcriptional DSB damage response during germination, indicative of high levels of genotoxic stress, which is induced following maturation drying and quiescence. Mutant atr and atm seeds are highly resistant to aging, establishing ATM and ATR as determinants of seed viability. In response to aging, ATM delays germination, whereas atm mutant seeds germinate with extensive chromosomal abnormalities. This identifies ATM as a major factor that controls germination in aged seeds, integrating progression through germination with surveillance of genome integrity. Mechanistically, ATM functions through control of DNA replication in imbibing seeds. ATM signaling is mediated by transcriptional control of the cell cycle inhibitor SIAMESE-RELATED 5, an essential factor required for the aging-induced delay to germination. In the soil seed bank, seeds exhibit increased transcript levels of ATM and ATR, with changes in dormancy and germination potential modulated by environmental signals, including temperature and soil moisture. Collectively, our findings reveal physiological functions for these sensor kinases in linking genome integrity to germination, thereby influencing seed quality, crucial for plant survival in the natural environment and sustainable crop production.

  4. High quality methylome-wide investigations through next-generation sequencing of DNA from a single archived dry blood spot

    PubMed Central

    Aberg, Karolina A.; Xie, Lin Y.; Nerella, Srilaxmi; Copeland, William E.; Costello, E. Jane; van den Oord, Edwin J.C.G.

    2013-01-01

    The potential importance of DNA methylation in the etiology of complex diseases has led to interest in the development of methylome-wide association studies (MWAS) aimed at interrogating all methylation sites in the human genome. When using blood as biomaterial for a MWAS the DNA is typically extracted directly from fresh or frozen whole blood that was collected via venous puncture. However, DNA extracted from dry blood spots may also be an alternative starting material. In the present study, we apply a methyl-CpG binding domain (MBD) protein enrichment-based technique in combination with next generation sequencing (MBD-seq) to assess the methylation status of the ~27 million CpGs in the human autosomal reference genome. We investigate eight methylomes using DNA from blood spots. This data are compared with 1,500 methylomes previously assayed with the same MBD-seq approach using DNA from whole blood. When investigating the sequence quality and the enrichment profile across biological features, we find that DNA extracted from blood spots gives comparable results with DNA extracted from whole blood. Only if the amount of starting material is ≤ 0.5µg DNA we observe a slight decrease in the assay performance. In conclusion, we show that high quality methylome-wide investigations using MBD-seq can be conducted in DNA extracted from archived dry blood spots without sacrificing quality and without bias in enrichment profile as long as the amount of starting material is sufficient. In general, the amount of DNA extracted from a single blood spot is sufficient for methylome-wide investigations with the MBD-seq approach. PMID:23644822

  5. High quality methylome-wide investigations through next-generation sequencing of DNA from a single archived dry blood spot.

    PubMed

    Aberg, Karolina A; Xie, Lin Y; Nerella, Srilaxmi; Copeland, William E; Costello, E Jane; van den Oord, Edwin J C G

    2013-05-01

    The potential importance of DNA methylation in the etiology of complex diseases has led to interest in the development of methylome-wide association studies (MWAS) aimed at interrogating all methylation sites in the human genome. When using blood as biomaterial for a MWAS the DNA is typically extracted directly from fresh or frozen whole blood that was collected via venous puncture. However, DNA extracted from dry blood spots may also be an alternative starting material. In the present study, we apply a methyl-CpG binding domain (MBD) protein enrichment-based technique in combination with next generation sequencing (MBD-seq) to assess the methylation status of the ~27 million CpGs in the human autosomal reference genome. We investigate eight methylomes using DNA from blood spots. This data are compared with 1,500 methylomes previously assayed with the same MBD-seq approach using DNA from whole blood. When investigating the sequence quality and the enrichment profile across biological features, we find that DNA extracted from blood spots gives comparable results with DNA extracted from whole blood. Only if the amount of starting material is ≤ 0.5µg DNA we observe a slight decrease in the assay performance. In conclusion, we show that high quality methylome-wide investigations using MBD-seq can be conducted in DNA extracted from archived dry blood spots without sacrificing quality and without bias in enrichment profile as long as the amount of starting material is sufficient. In general, the amount of DNA extracted from a single blood spot is sufficient for methylome-wide investigations with the MBD-seq approach.

  6. Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

    PubMed

    Duyk, G M; Kim, S W; Myers, R M; Cox, D R

    1990-11-01

    Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons.

  7. Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

    PubMed Central

    Duyk, G M; Kim, S W; Myers, R M; Cox, D R

    1990-01-01

    Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons. PMID:2247475

  8. The logic of DNA replication in double-stranded DNA viruses: insights from global analysis of viral genomes

    PubMed Central

    Kazlauskas, Darius; Krupovic, Mart; Venclovas, Česlovas

    2016-01-01

    Abstract Genomic DNA replication is a complex process that involves multiple proteins. Cellular DNA replication systems are broadly classified into only two types, bacterial and archaeo-eukaryotic. In contrast, double-stranded (ds) DNA viruses feature a much broader diversity of DNA replication machineries. Viruses differ greatly in both completeness and composition of their sets of DNA replication proteins. In this study, we explored whether there are common patterns underlying this extreme diversity. We identified and analyzed all major functional groups of DNA replication proteins in all available proteomes of dsDNA viruses. Our results show that some proteins are common to viruses infecting all domains of life and likely represent components of the ancestral core set. These include B-family polymerases, SF3 helicases, archaeo-eukaryotic primases, clamps and clamp loaders of the archaeo-eukaryotic type, RNase H and ATP-dependent DNA ligases. We also discovered a clear correlation between genome size and self-sufficiency of viral DNA replication, the unanticipated dominance of replicative helicases and pervasive functional associations among certain groups of DNA replication proteins. Altogether, our results provide a comprehensive view on the diversity and evolution of replication systems in the DNA virome and uncover fundamental principles underlying the orchestration of viral DNA replication. PMID:27112572

  9. Attomole-level Genomics with Single-molecule Direct DNA, cDNA and RNA Sequencing Technologies.

    PubMed

    Ozsolak, Fatih

    2016-01-01

    With the introduction of next-generation sequencing (NGS) technologies in 2005, the domination of microarrays in genomics quickly came to an end due to NGS's superior technical performance and cost advantages. By enabling genetic analysis capabilities that were not possible previously, NGS technologies have started to play an integral role in all areas of biomedical research. This chapter outlines the low-quantity DNA and cDNA sequencing capabilities and applications developed with the Helicos single molecule DNA sequencing technology.

  10. [Correlation of genomic DNA methylation level with unexplained early spontaneous abortion].

    PubMed

    Chao, Yuan; Weng, Lidong; Zeng, Rong

    2014-10-01

    To investigate the correlation of genomic DNA methylation level with unexplained early spontaneous abortion and analyze the role of DNMT1, DNMT3A and DNMT3B. Forty-five villus samples from spontaneous abortion cases (with 33 maternal peripheral blood samples) and 44 villus samples from induced abortion (with 34 maternal peripheral blood samples) were examined with high-pressure liquid chromatography (HPLC) to measure the overall methylation level of the genomic DNA. The expressions of DNMT mRNAs were detected using fluorescence quantitative-PCR in the villus samples from 33 induced abortion cases and 30 spontaneous abortion cases. Genomic DNA methylation level was significantly lower in the villus in spontaneous abortion group than in induced abortion group (P<0.01), but similar in the maternal blood samples between the two groups (P>0.05). The mean mRNA expression levels of DNMT1 and DNMT3A in the villus were significantly lower in spontaneous abortion group than in induced abortion group (P<0.05), but DNMT3B expression showed no significant difference between them (P>0.05). Insufficient genomic DNA methylation in the villus does exist in human early spontaneous abortion, and this insufficiency is probably associated with down-regulated expressions of DNMT1 and DNMT3A.

  11. Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications

    PubMed Central

    Del Medico, Luca; Christen, Heinz; Christen, Beat

    2017-01-01

    Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner. PMID:28531174

  12. Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications.

    PubMed

    Christen, Matthias; Del Medico, Luca; Christen, Heinz; Christen, Beat

    2017-01-01

    Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner.

  13. Genome-wide alterations of the DNA replication program during tumor progression

    NASA Astrophysics Data System (ADS)

    Arneodo, A.; Goldar, A.; Argoul, F.; Hyrien, O.; Audit, B.

    2016-08-01

    Oncogenic stress is a major driving force in the early stages of cancer development. Recent experimental findings reveal that, in precancerous lesions and cancers, activated oncogenes may induce stalling and dissociation of DNA replication forks resulting in DNA damage. Replication timing is emerging as an important epigenetic feature that recapitulates several genomic, epigenetic and functional specificities of even closely related cell types. There is increasing evidence that chromosome rearrangements, the hallmark of many cancer genomes, are intimately associated with the DNA replication program and that epigenetic replication timing changes often precede chromosomic rearrangements. The recent development of a novel methodology to map replication fork polarity using deep sequencing of Okazaki fragments has provided new and complementary genome-wide replication profiling data. We review the results of a wavelet-based multi-scale analysis of genomic and epigenetic data including replication profiles along human chromosomes. These results provide new insight into the spatio-temporal replication program and its dynamics during differentiation. Here our goal is to bring to cancer research, the experimental protocols and computational methodologies for replication program profiling, and also the modeling of the spatio-temporal replication program. To illustrate our purpose, we report very preliminary results obtained for the chronic myelogeneous leukemia, the archetype model of cancer. Finally, we discuss promising perspectives on using genome-wide DNA replication profiling as a novel efficient tool for cancer diagnosis, prognosis and personalized treatment.

  14. Insights on genome size evolution from a miniature inverted repeat transposon driving a satellite DNA.

    PubMed

    Scalvenzi, Thibault; Pollet, Nicolas

    2014-12-01

    The genome size in eukaryotes does not correlate well with the number of genes they contain. We can observe this so-called C-value paradox in amphibian species. By analyzing an amphibian genome we asked how repetitive DNA can impact genome size and architecture. We describe here our discovery of a Tc1/mariner miniature inverted-repeat transposon family present in Xenopus frogs. These transposons named miDNA4 are unique since they contain a satellite DNA motif. We found that miDNA4 measured 331 bp, contained 25 bp long inverted terminal repeat sequences and a sequence motif of 119 bp present as a unique copy or as an array of 2-47 copies. We characterized the structure, dynamics, impact and evolution of the miDNA4 family and its satellite DNA in Xenopus frog genomes. This led us to propose a model for the evolution of these two repeated sequences and how they can synergize to increase genome size. Copyright © 2014 Elsevier Inc. All rights reserved.

  15. Toxicological effects of benzo[a]pyrene on DNA methylation of whole genome in ICR mice.

    PubMed

    Zhao, L; Zhang, S; An, X; Tan, W; Pang, D; Ouyang, H

    2015-10-30

    It has been well known that alterations in DNA methylation - an important regulator of gene transcription - lead to cancer. Therefore a change in the level of DNA methylation of whole genome has been considered as a biomarker of carcinogenesis. Previously, a large number of experimental results in genetic toxicology have showed that benzo[a]pyrene could cause DNA mutation and fragmentation. However, there was little to no studies on alterations in DNA methylation of genome directly result from exposure to benzo[a]pyrene. In this paper, possible mechanisms of alterations in whole genomic DNA methylation by benzo[a]pyrene were investigated using ICR mice after benzo[a]pyrene exposure. The blood, liver, pancreas, skin, lung and bladder of ICR mice were removed and checked after a fixed time interval (6 hours) of benzo[a]pyrene exposure, and whole genomic DNA methylation level was determined by high performance liquid chromatography (HPLC). The results exhibited tissue specificity, that is, the level of whole genomic DNA methylation decreases significantly in blood and liver, rather than pancreas, lung, skin and bladder of ICR mice. This study investigated the direct relationship between aberrant DNA methylation level and benzo[a]pyrene exposure, which might be helpful to clarify the toxicological mechanism of benzo[a]pyrene in epigenetic perspectives.

  16. Genome-Wide Motif Statistics are Shaped by DNA Binding Proteins over Evolutionary Time Scales

    NASA Astrophysics Data System (ADS)

    Qian, Long; Kussell, Edo

    2016-10-01

    The composition of a genome with respect to all possible short DNA motifs impacts the ability of DNA binding proteins to locate and bind their target sites. Since nonfunctional DNA binding can be detrimental to cellular functions and ultimately to organismal fitness, organisms could benefit from reducing the number of nonfunctional DNA binding sites genome wide. Using in vitro measurements of binding affinities for a large collection of DNA binding proteins, in multiple species, we detect a significant global avoidance of weak binding sites in genomes. We demonstrate that the underlying evolutionary process leaves a distinct genomic hallmark in that similar words have correlated frequencies, a signal that we detect in all species across domains of life. We consider the possibility that natural selection against weak binding sites contributes to this process, and using an evolutionary model we show that the strength of selection needed to maintain global word compositions is on the order of point mutation rates. Likewise, we show that evolutionary mechanisms based on interference of protein-DNA binding with replication and mutational repair processes could yield similar results and operate with similar rates. On the basis of these modeling and bioinformatic results, we conclude that genome-wide word compositions have been molded by DNA binding proteins acting through tiny evolutionary steps over time scales spanning millions of generations.

  17. RNA-dependent DNA endonuclease Cas9 of the CRISPR system: Holy Grail of genome editing?

    PubMed

    Gasiunas, Giedrius; Siksnys, Virginijus

    2013-11-01

    Tailor-made nucleases for precise genome modification, such as zinc finger or TALE nucleases, currently represent the state-of-the-art for genome editing. These nucleases combine a programmable protein module which guides the enzyme to the target site with a nuclease domain which cuts DNA at the addressed site. Reprogramming of these nucleases to cut genomes at specific locations requires major protein engineering efforts. RNA-guided DNA endonuclease Cas9 of the type II (clustered regularly interspaced short palindromic repeat) CRISPR-Cas system uses CRISPR RNA (crRNA) as a guide to locate the DNA target and the Cas9 protein to cut DNA. Easy programmability of the Cas9 endonuclease using customizable RNAs brings unprecedented flexibility and versatility for targeted genome modification. We highlight the potential of the Cas9 RNA-guided DNA endonuclease as a novel tool for genome surgery, and discuss possible constraints and future prospects. Copyright © 2013 Elsevier Ltd. All rights reserved.

  18. Rapid discrimination of sequences flanking and within T-DNA insertions in the Arabidopsis genome.

    PubMed

    Ponce, M R; Quesada, V; Micol, J L

    1998-05-01

    An improvement to previous methods for recovering Arabidopsis thaliana genomic DNA flanking T-DNA insertions is presented that allows for the avoidance of some of the cloning difficulties caused by the concatameric nature of T-DNA inserts. The principle of the procedure is to categorize by size restriction fragments of mutant DNA, produced in separate digestions with NdeI and Bst1107I. Given that the sites for these two enzymes are contiguous within the pGV3850:1003 T-DNA construct, the restriction fragments obtained fall into two categories: those showing identical size in both digestions, which correspond to sequences internal to T-DNA concatamers; and those of different sizes, that contain the junctions between plant DNA and the T-DNA insert. Such a criterion makes it possible to easily distinguish the digestion products corresponding to internal T-DNA parts, which do not deserve further attention, and those which presumably include a segment of the locus of interest. Discrimination between restriction fragments of genomic mutant DNA can be made on rescued plasmids, inverse PCR amplification products or bands in a genomic blot.

  19. Pairagon: a highly accurate, HMM-based cDNA-to-genome aligner.

    PubMed

    Lu, David V; Brown, Randall H; Arumugam, Manimozhiyan; Brent, Michael R

    2009-07-01

    The most accurate way to determine the intron-exon structures in a genome is to align spliced cDNA sequences to the genome. Thus, cDNA-to-genome alignment programs are a key component of most annotation pipelines. The scoring system used to choose the best alignment is a primary determinant of alignment accuracy, while heuristics that prevent consideration of certain alignments are a primary determinant of runtime and memory usage. Both accuracy and speed are important considerations in choosing an alignment algorithm, but scoring systems have received much less attention than heuristics. We present Pairagon, a pair hidden Markov model based cDNA-to-genome alignment program, as the most accurate aligner for sequences with high- and low-identity levels. We conducted a series of experiments testing alignment accuracy with varying sequence identity. We first created 'perfect' simulated cDNA sequences by splicing the sequences of exons in the reference genome sequences of fly and human. The complete reference genome sequences were then mutated to various degrees using a realistic mutation simulator and the perfect cDNAs were aligned to them using Pairagon and 12 other aligners. To validate these results with natural sequences, we performed cross-species alignment using orthologous transcripts from human, mouse and rat. We found that aligner accuracy is heavily dependent on sequence identity. For sequences with 100% identity, Pairagon achieved accuracy levels of >99.6%, with one quarter of the errors of any other aligner. Furthermore, for human/mouse alignments, which are only 85% identical, Pairagon achieved 87% accuracy, higher than any other aligner. Pairagon source and executables are freely available at http://mblab.wustl.edu/software/pairagon/

  20. Generation of Leishmania Hybrids by Whole Genomic DNA Transformation

    PubMed Central

    Coelho, Adriano C.; Leprohon, Philippe; Ouellette, Marc

    2012-01-01

    Genetic exchange is a powerful tool to study gene function in microorganisms. Here, we tested the feasibility of generating Leishmania hybrids by electroporating genomic DNA of donor cells into recipient Leishmania parasites. The donor DNA was marked with a drug resistance marker facilitating the selection of DNA transfer into the recipient cells. The transferred DNA was integrated exclusively at homologous locus and was as large as 45 kb. The independent generation of L. infantum hybrids with L. major sequences was possible for several chromosomal regions. Interfering with the mismatch repair machinery by inactivating the MSH2 gene enabled an increased efficiency of recombination between divergent sequences, hence favouring the selection of hybrids between species. Hybrids were shown to acquire the phenotype derived from the donor cells, as demonstrated for the transfer of drug resistance genes from L. major into L. infantum. The described method is a first step allowing the generation of in vitro hybrids for testing gene functions in a natural genomic context in the parasite Leishmania. PMID:23029579

  1. [Evaluation of 3 methods of DNA extraction from paraffin-embedded material for the amplification of genomic DNA using PCR].

    PubMed

    Mesquita, R A; Anzai, E K; Oliveira, R N; Nunes, F D

    2001-01-01

    There are several protocols reported in the literature for the extraction of genomic DNA from formalin-fixed paraffin-embedded samples. Genomic DNA is utilized in molecular analyses, including PCR. This study compares three different methods for the extraction of genomic DNA from formalin-fixed paraffin-embedded (inflammatory fibrous hyperplasia) and non-formalin-fixed (normal oral mucosa) samples: phenol with enzymatic digestion, and silica with and without enzymatic digestion. The amplification of DNA by means of the PCR technique was carried out with primers for the exon 7 of human keratin type 14. Amplicons were analyzed by means of electrophoresis in an 8% polyacrylamide gel with 5% glycerol, followed by silver-staining visualization. The phenol/enzymatic digestion and the silica/enzymatic digestion methods provided amplicons from both tissue samples. The method described is a potential aid in the establishment of the histopathologic diagnosis and in retrospective studies with archival paraffin-embedded samples.

  2. RICD: a rice indica cDNA database resource for rice functional genomics.

    PubMed

    Lu, Tingting; Huang, Xuehui; Zhu, Chuanrang; Huang, Tao; Zhao, Qiang; Xie, Kabing; Xiong, Lizhong; Zhang, Qifa; Han, Bin

    2008-11-26

    The Oryza sativa L. indica subspecies is the most widely cultivated rice. During the last few years, we have collected over 20,000 putative full-length cDNAs and over 40,000 ESTs isolated from various cDNA libraries of two indica varieties Guangluai 4 and Minghui 63. A database of the rice indica cDNAs was therefore built to provide a comprehensive web data source for searching and retrieving the indica cDNA clones. Rice Indica cDNA Database (RICD) is an online MySQL-PHP driven database with a user-friendly web interface. It allows investigators to query the cDNA clones by keyword, genome position, nucleotide or protein sequence, and putative function. It also provides a series of information, including sequences, protein domain annotations, similarity search results, SNPs and InDels information, and hyperlinks to gene annotation in both The Rice Annotation Project Database (RAP-DB) and The TIGR Rice Genome Annotation Resource, expression atlas in RiceGE and variation report in Gramene of each cDNA. The online rice indica cDNA database provides cDNA resource with comprehensive information to researchers for functional analysis of indica subspecies and for comparative genomics. The RICD database is available through our website http://www.ncgr.ac.cn/ricd.

  3. Genomics DNA Profiling in Elite Professional Soccer Players: A Pilot Study

    PubMed Central

    Kambouris, M; Del Buono, A; Maffulli, N

    2014-01-01

    Functional variants in exonic regions have been associated with development of cardiovascular disease, diabetes and cancer. Athletic performance can be considered a multi-factorial complex phenotype. Genomic DNA was extracted from buccal swabs of seven soccer players from the Fulham football team. Single nucleotide polymorphism (SNPs) genotyping was undertaken. To achieve optimal athletic performance, predictive genomics DNA profiling for sports performance can be used to aid in sport selection and elaboration of personalized training and nutrition programs. Predictive DNA profiling may be able to detect athletes with potential or frank injuries, or screening and selection of future athletes, and can help them to maximize utilization of their potential and improve performance in sports. The aim of this study is to provide a wide scenario of specific genomic variants that an athlete carries, to implement which measures should be taken to maximize the athlete’s potential. PMID:24809029

  4. WordCluster: detecting clusters of DNA words and genomic elements

    PubMed Central

    2011-01-01

    Background Many k-mers (or DNA words) and genomic elements are known to be spatially clustered in the genome. Well established examples are the genes, TFBSs, CpG dinucleotides, microRNA genes and ultra-conserved non-coding regions. Currently, no algorithm exists to find these clusters in a statistically comprehensible way. The detection of clustering often relies on densities and sliding-window approaches or arbitrarily chosen distance thresholds. Results We introduce here an algorithm to detect clusters of DNA words (k-mers), or any other genomic element, based on the distance between consecutive copies and an assigned statistical significance. We implemented the method into a web server connected to a MySQL backend, which also determines the co-localization with gene annotations. We demonstrate the usefulness of this approach by detecting the clusters of CAG/CTG (cytosine contexts that can be methylated in undifferentiated cells), showing that the degree of methylation vary drastically between inside and outside of the clusters. As another example, we used WordCluster to search for statistically significant clusters of olfactory receptor (OR) genes in the human genome. Conclusions WordCluster seems to predict biological meaningful clusters of DNA words (k-mers) and genomic entities. The implementation of the method into a web server is available at http://bioinfo2.ugr.es/wordCluster/wordCluster.php including additional features like the detection of co-localization with gene regions or the annotation enrichment tool for functional analysis of overlapped genes. PMID:21261981

  5. The logic of DNA replication in double-stranded DNA viruses: insights from global analysis of viral genomes.

    PubMed

    Kazlauskas, Darius; Krupovic, Mart; Venclovas, Česlovas

    2016-06-02

    Genomic DNA replication is a complex process that involves multiple proteins. Cellular DNA replication systems are broadly classified into only two types, bacterial and archaeo-eukaryotic. In contrast, double-stranded (ds) DNA viruses feature a much broader diversity of DNA replication machineries. Viruses differ greatly in both completeness and composition of their sets of DNA replication proteins. In this study, we explored whether there are common patterns underlying this extreme diversity. We identified and analyzed all major functional groups of DNA replication proteins in all available proteomes of dsDNA viruses. Our results show that some proteins are common to viruses infecting all domains of life and likely represent components of the ancestral core set. These include B-family polymerases, SF3 helicases, archaeo-eukaryotic primases, clamps and clamp loaders of the archaeo-eukaryotic type, RNase H and ATP-dependent DNA ligases. We also discovered a clear correlation between genome size and self-sufficiency of viral DNA replication, the unanticipated dominance of replicative helicases and pervasive functional associations among certain groups of DNA replication proteins. Altogether, our results provide a comprehensive view on the diversity and evolution of replication systems in the DNA virome and uncover fundamental principles underlying the orchestration of viral DNA replication. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Peculiarities of RFLP of highly repetitive DNA in crow genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chelomina, G.N.; Kryukov, A.P.; Ivanov, S.V.

    1995-02-01

    We present a study of the structural organization of highly repetitive DNA in genomes of hooded crow Corvus cornix L., carrion crow C. corone L., and jungle crow C. macrorhynchos Wagl. RFLP and blot-hybridization with {sup 32}P-labeled Msp I fragment from hooded crow nDNA suggest the interspecific structural conservatism of the most repetitive DNA. The family of repeats we studied had tandem organization and the same (210 bp) period of reiteration for a set of restriction enzymes. However, in parallel to the general similarity of restriction patterns, there are species-specific peculiarities. The repetitive family revealed (Alu I, BsuR I, andmore » Msp I fragments) has quantitative RFLP of nDNA and interspecific differences in the extent of the multimer {open_quotes}ladder{close_quotes} pattern of Msp I fragments. The latter is more pronounced in nDNA of carrion crow than in that of phylogenetically distant jungle crow and closely related hooded crow. This suggests a recent amplification event for highly organized homological repeats in crow genomes. 10 refs., 2 figs.« less

  7. Origins of DNA Replication and Amplification in the Breast Cancer Genome

    DTIC Science & Technology

    2011-09-01

    AD_________________ Award Number: W81XWH-10-1-0463 TITLE: Origins of DNA Replication and...hypothesis we need to map origins of DNA replication in the genome and ask which of these coincide with sites of DNA amplification and with ER...Spring Harbor DNA Replication meetings this summer/earlyfall. Figures from the posters and also the abstracts are attached. The samples have been

  8. Company profile: Complete Genomics Inc.

    PubMed

    Reid, Clifford

    2011-02-01

    Complete Genomics Inc. is a life sciences company that focuses on complete human genome sequencing. It is taking a completely different approach to DNA sequencing than other companies in the industry. Rather than building a general-purpose platform for sequencing all organisms and all applications, it has focused on a single application - complete human genome sequencing. The company's Complete Genomics Analysis Platform (CGA™ Platform) comprises an integrated package of biochemistry, instrumentation and software that sequences human genomes at the highest quality, lowest cost and largest scale available. Complete Genomics offers a turnkey service that enables customers to outsource their human genome sequencing to the company's genome sequencing center in Mountain View, CA, USA. Customers send in their DNA samples, the company does all the library preparation, DNA sequencing, assembly and variant analysis, and customers receive research-ready data that they can use for biological discovery.

  9. BuD, a helix–loop–helix DNA-binding domain for genome modification

    PubMed Central

    Stella, Stefano; Molina, Rafael; López-Méndez, Blanca; Juillerat, Alexandre; Bertonati, Claudia; Daboussi, Fayza; Campos-Olivas, Ramon; Duchateau, Phillippe; Montoya, Guillermo

    2014-01-01

    DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein–DNA interactions in protein scaffolds is key to providing ‘toolkits’ for precise genome modification or regulation of gene expression. In a search for putative DNA-binding domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix–loop–helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing. PMID:25004980

  10. DNA Editing of LTR Retrotransposons Reveals the Impact of APOBECs on Vertebrate Genomes

    PubMed Central

    Knisbacher, Binyamin A.; Levanon, Erez Y.

    2016-01-01

    Long terminal repeat retrotransposons (LTR) are widespread in vertebrates and their dynamism facilitates genome evolution. However, these endogenous retroviruses (ERVs) must be restricted to maintain genomic stability. The APOBECs, a protein family that can edit C-to-U in DNA, do so by interfering with reverse transcription and hypermutating retrotransposon DNA. In some cases, a retrotransposon may integrate into the genome despite being hypermutated. Such an event introduces a unique sequence into the genome, increasing retrotransposon diversity and the probability of developing new function at the locus of insertion. The prevalence of this phenomenon and its effects on vertebrate genomes are still unclear. In this study, we screened ERV sequences in the genomes of 123 diverse species and identified hundreds of thousands of edited sites in multiple vertebrate lineages, including placental mammals, marsupials, and birds. Numerous edited ERVs carry high mutation loads, some with greater than 350 edited sites, profoundly damaging their open-reading frames. For many of the species studied, this is the first evidence that APOBECs are active players in their innate immune system. Unexpectedly, some birds and especially zebra finch and medium ground-finch (one of Darwin’s finches) are exceptionally enriched in DNA editing. We demonstrate that edited retrotransposons may be preferentially retained in active genomic regions, as reflected from their enrichment in genes, exons, promoters, and transcription start sites, thereby raising the probability of their exaptation for novel function. In conclusion, DNA editing of retrotransposons by APOBECs has a substantial role in vertebrate innate immunity and may boost genome evolution. PMID:26541172

  11. Measuring the Levels of Ribonucleotides Embedded in Genomic DNA.

    PubMed

    Meroni, Alice; Nava, Giulia M; Sertic, Sarah; Plevani, Paolo; Muzi-Falconi, Marco; Lazzaro, Federico

    2018-01-01

    Ribonucleotides (rNTPs) are incorporated into genomic DNA at a relatively high frequency during replication. They have beneficial effects but, if not removed from the chromosomes, increase genomic instability. Here, we describe a fast method to easily estimate the amounts of embedded ribonucleotides into the genome. The protocol described is performed in Saccharomyces cerevisiae and allows us to quantify altered levels of rNMPs due to different mutations in the replicative polymerase ε. However, this protocol can be easily applied to cells derived from any organism.

  12. Plasma DNA aberrations in systemic lupus erythematosus revealed by genomic and methylomic sequencing

    PubMed Central

    Chan, Rebecca W. Y.; Jiang, Peiyong; Peng, Xianlu; Tam, Lai-Shan; Liao, Gary J. W.; Li, Edmund K. M.; Wong, Priscilla C. H.; Sun, Hao; Chan, K. C. Allen; Chiu, Rossa W. K.; Lo, Y. M. Dennis

    2014-01-01

    We performed a high-resolution analysis of the biological characteristics of plasma DNA in systemic lupus erythematosus (SLE) patients using massively parallel genomic and methylomic sequencing. A number of plasma DNA abnormalities were found. First, aberrations in measured genomic representations (MGRs) were identified in the plasma DNA of SLE patients. The extent of the aberrations in MGRs correlated with anti-double–stranded DNA (anti-dsDNA) antibody level. Second, the plasma DNA of active SLE patients exhibited skewed molecular size-distribution profiles with a significantly increased proportion of short DNA fragments. The extent of plasma DNA shortening in SLE patients correlated with the SLE disease activity index (SLEDAI) and anti-dsDNA antibody level. Third, the plasma DNA of active SLE patients showed decreased methylation densities. The extent of hypomethylation correlated with SLEDAI and anti-dsDNA antibody level. To explore the impact of anti-dsDNA antibody on plasma DNA in SLE, a column-based protein G capture approach was used to fractionate the IgG-bound and non–IgG-bound DNA in plasma. Compared with healthy individuals, SLE patients had higher concentrations of IgG-bound DNA in plasma. More IgG binding occurs at genomic locations showing increased MGRs. Furthermore, the IgG-bound plasma DNA was shorter in size and more hypomethylated than the non–IgG-bound plasma DNA. These observations have enhanced our understanding of the spectrum of plasma DNA aberrations in SLE and may provide new molecular markers for SLE. Our results also suggest that caution should be exercised when interpreting plasma DNA-based noninvasive prenatal testing and cancer testing conducted for SLE patients. PMID:25427797

  13. Detecting DNA double-stranded breaks in mammalian genomes by linear amplification-mediated high-throughput genome-wide translocation sequencing.

    PubMed

    Hu, Jiazhi; Meyers, Robin M; Dong, Junchao; Panchakshari, Rohit A; Alt, Frederick W; Frock, Richard L

    2016-05-01

    Unbiased, high-throughput assays for detecting and quantifying DNA double-stranded breaks (DSBs) across the genome in mammalian cells will facilitate basic studies of the mechanisms that generate and repair endogenous DSBs. They will also enable more applied studies, such as those to evaluate the on- and off-target activities of engineered nucleases. Here we describe a linear amplification-mediated high-throughput genome-wide sequencing (LAM-HTGTS) method for the detection of genome-wide 'prey' DSBs via their translocation in cultured mammalian cells to a fixed 'bait' DSB. Bait-prey junctions are cloned directly from isolated genomic DNA using LAM-PCR and unidirectionally ligated to bridge adapters; subsequent PCR steps amplify the single-stranded DNA junction library in preparation for Illumina Miseq paired-end sequencing. A custom bioinformatics pipeline identifies prey sequences that contribute to junctions and maps them across the genome. LAM-HTGTS differs from related approaches because it detects a wide range of broken end structures with nucleotide-level resolution. Familiarity with nucleic acid methods and next-generation sequencing analysis is necessary for library generation and data interpretation. LAM-HTGTS assays are sensitive, reproducible, relatively inexpensive, scalable and straightforward to implement with a turnaround time of <1 week.

  14. Substitutions of short heterologous DNA segments of intragenomic or extragenomic origins produce clustered genomic polymorphisms

    PubMed Central

    Harms, Klaus; Lunnan, Asbjørn; Hülter, Nils; Mourier, Tobias; Vinner, Lasse; Andam, Cheryl P.; Marttinen, Pekka; Fridholm, Helena; Hansen, Anders Johannes; Hanage, William P.; Nielsen, Kaare Magne; Willerslev, Eske; Johnsen, Pål Jarle

    2016-01-01

    In a screen for unexplained mutation events we identified a previously unrecognized mechanism generating clustered DNA polymorphisms such as microindels and cumulative SNPs. The mechanism, short-patch double illegitimate recombination (SPDIR), facilitates short single-stranded DNA molecules to invade and replace genomic DNA through two joint illegitimate recombination events. SPDIR is controlled by key components of the cellular genome maintenance machinery in the gram-negative bacterium Acinetobacter baylyi. The source DNA is primarily intragenomic but can also be acquired through horizontal gene transfer. The DNA replacements are nonreciprocal and locus independent. Bioinformatic approaches reveal occurrence of SPDIR events in the gram-positive human pathogen Streptococcus pneumoniae and in the human genome. PMID:27956618

  15. A direct detection of Escherichia coli genomic DNA using gold nanoprobes

    PubMed Central

    2012-01-01

    Background In situation like diagnosis of clinical and forensic samples there exists a need for highly sensitive, rapid and specific DNA detection methods. Though conventional DNA amplification using PCR can provide fast results, it is not widely practised in diagnostic laboratories partially because it requires skilled personnel and expensive equipment. To overcome these limitations nanoparticles have been explored as signalling probes for ultrasensitive DNA detection that can be used in field applications. Among the nanomaterials, gold nanoparticles (AuNPs) have been extensively used mainly because of its optical property and ability to get functionalized with a variety of biomolecules. Results We report a protocol for the use of gold nanoparticles functionalized with single stranded oligonucleotide (AuNP- oligo probe) as visual detection probes for rapid and specific detection of Escherichia coli. The AuNP- oligo probe on hybridization with target DNA containing complementary sequences remains red whereas test samples without complementary DNA sequences to the probe turns purple due to acid induced aggregation of AuNP- oligo probes. The color change of the solution is observed visually by naked eye demonstrating direct and rapid detection of the pathogenic Escherichia coli from its genomic DNA without the need for PCR amplification. The limit of detection was ~54 ng for unamplified genomic DNA. The method requires less than 30 minutes to complete after genomic DNA extraction. However, by using unamplified enzymatic digested genomic DNA, the detection limit of 11.4 ng was attained. Results of UV-Vis spectroscopic measurement and AFM imaging further support the hypothesis of aggregation based visual discrimination. To elucidate its utility in medical diagnostic, the assay was validated on clinical strains of pathogenic Escherichia coli obtained from local hospitals and spiked urine samples. It was found to be 100% sensitive and proves to be highly specific without

  16. Divergent genome evolution caused by regional variation in DNA gain and loss between human and mouse

    PubMed Central

    Kortschak, R. Daniel

    2018-01-01

    The forces driving the accumulation and removal of non-coding DNA and ultimately the evolution of genome size in complex organisms are intimately linked to genome structure and organisation. Our analysis provides a novel method for capturing the regional variation of lineage-specific DNA gain and loss events in their respective genomic contexts. To further understand this connection we used comparative genomics to identify genome-wide individual DNA gain and loss events in the human and mouse genomes. Focusing on the distribution of DNA gains and losses, relationships to important structural features and potential impact on biological processes, we found that in autosomes, DNA gains and losses both followed separate lineage-specific accumulation patterns. However, in both species chromosome X was particularly enriched for DNA gain, consistent with its high L1 retrotransposon content required for X inactivation. We found that DNA loss was associated with gene-rich open chromatin regions and DNA gain events with gene-poor closed chromatin regions. Additionally, we found that DNA loss events tended to be smaller than DNA gain events suggesting that they were able to accumulate in gene-rich open chromatin regions due to their reduced capacity to interrupt gene regulatory architecture. GO term enrichment showed that mouse loss hotspots were strongly enriched for terms related to developmental processes. However, these genes were also located in regions with a high density of conserved elements, suggesting that despite high levels of DNA loss, gene regulatory architecture remained conserved. This is consistent with a model in which DNA gain and loss results in turnover or “churning” in regulatory element dense regions of open chromatin, where interruption of regulatory elements is selected against. PMID:29677183

  17. Links between DNA methylation and nucleosome occupancy in the human genome.

    PubMed

    Collings, Clayton K; Anderson, John N

    2017-01-01

    DNA methylation is an epigenetic modification that is enriched in heterochromatin but depleted at active promoters and enhancers. However, the debate on whether or not DNA methylation is a reliable indicator of high nucleosome occupancy has not been settled. For example, the methylation levels of DNA flanking CTCF sites are higher in linker DNA than in nucleosomal DNA, while other studies have shown that the nucleosome core is the preferred site of methylation. In this study, we make progress toward understanding these conflicting phenomena by implementing a bioinformatics approach that combines MNase-seq and NOMe-seq data and by comprehensively profiling DNA methylation and nucleosome occupancy throughout the human genome. The results demonstrated that increasing methylated CpG density is correlated with nucleosome occupancy in the total genome and within nearly all subgenomic regions. Features with elevated methylated CpG density such as exons, SINE-Alu sequences, H3K36-trimethylated peaks, and methylated CpG islands are among the highest nucleosome occupied elements in the genome, while some of the lowest occupancies are displayed by unmethylated CpG islands and unmethylated transcription factor binding sites. Additionally, outside of CpG islands, the density of CpGs within nucleosomes was shown to be important for the nucleosomal location of DNA methylation with low CpG frequencies favoring linker methylation and high CpG frequencies favoring core particle methylation. Prominent exceptions to the correlations between methylated CpG density and nucleosome occupancy include CpG islands marked by H3K27me3 and CpG-poor heterochromatin marked by H3K9me3, and these modifications, along with DNA methylation, distinguish the major silencing mechanisms of the human epigenome. Thus, the relationship between DNA methylation and nucleosome occupancy is influenced by the density of methylated CpG dinucleotides and by other epigenomic components in chromatin.

  18. DNA forms of arboviral RNA genomes are generated following infection in mosquito cell cultures.

    PubMed

    Nag, Dilip K; Brecher, Matthew; Kramer, Laura D

    2016-11-01

    Although infections of vertebrate hosts by arthropod-borne viruses may lead to pathogenic outcomes, infections of vector mosquitoes result in persistent infections, where the virus replicates in the host without causing apparent pathological effects. It is unclear how persistent infections are established and maintained in mosquitoes. Several reports revealed the presence of flavivirus-like DNA sequences in the mosquito genome, and recent studies have shown that DNA forms of RNA viruses restrict virus replication in Drosophila, suggesting that DNA forms may have a role in developing persistent infections. Here, we sought to investigate whether arboviruses generate DNA forms following infection in mosquitoes. Our results with West Nile, Dengue, and La Crosse viruses demonstrate that DNA forms of the viral RNA genome are generated in mosquito cells; however, not the entire viral genome, but patches of viral RNA in DNA forms can be detected 24h post infection. Copyright © 2016 Elsevier Inc. All rights reserved.

  19. A Portrait of Ribosomal DNA Contacts with Hi-C Reveals 5S and 45S rDNA Anchoring Points in the Folded Human Genome.

    PubMed

    Yu, Shoukai; Lemos, Bernardo

    2016-12-31

    Ribosomal RNAs (rRNAs) account for >60% of all RNAs in eukaryotic cells and are encoded in the ribosomal DNA (rDNA) arrays. The rRNAs are produced from two sets of loci: the 5S rDNA array resides exclusively on human chromosome 1, whereas the 45S rDNA array resides on the short arm of five human acrocentric chromosomes. The 45S rDNA gives origin to the nucleolus, the nuclear organelle that is the site of ribosome biogenesis. Intriguingly, 5S and 45S rDNA arrays exhibit correlated copy number variation in lymphoblastoid cells (LCLs). Here we examined the genomic architecture and repeat content of the 5S and 45S rDNA arrays in multiple human genome assemblies (including PacBio MHAP assembly) and ascertained contacts between the rDNA arrays and the rest of the genome using Hi-C datasets from two human cell lines (erythroleukemia K562 and lymphoblastoid cells). Our analyses revealed that 5S and 45S arrays each have thousands of contacts in the folded genome, with rDNA-associated regions and genes dispersed across all chromosomes. The rDNA contact map displayed conserved and disparate features between two cell lines, and pointed to specific chromosomes, genomic regions, and genes with evidence of spatial proximity to the rDNA arrays; the data also showed a lack of direct physical interaction between the 5S and 45S rDNA arrays. Finally, the analysis identified an intriguing organization in the 5S array with Alu and 5S elements adjacent to one another and organized in opposite orientation along the array. Portraits of genome folding centered on the ribosomal DNA array could help understand the emergence of concerted variation, the control of 5S and 45S expression, as well as provide insights into an organelle that contributes to the spatial localization of human chromosomes during interphase. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. A Portrait of Ribosomal DNA Contacts with Hi-C Reveals 5S and 45S rDNA Anchoring Points in the Folded Human Genome

    PubMed Central

    Yu, Shoukai; Lemos, Bernardo

    2016-01-01

    Ribosomal RNAs (rRNAs) account for >60% of all RNAs in eukaryotic cells and are encoded in the ribosomal DNA (rDNA) arrays. The rRNAs are produced from two sets of loci: the 5S rDNA array resides exclusively on human chromosome 1, whereas the 45S rDNA array resides on the short arm of five human acrocentric chromosomes. The 45S rDNA gives origin to the nucleolus, the nuclear organelle that is the site of ribosome biogenesis. Intriguingly, 5S and 45S rDNA arrays exhibit correlated copy number variation in lymphoblastoid cells (LCLs). Here we examined the genomic architecture and repeat content of the 5S and 45S rDNA arrays in multiple human genome assemblies (including PacBio MHAP assembly) and ascertained contacts between the rDNA arrays and the rest of the genome using Hi-C datasets from two human cell lines (erythroleukemia K562 and lymphoblastoid cells). Our analyses revealed that 5S and 45S arrays each have thousands of contacts in the folded genome, with rDNA-associated regions and genes dispersed across all chromosomes. The rDNA contact map displayed conserved and disparate features between two cell lines, and pointed to specific chromosomes, genomic regions, and genes with evidence of spatial proximity to the rDNA arrays; the data also showed a lack of direct physical interaction between the 5S and 45S rDNA arrays. Finally, the analysis identified an intriguing organization in the 5S array with Alu and 5S elements adjacent to one another and organized in opposite orientation along the array. Portraits of genome folding centered on the ribosomal DNA array could help understand the emergence of concerted variation, the control of 5S and 45S expression, as well as provide insights into an organelle that contributes to the spatial localization of human chromosomes during interphase. PMID:27797956

  1. Comprehensive analysis of DNA polymerase III α subunits and their homologs in bacterial genomes

    PubMed Central

    Timinskas, Kęstutis; Balvočiūtė, Monika; Timinskas, Albertas; Venclovas, Česlovas

    2014-01-01

    The analysis of ∼2000 bacterial genomes revealed that they all, without a single exception, encode one or more DNA polymerase III α-subunit (PolIIIα) homologs. Classified into C-family of DNA polymerases they come in two major forms, PolC and DnaE, related by ancient duplication. While PolC represents an evolutionary compact group, DnaE can be further subdivided into at least three groups (DnaE1-3). We performed an extensive analysis of various sequence, structure and surface properties of all four polymerase groups. Our analysis suggests a specific evolutionary pathway leading to PolC and DnaE from the last common ancestor and reveals important differences between extant polymerase groups. Among them, DnaE1 and PolC show the highest conservation of the analyzed properties. DnaE3 polymerases apparently represent an ‘impaired’ version of DnaE1. Nonessential DnaE2 polymerases, typical for oxygen-using bacteria with large GC-rich genomes, have a number of features in common with DnaE3 polymerases. The analysis of polymerase distribution in genomes revealed three major combinations: DnaE1 either alone or accompanied by one or more DnaE2s, PolC + DnaE3 and PolC + DnaE1. The first two combinations are present in Escherichia coli and Bacillus subtilis, respectively. The third one (PolC + DnaE1), found in Clostridia, represents a novel, so far experimentally uncharacterized, set. PMID:24106089

  2. Repair-mediated duplication by capture of proximal chromosomal DNA has shaped vertebrate genome evolution.

    PubMed

    Pace, John K; Sen, Shurjo K; Batzer, Mark A; Feschotte, Cédric

    2009-05-01

    DNA double-strand breaks (DSBs) are a common form of cellular damage that can lead to cell death if not repaired promptly. Experimental systems have shown that DSB repair in eukaryotic cells is often imperfect and may result in the insertion of extra chromosomal DNA or the duplication of existing DNA at the breakpoint. These events are thought to be a source of genomic instability and human diseases, but it is unclear whether they have contributed significantly to genome evolution. Here we developed an innovative computational pipeline that takes advantage of the repetitive structure of genomes to detect repair-mediated duplication events (RDs) that occurred in the germline and created insertions of at least 50 bp of genomic DNA. Using this pipeline we identified over 1,000 probable RDs in the human genome. Of these, 824 were intra-chromosomal, closely linked duplications of up to 619 bp bearing the hallmarks of the synthesis-dependent strand-annealing repair pathway. This mechanism has duplicated hundreds of sequences predicted to be functional in the human genome, including exons, UTRs, intron splice sites and transcription factor binding sites. Dating of the duplication events using comparative genomics and experimental validation revealed that the mechanism has operated continuously but with decreasing intensity throughout primate evolution. The mechanism has produced species-specific duplications in all primate species surveyed and is contributing to genomic variation among humans. Finally, we show that RDs have also occurred, albeit at a lower frequency, in non-primate mammals and other vertebrates, indicating that this mechanism has been an important force shaping vertebrate genome evolution.

  3. Specific detection of Mycobacterium sp. genomic DNA using dual labeled gold nanoparticle based electrochemical biosensor.

    PubMed

    Thiruppathiraja, Chinnasamy; Kamatchiammal, Senthilkumar; Adaikkappan, Periyakaruppan; Santhosh, Devakirubakaran Jayakar; Alagar, Muthukaruppan

    2011-10-01

    The present study was aimed at the development and evaluation of a DNA electrochemical biosensor for Mycobacterium sp. genomic DNA detection in a clinical specimen using a signal amplifier as dual-labeled AuNPs. The DNA electrochemical biosensors were fabricated using a sandwich detection strategy involving two kinds of DNA probes specific to Mycobacterium sp. genomic DNA. The probes of enzyme ALP and the detector probe both conjugated on the AuNPs and subsequently hybridized with target DNA immobilized in a SAM/ITO electrode followed by characterization with CV, EIS, and DPV analysis using the electroactive species para-nitrophenol generated by ALP through hydrolysis of para-nitrophenol phosphate. The effect of enhanced sensitivity was obtained due to the AuNPs carrying numerous ALPs per hybridization and a detection limit of 1.25 ng/ml genomic DNA was determined under optimized conditions. The dual-labeled AuNP-facilitated electrochemical sensor was also evaluated by clinical sputum samples, showing a higher sensitivity and specificity and the outcome was in agreement with the PCR analysis. In conclusion, the developed electrochemical sensor demonstrated unique sensitivity and specificity for both genomic DNA and sputum samples and can be employed as a regular diagnostics tool for Mycobacterium sp. monitoring in clinical samples. Copyright © 2011 Elsevier Inc. All rights reserved.

  4. The Second Subunit of DNA Polymerase Delta Is Required for Genomic Stability and Epigenetic Regulation1[OPEN

    PubMed Central

    Cheng, Jinkui; Lai, Jinsheng; Gong, Zhizhong

    2016-01-01

    DNA polymerase δ plays crucial roles in DNA repair and replication as well as maintaining genomic stability. However, the function of POLD2, the second small subunit of DNA polymerase δ, has not been characterized yet in Arabidopsis (Arabidopsis thaliana). During a genetic screen for release of transcriptional gene silencing, we identified a mutation in POLD2. Whole-genome bisulfite sequencing indicated that POLD2 is not involved in the regulation of DNA methylation. POLD2 genetically interacts with Ataxia Telangiectasia-mutated and Rad3-related and DNA polymerase α. The pold2-1 mutant exhibits genomic instability with a high frequency of homologous recombination. It also exhibits hypersensitivity to DNA-damaging reagents and short telomere length. Whole-genome chromatin immunoprecipitation sequencing and RNA sequencing analyses suggest that pold2-1 changes H3K27me3 and H3K4me3 modifications, and these changes are correlated with the gene expression levels. Our study suggests that POLD2 is required for maintaining genome integrity and properly establishing the epigenetic markers during DNA replication to modulate gene expression. PMID:27208288

  5. Draft versus finished sequence data for DNA and protein diagnostic signature development

    PubMed Central

    Gardner, Shea N.; Lam, Marisa W.; Smith, Jason R.; Torres, Clinton L.; Slezak, Tom R.

    2005-01-01

    Sequencing pathogen genomes is costly, demanding careful allocation of limited sequencing resources. We built a computational Sequencing Analysis Pipeline (SAP) to guide decisions regarding the amount of genomic sequencing necessary to develop high-quality diagnostic DNA and protein signatures. SAP uses simulations to estimate the number of target genomes and close phylogenetic relatives (near neighbors or NNs) to sequence. We use SAP to assess whether draft data are sufficient or finished sequencing is required using Marburg and variola virus sequences. Simulations indicate that intermediate to high-quality draft with error rates of 10−3–10−5 (∼8× coverage) of target organisms is suitable for DNA signature prediction. Low-quality draft with error rates of ∼1% (3× to 6× coverage) of target isolates is inadequate for DNA signature prediction, although low-quality draft of NNs is sufficient, as long as the target genomes are of high quality. For protein signature prediction, sequencing errors in target genomes substantially reduce the detection of amino acid sequence conservation, even if the draft is of high quality. In summary, high-quality draft of target and low-quality draft of NNs appears to be a cost-effective investment for DNA signature prediction, but may lead to underestimation of predicted protein signatures. PMID:16243783

  6. Isolation of genomic DNA from defatted oil seed residue of rapeseed (Brassica napus).

    PubMed

    Sadia, M; Rabbani, M A; Hameed, S; Pearce, S R; Malik, S A

    2011-02-08

    A simple protocol for obtaining pure, restrictable and amplifiable megabase genomic DNA from oil-free seed residue of Brassica napus, an important oil seed plant, has been developed. Oil from the dry seeds was completely recovered in an organic solvent and quantified gravimetrically followed by processing of the residual biomass (defatted seed residue) for genomic DNA isolation. The isolated DNA can be cut by a range of restriction enzymes. The method enables simultaneous isolation and recovery of lipids and genomic DNA from the same test sample, thus allowing two independent analyses from a single sample. Multiple micro-scale oil extraction from the commercial seeds gave approximately 39% oil, which is close to the usual oil recovery from standard oil seed. Most of the amplified fragments were scored in the range of 2.5 to 0.5 kb, best suited for scoring as molecular diagnostics.

  7. Evaluating droplet digital PCR for the quantification of human genomic DNA: converting copies per nanoliter to nanograms nuclear DNA per microliter.

    PubMed

    Duewer, David L; Kline, Margaret C; Romsos, Erica L; Toman, Blaza

    2018-05-01

    The highly multiplexed polymerase chain reaction (PCR) assays used for forensic human identification perform best when used with an accurately determined quantity of input DNA. To help ensure the reliable performance of these assays, we are developing a certified reference material (CRM) for calibrating human genomic DNA working standards. To enable sharing information over time and place, CRMs must provide accurate and stable values that are metrologically traceable to a common reference. We have shown that droplet digital PCR (ddPCR) limiting dilution end-point measurements of the concentration of DNA copies per volume of sample can be traceably linked to the International System of Units (SI). Unlike values assigned using conventional relationships between ultraviolet absorbance and DNA mass concentration, entity-based ddPCR measurements are expected to be stable over time. However, the forensic community expects DNA quantity to be stated in terms of mass concentration rather than entity concentration. The transformation can be accomplished given SI-traceable values and uncertainties for the number of nucleotide bases per human haploid genome equivalent (HHGE) and the average molar mass of a nucleotide monomer in the DNA polymer. This report presents the considerations required to establish the metrological traceability of ddPCR-based mass concentration estimates of human nuclear DNA. Graphical abstract The roots of metrological traceability for human nuclear DNA mass concentration results. Values for the factors in blue must be established experimentally. Values for the factors in red have been established from authoritative source materials. HHGE stands for "haploid human genome equivalent"; there are two HHGE per diploid human genome.

  8. Adenovirus Core Protein VII Downregulates the DNA Damage Response on the Host Genome

    PubMed Central

    Avgousti, Daphne C.; Della Fera, Ashley N.; Otter, Clayton J.; Herrmann, Christin; Pancholi, Neha J.

    2017-01-01

    ABSTRACT Viral manipulation of cellular proteins allows viruses to suppress host defenses and generate infectious progeny. Due to the linear double-stranded DNA nature of the adenovirus genome, the cellular DNA damage response (DDR) is considered a barrier to successful infection. The adenovirus genome is packaged with protein VII, a virally encoded histone-like core protein that is suggested to protect incoming viral genomes from detection by the cellular DNA damage machinery. We showed that protein VII localizes to host chromatin during infection, leading us to hypothesize that protein VII may affect DNA damage responses on the cellular genome. Here we show that protein VII at cellular chromatin results in a significant decrease in accumulation of phosphorylated H2AX (γH2AX) following irradiation, indicating that protein VII inhibits DDR signaling. The oncoprotein SET was recently suggested to modulate the DDR by affecting access of repair proteins to chromatin. Since protein VII binds SET, we investigated a role for SET in DDR inhibition by protein VII. We show that knockdown of SET partially rescues the protein VII-induced decrease in γH2AX accumulation on the host genome, suggesting that SET is required for inhibition. Finally, we show that knockdown of SET also allows ATM to localize to incoming viral genomes bound by protein VII during infection with a mutant lacking early region E4. Together, our data suggest that the protein VII-SET interaction contributes to DDR evasion by adenovirus. Our results provide an additional example of a strategy used by adenovirus to abrogate the host DDR and show how viruses can modify cellular processes through manipulation of host chromatin. IMPORTANCE The DNA damage response (DDR) is a cellular network that is crucial for maintaining genome integrity. DNA viruses replicating in the nucleus challenge the resident genome and must overcome cellular responses, including the DDR. Adenoviruses are prevalent human pathogens that

  9. Comparative Genomics of DNA Recombination and Repair in Cyanobacteria: Biotechnological Implications

    PubMed Central

    Cassier-Chauvat, Corinne; Veaudor, Théo; Chauvat, Franck

    2016-01-01

    Cyanobacteria are fascinating photosynthetic prokaryotes that are regarded as the ancestors of the plant chloroplast; the purveyors of oxygen and biomass for the food chain; and promising cell factories for an environmentally friendly production of chemicals. In colonizing most waters and soils of our planet, cyanobacteria are inevitably challenged by environmental stresses that generate DNA damages. Furthermore, many strains engineered for biotechnological purposes can use DNA recombination to stop synthesizing the biotechnological product. Hence, it is important to study DNA recombination and repair in cyanobacteria for both basic and applied research. This review reports what is known in a few widely studied model cyanobacteria and what can be inferred by mining the sequenced genomes of morphologically and physiologically diverse strains. We show that cyanobacteria possess many E. coli-like DNA recombination and repair genes, and possibly other genes not yet identified. E. coli-homolog genes are unevenly distributed in cyanobacteria, in agreement with their wide genome diversity. Many genes are extremely well conserved in cyanobacteria (mutMS, radA, recA, recFO, recG, recN, ruvABC, ssb, and uvrABCD), even in small genomes, suggesting that they encode the core DNA repair process. In addition to these core genes, the marine Prochlorococcus and Synechococcus strains harbor recBCD (DNA recombination), umuCD (mutational DNA replication), as well as the key SOS genes lexA (regulation of the SOS system) and sulA (postponing of cell division until completion of DNA reparation). Hence, these strains could possess an E. coli-type SOS system. In contrast, several cyanobacteria endowed with larger genomes lack typical SOS genes. For examples, the two studied Gloeobacter strains lack alkB, lexA, and sulA; and Synechococcus PCC7942 has neither lexA nor recCD. Furthermore, the Synechocystis PCC6803 lexA product does not regulate DNA repair genes. Collectively, these findings

  10. Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences

    PubMed Central

    Sheinman, Michael; Ramisch, Anna; Massip, Florian; Arndt, Peter F.

    2016-01-01

    Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf’s law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes. PMID:27488939

  11. In trans paired nicking triggers seamless genome editing without double-stranded DNA cutting.

    PubMed

    Chen, Xiaoyu; Janssen, Josephine M; Liu, Jin; Maggio, Ignazio; 't Jong, Anke E J; Mikkers, Harald M M; Gonçalves, Manuel A F V

    2017-09-22

    Precise genome editing involves homologous recombination between donor DNA and chromosomal sequences subjected to double-stranded DNA breaks made by programmable nucleases. Ideally, genome editing should be efficient, specific, and accurate. However, besides constituting potential translocation-initiating lesions, double-stranded DNA breaks (targeted or otherwise) are mostly repaired through unpredictable and mutagenic non-homologous recombination processes. Here, we report that the coordinated formation of paired single-stranded DNA breaks, or nicks, at donor plasmids and chromosomal target sites by RNA-guided nucleases based on CRISPR-Cas9 components, triggers seamless homology-directed gene targeting of large genetic payloads in human cells, including pluripotent stem cells. Importantly, in addition to significantly reducing the mutagenicity of the genome modification procedure, this in trans paired nicking strategy achieves multiplexed, single-step, gene targeting, and yields higher frequencies of accurately edited cells when compared to the standard double-stranded DNA break-dependent approach.CRISPR-Cas9-based gene editing involves double-strand breaks at target sequences, which are often repaired by mutagenic non-homologous end-joining. Here the authors use Cas9 nickases to generate coordinated single-strand breaks in donor and target DNA for precise homology-directed gene editing.

  12. Direct extraction of genomic DNA from maize with aqueous ionic liquid buffer systems for applications in genetically modified organisms analysis.

    PubMed

    Gonzalez García, Eric; Ressmann, Anna K; Gaertner, Peter; Zirbs, Ronald; Mach, Robert L; Krska, Rudolf; Bica, Katharina; Brunner, Kurt

    2014-12-01

    To date, the extraction of genomic DNA is considered a bottleneck in the process of genetically modified organisms (GMOs) detection. Conventional DNA isolation methods are associated with long extraction times and multiple pipetting and centrifugation steps, which makes the entire procedure not only tedious and complicated but also prone to sample cross-contamination. In recent times, ionic liquids have emerged as innovative solvents for biomass processing, due to their outstanding properties for dissolution of biomass and biopolymers. In this study, a novel, easily applicable, and time-efficient method for the direct extraction of genomic DNA from biomass based on aqueous-ionic liquid solutions was developed. The straightforward protocol relies on extraction of maize in a 10 % solution of ionic liquids in aqueous phosphate buffer for 5 min at room temperature, followed by a denaturation step at 95 °C for 10 min and a simple filtration to remove residual biopolymers. A set of 22 ionic liquids was tested in a buffer system and 1-ethyl-3-methylimidazolium dimethylphosphate, as well as the environmentally benign choline formate, were identified as ideal candidates. With this strategy, the quality of the genomic DNA extracted was significantly improved and the extraction protocol was notably simplified compared with a well-established method.

  13. DNA methylation at hepatitis B viral integrants is associated with methylation at flanking human genomic sequences

    PubMed Central

    Watanabe, Yoshiyuki; Yamamoto, Hiroyuki; Oikawa, Ritsuko; Toyota, Minoru; Yamamoto, Masakazu; Kokudo, Norihiro; Tanaka, Shinji; Arii, Shigeki; Yotsuyanagi, Hiroshi; Koike, Kazuhiko; Itoh, Fumio

    2015-01-01

    Integration of DNA viruses into the human genome plays an important role in various types of tumors, including hepatitis B virus (HBV)–related hepatocellular carcinoma. However, the molecular details and clinical impact of HBV integration on either human or HBV epigenomes are unknown. Here, we show that methylation of the integrated HBV DNA is related to the methylation status of the flanking human genome. We developed a next-generation sequencing-based method for structural methylation analysis of integrated viral genomes (denoted G-NaVI). This method is a novel approach that enables enrichment of viral fragments for sequencing using unique baits based on the sequence of the HBV genome. We detected integrated HBV sequences in the genome of the PLC/PRF/5 cell line and found variable levels of methylation within the integrated HBV genomes. Allele-specific methylation analysis revealed that the HBV genome often became significantly methylated when integrated into highly methylated host sites. After integration into unmethylated human genome regions such as promoters, however, the HBV DNA remains unmethylated and may eventually play an important role in tumorigenesis. The observed dynamic changes in DNA methylation of the host and viral genomes may functionally affect the biological behavior of HBV. These findings may impact public health given that millions of people worldwide are carriers of HBV. We also believe our assay will be a powerful tool to increase our understanding of the various types of DNA virus-associated tumorigenesis. PMID:25653310

  14. Extraction of High Quality DNA from Seized Moroccan Cannabis Resin (Hashish)

    PubMed Central

    El Alaoui, Moulay Abdelaziz; Melloul, Marouane; Alaoui Amine, Sanaâ; Stambouli, Hamid; El Bouri, Aziz; Soulaymani, Abdelmajid; El Fahime, Elmostafa

    2013-01-01

    The extraction and purification of nucleic acids is the first step in most molecular biology analysis techniques. The objective of this work is to obtain highly purified nucleic acids derived from Cannabis sativa resin seizure in order to conduct a DNA typing method for the individualization of cannabis resin samples. To obtain highly purified nucleic acids from cannabis resin (Hashish) free from contaminants that cause inhibition of PCR reaction, we have tested two protocols: the CTAB protocol of Wagner and a CTAB protocol described by Somma (2004) adapted for difficult matrix. We obtained high quality genomic DNA from 8 cannabis resin seizures using the adapted protocol. DNA extracted by the Wagner CTAB protocol failed to give polymerase chain reaction (PCR) amplification of tetrahydrocannabinolic acid (THCA) synthase coding gene. However, the extracted DNA by the second protocol permits amplification of THCA synthase coding gene using different sets of primers as assessed by PCR. We describe here for the first time the possibility of DNA extraction from (Hashish) resin derived from Cannabis sativa. This allows the use of DNA molecular tests under special forensic circumstances. PMID:24124454

  15. Genomically Encoded Analog Memory with Precise In vivo DNA Writing in Living Cell Populations

    PubMed Central

    Farzadfard, Fahim; Lu, Timothy K.

    2014-01-01

    Cellular memory is crucial to many natural biological processes and for sophisticated synthetic-biology applications. Existing cellular memories rely on epigenetic switches or recombinases, which are limited in scalability and recording capacity. Here, we use the DNA of living cell populations as genomic ‘tape recorders’ for the analog and distributed recording of long-term event histories. We describe a platform for generating single-stranded DNA (ssDNA) in vivo in response to arbitrary transcriptional signals. When co-expressed with a recombinase, these intracellularly expressed ssDNAs target specific genomic DNA addresses, resulting in precise mutations that accumulate in cell populations as a function of the magnitude and duration of the inputs. This platform could enable long-term cellular recorders for environmental and biomedical applications, biological state machines, and enhanced genome engineering strategies. PMID:25395541

  16. Quantifying quality in DNA self-assembly

    PubMed Central

    Wagenbauer, Klaus F.; Wachauf, Christian H.; Dietz, Hendrik

    2014-01-01

    Molecular self-assembly with DNA is an attractive route for building nanoscale devices. The development of sophisticated and precise objects with this technique requires detailed experimental feedback on the structure and composition of assembled objects. Here we report a sensitive assay for the quality of assembly. The method relies on measuring the content of unpaired DNA bases in self-assembled DNA objects using a fluorescent de-Bruijn probe for three-base ‘codons’, which enables a comparison with the designed content of unpaired DNA. We use the assay to measure the quality of assembly of several multilayer DNA origami objects and illustrate the use of the assay for the rational refinement of assembly protocols. Our data suggests that large and complex objects like multilayer DNA origami can be made with high strand integration quality up to 99%. Beyond DNA nanotechnology, we speculate that the ability to discriminate unpaired from paired nucleic acids in the same macromolecule may also be useful for analysing cellular nucleic acids. PMID:24751596

  17. Concerted copy number variation balances ribosomal DNA dosage in human and mouse genomes

    PubMed Central

    Gibbons, John G.; Branco, Alan T.; Godinho, Susana A.; Yu, Shoukai; Lemos, Bernardo

    2015-01-01

    Tandemly repeated ribosomal DNA (rDNA) arrays are among the most evolutionary dynamic loci of eukaryotic genomes. The loci code for essential cellular components, yet exhibit extensive copy number (CN) variation within and between species. CN might be partly determined by the requirement of dosage balance between the 5S and 45S rDNA arrays. The arrays are nonhomologous, physically unlinked in mammals, and encode functionally interdependent RNA components of the ribosome. Here we show that the 5S and 45S rDNA arrays exhibit concerted CN variation (cCNV). Despite 5S and 45S rDNA elements residing on different chromosomes and lacking sequence similarity, cCNV between these loci is strong, evolutionarily conserved in humans and mice, and manifested across individual genotypes in natural populations and pedigrees. Finally, we observe that bisphenol A induces rapid and parallel modulation of 5S and 45S rDNA CN. Our observations reveal a novel mode of genome variation, indicate that natural selection contributed to the evolution and conservation of cCNV, and support the hypothesis that 5S CN is partly determined by the requirement of dosage balance with the 45S rDNA array. We suggest that human disease variation might be traced to disrupted rDNA dosage balance in the genome. PMID:25583482

  18. Comprehensive evaluation of genome-wide 5-hydroxymethylcytosine profiling approaches in human DNA.

    PubMed

    Skvortsova, Ksenia; Zotenko, Elena; Luu, Phuc-Loi; Gould, Cathryn M; Nair, Shalima S; Clark, Susan J; Stirzaker, Clare

    2017-01-01

    The discovery that 5-methylcytosine (5mC) can be oxidized to 5-hydroxymethylcytosine (5hmC) by the ten-eleven translocation (TET) proteins has prompted wide interest in the potential role of 5hmC in reshaping the mammalian DNA methylation landscape. The gold-standard bisulphite conversion technologies to study DNA methylation do not distinguish between 5mC and 5hmC. However, new approaches to mapping 5hmC genome-wide have advanced rapidly, although it is unclear how the different methods compare in accurately calling 5hmC. In this study, we provide a comparative analysis on brain DNA using three 5hmC genome-wide approaches, namely whole-genome bisulphite/oxidative bisulphite sequencing (WG Bis/OxBis-seq), Infinium HumanMethylation450 BeadChip arrays coupled with oxidative bisulphite (HM450K Bis/OxBis) and antibody-based immunoprecipitation and sequencing of hydroxymethylated DNA (hMeDIP-seq). We also perform loci-specific TET-assisted bisulphite sequencing (TAB-seq) for validation of candidate regions. We show that whole-genome single-base resolution approaches are advantaged in providing precise 5hmC values but require high sequencing depth to accurately measure 5hmC, as this modification is commonly in low abundance in mammalian cells. HM450K arrays coupled with oxidative bisulphite provide a cost-effective representation of 5hmC distribution, at CpG sites with 5hmC levels >~10%. However, 5hmC analysis is restricted to the genomic location of the probes, which is an important consideration as 5hmC modification is commonly enriched at enhancer elements. Finally, we show that the widely used hMeDIP-seq method provides an efficient genome-wide profile of 5hmC and shows high correlation with WG Bis/OxBis-seq 5hmC distribution in brain DNA. However, in cell line DNA with low levels of 5hmC, hMeDIP-seq-enriched regions are not detected by WG Bis/OxBis or HM450K, either suggesting misinterpretation of 5hmC calls by hMeDIP or lack of sensitivity of the latter methods. We

  19. Identification of DNA Methyltransferase Genes in Human Pathogenic Bacteria by Comparative Genomics.

    PubMed

    Brambila-Tapia, Aniel Jessica Leticia; Poot-Hernández, Augusto Cesar; Perez-Rueda, Ernesto; Rodríguez-Vázquez, Katya

    2016-06-01

    DNA methylation plays an important role in gene expression and virulence in some pathogenic bacteria. In this report, we describe DNA methyltransferases (MTases) present in human pathogenic bacteria and compared them with related species, which are not pathogenic or less pathogenic, based in comparative genomics. We performed a search in the KEGG database of the KEGG database orthology groups associated with adenine and cytosine DNA MTase activities (EC: 2.1.1.37, EC: 2.1.1.113 and EC: 2.1.1.72) in 37 human pathogenic species and 18 non/less pathogenic relatives and performed comparisons of the number of these MTases sequences according to their genome size, the DNA MTase type and with their non-less pathogenic relatives. We observed that Helicobacter pylori and Neisseria spp. presented the highest number of MTases while ten different species did not present a predicted DNA MTase. We also detected a significant increase of adenine MTases over cytosine MTases (2.19 vs. 1.06, respectively, p < 0.001). Adenine MTases were the only MTases associated with restriction modification systems and DNA MTases associated with type I restriction modification systems were more numerous than those associated with type III restriction modification systems (0.84 vs. 0.17, p < 0.001); additionally, there was no correlation with the genome size and the total number of DNA MTases, indicating that the number of DNA MTases is related to the particular evolution and lifestyle of specific species, regulating the expression of virulence genes in some pathogenic bacteria.

  20. Evidence for horizontal transfer of mitochondrial DNA to the plastid genome in a bamboo genus.

    PubMed

    Ma, Peng-Fei; Zhang, Yu-Xiao; Guo, Zhen-Hua; Li, De-Zhu

    2015-06-23

    In flowering plants, three genomes (nuclear, mitochondrial, and plastid) coexist and intracellular horizontal transfer of DNA is prevalent, especially from the plastid to the mitochondrion genome. However, the plastid genomes are generally conserved in evolution and have long been considered immune to foreign DNA. Recently, the opposite direction of DNA transfer from the mitochondrial to the plastid genome has been reported in two eudicot lineages. Here we sequenced 6 plastid genomes of bamboos, three of which are neotropical woody species and three are herbaceous ones. Several unusual features were found, including the duplication of trnT-GGU and loss of one copy of rps19 due to contraction of inverted repeats (IRs). The most intriguing was the ~2.7 kb insertion in the plastid IR regions in the three herbaceous bamboos. Furthermore, the insertion was documented to be horizontally transferred from the mitochondrial to the plastid genome. Our study provided evidence of the mitochondrial-to-plastid DNA transfer in the monocots, demonstrating again that this rare event does occur in other angiosperm lineages. However, the mechanism underlying the transfer remains obscure, and more studies in other plants may elucidate it in the future.

  1. Translocation and deletion breakpoints in cancer genomes are associated with potential non-B DNA-forming sequences.

    PubMed

    Bacolla, Albino; Tainer, John A; Vasquez, Karen M; Cooper, David N

    2016-07-08

    Gross chromosomal rearrangements (including translocations, deletions, insertions and duplications) are a hallmark of cancer genomes and often create oncogenic fusion genes. An obligate step in the generation of such gross rearrangements is the formation of DNA double-strand breaks (DSBs). Since the genomic distribution of rearrangement breakpoints is non-random, intrinsic cellular factors may predispose certain genomic regions to breakage. Notably, certain DNA sequences with the potential to fold into secondary structures [potential non-B DNA structures (PONDS); e.g. triplexes, quadruplexes, hairpin/cruciforms, Z-DNA and single-stranded looped-out structures with implications in DNA replication and transcription] can stimulate the formation of DNA DSBs. Here, we tested the postulate that these DNA sequences might be found at, or in close proximity to, rearrangement breakpoints. By analyzing the distribution of PONDS-forming sequences within ±500 bases of 19 947 translocation and 46 365 sequence-characterized deletion breakpoints in cancer genomes, we find significant association between PONDS-forming repeats and cancer breakpoints. Specifically, (AT)n, (GAA)n and (GAAA)n constitute the most frequent repeats at translocation breakpoints, whereas A-tracts occur preferentially at deletion breakpoints. Translocation breakpoints near PONDS-forming repeats also recur in different individuals and patient tumor samples. Hence, PONDS-forming sequences represent an intrinsic risk factor for genomic rearrangements in cancer genomes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health

    PubMed Central

    Martin, William F.

    2017-01-01

    Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. PMID:28444372

  3. Pulling out the 1%: Whole-Genome Capture for the Targeted Enrichment of Ancient DNA Sequencing Libraries

    PubMed Central

    Carpenter, Meredith L.; Buenrostro, Jason D.; Valdiosera, Cristina; Schroeder, Hannes; Allentoft, Morten E.; Sikora, Martin; Rasmussen, Morten; Gravel, Simon; Guillén, Sonia; Nekhrizov, Georgi; Leshtakov, Krasimir; Dimitrova, Diana; Theodossiev, Nikola; Pettener, Davide; Luiselli, Donata; Sandoval, Karla; Moreno-Estrada, Andrés; Li, Yingrui; Wang, Jun; Gilbert, M. Thomas P.; Willerslev, Eske; Greenleaf, William J.; Bustamante, Carlos D.

    2013-01-01

    Most ancient specimens contain very low levels of endogenous DNA, precluding the shotgun sequencing of many interesting samples because of cost. Ancient DNA (aDNA) libraries often contain <1% endogenous DNA, with the majority of sequencing capacity taken up by environmental DNA. Here we present a capture-based method for enriching the endogenous component of aDNA sequencing libraries. By using biotinylated RNA baits transcribed from genomic DNA libraries, we are able to capture DNA fragments from across the human genome. We demonstrate this method on libraries created from four Iron Age and Bronze Age human teeth from Bulgaria, as well as bone samples from seven Peruvian mummies and a Bronze Age hair sample from Denmark. Prior to capture, shotgun sequencing of these libraries yielded an average of 1.2% of reads mapping to the human genome (including duplicates). After capture, this fraction increased substantially, with up to 59% of reads mapped to human and enrichment ranging from 6- to 159-fold. Furthermore, we maintained coverage of the majority of regions sequenced in the precapture library. Intersection with the 1000 Genomes Project reference panel yielded an average of 50,723 SNPs (range 3,062–147,243) for the postcapture libraries sequenced with 1 million reads, compared with 13,280 SNPs (range 217–73,266) for the precapture libraries, increasing resolution in population genetic analyses. Our whole-genome capture approach makes it less costly to sequence aDNA from specimens containing very low levels of endogenous DNA, enabling the analysis of larger numbers of samples. PMID:24568772

  4. Methods to Monitor DNA Repair Defects and Genomic Instability in the Context of a Disrupted Nuclear Lamina.

    PubMed

    Gonzalo, Susana; Kreienkamp, Ray

    2016-01-01

    The organization of the genome within the nuclear space is viewed as an additional level of regulation of genome function, as well as a means to ensure genome integrity. Structural proteins associated with the nuclear envelope, in particular lamins (A- and B-type) and lamin-associated proteins, play an important role in genome organization. Interestingly, there is a whole body of evidence that links disruptions of the nuclear lamina with DNA repair defects and genomic instability. Here, we describe a few standard techniques that have been successfully utilized to identify mechanisms behind DNA repair defects and genomic instability in cells with an altered nuclear lamina. In particular, we describe protocols to monitor changes in the expression of DNA repair factors (Western blot) and their recruitment to sites of DNA damage (immunofluorescence); kinetics of DNA double-strand break repair after ionizing radiation (neutral comet assays); frequency of chromosomal aberrations (FISH, fluorescence in situ hybridization); and alterations in telomere homeostasis (Quantitative-FISH). These techniques have allowed us to shed some light onto molecular mechanisms by which alterations in A-type lamins induce genomic instability, which could contribute to the pathophysiology of aging and aging-related diseases.

  5. Evaluation of plasmid and genomic DNA calibrants used for the quantification of genetically modified organisms.

    PubMed

    Caprioara-Buda, M; Meyer, W; Jeynov, B; Corbisier, P; Trapmann, S; Emons, H

    2012-07-01

    The reliable quantification of genetically modified organisms (GMOs) by real-time PCR requires, besides thoroughly validated quantitative detection methods, sustainable calibration systems. The latter establishes the anchor points for the measured value and the measurement unit, respectively. In this paper, the suitability of two types of DNA calibrants, i.e. plasmid DNA and genomic DNA extracted from plant leaves, for the certification of the GMO content in reference materials as copy number ratio between two targeted DNA sequences was investigated. The PCR efficiencies and coefficients of determination of the calibration curves as well as the measured copy number ratios for three powder certified reference materials (CRMs), namely ERM-BF415e (NK603 maize), ERM-BF425c (356043 soya), and ERM-BF427c (98140 maize), originally certified for their mass fraction of GMO, were compared for both types of calibrants. In all three systems investigated, the PCR efficiencies of plasmid DNA were slightly closer to the PCR efficiencies observed for the genomic DNA extracted from seed powders rather than those of the genomic DNA extracted from leaves. Although the mean DNA copy number ratios for each CRM overlapped within their uncertainties, the DNA copy number ratios were significantly different using the two types of calibrants. Based on these observations, both plasmid and leaf genomic DNA calibrants would be technically suitable as anchor points for the calibration of the real-time PCR methods applied in this study. However, the most suitable approach to establish a sustainable traceability chain is to fix a reference system based on plasmid DNA.

  6. DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation.

    PubMed

    Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H; Proukakis, Christos

    2017-01-01

    Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array "waves", and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance.

  7. The effect of input DNA copy number on genotype call and characterising SNP markers in the humpback whale genome using a nanofluidic array.

    PubMed

    Bhat, Somanath; Polanowski, Andrea M; Double, Mike C; Jarman, Simon N; Emslie, Kerry R

    2012-01-01

    Recent advances in nanofluidic technologies have enabled the use of Integrated Fluidic Circuits (IFCs) for high-throughput Single Nucleotide Polymorphism (SNP) genotyping (GT). In this study, we implemented and validated a relatively low cost nanofluidic system for SNP-GT with and without Specific Target Amplification (STA). As proof of principle, we first validated the effect of input DNA copy number on genotype call rate using well characterised, digital PCR (dPCR) quantified human genomic DNA samples and then implemented the validated method to genotype 45 SNPs in the humpback whale, Megaptera novaeangliae, nuclear genome. When STA was not incorporated, for a homozygous human DNA sample, reaction chambers containing, on average 9 to 97 copies, showed 100% call rate and accuracy. Below 9 copies, the call rate decreased, and at one copy it was 40%. For a heterozygous human DNA sample, the call rate decreased from 100% to 21% when predicted copies per reaction chamber decreased from 38 copies to one copy. The tightness of genotype clusters on a scatter plot also decreased. In contrast, when the same samples were subjected to STA prior to genotyping a call rate and a call accuracy of 100% were achieved. Our results demonstrate that low input DNA copy number affects the quality of data generated, in particular for a heterozygous sample. Similar to human genomic DNA, a call rate and a call accuracy of 100% was achieved with whale genomic DNA samples following multiplex STA using either 15 or 45 SNP-GT assays. These calls were 100% concordant with their true genotypes determined by an independent method, suggesting that the nanofluidic system is a reliable platform for executing call rates with high accuracy and concordance in genomic sequences derived from biological tissue.

  8. Genome measures used for quality control are dependent on gene function and ancestry.

    PubMed

    Wang, Jing; Raskin, Leon; Samuels, David C; Shyr, Yu; Guo, Yan

    2015-02-01

    The transition/transversion (Ti/Tv) ratio and heterozygous/nonreference-homozygous (het/nonref-hom) ratio have been commonly computed in genetic studies as a quality control (QC) measurement. Additionally, these two ratios are helpful in our understanding of the patterns of DNA sequence evolution. To thoroughly understand these two genomic measures, we performed a study using 1000 Genomes Project (1000G) released genotype data (N=1092). An additional two datasets (N=581 and N=6) were used to validate our findings from the 1000G dataset. We compared the two ratios among continental ancestry, genome regions and gene functionality. We found that the Ti/Tv ratio can be used as a quality indicator for single nucleotide polymorphisms inferred from high-throughput sequencing data. The Ti/Tv ratio varies greatly by genome region and functionality, but not by ancestry. The het/nonref-hom ratio varies greatly by ancestry, but not by genome regions and functionality. Furthermore, extreme guanine + cytosine content (either high or low) is negatively associated with the Ti/Tv ratio magnitude. Thus, when performing QC assessment using these two measures, care must be taken to apply the correct thresholds based on ancestry and genome region. Failure to take these considerations into account at the QC stage will bias any following analysis. yan.guo@vanderbilt.edu Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  9. Ribosomal DNA sequence heterogeneity reflects intraspecies phylogenies and predicts genome structure in two contrasting yeast species.

    PubMed

    West, Claire; James, Stephen A; Davey, Robert P; Dicks, Jo; Roberts, Ian N

    2014-07-01

    The ribosomal RNA encapsulates a wealth of evolutionary information, including genetic variation that can be used to discriminate between organisms at a wide range of taxonomic levels. For example, the prokaryotic 16S rDNA sequence is very widely used both in phylogenetic studies and as a marker in metagenomic surveys and the internal transcribed spacer region, frequently used in plant phylogenetics, is now recognized as a fungal DNA barcode. However, this widespread use does not escape criticism, principally due to issues such as difficulties in classification of paralogous versus orthologous rDNA units and intragenomic variation, both of which may be significant barriers to accurate phylogenetic inference. We recently analyzed data sets from the Saccharomyces Genome Resequencing Project, characterizing rDNA sequence variation within multiple strains of the baker's yeast Saccharomyces cerevisiae and its nearest wild relative Saccharomyces paradoxus in unprecedented detail. Notably, both species possess single locus rDNA systems. Here, we use these new variation datasets to assess whether a more detailed characterization of the rDNA locus can alleviate the second of these phylogenetic issues, sequence heterogeneity, while controlling for the first. We demonstrate that a strong phylogenetic signal exists within both datasets and illustrate how they can be used, with existing methodology, to estimate intraspecies phylogenies of yeast strains consistent with those derived from whole-genome approaches. We also describe the use of partial Single Nucleotide Polymorphisms, a type of sequence variation found only in repetitive genomic regions, in identifying key evolutionary features such as genome hybridization events and show their consistency with whole-genome Structure analyses. We conclude that our approach can transform rDNA sequence heterogeneity from a problem to a useful source of evolutionary information, enabling the estimation of highly accurate phylogenies of

  10. A ranking index for quality assessment of forensic DNA profiles forensic DNA profiles

    PubMed Central

    2010-01-01

    Background Assessment of DNA profile quality is vital in forensic DNA analysis, both in order to determine the evidentiary value of DNA results and to compare the performance of different DNA analysis protocols. Generally the quality assessment is performed through manual examination of the DNA profiles based on empirical knowledge, or by comparing the intensities (allelic peak heights) of the capillary electrophoresis electropherograms. Results We recently developed a ranking index for unbiased and quantitative quality assessment of forensic DNA profiles, the forensic DNA profile index (FI) (Hedman et al. Improved forensic DNA analysis through the use of alternative DNA polymerases and statistical modeling of DNA profiles, Biotechniques 47 (2009) 951-958). FI uses electropherogram data to combine the intensities of the allelic peaks with the balances within and between loci, using Principal Components Analysis. Here we present the construction of FI. We explain the mathematical and statistical methodologies used and present details about the applied data reduction method. Thereby we show how to adapt the ranking index for any Short Tandem Repeat-based forensic DNA typing system through validation against a manual grading scale and calibration against a specific set of DNA profiles. Conclusions The developed tool provides unbiased quality assessment of forensic DNA profiles. It can be applied for any DNA profiling system based on Short Tandem Repeat markers. Apart from crime related DNA analysis, FI can therefore be used as a quality tool in paternal or familial testing as well as in disaster victim identification. PMID:21062433

  11. Intrastrand triplex DNA repeats in bacteria: a source of genomic instability

    PubMed Central

    Holder, Isabelle T.; Wagner, Stefanie; Xiong, Peiwen; Sinn, Malte; Frickey, Tancred; Meyer, Axel; Hartig, Jörg S.

    2015-01-01

    Repetitive nucleic acid sequences are often prone to form secondary structures distinct from B-DNA. Prominent examples of such structures are DNA triplexes. We observed that certain intrastrand triplex motifs are highly conserved and abundant in prokaryotic genomes. A systematic search of 5246 different prokaryotic plasmids and genomes for intrastrand triplex motifs was conducted and the results summarized in the ITxF database available online at http://bioinformatics.uni-konstanz.de/utils/ITxF/. Next we investigated biophysical and biochemical properties of a particular G/C-rich triplex motif (TM) that occurs in many copies in more than 260 bacterial genomes by CD and nuclear magnetic resonance spectroscopy as well as in vivo footprinting techniques. A characterization of putative properties and functions of these unusually frequent nucleic acid motifs demonstrated that the occurrence of the TM is associated with a high degree of genomic instability. TM-containing genomic loci are significantly more rearranged among closely related Escherichia coli strains compared to control sites. In addition, we found very high frequencies of TM motifs in certain Enterobacteria and Cyanobacteria that were previously described as genetically highly diverse. In conclusion we link intrastrand triplex motifs with the induction of genomic instability. We speculate that the observed instability might be an adaptive feature of these genomes that creates variation for natural selection to act upon. PMID:26450966

  12. Nanopore DNA Sequencing and Genome Assembly on the International Space Station.

    PubMed

    Castro-Wallace, Sarah L; Chiu, Charles Y; John, Kristen K; Stahl, Sarah E; Rubins, Kathleen H; McIntyre, Alexa B R; Dworkin, Jason P; Lupisella, Mark L; Smith, David J; Botkin, Douglas J; Stephenson, Timothy A; Juul, Sissel; Turner, Daniel J; Izquierdo, Fernando; Federman, Scot; Stryke, Doug; Somasekar, Sneha; Alexander, Noah; Yu, Guixia; Mason, Christopher E; Burton, Aaron S

    2017-12-21

    We evaluated the performance of the MinION DNA sequencer in-flight on the International Space Station (ISS), and benchmarked its performance off-Earth against the MinION, Illumina MiSeq, and PacBio RS II sequencing platforms in terrestrial laboratories. Samples contained equimolar mixtures of genomic DNA from lambda bacteriophage, Escherichia coli (strain K12, MG1655) and Mus musculus (female BALB/c mouse). Nine sequencing runs were performed aboard the ISS over a 6-month period, yielding a total of 276,882 reads with no apparent decrease in performance over time. From sequence data collected aboard the ISS, we constructed directed assemblies of the ~4.6 Mb E. coli genome, ~48.5 kb lambda genome, and a representative M. musculus sequence (the ~16.3 kb mitochondrial genome), at 100%, 100%, and 96.7% consensus pairwise identity, respectively; de novo assembly of the E. coli genome from raw reads yielded a single contig comprising 99.9% of the genome at 98.6% consensus pairwise identity. Simulated real-time analyses of in-flight sequence data using an automated bioinformatic pipeline and laptop-based genomic assembly demonstrated the feasibility of sequencing analysis and microbial identification aboard the ISS. These findings illustrate the potential for sequencing applications including disease diagnosis, environmental monitoring, and elucidating the molecular basis for how organisms respond to spaceflight.

  13. Non-Enzymatic Detection of Bacterial Genomic DNA Using the Bio-Barcode Assay

    PubMed Central

    Hill, Haley D.; Vega, Rafael A.; Mirkin, Chad A.

    2011-01-01

    The detection of bacterial genomic DNA through a non-enzymatic nanomaterials based amplification method, the bio-barcode assay, is reported. The assay utilizes oligonucleotide functionalized magnetic microparticles to capture the target of interest from the sample. A critical step in the new assay involves the use of blocking oligonucleotides during heat denaturation of the double stranded DNA. These blockers bind to specific regions of the target DNA upon cooling, and prevent the duplex DNA from re-hybridizing, which allows the particle probes to bind. Following target isolation using the magnetic particles, oligonucleotide functionalized gold nanoparticles act as target recognition agents. The oligonucleotides on the nanoparticle (barcodes) act as amplification surrogates. The barcodes are then detected using the Scanometric method. The limit of detection for this assay was determined to be 2.5 femtomolar, and this is the first demonstration of a barcode type assay for the detection of double stranded, genomic DNA. PMID:17927207

  14. Enhancing Targeted Genomic DNA Editing in Chicken Cells Using the CRISPR/Cas9 System

    PubMed Central

    Wang, Ling; Yang, Likai; Guo, Yijie; Du, Weili; Yin, Yajun; Zhang, Tao; Lu, Hongzhao

    2017-01-01

    The CRISPR/Cas9 system has enabled highly efficient genome targeted editing for various organisms. However, few studies have focused on CRISPR/Cas9 nuclease-mediated chicken genome editing compared with mammalian genomes. The current study combined CRISPR with yeast Rad52 (yRad52) to enhance targeted genomic DNA editing in chicken DF-1 cells. The efficiency of CRISPR/Cas9 nuclease-induced targeted mutations in the chicken genome was increased to 41.9% via the enrichment of the dual-reporter surrogate system. In addition, the combined effect of CRISPR nuclease and yRad52 dramatically increased the efficiency of the targeted substitution in the myostatin gene using 50-mer oligodeoxynucleotides (ssODN) as the donor DNA, resulting in a 36.7% editing efficiency after puromycin selection. Furthermore, based on the effect of yRad52, the frequency of exogenous gene integration in the chicken genome was more than 3-fold higher than that without yRad52. Collectively, these results suggest that ssODN is an ideal donor DNA for targeted substitution and that CRISPR/Cas9 combined with yRad52 significantly enhances chicken genome editing. These findings could be extensively applied in other organisms. PMID:28068387

  15. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

    PubMed

    Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J; Leitch, Ilia J

    2015-01-01

    The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.

  16. Porcine parvovirus: DNA sequence and genome organization.

    PubMed

    Ranz, A I; Manclús, J J; Díaz-Aroca, E; Casal, J I

    1989-10-01

    We have determined the nucleotide sequence of an almost full-length clone of porcine parvovirus (PPV). The sequence is 4973 nucleotides (nt) long. The 3' end of virion DNA shows a Y-shaped configuration homologous to rodent parvoviruses. The 5' end of virion DNA shows a repetition of 127 nt at the carboxy terminus of the capsid proteins. The overall organization of the PPV genome is similar to those of other autonomous parvoviruses. There are two large open reading frames (ORFs) that almost entirely cover the genome, both located in the same frame of the complementary strand. The left ORF encodes the non-structural protein NS1 and the right ORF encodes the capsid proteins (VP1, VP2 and VP3). Promoter analysis, location of splicing sites and putative amino acid sequences for the viral proteins show a high homology of PPV with feline panleukopenia virus and canine parvoviruses (FPV and CPV) and rodent parvovirus. Therefore we conclude that PPV is related to the Kilham rat virus (KRV) group of autonomous parvoviruses formed by KRV, minute virus of mice, Lu III, H-1, FPV and CPV.

  17. DNA repair efficiency in germ cells and early mouse embryos and consequences for radiation-induced transgenerational genomic damage

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Marchetti, Francesco; Wyrobek, Andrew J.

    Exposure to ionizing radiation and other environmental agents can affect the genomic integrity of germ cells and induce adverse health effects in the progeny. Efficient DNA repair during gametogenesis and the early embryonic cycles after fertilization is critical for preventing transmission of DNA damage to the progeny and relies on maternal factors stored in the egg before fertilization. The ability of the maternal repair machinery to repair DNA damage in both parental genomes in the fertilizing egg is especially crucial for the fertilizing male genome that has not experienced a DNA repair-competent cellular environment for several weeks prior to fertilization.more » During the DNA repair-deficient period of spermatogenesis, DNA lesions may accumulate in sperm and be carried into the egg where, if not properly repaired, could result in the formation of heritable chromosomal aberrations or mutations and associated birth defects. Studies with female mice deficient in specific DNA repair genes have shown that: (i) cell cycle checkpoints are activated in the fertilized egg by DNA damage carried by the sperm; and (ii) the maternal genotype plays a major role in determining the efficiency of repairing genomic lesions in the fertilizing sperm and directly affect the risk for abnormal reproductive outcomes. There is also growing evidence that implicates DNA damage carried by the fertilizing gamete as a mediator of postfertilization processes that contribute to genomic instability in subsequent generations. Transgenerational genomic instability most likely involves epigenetic mechanisms or error-prone DNA repair processes in the early embryo. Maternal and embryonic DNA repair processes during the early phases of mammalian embryonic development can have far reaching consequences for the genomic integrity and health of subsequent generations.« less

  18. Genome-wide and caste-specific DNA methylomes of the ants Camponotus floridanus and Harpegnathos saltator

    PubMed Central

    Bonasio, Roberto; Li, Qiye; Lian, Jinmin; Mutti, Navdeep S.; Jin, Lijun; Zhao, Hongmei; Zhang, Pei; Wen, Ping; Xiang, Hui; Ding, Yun; Jin, Zonghui; Shen, Steven S.; Wang, Zongji; Wang, Wen; Wang, Jun; Berger, Shelley L.; Liebig, Jürgen; Zhang, Guojie; Reinberg, Danny

    2012-01-01

    SUMMARY Background Ant societies comprise individuals belonging to different castes characterized by specialized morphologies and behaviors. Because ant embryos can follow different developmental trajectories, epigenetic mechanisms must play a role in caste determination. Ants have a full set of DNA methyltransferase and their genomes contain methylcytosine. To determine the relationship between DNA methylation and phenotypic plasticity in ants, we obtained and compared the genome-wide methylomes of different castes and developmental stages of Camponotus floridanus and Harpegnathos saltator. Results In the ant genomes, methylcytosines are found both in CpG and non-CpG contexts and are strongly enriched at exons of active genes. Changes in exonic DNA methylation correlate with alternative splicing events such as exon skipping and alternative splice site selection. Several genes exhibit caste-specific and developmental changes in DNA methylation that are conserved between the two species, including genes involved in reproduction, telomere maintenance, and noncoding RNA metabolism. Several loci are methylated and expressed monoallelically, and in some cases the choice of methylated allele depends on the caste. Conclusions These first ant methylomes and their intra- and inter-species comparison reveal an exonic methylation pattern that points to a connection between DNA methylation and splicing. The presence of monoallelic DNA methylation and the methylation of non-CpG sites in all samples suggest roles in genome regulation in these social insects, including the intriguing possibility of parental or caste-specific genomic imprinting. PMID:22885060

  19. Neonatal exposure to diethylstilbestrol alters expression of DNA methyltransferases and methylation of genomic DNA in the mouse uterus.

    PubMed

    Sato, Koji; Fukata, Hideki; Kogo, Yasushi; Ohgane, Jun; Shiota, Kunio; Mori, Chisato

    2009-01-01

    Perinatal exposure to diethylstilbestrol (DES) can have numerous adverse effects on the reproductive organs later in life, such as vaginal clear-cell adenocarcinoma. Epigenetic processes including DNA methylation may be involved in the mechanisms. We subcutaneously injected DES to neonatal C57BL/6 mice. At days 5, 14, and 30, expressions of DNA methyltransferases (Dnmts) Dnmt1, Dnmt3a, and Dnmt3b, and transcription factors Sp1 and Sp3 were examined. We also performed restriction landmark genomic scanning (RLGS) to detect aberrant DNA methylation. Real-time RT-PCR revealed that expressions of Dnmt1, Dnmt3b, and Sp3 were decreased at day 5 in DES-treated mice, and that those of Dnmt1, Dnmt3a, and Sp1 were also decreased at day 14. RLGS analysis revealed that 5 genomic loci were demethylated, and 5 other loci were methylated by DES treatment. Two loci were cloned, and differential DNA methylation was quantified. Our results indicated that DES altered the expression levels of Dnmts and DNA methylation.

  20. Evaluation and validation of de novo and hybrid assembly techniques to derive high quality genome sequences

    DOE PAGES

    Utturkar, Sagar M.; Klingeman, Dawn Marie; Land, Miriam L.; ...

    2014-06-14

    Our motivation with this work was to assess the potential of different types of sequence data combined with de novo and hybrid assembly approaches to improve existing draft genome sequences. Our results show Illumina, 454 and PacBio sequencing technologies were used to generate de novo and hybrid genome assemblies for four different bacteria, which were assessed for quality using summary statistics (e.g. number of contigs, N50) and in silico evaluation tools. Differences in predictions of multiple copies of rDNA operons for each respective bacterium were evaluated by PCR and Sanger sequencing, and then the validated results were applied as anmore » additional criterion to rank assemblies. In general, assemblies using longer PacBio reads were better able to resolve repetitive regions. In this study, the combination of Illumina and PacBio sequence data assembled through the ALLPATHS-LG algorithm gave the best summary statistics and most accurate rDNA operon number predictions. This study will aid others looking to improve existing draft genome assemblies. As to availability and implementation–all assembly tools except CLC Genomics Workbench are freely available under GNU General Public License.« less

  1. Noncoding RNAs in DNA Repair and Genome Integrity

    PubMed Central

    Wan, Guohui; Liu, Yunhua; Han, Cecil; Zhang, Xinna

    2014-01-01

    Abstract Significance: The well-studied sequences in the human genome are those of protein-coding genes, which account for only 1%–2% of the total genome. However, with the advent of high-throughput transcriptome sequencing technology, we now know that about 90% of our genome is extensively transcribed and that the vast majority of them are transcribed into noncoding RNAs (ncRNAs). It is of great interest and importance to decipher the functions of these ncRNAs in humans. Recent Advances: In the last decade, it has become apparent that ncRNAs play a crucial role in regulating gene expression in normal development, in stress responses to internal and environmental stimuli, and in human diseases. Critical Issues: In addition to those constitutively expressed structural RNA, such as ribosomal and transfer RNAs, regulatory ncRNAs can be classified as microRNAs (miRNAs), Piwi-interacting RNAs (piRNAs), small interfering RNAs (siRNAs), small nucleolar RNAs (snoRNAs), and long noncoding RNAs (lncRNAs). However, little is known about the biological features and functional roles of these ncRNAs in DNA repair and genome instability, although a number of miRNAs and lncRNAs are regulated in the DNA damage response. Future Directions: A major goal of modern biology is to identify and characterize the full profile of ncRNAs with regard to normal physiological functions and roles in human disorders. Clinically relevant ncRNAs will also be evaluated and targeted in therapeutic applications. Antioxid. Redox Signal. 20, 655–677. PMID:23879367

  2. Origins of DNA Replication and Amplification in the Breast Cancer Genome

    DTIC Science & Technology

    2012-09-01

    W81XWH-10-1-0463 TITLE: Origins of DNA Replication and Amplification in the...2. REPORT TYPE Final 3. DATES COVERED 1 Sep 2010 – 31 Aug 2012 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER Origins of DNA Replication and...described in the DOD funded parent grant, to test our hypothesis we need to map origins of DNA replication in the genome and ask which of these

  3. The genome-wide DNA sequence specificity of the anti-tumour drug bleomycin in human cells.

    PubMed

    Murray, Vincent; Chen, Jon K; Tanaka, Mark M

    2016-07-01

    The cancer chemotherapeutic agent, bleomycin, cleaves DNA at specific sites. For the first time, the genome-wide DNA sequence specificity of bleomycin breakage was determined in human cells. Utilising Illumina next-generation DNA sequencing techniques, over 200 million bleomycin cleavage sites were examined to elucidate the bleomycin genome-wide DNA selectivity. The genome-wide bleomycin cleavage data were analysed by four different methods to determine the cellular DNA sequence specificity of bleomycin strand breakage. For the most highly cleaved DNA sequences, the preferred site of bleomycin breakage was at 5'-GT* dinucleotide sequences (where the asterisk indicates the bleomycin cleavage site), with lesser cleavage at 5'-GC* dinucleotides. This investigation also determined longer bleomycin cleavage sequences, with preferred cleavage at 5'-GT*A and 5'- TGT* trinucleotide sequences, and 5'-TGT*A tetranucleotides. For cellular DNA, the hexanucleotide DNA sequence 5'-RTGT*AY (where R is a purine and Y is a pyrimidine) was the most highly cleaved DNA sequence. It was striking that alternating purine-pyrimidine sequences were highly cleaved by bleomycin. The highest intensity cleavage sites in cellular and purified DNA were very similar although there were some minor differences. Statistical nucleotide frequency analysis indicated a G nucleotide was present at the -3 position (relative to the cleavage site) in cellular DNA but was absent in purified DNA.

  4. Phylogenetic Analysis of Shewanella Strains by DNA Relatedness Derived from Whole Genome Microarray DNA-DNA Hybridization and Comparison with Other Methods

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wu, Liyou; Yi, T. Y.; Van Nostrand, Joy

    Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site [Hanford Reach of the Columbia River (HRCR), 11 strains], Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the averagemore » nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.« less

  5. Genome-wide mapping of nuclear mitochondrial DNA sequences links DNA replication origins to chromosomal double-strand break formation in Schizosaccharomyces pombe

    PubMed Central

    Lenglez, Sandrine; Hermand, Damien; Decottignies, Anabelle

    2010-01-01

    Chromosomal double-strand breaks (DSBs) threaten genome integrity and repair of these lesions is often mutagenic. How and where DSBs are formed is a major question conveniently addressed in simple model organisms like yeast. NUMTs, nuclear DNA sequences of mitochondrial origin, are present in most eukaryotic genomes and probably result from the capture of mitochondrial DNA (mtDNA) fragments into chromosomal breaks. NUMT formation is ongoing and was reported to cause de novo human genetic diseases. Study of NUMTs is likely to contribute to the understanding of naturally occurring chromosomal breaks. We show that Schizosaccharomyces pombe NUMTs are exclusively located in noncoding regions with no preference for gene promoters and, when located into promoters, do not affect gene transcription level. Strikingly, most noncoding regions comprising NUMTs are also associated with a DNA replication origin (ORI). Chromatin immunoprecipitation experiments revealed that chromosomal NUMTs are probably not acting as ORI on their own but that mtDNA insertions occurred directly next to ORIs, suggesting that these loci may be prone to DSB formation. Accordingly, induction of excessive DNA replication origin firing, a phenomenon often associated with human tumor formation, resulted in frequent nucleotide deletion events within ORI3001 subtelomeric chromosomal locus, illustrating a novel aspect of DNA replication-driven genomic instability. How mtDNA is fragmented is another important issue that we addressed by sequencing experimentally induced NUMTs. This highlighted regions of S. pombe mtDNA prone to breaking. Together with an analysis of human NUMTs, we propose that these fragile sites in mtDNA may correspond to replication pause sites. PMID:20688779

  6. Genomic signal processing methods for computation of alignment-free distances from DNA sequences.

    PubMed

    Borrayo, Ernesto; Mendizabal-Ruiz, E Gerardo; Vélez-Pérez, Hugo; Romo-Vázquez, Rebeca; Mendizabal, Adriana P; Morales, J Alejandro

    2014-01-01

    Genomic signal processing (GSP) refers to the use of digital signal processing (DSP) tools for analyzing genomic data such as DNA sequences. A possible application of GSP that has not been fully explored is the computation of the distance between a pair of sequences. In this work we present GAFD, a novel GSP alignment-free distance computation method. We introduce a DNA sequence-to-signal mapping function based on the employment of doublet values, which increases the number of possible amplitude values for the generated signal. Additionally, we explore the use of three DSP distance metrics as descriptors for categorizing DNA signal fragments. Our results indicate the feasibility of employing GAFD for computing sequence distances and the use of descriptors for characterizing DNA fragments.

  7. Genomic Signal Processing Methods for Computation of Alignment-Free Distances from DNA Sequences

    PubMed Central

    Borrayo, Ernesto; Mendizabal-Ruiz, E. Gerardo; Vélez-Pérez, Hugo; Romo-Vázquez, Rebeca; Mendizabal, Adriana P.; Morales, J. Alejandro

    2014-01-01

    Genomic signal processing (GSP) refers to the use of digital signal processing (DSP) tools for analyzing genomic data such as DNA sequences. A possible application of GSP that has not been fully explored is the computation of the distance between a pair of sequences. In this work we present GAFD, a novel GSP alignment-free distance computation method. We introduce a DNA sequence-to-signal mapping function based on the employment of doublet values, which increases the number of possible amplitude values for the generated signal. Additionally, we explore the use of three DSP distance metrics as descriptors for categorizing DNA signal fragments. Our results indicate the feasibility of employing GAFD for computing sequence distances and the use of descriptors for characterizing DNA fragments. PMID:25393409

  8. AID to overcome the limitations of genomic information by introducing somatic DNA alterations.

    PubMed

    Honjo, Tasuku; Muramatsu, Masamichi; Nagaoka, Hitoshi; Kinoshita, Kazuo; Shinkura, Reiko

    2006-05-01

    The immune system has adopted somatic DNA alterations to overcome the limitations of the genomic information. Activation induced cytidine deaminase (AID) is an essential enzyme to regulate class switch recombination (CSR), somatic hypermutation (SHM) and gene conversion (GC) of the immunoglobulin gene. AID is known to be required for DNA cleavage of S regions in CSR and V regions in SHM. However, its molecular mechanism is a focus of extensive debate. RNA editing hypothesis postulates that AID edits yet unknown mRNA, to generate specific endonucleases for CSR and SHM. By contrast, DNA deamination hypothesis assumes that AID deaminates cytosine in DNA, followed by DNA cleavage by base excision repair enzymes. We summarize the basic knowledge for molecular mechanisms for CSR and SHM and then discuss the importance of AID not only in the immune regulation but also in the genome instability.

  9. DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation

    PubMed Central

    Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M.; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H.

    2017-01-01

    Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array “waves”, and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance. PMID:28683077

  10. Genome-wide DNA polymorphisms in two cultivars of mei (Prunus mume sieb. et zucc.).

    PubMed

    Sun, Lidan; Zhang, Qixiang; Xu, Zongda; Yang, Weiru; Guo, Yu; Lu, Jiuxing; Pan, Huitang; Cheng, Tangren; Cai, Ming

    2013-10-06

    Mei (Prunus mume Sieb. et Zucc.) is a famous ornamental plant and fruit crop grown in East Asian countries. Limited genetic resources, especially molecular markers, have hindered the progress of mei breeding projects. Here, we performed low-depth whole-genome sequencing of Prunus mume 'Fenban' and Prunus mume 'Kouzi Yudie' to identify high-quality polymorphic markers between the two cultivars on a large scale. A total of 1464.1 Mb and 1422.1 Mb of 'Fenban' and 'Kouzi Yudie' sequencing data were uniquely mapped to the mei reference genome with about 6-fold coverage, respectively. We detected a large number of putative polymorphic markers from the 196.9 Mb of sequencing data shared by the two cultivars, which together contained 200,627 SNPs, 4,900 InDels, and 7,063 SSRs. Among these markers, 38,773 SNPs, 174 InDels, and 418 SSRs were distributed in the 22.4 Mb CDS region, and 63.0% of these marker-containing CDS sequences were assigned to GO terms. Subsequently, 670 selected SNPs were validated using an Agilent's SureSelect solution phase hybridization assay. A subset of 599 SNPs was used to assess the genetic similarity of a panel of mei germplasm samples and a plum (P. salicina) cultivar, producing a set of informative diversity data. We also analyzed the frequency and distribution of detected InDels and SSRs in mei genome and validated their usefulness as DNA markers. These markers were successfully amplified in the cultivars and in their segregating progeny. A large set of high-quality polymorphic SNPs, InDels, and SSRs were identified in parallel between 'Fenban' and 'Kouzi Yudie' using low-depth whole-genome sequencing. The study presents extensive data on these polymorphic markers, which can be useful for constructing high-resolution genetic maps, performing genome-wide association studies, and designing genomic selection strategies in mei.

  11. Enzymatic Removal of Ribonucleotides from DNA Is Essential for Mammalian Genome Integrity and Development

    PubMed Central

    Reijns, Martin A.M.; Rabe, Björn; Rigby, Rachel E.; Mill, Pleasantine; Astell, Katy R.; Lettice, Laura A.; Boyle, Shelagh; Leitch, Andrea; Keighren, Margaret; Kilanowski, Fiona; Devenney, Paul S.; Sexton, David; Grimes, Graeme; Holt, Ian J.; Hill, Robert E.; Taylor, Martin S.; Lawson, Kirstie A.; Dorin, Julia R.; Jackson, Andrew P.

    2012-01-01

    Summary The presence of ribonucleotides in genomic DNA is undesirable given their increased susceptibility to hydrolysis. Ribonuclease (RNase) H enzymes that recognize and process such embedded ribonucleotides are present in all domains of life. However, in unicellular organisms such as budding yeast, they are not required for viability or even efficient cellular proliferation, while in humans, RNase H2 hypomorphic mutations cause the neuroinflammatory disorder Aicardi-Goutières syndrome. Here, we report that RNase H2 is an essential enzyme in mice, required for embryonic growth from gastrulation onward. RNase H2 null embryos accumulate large numbers of single (or di-) ribonucleotides embedded in their genomic DNA (>1,000,000 per cell), resulting in genome instability and a p53-dependent DNA-damage response. Our findings establish RNase H2 as a key mammalian genome surveillance enzyme required for ribonucleotide removal and demonstrate that ribonucleotides are the most commonly occurring endogenous nucleotide base lesion in replicating cells. PMID:22579044

  12. First Complete Squash leaf curl China virus Genomic Segment DNA-A Sequence from East Timor

    PubMed Central

    Maina, Solomon; Edwards, Owain R.; de Almeida, Luis; Ximenes, Abel

    2017-01-01

    ABSTRACT We present here the first complete Squash leaf curl China virus (SLCCV) genomic segment DNA-A sequence from East Timor. It was isolated from a pumpkin plant. When compared with 15 complete SLCCV DNA-A genome sequences from other world regions, it most resembled the Malaysian isolate MC1 sequence. PMID:28619789

  13. P-Hint-Hunt: a deep parallelized whole genome DNA methylation detection tool.

    PubMed

    Peng, Shaoliang; Yang, Shunyun; Gao, Ming; Liao, Xiangke; Liu, Jie; Yang, Canqun; Wu, Chengkun; Yu, Wenqiang

    2017-03-14

    The increasing studies have been conducted using whole genome DNA methylation detection as one of the most important part of epigenetics research to find the significant relationships among DNA methylation and several typical diseases, such as cancers and diabetes. In many of those studies, mapping the bisulfite treated sequence to the whole genome has been the main method to study DNA cytosine methylation. However, today's relative tools almost suffer from inaccuracies and time-consuming problems. In our study, we designed a new DNA methylation prediction tool ("Hint-Hunt") to solve the problem. By having an optimal complex alignment computation and Smith-Waterman matrix dynamic programming, Hint-Hunt could analyze and predict the DNA methylation status. But when Hint-Hunt tried to predict DNA methylation status with large-scale dataset, there are still slow speed and low temporal-spatial efficiency problems. In order to solve the problems of Smith-Waterman dynamic programming and low temporal-spatial efficiency, we further design a deep parallelized whole genome DNA methylation detection tool ("P-Hint-Hunt") on Tianhe-2 (TH-2) supercomputer. To the best of our knowledge, P-Hint-Hunt is the first parallel DNA methylation detection tool with a high speed-up to process large-scale dataset, and could run both on CPU and Intel Xeon Phi coprocessors. Moreover, we deploy and evaluate Hint-Hunt and P-Hint-Hunt on TH-2 supercomputer in different scales. The experimental results illuminate our tools eliminate the deviation caused by bisulfite treatment in mapping procedure and the multi-level parallel program yields a 48 times speed-up with 64 threads. P-Hint-Hunt gain a deep acceleration on CPU and Intel Xeon Phi heterogeneous platform, which gives full play of the advantages of multi-cores (CPU) and many-cores (Phi).

  14. Genome-Wide Analysis of Transposon and Retroviral Insertions Reveals Preferential Integrations in Regions of DNA Flexibility.

    PubMed

    Vrljicak, Pavle; Tao, Shijie; Varshney, Gaurav K; Quach, Helen Ngoc Bao; Joshi, Adita; LaFave, Matthew C; Burgess, Shawn M; Sampath, Karuna

    2016-04-07

    DNA transposons and retroviruses are important transgenic tools for genome engineering. An important consideration affecting the choice of transgenic vector is their insertion site preferences. Previous large-scale analyses of Ds transposon integration sites in plants were done on the basis of reporter gene expression or germ-line transmission, making it difficult to discern vertebrate integration preferences. Here, we compare over 1300 Ds transposon integration sites in zebrafish with Tol2 transposon and retroviral integration sites. Genome-wide analysis shows that Ds integration sites in the presence or absence of marker selection are remarkably similar and distributed throughout the genome. No strict motif was found, but a preference for structural features in the target DNA associated with DNA flexibility (Twist, Tilt, Rise, Roll, Shift, and Slide) was observed. Remarkably, this feature is also found in transposon and retroviral integrations in maize and mouse cells. Our findings show that structural features influence the integration of heterologous DNA in genomes, and have implications for targeted genome engineering. Copyright © 2016 Vrljicak et al.

  15. Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.

    PubMed

    Hazkani-Covo, Einat; Martin, William F

    2017-05-01

    Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  16. Short-Sequence DNA Repeats in Prokaryotic Genomes

    PubMed Central

    van Belkum, Alex; Scherer, Stewart; van Alphen, Loek; Verbrugh, Henri

    1998-01-01

    Short-sequence DNA repeat (SSR) loci can be identified in all eukaryotic and many prokaryotic genomes. These loci harbor short or long stretches of repeated nucleotide sequence motifs. DNA sequence motifs in a single locus can be identical and/or heterogeneous. SSRs are encountered in many different branches of the prokaryote kingdom. They are found in genes encoding products as diverse as microbial surface components recognizing adhesive matrix molecules and specific bacterial virulence factors such as lipopolysaccharide-modifying enzymes or adhesins. SSRs enable genetic and consequently phenotypic flexibility. SSRs function at various levels of gene expression regulation. Variations in the number of repeat units per locus or changes in the nature of the individual repeat sequences may result from recombination processes or polymerase inadequacy such as slipped-strand mispairing (SSM), either alone or in combination with DNA repair deficiencies. These rather complex phenomena can occur with relative ease, with SSM approaching a frequency of 10−4 per bacterial cell division and allowing high-frequency genetic switching. Bacteria use this random strategy to adapt their genetic repertoire in response to selective environmental pressure. SSR-mediated variation has important implications for bacterial pathogenesis and evolutionary fitness. Molecular analysis of changes in SSRs allows epidemiological studies on the spread of pathogenic bacteria. The occurrence, evolution and function of SSRs, and the molecular methods used to analyze them are discussed in the context of responsiveness to environmental factors, bacterial pathogenicity, epidemiology, and the availability of full-genome sequences for increasing numbers of microorganisms, especially those that are medically relevant. PMID:9618442

  17. Second generation noninvasive fetal genome analysis reveals de novo mutations, single-base parental inheritance, and preferred DNA ends

    PubMed Central

    Chan, K. C. Allen; Jiang, Peiyong; Sun, Kun; Cheng, Yvonne K. Y.; Tong, Yu K.; Cheng, Suk Hang; Wong, Ada I. C.; Hudecova, Irena; Leung, Tak Y.; Chiu, Rossa W. K.; Lo, Yuk Ming Dennis

    2016-01-01

    Plasma DNA obtained from a pregnant woman was sequenced to a depth of 270× haploid genome coverage. Comparing the maternal plasma DNA sequencing data with the parental genomic DNA data and using a series of bioinformatics filters, fetal de novo mutations were detected at a sensitivity of 85% and a positive predictive value of 74%. These results represent a 169-fold improvement in the positive predictive value over previous attempts. Improvements in the interpretation of the sequence information of every base position in the genome allowed us to interrogate the maternal inheritance of the fetus for 618,271 of 656,676 (94.2%) heterozygous SNPs within the maternal genome. The fetal genotype at each of these sites was deduced individually, unlike previously, where the inheritance was determined for a collection of sites within a haplotype. These results represent a 90-fold enhancement in the resolution in determining the fetus’s maternal inheritance. Selected genomic locations were more likely to be found at the ends of plasma DNA molecules. We found that a subset of such preferred ends exhibited selectivity for fetal- or maternal-derived DNA in maternal plasma. The ratio of the number of maternal plasma DNA molecules with fetal preferred ends to those with maternal preferred ends showed a correlation with the fetal DNA fraction. Finally, this second generation approach for noninvasive fetal whole-genome analysis was validated in a pregnancy diagnosed with cardiofaciocutaneous syndrome with maternal plasma DNA sequenced to 195× coverage. The causative de novo BRAF mutation was successfully detected through the maternal plasma DNA analysis. PMID:27799561

  18. Isolation of PCR quality microbial community DNA from heavily contaminated environments.

    PubMed

    Gunawardana, Manjula; Chang, Simon; Jimenez, Abraham; Holland-Moritz, Daniel; Holland-Moritz, Hannah; La Val, Taylor P; Lund, Craig; Mullen, Madeline; Olsen, John; Sztain, Terra A; Yoo, Jennifer; Moss, John A; Baum, Marc M

    2014-07-01

    Asphalts, biochemically degraded oil, contain persistent, water-soluble compounds that pose a significant challenge to the isolation of PCR quality DNA. The adaptation of existing DNA purification protocols and commercial kits proved unsuccessful at overcoming this hurdle. Treatment of aqueous asphalt extracts with a polyamide resin afforded genomic microbial DNA templates that could readily be amplified by PCR. Physicochemically distinct asphalt samples from five natural oil seeps successfully generated the expected 291 bp amplicons targeting a region of the 16S rRNA gene, illustrating the robustness of the method. DNA recovery yields were in the 50-80% range depending on how the asphalt sample was seeded with exogenous DNA. The scope of the new method was expanded to include soil with high humic acid content. DNA from soil samples spiked with a range of humic acid concentrations was extracted with a commercial kit followed by treatment with the polyamide resin. The additional step significantly improved the purity of the DNA templates, especially at high humic acid concentrations, based on qPCR analysis of the bacterial 16S rRNA genes. The new method has the advantages of being inexpensive, simple, and rapid and should provide a valuable addition to protocols in the field of petroleum and soil microbiology. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. Characterization of noncoding regulatory DNA in the human genome.

    PubMed

    Elkon, Ran; Agami, Reuven

    2017-08-08

    Genetic variants associated with common diseases are usually located in noncoding parts of the human genome. Delineation of the full repertoire of functional noncoding elements, together with efficient methods for probing their biological roles, is therefore of crucial importance. Over the past decade, DNA accessibility and various epigenetic modifications have been associated with regulatory functions. Mapping these features across the genome has enabled researchers to begin to document the full complement of putative regulatory elements. High-throughput reporter assays to probe the functions of regulatory regions have also been developed but these methods separate putative regulatory elements from the chromosome so that any effects of chromatin context and long-range regulatory interactions are lost. Definitive assignment of function(s) to putative cis-regulatory elements requires perturbation of these elements. Genome-editing technologies are now transforming our ability to perturb regulatory elements across entire genomes. Interpretation of high-throughput genetic screens that incorporate genome editors might enable the construction of an unbiased map of functional noncoding elements in the human genome.

  20. Use of Multiple Displacement Amplification as Pre-polymerase Chain Reaction (Pre-PCR) to amplify genomic DNA of siphonapterids preserved for long periods in scientific collections.

    PubMed

    Avelar, Daniel M; Linardi, Pedro M

    2010-09-15

    The recently developed Multiple Displacement Amplification technique (MDA) allows for the production of a large quantity of high quality genomic DNA from low amounts of the original DNA. The goal of this study was to evaluate the performance of the MDA technique to amplify genomic DNA of siphonapterids that have been stored for long periods in 70% ethanol at room temperature. We subjected each DNA sample to two different methodologies: (1) amplification of mitochondrial 16S sequences without MDA; (2) amplification of 16S after MDA. All the samples obtained from these procedures were then sequenced. Only 4 samples (15.4%) subjected to method 1 showed amplification. In contrast, the application of MDA (method 2) improved the performance substantially, with 24 samples (92.3%) showing amplification, with significant difference. Interestingly, one of the samples successfully amplified with this method was originally collected in 1909. All of the sequenced samples displayed satisfactory results in quality evaluations (Phred ≥ 20) and good similarities, as identified with the BLASTn tool. Our results demonstrate that the use of MDA may be an effective tool in molecular studies involving specimens of fleas that have traditionally been considered inadequately preserved for such purposes.

  1. Use of Multiple Displacement Amplification as Pre-polymerase Chain Reaction (Pre-PCR) to amplify genomic DNA of siphonapterids preserved for long periods in scientific collections

    PubMed Central

    2010-01-01

    The recently developed Multiple Displacement Amplification technique (MDA) allows for the production of a large quantity of high quality genomic DNA from low amounts of the original DNA. The goal of this study was to evaluate the performance of the MDA technique to amplify genomic DNA of siphonapterids that have been stored for long periods in 70% ethanol at room temperature. We subjected each DNA sample to two different methodologies: (1) amplification of mitochondrial 16S sequences without MDA; (2) amplification of 16S after MDA. All the samples obtained from these procedures were then sequenced. Only 4 samples (15.4%) subjected to method 1 showed amplification. In contrast, the application of MDA (method 2) improved the performance substantially, with 24 samples (92.3%) showing amplification, with significant difference. Interestingly, one of the samples successfully amplified with this method was originally collected in 1909. All of the sequenced samples displayed satisfactory results in quality evaluations (Phred ≥ 20) and good similarities, as identified with the BLASTn tool. Our results demonstrate that the use of MDA may be an effective tool in molecular studies involving specimens of fleas that have traditionally been considered inadequately preserved for such purposes. PMID:20840790

  2. Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.

    PubMed

    Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene

    2017-02-01

    Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  3. Adenovirus Core Protein VII Protects the Viral Genome from a DNA Damage Response at Early Times after Infection▿

    PubMed Central

    Karen, Kasey A.; Hearing, Patrick

    2011-01-01

    Adenovirus has a linear, double-stranded DNA genome that is perceived by the cellular Mre11-Rad50-Nbs1 (MRN) DNA repair complex as a double-strand break. If unabated, MRN elicits a double-strand break repair response that blocks viral DNA replication and ligates the viral genomes into concatemers. There are two sets of early viral proteins that inhibit the MRN complex. The E1B-55K/E4-ORF6 complex recruits an E3 ubiquitin ligase and targets MRN proteins for proteasome-dependent degradation. The E4-ORF3 protein inhibits MRN through sequestration. The mechanism that prevents MRN recognition of the viral genome prior to the expression of these early proteins was previously unknown. Here we show a temporal correlation between the loss of viral core protein VII from the adenovirus genome and a gain of checkpoint signaling due to the double-strand break repair response. While checkpoint signaling corresponds to the recognition of the viral genome, core protein VII binding to and checkpoint signaling at viral genomes are largely mutually exclusive. Transcription is known to release protein VII from the genome, and the inhibition of transcription shows a decrease in checkpoint signaling. Finally, we show that the nuclease activity of Mre11 is dispensable for the inhibition of viral DNA replication during a DNA damage response. These results support a model involving the protection of the incoming viral genome from checkpoint signaling by core protein VII and suggest that the induction of an MRN-dependent DNA damage response may inhibit adenovirus replication by physically masking the origins of DNA replication rather than altering their integrity. PMID:21345950

  4. DNA Damage and Genomic Instability Induced by Inappropriate DNA Re-replication

    DTIC Science & Technology

    2007-04-01

    Conway, A., Lockhart, D. J., Davis, R. W., Brewer , B. J., and Fangman, W. L. (2001). Replication dynamics of the yeast genome. Science 294, 115–121... Brewer , B. J. (2001). An origin-deficient yeast artificial chromosome triggers a cell cycle checkpoint. Mol. Cell 7, 705–713. Vas, A., Mok, W., and...replication in yeast cells. We have demonstrated that re-replication induces a rapid and significant decrease in cell viability and a cellular DNA damage

  5. Identification and nucleotide sequence analysis of the repetitive DNA element in the genome of fish lymphocystis disease virus.

    PubMed

    Schnitzler, P; Delius, H; Scholz, J; Touray, M; Orth, E; Darai, G

    1987-12-01

    The genome of the fish lymphocystis disease virus (FLDV) was screened for the existence of repetitive DNA sequences using a defined and complete gene library of the viral genome (98 kbp) by DNA-DNA hybridization, heteroduplex analysis, and restriction fine mapping. A repetitive DNA sequence was detected at the coordinates 0.034 to 0.057 and 0.718 to 0.736 map units (m.u.) of the FLDV genome. The first region (0.034 to 0.057 m.u.) corresponds to the 5' terminus of the EcoRI FLDV DNA fragment B (0.034 to 0.165 m.u.) and the second region (0.718 to 0.736 m.u.) is identical to the EcoRI DNA fragment M of the viral genome. The DNA nucleotide sequence of the EcoRI FLDV DNA fragment M was determined. This analysis revealed the presence of many short direct and inverted repetitions, e.g., a 18-mer direct repetition (TTTAAAATTTAATTAA) that started at nucleotide positions 812 and 942 and a 14-mer inverted repeat (TTAAATTTAAATTT) at nucleotide positions 820 and 959. Only short open reading frames were detected within this region. The DNA repetitions are discussed as sequences that play a possible regulatory role for virus replication. Furthermore, hybridization experiments revealed that the repetitive DNA sequences are conserved in the genome of different strains of fish lymphocystis disease virus isolated from two species of Pleuronectidae (flounder and dab).

  6. Local chromatin structure of heterochromatin regulates repeated DNA stability, nucleolus structure, and genome integrity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peng, Jamy C.

    Heterochromatin constitutes a significant portion of the genome in higher eukaryotes; approximately 30% in Drosophila and human. Heterochromatin contains a high repeat DNA content and a low density of protein-encoding genes. In contrast, euchromatin is composed mostly of unique sequences and contains the majority of single-copy genes. Genetic and cytological studies demonstrated that heterochromatin exhibits regulatory roles in chromosome organization, centromere function and telomere protection. As an epigenetically regulated structure, heterochromatin formation is not defined by any DNA sequence consensus. Heterochromatin is characterized by its association with nucleosomes containing methylated-lysine 9 of histone H3 (H3K9me), heterochromatin protein 1 (HP1) thatmore » binds H3K9me, and Su(var)3-9, which methylates H3K9 and binds HP1. Heterochromatin formation and functions are influenced by HP1, Su(var)3-9, and the RNA interference (RNAi) pathway. My thesis project investigates how heterochromatin formation and function impact nuclear architecture, repeated DNA organization, and genome stability in Drosophila melanogaster. H3K9me-based chromatin reduces extrachromosomal DNA formation; most likely by restricting the access of repair machineries to repeated DNAs. Reducing extrachromosomal ribosomal DNA stabilizes rDNA repeats and the nucleolus structure. H3K9me-based chromatin also inhibits DNA damage in heterochromatin. Cells with compromised heterochromatin structure, due to Su(var)3-9 or dcr-2 (a component of the RNAi pathway) mutations, display severe DNA damage in heterochromatin compared to wild type. In these mutant cells, accumulated DNA damage leads to chromosomal defects such as translocations, defective DNA repair response, and activation of the G2-M DNA repair and mitotic checkpoints that ensure cellular and animal viability. My thesis research suggests that DNA replication, repair, and recombination mechanisms in heterochromatin differ from those

  7. Analysis of Genome Plasticity in Pathogenic and Commensal Escherichia coli Isolates by Use of DNA Arrays

    PubMed Central

    Dobrindt, Ulrich; Agerer, Franziska; Michaelis, Kai; Janka, Andreas; Buchrieser, Carmen; Samuelson, Martin; Svanborg, Catharina; Gottschalk, Gerhard; Karch, Helge; Hacker, Jörg

    2003-01-01

    Genomes of prokaryotes differ significantly in size and DNA composition. Escherichia coli is considered a model organism to analyze the processes involved in bacterial genome evolution, as the species comprises numerous pathogenic and commensal variants. Pathogenic and nonpathogenic E. coli strains differ in the presence and absence of additional DNA elements contributing to specific virulence traits and also in the presence and absence of additional genetic information. To analyze the genetic diversity of pathogenic and commensal E. coli isolates, a whole-genome approach was applied. Using DNA arrays, the presence of all translatable open reading frames (ORFs) of nonpathogenic E. coli K-12 strain MG1655 was investigated in 26 E. coli isolates, including various extraintestinal and intestinal pathogenic E. coli isolates, 3 pathogenicity island deletion mutants, and commensal and laboratory strains. Additionally, the presence of virulence-associated genes of E. coli was determined using a DNA “pathoarray” developed in our laboratory. The frequency and distributional pattern of genomic variations vary widely in different E. coli strains. Up to 10% of the E. coli K-12-specific ORFs were not detectable in the genomes of the different strains. DNA sequences described for extraintestinal or intestinal pathogenic E. coli are more frequently detectable in isolates of the same origin than in other pathotypes. Several genes coding for virulence or fitness factors are also present in commensal E. coli isolates. Based on these results, the conserved E. coli core genome is estimated to consist of at least 3,100 translatable ORFs. The absence of K-12-specific ORFs was detectable in all chromosomal regions. These data demonstrate the great genome heterogeneity and genetic diversity among E. coli strains and underline the fact that both the acquisition and deletion of DNA elements are important processes involved in the evolution of prokaryotes. PMID:12618447

  8. Genome-Wide Profiling of DNA Double-Strand Breaks by the BLESS and BLISS Methods.

    PubMed

    Mirzazadeh, Reza; Kallas, Tomasz; Bienko, Magda; Crosetto, Nicola

    2018-01-01

    DNA double-strand breaks (DSBs) are major DNA lesions that are constantly formed during physiological processes such as DNA replication, transcription, and recombination, or as a result of exogenous agents such as ionizing radiation, radiomimetic drugs, and genome editing nucleases. Unrepaired DSBs threaten genomic stability by leading to the formation of potentially oncogenic rearrangements such as translocations. In past few years, several methods based on next-generation sequencing (NGS) have been developed to study the genome-wide distribution of DSBs or their conversion to translocation events. We developed Breaks Labeling, Enrichment on Streptavidin, and Sequencing (BLESS), which was the first method for direct labeling of DSBs in situ followed by their genome-wide mapping at nucleotide resolution (Crosetto et al., Nat Methods 10:361-365, 2013). Recently, we have further expanded the quantitative nature, applicability, and scalability of BLESS by developing Breaks Labeling In Situ and Sequencing (BLISS) (Yan et al., Nat Commun 8:15058, 2017). Here, we first present an overview of existing methods for genome-wide localization of DSBs, and then focus on the BLESS and BLISS methods, discussing different assay design options depending on the sample type and application.

  9. Undermethylated DNA as a source of microsatellites from a conifer genome.

    PubMed

    Zhou, Y; Bui, T; Auckland, L D; Williams, C G

    2002-02-01

    Developing microsatellites from the large, highly duplicated conifer genome requires special tools. To improve the efficiency of developing Pinus taeda L. microsatellites, undermethylated (UM) DNA fragments were used to construct a microsatellite-enriched copy library. A methylation-sensitive restriction enzyme, McrBC, was used to enrich for UM DNA before library construction. Digested DNA fragments larger than 9 kb were then excised and digested with RsaI and used to construct nine dinucleotide and trinucleotide libraries. A total of 1016 microsatellite-positive clones were detected among 11 904 clones and 620 of these were unique. Of 245 primer sets that produced a PCR product, 113 could be developed as UM microsatellite markers and 70 were polymorphic. Inheritance and marker informativeness were tested for a random sample of 36 polymorphic markers using a three-generation outbred pedigree. Thirty-one microsatellites (86%) had single-locus inheritance despite the highly duplicated nature of the P. taeda genome. Nineteen UM microsatellites had highly informative intercross mating type configurations. Allele number and frequency were estimated for eleven UM microsatellites using a population survey. Allele numbers for these UM microsatellites ranged from 3 to 12 with an average of 5.7 alleles/locus. Frequencies for the 63 alleles were mostly in the low-common range; only 14 of the 63 were in the rare allele (q < 0.05) class. Enriching for UM DNA was an efficient method for developing polymorphic microsatellites from a large plant genome.

  10. Archaeal Genome Guardians Give Insights into Eukaryotic DNA Replication and Damage Response Proteins

    PubMed Central

    Shin, David S.; Pratt, Ashley J.; Tainer, John A.

    2014-01-01

    As the third domain of life, archaea, like the eukarya and bacteria, must have robust DNA replication and repair complexes to ensure genome fidelity. Archaea moreover display a breadth of unique habitats and characteristics, and structural biologists increasingly appreciate these features. As archaea include extremophiles that can withstand diverse environmental stresses, they provide fundamental systems for understanding enzymes and pathways critical to genome integrity and stress responses. Such archaeal extremophiles provide critical data on the periodic table for life as well as on the biochemical, geochemical, and physical limitations to adaptive strategies allowing organisms to thrive under environmental stress relevant to determining the boundaries for life as we know it. Specifically, archaeal enzyme structures have informed the architecture and mechanisms of key DNA repair proteins and complexes. With added abilities to temperature-trap flexible complexes and reveal core domains of transient and dynamic complexes, these structures provide insights into mechanisms of maintaining genome integrity despite extreme environmental stress. The DNA damage response protein structures noted in this review therefore inform the basis for genome integrity in the face of environmental stress, with implications for all domains of life as well as for biomanufacturing, astrobiology, and medicine. PMID:24701133

  11. Ultra-barcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA.

    PubMed

    Kane, Nolan; Sveinsson, Saemundur; Dempewolf, Hannes; Yang, Ji Yong; Zhang, Dapeng; Engels, Johannes M M; Cronk, Quentin

    2012-02-01

    To reliably identify lineages below the species level such as subspecies or varieties, we propose an extension to DNA-barcoding using next-generation sequencing to produce whole organellar genomes and substantial nuclear ribosomal sequence. Because this method uses much longer versions of the traditional DNA-barcoding loci in the plastid and ribosomal DNA, we call our approach ultra-barcoding (UBC). We used high-throughput next-generation sequencing to scan the genome and generate reliable sequence of high copy number regions. Using this method, we examined whole plastid genomes as well as nearly 6000 bases of nuclear ribosomal DNA sequences for nine genotypes of Theobroma cacao and an individual of the related species T. grandiflorum, as well as an additional publicly available whole plastid genome of T. cacao. All individuals of T. cacao examined were uniquely distinguished, and evidence of reticulation and gene flow was observed. Sequence variation was observed in some of the canonical barcoding regions between species, but other regions of the chloroplast were more variable both within species and between species, as were ribosomal spacers. Furthermore, no single region provides the level of data available using the complete plastid genome and rDNA. Our data demonstrate that UBC is a viable, increasingly cost-effective approach for reliably distinguishing varieties and even individual genotypes of T. cacao. This approach shows great promise for applications where very closely related or interbreeding taxa must be distinguished.

  12. Novel genomic island modifies DNA with 7-deazaguanine derivatives

    PubMed Central

    Thiaville, Jennifer J.; Kellner, Stefanie M.; Yuan, Yifeng; Hutinet, Geoffrey; Thiaville, Patrick C.; Jumpathong, Watthanachai; Mohapatra, Susovan; Brochier-Armanet, Celine; Letarov, Andrey V.; Hillebrand, Roman; Malik, Chanchal K.; Rizzo, Carmelo J.; Dedon, Peter C.; de Crécy-Lagard, Valérie

    2016-01-01

    The discovery of ∼20-kb gene clusters containing a family of paralogs of tRNA guanosine transglycosylase genes, called tgtA5, alongside 7-cyano-7-deazaguanine (preQ0) synthesis and DNA metabolism genes, led to the hypothesis that 7-deazaguanine derivatives are inserted in DNA. This was established by detecting 2’-deoxy-preQ0 and 2’-deoxy-7-amido-7-deazaguanosine in enzymatic hydrolysates of DNA extracted from the pathogenic, Gram-negative bacteria Salmonella enterica serovar Montevideo. These modifications were absent in the closely related S. enterica serovar Typhimurium LT2 and from a mutant of S. Montevideo, each lacking the gene cluster. This led us to rename the genes of the S. Montevideo cluster as dpdA-K for 7-deazapurine in DNA. Similar gene clusters were analyzed in ∼150 phylogenetically diverse bacteria, and the modifications were detected in DNA from other organisms containing these clusters, including Kineococcus radiotolerans, Comamonas testosteroni, and Sphingopyxis alaskensis. Comparative genomic analysis shows that, in Enterobacteriaceae, the cluster is a genomic island integrated at the leuX locus, and the phylogenetic analysis of the TgtA5 family is consistent with widespread horizontal gene transfer. Comparison of transformation efficiencies of modified or unmodified plasmids into isogenic S. Montevideo strains containing or lacking the cluster strongly suggests a restriction–modification role for the cluster in Enterobacteriaceae. Another preQ0 derivative, 2’-deoxy-7-formamidino-7-deazaguanosine, was found in the Escherichia coli bacteriophage 9g, as predicted from the presence of homologs of genes involved in the synthesis of the archaeosine tRNA modification. These results illustrate a deep and unexpected evolutionary connection between DNA and tRNA metabolism. PMID:26929322

  13. The Conjugative Relaxase TrwC Promotes Integration of Foreign DNA in the Human Genome

    PubMed Central

    González-Prieto, Coral; Gabriel, Richard; Dehio, Christoph; Schmidt, Manfred

    2017-01-01

    ABSTRACT Bacterial conjugation is a mechanism of horizontal DNA transfer. The relaxase TrwC of the conjugative plasmid R388 cleaves one strand of the transferred DNA at the oriT gene, covalently attaches to it, and leads the single-stranded DNA (ssDNA) into the recipient cell. In addition, TrwC catalyzes site-specific integration of the transferred DNA into its target sequence present in the genome of the recipient bacterium. Here, we report the analysis of the efficiency and specificity of the integrase activity of TrwC in human cells, using the type IV secretion system of the human pathogen Bartonella henselae to introduce relaxase-DNA complexes. Compared to Mob relaxase from plasmid pBGR1, we found that TrwC mediated a 10-fold increase in the rate of plasmid DNA transfer to human cells and a 100-fold increase in the rate of chromosomal integration of the transferred DNA. We used linear amplification-mediated PCR and plasmid rescue to characterize the integration pattern in the human genome. DNA sequence analysis revealed mostly reconstituted oriT sequences, indicating that TrwC is active and recircularizes transferred DNA in human cells. One TrwC-mediated site-specific integration event was detected, proving that TrwC is capable of mediating site-specific integration in the human genome, albeit with very low efficiency compared to the rate of random integration. Our results suggest that TrwC may stabilize the plasmid DNA molecules in the nucleus of the human cell, probably by recircularization of the transferred DNA strand. This stabilization would increase the opportunities for integration of the DNA by the host machinery. IMPORTANCE Different biotechnological applications, including gene therapy strategies, require permanent modification of target cells. Long-term expression is achieved either by extrachromosomal persistence or by integration of the introduced DNA. Here, we studied the utility of conjugative relaxase TrwC, a bacterial protein with site

  14. Sampling the genomic pool of protein tyrosine kinase genes using the polymerase chain reaction with genomic DNA.

    PubMed

    Oates, A C; Wollberg, P; Achen, M G; Wilks, A F

    1998-08-28

    The polymerase chain reaction (PCR), with cDNA as template, has been widely used to identify members of protein families from many species. A major limitation of using cDNA in PCR is that detection of a family member is dependent on temporal and spatial patterns of gene expression. To circumvent this restriction, and in order to develop a technique that is broadly applicable we have tested the use of genomic DNA as PCR template to identify members of protein families in an expression-independent manner. This test involved amplification of DNA encoding protein tyrosine kinase (PTK) genes from the genomes of three animal species that are well known development models; namely, the mouse Mus musculus, the fruit fly Drosophila melanogaster, and the nematode worm Caenorhabditis elegans. Ten PTK genes were identified from the mouse, 13 from the fruit fly, and 13 from the nematode worm. Among these kinases were 13 members of the PTK family that had not been reported previously. Selected PTKs from this screen were shown to be expressed during development, demonstrating that the amplified fragments did not arise from pseudogenes. This approach will be useful for the identification of many novel members of gene families in organisms of agricultural, medical, developmental and evolutionary significance and for analysis of gene families from any species, or biological sample whose habitat precludes the isolation of mRNA. Furthermore, as a tool to hasten the discovery of members of gene families that are of particular interest, this method offers an opportunity to sample the genome for new members irrespective of their expression pattern.

  15. Comparison of microbial DNA enrichment tools for metagenomic whole genome sequencing.

    PubMed

    Thoendel, Matthew; Jeraldo, Patricio R; Greenwood-Quaintance, Kerryl E; Yao, Janet Z; Chia, Nicholas; Hanssen, Arlen D; Abdel, Matthew P; Patel, Robin

    2016-08-01

    Metagenomic whole genome sequencing for detection of pathogens in clinical samples is an exciting new area for discovery and clinical testing. A major barrier to this approach is the overwhelming ratio of human to pathogen DNA in samples with low pathogen abundance, which is typical of most clinical specimens. Microbial DNA enrichment methods offer the potential to relieve this limitation by improving this ratio. Two commercially available enrichment kits, the NEBNext Microbiome DNA Enrichment Kit and the Molzym MolYsis Basic kit, were tested for their ability to enrich for microbial DNA from resected arthroplasty component sonicate fluids from prosthetic joint infections or uninfected sonicate fluids spiked with Staphylococcus aureus. Using spiked uninfected sonicate fluid there was a 6-fold enrichment of bacterial DNA with the NEBNext kit and 76-fold enrichment with the MolYsis kit. Metagenomic whole genome sequencing of sonicate fluid revealed 13- to 85-fold enrichment of bacterial DNA using the NEBNext enrichment kit. The MolYsis approach achieved 481- to 9580-fold enrichment, resulting in 7 to 59% of sequencing reads being from the pathogens known to be present in the samples. These results demonstrate the usefulness of these tools when testing clinical samples with low microbial burden using next generation sequencing. Copyright © 2016 Elsevier B.V. All rights reserved.

  16. Effect of storage and processing on plasmid, yeast and plant genomic DNA stability in juice from genetically modified oranges.

    PubMed

    Weiss, Julia; Ros-Chumillas, Maria; Peña, Leandro; Egea-Cortines, Marcos

    2007-01-30

    Recombinant DNA technology is an important tool in the development of plant varieties with new favourable features. There is strong opposition towards this technology due to the potential risk of horizontal gene transfer between genetically modified plant material and food-associated bacteria, especially if genes for antibiotic resistance are involved. Since horizontal transfer efficiency depends on size and length of homologous sequences, we investigated the effect of conditions required for orange juice processing on the stability of DNA from three different origins: plasmid DNA, yeast genomic DNA and endogenous genomic DNA from transgenic sweet orange (C. sinensis L. Osb.). Acidic orange juice matrix had a strong degrading effect on plasmid DNA which becomes apparent in a conformation change from supercoiled structure to nicked, linear structure within 5h of storage at 4 degrees C. Genomic yeast DNA was degraded during exposure to acidic orange juice matrix within 4 days, and also the genomic DNA of C. sinensis suffered degradation within 2 days of storage as indicated by amplification results from transgene markers. Standard pasteurization procedures affected DNA integrity depending on the method and time used. Our data show that the current standard industrial procedures to pasteurize orange juice as well as its acidic nature causes a strong degradation of both yeast and endogenous genomic DNA below sizes reported to be suitable for horizontal gene transfer.

  17. DNA Scrunching in the Packaging of Viral Genomes.

    PubMed

    Waters, James T; Kim, Harold D; Gumbart, James C; Lu, Xiang-Jun; Harvey, Stephen C

    2016-07-07

    The motors that drive double-stranded DNA (dsDNA) genomes into viral capsids are among the strongest of all biological motors for which forces have been measured, but it is not known how they generate force. We previously proposed that the DNA is not a passive substrate but that it plays an active role in force generation. This "scrunchworm hypothesis" holds that the motor proteins repeatedly dehydrate and rehydrate the DNA, which then undergoes cyclic shortening and lengthening motions. These are captured by a coupled protein-DNA grip-and-release cycle to rectify the motion and translocate the DNA into the capsid. In this study, we examined the interactions of dsDNA with the dodecameric connector protein of bacteriophage ϕ29, using molecular dynamics simulations on four different DNA sequences, starting from two different conformations (A-DNA and B-DNA). In all four simulations starting with the protein equilibrated with A-DNA in the channel, we observed transitions to a common, metastable, highly scrunched conformation, designated A*. This conformation is very similar to one recently reported by Kumar and Grubmüller in much longer MD simulations on B-DNA docked into the ϕ29 connector. These results are significant for four reasons. First, the scrunched conformations occur spontaneously, without requiring lever-like protein motions often believed to be necessary for DNA translocation. Second, the transition takes place within the connector, providing the location of the putative "dehydrator". Third, the protein has more contacts with one strand of the DNA than with the other; the former was identified in single-molecule laser tweezer experiments as the "load-bearing strand". Finally, the spontaneity of the DNA-protein interaction suggests that it may play a role in the initial docking of DNA in motors like that of T4 that can load and package any sequence.

  18. Identification of Poxvirus Genome Uncoating and DNA Replication Factors with Mutually Redundant Roles.

    PubMed

    Liu, Baoming; Panda, Debasis; Mendez-Rios, Jorge D; Ganesan, Sundar; Wyatt, Linda S; Moss, Bernard

    2018-04-01

    Genome uncoating is essential for replication of most viruses. For poxviruses, the process is divided into two stages: removal of the envelope, allowing early gene expression, and breaching of the core wall, allowing DNA release, replication, and late gene expression. Subsequent studies showed that the host proteasome and the viral D5 protein, which has an essential role in DNA replication, are required for vaccinia virus (VACV) genome uncoating. In a search for additional VACV uncoating proteins, we noted a report that described a defect in DNA replication and late expression when the gene encoding a 68-kDa ankyrin repeat/F-box protein (68k-ank), associated with the cellular SCF (Skp1, cullin1, F-box-containing complex) ubiquitin ligase complex, was deleted from the attenuated modified vaccinia virus Ankara (MVA). Here we showed that the 68k-ank deletion mutant exhibited diminished genome uncoating, formation of DNA prereplication sites, and degradation of viral cores as well as an additional, independent defect in DNA synthesis. Deletion of the 68k-ank homolog of VACV strain WR, however, was without effect, suggesting the existence of compensating genes. By inserting VACV genes into an MVA 68k-ank deletion mutant, we discovered that M2, a member of the poxvirus immune evasion (PIE) domain superfamily and a regulator of NF-κB, and C5, a member of the BTB/Kelch superfamily associated with cullin-3-based ligase complexes, independently rescued the 68k-ank deletion phenotype. Thus, poxvirus uncoating and DNA replication are intertwined processes involving at least three viral proteins with mutually redundant functions in addition to D5. IMPORTANCE Poxviruses comprise a family of large DNA viruses that infect vertebrates and invertebrates and cause diseases of medical and zoological importance. Poxviruses, unlike most other DNA viruses, replicate in the cytoplasm, and their large genomes usually encode 200 or more proteins with diverse functions. About 90 genes may

  19. The pathological consequences of impaired genome integrity in humans; disorders of the DNA replication machinery.

    PubMed

    O'Driscoll, Mark

    2017-01-01

    Accurate and efficient replication of the human genome occurs in the context of an array of constitutional barriers, including regional topological constraints imposed by chromatin architecture and processes such as transcription, catenation of the helical polymer and spontaneously generated DNA lesions, including base modifications and strand breaks. DNA replication is fundamentally important for tissue development and homeostasis; differentiation programmes are intimately linked with stem cell division. Unsurprisingly, impairments of the DNA replication machinery can have catastrophic consequences for genome stability and cell division. Functional impacts on DNA replication and genome stability have long been known to play roles in malignant transformation through a variety of complex mechanisms, and significant further insights have been gained from studying model organisms in this context. Congenital hypomorphic defects in components of the DNA replication machinery have been and continue to be identified in humans. These disorders present with a wide range of clinical features. Indeed, in some instances, different mutations in the same gene underlie different clinical presentations. Understanding the origin and molecular basis of these features opens a window onto the range of developmental impacts of suboptimal DNA replication and genome instability in humans. Here, I will briefly overview the basic steps involved in DNA replication and the key concepts that have emerged from this area of research, before switching emphasis to the pathological consequences of defects within the DNA replication network; the human disorders. Copyright © 2016 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd. Copyright © 2016 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.

  20. Comprehensive analysis of genome-wide DNA methylation across human polycystic ovary syndrome ovary granulosa cell.

    PubMed

    Xu, Jiawei; Bao, Xiao; Peng, Zhaofeng; Wang, Linlin; Du, Linqing; Niu, Wenbin; Sun, Yingpu

    2016-05-10

    Polycystic ovary syndrome (PCOS) affects approximately 7% of the reproductive-age women. A growing body of evidence indicated that epigenetic mechanisms contributed to the development of PCOS. The role of DNA modification in human PCOS ovary granulosa cell is still unknown in PCOS progression. Global DNA methylation and hydroxymethylation were detected between PCOS' and controls' granulosa cell. Genome-wide DNA methylation was profiled to investigate the putative function of DNA methylaiton. Selected genes expressions were analyzed between PCOS' and controls' granulosa cell. Our results showed that the granulosa cell global DNA methylation of PCOS patients was significant higher than the controls'. The global DNA hydroxymethylation showed low level and no statistical difference between PCOS and control. 6936 differentially methylated CpG sites were identified between control and PCOS-obesity. 12245 differential methylated CpG sites were detected between control and PCOS-nonobesity group. 5202 methylated CpG sites were significantly differential between PCOS-obesity and PCOS-nonobesity group. Our results showed that DNA methylation not hydroxymethylation altered genome-wide in PCOS granulosa cell. The different methylation genes were enriched in development protein, transcription factor activity, alternative splicing, sequence-specific DNA binding and embryonic morphogenesis. YWHAQ, NCF2, DHRS9 and SCNA were up-regulation in PCOS-obesity patients with no significance different between control and PCOS-nonobesity patients, which may be activated by lower DNA methylaiton. Global and genome-wide DNA methylation alteration may contribute to different genes expression and PCOS clinical pathology.

  1. Digital droplet multiple displacement amplification (ddMDA) for whole genome sequencing of limited DNA samples

    DOE PAGES

    Rhee, Minsoung; Light, Yooli K.; Meagher, Robert J.; ...

    2016-05-04

    Here, multiple displacement amplification (MDA) is a widely used technique for amplification of DNA from samples containing limited amounts of DNA (e.g., uncultivable microbes or clinical samples) before whole genome sequencing. Despite its advantages of high yield and fidelity, it suffers from high amplification bias and non-specific amplification when amplifying sub-nanogram of template DNA. Here, we present a microfluidic digital droplet MDA (ddMDA) technique where partitioning of the template DNA into thousands of sub-nanoliter droplets, each containing a small number of DNA fragments, greatly reduces the competition among DNA fragments for primers and polymerase thereby greatly reducing amplification bias. Consequently,more » the ddMDA approach enabled a more uniform coverage of amplification over the entire length of the genome, with significantly lower bias and non-specific amplification than conventional MDA. For a sample containing 0.1 pg/μL of E. coli DNA (equivalent of ~3/1000 of an E. coli genome per droplet), ddMDA achieves a 65-fold increase in coverage in de novo assembly, and more than 20-fold increase in specificity (percentage of reads mapping to E. coli) compared to the conventional tube MDA. ddMDA offers a powerful method useful for many applications including medical diagnostics, forensics, and environmental microbiology.« less

  2. Digital droplet multiple displacement amplification (ddMDA) for whole genome sequencing of limited DNA samples

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rhee, Minsoung; Light, Yooli K.; Meagher, Robert J.

    Here, multiple displacement amplification (MDA) is a widely used technique for amplification of DNA from samples containing limited amounts of DNA (e.g., uncultivable microbes or clinical samples) before whole genome sequencing. Despite its advantages of high yield and fidelity, it suffers from high amplification bias and non-specific amplification when amplifying sub-nanogram of template DNA. Here, we present a microfluidic digital droplet MDA (ddMDA) technique where partitioning of the template DNA into thousands of sub-nanoliter droplets, each containing a small number of DNA fragments, greatly reduces the competition among DNA fragments for primers and polymerase thereby greatly reducing amplification bias. Consequently,more » the ddMDA approach enabled a more uniform coverage of amplification over the entire length of the genome, with significantly lower bias and non-specific amplification than conventional MDA. For a sample containing 0.1 pg/μL of E. coli DNA (equivalent of ~3/1000 of an E. coli genome per droplet), ddMDA achieves a 65-fold increase in coverage in de novo assembly, and more than 20-fold increase in specificity (percentage of reads mapping to E. coli) compared to the conventional tube MDA. ddMDA offers a powerful method useful for many applications including medical diagnostics, forensics, and environmental microbiology.« less

  3. A simple and rapid method for isolation of high quality genomic DNA from fruit trees and conifers using PVP.

    PubMed

    Kim, C S; Lee, C H; Shin, J S; Chung, Y S; Hyung, N I

    1997-03-01

    Because DNA degradation is mediated by secondary plant products such as phenolic terpenoids, the isolation of high quality DNA from plants containing a high content of polyphenolics has been a difficult problem. We demonstrate an easy extraction process by modifying several existing ones. Using this process we have found it possible to isolate DNAs from four fruit trees, grape (Vitis spp.), apple (Malus spp.), pear (Pyrus spp.) and persimmon (Diospyros spp.) and four species of conifer, Pinus densiflora, Pinus koraiensis,Taxus cuspidata and Juniperus chinensis within a few hours. Compared with the existing method, we have isolated high quality intact DNAs (260/280 = 1.8-2.0) routinely yielding 250-500 ng/microl (total 7.5-15 microg DNA from four to five tissue discs).

  4. A simple and rapid method for isolation of high quality genomic DNA from fruit trees and conifers using PVP.

    PubMed Central

    Kim, C S; Lee, C H; Shin, J S; Chung, Y S; Hyung, N I

    1997-01-01

    Because DNA degradation is mediated by secondary plant products such as phenolic terpenoids, the isolation of high quality DNA from plants containing a high content of polyphenolics has been a difficult problem. We demonstrate an easy extraction process by modifying several existing ones. Using this process we have found it possible to isolate DNAs from four fruit trees, grape (Vitis spp.), apple (Malus spp.), pear (Pyrus spp.) and persimmon (Diospyros spp.) and four species of conifer, Pinus densiflora, Pinus koraiensis,Taxus cuspidata and Juniperus chinensis within a few hours. Compared with the existing method, we have isolated high quality intact DNAs (260/280 = 1.8-2.0) routinely yielding 250-500 ng/microl (total 7.5-15 microg DNA from four to five tissue discs). PMID:9023124

  5. Shotgun Bisulfite Sequencing of the Betula platyphylla Genome Reveals the Tree’s DNA Methylation Patterning

    PubMed Central

    Su, Chang; Wang, Chao; He, Lin; Yang, Chuanping; Wang, Yucheng

    2014-01-01

    DNA methylation plays a critical role in the regulation of gene expression. Most studies of DNA methylation have been performed in herbaceous plants, and little is known about the methylation patterns in tree genomes. In the present study, we generated a map of methylated cytosines at single base pair resolution for Betula platyphylla (white birch) by bisulfite sequencing combined with transcriptomics to analyze DNA methylation and its effects on gene expression. We obtained a detailed view of the function of DNA methylation sequence composition and distribution in the genome of B. platyphylla. There are 34,460 genes in the whole genome of birch, and 31,297 genes are methylated. Conservatively, we estimated that 14.29% of genomic cytosines are methylcytosines in birch. Among the methylation sites, the CHH context accounts for 48.86%, and is the largest proportion. Combined transcriptome and methylation analysis showed that the genes with moderate methylation levels had higher expression levels than genes with high and low methylation. In addition, methylated genes are highly enriched for the GO subcategories of binding activities, catalytic activities, cellular processes, response to stimulus and cell death, suggesting that methylation mediates these pathways in birch trees. PMID:25514241

  6. The value of new genome references.

    PubMed

    Worley, Kim C; Richards, Stephen; Rogers, Jeffrey

    2017-09-15

    Genomic information has become a ubiquitous and almost essential aspect of biological research. Over the last 10-15 years, the cost of generating sequence data from DNA or RNA samples has dramatically declined and our ability to interpret those data increased just as remarkably. Although it is still possible for biologists to conduct interesting and valuable research on species for which genomic data are not available, the impact of having access to a high quality whole genome reference assembly for a given species is nothing short of transformational. Research on a species for which we have no DNA or RNA sequence data is restricted in fundamental ways. In contrast, even access to an initial draft quality genome (see below for definitions) opens a wide range of opportunities that are simply not available without that reference genome assembly. Although a complete discussion of the impact of genome sequencing and assembly is beyond the scope of this short paper, the goal of this review is to summarize the most common and highest impact contributions that whole genome sequencing and assembly has had on comparative and evolutionary biology. Copyright © 2016. Published by Elsevier Inc.

  7. DNA capture and next-generation sequencing can recover whole mitochondrial genomes from highly degraded samples for human identification

    PubMed Central

    2013-01-01

    Background Mitochondrial DNA (mtDNA) typing can be a useful aid for identifying people from compromised samples when nuclear DNA is too damaged, degraded or below detection thresholds for routine short tandem repeat (STR)-based analysis. Standard mtDNA typing, focused on PCR amplicon sequencing of the control region (HVS I and HVS II), is limited by the resolving power of this short sequence, which misses up to 70% of the variation present in the mtDNA genome. Methods We used in-solution hybridisation-based DNA capture (using DNA capture probes prepared from modern human mtDNA) to recover mtDNA from post-mortem human remains in which the majority of DNA is both highly fragmented (<100 base pairs in length) and chemically damaged. The method ‘immortalises’ the finite quantities of DNA in valuable extracts as DNA libraries, which is followed by the targeted enrichment of endogenous mtDNA sequences and characterisation by next-generation sequencing (NGS). Results We sequenced whole mitochondrial genomes for human identification from samples where standard nuclear STR typing produced only partial profiles or demonstrably failed and/or where standard mtDNA hypervariable region sequences lacked resolving power. Multiple rounds of enrichment can substantially improve coverage and sequencing depth of mtDNA genomes from highly degraded samples. The application of this method has led to the reliable mitochondrial sequencing of human skeletal remains from unidentified World War Two (WWII) casualties approximately 70 years old and from archaeological remains (up to 2,500 years old). Conclusions This approach has potential applications in forensic science, historical human identification cases, archived medical samples, kinship analysis and population studies. In particular the methodology can be applied to any case, involving human or non-human species, where whole mitochondrial genome sequences are required to provide the highest level of maternal lineage discrimination

  8. Genomic variation in parthenogenetic lizard Darevskia armeniaca: evidence from DNA fingerprinting data.

    PubMed

    Malysheva, D N; Tokarskaya, Olga N; Petrosyan, Varos G; Danielyan, Felix D; Darevsky, Iliya S; Ryskov, Alexei P

    2007-01-01

    Microsatellites, or short tandem repeats, are abundant across genomes of most organisms. It is evident that the most straightforward and conclusive way of studying mutations in microsatellite-containing loci is to use clonally transmitted genomes or DNA sequences inherited in multigeneration pedigrees. At present, little is known about the origin of genetic variation in species that lack effective genetic recombination. DNA fingerprinting in 43 families of the parthenogenetic lizard species Darevskia armeniaca (131 siblings), using (GACA)(4), (GGCA)(4), (GATA)(4), and (CAC)(5) probes, revealed mutant fingerprints in siblings that differed from their mothers in several restriction DNA fragments. In some cases, the mutant fingerprints detected in siblings were also found in population samples. The mutation rate for new restriction fragment length estimated by using multilocus probes varied from 0.8 x 10(-2) to 4.9 x 10(-2) per band/per sibling. Probably, the most variations detected as restriction fragment length polymorphism have germ-line origin, but somatic changes of (CAC)(n) fingerprints in adult lizards were also observed. These results provide new evidence of existing unstable regions in genomes of parthenogenetic vertebrate animals, which provide genetic variation in unisexual populations.

  9. Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

    Treesearch

    Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

    2011-01-01

    Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...

  10. A one-dimensional statistical mechanics model for nucleosome positioning on genomic DNA.

    PubMed

    Tesoro, S; Ali, I; Morozov, A N; Sulaiman, N; Marenduzzo, D

    2016-02-12

    The first level of folding of DNA in eukaryotes is provided by the so-called '10 nm chromatin fibre', where DNA wraps around histone proteins (∼10 nm in size) to form nucleosomes, which go on to create a zig-zagging bead-on-a-string structure. In this work we present a one-dimensional statistical mechanics model to study nucleosome positioning within one such 10 nm fibre. We focus on the case of genomic sheep DNA, and we start from effective potentials valid at infinite dilution and determined from high-resolution in vitro salt dialysis experiments. We study positioning within a polynucleosome chain, and compare the results for genomic DNA to that obtained in the simplest case of homogeneous DNA, where the problem can be mapped to a Tonks gas. First, we consider the simple, analytically solvable, case where nucleosomes are assumed to be point-like. Then, we perform numerical simulations to gauge the effect of their finite size on the nucleosomal distribution probabilities. Finally we compare nucleosome distributions and simulated nuclease digestion patterns for the two cases (homogeneous and sheep DNA), thereby providing testable predictions of the effect of sequence on experimentally observable quantities in experiments on polynucleosome chromatin fibres reconstituted in vitro.

  11. Coevolution between Nuclear-Encoded DNA Replication, Recombination, and Repair Genes and Plastid Genome Complexity

    PubMed Central

    Zhang, Jin; Ruhlman, Tracey A.; Sabir, Jamal S. M.; Blazier, John Chris; Weng, Mao-Lun; Park, Seongjun; Jansen, Robert K.

    2016-01-01

    Disruption of DNA replication, recombination, and repair (DNA-RRR) systems has been hypothesized to cause highly elevated nucleotide substitution rates and genome rearrangements in the plastids of angiosperms, but this theory remains untested. To investigate nuclear–plastid genome (plastome) coevolution in Geraniaceae, four different measures of plastome complexity (rearrangements, repeats, nucleotide insertions/deletions, and substitution rates) were evaluated along with substitution rates of 12 nuclear-encoded, plastid-targeted DNA-RRR genes from 27 Geraniales species. Significant correlations were detected for nonsynonymous (dN) but not synonymous (dS) substitution rates for three DNA-RRR genes (uvrB/C, why1, and gyrA) supporting a role for these genes in accelerated plastid genome evolution in Geraniaceae. Furthermore, correlation between dN of uvrB/C and plastome complexity suggests the presence of nucleotide excision repair system in plastids. Significant correlations were also detected between plastome complexity and 13 of the 90 nuclear-encoded organelle-targeted genes investigated. Comparisons revealed significant acceleration of dN in plastid-targeted genes of Geraniales relative to Brassicales suggesting this correlation may be an artifact of elevated rates in this gene set in Geraniaceae. Correlation between dN of plastid-targeted DNA-RRR genes and plastome complexity supports the hypothesis that the aberrant patterns in angiosperm plastome evolution could be caused by dysfunction in DNA-RRR systems. PMID:26893456

  12. Comparative Genomics of Amphibian-like Ranaviruses, Nucleocytoplasmic Large DNA Viruses of Poikilotherms

    PubMed Central

    Price, Stephen J.

    2015-01-01

    Recent research on genome evolution of large DNA viruses has highlighted a number of incredibly dynamic processes that can facilitate rapid adaptation. The genomes of amphibian-like ranaviruses – double-stranded DNA viruses infecting amphibians, reptiles, and fish (family Iridoviridae) – were examined to assess variation in genome content and evolutionary processes. The viruses studied were closely related, but their genome content varied considerably, with 29 genes identified that were not present in all of the major clades. Twenty-one genes had evidence of recombination, while a virus isolated from a captive reptile appeared to be a mosaic of two divergent parents. Positive selection was also found to be acting on more than a quarter of Ranavirus genes and was found most frequently in the Spanish common midwife toad virus, which has had a severe impact on amphibian host communities. Efforts to resolve the root of this group by inclusion of an outgroup were inconclusive, but a set of core genes were identified, which recovered a well-supported species tree. PMID:27812275

  13. Fourteen-Genome Comparison Identifies DNA Markers for Severe-Disease-Associated Strains of Clostridium difficile▿†

    PubMed Central

    Forgetta, Vincenzo; Oughton, Matthew T.; Marquis, Pascale; Brukner, Ivan; Blanchette, Ruth; Haub, Kevin; Magrini, Vince; Mardis, Elaine R.; Gerding, Dale N.; Loo, Vivian G.; Miller, Mark A.; Mulvey, Michael R.; Rupnik, Maja; Dascal, Andre; Dewar, Ken

    2011-01-01

    Clostridium difficile is a common cause of infectious diarrhea in hospitalized patients. A severe and increased incidence of C. difficile infection (CDI) is associated predominantly with the NAP1 strain; however, the existence of other severe-disease-associated (SDA) strains and the extensive genetic diversity across C. difficile complicate reliable detection and diagnosis. Comparative genome analysis of 14 sequenced genomes, including those of a subset of NAP1 isolates, allowed the assessment of genetic diversity within and between strain types to identify DNA markers that are associated with severe disease. Comparative genome analysis of 14 isolates, including five publicly available strains, revealed that C. difficile has a core genome of 3.4 Mb, comprising ∼3,000 genes. Analysis of the core genome identified candidate DNA markers that were subsequently evaluated using a multistrain panel of 177 isolates, representing more than 50 pulsovars and 8 toxinotypes. A subset of 117 isolates from the panel had associated patient data that allowed assessment of an association between the DNA markers and severe CDI. We identified 20 candidate DNA markers for species-wide detection and 10,683 single nucleotide polymorphisms (SNPs) associated with the predominant SDA strain (NAP1). A species-wide detection candidate marker, the sspA gene, was found to be the same across 177 sequenced isolates and lacked significant similarity to those of other species. Candidate SNPs in genes CD1269 and CD1265 were found to associate more closely with disease severity than currently used diagnostic markers, as they were also present in the toxin A-negative and B-positive (A-B+) strain types. The genetic markers identified illustrate the potential of comparative genomics for the discovery of diagnostic DNA-based targets that are species specific or associated with multiple SDA strains. PMID:21508155

  14. PCR-fingerprint profiles of mitochondrial and genomic DNA extracted from Fetus cervi using different extraction methods.

    PubMed

    Ai, Jinxia; Wang, Xuesong; Gao, Lijun; Xia, Wei; Li, Mingcheng; Yuan, Guangxin; Niu, Jiamu; Zhang, Lihua

    2017-11-01

    The use of Fetus cervi, which is derived from the embryo and placenta of Cervus Nippon Temminck or Cervs elaphus Linnaeus, has been documented for a long time in China. There are abundant species of deer worldwide. Those recorded by China Pharmacopeia (2010 edition) from all the species were either authentic or adulterants/counterfeits. Identification of their origins or authenticity became a key in the preparation of the authentic products. The traditional SDS alkaline lysis and salt-outing methods were modified to extract mt DNA and genomic DNA from fresh and dry Fetus cervi in addition to Fetus from false animals, respectively. A set of primers were designed by bioinformatics to target the intra-and inter-variation. The mt DNA and genomic DNA extracted from Fetus cervi using the two methods meet the requirement for authenticity. Extraction of mt DNA by SDS alkaline lysis is more practical and accurate than extraction of genomic DNA by salt-outing method. There were differences in length and number of segments amplified by PCR between mt DNA from authentic Fetus cervi and false animals Fetus. The distinctive PCR-fingerprint patterns can distinguish the Fetus cervi from adulterants and counterfeit animal Fetus.

  15. Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes.

    PubMed

    Cer, Regina Z; Bruce, Kevin H; Mudunuri, Uma S; Yi, Ming; Volfovsky, Natalia; Luke, Brian T; Bacolla, Albino; Collins, Jack R; Stephens, Robert M

    2011-01-01

    Although the capability of DNA to form a variety of non-canonical (non-B) structures has long been recognized, the overall significance of these alternate conformations in biology has only recently become accepted en masse. In order to provide access to genome-wide locations of these classes of predicted structures, we have developed non-B DB, a database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides the most complete list of alternative DNA structure predictions available, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database has been built using the latest releases of the human, chimp, dog, macaque and mouse genomes, so that the results can be compared directly with other data sources. In order to make the data interpretable in a genomic context, features such as genes, single-nucleotide polymorphisms and repetitive elements (SINE, LINE, etc.) have also been incorporated. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer. It is freely accessible at http://nonb.abcc.ncifcrf.gov.

  16. Evidence of pervasive biologically functional secondary structures within the genomes of eukaryotic single-stranded DNA viruses.

    PubMed

    Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y F; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie; Martin, Darren Patrick

    2014-02-01

    Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here.

  17. Evidence of Pervasive Biologically Functional Secondary Structures within the Genomes of Eukaryotic Single-Stranded DNA Viruses

    PubMed Central

    Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y. F.; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie

    2014-01-01

    Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here. PMID:24284329

  18. A genome-specific repetitive DNA sequence from Oryza eichingeri: characterization, localization, and introgression to O. sativa.

    PubMed

    Yan, H. H.; Liu, G. Q.; Cheng, Z. K.; Li, X. B.; Liu, G. Z.; Min, S. K.; Zhu, L.H.

    2002-02-01

    In the course of transferring the brown planthopper resistance from a diploid, CC-genome wild rice species, Oryza eichingeri (IRGC acc. 105159 and 105163), to the cultivated rice variety 02428, we have isolated many alien addition and introgression lines. The O. eichingeri chromatin in some of these lines has previously been identified using genomic in situ hybridization and molecular-marker analysis. Here we cloned a tandemly repetitive DNA sequence from O. eichingeri IRGC acc105163, and detected it in 25 introgression lines. This repetitive DNA sequence showed high specificity to the rice CC genome, but was absent from all the four tetraploid species with BBCC or CCDD genomes. The monomer in this repetitive DNA sequence is 325-366-bp long, with a copy number of about 5,000 per 1 C of the O. eichingerigenome, showing 88% homology to a repetitive DNA sequence isolated from Oryza officinalis(2n=2 x=24, CC). Fluorescent in situ hybridization revealed 11 signals distributed over eight O. eichingeri chromosomes, mostly in terminal or subterminal regions.

  19. Aprataxin resolves adenylated RNA–DNA junctions to maintain genome integrity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tumbale, Percy; Williams, Jessica S.; Schellenberg, Matthew J.

    2013-12-22

    Faithful maintenance and propagation of eukaryotic genomes is ensured by three-step DNA ligation reactions used by ATP-dependent DNA ligases. Paradoxically, when DNA ligases encounter nicked DNA structures with abnormal DNA termini, DNA ligase catalytic activity can generate and/or exacerbate DNA damage through abortive ligation that produces chemically adducted, toxic 5'-adenylated (5'-AMP) DNA lesions. Aprataxin (APTX) reverses DNA adenylation but the context for deadenylation repair is unclear. Here we examine the importance of APTX to RNase-H2-dependent excision repair (RER) of a lesion that is very frequently introduced into DNA, a ribonucleotide. We show that ligases generate adenylated 5' ends containing amore » ribose characteristic of RNase H2 incision. APTX efficiently repairs adenylated RNA–DNA, and acting in an RNA–DNA damage response (RDDR), promotes cellular survival and prevents S-phase checkpoint activation in budding yeast undergoing RER. Structure–function studies of human APTX–RNA–DNA–AMP–Zn complexes define a mechanism for detecting and reversing adenylation at RNA–DNA junctions. This involves A-form RNA binding, proper protein folding and conformational changes, all of which are affected by heritable APTX mutations in ataxia with oculomotor apraxia 1. Together, these results indicate that accumulation of adenylated RNA–DNA may contribute to neurological disease.« less

  20. Synthetic biology. Genomically encoded analog memory with precise in vivo DNA writing in living cell populations.

    PubMed

    Farzadfard, Fahim; Lu, Timothy K

    2014-11-14

    Cellular memory is crucial to many natural biological processes and sophisticated synthetic biology applications. Existing cellular memories rely on epigenetic switches or recombinases, which are limited in scalability and recording capacity. In this work, we use the DNA of living cell populations as genomic "tape recorders" for the analog and distributed recording of long-term event histories. We describe a platform for generating single-stranded DNA (ssDNA) in vivo in response to arbitrary transcriptional signals. When coexpressed with a recombinase, these intracellularly expressed ssDNAs target specific genomic DNA addresses, resulting in precise mutations that accumulate in cell populations as a function of the magnitude and duration of the inputs. This platform could enable long-term cellular recorders for environmental and biomedical applications, biological state machines, and enhanced genome engineering strategies. Copyright © 2014, American Association for the Advancement of Science.

  1. The Conjugative Relaxase TrwC Promotes Integration of Foreign DNA in the Human Genome.

    PubMed

    González-Prieto, Coral; Gabriel, Richard; Dehio, Christoph; Schmidt, Manfred; Llosa, Matxalen

    2017-06-15

    Bacterial conjugation is a mechanism of horizontal DNA transfer. The relaxase TrwC of the conjugative plasmid R388 cleaves one strand of the transferred DNA at the oriT gene, covalently attaches to it, and leads the single-stranded DNA (ssDNA) into the recipient cell. In addition, TrwC catalyzes site-specific integration of the transferred DNA into its target sequence present in the genome of the recipient bacterium. Here, we report the analysis of the efficiency and specificity of the integrase activity of TrwC in human cells, using the type IV secretion system of the human pathogen Bartonella henselae to introduce relaxase-DNA complexes. Compared to Mob relaxase from plasmid pBGR1, we found that TrwC mediated a 10-fold increase in the rate of plasmid DNA transfer to human cells and a 100-fold increase in the rate of chromosomal integration of the transferred DNA. We used linear amplification-mediated PCR and plasmid rescue to characterize the integration pattern in the human genome. DNA sequence analysis revealed mostly reconstituted oriT sequences, indicating that TrwC is active and recircularizes transferred DNA in human cells. One TrwC-mediated site-specific integration event was detected, proving that TrwC is capable of mediating site-specific integration in the human genome, albeit with very low efficiency compared to the rate of random integration. Our results suggest that TrwC may stabilize the plasmid DNA molecules in the nucleus of the human cell, probably by recircularization of the transferred DNA strand. This stabilization would increase the opportunities for integration of the DNA by the host machinery. IMPORTANCE Different biotechnological applications, including gene therapy strategies, require permanent modification of target cells. Long-term expression is achieved either by extrachromosomal persistence or by integration of the introduced DNA. Here, we studied the utility of conjugative relaxase TrwC, a bacterial protein with site

  2. High Quality Genomic Copy Number Data from Archival Formalin-Fixed Paraffin-Embedded Leiomyosarcoma: Optimisation of Universal Linkage System Labelling

    PubMed Central

    Salawu, Abdulazeez; Ul-Hassan, Aliya; Hammond, David; Fernando, Malee; Reed, Malcolm; Sisley, Karen

    2012-01-01

    Most soft tissue sarcomas are characterized by genetic instability and frequent genomic copy number aberrations that are not subtype-specific. Oligonucleotide microarray-based Comparative Genomic Hybridisation (array CGH) is an important technique used to map genome-wide copy number aberrations, but the traditional requirement for high-quality DNA typically obtained from fresh tissue has limited its use in sarcomas. Although large archives of Formalin-fixed Paraffin-embedded (FFPE) tumour samples are available for research, the degradative effects of formalin on DNA from these tissues has made labelling and analysis by array CGH technically challenging. The Universal Linkage System (ULS) may be used for a one-step chemical labelling of such degraded DNA. We have optimised the ULS labelling protocol to perform aCGH on archived FFPE leiomyosarcoma tissues using the 180k Agilent platform. Preservation age of samples ranged from a few months to seventeen years and the DNA showed a wide range of degradation (when visualised on agarose gels). Consistently high DNA labelling efficiency and low microarray probe-to-probe variation (as measured by the derivative log ratio spread) was seen. Comparison of paired fresh and FFPE samples from identical tumours showed good correlation of CNAs detected. Furthermore, the ability to macro-dissect FFPE samples permitted the detection of CNAs that were masked in fresh tissue. Aberrations were visually confirmed using Fluorescence in situ Hybridisation. These results suggest that archival FFPE tissue, with its relative abundance and attendant clinical data may be used for effective mapping for genomic copy number aberrations in such rare tumours as leiomyosarcoma and potentially unravel clues to tumour origins, progression and ultimately, targeted treatment. PMID:23209738

  3. The Genomic Impact of DNA CpG Methylation on Gene Expression; Relationships in Prostate Cancer.

    PubMed

    Long, Mark D; Smiraglia, Dominic J; Campbell, Moray J

    2017-02-14

    The process of DNA CpG methylation has been extensively investigated for over 50 years and revealed associations between changing methylation status of CpG islands and gene expression. As a result, DNA CpG methylation is implicated in the control of gene expression in developmental and homeostasis processes, as well as being a cancer-driver mechanism. The development of genome-wide technologies and sophisticated statistical analytical approaches has ushered in an era of widespread analyses, for example in the cancer arena, of the relationships between altered DNA CpG methylation, gene expression, and tumor status. The remarkable increase in the volume of such genomic data, for example, through investigators from the Cancer Genome Atlas (TCGA), has allowed dissection of the relationships between DNA CpG methylation density and distribution, gene expression, and tumor outcome. In this manner, it is now possible to test that the genome-wide correlations are measurable between changes in DNA CpG methylation and gene expression. Perhaps surprisingly is that these associations can only be detected for hundreds, but not thousands, of genes, and the direction of the correlations are both positive and negative. This, perhaps, suggests that CpG methylation events in cancer systems can act as disease drivers but the effects are possibly more restricted than suspected. Additionally, the positive and negative correlations suggest direct and indirect events and an incomplete understanding. Within the prostate cancer TCGA cohort, we examined the relationships between expression of genes that control DNA methylation, known targets of DNA methylation and tumor status. This revealed that genes that control the synthesis of S -adenosyl-l-methionine (SAM) associate with altered expression of DNA methylation targets in a subset of aggressive tumors.

  4. Comparative chloroplast genomes of eleven Schima (Theaceae) species: Insights into DNA barcoding and phylogeny.

    PubMed

    Yu, Xiang-Qin; Drew, Bryan T; Yang, Jun-Bo; Gao, Lian-Ming; Li, De-Zhu

    2017-01-01

    Schima is an ecologically and economically important woody genus in tea family (Theaceae). Unresolved species delimitations and phylogenetic relationships within Schima limit our understanding of the genus and hinder utilization of the genus for economic purposes. In the present study, we conducted comparative analysis among the complete chloroplast (cp) genomes of 11 Schima species. Our results indicate that Schima cp genomes possess a typical quadripartite structure, with conserved genomic structure and gene order. The size of the Schima cp genome is about 157 kilo base pairs (kb). They consistently encode 114 unique genes, including 80 protein-coding genes, 30 tRNAs, and 4 rRNAs, with 17 duplicated in the inverted repeat (IR). These cp genomes are highly conserved and do not show obvious expansion or contraction of the IR region. The percent variability of the 68 coding and 93 noncoding (>150 bp) fragments is consistently less than 3%. The seven most widely touted DNA barcode regions as well as one promising barcode candidate showed low sequence divergence. Eight mutational hotspots were identified from the 11 cp genomes. These hotspots may potentially be useful as specific DNA barcodes for species identification of Schima. The 58 cpSSR loci reported here are complementary to the microsatellite markers identified from the nuclear genome, and will be leveraged for further population-level studies. Phylogenetic relationships among the 11 Schima species were resolved with strong support based on the cp genome data set, which corresponds well with the species distribution pattern. The data presented here will serve as a foundation to facilitate species identification, DNA barcoding and phylogenetic reconstructions for future exploration of Schima.

  5. Genome-wide alteration in DNA hydroxymethylation in the sperm from bisphenol A-exposed men

    PubMed Central

    Li, De-kun; Yang, Fen; Pan, Hongjie; Li, Tianqi; Miao, Maohua; Li, Runsheng; Yuan, Wei

    2017-01-01

    Environmental BPA exposure has been shown to impact human sperm concentration and motility, as well as rodent spermatogenesis. However, it is unclear whether BPA exposure is associated with alteration in DNA hydroxymethylation, a marker for epigenetic modification, in human sperm. A genome-wide DNA hydroxymethylation study was performed using sperm samples of men who were occupationally exposed to BPA. Compared with controls who had no occupational BPA exposure, the total levels of 5-hydroxymethylcytosine (5hmc) increased significantly (19.37% increase) in BPA-exposed men, with 72.69% of genome regions harboring 5hmc. A total of 9,610 differential 5hmc regions (DhMRs) were revealed in BPA-exposed men relative to controls, which were mainly located in intergenic and intron regions. These DhMRs were composed of 8,670 hyper-hMRs and 940 hypo-hMRs, affecting 2,008 genes and the repetitive elements. The hyper-hMRs affected genes were enriched in pathways associated with nervous system, development, cardiovascular diseases and signal transduction. Additionally, enrichment of 5hmc was observed in the promoters of eight maternally expressed imprinted genes in BPA-exposed sperm. Some of the BPA-affected genes, for example, MLH1, CHD2, SPATA12 and SPATA20 might participate in the response to DNA damage in germ cells caused by BPA. Our analysis showed that enrichment of 5hmc both in promoters and gene bodies is higher in the genes whose expression has been detected in human sperm than those whose expression is absent. Importantly, we observed that BPA exposure affected the 5hmc level in 11.4% of these genes expressed in sperm, and in 6.85% of the sperm genome. Finally, we also observed that BPA exposure tends to change the 5hmc enrichment in the genes which was previously reported to be distributed with the trimethylated Histone 3 (H3K27me3, H3K4me2 or H3K4me3) in sperm. Thus, these results suggest that BPA exposure likely interferes with gene expression via affecting DNA

  6. Genome-wide alteration in DNA hydroxymethylation in the sperm from bisphenol A-exposed men.

    PubMed

    Zheng, Huajun; Zhou, Xiaoyu; Li, De-Kun; Yang, Fen; Pan, Hongjie; Li, Tianqi; Miao, Maohua; Li, Runsheng; Yuan, Wei

    2017-01-01

    Environmental BPA exposure has been shown to impact human sperm concentration and motility, as well as rodent spermatogenesis. However, it is unclear whether BPA exposure is associated with alteration in DNA hydroxymethylation, a marker for epigenetic modification, in human sperm. A genome-wide DNA hydroxymethylation study was performed using sperm samples of men who were occupationally exposed to BPA. Compared with controls who had no occupational BPA exposure, the total levels of 5-hydroxymethylcytosine (5hmc) increased significantly (19.37% increase) in BPA-exposed men, with 72.69% of genome regions harboring 5hmc. A total of 9,610 differential 5hmc regions (DhMRs) were revealed in BPA-exposed men relative to controls, which were mainly located in intergenic and intron regions. These DhMRs were composed of 8,670 hyper-hMRs and 940 hypo-hMRs, affecting 2,008 genes and the repetitive elements. The hyper-hMRs affected genes were enriched in pathways associated with nervous system, development, cardiovascular diseases and signal transduction. Additionally, enrichment of 5hmc was observed in the promoters of eight maternally expressed imprinted genes in BPA-exposed sperm. Some of the BPA-affected genes, for example, MLH1, CHD2, SPATA12 and SPATA20 might participate in the response to DNA damage in germ cells caused by BPA. Our analysis showed that enrichment of 5hmc both in promoters and gene bodies is higher in the genes whose expression has been detected in human sperm than those whose expression is absent. Importantly, we observed that BPA exposure affected the 5hmc level in 11.4% of these genes expressed in sperm, and in 6.85% of the sperm genome. Finally, we also observed that BPA exposure tends to change the 5hmc enrichment in the genes which was previously reported to be distributed with the trimethylated Histone 3 (H3K27me3, H3K4me2 or H3K4me3) in sperm. Thus, these results suggest that BPA exposure likely interferes with gene expression via affecting DNA

  7. The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome.

    PubMed

    Albayrak, Levent; Khanipov, Kamil; Pimenova, Maria; Golovko, George; Rojas, Mark; Pavlidis, Ioannis; Chumakov, Sergei; Aguilar, Gerardo; Chávez, Arturo; Widger, William R; Fofanov, Yuriy

    2016-12-12

    Low-abundance mutations in mitochondrial populations (mutations with minor allele frequency ≤ 1%), are associated with cancer, aging, and neurodegenerative disorders. While recent progress in high-throughput sequencing technology has significantly improved the heteroplasmy identification process, the ability of this technology to detect low-abundance mutations can be affected by the presence of similar sequences originating from nuclear DNA (nDNA). To determine to what extent nDNA can cause false positive low-abundance heteroplasmy calls, we have identified mitochondrial locations of all subsequences that are common or similar (one mismatch allowed) between nDNA and mitochondrial DNA (mtDNA). Performed analysis revealed up to a 25-fold variation in the lengths of longest common and longest similar (one mismatch allowed) subsequences across the mitochondrial genome. The size of the longest subsequences shared between nDNA and mtDNA in several regions of the mitochondrial genome were found to be as low as 11 bases, which not only allows using these regions to design new, very specific PCR primers, but also supports the hypothesis of the non-random introduction of mtDNA into the human nuclear DNA. Analysis of the mitochondrial locations of the subsequences shared between nDNA and mtDNA suggested that even very short (36 bases) single-end sequencing reads can be used to identify low-abundance variation in 20.4% of the mitochondrial genome. For longer (76 and 150 bases) reads, the proportion of the mitochondrial genome where nDNA presence will not interfere found to be 44.5 and 67.9%, when low-abundance mutations at 100% of locations can be identified using 417 bases long single reads. This observation suggests that the analysis of low-abundance variations in mitochondria population can be extended to a variety of large data collections such as NCBI Sequence Read Archive, European Nucleotide Archive, The Cancer Genome Atlas, and International Cancer Genome

  8. DNA aptamers against FokI nuclease domain for genome editing applications.

    PubMed

    Nishio, Maui; Matsumoto, Daisuke; Kato, Yoshio; Abe, Koichi; Lee, Jinhee; Tsukakoshi, Kaori; Yamagishi, Ayana; Nakamura, Chikashi; Ikebukuro, Kazunori

    2017-07-15

    Genome editing with site-specific nucleases (SSNs) can modify only the target gene and may be effective for gene therapy. The main limitation of genome editing for clinical use is off-target effects; excess SSNs in the cells and their longevity can contribute to off-target effects. Therefore, a controlled delivery system for SSNs is necessary. FokI nuclease domain (FokI) is a common DNA cleavage domain in zinc finger nuclease (ZFN) and transcription activator-like effector nuclease. Previously, we reported a zinc finger protein delivery system that combined aptamer-fused, double-strand oligonucleotides and nanoneedles. Here, we report the development of DNA aptamers that bind to the target molecules, with high affinity and specificity to the FokI. DNA aptamers were selected in six rounds of systematic evolution of ligands by exponential enrichment. Aptamers F6#8 and #71, which showed high binding affinity to FokI (K d =82nM, 74nM each), showed resistance to nuclease activity itself and did not inhibit nuclease activity. We immobilized the ZFN-fused GFP to nanoneedles through these aptamers and inserted the nanoneedles into HEK293 cells. We observed the release of ZFN-fused GFP from the nanoneedles in the presence of cells. Therefore, these aptamers are useful for genome editing applications such as controlled delivery of SSNs. Copyright © 2016 Elsevier B.V. All rights reserved.

  9. Non-Homologous End Joining and Homology Directed DNA Repair Frequency of Double-Stranded Breaks Introduced by Genome Editing Reagents.

    PubMed

    Zaboikin, Michail; Zaboikina, Tatiana; Freter, Carl; Srinivasakumar, Narasimhachar

    2017-01-01

    Genome editing using transcription-activator like effector nucleases or RNA guided nucleases allows one to precisely engineer desired changes within a given target sequence. The genome editing reagents introduce double stranded breaks (DSBs) at the target site which can then undergo DNA repair by non-homologous end joining (NHEJ) or homology directed recombination (HDR) when a template DNA molecule is available. NHEJ repair results in indel mutations at the target site. As PCR amplified products from mutant target regions are likely to exhibit different melting profiles than PCR products amplified from wild type target region, we designed a high resolution melting analysis (HRMA) for rapid identification of efficient genome editing reagents. We also designed TaqMan assays using probes situated across the cut site to discriminate wild type from mutant sequences present after genome editing. The experiments revealed that the sensitivity of the assays to detect NHEJ-mediated DNA repair could be enhanced by selection of transfected cells to reduce the contribution of unmodified genomic DNA from untransfected cells to the DNA melting profile. The presence of donor template DNA lacking the target sequence at the time of genome editing further enhanced the sensitivity of the assays for detection of mutant DNA molecules by excluding the wild-type sequences modified by HDR. A second TaqMan probe that bound to an adjacent site, outside of the primary target cut site, was used to directly determine the contribution of HDR to DNA repair in the presence of the donor template sequence. The TaqMan qPCR assay, designed to measure the contribution of NHEJ and HDR in DNA repair, corroborated the results from HRMA. The data indicated that genome editing reagents can produce DSBs at high efficiency in HEK293T cells but a significant proportion of these are likely masked by reversion to wild type as a result of HDR. Supplying a donor plasmid to provide a template for HDR (that

  10. Isolation of "Caenorhabditis elegans" Genomic DNA and Detection of Deletions in the "unc-93" Gene Using PCR

    ERIC Educational Resources Information Center

    Lissemore, James L.; Lackner, Laura L.; Fedoriw, George D.; De Stasio, Elizabeth A.

    2005-01-01

    PCR, genomic DNA isolation, and agarose gel electrophoresis are common molecular biology techniques with a wide range of applications. Therefore, we have developed a series of exercises employing these techniques for an intermediate level undergraduate molecular biology laboratory course. In these exercises, students isolate genomic DNA from the…

  11. DHX9 helicase is involved in preventing genomic instability induced by alternatively structured DNA in human cells

    PubMed Central

    Jain, Aklank; Bacolla, Albino; del Mundo, Imee M.; Zhao, Junhua; Wang, Guliang; Vasquez, Karen M.

    2013-01-01

    Sequences that have the capacity to adopt alternative (i.e. non-B) DNA structures in the human genome have been implicated in stimulating genomic instability. Previously, we found that a naturally occurring intra-molecular triplex (H-DNA) caused genetic instability in mammals largely in the form of DNA double-strand breaks. Thus, it is of interest to determine the mechanism(s) involved in processing H-DNA. Recently, we demonstrated that human DHX9 helicase preferentially unwinds inter-molecular triplex DNA in vitro. Herein, we used a mutation-reporter system containing H-DNA to examine the relevance of DHX9 activity on naturally occurring H-DNA structures in human cells. We found that H-DNA significantly increased mutagenesis in small-interfering siRNA-treated, DHX9-depleted cells, affecting mostly deletions. Moreover, DHX9 associated with H-DNA in the context of supercoiled plasmids. To further investigate the role of DHX9 in the recognition/processing of H-DNA, we performed binding assays in vitro and chromatin immunoprecipitation assays in U2OS cells. DHX9 recognized H-DNA, as evidenced by its binding to the H-DNA structure and enrichment at the H-DNA region compared with a control region in human cells. These composite data implicate DHX9 in processing H-DNA structures in vivo and support its role in the overall maintenance of genomic stability at sites of alternatively structured DNA. PMID:24049074

  12. DHX9 helicase is involved in preventing genomic instability induced by alternatively structured DNA in human cells.

    PubMed

    Jain, Aklank; Bacolla, Albino; Del Mundo, Imee M; Zhao, Junhua; Wang, Guliang; Vasquez, Karen M

    2013-12-01

    Sequences that have the capacity to adopt alternative (i.e. non-B) DNA structures in the human genome have been implicated in stimulating genomic instability. Previously, we found that a naturally occurring intra-molecular triplex (H-DNA) caused genetic instability in mammals largely in the form of DNA double-strand breaks. Thus, it is of interest to determine the mechanism(s) involved in processing H-DNA. Recently, we demonstrated that human DHX9 helicase preferentially unwinds inter-molecular triplex DNA in vitro. Herein, we used a mutation-reporter system containing H-DNA to examine the relevance of DHX9 activity on naturally occurring H-DNA structures in human cells. We found that H-DNA significantly increased mutagenesis in small-interfering siRNA-treated, DHX9-depleted cells, affecting mostly deletions. Moreover, DHX9 associated with H-DNA in the context of supercoiled plasmids. To further investigate the role of DHX9 in the recognition/processing of H-DNA, we performed binding assays in vitro and chromatin immunoprecipitation assays in U2OS cells. DHX9 recognized H-DNA, as evidenced by its binding to the H-DNA structure and enrichment at the H-DNA region compared with a control region in human cells. These composite data implicate DHX9 in processing H-DNA structures in vivo and support its role in the overall maintenance of genomic stability at sites of alternatively structured DNA.

  13. Automated Processing of 2-D Gel Electrophoretograms of Genomic DNA for Hunting Pathogenic DNA Molecular Changes.

    PubMed

    Takahashi; Nakazawa; Watanabe; Konagaya

    1999-01-01

    We have developed the automated processing algorithms for 2-dimensional (2-D) electrophoretograms of genomic DNA based on RLGS (Restriction Landmark Genomic Scanning) method, which scans the restriction enzyme recognition sites as the landmark and maps them onto a 2-D electrophoresis gel. Our powerful processing algorithms realize the automated spot recognition from RLGS electrophoretograms and the automated comparison of a huge number of such images. In the final stage of the automated processing, a master spot pattern, on which all the spots in the RLGS images are mapped at once, can be obtained. The spot pattern variations which seemed to be specific to the pathogenic DNA molecular changes can be easily detected by simply looking over the master spot pattern. When we applied our algorithms to the analysis of 33 RLGS images derived from human colon tissues, we successfully detected several colon tumor specific spot pattern changes.

  14. Qualitative and quantitative assessment of DNA quality of frozen beef based on DNA yield, gel electrophoresis and PCR amplification and their correlations to beef quality.

    PubMed

    Zhao, Jing; Zhang, Ting; Liu, Yongfeng; Wang, Xingyu; Zhang, Lan; Ku, Ting; Quek, Siew Young

    2018-09-15

    Freezing is a practical method for meat preservation but the quality of frozen meat can deteriorate with storage time. This research investigated the effect of frozen storage time (up to 66 months) on changes in DNA yield, purity and integrity in beef, and further analyzed the correlation between beef quality (moisture content, protein content, TVB-N value and pH value) and DNA quality in an attempt to establish a reliable, high-throughput method for meat quality control. Results showed that frozen storage time influenced the yield and integrity of DNA significantly (p < 0.05). The DNA yield decreased as frozen storage time increased due to DNA degradation. The half-life (t 1/2  = ln2/0.015) was calculated as 46 months. The DNA quality degraded dramatically with the increased storage time based on gel electrophoresis results. Polymerase chain reaction (PCR) products from both mitochondrial DNA (mtDNA) and nuclear DNA (nDNA) were observed in all frozen beef samples. Using real-time PCR for quantitative assessment of DNA and meat quality revealed that correlations could be established successfully with mathematical models to evaluate frozen beef quality. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. Few mitochondrial DNA sequences are inserted into the turkey (Meleagris gallopavo) nuclear genome: evolutionary analyses and informativity in the domestic lineage.

    PubMed

    Schiavo, G; Strillacci, M G; Ribani, A; Bovo, S; Roman-Ponce, S I; Cerolini, S; Bertolini, F; Bagnato, A; Fontanesi, L

    2018-06-01

    Mitochondrial DNA (mtDNA) insertions have been detected in the nuclear genome of many eukaryotes. These sequences are pseudogenes originated by horizontal transfer of mtDNA fragments into the nuclear genome, producing nuclear DNA sequences of mitochondrial origin (numt). In this study we determined the frequency and distribution of mtDNA-originated pseudogenes in the turkey (Meleagris gallopavo) nuclear genome. The turkey reference genome (Turkey_2.01) was aligned with the reference linearized mtDNA sequence using last. A total of 32 numt sequences (corresponding to 18 numt regions derived by unique insertional events) were identified in the turkey nuclear genome (size ranging from 66 to 1415 bp; identity against the modern turkey mtDNA corresponding region ranging from 62% to 100%). Numts were distributed in nine chromosomes and in one scaffold. They derived from parts of 10 mtDNA protein-coding genes, ribosomal genes, the control region and 10 tRNA genes. Seven numt regions reported in the turkey genome were identified in orthologues positions in the Gallus gallus genome and therefore were present in the ancestral genome that in the Cretaceous originated the lineages of the modern crown Galliformes. Five recently integrated turkey numts were validated by PCR in 168 turkeys of six different domestic populations. None of the analysed numts were polymorphic (i.e. absence of the inserted sequence, as reported in numts of recent integration in other species), suggesting that the reticulate speciation model is not useful for explaining the origin of the domesticated turkey lineage. © 2018 Stichting International Foundation for Animal Genetics.

  16. Impacts of Chromatin States and Long-Range Genomic Segments on Aging and DNA Methylation

    PubMed Central

    Sun, Dan; Yi, Soojin V.

    2015-01-01

    Understanding the fundamental dynamics of epigenome variation during normal aging is critical for elucidating key epigenetic alterations that affect development, cell differentiation and diseases. Advances in the field of aging and DNA methylation strongly support the aging epigenetic drift model. Although this model aligns with previous studies, the role of other epigenetic marks, such as histone modification, as well as the impact of sampling specific CpGs, must be evaluated. Ultimately, it is crucial to investigate how all CpGs in the human genome change their methylation with aging in their specific genomic and epigenomic contexts. Here, we analyze whole genome bisulfite sequencing DNA methylation maps of brain frontal cortex from individuals of diverse ages. Comparisons with blood data reveal tissue-specific patterns of epigenetic drift. By integrating chromatin state information, divergent degrees and directions of aging-associated methylation in different genomic regions are revealed. Whole genome bisulfite sequencing data also open a new door to investigate whether adjacent CpG sites exhibit coordinated DNA methylation changes with aging. We identified significant ‘aging-segments’, which are clusters of nearby CpGs that respond to aging by similar DNA methylation changes. These segments not only capture previously identified aging-CpGs but also include specific functional categories of genes with implications on epigenetic regulation of aging. For example, genes associated with development are highly enriched in positive aging segments, which are gradually hyper-methylated with aging. On the other hand, regions that are gradually hypo-methylated with aging (‘negative aging segments’) in the brain harbor genes involved in metabolism and protein ubiquitination. Given the importance of protein ubiquitination in proteome homeostasis of aging brains and neurodegenerative disorders, our finding suggests the significance of epigenetic regulation of this

  17. A periodic pattern of SNPs in the human genome

    PubMed Central

    Madsen, Bo Eskerod; Villesen, Palle; Wiuf, Carsten

    2007-01-01

    By surveying a filtered, high-quality set of SNPs in the human genome, we have found that SNPs positioned 1, 2, 4, 6, or 8 bp apart are more frequent than SNPs positioned 3, 5, 7, or 9 bp apart. The observed pattern is not restricted to genomic regions that are known to cause sequencing or alignment errors, for example, transposable elements (SINE, LINE, and LTR), tandem repeats, and large duplicated regions. However, we found that the pattern is almost entirely confined to what we define as “periodic DNA.” Periodic DNA is a genomic region with a high degree of periodicity in nucleotide usage. It turned out that periodic DNA is mainly small regions (average length 16.9 bp), widely distributed in the genome. Furthermore, periodic DNA has a 1.8 times higher SNP density than the rest of the genome and SNPs inside periodic DNA have a significantly higher genotyping error rate than SNPs outside periodic DNA. Our results suggest that not all SNPs in the human genome are created by independent single nucleotide mutations, and that care should be taken in analysis of SNPs from periodic DNA. The latter may have important consequences for SNP and association studies. PMID:17673700

  18. Myeloperoxidase-produced Genomic DNA-centered Radicals and Protection by Resveratrol

    EPA Science Inventory

    Myeloperoxidase (MPO) released by activated neutrophils, production of hypochlorous acid (HOCI) and oxidation of the genomic DNA in epithelial cells is thought to initiate and promote carcinogenesis. In this study we applied the 5,5-dimethyl-l-pyrroline N-oxide (DMPO)-based i;nmu...

  19. Rhipicephalus microplus dataset of nonredundant raw sequence reads from 454 GS FLX sequencing of Cot-selected (Cot = 660) genomic DNA

    USDA-ARS?s Scientific Manuscript database

    A reassociation kinetics-based approach was used to reduce the complexity of genomic DNA from the Deutsch laboratory strain of the cattle tick, Rhipicephalus microplus, to facilitate genome sequencing. Selected genomic DNA (Cot value = 660) was sequenced using 454 GS FLX technology, resulting in 356...

  20. Coevolution between Nuclear-Encoded DNA Replication, Recombination, and Repair Genes and Plastid Genome Complexity.

    PubMed

    Zhang, Jin; Ruhlman, Tracey A; Sabir, Jamal S M; Blazier, John Chris; Weng, Mao-Lun; Park, Seongjun; Jansen, Robert K

    2016-02-17

    Disruption of DNA replication, recombination, and repair (DNA-RRR) systems has been hypothesized to cause highly elevated nucleotide substitution rates and genome rearrangements in the plastids of angiosperms, but this theory remains untested. To investigate nuclear-plastid genome (plastome) coevolution in Geraniaceae, four different measures of plastome complexity (rearrangements, repeats, nucleotide insertions/deletions, and substitution rates) were evaluated along with substitution rates of 12 nuclear-encoded, plastid-targeted DNA-RRR genes from 27 Geraniales species. Significant correlations were detected for nonsynonymous (dN) but not synonymous (dS) substitution rates for three DNA-RRR genes (uvrB/C, why1, and gyrA) supporting a role for these genes in accelerated plastid genome evolution in Geraniaceae. Furthermore, correlation between dN of uvrB/C and plastome complexity suggests the presence of nucleotide excision repair system in plastids. Significant correlations were also detected between plastome complexity and 13 of the 90 nuclear-encoded organelle-targeted genes investigated. Comparisons revealed significant acceleration of dN in plastid-targeted genes of Geraniales relative to Brassicales suggesting this correlation may be an artifact of elevated rates in this gene set in Geraniaceae. Correlation between dN of plastid-targeted DNA-RRR genes and plastome complexity supports the hypothesis that the aberrant patterns in angiosperm plastome evolution could be caused by dysfunction in DNA-RRR systems. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  1. Deep Investigation of Arabidopsis thaliana Junk DNA Reveals a Continuum between Repetitive Elements and Genomic Dark Matter

    PubMed Central

    Maumus, Florian; Quesneville, Hadi

    2014-01-01

    Eukaryotic genomes contain highly variable amounts of DNA with no apparent function. This so-called junk DNA is composed of two components: repeated and repeat-derived sequences (together referred to as the repeatome), and non-annotated sequences also known as genomic dark matter. Because of their high duplication rates as compared to other genomic features, transposable elements are predominant contributors to the repeatome and the products of their decay is thought to be a major source of genomic dark matter. Determining the origin and composition of junk DNA is thus important to help understanding genome evolution as well as host biology. In this study, we have used a combination of tools enabling to show that the repeatome from the small and reducing A. thaliana genome is significantly larger than previously thought. Furthermore, we present the concepts and results from a series of innovative approaches suggesting that a significant amount of the A. thaliana dark matter is of repetitive origin. As a tentative standard for the community, we propose a deep compendium annotation of the A. thaliana repeatome that may help addressing farther genome evolution as well as transcriptional and epigenetic regulation in this model plant. PMID:24709859

  2. Repair of DNA double-strand breaks by templated nucleotide sequence insertions derived from distant regions of the genome.

    PubMed

    Onozawa, Masahiro; Zhang, Zhenhua; Kim, Yoo Jung; Goldberg, Liat; Varga, Tamas; Bergsagel, P Leif; Kuehl, W Michael; Aplan, Peter D

    2014-05-27

    We used the I-SceI endonuclease to produce DNA double-strand breaks (DSBs) and observed that a fraction of these DSBs were repaired by insertion of sequences, which we termed "templated sequence insertions" (TSIs), derived from distant regions of the genome. These TSIs were derived from genic, retrotransposon, or telomere sequences and were not deleted from the donor site in the genome, leading to the hypothesis that they were derived from reverse-transcribed RNA. Cotransfection of RNA and an I-SceI expression vector demonstrated insertion of RNA-derived sequences at the DNA-DSB site, and TSIs were suppressed by reverse-transcriptase inhibitors. Both observations support the hypothesis that TSIs were derived from RNA templates. In addition, similar insertions were detected at sites of DNA DSBs induced by transcription activator-like effector nuclease proteins. Whole-genome sequencing of myeloma cell lines revealed additional TSIs, demonstrating that repair of DNA DSBs via insertion was not restricted to experimentally produced DNA DSBs. Analysis of publicly available databases revealed that many of these TSIs are polymorphic in the human genome. Taken together, these results indicate that insertional events should be considered as alternatives to gross chromosomal rearrangements in the interpretation of whole-genome sequence data and that this mutagenic form of DNA repair may play a role in genetic disease, exon shuffling, and mammalian evolution.

  3. Toxic reagents and expensive equipment: are they really necessary for the extraction of good quality fungal DNA?

    PubMed

    Rodrigues, P; Venâncio, A; Lima, N

    2018-01-01

    The aim of this work was to evaluate a fungal DNA extraction procedure with the lowest inputs in terms of time as well as of expensive and toxic chemicals, but able to consistently produce genomic DNA of good quality for PCR purposes. Two types of fungal biological material were tested - mycelium and conidia - combined with two protocols for DNA extraction using Sodium Dodecyl Sulphate (SDS) and Cetyl Trimethyl Ammonium Bromide as extraction buffers and glass beads for mechanical disruption of cell walls. Our results showed that conidia and SDS buffer was the combination that lead to the best DNA quality and yield, with the lowest variation between samples. This study clearly demonstrates that it is possible to obtain high yield and pure DNA from pigmented conidia without the use of strong cell disrupting procedures and of toxic reagents. There are numerous methods for DNA extraction from fungi. Some rely on expensive commercial kits and/or equipments, unavailable for many laboratories, or make use of toxic chemicals such as chloroform, phenol and mercaptoethanol. This study clearly demonstrates that it is possible to obtain high yields of pure DNA from pigmented conidia without the use of strong and expensive cell disrupting procedures and of toxic reagents. The method herein described is simultaneously inexpensive and adequate to DNA extraction from several different types of fungi. © 2017 The Society for Applied Microbiology.

  4. Reconstitution of wild type viral DNA in simian cells transfected with early and late SV40 defective genomes.

    PubMed

    O'Neill, F J; Gao, Y; Xu, X

    1993-11-01

    The DNAs of polyomaviruses ordinarily exist as a single circular molecule of approximately 5000 base pairs. Variants of SV40, BKV and JCV have been described which contain two complementing defective DNA molecules. These defectives, which form a bipartite genome structure, contain either the viral early region or the late region. The defectives have the unique property of being able to tolerate variable sized reiterations of regulatory and terminus region sequences, and portions of the coding region. They can also exchange coding region sequences with other polyomaviruses. It has been suggested that the bipartite genome structure might be a stage in the evolution of polyomaviruses which can uniquely sustain genome and sequence diversity. However, it is not known if the regulatory and terminus region sequences are highly mutable. Also, it is not known if the bipartite genome structure is reversible and what the conditions might be which would favor restoration of the monomolecular genome structure. We addressed the first question by sequencing the reiterated regulatory and terminus regions of E- and L-SV40 DNAs. This revealed a large number of mutations in the regulatory regions of the defective genomes, including deletions, insertions, rearrangements and base substitutions. We also detected insertions and base substitutions in the T-antigen gene. We addressed the second question by introducing into permissive simian cells, E- and L-SV40 genomes which had been engineered to contain only a single regulatory region. Analysis of viral DNA from transfected cells demonstrated recombined genomes containing a wild type monomolecular DNA structure. However, the complete defectives, containing reiterated regulatory regions, could often compete away the wild type genomes. The recombinant monomolecular genomes were isolated, cloned and found to be infectious. All of the DNA alterations identified in one of the regulatory regions of E-SV40 DNA were present in the recombinant

  5. GeneCount: genome-wide calculation of absolute tumor DNA copy numbers from array comparative genomic hybridization data

    PubMed Central

    Lyng, Heidi; Lando, Malin; Brøvig, Runar S; Svendsrud, Debbie H; Johansen, Morten; Galteland, Eivind; Brustugun, Odd T; Meza-Zepeda, Leonardo A; Myklebost, Ola; Kristensen, Gunnar B; Hovig, Eivind; Stokke, Trond

    2008-01-01

    Absolute tumor DNA copy numbers can currently be achieved only on a single gene basis by using fluorescence in situ hybridization (FISH). We present GeneCount, a method for genome-wide calculation of absolute copy numbers from clinical array comparative genomic hybridization data. The tumor cell fraction is reliably estimated in the model. Data consistent with FISH results are achieved. We demonstrate significant improvements over existing methods for exploring gene dosages and intratumor copy number heterogeneity in cancers. PMID:18500990

  6. Development and validation of an rDNA operon based primer walking strategy applicable to de novo bacterial genome finishing

    PubMed Central

    Eastman, Alexander W.; Yuan, Ze-Chun

    2015-01-01

    Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing projects. PMID

  7. EG-13GENOME-WIDE METHYLATION ANALYSIS IDENTIFIES GENOMIC DNA DEMETHYLATION DURING MALIGNANT PROGRESSION OF GLIOMAS

    PubMed Central

    Saito, Kuniaki; Mukasa, Akitake; Nagae, Genta; Aihara, Koki; Otani, Ryohei; Takayanagi, Shunsaku; Omata, Mayu; Tanaka, Shota; Shibahara, Junji; Takahashi, Miwako; Momose, Toshimitsu; Shimamura, Teppei; Miyano, Satoru; Narita, Yoshitaka; Ueki, Keisuke; Nishikawa, Ryo; Nagane, Motoo; Aburatani, Hiroyuki; Saito, Nobuhito

    2014-01-01

    Low-grade gliomas often undergo malignant progression, and these transformations are a leading cause of death in patients with low-grade gliomas. However, the molecular mechanisms underlying malignant tumor progression are still not well understood. Recent evidence indicates that epigenetic deregulation is an important cause of gliomagenesis; therefore, we examined the impact of epigenetic changes during malignant progression of low-grade gliomas. Specifically, we used the Illumina Infinium Human Methylation 450K BeadChip to perform genome-wide DNA methylation analysis of 120 gliomas and four normal brains. This study sample included 25 matched-pairs of initial low-grade gliomas and recurrent tumors (temporal heterogeneity) and 20 of the 25 recurring tumors recurred as malignant progressions, and one matched-pair of newly emerging malignant lesions and pre-existing lesions (spatial heterogeneity). Analyses of methylation profiles demonstrated that most low-grade gliomas in our sample (43/51; 84%) had a CpG island methylator phenotype (G-CIMP). Remarkably, approximately 50% of secondary glioblastomas that had progressed from low-grade tumors with the G-CIMP status exhibited a characteristic partial demethylation of genomic DNA during malignant progression, but other recurrent gliomas showed no apparent change in DNA methylation pattern. Interestingly, we found that most loci that were demethylated during malignant progression were located outside of CpG islands. The information of histone modifications patterns in normal human astrocytes and embryonal stem cells also showed that the ratio of active marks at the site corresponding to DNA demethylated loci in G-CIMP-demethylated tumors was significantly lower; this finding indicated that most demethylated loci in G-CIMP-demethylated tumors were likely transcriptionally inactive. A small number of the genes that were upregulated and had demethylated CpG islands were associated with cell cycle-related pathway. In

  8. QUAST: quality assessment tool for genome assemblies.

    PubMed

    Gurevich, Alexey; Saveliev, Vladislav; Vyahhi, Nikolay; Tesler, Glenn

    2013-04-15

    Limitations of genome sequencing techniques have led to dozens of assembly algorithms, none of which is perfect. A number of methods for comparing assemblers have been developed, but none is yet a recognized benchmark. Further, most existing methods for comparing assemblies are only applicable to new assemblies of finished genomes; the problem of evaluating assemblies of previously unsequenced species has not been adequately considered. Here, we present QUAST-a quality assessment tool for evaluating and comparing genome assemblies. This tool improves on leading assembly comparison software with new ideas and quality metrics. QUAST can evaluate assemblies both with a reference genome, as well as without a reference. QUAST produces many reports, summary tables and plots to help scientists in their research and in their publications. In this study, we used QUAST to compare several genome assemblers on three datasets. QUAST tables and plots for all of them are available in the Supplementary Material, and interactive versions of these reports are on the QUAST website. http://bioinf.spbau.ru/quast . Supplementary data are available at Bioinformatics online.

  9. Profiling the genome-wide DNA methylation pattern of porcine ovaries using reduced representation bisulfite sequencing.

    PubMed

    Yuan, Xiao-Long; Gao, Ning; Xing, Yan; Zhang, Hai-Bin; Zhang, Ai-Ling; Liu, Jing; He, Jin-Long; Xu, Yuan; Lin, Wen-Mian; Chen, Zan-Mou; Zhang, Hao; Zhang, Zhe; Li, Jia-Qi

    2016-02-25

    Substantial evidence has shown that DNA methylation regulates the initiation of ovarian and sexual maturation. Here, we investigated the genome-wide profile of DNA methylation in porcine ovaries at single-base resolution using reduced representation bisulfite sequencing. The biological variation was minimal among the three ovarian replicates. We found hypermethylation frequently occurred in regions with low gene abundance, while hypomethylation in regions with high gene abundance. The DNA methylation around transcriptional start sites was negatively correlated with their own CpG content. Additionally, the methylation level in the bodies of genes was higher than that in their 5' and 3' flanking regions. The DNA methylation pattern of the low CpG content promoter genes differed obviously from that of the high CpG content promoter genes. The DNA methylation level of the porcine ovary was higher than that of the porcine intestine. Analyses of the genome-wide DNA methylation in porcine ovaries would advance the knowledge and understanding of the porcine ovarian methylome.

  10. On the roles of repetitive DNA elements in the context of a unified genomic-epigenetic system.

    PubMed

    von Sternberg, Richard

    2002-12-01

    Repetitive DNA sequences comprise a substantial portion of most eukaryotic and some prokaryotic chromosomes. Despite nearly forty years of research, the functions of various sequence families as a whole and their monomer units remain largely unknown. The inability to map specific functional roles onto many repetitive DNA elements (REs), coupled with the taxon-specificity of sequence families, have led many to speculate that these genomic components are "selfish" replicators generating genomic "junk." The purpose of this paper is to critically examine the selfishness, evolutionary effects, and functionality of REs. First, a brief overview of the range of ideas pertaining to RE function is presented. Second, the argument is presented that the selfish DNA "hypothesis" is actually a narrative scheme, that it serves to protect neo-Darwinian assumptions from criticism, and that this story is untestable and therefore not a hypothesis. Third, attempts to synthesize the selfish DNA concept with complex systems models of the genome and RE functionality are critiqued. Fourth, the supposed connection between RE-induced mutations and macroevolutionary events are stated to be at variance with empirical evidence and theoretical considerations. Hypotheses that base phylogenetic transitions in repetitive sequence changes thus remain speculative. Fifth and finally, the case is made for viewing REs as integrally functional components of chromosomes, genomes, and cells. It is argued throughout that a new conceptual framework is needed for understanding the roles of repetitive DNA in genomic/epigenetic systems, and that neo-Darwinian "narratives" have been the primary obstacle to elucidating the effects of these enigmatic components of chromosomes.

  11. Evaluation of next generation mtGenome sequencing using the Ion Torrent Personal Genome Machine (PGM)☆

    PubMed Central

    Parson, Walther; Strobl, Christina; Huber, Gabriela; Zimmermann, Bettina; Gomes, Sibylle M.; Souto, Luis; Fendt, Liane; Delport, Rhena; Langit, Reina; Wootton, Sharon; Lagacé, Robert; Irwin, Jodi

    2013-01-01

    Insights into the human mitochondrial phylogeny have been primarily achieved by sequencing full mitochondrial genomes (mtGenomes). In forensic genetics (partial) mtGenome information can be used to assign haplotypes to their phylogenetic backgrounds, which may, in turn, have characteristic geographic distributions that would offer useful information in a forensic case. In addition and perhaps even more relevant in the forensic context, haplogroup-specific patterns of mutations form the basis for quality control of mtDNA sequences. The current method for establishing (partial) mtDNA haplotypes is Sanger-type sequencing (STS), which is laborious, time-consuming, and expensive. With the emergence of Next Generation Sequencing (NGS) technologies, the body of available mtDNA data can potentially be extended much more quickly and cost-efficiently. Customized chemistries, laboratory workflows and data analysis packages could support the community and increase the utility of mtDNA analysis in forensics. We have evaluated the performance of mtGenome sequencing using the Personal Genome Machine (PGM) and compared the resulting haplotypes directly with conventional Sanger-type sequencing. A total of 64 mtGenomes (>1 million bases) were established that yielded high concordance with the corresponding STS haplotypes (<0.02% differences). About two-thirds of the differences were observed in or around homopolymeric sequence stretches. In addition, the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes. Further development of alignment software would be desirable to facilitate the application of NGS in mtDNA forensic genetics. PMID:23948325

  12. Structural rearrangements in the mitochondrial genome of Drosophila melanogaster induced by elevated levels of the replicative DNA helicase

    PubMed Central

    Ciesielski, Grzegorz L; Nadalutti, Cristina A; Oliveira, Marcos T; Griffith, Jack D; Kaguni, Laurie S

    2018-01-01

    Abstract Pathological conditions impairing functions of mitochondria often lead to compensatory upregulation of the mitochondrial DNA (mtDNA) replisome machinery, and the replicative DNA helicase appears to be a key factor in regulating mtDNA copy number. Moreover, mtDNA helicase mutations have been associated with structural rearrangements of the mitochondrial genome. To evaluate the effects of elevated levels of the mtDNA helicase on the integrity and replication of the mitochondrial genome, we overexpressed the helicase in Drosophila melanogaster Schneider cells and analyzed the mtDNA by two-dimensional neutral agarose gel electrophoresis and electron microscopy. We found that elevation of mtDNA helicase levels increases the quantity of replication intermediates and alleviates pausing at the replication slow zones. Though we did not observe a concomitant alteration in mtDNA copy number, we observed deletions specific to the segment of repeated elements in the immediate vicinity of the origin of replication, and an accumulation of species characteristic of replication fork stalling. We also found elevated levels of RNA that are retained in the replication intermediates. Together, our results suggest that upregulation of mtDNA helicase promotes the process of mtDNA replication but also results in genome destabilization. PMID:29432582

  13. A new approach for cloning hLIF cDNA from genomic DNA isolated from the oral mucous membrane.

    PubMed

    Cui, Y H; Zhu, G Q; Chen, Q J; Wang, Y F; Yang, M M; Song, Y X; Wang, J G; Cao, B Y

    2011-11-25

    Complementary DNA (cDNA) is valuable for investigating protein structure and function in the study of life science, but it is difficult to obtain by traditional reverse transcription. We employed a novel strategy to clone human leukemia inhibitory factor (hLIF) gene cDNA from genomic DNA, which was directly isolated from the mucous membrane of mouth. The hLIF sequence, which is 609 bp long and is composed of three exons, can be acquired within a few hours by amplifying each exon and splicing all of them using overlap-PCR. This new approach developed is simple, time- and cost-effective, without RNA preparation or cDNA synthesis, and is not limited to the specific tissues for a particular gene and the expression level of the gene.

  14. Using circulating cell-free DNA to monitor personalized cancer therapy.

    PubMed

    Oellerich, Michael; Schütz, Ekkehard; Beck, Julia; Kanzow, Philipp; Plowman, Piers N; Weiss, Glen J; Walson, Philip D

    2017-05-01

    High-quality genomic analysis is critical for personalized pharmacotherapy in patients with cancer. Tumor-specific genomic alterations can be identified in cell-free DNA (cfDNA) from patient blood samples and can complement biopsies for real-time molecular monitoring of treatment, detection of recurrence, and tracking resistance. cfDNA can be especially useful when tumor tissue is unavailable or insufficient for testing. For blood-based genomic profiling, next-generation sequencing (NGS) and droplet digital PCR (ddPCR) have been successfully applied. The US Food and Drug Administration (FDA) recently approved the first such "liquid biopsy" test for EGFR mutations in patients with non-small cell lung cancer (NSCLC). Such non-invasive methods allow for the identification of specific resistance mutations selected by treatment, such as EGFR T790M, in patients with NSCLC treated with gefitinib. Chromosomal aberration pattern analysis by low coverage whole genome sequencing is a more universal approach based on genomic instability. Gains and losses of chromosomal regions have been detected in plasma tumor-specific cfDNA as copy number aberrations and can be used to compute a genomic copy number instability (CNI) score of cfDNA. A specific CNI index obtained by massive parallel sequencing discriminated those patients with prostate cancer from both healthy controls and men with benign prostatic disease. Furthermore, androgen receptor gene aberrations in cfDNA were associated with therapeutic resistance in metastatic castration resistant prostate cancer. Change in CNI score has been shown to serve as an early predictor of response to standard chemotherapy for various other cancer types (e.g. NSCLC, colorectal cancer, pancreatic ductal adenocarcinomas). CNI scores have also been shown to predict therapeutic responses to immunotherapy. Serial genomic profiling can detect resistance mutations up to 16 weeks before radiographic progression. There is a potential for cost savings

  15. High quality draft genome sequences of Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonas cremoricolorata DSM 17059 T type strains

    DOE PAGES

    Peña, Arantxa; Busquets, Antonio; Gomila, Margarita; ...

    2016-09-01

    Pseudomonas has the highest number of species out of any genus of Gram-negative bacteria and is phylogenetically divided into several groups. The Pseudomonas putida phylogenetic branch includes at least 13 species of environmental and industrial interest, plant-associated bacteria, insect pathogens, and even some members that have been found in clinical specimens. In the context of the Genomic Encyclopedia of Bacteria and Archaea project, we present the permanent, high-quality draft genomes of the type strains of 3 taxonomically and ecologically closely related species in the Pseudomonas putida phylogenetic branch: Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonasmore » cremoricolorata DSM 17059T. All three genomes are comparable in size (4.6-4.9Mb), with 4,119-4,459 protein-coding genes. Average nucleotide identity based on BLAST comparisons and digital genome-to-genome distance calculations are in good agreement with experimental DNA-DNA hybridization results. The genome sequences presented here will be very helpful in elucidating the taxonomy, phylogeny and evolution of the Pseudomonas putida species complex.« less

  16. High quality draft genome sequences of Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonas cremoricolorata DSM 17059 T type strains

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peña, Arantxa; Busquets, Antonio; Gomila, Margarita

    Pseudomonas has the highest number of species out of any genus of Gram-negative bacteria and is phylogenetically divided into several groups. The Pseudomonas putida phylogenetic branch includes at least 13 species of environmental and industrial interest, plant-associated bacteria, insect pathogens, and even some members that have been found in clinical specimens. In the context of the Genomic Encyclopedia of Bacteria and Archaea project, we present the permanent, high-quality draft genomes of the type strains of 3 taxonomically and ecologically closely related species in the Pseudomonas putida phylogenetic branch: Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonasmore » cremoricolorata DSM 17059T. All three genomes are comparable in size (4.6-4.9Mb), with 4,119-4,459 protein-coding genes. Average nucleotide identity based on BLAST comparisons and digital genome-to-genome distance calculations are in good agreement with experimental DNA-DNA hybridization results. The genome sequences presented here will be very helpful in elucidating the taxonomy, phylogeny and evolution of the Pseudomonas putida species complex.« less

  17. A genome-wide BAC-end sequence survey provides first insights into sweetpotato (Ipomoea batatas (L.) Lam.) genome composition.

    PubMed

    Si, Zengzhi; Du, Bing; Huo, Jinxi; He, Shaozhen; Liu, Qingchang; Zhai, Hong

    2016-11-21

    Sweetpotato, Ipomoea batatas (L.) Lam., is an important food crop widely grown in the world. However, little is known about the genome of this species because it is a highly heterozygous hexaploid. Gaining a more in-depth knowledge of sweetpotato genome is therefore necessary and imperative. In this study, the first bacterial artificial chromosome (BAC) library of sweetpotato was constructed. Clones from the BAC library were end-sequenced and analyzed to provide genome-wide information about this species. The BAC library contained 240,384 clones with an average insert size of 101 kb and had a 7.93-10.82 × coverage of the genome, and the probability of isolating any single-copy DNA sequence from the library was more than 99%. Both ends of 8310 BAC clones randomly selected from the library were sequenced to generate 11,542 high-quality BAC-end sequences (BESs), with an accumulative length of 7,595,261 bp and an average length of 658 bp. Analysis of the BESs revealed that 12.17% of the sweetpotato genome were known repetitive DNA, including 7.37% long terminal repeat (LTR) retrotransposons, 1.15% Non-LTR retrotransposons and 1.42% Class II DNA transposons etc., 18.31% of the genome were identified as sweetpotato-unique repetitive DNA and 10.00% of the genome were predicted to be coding regions. In total, 3,846 simple sequences repeats (SSRs) were identified, with a density of one SSR per 1.93 kb, from which 288 SSRs primers were designed and tested for length polymorphism using 20 sweetpotato accessions, 173 (60.07%) of them produced polymorphic bands. Sweetpotato BESs had significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum than those of Vitis vinifera, Theobroma cacao and Arabidopsis thaliana. The first BAC library for sweetpotato has been successfully constructed. The high quality BESs provide first insights into sweetpotato genome composition, and have significant hits to the genome

  18. Development of DNA-Free Sediment for Ecological Assays with Genomic Endpoints

    EPA Science Inventory

    Recent advances in genomics are currently being exploited to discern ecological changes that have conventionally been measured using laborious counting techniques. For example, next generation sequencing technologies can be used to create DNA libraries from benthic community ass...

  19. 'Your DNA, Your Say': global survey gathering attitudes toward genomics: design, delivery and methods.

    PubMed

    Middleton, Anna; Niemiec, Emilia; Prainsack, Barbara; Bobe, Jason; Farley, Lauren; Steed, Claire; Smith, James; Bevan, Paul; Bonhomme, Natasha; Kleiderman, Erika; Thorogood, Adrian; Schickhardt, Christoph; Garattini, Chiara; Vears, Danya; Littler, Katherine; Banner, Natalie; Scott, Erick; Kovalevskaya, Nadezda V; Levin, Elissa; Morley, Katherine I; Howard, Heidi C

    2018-06-01

    Our international study, 'Your DNA, Your Say', uses film and an online cross-sectional survey to gather public attitudes toward the donation, access and sharing of DNA information. We describe the methodological approach used to create an engaging and bespoke survey, suitable for translation into many different languages. We address some of the particular challenges in designing a survey on the subject of genomics. In order to understand the significance of a genomic result, researchers and clinicians alike use external databases containing DNA and medical information from thousands of people. We ask how publics would like their 'anonymous' data to be used (or not to be used) and whether they are concerned by the potential risks of reidentification; the results will be used to inform policy.

  20. A Surrogate Approach to Study the Evolution of Noncoding DNA Elements That Organize Eukaryotic Genomes

    PubMed Central

    Vermaak, Danielle; Bayes, Joshua J.

    2009-01-01

    Comparative genomics provides a facile way to address issues of evolutionary constraint acting on different elements of the genome. However, several important DNA elements have not reaped the benefits of this new approach. Some have proved intractable to current day sequencing technology. These include centromeric and heterochromatic DNA, which are essential for chromosome segregation as well as gene regulation, but the highly repetitive nature of the DNA sequences in these regions make them difficult to assemble into longer contigs. Other sequences, like dosage compensation X chromosomal sites, origins of DNA replication, or heterochromatic sequences that encode piwi-associated RNAs, have proved difficult to study because they do not have recognizable DNA features that allow them to be described functionally or computationally. We have employed an alternate approach to the direct study of these DNA elements. By using proteins that specifically bind these noncoding DNAs as surrogates, we can indirectly assay the evolutionary constraints acting on these important DNA elements. We review the impact that such “surrogate strategies” have had on our understanding of the evolutionary constraints shaping centromeres, origins of DNA replication, and dosage compensation X chromosomal sites. These have begun to reveal that in contrast to the view that such structural DNA elements are either highly constrained (under purifying selection) or free to drift (under neutral evolution), some of them may instead be shaped by adaptive evolution and genetic conflicts (these are not mutually exclusive). These insights also help to explain why the same elements (e.g., centromeres and replication origins), which are so complex in some eukaryotic genomes, can be simple and well defined in other where similar conflicts do not exist. PMID:19635763

  1. Toward allotetraploid cotton genome assembly: integration of a high-density molecular genetic linkage map with DNA sequence information

    PubMed Central

    2012-01-01

    Background Cotton is the world’s most important natural textile fiber and a significant oilseed crop. Decoding cotton genomes will provide the ultimate reference and resource for research and utilization of the species. Integration of high-density genetic maps with genomic sequence information will largely accelerate the process of whole-genome assembly in cotton. Results In this paper, we update a high-density interspecific genetic linkage map of allotetraploid cultivated cotton. An additional 1,167 marker loci have been added to our previously published map of 2,247 loci. Three new marker types, InDel (insertion-deletion) and SNP (single nucleotide polymorphism) developed from gene information, and REMAP (retrotransposon-microsatellite amplified polymorphism), were used to increase map density. The updated map consists of 3,414 loci in 26 linkage groups covering 3,667.62 cM with an average inter-locus distance of 1.08 cM. Furthermore, genome-wide sequence analysis was finished using 3,324 informative sequence-based markers and publicly-available Gossypium DNA sequence information. A total of 413,113 EST and 195 BAC sequences were physically anchored and clustered by 3,324 sequence-based markers. Of these, 14,243 ESTs and 188 BACs from different species of Gossypium were clustered and specifically anchored to the high-density genetic map. A total of 2,748 candidate unigenes from 2,111 ESTs clusters and 63 BACs were mined for functional annotation and classification. The 337 ESTs/genes related to fiber quality traits were integrated with 132 previously reported cotton fiber quality quantitative trait loci, which demonstrated the important roles in fiber quality of these genes. Higher-level sequence conservation between different cotton species and between the A- and D-subgenomes in tetraploid cotton was found, indicating a common evolutionary origin for orthologous and paralogous loci in Gossypium. Conclusion This study will serve as a valuable genomic resource

  2. Co-opting the Fanconi Anemia Genomic Stability Pathway Enables Herpesvirus DNA Synthesis and Productive Growth

    PubMed Central

    Karttunen, Heidi; Savas, Jeffrey N.; McKinney, Caleb; Chen, Yu-Hung; Yates, John R.; Hukkanen, Veijo; Huang, Tony T.; Mohr, Ian

    2015-01-01

    SUMMARY DNA damage associated with viral DNA synthesis can result in double strand breaks that threaten genome integrity and must be repaired. Here, we establish that the cellular Fanconi Anemia (FA) genomic stability pathway is exploited by HSV1 to promote viral DNA synthesis and enable its productive growth. Potent FA pathway activation in HSV1-infected cells resulted in monoubiquitination of FA effector proteins, FANCI and FANCD2 (FANCI-D2) and required the viral DNA polymerase. FANCD2 relocalized to viral replication compartments and FANCI-D2 interacted with a multi-subunit complex containing the virus-encoded single-stranded DNA-binding protein ICP8. Significantly, while HSV1 productive growth was impaired in monoubiquitination-defective FA patient cells, this restriction was partially surmounted by antagonizing the DNA-dependent protein kinase (DNA-PK), a critical enzyme required for non-homologous end-joining (NHEJ). This identifies the FA-pathway as a new cellular factor required for herpesvirus productive growth and suggests that FA-mediated suppression of NHEJ is a fundamental step in the viral lifecycle. PMID:24954902

  3. Variola virus topoisomerase: DNA cleavage specificity and distribution of sites in Poxvirus genomes.

    PubMed

    Minkah, Nana; Hwang, Young; Perry, Kay; Van Duyne, Gregory D; Hendrickson, Robert; Lefkowitz, Elliot J; Hannenhalli, Sridhar; Bushman, Frederic D

    2007-08-15

    Topoisomerase enzymes regulate superhelical tension in DNA resulting from transcription, replication, repair, and other molecular transactions. Poxviruses encode an unusual type IB topoisomerase that acts only at conserved DNA sequences containing the core pentanucleotide 5'-(T/C)CCTT-3'. In X-ray structures of the variola virus topoisomerase bound to DNA, protein-DNA contacts were found to extend beyond the core pentanucleotide, indicating that the full recognition site has not yet been fully defined in functional studies. Here we report quantitation of DNA cleavage rates for an optimized 13 bp site and for all possible single base substitutions (40 total sites), with the goals of understanding the molecular mechanism of recognition and mapping topoisomerase sites in poxvirus genome sequences. The data allow a precise definition of enzyme-DNA interactions and the energetic contributions of each. We then used the resulting "action matrix" to show that favorable topoisomerase sites are distributed all along the length of poxvirus DNA sequences, consistent with a requirement for local release of superhelical tension in constrained topological domains. In orthopox genomes, an additional central cluster of sites was also evident. A negative correlation of predicted topoisomerase sites was seen relative to early terminators, but no correlation was seen with early or late promoters. These data define the full variola virus topoisomerase recognition site and provide a new window on topoisomerase function in vivo.

  4. Comparative Genomics as a Foundation for Evo-Devo Studies in Birds.

    PubMed

    Grayson, Phil; Sin, Simon Y W; Sackton, Timothy B; Edwards, Scott V

    2017-01-01

    Developmental genomics is a rapidly growing field, and high-quality genomes are a useful foundation for comparative developmental studies. A high-quality genome forms an essential reference onto which the data from numerous assays and experiments, including ChIP-seq, ATAC-seq, and RNA-seq, can be mapped. A genome also streamlines and simplifies the development of primers used to amplify putative regulatory regions for enhancer screens, cDNA probes for in situ hybridization, microRNAs (miRNAs) or short hairpin RNAs (shRNA) for RNA interference (RNAi) knockdowns, mRNAs for misexpression studies, and even guide RNAs (gRNAs) for CRISPR knockouts. Finally, much can be gleaned from comparative genomics alone, including the identification of highly conserved putative regulatory regions. This chapter provides an overview of laboratory and bioinformatics protocols for DNA extraction, library preparation, library quantification, and genome assembly, from fresh or frozen tissue to a draft avian genome. Generating a high-quality draft genome can provide a developmental research group with excellent resources for their study organism, opening the doors to many additional assays and experiments.

  5. Genomic DNA sequence and cytosine methylation changes of adult rice leaves after seeds space flight

    NASA Astrophysics Data System (ADS)

    Shi, Jinming

    In this study, cytosine methylation on CCGG site and genomic DNA sequence changes of adult leaves of rice after seeds space flight were detected by methylation-sensitive amplification polymorphism (MSAP) and Amplified fragment length polymorphism (AFLP) technique respectively. Rice seeds were planted in the trial field after 4 days space flight on the shenzhou-6 Spaceship of China. Adult leaves of space-treated rice including 8 plants chosen randomly and 2 plants with phenotypic mutation were used for AFLP and MSAP analysis. Polymorphism of both DNA sequence and cytosine methylation were detected. For MSAP analysis, the average polymorphic frequency of the on-ground controls, space-treated plants and mutants are 1.3%, 3.1% and 11% respectively. For AFLP analysis, the average polymorphic frequencies are 1.4%, 2.9%and 8%respectively. Total 27 and 22 polymorphic fragments were cloned sequenced from MSAP and AFLP analysis respectively. Nine of the 27 fragments from MSAP analysis show homology to coding sequence. For the 22 polymorphic fragments from AFLP analysis, no one shows homology to mRNA sequence and eight fragments show homology to repeat region or retrotransposon sequence. These results suggest that although both genomic DNA sequence and cytosine methylation status can be effected by space flight, the genomic region homology to the fragments from genome DNA and cytosine methylation analysis were different.

  6. Incidence of genome structure, DNA asymmetry, and cell physiology on T-DNA integration in chromosomes of the phytopathogenic fungus Leptosphaeria maculans.

    PubMed

    Bourras, Salim; Meyer, Michel; Grandaubert, Jonathan; Lapalu, Nicolas; Fudal, Isabelle; Linglin, Juliette; Ollivier, Benedicte; Blaise, Françoise; Balesdent, Marie-Hélène; Rouxel, Thierry

    2012-08-01

    The ever-increasing generation of sequence data is accompanied by unsatisfactory functional annotation, and complex genomes, such as those of plants and filamentous fungi, show a large number of genes with no predicted or known function. For functional annotation of unknown or hypothetical genes, the production of collections of mutants using Agrobacterium tumefaciens-mediated transformation (ATMT) associated with genotyping and phenotyping has gained wide acceptance. ATMT is also widely used to identify pathogenicity determinants in pathogenic fungi. A systematic analysis of T-DNA borders was performed in an ATMT-mutagenized collection of the phytopathogenic fungus Leptosphaeria maculans to evaluate the features of T-DNA integration in its particular transposable element-rich compartmentalized genome. A total of 318 T-DNA tags were recovered and analyzed for biases in chromosome and genic compartments, existence of CG/AT skews at the insertion site, and occurrence of microhomologies between the T-DNA left border (LB) and the target sequence. Functional annotation of targeted genes was done using the Gene Ontology annotation. The T-DNA integration mainly targeted gene-rich, transcriptionally active regions, and it favored biological processes consistent with the physiological status of a germinating spore. T-DNA integration was strongly biased toward regulatory regions, and mainly promoters. Consistent with the T-DNA intranuclear-targeting model, the density of T-DNA insertion correlated with CG skew near the transcription initiation site. The existence of microhomologies between promoter sequences and the T-DNA LB flanking sequence was also consistent with T-DNA integration to host DNA mediated by homologous recombination based on the microhomology-mediated end-joining pathway.

  7. Criminal genomic pragmatism: prisoners' representations of DNA technology and biosecurity.

    PubMed

    Machado, Helena; Silva, Susana

    2012-01-01

    Within the context of the use of DNA technology in crime investigation, biosecurity is perceived by different stakeholders according to their particular rationalities and interests. Very little is known about prisoners' perceptions and assessments of the uses of DNA technology in solving crime. To propose a conceptual model that serves to analyse and interpret prisoners' representations of DNA technology and biosecurity. A qualitative study using an interpretative approach based on 31 semi-structured tape-recorded interviews was carried out between May and September 2009, involving male inmates in three prisons located in the north of Portugal. The content analysis focused on the following topics: the meanings attributed to DNA and assessments of the risks and benefits of the uses of DNA technology and databasing in forensic applications. DNA was described as a record of identity, an exceptional material, and a powerful biometric identifier. The interviewees believed that DNA can be planted to incriminate suspects. Convicted offenders argued for the need to extend the criteria for the inclusion of DNA profiles in forensic databases and to restrict the removal of profiles. The conceptual model entitled criminal genomic pragmatism allows for an understanding of the views of prison inmates regarding DNA technology and biosecurity.

  8. Effects of antioxidants on the quality and genomic stability of induced pluripotent stem cells

    PubMed Central

    Luo, Lan; Kawakatsu, Miho; Guo, Chao-Wan; Urata, Yoshishige; Huang, Wen-Jing; Ali, Haytham; Doi, Hanako; Kitajima, Yuriko; Tanaka, Takayuki; Goto, Shinji; Ono, Yusuke; Xin, Hong-Bo; Hamano, Kimikazu; Li, Tao-Sheng

    2014-01-01

    Effects of antioxidants on the quality and genomic stability of induced pluripotent stem (iPS) cells were investigated with two human iPS cell lines (201B7 and 253G1). Cells used in this study were expanded from a single colony of each cell line with the addition of proprietary antioxidant supplement or homemade antioxidant cocktail in medium, and maintained in parallel for 2 months. The cells grew well in all culture conditions and kept “stemness”. Although antioxidants modestly decreased the levels of intracellular reactive oxygen species, there were no differences in the expression of 53BP1 and pATM, two critical molecules related with DNA damage and repair, under various culture conditions. CGH analysis showed that the events of genetic aberrations were decreased only in the 253G1 iPS cells with the addition of homemade antioxidant cocktail. Long-term culture will be necessary to confirm whether low dose antioxidants improve the quality and genomic stability of iPS cells. PMID:24445363

  9. Nanoliter reactors improve multiple displacement amplification of genomes from single cells.

    PubMed

    Marcy, Yann; Ishoey, Thomas; Lasken, Roger S; Stockwell, Timothy B; Walenz, Brian P; Halpern, Aaron L; Beeson, Karen Y; Goldberg, Susanne M D; Quake, Stephen R

    2007-09-01

    Since only a small fraction of environmental bacteria are amenable to laboratory culture, there is great interest in genomic sequencing directly from single cells. Sufficient DNA for sequencing can be obtained from one cell by the Multiple Displacement Amplification (MDA) method, thereby eliminating the need to develop culture methods. Here we used a microfluidic device to isolate individual Escherichia coli and amplify genomic DNA by MDA in 60-nl reactions. Our results confirm a report that reduced MDA reaction volume lowers nonspecific synthesis that can result from contaminant DNA templates and unfavourable interaction between primers. The quality of the genome amplification was assessed by qPCR and compared favourably to single-cell amplifications performed in standard 50-microl volumes. Amplification bias was greatly reduced in nanoliter volumes, thereby providing a more even representation of all sequences. Single-cell amplicons from both microliter and nanoliter volumes provided high-quality sequence data by high-throughput pyrosequencing, thereby demonstrating a straightforward route to sequencing genomes from single cells.

  10. PTEN in the maintenance of genome integrity: From DNA replication to chromosome segregation.

    PubMed

    Hou, Sheng-Qi; Ouyang, Meng; Brandmaier, Andrew; Hao, Hongbo; Shen, Wen H

    2017-10-01

    Faithful DNA replication and accurate chromosome segregation are the key machineries of genetic transmission. Disruption of these processes represents a hallmark of cancer and often results from loss of tumor suppressors. PTEN is an important tumor suppressor that is frequently mutated or deleted in human cancer. Loss of PTEN has been associated with aneuploidy and poor prognosis in cancer patients. In mice, Pten deletion or mutation drives genomic instability and tumor development. PTEN deficiency induces DNA replication stress, confers stress tolerance, and disrupts mitotic spindle architecture, leading to accumulation of structural and numerical chromosome instability. Therefore, PTEN guards the genome by controlling multiple processes of chromosome inheritance. Here, we summarize current understanding of the PTEN function in promoting high-fidelity transmission of genetic information. We also discuss the PTEN pathways of genome maintenance and highlight potential targets for cancer treatment. © 2017 WILEY Periodicals, Inc.

  11. Micro-Scale Genomic DNA Copy Number Aberrations as Another Means of Mutagenesis in Breast Cancer

    PubMed Central

    Chao, Hann-Hsiang; He, Xiaping; Parker, Joel S.; Zhao, Wei; Perou, Charles M.

    2012-01-01

    Introduction In breast cancer, the basal-like subtype has high levels of genomic instability relative to other breast cancer subtypes with many basal-like-specific regions of aberration. There is evidence that this genomic instability extends to smaller scale genomic aberrations, as shown by a previously described micro-deletion event in the PTEN gene in the Basal-like SUM149 breast cancer cell line. Methods We sought to identify if small regions of genomic DNA copy number changes exist by using a high density, gene-centric Comparative Genomic Hybridizations (CGH) array on cell lines and primary tumors. A custom tiling array for CGH (244,000 probes, 200 bp tiling resolution) was created to identify small regions of genomic change, which was focused on previously identified basal-like-specific, and general cancer genes. Tumor genomic DNA from 94 patients and 2 breast cancer cell lines was labeled and hybridized to these arrays. Aberrations were called using SWITCHdna and the smallest 25% of SWITCHdna-defined genomic segments were called micro-aberrations (<64 contiguous probes, ∼ 15 kb). Results Our data showed that primary tumor breast cancer genomes frequently contained many small-scale copy number gains and losses, termed micro-aberrations, most of which are undetectable using typical-density genome-wide aCGH arrays. The basal-like subtype exhibited the highest incidence of these events. These micro-aberrations sometimes altered expression of the involved gene. We confirmed the presence of the PTEN micro-amplification in SUM149 and by mRNA-seq showed that this resulted in loss of expression of all exons downstream of this event. Micro-aberrations disproportionately affected the 5′ regions of the affected genes, including the promoter region, and high frequency of micro-aberrations was associated with poor survival. Conclusion Using a high-probe-density, gene-centric aCGH microarray, we present evidence of small-scale genomic aberrations that can contribute to

  12. Genome-wide DNA methylation patterns of bovine blastocysts derived from in vivo embryos subjected to in vitro culture before, during or after embryonic genome activation.

    PubMed

    Salilew-Wondim, Dessie; Saeed-Zidane, Mohammed; Hoelker, Michael; Gebremedhn, Samuel; Poirier, Mikhaël; Pandey, Hari Om; Tholen, Ernst; Neuhoff, Christiane; Held, Eva; Besenfelder, Urban; Havlicek, Vita; Rings, Franca; Fournier, Eric; Gagné, Dominic; Sirard, Marc-André; Robert, Claude; Gad, Ahmed; Schellander, Karl; Tesfaye, Dawit

    2018-06-01

    Aberrant DNA methylation patterns of genes required for development are common in in vitro produced embryos. In this regard, we previously identified altered DNA methylation patterns of in vivo developed blastocysts from embryos which spent different stages of development in vitro, indicating carryover effects of suboptimal culture conditions on epigenetic signatures of preimplantation embryos. However, epigenetic responses of in vivo originated embryos to suboptimal culture conditions are not fully understood. Therefore, here we investigated DNA methylation patterns of in vivo derived bovine embryos subjected to in vitro culture condition before, during or after major embryonic genome activation (EGA). For this, in vivo produced 2-, 8- and 16-cell stage embryos were cultured in vitro until the blastocyst stage and blastocysts were used for genome-wide DNA methylation analysis. The 2- and 8-cell flushed embryo groups showed lower blastocyst rates compared to the 16-cell flush group. This was further accompanied by increased numbers of differentially methylated genomic regions (DMRs) in blastocysts of the 2- and 8-cell flush groups compared to the complete in vivo control ones. Moreover, 1623 genomic loci including imprinted genes were hypermethylated in blastocyst of 2-, 8- and 16-cell flushed groups, indicating the presence of genomic regions which are sensitive to the in vitro culture at any stage of embryonic development. Furthermore, hypermethylated genomic loci outnumbered hypomethylated ones in blastocysts of 2- and 16-cell flushed embryo groups, but the opposite occurred in the 8-cell group. Moreover, DMRs which were unique to blastocysts of the 2-cell flushed group and inversely correlated with corresponding mRNA expression levels were involved in plasma membrane lactate transport, amino acid transport and phosphorus metabolic processes, whereas DMRs which were specific to the 8-cell group and inversely correlated with corresponding mRNA expression levels

  13. Analysis of bacterial populations in the environment using two-dimensional gel electrophoresis of genomic DNA and complementary DNA.

    PubMed

    Liu, Guo-Hua; Nakamura, Tatsuo; Amemiya, Takashi; Rajendran, Narasimmalu; Itoh, Kiminori

    2011-01-01

    Two-dimensional gel electrophoresis (2-DGE) mapping of genomic DNA and complementary DNA (cDNA) amplicons was attempted to analyze total and active bacterial populations within soil and activated sludge samples. Distinct differences in the number and species of bacterial populations and those that were metabolically active at the time of sampling were visually observed especially for the soil community. Statistical analyses and sequencing based on the 2-DGE data further revealed the relationships between total and active bacterial populations within each community. This high-resolution technique would be useful for obtaining a better understanding of bacterial population structures in the environment.

  14. TopBP1/Dpb11 binds DNA anaphase bridges to prevent genome instability

    PubMed Central

    Germann, Susanne M.; Schramke, Vera; Pedersen, Rune Troelsgaard; Gallina, Irene; Eckert-Boulet, Nadine; Oestergaard, Vibe H.

    2014-01-01

    DNA anaphase bridges are a potential source of genome instability that may lead to chromosome breakage or nondisjunction during mitosis. Two classes of anaphase bridges can be distinguished: DAPI-positive chromatin bridges and DAPI-negative ultrafine DNA bridges (UFBs). Here, we establish budding yeast Saccharomyces cerevisiae and the avian DT40 cell line as model systems for studying DNA anaphase bridges and show that TopBP1/Dpb11 plays an evolutionarily conserved role in their metabolism. Together with the single-stranded DNA binding protein RPA, TopBP1/Dpb11 binds to UFBs, and depletion of TopBP1/Dpb11 led to an accumulation of chromatin bridges. Importantly, the NoCut checkpoint that delays progression from anaphase to abscission in yeast was activated by both UFBs and chromatin bridges independently of Dpb11, and disruption of the NoCut checkpoint in Dpb11-depleted cells led to genome instability. In conclusion, we propose that TopBP1/Dpb11 prevents accumulation of anaphase bridges via stimulation of the Mec1/ATR kinase and suppression of homologous recombination. PMID:24379413

  15. TopBP1/Dpb11 binds DNA anaphase bridges to prevent genome instability.

    PubMed

    Germann, Susanne M; Schramke, Vera; Pedersen, Rune Troelsgaard; Gallina, Irene; Eckert-Boulet, Nadine; Oestergaard, Vibe H; Lisby, Michael

    2014-01-06

    DNA anaphase bridges are a potential source of genome instability that may lead to chromosome breakage or nondisjunction during mitosis. Two classes of anaphase bridges can be distinguished: DAPI-positive chromatin bridges and DAPI-negative ultrafine DNA bridges (UFBs). Here, we establish budding yeast Saccharomyces cerevisiae and the avian DT40 cell line as model systems for studying DNA anaphase bridges and show that TopBP1/Dpb11 plays an evolutionarily conserved role in their metabolism. Together with the single-stranded DNA binding protein RPA, TopBP1/Dpb11 binds to UFBs, and depletion of TopBP1/Dpb11 led to an accumulation of chromatin bridges. Importantly, the NoCut checkpoint that delays progression from anaphase to abscission in yeast was activated by both UFBs and chromatin bridges independently of Dpb11, and disruption of the NoCut checkpoint in Dpb11-depleted cells led to genome instability. In conclusion, we propose that TopBP1/Dpb11 prevents accumulation of anaphase bridges via stimulation of the Mec1/ATR kinase and suppression of homologous recombination.

  16. qPCR-based mitochondrial DNA quantification: Influence of template DNA fragmentation on accuracy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jackson, Christopher B., E-mail: Christopher.jackson@insel.ch; Gallati, Sabina, E-mail: sabina.gallati@insel.ch; Schaller, Andre, E-mail: andre.schaller@insel.ch

    2012-07-06

    Highlights: Black-Right-Pointing-Pointer Serial qPCR accurately determines fragmentation state of any given DNA sample. Black-Right-Pointing-Pointer Serial qPCR demonstrates different preservation of the nuclear and mitochondrial genome. Black-Right-Pointing-Pointer Serial qPCR provides a diagnostic tool to validate the integrity of bioptic material. Black-Right-Pointing-Pointer Serial qPCR excludes degradation-induced erroneous quantification. -- Abstract: Real-time PCR (qPCR) is the method of choice for quantification of mitochondrial DNA (mtDNA) by relative comparison of a nuclear to a mitochondrial locus. Quantitative abnormal mtDNA content is indicative of mitochondrial disorders and mostly confines in a tissue-specific manner. Thus handling of degradation-prone bioptic material is inevitable. We established a serialmore » qPCR assay based on increasing amplicon size to measure degradation status of any DNA sample. Using this approach we can exclude erroneous mtDNA quantification due to degraded samples (e.g. long post-exicision time, autolytic processus, freeze-thaw cycles) and ensure abnormal DNA content measurements (e.g. depletion) in non-degraded patient material. By preparation of degraded DNA under controlled conditions using sonification and DNaseI digestion we show that erroneous quantification is due to the different preservation qualities of the nuclear and the mitochondrial genome. This disparate degradation of the two genomes results in over- or underestimation of mtDNA copy number in degraded samples. Moreover, as analysis of defined archival tissue would allow to precise the molecular pathomechanism of mitochondrial disorders presenting with abnormal mtDNA content, we compared fresh frozen (FF) with formalin-fixed paraffin-embedded (FFPE) skeletal muscle tissue of the same sample. By extrapolation of measured decay constants for nuclear DNA ({lambda}{sub nDNA}) and mtDNA ({lambda}{sub mtDNA}) we present an approach to possibly correct

  17. Complete mitochondrial DNA genome of bonnethead shark, Sphyrna tiburo, and phylogenetic relationships among main superorders of modern elasmobranchs

    PubMed Central

    Díaz-Jaimes, Píndaro; Bayona-Vásquez, Natalia J.; Adams, Douglas H.; Uribe-Alcocer, Manuel

    2015-01-01

    Elasmobranchs are one of the most diverse groups in the marine realm represented by 18 orders, 55 families and about 1200 species reported, but also one of the most vulnerable to exploitation and to climate change. Phylogenetic relationships among main orders have been controversial since the emergence of the Hypnosqualean hypothesis by Shirai (1992) that considered batoids as a sister group of sharks. The use of the complete mitochondrial DNA (mtDNA) may shed light to further validate this hypothesis by increasing the number of informative characters. We report the mtDNA genome of the bonnethead shark Sphyrna tiburo, and compare it with mitogenomes of other 48 species to assess phylogenetic relationships. The mtDNA genome of S. tiburo, is quite similar in size to that of congeneric species but also similar to the reported mtDNA genome of other Carcharhinidae species. Like most vertebrate mitochondrial genomes, it contained 13 protein coding genes, two rRNA genes and 22 tRNA genes and the control region of 1086 bp (D-loop). The Bayesian analysis of the 49 mitogenomes supported the view that sharks and batoids are separate groups. PMID:27014583

  18. Single-Molecule Denaturation Mapping of Genomic DNA in Nanofluidic Channels

    NASA Astrophysics Data System (ADS)

    Reisner, Walter; Larsen, Niels; Kristensen, Anders; Tegenfeldt, Jonas O.; Flyvbjerg, Henrik

    2009-03-01

    We have developed a new DNA barcoding technique based on the partial denaturation of extended fluorescently labeled DNA molecules. We partially melt DNA extended in nanofluidic channels via a combination of local heating and added chemical denaturants. The melted molecules, imaged via a standard fluorescence videomicroscopy setup, exhibit a nonuniform fluorescence profile corresponding to a series of local dips and peaks in the intensity trace along the stretched molecule. We show that this barcode is consistent with the presence of locally melted regions and can be explained by calculations of sequence-dependent melting probability. We believe this melting mapping technology is the first optically based single molecule technique sensitive to genome wide sequence variation that does not require an additional enzymatic labeling or restriction scheme.

  19. DNApod: DNA polymorphism annotation database from next-generation sequence read archives.

    PubMed

    Mochizuki, Takako; Tanizawa, Yasuhiro; Fujisawa, Takatomo; Ohta, Tazro; Nikoh, Naruo; Shimizu, Tokurou; Toyoda, Atsushi; Fujiyama, Asao; Kurata, Nori; Nagasaki, Hideki; Kaminuma, Eli; Nakamura, Yasukazu

    2017-01-01

    With the rapid advances in next-generation sequencing (NGS), datasets for DNA polymorphisms among various species and strains have been produced, stored, and distributed. However, reliability varies among these datasets because the experimental and analytical conditions used differ among assays. Furthermore, such datasets have been frequently distributed from the websites of individual sequencing projects. It is desirable to integrate DNA polymorphism data into one database featuring uniform quality control that is distributed from a single platform at a single place. DNA polymorphism annotation database (DNApod; http://tga.nig.ac.jp/dnapod/) is an integrated database that stores genome-wide DNA polymorphism datasets acquired under uniform analytical conditions, and this includes uniformity in the quality of the raw data, the reference genome version, and evaluation algorithms. DNApod genotypic data are re-analyzed whole-genome shotgun datasets extracted from sequence read archives, and DNApod distributes genome-wide DNA polymorphism datasets and known-gene annotations for each DNA polymorphism. This new database was developed for storing genome-wide DNA polymorphism datasets of plants, with crops being the first priority. Here, we describe our analyzed data for 679, 404, and 66 strains of rice, maize, and sorghum, respectively. The analytical methods are available as a DNApod workflow in an NGS annotation system of the DNA Data Bank of Japan and a virtual machine image. Furthermore, DNApod provides tables of links of identifiers between DNApod genotypic data and public phenotypic data. To advance the sharing of organism knowledge, DNApod offers basic and ubiquitous functions for multiple alignment and phylogenetic tree construction by using orthologous gene information.

  20. DNApod: DNA polymorphism annotation database from next-generation sequence read archives

    PubMed Central

    Mochizuki, Takako; Tanizawa, Yasuhiro; Fujisawa, Takatomo; Ohta, Tazro; Nikoh, Naruo; Shimizu, Tokurou; Toyoda, Atsushi; Fujiyama, Asao; Kurata, Nori; Nagasaki, Hideki; Kaminuma, Eli; Nakamura, Yasukazu

    2017-01-01

    With the rapid advances in next-generation sequencing (NGS), datasets for DNA polymorphisms among various species and strains have been produced, stored, and distributed. However, reliability varies among these datasets because the experimental and analytical conditions used differ among assays. Furthermore, such datasets have been frequently distributed from the websites of individual sequencing projects. It is desirable to integrate DNA polymorphism data into one database featuring uniform quality control that is distributed from a single platform at a single place. DNA polymorphism annotation database (DNApod; http://tga.nig.ac.jp/dnapod/) is an integrated database that stores genome-wide DNA polymorphism datasets acquired under uniform analytical conditions, and this includes uniformity in the quality of the raw data, the reference genome version, and evaluation algorithms. DNApod genotypic data are re-analyzed whole-genome shotgun datasets extracted from sequence read archives, and DNApod distributes genome-wide DNA polymorphism datasets and known-gene annotations for each DNA polymorphism. This new database was developed for storing genome-wide DNA polymorphism datasets of plants, with crops being the first priority. Here, we describe our analyzed data for 679, 404, and 66 strains of rice, maize, and sorghum, respectively. The analytical methods are available as a DNApod workflow in an NGS annotation system of the DNA Data Bank of Japan and a virtual machine image. Furthermore, DNApod provides tables of links of identifiers between DNApod genotypic data and public phenotypic data. To advance the sharing of organism knowledge, DNApod offers basic and ubiquitous functions for multiple alignment and phylogenetic tree construction by using orthologous gene information. PMID:28234924

  1. Genome-wide Mapping Reveals Conservation of Promoter DNA Methylation Following Chicken Domestication

    PubMed Central

    Li, Qinghe; Wang, Yuanyuan; Hu, Xiaoxiang; Zhao, Yaofeng; Li, Ning

    2015-01-01

    It is well-known that environment influences DNA methylation, however, the extent of heritable DNA methylation variation following animal domestication remains largely unknown. Using meDIP-chip we mapped the promoter methylomes for 23,316 genes in muscle tissues of ancestral and domestic chickens. We systematically examined the variation of promoter DNA methylation in terms of different breeds, differentially expressed genes, SNPs and genes undergo genetic selection sweeps. While considerable changes in DNA sequence and gene expression programs were prevalent, we found that the inter-strain DNA methylation patterns were highly conserved in promoter region between the wild and domestic chicken breeds. Our data suggests a global preservation of DNA methylation between the wild and domestic chicken breeds in either a genome-wide or locus-specific scale in chick muscle tissues. PMID:25735894

  2. QUAST: quality assessment tool for genome assemblies

    PubMed Central

    Gurevich, Alexey; Saveliev, Vladislav; Vyahhi, Nikolay; Tesler, Glenn

    2013-01-01

    Summary: Limitations of genome sequencing techniques have led to dozens of assembly algorithms, none of which is perfect. A number of methods for comparing assemblers have been developed, but none is yet a recognized benchmark. Further, most existing methods for comparing assemblies are only applicable to new assemblies of finished genomes; the problem of evaluating assemblies of previously unsequenced species has not been adequately considered. Here, we present QUAST—a quality assessment tool for evaluating and comparing genome assemblies. This tool improves on leading assembly comparison software with new ideas and quality metrics. QUAST can evaluate assemblies both with a reference genome, as well as without a reference. QUAST produces many reports, summary tables and plots to help scientists in their research and in their publications. In this study, we used QUAST to compare several genome assemblers on three datasets. QUAST tables and plots for all of them are available in the Supplementary Material, and interactive versions of these reports are on the QUAST website. Availability: http://bioinf.spbau.ru/quast Contact: gurevich@bioinf.spbau.ru Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23422339

  3. Assessment of genome origins and genetic diversity in the genus Eleusine with DNA markers.

    PubMed

    Salimath, S S; de Oliveira, A C; Godwin, I D; Bennetzen, J L

    1995-08-01

    Finger millet (Eleusine coracana), an allotetraploid cereal, is widely cultivated in the arid and semiarid regions of the world. Three DNA marker techniques, restriction fragment length polymorphism (RFLP), randomly amplified polymorphic DNA (RAPD), and inter simple sequence repeat amplification (ISSR), were employed to analyze 22 accessions belonging to 5 species of Eleusine. An 8 probe--3 enzyme RFLP combination, 18 RAPD primers, and 6 ISSR primers, respectively, revealed 14, 10, and 26% polymorphism in 17 accessions of E. coracana from Africa and Asia. These results indicated a very low level of DNA sequence variability in the finger millets but did allow each line to be distinguished. The different Eleusine species could be easily identified by DNA marker technology and the 16% intraspecific polymorphism exhibited by the two analyzed accessions of E. floccifolia suggested a much higher level of diversity in this species than in E. coracana. Between species, E. coracana and E. indica shared the most markers, while E. indica and E. tristachya shared a considerable number of markers, indicating that these three species form a close genetic assemblage within the Eleusine. Eleusine floccifolia and E. compressa were found to be the most divergent among the species examined. Comparison of RFLP, RAPD, and ISSR technologies, in terms of the quantity and quality of data output, indicated that ISSRs are particularly promising for the analysis of plant genome diversity.

  4. Inter-Fork Strand Annealing causes genomic deletions during the termination of DNA replication.

    PubMed

    Morrow, Carl A; Nguyen, Michael O; Fower, Andrew; Wong, Io Nam; Osman, Fekret; Bryer, Claire; Whitby, Matthew C

    2017-06-06

    Problems that arise during DNA replication can drive genomic alterations that are instrumental in the development of cancers and many human genetic disorders. Replication fork barriers are a commonly encountered problem, which can cause fork collapse and act as hotspots for replication termination. Collapsed forks can be rescued by homologous recombination, which restarts replication. However, replication restart is relatively slow and, therefore, replication termination may frequently occur by an active fork converging on a collapsed fork. We find that this type of non-canonical fork convergence in fission yeast is prone to trigger deletions between repetitive DNA sequences via a mechanism we call Inter-Fork Strand Annealing (IFSA) that depends on the recombination proteins Rad52, Exo1 and Mus81, and is countered by the FANCM-related DNA helicase Fml1. Based on our findings, we propose that IFSA is a potential threat to genomic stability in eukaryotes.

  5. Genome-wide DNA methylation drives human embryonic stem cell erythropoiesis by remodeling gene expression dynamics.

    PubMed

    Liu, Zhijing; Feng, Qiang; Sun, Pengpeng; Lu, Yan; Yang, Minlan; Zhang, Xiaowei; Jin, Xiangshu; Li, Yulin; Lu, Shi-Jiang; Quan, Chengshi

    2017-12-01

    To investigate the role of DNA methylation during erythrocyte production by human embryonic stem cells (hESCs). We employed an erythroid differentiation model from hESCs, and then tracked the genome-wide DNA methylation maps and gene expression patterns through an Infinium HumanMethylation450K BeadChip and an Ilumina Human HT-12 v4 Expression Beadchip, respectively. A negative correlation between DNA methylation and gene expression was substantially enriched during the later differentiation stage and was present in both the promoter and the gene body. Moreover, erythropoietic genes with differentially methylated CpG sites that were primarily enriched in nonisland regions were upregulated, and demethylation of their gene bodies was associated with the presence of enhancers and DNase I hypersensitive sites. Finally, the components of JAK-STAT-NF-κB signaling were DNA hypomethylated and upregulated, which targets the key genes for erythropoiesis. Erythroid lineage commitment by hESCs requires genome-wide DNA methylation modifications to remodel gene expression dynamics.

  6. Circulating nucleic acids damage DNA of healthy cells by integrating into their genomes

    PubMed Central

    Mittra, Indraneel; Khare, Naveen Kumar; Raghuram, Gorantla Venkata; Chaubal, Rohan; Khambatti, Fatema; Gupta, Deepika; Gaikwad, Ashwini; Prasannan, Preeti; Singh, Akshita; Iyer, Aishwarya; Singh, Ankita; Upadhyay, Pawan; Nair, Naveen Kumar; Mishra, Pradyumna Kumar; Dutt, Amit

    2018-01-01

    Whether nucleic acids that circulate in blood have any patho-physiological functions in the host have not been explored. We report here that far from being inert molecules, circulating nucleic acids have significant biological activities of their own that are deleterious to healthy cells of the body. Fragmented DNA and chromatin (DNAfs and Cfs) isolated from blood of cancer patients and healthy volunteers are readily taken up by a variety of cells in culture to be localized in their nuclei within a few minutes. The intra-nuclear DNAfs and Cfs associate themselves with host cell chromosomes to evoke a cellular DNA-damage-repair-response (DDR) followed by their incorporation into the host cell genomes. Whole genome sequencing detected the presence of tens of thousands of human sequence reads in the recipient mouse cells. Genomic incorporation of DNAfs and Cfs leads to dsDNA breaks and activation of apoptotic pathways in the treated cells. When injected intravenously into Balb/C mice, DNAfs and Cfs undergo genomic integration into cells of their vital organs resulting in activation of DDR and apoptotic proteins in the recipient cells. Cfs have significantly greater activity than DNAfs with respect to all parameters examined, while both DNAfs and Cfs isolated from cancer patients are more active than those from normal volunteers. All the above pathological actions of DNAfs and Cfs described above can be abrogated by concurrent treatment with DNase I and/or anti-histone antibody complexed nanoparticles both in vitro and in vivo. Taken together, our results that circulating DNAfs and Cfs are physiological, continuously arising, endogenous DNA damaging agents with implications to ageing and a multitude of human pathologies including initiation of cancer. PMID:25740145

  7. G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences.

    PubMed

    Lenis, Vasileios Panagiotis E; Swain, Martin; Larkin, Denis M

    2018-05-01

    Cross-species whole-genome sequence alignment is a critical first step for genome comparative analyses, ranging from the detection of sequence variants to studies of chromosome evolution. Animal genomes are large and complex, and whole-genome alignment is a computationally intense process, requiring expensive high-performance computing systems due to the need to explore extensive local alignments. With hundreds of sequenced animal genomes available from multiple projects, there is an increasing demand for genome comparative analyses. Here, we introduce G-Anchor, a new, fast, and efficient pipeline that uses a strictly limited but highly effective set of local sequence alignments to anchor (or map) an animal genome to another species' reference genome. G-Anchor makes novel use of a databank of highly conserved DNA sequence elements. We demonstrate how these elements may be aligned to a pair of genomes, creating anchors. These anchors enable the rapid mapping of scaffolds from a de novo assembled genome to chromosome assemblies of a reference species. Our results demonstrate that G-Anchor can successfully anchor a vertebrate genome onto a phylogenetically related reference species genome using a desktop or laptop computer within a few hours and with comparable accuracy to that achieved by a highly accurate whole-genome alignment tool such as LASTZ. G-Anchor thus makes whole-genome comparisons accessible to researchers with limited computational resources. G-Anchor is a ready-to-use tool for anchoring a pair of vertebrate genomes. It may be used with large genomes that contain a significant fraction of evolutionally conserved DNA sequences and that are not highly repetitive, polypoid, or excessively fragmented. G-Anchor is not a substitute for whole-genome aligning software but can be used for fast and accurate initial genome comparisons. G-Anchor is freely available and a ready-to-use tool for the pairwise comparison of two genomes.

  8. To peep into Pif1 helicase: multifaceted all the way from genome stability to repair-associated DNA synthesis.

    PubMed

    Chung, Woo-Hyun

    2014-02-01

    Pif1 DNA helicase is the prototypical member of a 5' to 3' helicase superfamily conserved from bacteria to humans. In Saccharomyces cerevisiae, Pif1 and its homologue Rrm3, localize in both mitochondria and nucleus playing multiple roles in the maintenance of genomic homeostasis. They display relatively weak processivities in vitro, but have largely non-overlapping functions on common genomic loci such as mitochondrial DNA, telomeric ends, and many replication forks especially at hard-to-replicate regions including ribosomal DNA and G-quadruplex structures. Recently, emerging evidence shows that Pif1, but not Rrm3, has a significant new role in repair-associated DNA synthesis with Polδ during homologous recombination stimulating D-loop migration for conservative DNA replication. Comparative genetic and biochemical studies on the structure and function of Pif1 family helicases across different biological systems are further needed to elucidate both diversity and specificity of their mechanisms of action that contribute to genome stability.

  9. DNA methylation and genome rearrangement characteristics of phase change in cultured shoots of Sequoia sempervirens.

    PubMed

    Huang, Li-Chun; Hsiao, Lin-June; Pu, Szu-Yuan; Kuo, Ching-I; Huang, Bau-Lian; Tseng, Tsung-Che; Huang, Hao-Jen; Chen, Yu-Ting

    2012-06-01

    Epigenetic machinery regulates the expression of individual genes and plays a crucial role in globally shaping and maintaining developmental patterning. We studied the extent of DNA methylation in the nucleus, mitochondrion and chloroplast in cultured Sequoia sempervirens (coast redwood) adult, juvenile and rejuvenated shoots by measuring the ratio of methylcytosine to total cytosine using high-performance liquid chromatography (HPLC). We also analyzed nuclear DNA (nuDNA) polymorphisms of different shoot types by methylation-sensitive amplified fragment length polymorphism (MSAP) and Southern blot analysis. The extent of nuDNA methylation was greater in the adult vegetative than juvenile and rejuvenated shoots (8% vs 6.5-7.5%). In contrast, the proportion of methylcytosine was higher in mitochondrial DNA (mDNA) of juvenile and rejuvenated shoots than adult shoots (6.6% vs 7.8-8.2%). MSAP and Southern blot analyses identified three MSAP fragments which could be applied as phase-specific molecular markers. We also found nuclear genome and mtDNA rearrangement may be as important as DNA methylation status during the phase change. Our findings strongly suggest that DNA methylation and genome rearrangement may affect the dynamic tissue- and cell type-specific changes that determine the developmental phase of S. sempervirens shoots. Copyright © Physiologia Plantarum 2012.

  10. A versatile genome-scale PCR-based pipeline for high-definition DNA FISH.

    PubMed

    Bienko, Magda; Crosetto, Nicola; Teytelman, Leonid; Klemm, Sandy; Itzkovitz, Shalev; van Oudenaarden, Alexander

    2013-02-01

    We developed a cost-effective genome-scale PCR-based method for high-definition DNA FISH (HD-FISH). We visualized gene loci with diffraction-limited resolution, chromosomes as spot clusters and single genes together with transcripts by combining HD-FISH with single-molecule RNA FISH. We provide a database of over 4.3 million primer pairs targeting the human and mouse genomes that is readily usable for rapid and flexible generation of probes.

  11. Inactivating UBE2M impacts the DNA damage response and genome integrity involving multiple cullin ligases.

    PubMed

    Cukras, Scott; Morffy, Nicholas; Ohn, Takbum; Kee, Younghoon

    2014-01-01

    Protein neddylation is involved in a wide variety of cellular processes. Here we show that the DNA damage response is perturbed in cells inactivated with an E2 Nedd8 conjugating enzyme UBE2M, measured by RAD51 foci formation kinetics and cell based DNA repair assays. UBE2M knockdown increases DNA breakages and cellular sensitivity to DNA damaging agents, further suggesting heightened genomic instability and defective DNA repair activity. Investigating the downstream Cullin targets of UBE2M revealed that silencing of Cullin 1, 2, and 4 ligases incurred significant DNA damage. In particular, UBE2M knockdown, or defective neddylation of Cullin 2, leads to a blockade in the G1 to S progression and is associated with delayed S-phase dependent DNA damage response. Cullin 4 inactivation leads to an aberrantly high DNA damage response that is associated with increased DNA breakages and sensitivity of cells to DNA damaging agents, suggesting a DNA repair defect is associated. siRNA interrogation of key Cullin substrates show that CDT1, p21, and Claspin are involved in elevated DNA damage in the UBE2M knockdown cells. Therefore, UBE2M is required to maintain genome integrity by activating multiple Cullin ligases throughout the cell cycle.

  12. Large-scale chromosome folding versus genomic DNA sequences: A discrete double Fourier transform technique.

    PubMed

    Chechetkin, V R; Lobzin, V V

    2017-08-07

    Using state-of-the-art techniques combining imaging methods and high-throughput genomic mapping tools leaded to the significant progress in detailing chromosome architecture of various organisms. However, a gap still remains between the rapidly growing structural data on the chromosome folding and the large-scale genome organization. Could a part of information on the chromosome folding be obtained directly from underlying genomic DNA sequences abundantly stored in the databanks? To answer this question, we developed an original discrete double Fourier transform (DDFT). DDFT serves for the detection of large-scale genome regularities associated with domains/units at the different levels of hierarchical chromosome folding. The method is versatile and can be applied to both genomic DNA sequences and corresponding physico-chemical parameters such as base-pairing free energy. The latter characteristic is closely related to the replication and transcription and can also be used for the assessment of temperature or supercoiling effects on the chromosome folding. We tested the method on the genome of E. coli K-12 and found good correspondence with the annotated domains/units established experimentally. As a brief illustration of further abilities of DDFT, the study of large-scale genome organization for bacteriophage PHIX174 and bacterium Caulobacter crescentus was also added. The combined experimental, modeling, and bioinformatic DDFT analysis should yield more complete knowledge on the chromosome architecture and genome organization. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development.

    PubMed

    Daccord, Nicolas; Celton, Jean-Marc; Linsmith, Gareth; Becker, Claude; Choisne, Nathalie; Schijlen, Elio; van de Geest, Henri; Bianco, Luca; Micheletti, Diego; Velasco, Riccardo; Di Pierro, Erica Adele; Gouzy, Jérôme; Rees, D Jasper G; Guérif, Philippe; Muranty, Hélène; Durel, Charles-Eric; Laurens, François; Lespinasse, Yves; Gaillard, Sylvain; Aubourg, Sébastien; Quesneville, Hadi; Weigel, Detlef; van de Weg, Eric; Troggio, Michela; Bucher, Etienne

    2017-07-01

    Using the latest sequencing and optical mapping technologies, we have produced a high-quality de novo assembly of the apple (Malus domestica Borkh.) genome. Repeat sequences, which represented over half of the assembly, provided an unprecedented opportunity to investigate the uncharacterized regions of a tree genome; we identified a new hyper-repetitive retrotransposon sequence that was over-represented in heterochromatic regions and estimated that a major burst of different transposable elements (TEs) occurred 21 million years ago. Notably, the timing of this TE burst coincided with the uplift of the Tian Shan mountains, which is thought to be the center of the location where the apple originated, suggesting that TEs and associated processes may have contributed to the diversification of the apple ancestor and possibly to its divergence from pear. Finally, genome-wide DNA methylation data suggest that epigenetic marks may contribute to agronomically relevant aspects, such as apple fruit development.

  14. Mobile small RNAs regulate genome-wide DNA methylation.

    PubMed

    Lewsey, Mathew G; Hardcastle, Thomas J; Melnyk, Charles W; Molnar, Attila; Valli, Adrián; Urich, Mark A; Nery, Joseph R; Baulcombe, David C; Ecker, Joseph R

    2016-02-09

    RNA silencing at the transcriptional and posttranscriptional levels regulates endogenous gene expression, controls invading transposable elements (TEs), and protects the cell against viruses. Key components of the mechanism are small RNAs (sRNAs) of 21-24 nt that guide the silencing machinery to their nucleic acid targets in a nucleotide sequence-specific manner. Transcriptional gene silencing is associated with 24-nt sRNAs and RNA-directed DNA methylation (RdDM) at cytosine residues in three DNA sequence contexts (CG, CHG, and CHH). We previously demonstrated that 24-nt sRNAs are mobile from shoot to root in Arabidopsis thaliana and confirmed that they mediate DNA methylation at three sites in recipient cells. In this study, we extend this finding by demonstrating that RdDM of thousands of loci in root tissues is dependent upon mobile sRNAs from the shoot and that mobile sRNA-dependent DNA methylation occurs predominantly in non-CG contexts. Mobile sRNA-dependent non-CG methylation is largely dependent on the DOMAINS REARRANGED METHYLTRANSFERASES 1/2 (DRM1/DRM2) RdDM pathway but is independent of the CHROMOMETHYLASE (CMT)2/3 DNA methyltransferases. Specific superfamilies of TEs, including those typically found in gene-rich euchromatic regions, lose DNA methylation in a mutant lacking 22- to 24-nt sRNAs (dicer-like 2, 3, 4 triple mutant). Transcriptome analyses identified a small number of genes whose expression in roots is associated with mobile sRNAs and connected to DNA methylation directly or indirectly. Finally, we demonstrate that sRNAs from shoots of one accession move across a graft union and target DNA methylation de novo at normally unmethylated sites in the genomes of root cells from a different accession.

  15. A DNA-based pattern classifier with in vitro learning and associative recall for genomic characterization and biosensing without explicit sequence knowledge.

    PubMed

    Lee, Ju Seok; Chen, Junghuei; Deaton, Russell; Kim, Jin-Woo

    2014-01-01

    Genetic material extracted from in situ microbial communities has high promise as an indicator of biological system status. However, the challenge is to access genomic information from all organisms at the population or community scale to monitor the biosystem's state. Hence, there is a need for a better diagnostic tool that provides a holistic view of a biosystem's genomic status. Here, we introduce an in vitro methodology for genomic pattern classification of biological samples that taps large amounts of genetic information from all genes present and uses that information to detect changes in genomic patterns and classify them. We developed a biosensing protocol, termed Biological Memory, that has in vitro computational capabilities to "learn" and "store" genomic sequence information directly from genomic samples without knowledge of their explicit sequences, and that discovers differences in vitro between previously unknown inputs and learned memory molecules. The Memory protocol was designed and optimized based upon (1) common in vitro recombinant DNA operations using 20-base random probes, including polymerization, nuclease digestion, and magnetic bead separation, to capture a snapshot of the genomic state of a biological sample as a DNA memory and (2) the thermal stability of DNA duplexes between new input and the memory to detect similarities and differences. For efficient read out, a microarray was used as an output method. When the microarray-based Memory protocol was implemented to test its capability and sensitivity using genomic DNA from two model bacterial strains, i.e., Escherichia coli K12 and Bacillus subtilis, results indicate that the Memory protocol can "learn" input DNA, "recall" similar DNA, differentiate between dissimilar DNA, and detect relatively small concentration differences in samples. This study demonstrated not only the in vitro information processing capabilities of DNA, but also its promise as a genomic pattern classifier that could

  16. Aberrant topoisomerase-1 DNA lesions are pathogenic in neurodegenerative genome instability syndromes.

    PubMed

    Katyal, Sachin; Lee, Youngsoo; Nitiss, Karin C; Downing, Susanna M; Li, Yang; Shimada, Mikio; Zhao, Jingfeng; Russell, Helen R; Petrini, John H J; Nitiss, John L; McKinnon, Peter J

    2014-06-01

    DNA damage is considered to be a prime factor in several spinocerebellar neurodegenerative diseases; however, the DNA lesions underpinning disease etiology are unknown. We observed the endogenous accumulation of pathogenic topoisomerase-1 (Top1)-DNA cleavage complexes (Top1ccs) in murine models of ataxia telangiectasia and spinocerebellar ataxia with axonal neuropathy 1. We found that the defective DNA damage response factors in these two diseases cooperatively modulated Top1cc turnover in a non-epistatic and ATM kinase-independent manner. Furthermore, coincident neural inactivation of ATM and DNA single-strand break repair factors, including tyrosyl-DNA phosphodiesterase-1 or XRCC1, resulted in increased Top1cc formation and excessive DNA damage and neurodevelopmental defects. Notably, direct Top1 poisoning to elevate Top1cc levels phenocopied the neuropathology of the mouse models described above. Our results identify a critical endogenous pathogenic lesion associated with neurodegenerative syndromes arising from DNA repair deficiency, indicating that genome integrity is important for preventing disease in the nervous system.

  17. Nuclear genomes distinguish cryptic species suggested by their DNA barcodes and ecology

    PubMed Central

    Janzen, Daniel H.; Burns, John M.; Cong, Qian; Hallwachs, Winnie; Dapkey, Tanya; Manjunath, Ramya; Hajibabaei, Mehrdad; Hebert, Paul D. N.; Grishin, Nick V.

    2017-01-01

    DNA sequencing brings another dimension to exploration of biodiversity, and large-scale mitochondrial DNA cytochrome oxidase I barcoding has exposed many potential new cryptic species. Here, we add complete nuclear genome sequencing to DNA barcoding, ecological distribution, natural history, and subtleties of adult color pattern and size to show that a widespread neotropical skipper butterfly known as Udranomia kikkawai (Weeks) comprises three different species in Costa Rica. Full-length barcodes obtained from all three century-old Venezuelan syntypes of U. kikkawai show that it is a rainforest species occurring from Costa Rica to Brazil. The two new species are Udranomia sallydaleyae Burns, a dry forest denizen occurring from Costa Rica to Mexico, and Udranomia tomdaleyi Burns, which occupies the junction between the rainforest and dry forest and currently is known only from Costa Rica. Whereas the three species are cryptic, differing but slightly in appearance, their complete nuclear genomes totaling 15 million aligned positions reveal significant differences consistent with their 0.00065-Mbp (million base pair) mitochondrial barcodes and their ecological diversification. DNA barcoding of tropical insects reared by a massive inventory suggests that the presence of cryptic species is a widespread phenomenon and that further studies will substantially increase current estimates of insect species richness. PMID:28716927

  18. The Paramecium germline genome provides a niche for intragenic parasitic DNA: evolutionary dynamics of internal eliminated sequences.

    PubMed

    Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda

    2012-01-01

    Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of -45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a -10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the

  19. The Paramecium Germline Genome Provides a Niche for Intragenic Parasitic DNA: Evolutionary Dynamics of Internal Eliminated Sequences

    PubMed Central

    Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E.; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda

    2012-01-01

    Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of ∼45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a ∼10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the

  20. Mitochondrial genome rearrangements in glomus species triggered by homologous recombination between distinct mtDNA haplotypes.

    PubMed

    Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed

    2013-01-01

    Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms.AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence,were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigated podiversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity.We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants.

  1. Mitochondrial Genome Rearrangements in Glomus Species Triggered by Homologous Recombination between Distinct mtDNA Haplotypes

    PubMed Central

    Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed

    2013-01-01

    Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms. AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence, were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigate dpo diversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity. We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants. PMID:23925788

  2. Effects of storage temperature on the quantity and integrity of genomic DNA extracted from mice tissues: A comparison of recovery methods

    PubMed Central

    Al-Griw, Huda H.; Zraba, Zena A.; Al-Muntaser, Salsabiel K.; Draid, Marwan M.; Zaidi, Aisha M.; Tabagh, Refaat M.; Al-Griw, Mohamed A.

    2017-01-01

    Efficient extraction of genomic DNA (gDNA) from biological materials found in harsh environments is the first step for successful forensic DNA profiling. This study aimed to evaluate two methods for DNA recovery from animal tissues (livers, muscles), focusing on the best storage temperature for DNA yield in term of quality, quantity, and integrity for use in several downstream molecular techniques. Six male Swiss albino mice were sacrificed, liver and muscle tissues (n=32) were then harvested and stored for one week in different temperatures, -20°C, 4°C, 25°C and 40°C. The conditioned animal tissues were used for DNA extraction by Chelex-100 method or NucleoSpinC Blood and Tissue kit. The extracted gDNA was visualized on 1.5% agarose gel electrophoresis to determine the quality of gDNA and analysed spectrophotometrically to determine the DNA concentration and the purity. Both methods, Chelex-100 and NucleoSpin Blood and Tissue kit found to be appropriate for yielding high quantity of gDNA, with the Chelex 100 method yielding a greater quantity (P < 0.045) than the kit. At -20°C, 4°C, and 25°C temperatures, the concentration of DNA yield was numerically lower than at 40°C. The NucleoSpinC Blood and Tissue kit produced a higher (P=0.031) purity product than the Chelex-100 method, particularly for muscle tissues. The Chelex-100 method is cheap, fast, effective, and is a crucial tool for yielding DNA from animal tissues (livers, muscles) exposed to harsh environment with little limitations. PMID:28884076

  3. Human urinary bladder epithelial cells lacking wild-type p53 function are deficient in the repair of 4-aminobiphenyl-DNA adducts in genomic DNA.

    PubMed

    Swaminathan, Santhanam; Torino, Jennifer L; Burger, Melissa S

    2002-01-29

    The effect of the tumor suppressor gene TP53 on repair of genomic DNA damage was examined in human urinary bladder transitional cell carcinoma (TCC) cell lines. Utilizing TCC10 containing wild-type p53 (wt-p53) as the parental line, an isogenic set of cell lines was derived by retroviral infection that expressed a transdominant mutant p53 (Arg --> His at codon 273, TDM273-TCC10), or the human papilloma virus 16-E6 oncoprotein (E6-TCC10). 32P-postlabeling analyses were performed on DNA from TCC cultures obtained after treatment with N-hydroxy-4-aminobiphenyl (N-OH-ABP), N-hydroxy-4-acetylaminobiphenyl (N-OH-AABP) and N-acetoxy-4-acetylaminobiphenyl (N-OAc-AABP). The major adduct was identified as N-(deoxyguanosin-8-yl)-4-aminobiphenyl (dG-C8-ABP) with all three chemicals. The amount of adducts in urothelial DNA ranged between 0.1 and 20 per 10(6) nucleotides, N-OAc-AABP yielding the highest levels, followed by N-OH-ABP and N-OH-AABP. To determine, if the functional status of p53 affects the rate of repair of dG-C8-ABP in genomic DNA, TCC10 and the TDM273-TCC10 and E6-TCC10 isotypes were exposed to N-OH-AABP for 12h and the DNA damage was allowed to repair up to 24h. The adduct levels were quantified and compared between the TCC10 isotypes. The amounts of dG-C8-ABP that remained in genomic DNA from E6-TCC10 and TDM273-TCC10 were approximately two-fold higher, as compared to the parental TCC10. At the dose used for DNA repair studies, N-OH-AABP or N-OAc-AABP did not induce apoptosis in TCC10. However, N-OAc-AABP at high doses (>5 microM) induced apoptosis, as evidenced by DNA fragmentation analyses. Furthermore, N-OAc-AABP-mediated apoptosis was independent of the functional status of wt-p53, since both E6-TCC10 and the parental TCC10 exhibited DNA fragmentation following treatment. These results suggest that p53 might modulate the repair of DNA adducts generated from the human bladder carcinogen ABP in its target human uroepithelial cells. This implies that in p53

  4. Automated sample-preparation technologies in genome sequencing projects.

    PubMed

    Hilbert, H; Lauber, J; Lubenow, H; Düsterhöft, A

    2000-01-01

    A robotic workstation system (BioRobot 96OO, QIAGEN) and a 96-well UV spectrophotometer (Spectramax 250, Molecular Devices) were integrated in to the process of high-throughput automated sequencing of double-stranded plasmid DNA templates. An automated 96-well miniprep kit protocol (QIAprep Turbo, QIAGEN) provided high-quality plasmid DNA from shotgun clones. The DNA prepared by this procedure was used to generate more than two mega bases of final sequence data for two genomic projects (Arabidopsis thaliana and Schizosaccharomyces pombe), three thousand expressed sequence tags (ESTs) plus half a mega base of human full-length cDNA clones, and approximately 53,000 single reads for a whole genome shotgun project (Pseudomonas putida).

  5. Complete cpDNA genome sequence of Smilax china and phylogenetic placement of Liliales--influences of gene partitions and taxon sampling.

    PubMed

    Liu, Juan; Qi, Zhe-Chen; Zhao, Yun-Peng; Fu, Cheng-Xin; Jenny Xiang, Qiu-Yun

    2012-09-01

    The complete nucleotide sequence of the chloroplast genome (cpDNA) of Smilax china L. (Smilacaceae) is reported. It is the first complete cp genome sequence in Liliales. Genomic analyses were conducted to examine the rate and pattern of cpDNA genome evolution in Smilax relative to other major lineages of monocots. The cpDNA genomic sequences were combined with those available for Lilium to evaluate the phylogenetic position of Liliales and to investigate the influence of taxon sampling, gene sampling, gene function, natural selection, and substitution rate on phylogenetic inference in monocots. Phylogenetic analyses using sequence data of gene groups partitioned according to gene function, selection force, and total substitution rate demonstrated evident impacts of these factors on phylogenetic inference of monocots and the placement of Liliales, suggesting potential evolutionary convergence or adaptation of some cpDNA genes in monocots. Our study also demonstrated that reduced taxon sampling reduced the bootstrap support for the placement of Liliales in the cpDNA phylogenomic analysis. Analyses of sequences of 77 protein genes with some missing data and sequences of 81 genes (all protein genes plus the rRNA genes) support a sister relationship of Liliales to the commelinids-Asparagales clade, consistent with the APG III system. Analyses of 63 cpDNA protein genes for 32 taxa with few missing data, however, support a sister relationship of Liliales (represented by Smilax and Lilium) to Dioscoreales-Pandanales. Topology tests indicated that these two alignments do not significantly differ given any of these three cpDNA genomic sequence data sets. Furthermore, we found no saturation effect of the data, suggesting that the cpDNA genomic sequence data used in the study are appropriate for monocot phylogenetic study and long-branch attraction is unlikely to be the cause to explain the result of two well-supported, conflict placements of Liliales. Further analyses using

  6. A novel method of genomic DNA extraction for Cactaceae1

    PubMed Central

    Fehlberg, Shannon D.; Allen, Jessica M.; Church, Kathleen

    2013-01-01

    • Premise of the study: Genetic studies of Cactaceae can at times be impeded by difficult sampling logistics and/or high mucilage content in tissues. Simplifying sampling and DNA isolation through the use of cactus spines has not previously been investigated. • Methods and Results: Several protocols for extracting DNA from spines were tested and modified to maximize yield, amplification, and sequencing. Sampling of and extraction from spines resulted in a simplified protocol overall and complete avoidance of mucilage as compared to typical tissue extractions. Sequences from one nuclear and three plastid regions were obtained across eight genera and 20 species of cacti using DNA extracted from spines. • Conclusions: Genomic DNA useful for amplification and sequencing can be obtained from cactus spines. The protocols described here are valuable for any cactus species, but are particularly useful for investigators interested in sampling living collections, extensive field sampling, and/or conservation genetic studies. PMID:25202521

  7. Comprehensive definition of genome features in Spirodela polyrhiza by high-depth physical mapping and short-read DNA sequencing strategies.

    PubMed

    Michael, Todd P; Bryant, Douglas; Gutierrez, Ryan; Borisjuk, Nikolai; Chu, Philomena; Zhang, Hanzhong; Xia, Jing; Zhou, Junfei; Peng, Hai; El Baidouri, Moaine; Ten Hallers, Boudewijn; Hastie, Alex R; Liang, Tiffany; Acosta, Kenneth; Gilbert, Sarah; McEntee, Connor; Jackson, Scott A; Mockler, Todd C; Zhang, Weixiong; Lam, Eric

    2017-02-01

    Spirodela polyrhiza is a fast-growing aquatic monocot with highly reduced morphology, genome size and number of protein-coding genes. Considering these biological features of Spirodela and its basal position in the monocot lineage, understanding its genome architecture could shed light on plant adaptation and genome evolution. Like many draft genomes, however, the 158-Mb Spirodela genome sequence has not been resolved to chromosomes, and important genome characteristics have not been defined. Here we deployed rapid genome-wide physical maps combined with high-coverage short-read sequencing to resolve the 20 chromosomes of Spirodela and to empirically delineate its genome features. Our data revealed a dramatic reduction in the number of the rDNA repeat units in Spirodela to fewer than 100, which is even fewer than that reported for yeast. Consistent with its unique phylogenetic position, small RNA sequencing revealed 29 Spirodela-specific microRNA, with only two being shared with Elaeis guineensis (oil palm) and Musa balbisiana (banana). Combining DNA methylation data and small RNA sequencing enabled the accurate prediction of 20.5% long terminal repeats (LTRs) that doubled the previous estimate, and revealed a high Solo:Intact LTR ratio of 8.2. Interestingly, we found that Spirodela has the lowest global DNA methylation levels (9%) of any plant species tested. Taken together our results reveal a genome that has undergone reduction, likely through eliminating non-essential protein coding genes, rDNA and LTRs. In addition to delineating the genome features of this unique plant, the methodologies described and large-scale genome resources from this work will enable future evolutionary and functional studies of this basal monocot family. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  8. Qualitative and quantitative evaluation of the genomic DNA extracted from GMO and non-GMO foodstuffs with four different extraction methods.

    PubMed

    Peano, Clelia; Samson, Maria Cristina; Palmieri, Luisa; Gulli, Mariolina; Marmiroli, Nelson

    2004-11-17

    The presence of DNA in foodstuffs derived from or containing genetically modified organisms (GMO) is the basic requirement for labeling of GMO foods in Council Directive 2001/18/CE (Off. J. Eur. Communities 2001, L1 06/2). In this work, four different methods for DNA extraction were evaluated and compared. To rank the different methods, the quality and quantity of DNA extracted from standards, containing known percentages of GMO material and from different food products, were considered. The food products analyzed derived from both soybean and maize and were chosen on the basis of the mechanical, technological, and chemical treatment they had been subjected to during processing. Degree of DNA degradation at various stages of food production was evaluated through the amplification of different DNA fragments belonging to the endogenous genes of both maize and soybean. Genomic DNA was extracted from Roundup Ready soybean and maize MON810 standard flours, according to four different methods, and quantified by real-time Polymerase Chain Reaction (PCR), with the aim of determining the influence of the extraction methods on the DNA quantification through real-time PCR.

  9. Extensive sequence-influenced DNA methylation polymorphism in the human genome

    PubMed Central

    2010-01-01

    Background Epigenetic polymorphisms are a potential source of human diversity, but their frequency and relationship to genetic polymorphisms are unclear. DNA methylation, an epigenetic mark that is a covalent modification of the DNA itself, plays an important role in the regulation of gene expression. Most studies of DNA methylation in mammalian cells have focused on CpG methylation present in CpG islands (areas of concentrated CpGs often found near promoters), but there are also interesting patterns of CpG methylation found outside of CpG islands. Results We compared DNA methylation patterns on both alleles between many pairs (and larger groups) of related and unrelated individuals. Direct observation and simulation experiments revealed that around 10% of common single nucleotide polymorphisms (SNPs) reside in regions with differences in the propensity for local DNA methylation between the two alleles. We further showed that for the most common form of SNP, a polymorphism at a CpG dinucleotide, the presence of the CpG at the SNP positively affected local DNA methylation in cis. Conclusions Taken together with the known effect of DNA methylation on mutation rate, our results suggest an interesting interdependence between genetics and epigenetics underlying diversity in the human genome. PMID:20497546

  10. The mitochondrial genome of the pathogenic yeast Candida subhashii: GC-rich linear DNA with a protein covalently attached to the 5′ termini

    PubMed Central

    Fricova, Dominika; Valach, Matus; Farkas, Zoltan; Pfeiffer, Ilona; Kucsera, Judit; Tomaska, Lubomir; Nosek, Jozef

    2010-01-01

    As a part of our initiative aimed at a large-scale comparative analysis of fungal mitochondrial genomes, we determined the complete DNA sequence of the mitochondrial genome of the yeast Candida subhashii and found that it exhibits a number of peculiar features. First, the mitochondrial genome is represented by linear dsDNA molecules of uniform length (29 795 bp), with an unusually high content of guanine and cytosine residues (52.7 %). Second, the coding sequences lack introns; thus, the genome has a relatively compact organization. Third, the termini of the linear molecules consist of long inverted repeats and seem to contain a protein covalently bound to terminal nucleotides at the 5′ ends. This architecture resembles the telomeres in a number of linear viral and plasmid DNA genomes classified as invertrons, in which the terminal proteins serve as specific primers for the initiation of DNA synthesis. Finally, although the mitochondrial genome of C. subhashii contains essentially the same set of genes as other closely related pathogenic Candida species, we identified additional ORFs encoding two homologues of the family B protein-priming DNA polymerases and an unknown protein. The terminal structures and the genes for DNA polymerases are reminiscent of linear mitochondrial plasmids, indicating that this genome architecture might have emerged from fortuitous recombination between an ancestral, presumably circular, mitochondrial genome and an invertron-like element. PMID:20395267

  11. Double-strand breaks in genome-sized DNA caused by mechanical stress under mixing: Quantitative evaluation through single-molecule observation

    NASA Astrophysics Data System (ADS)

    Kikuchi, Hayato; Nose, Keiji; Yoshikawa, Yuko; Yoshikawa, Kenichi

    2018-06-01

    It is becoming increasingly apparent that changes in the higher-order structure of genome-sized DNA molecules of more than several tens kbp play important roles in the self-control of genome activity in living cells. Unfortunately, it has been rather difficult to prepare genome-sized DNA molecules without damage or fragmentation. Here, we evaluated the degree of double-strand breaks (DSBs) caused by mechanical mixing by single-molecule observation with fluorescence microscopy. The results show that DNA breaks are most significant for the first second after the initiation of mechanical agitation. Based on such observation, we propose a novel mixing procedure to significantly decrease DSBs.

  12. Rescue of a Porcine Anellovirus (Torque Teno Sus Virus 2) from Cloned Genomic DNA in Pigs

    PubMed Central

    Huang, Yao-Wei; Patterson, Abby R.; Opriessnig, Tanja; Dryman, Barbara A.; Gallei, Andreas; Harrall, Kylie K.; Vaughn, Eric M.; Roof, Michael B.

    2012-01-01

    Anelloviruses are a group of single-stranded circular DNA viruses infecting humans and other animal species. Animal models combined with reverse genetic systems of anellovirus have not been developed. We report here the construction and initial characterization of full-length DNA clones of a porcine anellovirus, torque teno sus virus 2 (TTSuV2), in vitro and in vivo. We first demonstrated that five cell lines, including PK-15 cells, are free of TTSuV1 or TTSuV2 contamination, as determined by a real-time PCR and an immunofluorescence assay (IFA) using anti-TTSuV antibodies. Recombinant plasmids harboring monomeric or tandem-dimerized genomic DNA of TTSuV2 from the United States and Germany were constructed. Circular TTSuV2 genomic DNA with or without introduced genetic markers and tandem-dimerized TTSuV2 plasmids were transfected into PK-15 cells, respectively. Splicing of viral mRNAs was identified in transfected cells. Expression of TTSuV2-specific open reading frame 1 (ORF1) in cell nuclei, especially in nucleoli, was detected by IFA. However, evidence of productive TTSuV2 infection was not observed in 12 different cell lines transfected with the TTSuV2 DNA clones. Transfection with circular DNA from a TTSuV2 deletion mutant did not produce ORF1 protein, suggesting that the observed ORF1 expression is driven by TTSuV2 DNA replication in cells. Pigs inoculated with either the tandem-dimerized clones or circular genomic DNA of U.S. TTSuV2 developed viremia, and the introduced genetic markers were retained in viral DNA recovered from the sera of infected pigs. The availability of an infectious DNA clone of TTSuV2 will facilitate future study of porcine anellovirus pathogenesis and biology. PMID:22491450

  13. Determination of the melon chloroplast and mitochondrial genome sequences reveals that the largest reported mitochondrial genome in plants contains a significant amount of DNA having a nuclear origin

    PubMed Central

    2011-01-01

    Background The melon belongs to the Cucurbitaceae family, whose economic importance among vegetable crops is second only to Solanaceae. The melon has a small genome size (454 Mb), which makes it suitable for molecular and genetic studies. Despite similar nuclear and chloroplast genome sizes, cucurbits show great variation when their mitochondrial genomes are compared. The melon possesses the largest plant mitochondrial genome, as much as eight times larger than that of other cucurbits. Results The nucleotide sequences of the melon chloroplast and mitochondrial genomes were determined. The chloroplast genome (156,017 bp) included 132 genes, with 98 single-copy genes dispersed between the small (SSC) and large (LSC) single-copy regions and 17 duplicated genes in the inverted repeat regions (IRa and IRb). A comparison of the cucumber and melon chloroplast genomes showed differences in only approximately 5% of nucleotides, mainly due to short indels and SNPs. Additionally, 2.74 Mb of mitochondrial sequence, accounting for 95% of the estimated mitochondrial genome size, were assembled into five scaffolds and four additional unscaffolded contigs. An 84% of the mitochondrial genome is contained in a single scaffold. The gene-coding region accounted for 1.7% (45,926 bp) of the total sequence, including 51 protein-coding genes, 4 conserved ORFs, 3 rRNA genes and 24 tRNA genes. Despite the differences observed in the mitochondrial genome sizes of cucurbit species, Citrullus lanatus (379 kb), Cucurbita pepo (983 kb) and Cucumis melo (2,740 kb) share 120 kb of sequence, including the predicted protein-coding regions. Nevertheless, melon contained a high number of repetitive sequences and a high content of DNA of nuclear origin, which represented 42% and 47% of the total sequence, respectively. Conclusions Whereas the size and gene organisation of chloroplast genomes are similar among the cucurbit species, mitochondrial genomes show a wide variety of sizes, with a non

  14. iMETHYL: an integrative database of human DNA methylation, gene expression, and genomic variation.

    PubMed

    Komaki, Shohei; Shiwa, Yuh; Furukawa, Ryohei; Hachiya, Tsuyoshi; Ohmomo, Hideki; Otomo, Ryo; Satoh, Mamoru; Hitomi, Jiro; Sobue, Kenji; Sasaki, Makoto; Shimizu, Atsushi

    2018-01-01

    We launched an integrative multi-omics database, iMETHYL (http://imethyl.iwate-megabank.org). iMETHYL provides whole-DNA methylation (~24 million autosomal CpG sites), whole-genome (~9 million single-nucleotide variants), and whole-transcriptome (>14 000 genes) data for CD4 + T-lymphocytes, monocytes, and neutrophils collected from approximately 100 subjects. These data were obtained from whole-genome bisulfite sequencing, whole-genome sequencing, and whole-transcriptome sequencing, making iMETHYL a comprehensive database.

  15. Modeling the relaxation of internal DNA segments during genome mapping in nanochannels.

    PubMed

    Jain, Aashish; Sheats, Julian; Reifenberger, Jeffrey G; Cao, Han; Dorfman, Kevin D

    2016-09-01

    We have developed a multi-scale model describing the dynamics of internal segments of DNA in nanochannels used for genome mapping. In addition to the channel geometry, the model takes as its inputs the DNA properties in free solution (persistence length, effective width, molecular weight, and segmental hydrodynamic radius) and buffer properties (temperature and viscosity). Using pruned-enriched Rosenbluth simulations of a discrete wormlike chain model with circa 10 base pair resolution and a numerical solution for the hydrodynamic interactions in confinement, we convert these experimentally available inputs into the necessary parameters for a one-dimensional, Rouse-like model of the confined chain. The resulting coarse-grained model resolves the DNA at a length scale of approximately 6 kilobase pairs in the absence of any global hairpin folds, and is readily studied using a normal-mode analysis or Brownian dynamics simulations. The Rouse-like model successfully reproduces both the trends and order of magnitude of the relaxation time of the distance between labeled segments of DNA obtained in experiments. The model also provides insights that are not readily accessible from experiments, such as the role of the molecular weight of the DNA and location of the labeled segments that impact the statistical models used to construct genome maps from data acquired in nanochannels. The multi-scale approach used here, while focused towards a technologically relevant scenario, is readily adapted to other channel sizes and polymers.

  16. The 'dark matter' in the plant genomes: non-coding and unannotated DNA sequences associated with open chromatin.

    PubMed

    Jiang, Jiming

    2015-04-01

    Sequencing of complete plant genomes has become increasingly more routine since the advent of the next-generation sequencing technology. Identification and annotation of large amounts of noncoding but functional DNA sequences, including cis-regulatory DNA elements (CREs), have become a new frontier in plant genome research. Genomic regions containing active CREs bound to regulatory proteins are hypersensitive to DNase I digestion and are called DNase I hypersensitive sites (DHSs). Several recent DHS studies in plants illustrate that DHS datasets produced by DNase I digestion followed by next-generation sequencing (DNase-seq) are highly valuable for the identification and characterization of CREs associated with plant development and responses to environmental cues. DHS-based genomic profiling has opened a door to identify and annotate the 'dark matter' in sequenced plant genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.

  17. Sequencing of whole plastid genomes and nuclear ribosomal DNA of Diospyros species (Ebenaceae) endemic to New Caledonia: many species, little divergence

    PubMed Central

    Turner, Barbara; Paun, Ovidiu; Munzinger, Jérôme; Chase, Mark W.; Samuel, Rosabelle

    2016-01-01

    Background and Aims Some plant groups, especially on islands, have been shaped by strong ancestral bottlenecks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA (nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data types to resolve phylogenetic relationships among closely related Diospyros species. Methods Twenty-two closely related Diospyros species from New Caledonia were investigated using whole plastid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA and combined matrices. Key Results The plastid and nrDNA sequences were, singly and together, unable to provide well supported phylogenetic relationships among the closely related New Caledonian Diospyros species. In the nrDNA, a 6-fold greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total number of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than following taxonomic species. Conclusions In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction. Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New Caledonian Diospyros species, but the higher percentage of parsimony-informative characters in nrDNA compared with

  18. Evolutionary Analyses of Entire Genomes Do Not Support the Association of mtDNA Mutations with Ras/MAPK Pathway Syndromes

    PubMed Central

    Cerezo, María; Balboa, Emilia; Heredia, Claudia; Castro-Feijóo, Lidia; Rica, Itxaso; Barreiro, Jesús; Eirís, Jesús; Cabanas, Paloma; Martínez-Soto, Isabel; Fernández-Toral, Joaquín; Castro-Gago, Manuel; Pombo, Manuel; Carracedo, Ángel; Barros, Francisco

    2011-01-01

    Background There are several known autosomal genes responsible for Ras/MAPK pathway syndromes, including Noonan syndrome (NS) and related disorders (such as LEOPARD, neurofibromatosis type 1), although mutations of these genes do not explain all cases. Due to the important role played by the mitochondrion in the energetic metabolism of cardiac muscle, it was recently proposed that variation in the mitochondrial DNA (mtDNA) genome could be a risk factor in the Noonan phenotype and in hypertrophic cardiomyopathy (HCM), which is a common clinical feature in Ras/MAPK pathway syndromes. In order to test these hypotheses, we sequenced entire mtDNA genomes in the largest series of patients suffering from Ras/MAPK pathway syndromes analyzed to date (n = 45), most of them classified as NS patients (n = 42). Methods/Principal Findings The results indicate that the observed mtDNA lineages were mostly of European ancestry, reproducing in a nutshell the expected haplogroup (hg) patterns of a typical Iberian dataset (including hgs H, T, J, and U). Three new branches of the mtDNA phylogeny (H1j1, U5b1e, and L2a5) are described for the first time, but none of these are likely to be related to NS or Ras/MAPK pathway syndromes when observed under an evolutionary perspective. Patterns of variation in tRNA and protein genes, as well as redundant, private and heteroplasmic variants, in the mtDNA genomes of patients were as expected when compared with the patterns inferred from a worldwide mtDNA phylogeny based on more than 8700 entire genomes. Moreover, most of the mtDNA variants found in patients had already been reported in healthy individuals and constitute common polymorphisms in human population groups. Conclusions/Significance As a whole, the observed mtDNA genome variation in the NS patients was difficult to reconcile with previous findings that indicated a pathogenic role of mtDNA variants in NS. PMID:21526175

  19. Optimisation of DNA extraction from the crustacean Daphnia

    PubMed Central

    Athanasio, Camila Gonçalves; Chipman, James K.; Viant, Mark R.

    2016-01-01

    Daphnia are key model organisms for mechanistic studies of phenotypic plasticity, adaptation and microevolution, which have led to an increasing demand for genomics resources. A key step in any genomics analysis, such as high-throughput sequencing, is the availability of sufficient and high quality DNA. Although commercial kits exist to extract genomic DNA from several species, preparation of high quality DNA from Daphnia spp. and other chitinous species can be challenging. Here, we optimise methods for tissue homogenisation, DNA extraction and quantification customised for different downstream analyses (e.g., LC-MS/MS, Hiseq, mate pair sequencing or Nanopore). We demonstrate that if Daphnia magna are homogenised as whole animals (including the carapace), absorbance-based DNA quantification methods significantly over-estimate the amount of DNA, resulting in using insufficient starting material for experiments, such as preparation of sequencing libraries. This is attributed to the high refractive index of chitin in Daphnia’s carapace at 260 nm. Therefore, unless the carapace is removed by overnight proteinase digestion, the extracted DNA should be quantified with fluorescence-based methods. However, overnight proteinase digestion will result in partial fragmentation of DNA therefore the prepared DNA is not suitable for downstream methods that require high molecular weight DNA, such as PacBio, mate pair sequencing and Nanopore. In conclusion, we found that the MasterPure DNA purification kit, coupled with grinding of frozen tissue, is the best method for extraction of high molecular weight DNA as long as the extracted DNA is quantified with fluorescence-based methods. This method generated high yield and high molecular weight DNA (3.10 ± 0.63 ng/µg dry mass, fragments >60 kb), free of organic contaminants (phenol, chloroform) and is suitable for large number of downstream analyses. PMID:27190714

  20. Is “Junk” DNA Mostly Intron DNA?

    PubMed Central

    Wong, Gane Ka-Shu; Passey, Douglas A.; Huang, Ying-zong; Yang, Zhiyong; Yu, Jun

    2000-01-01

    Among higher eukaryotes, very little of the genome codes for protein. What is in the rest of the genome, or the “junk” DNA, that, in Homo sapiens, is estimated to be almost 97% of the genome? Is it possible that much of this “junk” is intron DNA? This is not a question that can be answered just by looking at the published data, even from the finished genomes. One cannot assume that there are no genes in a sequenced region, just because no genes were annotated. We introduce another approach to this problem, based on an analysis of the cDNA-to-genomic alignments, in all of the complete or nearly-complete genomes from the multicellular organisms. Our conclusion is that, in animals but not in plants, most of the “junk” is intron DNA. PMID:11076852

  1. Fixing Formalin: A Method to Recover Genomic-Scale DNA Sequence Data from Formalin-Fixed Museum Specimens Using High-Throughput Sequencing

    PubMed Central

    Hykin, Sarah M.; Bi, Ke; McGuire, Jimmy A.

    2015-01-01

    For 150 years or more, specimens were routinely collected and deposited in natural history collections without preserving fresh tissue samples for genetic analysis. In the case of most herpetological specimens (i.e. amphibians and reptiles), attempts to extract and sequence DNA from formalin-fixed, ethanol-preserved specimens—particularly for use in phylogenetic analyses—has been laborious and largely ineffective due to the highly fragmented nature of the DNA. As a result, tens of thousands of specimens in herpetological collections have not been available for sequence-based phylogenetic studies. Massively parallel High-Throughput Sequencing methods and the associated bioinformatics, however, are particularly suited to recovering meaningful genetic markers from severely degraded/fragmented DNA sequences such as DNA damaged by formalin-fixation. In this study, we compared previously published DNA extraction methods on three tissue types subsampled from formalin-fixed specimens of Anolis carolinensis, followed by sequencing. Sufficient quality DNA was recovered from liver tissue, making this technique minimally destructive to museum specimens. Sequencing was only successful for the more recently collected specimen (collected ~30 ybp). We suspect this could be due either to the conditions of preservation and/or the amount of tissue used for extraction purposes. For the successfully sequenced sample, we found a high rate of base misincorporation. After rigorous trimming, we successfully mapped 27.93% of the cleaned reads to the reference genome, were able to reconstruct the complete mitochondrial genome, and recovered an accurate phylogenetic placement for our specimen. We conclude that the amount of DNA available, which can vary depending on specimen age and preservation conditions, will determine if sequencing will be successful. The technique described here will greatly improve the value of museum collections by making many formalin-fixed specimens available for

  2. Fixing Formalin: A Method to Recover Genomic-Scale DNA Sequence Data from Formalin-Fixed Museum Specimens Using High-Throughput Sequencing.

    PubMed

    Hykin, Sarah M; Bi, Ke; McGuire, Jimmy A

    2015-01-01

    For 150 years or more, specimens were routinely collected and deposited in natural history collections without preserving fresh tissue samples for genetic analysis. In the case of most herpetological specimens (i.e. amphibians and reptiles), attempts to extract and sequence DNA from formalin-fixed, ethanol-preserved specimens-particularly for use in phylogenetic analyses-has been laborious and largely ineffective due to the highly fragmented nature of the DNA. As a result, tens of thousands of specimens in herpetological collections have not been available for sequence-based phylogenetic studies. Massively parallel High-Throughput Sequencing methods and the associated bioinformatics, however, are particularly suited to recovering meaningful genetic markers from severely degraded/fragmented DNA sequences such as DNA damaged by formalin-fixation. In this study, we compared previously published DNA extraction methods on three tissue types subsampled from formalin-fixed specimens of Anolis carolinensis, followed by sequencing. Sufficient quality DNA was recovered from liver tissue, making this technique minimally destructive to museum specimens. Sequencing was only successful for the more recently collected specimen (collected ~30 ybp). We suspect this could be due either to the conditions of preservation and/or the amount of tissue used for extraction purposes. For the successfully sequenced sample, we found a high rate of base misincorporation. After rigorous trimming, we successfully mapped 27.93% of the cleaned reads to the reference genome, were able to reconstruct the complete mitochondrial genome, and recovered an accurate phylogenetic placement for our specimen. We conclude that the amount of DNA available, which can vary depending on specimen age and preservation conditions, will determine if sequencing will be successful. The technique described here will greatly improve the value of museum collections by making many formalin-fixed specimens available for

  3. Comparison of randomly cloned and whole genomic DNA probes for the detection of Porphyromonas gingivalis and Bacteroides forsythus

    PubMed Central

    Wong, M.; DiRienzo, J.M.; Lai, C.-H.; Listgarten, M. A.

    2012-01-01

    Whole genomic and randomly-cloned DNA probes for two fastidious periodontal pathogens, Porphyromonas gingivalis and Bacteroides forsythus were labeled with digoxigenin and detected by a colorimetric method. The specificity and sensitivity of the whole genomic and cloned probes were compared. The cloned probes were highly specific compared to the whole genomic probes. A significant degree of cross-reactivity with Bacteroides species. Capnocytophaga sp. and Prevotella sp. was observed with the whole genomic probes. The cloned probes were less sensitive than the whole genomic probes and required at least 106 target cells or a minimum of 10 ng of target DNA to be detected during hybridization. Although a ten-fold increase in sensitivity was obtained with the whole genomic probes, cross-hybridization to closely related species limits their reliability in identifying target bacteria in subgingival plaque samples. PMID:8636873

  4. Fine organization of genomic regions tagged to the 5S rDNA locus of the bread wheat 5B chromosome.

    PubMed

    Sergeeva, Ekaterina M; Shcherban, Andrey B; Adonina, Irina G; Nesterov, Michail A; Beletsky, Alexey V; Rakitin, Andrey L; Mardanov, Andrey V; Ravin, Nikolai V; Salina, Elena A

    2017-11-14

    The multigene family encoding the 5S rRNA, one of the most important structurally-functional part of the large ribosomal subunit, is an obligate component of all eukaryotic genomes. 5S rDNA has long been a favored target for cytological and phylogenetic studies due to the inherent peculiarities of its structural organization, such as the tandem arrays of repetitive units and their high interspecific divergence. The complex polyploid nature of the genome of bread wheat, Triticum aestivum, and the technically difficult task of sequencing clusters of tandem repeats mean that the detailed organization of extended genomic regions containing 5S rRNA genes remains unclear. This is despite the recent progress made in wheat genomic sequencing. Using pyrosequencing of BAC clones, in this work we studied the organization of two distinct 5S rDNA-tagged regions of the 5BS chromosome of bread wheat. Three BAC-clones containing 5S rDNA were identified in the 5BS chromosome-specific BAC-library of Triticum aestivum. Using the results of pyrosequencing and assembling, we obtained six 5S rDNA- containing contigs with a total length of 140,417 bp, and two sets (pools) of individual 5S rDNA sequences belonging to separate, but closely located genomic regions on the 5BS chromosome. Both regions are characterized by the presence of approximately 70-80 copies of 5S rDNA, however, they are completely different in their structural organization. The first region contained highly diverged short-type 5S rDNA units that were disrupted by multiple insertions of transposable elements. The second region contained the more conserved long-type 5S rDNA, organized as a single tandem array. FISH using probes specific to both 5S rDNA unit types showed differences in the distribution and intensity of signals on the chromosomes of polyploid wheat species and their diploid progenitors. A detailed structural organization of two closely located 5S rDNA-tagged genomic regions on the 5BS chromosome of bread

  5. Mitochondrial comparative genomics and phylogenetic signal assessment of mtDNA among arbuscular mycorrhizal fungi.

    PubMed

    Nadimi, Maryam; Daubois, Laurence; Hijri, Mohamed

    2016-05-01

    Mitochondrial (mt) genes, such as cytochrome C oxidase genes (cox), have been widely used for barcoding in many groups of organisms, although this approach has been less powerful in the fungal kingdom due to the rapid evolution of their mt genomes. The use of mt genes in phylogenetic studies of Dikarya has been met with success, while early diverging fungal lineages remain less studied, particularly the arbuscular mycorrhizal fungi (AMF). Advances in next-generation sequencing have substantially increased the number of publically available mtDNA sequences for the Glomeromycota. As a result, comparison of mtDNA across key AMF taxa can now be applied to assess the phylogenetic signal of individual mt coding genes, as well as concatenated subsets of coding genes. Here we show comparative analyses of publically available mt genomes of Glomeromycota, augmented with two mtDNA genomes that were newly sequenced for this study (Rhizophagus irregularis DAOM240159 and Glomus aggregatum DAOM240163), resulting in 16 complete mtDNA datasets. R. irregularis isolate DAOM240159 and G. aggregatum isolate DAOM240163 showed mt genomes measuring 72,293bp and 69,505bp with G+C contents of 37.1% and 37.3%, respectively. We assessed the phylogenies inferred from single mt genes and complete sets of coding genes, which are referred to as "supergenes" (16 concatenated coding genes), using Shimodaira-Hasegawa tests, in order to identify genes that best described AMF phylogeny. We found that rnl, nad5, cox1, and nad2 genes, as well as concatenated subset of these genes, provided phylogenies that were similar to the supergene set. This mitochondrial genomic analysis was also combined with principal coordinate and partitioning analyses, which helped to unravel certain evolutionary relationships in the Rhizophagus genus and for G. aggregatum within the Glomeromycota. We showed evidence to support the position of G. aggregatum within the R. irregularis 'species complex'. Copyright © 2016

  6. DNA quality and quantity from up to 16 years old post-mortem blood stored on FTA cards.

    PubMed

    Rahikainen, Anna-Liina; Palo, Jukka U; de Leeuw, Wiljo; Budowle, Bruce; Sajantila, Antti

    2016-04-01

    Blood samples preserved on FTA cards offer unique opportunities for genetic research. DNA recovered from these cards should be stable for long periods of time. However, it is not well established as how well the DNA stored on FTA card for substantial time periods meets the demands of forensic or genomic DNA analyses and especially so for from post-mortem (PM) samples in which the quality can vary upon initial collection. The aim of this study was to evaluate the time-dependent degradation on DNA quality and quantity extracted from up to 16 years old post-mortem bloodstained FTA cards. Four random FTA samples from eight time points spanning 1998 to 2013 (n=32) were collected and extracted in triplicate. The quantity and quality of the extracted DNA samples were determined with Quantifiler(®) Human Plus (HP) Quantification kit. Internal sample and sample-to-sample variation were evaluated by comparing recovered DNA yields. The DNA from the triplicate samplings were subsequently combined and normalized for further analysis. The practical effect of degradation on DNA quality was evaluated from normalized samples both with forensic and pharmacogenetic target markers. Our results suggest that (1) a PM change, e.g. blood clotting prior to sampling, affects the recovered DNA yield, creating both internal and sample-to-sample variation; (2) a negative correlation between the FTA card storage time and DNA quantity (r=-0.836 at the 0.01 level) was observed; (3) a positive correlation (r=0.738 at the level 0.01) was found between FTA card storage time and degradation levels. However, no inhibition was observed with the method used. The effect of degradation was manifested clearly with functional applications. Although complete STR-profiles were obtained for all samples, there was evidence of degradation manifested as decreased peak heights in the larger-sized amplicons. Lower amplification success was notable with the large 5.1 kb CYP2D6 gene fragment which strongly supports

  7. Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): A linear DNA molecule encoding a putative DNA-dependent DNA polymerase.

    PubMed

    Shao, Zhiyong; Graf, Shannon; Chaga, Oleg Y; Lavrov, Dennis V

    2006-10-15

    The 16,937-nuceotide sequence of the linear mitochondrial DNA (mt-DNA) molecule of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa) - the first mtDNA sequence from the class Scypozoa and the first sequence of a linear mtDNA from Metazoa - has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs. In addition, two open reading frames of 324 and 969 base pairs in length have been found. The deduced amino-acid sequence of one of them, ORF969, displays extensive sequence similarity with the polymerase [but not the exonuclease] domain of family B DNA polymerases, and this ORF has been tentatively identified as dnab. This is the first report of dnab in animal mtDNA. The genes in A. aurita mtDNA are arranged in two clusters with opposite transcriptional polarities; transcription proceeding toward the ends of the molecule. The determined sequences at the ends of the molecule are nearly identical but inverted and lack any obvious potential secondary structures or telomere-like repeat elements. The acquisition of mitochondrial genomic data for the second class of Cnidaria allows us to reconstruct characteristic features of mitochondrial evolution in this animal phylum.

  8. Ultra-barcoding in cacao (Theobroma spp.; malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA

    USDA-ARS?s Scientific Manuscript database

    High-throughput next-generation sequencing was used to scan the genome and generate reliable sequence of high copy number regions. Using this method, we examined whole plastid genomes as well as nearly 6000 bases of nuclear ribosomal DNA sequences for nine genotypes of Theobroma cacao and an indivi...

  9. Animal Mitochondrial DNA as We Do Not Know It: mt-Genome Organization and Evolution in Nonbilaterian Lineages

    PubMed Central

    Pett, Walker

    2016-01-01

    Abstract Animal mitochondrial DNA (mtDNA) is commonly described as a small, circular molecule that is conserved in size, gene content, and organization. Data collected in the last decade have challenged this view by revealing considerable diversity in animal mitochondrial genome organization. Much of this diversity has been found in nonbilaterian animals (phyla Cnidaria, Ctenophora, Placozoa, and Porifera), which, from a phylogenetic perspective, form the main branches of the animal tree along with Bilateria. Within these groups, mt-genomes are characterized by varying numbers of both linear and circular chromosomes, extra genes (e.g. atp9, polB, tatC), large variation in the number of encoded mitochondrial transfer RNAs (tRNAs) (0–25), at least seven different genetic codes, presence/absence of introns, tRNA and mRNA editing, fragmented ribosomal RNA genes, translational frameshifting, highly variable substitution rates, and a large range of genome sizes. This newly discovered diversity allows a better understanding of the evolutionary plasticity and conservation of animal mtDNA and provides insights into the molecular and evolutionary mechanisms shaping mitochondrial genomes. PMID:27557826

  10. Transmission of human mtDNA heteroplasmy in the Genome of the Netherlands families: support for a variable-size bottleneck

    PubMed Central

    Li, Mingkun; Rothwell, Rebecca; Vermaat, Martijn; Wachsmuth, Manja; Schröder, Roland; Laros, Jeroen F.J.; van Oven, Mannis; de Bakker, Paul I.W.; Bovenberg, Jasper A.; van Duijn, Cornelia M.; van Ommen, Gert-Jan B.; Slagboom, P. Eline; Swertz, Morris A.; Wijmenga, Cisca; Kayser, Manfred; Boomsma, Dorret I.; Zöllner, Sebastian; de Knijff, Peter; Stoneking, Mark

    2016-01-01

    Although previous studies have documented a bottleneck in the transmission of mtDNA genomes from mothers to offspring, several aspects remain unclear, including the size and nature of the bottleneck. Here, we analyze the dynamics of mtDNA heteroplasmy transmission in the Genomes of the Netherlands (GoNL) data, which consists of complete mtDNA genome sequences from 228 trios, eight dizygotic (DZ) twin quartets, and 10 monozygotic (MZ) twin quartets. Using a minor allele frequency (MAF) threshold of 2%, we identified 189 heteroplasmies in the trio mothers, of which 59% were transmitted to offspring, and 159 heteroplasmies in the trio offspring, of which 70% were inherited from the mothers. MZ twin pairs exhibited greater similarity in MAF at heteroplasmic sites than DZ twin pairs, suggesting that the heteroplasmy MAF in the oocyte is the major determinant of the heteroplasmy MAF in the offspring. We used a likelihood method to estimate the effective number of mtDNA genomes transmitted to offspring under different bottleneck models; a variable bottleneck size model provided the best fit to the data, with an estimated mean of nine individual mtDNA genomes transmitted. We also found evidence for negative selection during transmission against novel heteroplasmies (in which the minor allele has never been observed in polymorphism data). These novel heteroplasmies are enhanced for tRNA and rRNA genes, and mutations associated with mtDNA diseases frequently occur in these genes. Our results thus suggest that the female germ line is able to recognize and select against deleterious heteroplasmies. PMID:26916109

  11. Epigenetic Variation in Monozygotic Twins: A Genome-Wide Analysis of DNA Methylation in Buccal Cells

    PubMed Central

    van Dongen, Jenny; Ehli, Erik A.; Slieker, Roderick C.; Bartels, Meike; Weber, Zachary M.; Davies, Gareth E.; Slagboom, P. Eline; Heijmans, Bastiaan T.; Boomsma, Dorret I.

    2014-01-01

    DNA methylation is one of the most extensively studied epigenetic marks in humans. Yet, it is largely unknown what causes variation in DNA methylation between individuals. The comparison of DNA methylation profiles of monozygotic (MZ) twins offers a unique experimental design to examine the extent to which such variation is related to individual-specific environmental influences and stochastic events or to familial factors (DNA sequence and shared environment). We measured genome-wide DNA methylation in buccal samples from ten MZ pairs (age 8–19) using the Illumina 450k array and examined twin correlations for methylation level at 420,921 CpGs after QC. After selecting CpGs showing the most variation in the methylation level between subjects, the mean genome-wide correlation (rho) was 0.54. The correlation was higher, on average, for CpGs within CpG islands (CGIs), compared to CGI shores, shelves and non-CGI regions, particularly at hypomethylated CpGs. This finding suggests that individual-specific environmental and stochastic influences account for more variation in DNA methylation in CpG-poor regions. Our findings also indicate that it is worthwhile to examine heritable and shared environmental influences on buccal DNA methylation in larger studies that also include dizygotic twins. PMID:24802513

  12. Construction of an infectious genomic clone of porcine parvovirus: effect of the 5'-end on DNA replication.

    PubMed

    Casal, J I; Diaz-Aroca, E; Ranz, A I; Manclus, J J

    1990-08-01

    The linear single-stranded DNA genome of the porcine parvovirus, an autonomous parvovirus, was cloned in duplex form into the bacterial plasmid pUC18 using a simple and reliable method. These clones were stable during propagation in Escherichia coli JM109. The recombinant clones of porcine parvovirus were infectious when transfected into monolayers of swine testes cells as identified by the development of cytopathic effect, indirect immunofluorescence with specific antiserum, and hemagglutination assays. DNA isolated from progeny virus arising from transfected infectious clones was found to be indistinguishable from wild-type DNA by restriction enzyme analysis. Defective genomes could also be detected in the progeny DNA even though the infection was initiated with homogeneous, cloned DNA. The presence of the turn of the 5'-end loop seems to be necessary to get stable infectious clones.

  13. Genome-wide comparative analysis of DNA methylation between soybean cytoplasmic male-sterile line NJCMS5A and its maintainer NJCMS5B.

    PubMed

    Li, Yanwei; Ding, Xianlong; Wang, Xuan; He, Tingting; Zhang, Hao; Yang, Longshu; Wang, Tanliu; Chen, Linfeng; Gai, Junyi; Yang, Shouping

    2017-08-10

    DNA methylation is an important epigenetic modification. It can regulate the expression of many key genes without changing the primary structure of the genomic DNA, and plays a vital role in the growth and development of the organism. The genome-wide DNA methylation profile of the cytoplasmic male sterile (CMS) line in soybean has not been reported so far. In this study, genome-wide comparative analysis of DNA methylation between soybean CMS line NJCMS5A and its maintainer NJCMS5B was conducted by whole-genome bisulfite sequencing. The results showed 3527 differentially methylated regions (DMRs) and 485 differentially methylated genes (DMGs), including 353 high-credible methylated genes, 56 methylated genes coding unknown protein and 76 novel methylated genes with no known function were identified. Among them, 25 DMRs were further validated that the genome-wide DNA methylation data were reliable through bisulfite treatment, and 9 DMRs were confirmed the relationship between DNA methylation and gene expression by qRT-PCR. Finally, 8 key DMGs possibly associated with soybean CMS were identified. Genome-wide DNA methylation profile of the soybean CMS line NJCMS5A and its maintainer NJCMS5B was obtained for the first time. Several specific DMGs which participated in pollen and flower development were further identified to be probably associated with soybean CMS. This study will contribute to further understanding of the molecular mechanism behind soybean CMS.

  14. An Integrated Encyclopedia of DNA Elements in the Human Genome

    PubMed Central

    2012-01-01

    Summary The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure, and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall the project provides new insights into the organization and regulation of our genes and genome, and an expansive resource of functional annotations for biomedical research. PMID:22955616

  15. Comparative analysis of protocols for DNA extraction from soybean caterpillars.

    PubMed

    Palma, J; Valmorbida, I; da Costa, I F D; Guedes, J V C

    2016-04-07

    Genomic DNA extraction is crucial for molecular research, including diagnostic and genome characterization of different organisms. The aim of this study was to comparatively analyze protocols of DNA extraction based on cell lysis by sarcosyl, cetyltrimethylammonium bromide, and sodium dodecyl sulfate, and to determine the most efficient method applicable to soybean caterpillars. DNA was extracted from specimens of Chrysodeixis includens and Spodoptera eridania using the aforementioned three methods. DNA quantification was performed using spectrophotometry and high molecular weight DNA ladders. The purity of the extracted DNA was determined by calculating the A260/A280 ratio. Cost and time for each DNA extraction method were estimated and analyzed statistically. The amount of DNA extracted by these three methods was sufficient for PCR amplification. The sarcosyl method yielded DNA of higher purity, because it generated a clearer pellet without viscosity, and yielded high quality amplification products of the COI gene I. The sarcosyl method showed lower cost per extraction and did not differ from the other methods with respect to preparation times. Cell lysis by sarcosyl represents the best method for DNA extraction in terms of yield, quality, and cost effectiveness.

  16. Recent advances in ChIP-seq analysis: from quality management to whole-genome annotation.

    PubMed

    Nakato, Ryuichiro; Shirahige, Katsuhiko

    2017-03-01

    Chromatin immunoprecipitation followed by sequencing (ChIP-seq) analysis can detect protein/DNA-binding and histone-modification sites across an entire genome. Recent advances in sequencing technologies and analyses enable us to compare hundreds of samples simultaneously; such large-scale analysis has potential to reveal the high-dimensional interrelationship level for regulatory elements and annotate novel functional genomic regions de novo. Because many experimental considerations are relevant to the choice of a method in a ChIP-seq analysis, the overall design and quality management of the experiment are of critical importance. This review offers guiding principles of computation and sample preparation for ChIP-seq analyses, highlighting the validity and limitations of the state-of-the-art procedures at each step. We also discuss the latest challenges of single-cell analysis that will encourage a new era in this field. © The Author 2016. Published by Oxford University Press.

  17. Single Molecule Analysis of Replicated DNA Reveals the Usage of Multiple KSHV Genome Regions for Latent Replication

    PubMed Central

    Verma, Subhash C.; Lu, Jie; Cai, Qiliang; Kosiyatrakul, Settapong; McDowell, Maria E.; Schildkraut, Carl L.; Robertson, Erle S.

    2011-01-01

    Kaposi's sarcoma associated herpesvirus (KSHV), an etiologic agent of Kaposi's sarcoma, Body Cavity Based Lymphoma and Multicentric Castleman's Disease, establishes lifelong latency in infected cells. The KSHV genome tethers to the host chromosome with the help of a latency associated nuclear antigen (LANA). Additionally, LANA supports replication of the latent origins within the terminal repeats by recruiting cellular factors. Our previous studies identified and characterized another latent origin, which supported the replication of plasmids ex-vivo without LANA expression in trans. Therefore identification of an additional origin site prompted us to analyze the entire KSHV genome for replication initiation sites using single molecule analysis of replicated DNA (SMARD). Our results showed that replication of DNA can initiate throughout the KSHV genome and the usage of these regions is not conserved in two different KSHV strains investigated. SMARD also showed that the utilization of multiple replication initiation sites occurs across large regions of the genome rather than a specified sequence. The replication origin of the terminal repeats showed only a slight preference for their usage indicating that LANA dependent origin at the terminal repeats (TR) plays only a limited role in genome duplication. Furthermore, we performed chromatin immunoprecipitation for ORC2 and MCM3, which are part of the pre-replication initiation complex to determine the genomic sites where these proteins accumulate, to provide further characterization of potential replication initiation sites on the KSHV genome. The ChIP data confirmed accumulation of these pre-RC proteins at multiple genomic sites in a cell cycle dependent manner. Our data also show that both the frequency and the sites of replication initiation vary within the two KSHV genomes studied here, suggesting that initiation of replication is likely to be affected by the genomic context rather than the DNA sequences. PMID

  18. FISH Oracle: a web server for flexible visualization of DNA copy number data in a genomic context.

    PubMed

    Mader, Malte; Simon, Ronald; Steinbiss, Sascha; Kurtz, Stefan

    2011-07-28

    The rapidly growing amount of array CGH data requires improved visualization software supporting the process of identifying candidate cancer genes. Optimally, such software should work across multiple microarray platforms, should be able to cope with data from different sources and should be easy to operate. We have developed a web-based software FISH Oracle to visualize data from multiple array CGH experiments in a genomic context. Its fast visualization engine and advanced web and database technology supports highly interactive use. FISH Oracle comes with a convenient data import mechanism, powerful search options for genomic elements (e.g. gene names or karyobands), quick navigation and zooming into interesting regions, and mechanisms to export the visualization into different high quality formats. These features make the software especially suitable for the needs of life scientists. FISH Oracle offers a fast and easy to use visualization tool for array CGH and SNP array data. It allows for the identification of genomic regions representing minimal common changes based on data from one or more experiments. FISH Oracle will be instrumental to identify candidate onco and tumor suppressor genes based on the frequency and genomic position of DNA copy number changes. The FISH Oracle application and an installed demo web server are available at http://www.zbh.uni-hamburg.de/fishoracle.

  19. DNA-based identification of spices: DNA isolation, whole genome amplification, and polymerase chain reaction.

    PubMed

    Focke, Felix; Haase, Ilka; Fischer, Markus

    2011-01-26

    Usually spices are identified morphologically using simple methods like magnifying glasses or microscopic instruments. On the other hand, molecular biological methods like the polymerase chain reaction (PCR) enable an accurate and specific detection also in complex matrices. Generally, the origins of spices are plants with diverse genetic backgrounds and relationships. The processing methods used for the production of spices are complex and individual. Consequently, the development of a reliable DNA-based method for spice analysis is a challenging intention. However, once established, this method will be easily adapted to less difficult food matrices. In the current study, several alternative methods for the isolation of DNA from spices have been developed and evaluated in detail with regard to (i) its purity (photometric), (ii) yield (fluorimetric methods), and (iii) its amplifiability (PCR). Whole genome amplification methods were used to preamplify isolates to improve the ratio between amplifiable DNA and inhibiting substances. Specific primer sets were designed, and the PCR conditions were optimized to detect 18 spices selectively. Assays of self-made spice mixtures were performed to proof the applicability of the developed methods.

  20. Reference-quality genome sequence of Aegilops tauschii, the source of wheat D genome, shows that recombination shapes genome structure and evolution

    USDA-ARS?s Scientific Manuscript database

    Aegilops tauschii is the diploid progenitor of the D genome of hexaploid wheat and an important genetic resource for wheat. A reference-quality sequence for the Ae. tauschii genome was produced with a combination of ordered-clone sequencing, whole-genome shotgun sequencing, and BioNano optical geno...

  1. DNA is structured as a linear "jigsaw puzzle" in the genomes of Arabidopsis, rice, and budding yeast.

    PubMed

    Liu, Yun-Hua; Zhang, Meiping; Wu, Chengcang; Huang, James J; Zhang, Hong-Bin

    2014-01-01

    Knowledge of how a genome is structured and organized from its constituent elements is crucial to understanding its biology and evolution. Here, we report the genome structuring and organization pattern as revealed by systems analysis of the sequences of three model species, Arabidopsis, rice and yeast, at the whole-genome and chromosome levels. We found that all fundamental function elements (FFE) constituting the genomes, including genes (GEN), DNA transposable elements (DTE), retrotransposable elements (RTE), simple sequence repeats (SSR), and (or) low complexity repeats (LCR), are structured in a nonrandom and correlative manner, thus leading to a hypothesis that the DNA of the species is structured as a linear "jigsaw puzzle". Furthermore, we showed that different FFE differ in their importance in the formation and evolution of the DNA jigsaw puzzle structure between species. DTE and RTE play more important roles than GEN, LCR, and SSR in Arabidopsis, whereas GEN and RTE play more important roles than LCR, SSR, and DTE in rice. The genes having multiple recognized functions play more important roles than those having single functions. These results provide useful knowledge necessary for better understanding genome biology and evolution of the species and for effective molecular breeding of rice.

  2. Telling plant species apart with DNA: from barcodes to genomes

    PubMed Central

    Li, De-Zhu; van der Bank, Michelle

    2016-01-01

    Land plants underpin a multitude of ecosystem functions, support human livelihoods and represent a critically important component of terrestrial biodiversity—yet many tens of thousands of species await discovery, and plant identification remains a substantial challenge, especially where material is juvenile, fragmented or processed. In this opinion article, we tackle two main topics. Firstly, we provide a short summary of the strengths and limitations of plant DNA barcoding for addressing these issues. Secondly, we discuss options for enhancing current plant barcodes, focusing on increasing discriminatory power via either gene capture of nuclear markers or genome skimming. The former has the advantage of establishing a defined set of target loci maximizing efficiency of sequencing effort, data storage and analysis. The challenge is developing a probe set for large numbers of nuclear markers that works over sufficient phylogenetic breadth. Genome skimming has the advantage of using existing protocols and being backward compatible with existing barcodes; and the depth of sequence coverage can be increased as sequencing costs fall. Its non-targeted nature does, however, present a major informatics challenge for upscaling to large sample sets. This article is part of the themed issue ‘From DNA barcodes to biomes’. PMID:27481790

  3. Anthocyanin inhibits propidium iodide DNA fluorescence in Euphorbia pulcherrima: implications for genome size variation and flow cytometry.

    PubMed

    Bennett, Michael D; Price, H James; Johnston, J Spencer

    2008-04-01

    Measuring genome size by flow cytometry assumes direct proportionality between nuclear DNA staining and DNA amount. By 1997 it was recognized that secondary metabolites may affect DNA staining, thereby causing inaccuracy. Here experiments are reported with poinsettia (Euphorbia pulcherrima) with green leaves and red bracts rich in phenolics. DNA content was estimated as fluorescence of propidium iodide (PI)-stained nuclei of poinsettia and/or pea (Pisum sativum) using flow cytometry. Tissue was chopped, or two tissues co-chopped, in Galbraith buffer alone or with six concentrations of cyanidin-3-rutinoside (a cyanidin-3-rhamnoglucoside contributing to red coloration in poinsettia). There were large differences in PI staining (35-70 %) between 2C nuclei from green leaf and red bract tissue in poinsettia. These largely disappeared when pea leaflets were co-chopped with poinsettia tissue as an internal standard. However, smaller (2.8-6.9 %) differences remained, and red bracts gave significantly lower 1C genome size estimates (1.69-1.76 pg) than green leaves (1.81 pg). Chopping pea or poinsettia tissue in buffer with 0-200 microm cyanidin-3-rutinoside showed that the effects of natural inhibitors in red bracts of poinsettia on PI staining were largely reproduced in a dose-dependent way by this anthocyanin. Given their near-ubiquitous distribution, many suspected roles and known affects on DNA staining, anthocyanins are a potent, potential cause of significant error variation in genome size estimations for many plant tissues and taxa. This has important implications of wide practical and theoretical significance. When choosing genome size calibration standards it seems prudent to select materials producing little or no anthocyanin. Reviewing the literature identifies clear examples in which claims of intraspecific variation in genome size are probably artefacts caused by natural variation in anthocyanin levels or correlated with environmental factors known to induce

  4. C. elegans whole-genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency

    PubMed Central

    Meier, Bettina; Cooke, Susanna L.; Weiss, Joerg; Bailly, Aymeric P.; Alexandrov, Ludmil B.; Marshall, John; Raine, Keiran; Maddison, Mark; Anderson, Elizabeth; Stratton, Michael R.; Campbell, Peter J.

    2014-01-01

    Mutation is associated with developmental and hereditary disorders, aging, and cancer. While we understand some mutational processes operative in human disease, most remain mysterious. We used Caenorhabditis elegans whole-genome sequencing to model mutational signatures, analyzing 183 worm populations across 17 DNA repair-deficient backgrounds propagated for 20 generations or exposed to carcinogens. The baseline mutation rate in C. elegans was approximately one per genome per generation, not overtly altered across several DNA repair deficiencies over 20 generations. Telomere erosion led to complex chromosomal rearrangements initiated by breakage–fusion–bridge cycles and completed by simultaneously acquired, localized clusters of breakpoints. Aflatoxin B1 induced substitutions of guanines in a GpC context, as observed in aflatoxin-induced liver cancers. Mutational burden increased with impaired nucleotide excision repair. Cisplatin and mechlorethamine, DNA crosslinking agents, caused dose- and genotype-dependent signatures among indels, substitutions, and rearrangements. Strikingly, both agents induced clustered rearrangements resembling “chromoanasynthesis,” a replication-based mutational signature seen in constitutional genomic disorders, suggesting that interstrand crosslinks may play a pathogenic role in such events. Cisplatin mutagenicity was most pronounced in xpf-1 mutants, suggesting that this gene critically protects cells against platinum chemotherapy. Thus, experimental model systems combined with genome sequencing can recapture and mechanistically explain mutational signatures associated with human disease. PMID:25030888

  5. Predicting DNA Methylation State of CpG Dinucleotide Using Genome Topological Features and Deep Networks

    NASA Astrophysics Data System (ADS)

    Wang, Yiheng; Liu, Tong; Xu, Dong; Shi, Huidong; Zhang, Chaoyang; Mo, Yin-Yuan; Wang, Zheng

    2016-01-01

    The hypo- or hyper-methylation of the human genome is one of the epigenetic features of leukemia. However, experimental approaches have only determined the methylation state of a small portion of the human genome. We developed deep learning based (stacked denoising autoencoders, or SdAs) software named “DeepMethyl” to predict the methylation state of DNA CpG dinucleotides using features inferred from three-dimensional genome topology (based on Hi-C) and DNA sequence patterns. We used the experimental data from immortalised myelogenous leukemia (K562) and healthy lymphoblastoid (GM12878) cell lines to train the learning models and assess prediction performance. We have tested various SdA architectures with different configurations of hidden layer(s) and amount of pre-training data and compared the performance of deep networks relative to support vector machines (SVMs). Using the methylation states of sequentially neighboring regions as one of the learning features, an SdA achieved a blind test accuracy of 89.7% for GM12878 and 88.6% for K562. When the methylation states of sequentially neighboring regions are unknown, the accuracies are 84.82% for GM12878 and 72.01% for K562. We also analyzed the contribution of genome topological features inferred from Hi-C. DeepMethyl can be accessed at http://dna.cs.usm.edu/deepmethyl/.

  6. C. elegans whole-genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency.

    PubMed

    Meier, Bettina; Cooke, Susanna L; Weiss, Joerg; Bailly, Aymeric P; Alexandrov, Ludmil B; Marshall, John; Raine, Keiran; Maddison, Mark; Anderson, Elizabeth; Stratton, Michael R; Gartner, Anton; Campbell, Peter J

    2014-10-01

    Mutation is associated with developmental and hereditary disorders, aging, and cancer. While we understand some mutational processes operative in human disease, most remain mysterious. We used Caenorhabditis elegans whole-genome sequencing to model mutational signatures, analyzing 183 worm populations across 17 DNA repair-deficient backgrounds propagated for 20 generations or exposed to carcinogens. The baseline mutation rate in C. elegans was approximately one per genome per generation, not overtly altered across several DNA repair deficiencies over 20 generations. Telomere erosion led to complex chromosomal rearrangements initiated by breakage-fusion-bridge cycles and completed by simultaneously acquired, localized clusters of breakpoints. Aflatoxin B1 induced substitutions of guanines in a GpC context, as observed in aflatoxin-induced liver cancers. Mutational burden increased with impaired nucleotide excision repair. Cisplatin and mechlorethamine, DNA crosslinking agents, caused dose- and genotype-dependent signatures among indels, substitutions, and rearrangements. Strikingly, both agents induced clustered rearrangements resembling "chromoanasynthesis," a replication-based mutational signature seen in constitutional genomic disorders, suggesting that interstrand crosslinks may play a pathogenic role in such events. Cisplatin mutagenicity was most pronounced in xpf-1 mutants, suggesting that this gene critically protects cells against platinum chemotherapy. Thus, experimental model systems combined with genome sequencing can recapture and mechanistically explain mutational signatures associated with human disease. © 2014 Meier et al.; Published by Cold Spring Harbor Laboratory Press.

  7. Predicting DNA Methylation State of CpG Dinucleotide Using Genome Topological Features and Deep Networks.

    PubMed

    Wang, Yiheng; Liu, Tong; Xu, Dong; Shi, Huidong; Zhang, Chaoyang; Mo, Yin-Yuan; Wang, Zheng

    2016-01-22

    The hypo- or hyper-methylation of the human genome is one of the epigenetic features of leukemia. However, experimental approaches have only determined the methylation state of a small portion of the human genome. We developed deep learning based (stacked denoising autoencoders, or SdAs) software named "DeepMethyl" to predict the methylation state of DNA CpG dinucleotides using features inferred from three-dimensional genome topology (based on Hi-C) and DNA sequence patterns. We used the experimental data from immortalised myelogenous leukemia (K562) and healthy lymphoblastoid (GM12878) cell lines to train the learning models and assess prediction performance. We have tested various SdA architectures with different configurations of hidden layer(s) and amount of pre-training data and compared the performance of deep networks relative to support vector machines (SVMs). Using the methylation states of sequentially neighboring regions as one of the learning features, an SdA achieved a blind test accuracy of 89.7% for GM12878 and 88.6% for K562. When the methylation states of sequentially neighboring regions are unknown, the accuracies are 84.82% for GM12878 and 72.01% for K562. We also analyzed the contribution of genome topological features inferred from Hi-C. DeepMethyl can be accessed at http://dna.cs.usm.edu/deepmethyl/.

  8. Nuclear routing networks span between nuclear pore complexes and genomic DNA to guide nucleoplasmic trafficking of biomolecules

    PubMed Central

    Malecki, Marek; Malecki, Bianca

    2012-01-01

    In health and disease, biomolecules, which are involved in gene expression, recombination, or reprogramming have to traffic through the nucleoplasm, between nuclear pore complexes (NPCs) and genomic DNA (gDNA). This trafficking is guided by the recently revealed nuclear routing networks (NRNs). In this study, we aimed to investigate, if the NRNs have established associations with the genomic DNA in situ and if the NRNs have capabilities to bind the DNA de novo. Moreover, we aimed to study further, if nucleoplasmic trafficking of the histones, rRNA, and transgenes’ vectors, between the NPCs and gDNA, is guided by the NRNs. We used Xenopus laevis oocytes as the model system. We engineered the transgenes’ DNA vectors equipped with the SV40 LTA nuclear localization signals (NLS) and/or HIV Rev nuclear export signals (NES). We purified histones, 5S rRNA, and gDNA. We rendered all these molecules superparamagnetic and fluorescent for detection with nuclear magnetic resonance (NMR), total reflection x-ray fluorescence (TXRF), energy dispersive x-ray spectroscopy (EDXS), and electron energy loss spectroscopy (EELS). The NRNs span between the NPCs and genomic DNA. They form firm bonds with the gDNA in situ. After complete digestion of the nucleic acids with the RNases and DNases, the newly added DNA - modified with the dNTP analogs, bonds firmly to the NRNs. Moreover, the NRNs guide the trafficking of the DNA transgenes’ vectors - modified with the SV40 LTA NLS, following their import into the nuclei through the NPCs. The pathway is identical to that of histones. The NRNs also guide the trafficking of the DNA transgenes’ vectors, modified with the HIV Rev NES, to the NPCs, followed by their export out of the nuclei. Ribosomal RNAs follow the same pathway. To summarize, the NRNs are the structures connecting the NPCs and the gDNA. They guide the trafficking of the biomolecules between the NPCs and the gDNA. PMID:23275893

  9. Genome-scale analysis of aberrant DNA methylation in colorectal cancer

    PubMed Central

    Hinoue, Toshinori; Weisenberger, Daniel J.; Lange, Christopher P.E.; Shen, Hui; Byun, Hyang-Min; Van Den Berg, David; Malik, Simeen; Pan, Fei; Noushmehr, Houtan; van Dijk, Cornelis M.; Tollenaar, Rob A.E.M.; Laird, Peter W.

    2012-01-01

    Colorectal cancer (CRC) is a heterogeneous disease in which unique subtypes are characterized by distinct genetic and epigenetic alterations. Here we performed comprehensive genome-scale DNA methylation profiling of 125 colorectal tumors and 29 adjacent normal tissues. We identified four DNA methylation–based subgroups of CRC using model-based cluster analyses. Each subtype shows characteristic genetic and clinical features, indicating that they represent biologically distinct subgroups. A CIMP-high (CIMP-H) subgroup, which exhibits an exceptionally high frequency of cancer-specific DNA hypermethylation, is strongly associated with MLH1 DNA hypermethylation and the BRAFV600E mutation. A CIMP-low (CIMP-L) subgroup is enriched for KRAS mutations and characterized by DNA hypermethylation of a subset of CIMP-H-associated markers rather than a unique group of CpG islands. Non-CIMP tumors are separated into two distinct clusters. One non-CIMP subgroup is distinguished by a significantly higher frequency of TP53 mutations and frequent occurrence in the distal colon, while the tumors that belong to the fourth group exhibit a low frequency of both cancer-specific DNA hypermethylation and gene mutations and are significantly enriched for rectal tumors. Furthermore, we identified 112 genes that were down-regulated more than twofold in CIMP-H tumors together with promoter DNA hypermethylation. These represent ∼7% of genes that acquired promoter DNA methylation in CIMP-H tumors. Intriguingly, 48/112 genes were also transcriptionally down-regulated in non-CIMP subgroups, but this was not attributable to promoter DNA hypermethylation. Together, we identified four distinct DNA methylation subgroups of CRC and provided novel insight regarding the role of CIMP-specific DNA hypermethylation in gene silencing. PMID:21659424

  10. Assessment of the quality of DNA from various formalin-fixed paraffin-embedded (FFPE) tissues and the use of this DNA for next-generation sequencing (NGS) with no artifactual mutation

    PubMed Central

    Einaga, Naoki; Yoshida, Akio; Noda, Hiroko; Suemitsu, Masaaki; Nakayama, Yuki; Sakurada, Akihisa; Kawaji, Yoshiko; Yamaguchi, Hiromi; Sasaki, Yasushi; Tokino, Takashi; Esumi, Mariko

    2017-01-01

    Formalin-fixed, paraffin-embedded (FFPE) tissues used for pathological diagnosis are valuable for studying cancer genomics. In particular, laser-capture microdissection of target cells determined by histopathology combined with FFPE tissue section immunohistochemistry (IHC) enables precise analysis by next-generation sequencing (NGS) of the genetic events occurring in cancer. The result is a new strategy for a pathological tool for cancer diagnosis: ‘microgenomics’. To more conveniently and precisely perform microgenomics, we revealed by systematic analysis the following three details regarding FFPE DNA compared with paired frozen tissue DNA. 1) The best quality of FFPE DNA is obtained by tissue fixation with 10% neutral buffered formalin for 1 day and heat treatment of tissue lysates at 95°C for 30 minutes. 2) IHC staining of FFPE tissues decreases the quantity and quality of FFPE DNA to one-fourth, and antigen retrieval (at 120°C for 15 minutes, pH 6.0) is the major reason for this decrease. 3) FFPE DNA prepared as described herein is sufficient for NGS. For non-mutated tissue specimens, no artifactual mutation occurs during FFPE preparation, as shown by precise comparison of NGS of FFPE DNA and paired frozen tissue DNA followed by validation. These results demonstrate that even FFPE tissues used for routine clinical diagnosis can be utilized to obtain reliable NGS data if appropriate conditions of fixation and validation are applied. PMID:28498833

  11. DNA microarrays of baculovirus genomes: differential expression of viral genes in two susceptible insect cell lines.

    PubMed

    Yamagishi, J; Isobe, R; Takebuchi, T; Bando, H

    2003-03-01

    We describe, for the first time, the generation of a viral DNA chip for simultaneous expression measurements of nearly all known open reading frames (ORFs) in the best-studied members of the family Baculoviridae, Autographa californica multiple nucleopolyhedrovirus (AcMNPV) and Bombyx mori nucleopolyhedrovirus (BmNPV). In this study, a viral DNA chip (Ac-BmNPV chip) was fabricated and used to characterize the viral gene expression profile for AcMNPV in different cell types. The viral chip is composed of microarrays of viral DNA prepared by robotic deposition of PCR-amplified viral DNA fragments on glass for ORFs in the NPV genome. Viral gene expression was monitored by hybridization to the DNA fragment microarrays with fluorescently labeled cDNAs prepared from infected Spodoptera frugiperda, Sf9 cells and Trichoplusia ni, TnHigh-Five cells, the latter a major producer of baculovirus and recombinant proteins. A comparison of expression profiles of known ORFs in AcMNPV elucidated six genes (ORF150, p10, pk2, and three late gene expression factor genes lef-3, p35 and lef- 6) the expression of each of which was regulated differently in the two cell lines. Most of these genes are known to be closely involved in the viral life cycle such as in DNA replication, late gene expression and the release of polyhedra from infected cells. These results imply that the differential expression of these viral genes accounts for the differences in viral replication between these two cell lines. Thus, these fabricated microarrays of NPV DNA which allow a rapid analysis of gene expression at the viral genome level should greatly speed the functional analysis of large genomes of NPV.

  12. Comparative clinical utility of tumor genomic testing and cell-free DNA in metastatic breast cancer.

    PubMed

    Maxwell, Kara N; Soucier-Ernst, Danielle; Tahirovic, Emin; Troxel, Andrea B; Clark, Candace; Feldman, Michael; Colameco, Christopher; Kakrecha, Bijal; Langer, Melissa; Lieberman, David; Morrissette, Jennifer J D; Paul, Matt R; Pan, Tien-Chi; Yee, Stephanie; Shih, Natalie; Carpenter, Erica; Chodosh, Lewis A; DeMichele, Angela

    2017-08-01

    Breast cancer metastases differ biologically from primary disease; therefore, metastatic biopsies may assist in treatment decision making. Commercial genomic testing of both tumor and circulating tumor DNA have become available clinically, but utility of these tests in breast cancer management remains unclear. Patients undergoing a clinically indicated metastatic tumor biopsy were consented to the ongoing METAMORPH registry. Tumor and blood were collected at the time of disease progression before subsequent therapy, and patients were followed for response on subsequent treatment. Tumor testing (n = 53) and concurrent cell-free DNA (n = 32) in a subset of patients was performed using CLIA-approved assays. The proportion of patients with a genomic alteration was lower in tumor than in blood (69 vs. 91%; p = 0.06). After restricting analysis to alterations covered on both platforms, 83% of tumor alterations were detected in blood, while 90% of blood alterations were detected in tumor. Mutational load specific for the panel genes was calculated for both tumor and blood. Time to progression on subsequent treatment was significantly shorter for patients whose tumors had high panel-specific mutational load (HR 0.31, 95% CI 0.12-0.78) or a TP53 mutation (HR 0.35, 95% CI 0.20-0.79), after adjusting for stage at presentation, hormone receptor status, prior treatment type, and number of lines of metastatic treatment. Treating oncologists must distinguish platform differences from true biological heterogeneity when comparing tumor and cfDNA genomic testing results. Tumor and concurrent cfDNA contribute unique genomic information in metastatic breast cancer patients, providing potentially useful biomarkers for aggressive metastatic disease.

  13. DNA Data Visualization (DDV): Software for Generating Web-Based Interfaces Supporting Navigation and Analysis of DNA Sequence Data of Entire Genomes.

    PubMed

    Neugebauer, Tomasz; Bordeleau, Eric; Burrus, Vincent; Brzezinski, Ryszard

    2015-01-01

    Data visualization methods are necessary during the exploration and analysis activities of an increasingly data-intensive scientific process. There are few existing visualization methods for raw nucleotide sequences of a whole genome or chromosome. Software for data visualization should allow the researchers to create accessible data visualization interfaces that can be exported and shared with others on the web. Herein, novel software developed for generating DNA data visualization interfaces is described. The software converts DNA data sets into images that are further processed as multi-scale images to be accessed through a web-based interface that supports zooming, panning and sequence fragment selection. Nucleotide composition frequencies and GC skew of a selected sequence segment can be obtained through the interface. The software was used to generate DNA data visualization of human and bacterial chromosomes. Examples of visually detectable features such as short and long direct repeats, long terminal repeats, mobile genetic elements, heterochromatic segments in microbial and human chromosomes, are presented. The software and its source code are available for download and further development. The visualization interfaces generated with the software allow for the immediate identification and observation of several types of sequence patterns in genomes of various sizes and origins. The visualization interfaces generated with the software are readily accessible through a web browser. This software is a useful research and teaching tool for genetics and structural genomics.

  14. Construction of a genomic DNA library with a TA vector and its application in cloning of the phytoene synthase gene from the cyanobacterium Spirulina platensis M-135

    NASA Astrophysics Data System (ADS)

    Yoshikazu, Kawata; Shin-Ichi, Yano; Hiroyuki, Kojima

    1998-03-01

    An efficient and simple method for constructing a genomic DNA library using a TA cloning vector is presented. It is based on the sonicative cleavage of genomic DNA and modification of fragment ends with Taq DNA polymerase, followed by ligation using a TA vector. This method was applied for cloning of the phytoene synthase gene crt B from Spirulina platensis. This method is useful when genomic DNA cannot be efficiently digested with restriction enzymes, a problem often encountered during the construction of a genomic DNA library of cyanobacteria.

  15. Genome-Wide Association Mapping and Genomic Selection for Alfalfa (Medicago sativa) Forage Quality Traits

    PubMed Central

    Pecetti, Luciano; Brummer, E. Charles; Palmonari, Alberto; Tava, Aldo

    2017-01-01

    Genetic progress for forage quality has been poor in alfalfa (Medicago sativa L.), the most-grown forage legume worldwide. This study aimed at exploring opportunities for marker-assisted selection (MAS) and genomic selection of forage quality traits based on breeding values of parent plants. Some 154 genotypes from a broadly-based reference population were genotyped by genotyping-by-sequencing (GBS), and phenotyped for leaf-to-stem ratio, leaf and stem contents of protein, neutral detergent fiber (NDF) and acid detergent lignin (ADL), and leaf and stem NDF digestibility after 24 hours (NDFD), of their dense-planted half-sib progenies in three growing conditions (summer harvest, full irrigation; summer harvest, suspended irrigation; autumn harvest). Trait-marker analyses were performed on progeny values averaged over conditions, owing to modest germplasm × condition interaction. Genomic selection exploited 11,450 polymorphic SNP markers, whereas a subset of 8,494 M. truncatula-aligned markers were used for a genome-wide association study (GWAS). GWAS confirmed the polygenic control of quality traits and, in agreement with phenotypic correlations, indicated substantially different genetic control of a given trait in stems and leaves. It detected several SNPs in different annotated genes that were highly linked to stem protein content. Also, it identified a small genomic region on chromosome 8 with high concentration of annotated genes associated with leaf ADL, including one gene probably involved in the lignin pathway. Three genomic selection models, i.e., Ridge-regression BLUP, Bayes B and Bayesian Lasso, displayed similar prediction accuracy, whereas SVR-lin was less accurate. Accuracy values were moderate (0.3–0.4) for stem NDFD and leaf protein content, modest for leaf ADL and NDFD, and low to very low for the other traits. Along with previous results for the same germplasm set, this study indicates that GBS data can be exploited to improve both quality traits

  16. Genome-Wide Association Mapping and Genomic Selection for Alfalfa (Medicago sativa) Forage Quality Traits.

    PubMed

    Biazzi, Elisa; Nazzicari, Nelson; Pecetti, Luciano; Brummer, E Charles; Palmonari, Alberto; Tava, Aldo; Annicchiarico, Paolo

    2017-01-01

    Genetic progress for forage quality has been poor in alfalfa (Medicago sativa L.), the most-grown forage legume worldwide. This study aimed at exploring opportunities for marker-assisted selection (MAS) and genomic selection of forage quality traits based on breeding values of parent plants. Some 154 genotypes from a broadly-based reference population were genotyped by genotyping-by-sequencing (GBS), and phenotyped for leaf-to-stem ratio, leaf and stem contents of protein, neutral detergent fiber (NDF) and acid detergent lignin (ADL), and leaf and stem NDF digestibility after 24 hours (NDFD), of their dense-planted half-sib progenies in three growing conditions (summer harvest, full irrigation; summer harvest, suspended irrigation; autumn harvest). Trait-marker analyses were performed on progeny values averaged over conditions, owing to modest germplasm × condition interaction. Genomic selection exploited 11,450 polymorphic SNP markers, whereas a subset of 8,494 M. truncatula-aligned markers were used for a genome-wide association study (GWAS). GWAS confirmed the polygenic control of quality traits and, in agreement with phenotypic correlations, indicated substantially different genetic control of a given trait in stems and leaves. It detected several SNPs in different annotated genes that were highly linked to stem protein content. Also, it identified a small genomic region on chromosome 8 with high concentration of annotated genes associated with leaf ADL, including one gene probably involved in the lignin pathway. Three genomic selection models, i.e., Ridge-regression BLUP, Bayes B and Bayesian Lasso, displayed similar prediction accuracy, whereas SVR-lin was less accurate. Accuracy values were moderate (0.3-0.4) for stem NDFD and leaf protein content, modest for leaf ADL and NDFD, and low to very low for the other traits. Along with previous results for the same germplasm set, this study indicates that GBS data can be exploited to improve both quality traits

  17. A common mutation in the 5,10-methylenetetrahydrofolate reductase gene affects genomic DNA methylation through an interaction with folate status

    PubMed Central

    Friso, Simonetta; Choi, Sang-Woon; Girelli, Domenico; Mason, Joel B.; Dolnikowski, Gregory G.; Bagley, Pamela J.; Olivieri, Oliviero; Jacques, Paul F.; Rosenberg, Irwin H.; Corrocher, Roberto; Selhub, Jacob

    2002-01-01

    DNA methylation, an essential epigenetic feature of DNA that modulates gene expression and genomic integrity, is catalyzed by methyltransferases that use the universal methyl donor S-adenosyl-l-methionine. Methylenetetrahydrofolate reductase (MTHFR) catalyzes the synthesis of 5-methyltetrahydrofolate (5-methylTHF), the methyl donor for synthesis of methionine from homocysteine and precursor of S-adenosyl-l-methionine. In the present study we sought to determine the effect of folate status on genomic DNA methylation with an emphasis on the interaction with the common C677T mutation in the MTHFR gene. A liquid chromatography/MS method for the analysis of nucleotide bases was used to assess genomic DNA methylation in peripheral blood mononuclear cell DNA from 105 subjects homozygous for this mutation (T/T) and 187 homozygous for the wild-type (C/C) MTHFR genotype. The results show that genomic DNA methylation directly correlates with folate status and inversely with plasma homocysteine (tHcy) levels (P < 0.01). T/T genotypes had a diminished level of DNA methylation compared with those with the C/C wild-type (32.23 vs.62.24 ng 5-methylcytosine/μg DNA, P < 0.0001). When analyzed according to folate status, however, only the T/T subjects with low levels of folate accounted for the diminished DNA methylation (P < 0.0001). Moreover, in T/T subjects DNA methylation status correlated with the methylated proportion of red blood cell folate and was inversely related to the formylated proportion of red blood cell folates (P < 0.03) that is known to be solely represented in those individuals. These results indicate that the MTHFR C677T polymorphism influences DNA methylation status through an interaction with folate status. PMID:11929966

  18. Primers for polymerase chain reaction to detect genomic DNA of Toxocara canis and T. cati.

    PubMed

    Wu, Z; Nagano, I; Xu, D; Takahashi, Y

    1997-03-01

    Primers for polymerase chain reaction to amplify genomic DNA of both Toxocara canis and T. cati were constructed by adapting cloning and sequencing random amplified polymorphic DNA. The primers are expected to detect eggs and/or larvae of T. canis and T. cati, both of which are known to cause toxocariasis in humans.

  19. Genome-wide measures of DNA methylation in peripheral blood and the risk of urothelial cell carcinoma: a prospective nested case-control study.

    PubMed

    Dugué, Pierre-Antoine; Brinkman, Maree T; Milne, Roger L; Wong, Ee Ming; FitzGerald, Liesel M; Bassett, Julie K; Joo, Jihoon E; Jung, Chol-Hee; Makalic, Enes; Schmidt, Daniel F; Park, Daniel J; Chung, Jessica; Ta, Anthony D; Bolton, Damien M; Lonie, Andrew; Longano, Anthony; Hopper, John L; Severi, Gianluca; Saffery, Richard; English, Dallas R; Southey, Melissa C; Giles, Graham G

    2016-09-06

    Global DNA methylation has been reported to be associated with urothelial cell carcinoma (UCC) by studies using blood samples collected at diagnosis. Using the Illumina HumanMethylation450 assay, we derived genome-wide measures of blood DNA methylation and assessed them for their prospective association with UCC risk. We used 439 case-control pairs from the Melbourne Collaborative Cohort Study matched on age, sex, country of birth, DNA sample type, and collection period. Conditional logistic regression was used to compute odds ratios (OR) of UCC risk per s.d. of each genome-wide measure of DNA methylation and 95% confidence intervals (CIs), adjusted for potential confounders. We also investigated associations by disease subtype, sex, smoking, and time since blood collection. The risk of superficial UCC was decreased for individuals with higher levels of our genome-wide DNA methylation measure (OR=0.71, 95% CI: 0.54-0.94; P=0.02). This association was particularly strong for current smokers at sample collection (OR=0.47, 95% CI: 0.27-0.83). Intermediate levels of our genome-wide measure were associated with decreased risk of invasive UCC. Some variation was observed between UCC subtypes and the location and regulatory function of the CpGs included in the genome-wide measures of methylation. Higher levels of our genome-wide DNA methylation measure were associated with decreased risk of superficial UCC and intermediate levels were associated with reduced risk of invasive disease. These findings require replication by other prospective studies.

  20. Genome-wide DNA methylation patterns in wild samples of two morphotypes of threespine stickleback (Gasterosteus aculeatus).

    PubMed

    Smith, Gilbert; Smith, Carl; Kenny, John G; Chaudhuri, Roy R; Ritchie, Michael G

    2015-04-01

    Epigenetic marks such as DNA methylation play important biological roles in gene expression regulation and cellular differentiation during development. To examine whether DNA methylation patterns are potentially associated with naturally occurring phenotypic differences, we examined genome-wide DNA methylation within Gasterosteus aculeatus, using reduced representation bisulfite sequencing. First, we identified highly methylated regions of the stickleback genome, finding such regions to be located predominantly within genes, and associated with genes functioning in metabolism and biosynthetic processes, cell adhesion, signaling pathways, and blood vessel development. Next, we identified putative differentially methylated regions (DMRs) of the genome between complete and low lateral plate morphs of G. aculeatus. We detected 77 DMRs that were mainly located in intergenic regions. Annotations of genes associated with these DMRs revealed potential functions in a number of known divergent adaptive phenotypes between G. aculeatus ecotypes, including cardiovascular development, growth, and neuromuscular development. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  1. Genome-wide DNA methylation measurements in prostate tissues uncovers novel prostate cancer diagnostic biomarkers and transcription factor binding patterns.

    PubMed

    Kirby, Marie K; Ramaker, Ryne C; Roberts, Brian S; Lasseigne, Brittany N; Gunther, David S; Burwell, Todd C; Davis, Nicholas S; Gulzar, Zulfiqar G; Absher, Devin M; Cooper, Sara J; Brooks, James D; Myers, Richard M

    2017-04-17

    Current diagnostic tools for prostate cancer lack specificity and sensitivity for detecting very early lesions. DNA methylation is a stable genomic modification that is detectable in peripheral patient fluids such as urine and blood plasma that could serve as a non-invasive diagnostic biomarker for prostate cancer. We measured genome-wide DNA methylation patterns in 73 clinically annotated fresh-frozen prostate cancers and 63 benign-adjacent prostate tissues using the Illumina Infinium HumanMethylation450 BeadChip array. We overlaid the most significantly differentially methylated sites in the genome with transcription factor binding sites measured by the Encyclopedia of DNA Elements consortium. We used logistic regression and receiver operating characteristic curves to assess the performance of candidate diagnostic models. We identified methylation patterns that have a high predictive power for distinguishing malignant prostate tissue from benign-adjacent prostate tissue, and these methylation signatures were validated using data from The Cancer Genome Atlas Project. Furthermore, by overlaying ENCODE transcription factor binding data, we observed an enrichment of enhancer of zeste homolog 2 binding in gene regulatory regions with higher DNA methylation in malignant prostate tissues. DNA methylation patterns are greatly altered in prostate cancer tissue in comparison to benign-adjacent tissue. We have discovered patterns of DNA methylation marks that can distinguish prostate cancers with high specificity and sensitivity in multiple patient tissue cohorts, and we have identified transcription factors binding in these differentially methylated regions that may play important roles in prostate cancer development.

  2. The TTSMI database: a catalog of triplex target DNA sites associated with genes and regulatory elements in the human genome.

    PubMed

    Jenjaroenpun, Piroon; Chew, Chee Siang; Yong, Tai Pang; Choowongkomon, Kiattawee; Thammasorn, Wimada; Kuznetsov, Vladimir A

    2015-01-01

    A triplex target DNA site (TTS), a stretch of DNA that is composed of polypurines, is able to form a triple-helix (triplex) structure with triplex-forming oligonucleotides (TFOs) and is able to influence the site-specific modulation of gene expression and/or the modification of genomic DNA. The co-localization of a genomic TTS with gene regulatory signals and functional genome structures suggests that TFOs could potentially be exploited in antigene strategies for the therapy of cancers and other genetic diseases. Here, we present the TTS Mapping and Integration (TTSMI; http://ttsmi.bii.a-star.edu.sg) database, which provides a catalog of unique TTS locations in the human genome and tools for analyzing the co-localization of TTSs with genomic regulatory sequences and signals that were identified using next-generation sequencing techniques and/or predicted by computational models. TTSMI was designed as a user-friendly tool that facilitates (i) fast searching/filtering of TTSs using several search terms and criteria associated with sequence stability and specificity, (ii) interactive filtering of TTSs that co-localize with gene regulatory signals and non-B DNA structures, (iii) exploration of dynamic combinations of the biological signals of specific TTSs and (iv) visualization of a TTS simultaneously with diverse annotation tracks via the UCSC genome browser. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Relations between Shannon entropy and genome order index in segmenting DNA sequences.

    PubMed

    Zhang, Yi

    2009-04-01

    Shannon entropy H and genome order index S are used in segmenting DNA sequences. Zhang [Phys. Rev. E 72, 041917 (2005)] found that the two schemes are equivalent when a DNA sequence is converted to a binary sequence of S (strong H bond) and W (weak H bond). They left the mathematical proof to mathematicians who are interested in this issue. In this paper, a possible mathematical explanation is given. Moreover, we find that Chargaff parity rule 2 is the necessary condition of the equivalence, and the equivalence disappears when a DNA sequence is regarded as a four-symbol sequence. At last, we propose that S-2(-H) may be related to species evolution.

  4. Low-level laser irradiation alters mRNA expression from genes involved in DNA repair and genomic stabilization in myoblasts

    NASA Astrophysics Data System (ADS)

    Trajano, L. A. S. N.; Sergio, L. P. S.; Silva, C. L.; Carvalho, L.; Mencalha, A. L.; Stumbo, A. C.; Fonseca, A. S.

    2016-07-01

    Low-level lasers are used for the treatment of diseases in soft and bone tissues, but few data are available regarding their effects on genomic stability. In this study, we investigated mRNA expression from genes involved in DNA repair and genomic stabilization in myoblasts exposed to low-level infrared laser. C2C12 myoblast cultures in different fetal bovine serum concentrations were exposed to low-level infrared laser (10, 35 and 70 J cm-2), and collected for the evaluation of DNA repair gene expression. Laser exposure increased gene expression related to base excision repair (8-oxoguanine DNA glycosylase and apurinic/apyrimidinic endonuclease 1), nucleotide excision repair (excision repair cross-complementation group 1 and xeroderma pigmentosum C protein) and genomic stabilization (ATM serine/threonine kinase and tumor protein p53) in normal and low fetal bovine serum concentrations. Results suggest that genomic stability could be part of a biostimulation effect of low-level laser therapy in injured muscles.

  5. Development of DNA-Free Sediment for Ecological Assays with Genomic Endpoints (NAC SETAC)

    EPA Science Inventory

    Recent advances in genomics are currently being exploited to discern ecological changes that have conventionally been measured using laborious counting techniques. For example, next generation sequencing technologies can be used to create DNA libraries from benthic community ass...

  6. Subpicosecond surface dynamics in genomic DNA from in vitro-grown plant species: a SERS assessment.

    PubMed

    Muntean, Cristina M; Bratu, Ioan; Leopold, Nicolae; Morari, Cristian; Buimaga-Iarinca, Luiza; Purcaru, Monica A P

    2015-09-07

    In this work the surface-enhanced Raman total half band widths of seven genomic DNAs from leaves of chrysanthemum (Dendranthema grandiflora Ramat.), common sundew (Drosera rotundifolia L.), edelweiss (Leontopodium alpinum Cass), Epilobium hirsutum L., Hypericum richeri ssp. transsilvanicum (Čelak) Ciocârlan, rose (Rosa x hybrida L.) and redwood (Sequoia sempervirens D. Don. Endl.) have been measured. We have shown that surface-enhanced Raman spectroscopy (SERS) can be used to study the fast subpicosecond dynamics of DNA in the proximity of a metallic surface. The dependencies of the total half band widths and the global relaxation times, on the DNA molecular subgroup structure and on the type of genomic DNA, are reported. In our study, the full widths at half-maximum (FWHMs) for the SERS bands of genomic DNAs from different leaf tissues are typically in the wavenumber range from 15 to 55 cm(-1). Besides, it can be observed that molecular relaxation processes studied in this work have a global relaxation time smaller than 0.71 ps and larger than 0.19 ps. A comparison between different ranges of FT-Raman and SERS band parameters, respectively, corresponding to DNA extracted from leaf tissues is given. It is shown that the interaction between DNA and a metallic surface has the potential to lead to a shortening of the global relaxation times, as compared with molecular dynamics in solution. We have found that the surface dynamics of molecular subgroups in plant DNA is, in some cases, about two times faster than the solution dynamics of nucleic acids. This can be rationalized in a qualitative manner by invoking the complex landscape of the interaction energy between the molecule and the silver surface.

  7. Genome-Wide Requirements for Resistance to Functionally Distinct DNA-Damaging Agents

    PubMed Central

    Proctor, Michael; Flaherty, Patrick; Jordan, Michael I; Arkin, Adam P; Davis, Ronald W; Nislow, Corey; Giaever, Guri

    2005-01-01

    The mechanistic and therapeutic differences in the cellular response to DNA-damaging compounds are not completely understood, despite intense study. To expand our knowledge of DNA damage, we assayed the effects of 12 closely related DNA-damaging agents on the complete pool of ~4,700 barcoded homozygous deletion strains of Saccharomyces cerevisiae. In our protocol, deletion strains are pooled together and grown competitively in the presence of compound. Relative strain sensitivity is determined by hybridization of PCR-amplified barcodes to an oligonucleotide array carrying the barcode complements. These screens identified genes in well-characterized DNA-damage-response pathways as well as genes whose role in the DNA-damage response had not been previously established. High-throughput individual growth analysis was used to independently confirm microarray results. Each compound produced a unique genome-wide profile. Analysis of these data allowed us to determine the relative importance of DNA-repair modules for resistance to each of the 12 profiled compounds. Clustering the data for 12 distinct compounds uncovered both known and novel functional interactions that comprise the DNA-damage response and allowed us to define the genetic determinants required for repair of interstrand cross-links. Further genetic analysis allowed determination of epistasis for one of these functional groups. PMID:16121259

  8. The genomics revolution and its effect on water quality

    EPA Science Inventory

    Genomic-based molecular tools are emerging as powerful laboratory methods for assessing water quality characteristics and improving our ability to assess the human health risks posed by microbial contaminants in drinking water. To a great extent, this revolution in genomics-rese...

  9. Genome-wide identification and characterisation of human DNA replication origins by initiation site sequencing (ini-seq)

    PubMed Central

    Langley, Alexander R.; Gräf, Stefan; Smith, James C.; Krude, Torsten

    2016-01-01

    Next-generation sequencing has enabled the genome-wide identification of human DNA replication origins. However, different approaches to mapping replication origins, namely (i) sequencing isolated small nascent DNA strands (SNS-seq); (ii) sequencing replication bubbles (bubble-seq) and (iii) sequencing Okazaki fragments (OK-seq), show only limited concordance. To address this controversy, we describe here an independent high-resolution origin mapping technique that we call initiation site sequencing (ini-seq). In this approach, newly replicated DNA is directly labelled with digoxigenin-dUTP near the sites of its initiation in a cell-free system. The labelled DNA is then immunoprecipitated and genomic locations are determined by DNA sequencing. Using this technique we identify >25,000 discrete origin sites at sub-kilobase resolution on the human genome, with high concordance between biological replicates. Most activated origins identified by ini-seq are found at transcriptional start sites and contain G-quadruplex (G4) motifs. They tend to cluster in early-replicating domains, providing a correlation between early replication timing and local density of activated origins. Origins identified by ini-seq show highest concordance with sites identified by SNS-seq, followed by OK-seq and bubble-seq. Furthermore, germline origins identified by positive nucleotide distribution skew jumps overlap with origins identified by ini-seq and OK-seq more frequently and more specifically than do sites identified by either SNS-seq or bubble-seq. PMID:27587586

  10. Genome-wide identification and characterisation of human DNA replication origins by initiation site sequencing (ini-seq).

    PubMed

    Langley, Alexander R; Gräf, Stefan; Smith, James C; Krude, Torsten

    2016-12-01

    Next-generation sequencing has enabled the genome-wide identification of human DNA replication origins. However, different approaches to mapping replication origins, namely (i) sequencing isolated small nascent DNA strands (SNS-seq); (ii) sequencing replication bubbles (bubble-seq) and (iii) sequencing Okazaki fragments (OK-seq), show only limited concordance. To address this controversy, we describe here an independent high-resolution origin mapping technique that we call initiation site sequencing (ini-seq). In this approach, newly replicated DNA is directly labelled with digoxigenin-dUTP near the sites of its initiation in a cell-free system. The labelled DNA is then immunoprecipitated and genomic locations are determined by DNA sequencing. Using this technique we identify >25,000 discrete origin sites at sub-kilobase resolution on the human genome, with high concordance between biological replicates. Most activated origins identified by ini-seq are found at transcriptional start sites and contain G-quadruplex (G4) motifs. They tend to cluster in early-replicating domains, providing a correlation between early replication timing and local density of activated origins. Origins identified by ini-seq show highest concordance with sites identified by SNS-seq, followed by OK-seq and bubble-seq. Furthermore, germline origins identified by positive nucleotide distribution skew jumps overlap with origins identified by ini-seq and OK-seq more frequently and more specifically than do sites identified by either SNS-seq or bubble-seq. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. The study of genomic DNA adsorption and subsequent interactions using total internal reflection ellipsometry.

    PubMed

    Nabok, Alexei; Tsargorodskaya, Anna; Davis, Frank; Higson, Séamus P J

    2007-10-31

    The adsorption of genomic DNA and subsequent interactions between adsorbed and solvated DNA was studied using a novel sensitive optical method of total internal reflection ellipsometry (TIRE), which combines spectroscopic ellipsometry with surface plasmon resonance (SPR). Single strands of DNA of two species of fish (herring and salmon) were electrostatically adsorbed on top of polyethylenimine films deposited upon gold coated glass slides. The ellipsometric spectra were recorded and data fitting utilized to extract optical parameters (thickness and refractive index) of adsorbed DNA layers. The further adsorption of single stranded DNA from an identical source, i.e. herring ss-DNA on herring ss-DNA or salmon ss-DNA on salmon ss-DNA, on the surface was observed to give rise to substantial film thickness increases at the surface of about 20-21 nm. Conversely adsorption of DNA from alternate species, i.e. salmon ss-DNA on herring ss-DNA or herring ss-DNA on salmon ss-DNA, yielded much smaller changes in thickness of 3-5 nm. AFM studies of the surface roughness of adsorbed layers were in line with the TIRE data.

  12. Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer's Disease.

    PubMed

    Jin, Sheng Chih; Benitez, Bruno A; Deming, Yuetiva; Cruchaga, Carlos

    2016-01-01

    Analyses of genome-wide association studies (GWAS) for complex disorders usually identify common variants with a relatively small effect size that only explain a small proportion of phenotypic heritability. Several studies have suggested that a significant fraction of heritability may be explained by low-frequency (minor allele frequency (MAF) of 1-5 %) and rare-variants that are not contained in the commercial GWAS genotyping arrays (Schork et al., Curr Opin Genet Dev 19:212, 2009). Rare variants can also have relatively large effects on risk for developing human diseases or disease phenotype (Cruchaga et al., PLoS One 7:e31039, 2012). However, it is necessary to perform next-generation sequencing (NGS) studies in a large population (>4,000 samples) to detect a significant rare-variant association. Several NGS methods, such as custom capture sequencing and amplicon-based sequencing, are designed to screen a small proportion of the genome, but most of these methods are limited in the number of samples that can be multiplexed (i.e. most sequencing kits only provide 96 distinct index). Additionally, the sequencing library preparation for 4,000 samples remains expensive and thus conducting NGS studies with the aforementioned methods are not feasible for most research laboratories.The need for low-cost large scale rare-variant detection makes pooled-DNA sequencing an ideally efficient and cost-effective technique to identify rare variants in target regions by sequencing hundreds to thousands of samples. Our recent work has demonstrated that pooled-DNA sequencing can accurately detect rare variants in targeted regions in multiple DNA samples with high sensitivity and specificity (Jin et al., Alzheimers Res Ther 4:34, 2012). In these studies we used a well-established pooled-DNA sequencing approach and a computational package, SPLINTER (short indel prediction by large deviation inference and nonlinear true frequency estimation by recursion) (Vallania et al., Genome Res

  13. DNA Data Bank of Japan (DDBJ) for genome scale research in life science

    PubMed Central

    Tateno, Y.; Imanishi, T.; Miyazaki, S.; Fukami-Kobayashi, K.; Saitou, N.; Sugawara, H.; Gojobori, T.

    2002-01-01

    The DNA Data Bank of Japan (DDBJ, http://www.ddbj.nig.ac.jp) has made an effort to collect as much data as possible mainly from Japanese researchers. The increase rates of the data we collected, annotated and released to the public in the past year are 43% for the number of entries and 52% for the number of bases. The increase rates are accelerated even after the human genome was sequenced, because sequencing technology has been remarkably advanced and simplified, and research in life science has been shifted from the gene scale to the genome scale. In addition, we have developed the Genome Information Broker (GIB, http://gib.genes.nig.ac.jp) that now includes more than 50 complete microbial genome and Arabidopsis genome data. We have also developed a database of the human genome, the Human Genomics Studio (HGS, http://studio.nig.ac.jp). HGS provides one with a set of sequences being as continuous as possible in any one of the 24 chromosomes. Both GIB and HGS have been updated incorporating newly available data and retrieval tools. PMID:11752245

  14. LncRNA/DNA binding analysis reveals losses and gains and lineage specificity of genomic imprinting in mammals.

    PubMed

    Liu, Haihua; Shang, Xiaoxiao; Zhu, Hao

    2017-05-15

    Genomic imprinting is regulated by lncRNAs and is important for embryogenesis, physiology and behaviour in mammals. Aberrant imprinting causes diseases and disorders. Experimental studies have examined genomic imprinting primarily in humans and mice, thus leaving some fundamental issues poorly addressed. The cost of experimentally examining imprinted genes in many tissues in diverse species makes computational analysis of lncRNAs' DNA binding sites valuable. We performed lncRNA/DNA binding analysis in imprinting clusters from multiple mammalian clades and discovered the following: (i) lncRNAs and imprinting sites show significant losses and gains and distinct lineage-specificity; (ii) binding of lncRNAs to promoters of imprinted genes may occur widely throughout the genome; (iii) a considerable number of imprinting sites occur in only evolutionarily more derived species; and (iv) multiple lncRNAs may bind to the same imprinting sites, and some lncRNAs have multiple DNA binding motifs. These results suggest that the occurrence of abundant lncRNAs in mammalian genomes makes genomic imprinting a mechanism of adaptive evolution at the epigenome level. The data and program are available at the database LongMan at lncRNA.smu.edu.cn. zhuhao@smu.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  15. Comparison of strategies for the isolation of PCR-compatible, genomic DNA from a municipal biogas plants.

    PubMed

    Weiss, Agnes; Jérôme, Valérie; Freitag, Ruth

    2007-06-15

    The goal of the project was the extraction of PCR-compatible genomic DNA representative of the entire microbial community from municipal biogas plant samples (mash, bioreactor content, process water, liquid fertilizer). For the initial isolation of representative DNA from the respective lysates, methods were used that employed adsorption, extraction, or precipitation to specifically enrich the DNA. Since no dedicated method for biogas plant samples was available, preference was given to kits/methods suited to samples that resembled either the bioreactor feed, e.g. foodstuffs, or those intended for environmental samples including wastewater. None of the methods succeeded in preparing DNA that was directly PCR-compatible. Instead the DNA was found to still contain considerable amounts of difficult-to-remove enzyme inhibitors (presumably humic acids) that hindered the PCR reaction. Based on the isolation method that gave the highest yield/purity for all sample types, subsequent purification was attempted by agarose gel electrophoresis followed by electroelution, spermine precipitation, or dialysis through nitrocellulose membrane. A combination of phenol/chloroform extraction followed by purification via dialysis constituted the most efficient sample treatment. When such DNA preparations were diluted 1:100 they did no longer inhibit PCR reactions, while they still contained sufficient genomic DNA to allow specific amplification of specific target sequences.

  16. Current and Emerging Technologies for the Analysis of the Genome-Wide and Locus-Specific DNA Methylation Patterns.

    PubMed

    Tost, Jörg

    2016-01-01

    DNA methylation is the most studied epigenetic modification, and altered DNA methylation patterns have been identified in cancer and more recently also in many other complex diseases. Furthermore, DNA methylation is influenced by a variety of environmental factors, and the analysis of DNA methylation patterns might allow deciphering previous exposure. Although a large number of techniques to study DNA methylation either genome-wide or at specific loci have been devised, they all are based on a limited number of principles for differentiating the methylation state, viz., methylation-specific/methylation-dependent restriction enzymes, antibodies or methyl-binding proteins, chemical-based enrichment, or bisulfite conversion. Second-generation sequencing has largely replaced microarrays as readout platform and is also becoming more popular for locus-specific DNA methylation analysis. In this chapter, the currently used methods for both genome-wide and locus-specific analysis of 5-methylcytosine and as its oxidative derivatives, such as 5-hydroxymethylcytosine, are reviewed in detail, and the advantages and limitations of each approach are discussed. Furthermore, emerging technologies avoiding PCR amplification and allowing a direct readout of DNA methylation are summarized, together with novel applications, such as the detection of DNA methylation in single cells or in circulating cell-free DNA.

  17. Novel genomes and genome constitutions identified by GISH and 5S rDNA and knotted1 genomic sequences in the genus Setaria.

    PubMed

    Zhao, Meicheng; Zhi, Hui; Doust, Andrew N; Li, Wei; Wang, Yongfang; Li, Haiquan; Jia, Guanqing; Wang, Yongqiang; Zhang, Ning; Diao, Xianmin

    2013-04-11

    The Setaria genus is increasingly of interest to researchers, as its two species, S. viridis and S. italica, are being developed as models for understanding C4 photosynthesis and plant functional genomics. The genome constitution of Setaria species has been studied in the diploid species S. viridis, S. adhaerans and S. grisebachii, where three genomes A, B and C were identified respectively. Two allotetraploid species, S. verticillata and S. faberi, were found to have AABB genomes, and one autotetraploid species, S. queenslandica, with an AAAA genome, has also been identified. The genomes and genome constitutions of most other species remain unknown, even though it was thought there are approximately 125 species in the genus distributed world-wide. GISH was performed to detect the genome constitutions of Eurasia species of S. glauca, S. plicata, and S. arenaria, with the known A, B and C genomes as probes. No or very poor hybridization signal was detected indicating that their genomes are different from those already described. GISH was also performed reciprocally between S. glauca, S. plicata, and S. arenaria genomes, but no hybridization signals between each other were found. The two sets of chromosomes of S. lachnea both hybridized strong signals with only the known C genome of S. grisebachii. Chromosomes of Qing 9, an accession formerly considered as S. viridis, hybridized strong signal only to B genome of S. adherans. Phylogenetic trees constructed with 5S rDNA and knotted1 markers, clearly classify the samples in this study into six clusters, matching the GISH results, and suggesting that the F genome of S. arenaria is basal in the genus. Three novel genomes in the Setaria genus were identified and designated as genome D (S. glauca), E (S. plicata) and F (S. arenaria) respectively. The genome constitution of tetraploid S. lachnea is putatively CCC'C'. Qing 9 is a B genome species indigenous to China and is hypothesized to be a newly identified species. The

  18. Novel genomes and genome constitutions identified by GISH and 5S rDNA and knotted1 genomic sequences in the genus Setaria

    PubMed Central

    2013-01-01

    Background The Setaria genus is increasingly of interest to researchers, as its two species, S. viridis and S. italica, are being developed as models for understanding C4 photosynthesis and plant functional genomics. The genome constitution of Setaria species has been studied in the diploid species S. viridis, S. adhaerans and S. grisebachii, where three genomes A, B and C were identified respectively. Two allotetraploid species, S. verticillata and S. faberi, were found to have AABB genomes, and one autotetraploid species, S. queenslandica, with an AAAA genome, has also been identified. The genomes and genome constitutions of most other species remain unknown, even though it was thought there are approximately 125 species in the genus distributed world-wide. Results GISH was performed to detect the genome constitutions of Eurasia species of S. glauca, S. plicata, and S. arenaria, with the known A, B and C genomes as probes. No or very poor hybridization signal was detected indicating that their genomes are different from those already described. GISH was also performed reciprocally between S. glauca, S. plicata, and S. arenaria genomes, but no hybridization signals between each other were found. The two sets of chromosomes of S. lachnea both hybridized strong signals with only the known C genome of S. grisebachii. Chromosomes of Qing 9, an accession formerly considered as S. viridis, hybridized strong signal only to B genome of S. adherans. Phylogenetic trees constructed with 5S rDNA and knotted1 markers, clearly classify the samples in this study into six clusters, matching the GISH results, and suggesting that the F genome of S. arenaria is basal in the genus. Conclusions Three novel genomes in the Setaria genus were identified and designated as genome D (S. glauca), E (S. plicata) and F (S. arenaria) respectively. The genome constitution of tetraploid S. lachnea is putatively CCC’C’. Qing 9 is a B genome species indigenous to China and is hypothesized to be

  19. Genome Evolution and Meiotic Maps by Massively Parallel DNA Sequencing: Spotted Gar, an Outgroup for the Teleost Genome Duplication

    PubMed Central

    Amores, Angel; Catchen, Julian; Ferrara, Allyse; Fontenot, Quenton; Postlethwait, John H.

    2011-01-01

    Genomic resources for hundreds of species of evolutionary, agricultural, economic, and medical importance are unavailable due to the expense of well-assembled genome sequences and difficulties with multigenerational studies. Teleost fish provide many models for human disease but possess anciently duplicated genomes that sometimes obfuscate connectivity. Genomic information representing a fish lineage that diverged before the teleost genome duplication (TGD) would provide an outgroup for exploring the mechanisms of evolution after whole-genome duplication. We exploited massively parallel DNA sequencing to develop meiotic maps with thrift and speed by genotyping F1 offspring of a single female and a single male spotted gar (Lepisosteus oculatus) collected directly from nature utilizing only polymorphisms existing in these two wild individuals. Using Stacks, software that automates the calling of genotypes from polymorphisms assayed by Illumina sequencing, we constructed a map containing 8406 markers. RNA-seq on two map-cross larvae provided a reference transcriptome that identified nearly 1000 mapped protein-coding markers and allowed genome-wide analysis of conserved synteny. Results showed that the gar lineage diverged from teleosts before the TGD and its genome is organized more similarly to that of humans than teleosts. Thus, spotted gar provides a critical link between medical models in teleost fish, to which gar is biologically similar, and humans, to which gar is genomically similar. Application of our F1 dense mapping strategy to species with no prior genome information promises to facilitate comparative genomics and provide a scaffold for ordering the numerous contigs arising from next generation genome sequencing. PMID:21828280

  20. Rapidly expanding genetic diversity and host range of the Circoviridae viral family and other Rep encoding small circular ssDNA genomes.

    PubMed

    Delwart, Eric; Li, Linlin

    2012-03-01

    The genomes of numerous circoviruses and distantly related circular ssDNA viruses encoding a rolling circle replication initiator protein (Rep) have been characterized from the tissues of mammals, fish, insects, plants (geminivirus and nanovirus), in human and animal feces, in an algae cell, and in diverse environmental samples. We review the genome organization, phylogenetic relationships and initial prevalence studies of cycloviruses, a proposed new genus in the Circoviridae family. Viral fossil rep sequences were also recently identified integrated on the chromosomes of mammals, frogs, lancelets, crustaceans, mites, gastropods, roundworms, placozoans, hydrozoans, protozoans, land plants, fungi, algae, and phytoplasma bacterias and their plasmids, reflecting the very wide past host range of rep bearing viruses. An ancient origin for viruses with Rep-encoding small circular ssDNA genomes, predating the diversification of eukaryotes, is discussed. The cellular hosts and pathogenicity of many recently described rep-containing circular ssDNA genomes remain to be determined. Future studies of the virome of single cell and multi-cellular eukaryotes are likely to further extend the known diversity and host-range of small rep-containing circular ssDNA viral genomes. Copyright © 2011 Elsevier B.V. All rights reserved.

  1. Sequence evaluation of four specific cDNA libraries for developmental genomics of sunflower.

    PubMed

    Tamborindeguy, C; Ben, C; Liboz, T; Gentzbittel, L

    2004-04-01

    Four different cDNA libraries were constructed from sunflower protoplasts growing under embryogenic and non-embryogenic conditions: one standard library from each condition and two subtractive libraries in opposite sense. A total of 22,876 cDNA clones were obtained and 4800 ESTs were sequenced, giving rise to 2479 high quality ESTs representing an unigene set of 1502 sequences. This set was compared with ESTs represented in public databases using the programs BLASTN and BLASTX, and its members were classified according to putative function using the catalog in the Kyoto Encyclopedia of Genes and Genomes (KEGG). Some 33% of sequences failed to align with existing plant ESTs and therefore represent putative novel genes. The libraries show a low level of redundancy and, on average, 50% of the present ESTs have not been previously reported for sunflower. Several potentially interesting genes were identified, based on their homology with genes involved in animal zygotic division or plant embryogenesis. We also identified two ESTs that show significantly different levels of expression under embryogenic and non-embryogenic conditions. The libraries described here represent an original and valuable resource for the discovery of yet unknown genes putatively involved in dicot embryogenesis and improving our knowledge of the mechanisms involved in polarity acquisition by plant embryos.

  2. Comparison of whole-genome bisulfite sequencing library preparation strategies identifies sources of biases affecting DNA methylation data.

    PubMed

    Olova, Nelly; Krueger, Felix; Andrews, Simon; Oxley, David; Berrens, Rebecca V; Branco, Miguel R; Reik, Wolf

    2018-03-15

    Whole-genome bisulfite sequencing (WGBS) is becoming an increasingly accessible technique, used widely for both fundamental and disease-oriented research. Library preparation methods benefit from a variety of available kits, polymerases and bisulfite conversion protocols. Although some steps in the procedure, such as PCR amplification, are known to introduce biases, a systematic evaluation of biases in WGBS strategies is missing. We perform a comparative analysis of several commonly used pre- and post-bisulfite WGBS library preparation protocols for their performance and quality of sequencing outputs. Our results show that bisulfite conversion per se is the main trigger of pronounced sequencing biases, and PCR amplification builds on these underlying artefacts. The majority of standard library preparation methods yield a significantly biased sequence output and overestimate global methylation. Importantly, both absolute and relative methylation levels at specific genomic regions vary substantially between methods, with clear implications for DNA methylation studies. We show that amplification-free library preparation is the least biased approach for WGBS. In protocols with amplification, the choice of bisulfite conversion protocol or polymerase can significantly minimize artefacts. To aid with the quality assessment of existing WGBS datasets, we have integrated a bias diagnostic tool in the Bismark package and offer several approaches for consideration during the preparation and analysis of WGBS datasets.

  3. Genome-wide analysis of DNA methylation in five tissues of sika deer (Cervus nippon).

    PubMed

    Yang, Chun; Zhang, Yan; Liu, Wenyuan; Lu, Xiao; Li, Chunyi

    2018-03-01

    DNA methylation plays an important role in regulating gene expression during tissue development and differentiation in eukaryotes. In contrast to domestic animals, epigenetic studies have been seldom conducted in wild animals. In the present study, we conducted the genome-wide profiling of DNA methylation for five tissues of sika deer using the fluorescence-labeled methylation-sensitive amplified polymorphism (F-MSAP) technique. Overall, a total of 104,131 fragments were amplified including 41,951 methylated fragments using 32 pairs of selected primers. The average incidence of DNA methylation was approximately 38.18% in muscle, 40.32% in heart, 41.86% in liver, 41.20% in lung, and 41.68% in kidney, respectively. Also, the significant differences of the DNA methylation levels were found between the different tissue types (P<0.05), which indicates that the differences of genome-wide DNA methylation levels may be related to gene expression during tissue development and differentiation. In addition, 37 tissue-specific differentially methylated regions (T-DMRs) were identified and recovered by MSAP in five tissues, and were further confirmed by Southern blot analysis. Our study presents the first look at the T-DMRs in sika deer and represents an initial step towards understanding of epigenetic regulatory mechanism underlying tissue development and differentiation in sika deer. Copyright © 2017. Published by Elsevier B.V.

  4. USE OF COMPETITIVE DNA HYBRIDIZATION TO IDENTIFY DIFFERENCES IN THE GENOMES OF TWO CLOSELY RELATED FECAL INDICATOR BACTERIA

    EPA Science Inventory

    Although recent technological advances in DNA sequencing and computational biology now allow scientists to compare entire microbial genomes, comparisons of closely related bacterial species and individual isolates by whole-genome sequencing approaches remains prohibitively expens...

  5. FISH Oracle: a web server for flexible visualization of DNA copy number data in a genomic context

    PubMed Central

    2011-01-01

    Background The rapidly growing amount of array CGH data requires improved visualization software supporting the process of identifying candidate cancer genes. Optimally, such software should work across multiple microarray platforms, should be able to cope with data from different sources and should be easy to operate. Results We have developed a web-based software FISH Oracle to visualize data from multiple array CGH experiments in a genomic context. Its fast visualization engine and advanced web and database technology supports highly interactive use. FISH Oracle comes with a convenient data import mechanism, powerful search options for genomic elements (e.g. gene names or karyobands), quick navigation and zooming into interesting regions, and mechanisms to export the visualization into different high quality formats. These features make the software especially suitable for the needs of life scientists. Conclusions FISH Oracle offers a fast and easy to use visualization tool for array CGH and SNP array data. It allows for the identification of genomic regions representing minimal common changes based on data from one or more experiments. FISH Oracle will be instrumental to identify candidate onco and tumor suppressor genes based on the frequency and genomic position of DNA copy number changes. The FISH Oracle application and an installed demo web server are available at http://www.zbh.uni-hamburg.de/fishoracle. PMID:21884636

  6. Optimized DNA extraction from neonatal dried blood spots: application in methylome profiling

    PubMed Central

    2014-01-01

    Background Neonatal dried blood spots (DBS) represent an inexpensive method for long-term biobanking worldwide and are considered gold mines for research for several human diseases, including those of metabolic, infectious, genetic and epigenetic origin. However, the utility of DBS is restricted by the limited amount and quality of extractable biomolecules (including DNA), especially for genome wide profiling. Degradation of DNA in DBS often occurs during storage and extraction. Moreover, amplifying small quantities of DNA often leads to a bias in subsequent data, particularly in methylome profiles. Thus it is important to develop methodologies that maximize both the yield and quality of DNA from DBS for downstream analyses. Results Using combinations of in-house-derived and modified commercial extraction kits, we developed a robust and efficient protocol, compatible with methylome studies, many of which require stringent bisulfite conversion steps. Several parameters were tested in a step-wise manner, including blood extraction, cell lysis, protein digestion, and DNA precipitation, purification and elution. DNA quality was assessed based on spectrophotometric measurements, DNA detectability by PCR, and DNA integrity by gel electrophoresis and bioanalyzer analyses. Genome scale Infinium HumanMethylation450 and locus-specific pyrosequencing data generated using the refined DBS extraction protocol were of high quality, reproducible and consistent. Conclusions This study may prove useful to meet the increased demand for research on prenatal, particularly epigenetic, origins of human diseases and for newborn screening programs, all of which are often based on DNA extracted from DBS. PMID:24980254

  7. Phenotypic diversification by enhanced genome restructuring after induction of multiple DNA double-strand breaks.

    PubMed

    Muramoto, Nobuhiko; Oda, Arisa; Tanaka, Hidenori; Nakamura, Takahiro; Kugou, Kazuto; Suda, Kazuki; Kobayashi, Aki; Yoneda, Shiori; Ikeuchi, Akinori; Sugimoto, Hiroki; Kondo, Satoshi; Ohto, Chikara; Shibata, Takehiko; Mitsukawa, Norihiro; Ohta, Kunihiro

    2018-05-18

    DNA double-strand break (DSB)-mediated genome rearrangements are assumed to provide diverse raw genetic materials enabling accelerated adaptive evolution; however, it remains unclear about the consequences of massive simultaneous DSB formation in cells and their resulting phenotypic impact. Here, we establish an artificial genome-restructuring technology by conditionally introducing multiple genomic DSBs in vivo using a temperature-dependent endonuclease TaqI. Application in yeast and Arabidopsis thaliana generates strains with phenotypes, including improved ethanol production from xylose at higher temperature and increased plant biomass, that are stably inherited to offspring after multiple passages. High-throughput genome resequencing revealed that these strains harbor diverse rearrangements, including copy number variations, translocations in retrotransposons, and direct end-joinings at TaqI-cleavage sites. Furthermore, large-scale rearrangements occur frequently in diploid yeasts (28.1%) and tetraploid plants (46.3%), whereas haploid yeasts and diploid plants undergo minimal rearrangement. This genome-restructuring system (TAQing system) will enable rapid genome breeding and aid genome-evolution studies.

  8. Genome-wide measures of DNA methylation in peripheral blood and the risk of urothelial cell carcinoma: a prospective nested case–control study

    PubMed Central

    Dugué, Pierre-Antoine; Brinkman, Maree T; Milne, Roger L; Wong, Ee Ming; FitzGerald, Liesel M; Bassett, Julie K; Joo, Jihoon E; Jung, Chol-Hee; Makalic, Enes; Schmidt, Daniel F; Park, Daniel J; Chung, Jessica; Ta, Anthony D; Bolton, Damien M; Lonie, Andrew; Longano, Anthony; Hopper, John L; Severi, Gianluca; Saffery, Richard; English, Dallas R; Southey, Melissa C; Giles, Graham G

    2016-01-01

    Background: Global DNA methylation has been reported to be associated with urothelial cell carcinoma (UCC) by studies using blood samples collected at diagnosis. Using the Illumina HumanMethylation450 assay, we derived genome-wide measures of blood DNA methylation and assessed them for their prospective association with UCC risk. Methods: We used 439 case–control pairs from the Melbourne Collaborative Cohort Study matched on age, sex, country of birth, DNA sample type, and collection period. Conditional logistic regression was used to compute odds ratios (OR) of UCC risk per s.d. of each genome-wide measure of DNA methylation and 95% confidence intervals (CIs), adjusted for potential confounders. We also investigated associations by disease subtype, sex, smoking, and time since blood collection. Results: The risk of superficial UCC was decreased for individuals with higher levels of our genome-wide DNA methylation measure (OR=0.71, 95% CI: 0.54–0.94; P=0.02). This association was particularly strong for current smokers at sample collection (OR=0.47, 95% CI: 0.27–0.83). Intermediate levels of our genome-wide measure were associated with decreased risk of invasive UCC. Some variation was observed between UCC subtypes and the location and regulatory function of the CpGs included in the genome-wide measures of methylation. Conclusions: Higher levels of our genome-wide DNA methylation measure were associated with decreased risk of superficial UCC and intermediate levels were associated with reduced risk of invasive disease. These findings require replication by other prospective studies. PMID:27490804

  9. The complete mitochondrial genome of the enigmatic bigheadedturtle (Platysternon): description of unusual genomic features and thereconciliation of phylogenetic hypotheses based on mitochondrial andnuclear DNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Parham, James F.; Feldman, Chris R.; Boore, Jeffrey L.

    2005-12-28

    The big-headed turtle (Platysternon megacephalum) from east Asia is the sole living representative of a poorly-studied turtle lineage (Platysternidae). It has no close living relatives, and its phylogenetic position within turtles is one of the outstanding controversies in turtle systematics. Platysternon was traditionally considered to be close to snapping turtles (Chelydridae) based on some studies of its morphology and mitochondrial (mt) DNA, however, other studies of morphology and nuclear (nu) DNA do not support that hypothesis. We sequenced the complete mt genome of Platysternon and the nearly complete mt genomes of two other relevant turtles and compared them to turtlemore » mt genomes from the literature to form the largest molecular dataset used to date to address this issue. The resulting phylogeny robustly rejects the placement of Platysternon with Chelydridae, but instead shows that it is a member of the Testudinoidea, a diverse, nearly globally-distributed group that includes pond turtles and tortoises. We also discovered that Platysternon mtDNA has large-scale gene rearrangements and possesses two, nearly identical, control regions, features that distinguish it from all other studied turtles. Our study robustly determines the phylogenetic placement of Platysternon and provides a well-resolved outline of major turtle lineages, while demonstrating the significantly greater resolving power of comparing large amounts of mt sequence over that of short fragments. Earlier phylogenies placing Platysternon with chelydrids required a temporal gap in the fossil record that is now unnecessary. The duplicated control regions and gene rearrangements of the Platysternon mt DNA probably resulted from the duplication of part of the genome and then the subsequent loss of redundant genes. Although it is possible that having two control regions may provide some advantage, explaining why the control regions would be maintained while some of the duplicated genes were

  10. Surface-enhanced Raman spectroscopy of genomic DNA from in vitro grown tomato (Lycopersicon esculentum Mill.) cultivars before and after plant cryopreservation

    NASA Astrophysics Data System (ADS)

    Muntean, Cristina M.; Leopold, Nicolae; Tripon, Carmen; Coste, Ana; Halmagyi, Adela

    2015-06-01

    In this work the surface-enhanced Raman scattering (SERS) spectra of five genomic DNAs from non-cryopreserved control tomato plants (Lycopersicon esculentum Mill. cultivars Siriana, Darsirius, Kristin, Pontica and Capriciu) respectively, have been analyzed in the wavenumber range 400-1800 cm-1. Structural changes induced in genomic DNAs upon cryopreservation were discussed in detail for four of the above mentioned tomato cultivars. The surface-enhanced Raman vibrational modes for each of these cases, spectroscopic band assignments and structural interpretations of genomic DNAs are reported. We have found, that DNA isolated from Siriana cultivar leaf tissues suffers the weakest structural changes upon cryogenic storage of tomato shoot apices. On the contrary, genomic DNA extracted from Pontica cultivar is the most responsive system to cryopreservation process. Particularly, both C2‧-endo-anti and C3'-endo-anti conformations have been detected. As a general observation, the wavenumber range 1511-1652 cm-1, being due to dA, dG and dT residues seems to be influenced by cryopreservation process. These changes could reflect unstacking of DNA bases. However, not significant structural changes of genomic DNAs from Siriana, Darsirius and Kristin have been found upon cryopreservation process of tomato cultivars. Based on this work, specific plant DNA-ligand interactions or accurate local structure of DNA in the proximity of a metallic surface, might be further investigated using surface-enhanced Raman spectroscopy.

  11. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

    PubMed

    Chin, Chen-Shan; Alexander, David H; Marks, Patrick; Klammer, Aaron A; Drake, James; Heiner, Cheryl; Clum, Alicia; Copeland, Alex; Huddleston, John; Eichler, Evan E; Turner, Stephen W; Korlach, Jonas

    2013-06-01

    We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.

  12. Genome-wide profiling of DNA-binding proteins using barcode-based multiplex Solexa sequencing.

    PubMed

    Raghav, Sunil Kumar; Deplancke, Bart

    2012-01-01

    Chromatin immunoprecipitation (ChIP) is a commonly used technique to detect the in vivo binding of proteins to DNA. ChIP is now routinely paired to microarray analysis (ChIP-chip) or next-generation sequencing (ChIP-Seq) to profile the DNA occupancy of proteins of interest on a genome-wide level. Because ChIP-chip introduces several biases, most notably due to the use of a fixed number of probes, ChIP-Seq has quickly become the method of choice as, depending on the sequencing depth, it is more sensitive, quantitative, and provides a greater binding site location resolution. With the ever increasing number of reads that can be generated per sequencing run, it has now become possible to analyze several samples simultaneously while maintaining sufficient sequence coverage, thus significantly reducing the cost per ChIP-Seq experiment. In this chapter, we provide a step-by-step guide on how to perform multiplexed ChIP-Seq analyses. As a proof-of-concept, we focus on the genome-wide profiling of RNA Polymerase II as measuring its DNA occupancy at different stages of any biological process can provide insights into the gene regulatory mechanisms involved. However, the protocol can also be used to perform multiplexed ChIP-Seq analyses of other DNA-binding proteins such as chromatin modifiers and transcription factors.

  13. Genome-Wide DNA Methylation Indicates Silencing of Tumor Suppressor Genes in Uterine Leiomyoma

    PubMed Central

    Navarro, Antonia; Yin, Ping; Monsivais, Diana; Lin, Simon M.; Du, Pan; Wei, Jian-Jun; Bulun, Serdar E.

    2012-01-01

    Background Uterine leiomyomas, or fibroids, represent the most common benign tumor of the female reproductive tract. Fibroids become symptomatic in 30% of all women and up to 70% of African American women of reproductive age. Epigenetic dysregulation of individual genes has been demonstrated in leiomyoma cells; however, the in vivo genome-wide distribution of such epigenetic abnormalities remains unknown. Principal Findings We characterized and compared genome-wide DNA methylation and mRNA expression profiles in uterine leiomyoma and matched adjacent normal myometrial tissues from 18 African American women. We found 55 genes with differential promoter methylation and concominant differences in mRNA expression in uterine leiomyoma versus normal myometrium. Eighty percent of the identified genes showed an inverse relationship between DNA methylation status and mRNA expression in uterine leiomyoma tissues, and the majority of genes (62%) displayed hypermethylation associated with gene silencing. We selected three genes, the known tumor suppressors KLF11, DLEC1, and KRT19 and verified promoter hypermethylation, mRNA repression and protein expression using bisulfite sequencing, real-time PCR and western blot. Incubation of primary leiomyoma smooth muscle cells with a DNA methyltransferase inhibitor restored KLF11, DLEC1 and KRT19 mRNA levels. Conclusions These results suggest a possible functional role of promoter DNA methylation-mediated gene silencing in the pathogenesis of uterine leiomyoma in African American women. PMID:22428009

  14. Distinct Mechanisms of Nuclease-Directed DNA-Structure-Induced Genetic Instability in Cancer Genomes.

    PubMed

    Zhao, Junhua; Wang, Guliang; Del Mundo, Imee M; McKinney, Jennifer A; Lu, Xiuli; Bacolla, Albino; Boulware, Stephen B; Zhang, Changsheng; Zhang, Haihua; Ren, Pengyu; Freudenreich, Catherine H; Vasquez, Karen M

    2018-01-30

    Sequences with the capacity to adopt alternative DNA structures have been implicated in cancer etiology; however, the mechanisms are unclear. For example, H-DNA-forming sequences within oncogenes have been shown to stimulate genetic instability in mammals. Here, we report that H-DNA-forming sequences are enriched at translocation breakpoints in human cancer genomes, further implicating them in cancer etiology. H-DNA-induced mutations were suppressed in human cells deficient in the nucleotide excision repair nucleases, ERCC1-XPF and XPG, but were stimulated in cells deficient in FEN1, a replication-related endonuclease. Further, we found that these nucleases cleaved H-DNA conformations, and the interactions of modeled H-DNA with ERCC1-XPF, XPG, and FEN1 proteins were explored at the sub-molecular level. The results suggest mechanisms of genetic instability triggered by H-DNA through distinct structure-specific, cleavage-based replication-independent and replication-dependent pathways, providing critical evidence for a role of the DNA structure itself in the etiology of cancer and other human diseases. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Near-quantitative extraction of genomic DNA from various food-borne eubacteria

    USDA-ARS?s Scientific Manuscript database

    In this work we have tested a dozen commercial bacterial genomic DNA extraction methodologies on an average of 7.70E6 (± 9.05%), 4.77E8 (± 31.0%), and 5.93E8 (± 4.69%) colony forming units (CFU) associated with 3 cultures (n = 3) each of Brochothrix thermosphacta (Bt), Shigella sonnei (Ss), and Esch...

  16. TRAIP promotes DNA damage response during genome replication and is mutated in primordial dwarfism.

    PubMed

    Harley, Margaret E; Murina, Olga; Leitch, Andrea; Higgs, Martin R; Bicknell, Louise S; Yigit, Gökhan; Blackford, Andrew N; Zlatanou, Anastasia; Mackenzie, Karen J; Reddy, Kaalak; Halachev, Mihail; McGlasson, Sarah; Reijns, Martin A M; Fluteau, Adeline; Martin, Carol-Anne; Sabbioneda, Simone; Elcioglu, Nursel H; Altmüller, Janine; Thiele, Holger; Greenhalgh, Lynn; Chessa, Luciana; Maghnie, Mohamad; Salim, Mahmoud; Bober, Michael B; Nürnberg, Peter; Jackson, Stephen P; Hurles, Matthew E; Wollnik, Bernd; Stewart, Grant S; Jackson, Andrew P

    2016-01-01

    DNA lesions encountered by replicative polymerases threaten genome stability and cell cycle progression. Here we report the identification of mutations in TRAIP, encoding an E3 RING ubiquitin ligase, in patients with microcephalic primordial dwarfism. We establish that TRAIP relocalizes to sites of DNA damage, where it is required for optimal phosphorylation of H2AX and RPA2 during S-phase in response to ultraviolet (UV) irradiation, as well as fork progression through UV-induced DNA lesions. TRAIP is necessary for efficient cell cycle progression and mutations in TRAIP therefore limit cellular proliferation, providing a potential mechanism for microcephaly and dwarfism phenotypes. Human genetics thus identifies TRAIP as a component of the DNA damage response to replication-blocking DNA lesions.

  17. TRAIP promotes DNA damage response during genome replication and is mutated in primordial dwarfism

    PubMed Central

    Leitch, Andrea; Higgs, Martin R.; Bicknell, Louise S.; Yigit, Gökhan; Blackford, Andrew N.; Zlatanou, Anastasia; Mackenzie, Karen J.; Reddy, Kaalak; Halachev, Mihail; McGlasson, Sarah; Reijns, Martin A. M.; Fluteau, Adeline; Martin, Carol-Anne; Sabbioneda, Simone; Elcioglu, Nursel H.; Altmüller, Janine; Thiele, Holger; Greenhalgh, Lynn; Chessa, Luciana; Maghnie, Mohamad; Salim, Mahmoud; Bober, Michael B.; Nürnberg, Peter; Jackson, Stephen P.; Hurles, Matthew E.; Wollnik, Bernd; Stewart, Grant S.; Jackson, Andrew P.

    2015-01-01

    DNA lesions encountered by replicative polymerases threaten genome stability and cell cycle progression. Here we report the identification of mutations in TRAIP, encoding an E3 RING ubiquitin ligase, in patients with microcephalic primordial dwarfism/Seckel syndrome. We establish that TRAIP relocalizes to sites of DNA damage where it is required for optimal phosphorylation of H2AX and RPA2 during S-phase in response to UV irradiation, as well as fork progression through UV-induced DNA lesions. TRAIP is necessary for efficient cell cycle progression and mutations in TRAIP therefore limit cellular proliferation, providing a potential mechanism for microcephaly and dwarfism phenotypes. Human genetics thus identifies TRAIP as a novel component of the DNA damage response to replication-blocking DNA lesions. PMID:26595769

  18. Cmr1/WDR76 defines a nuclear genotoxic stress body linking genome integrity and protein quality control.

    PubMed

    Gallina, Irene; Colding, Camilla; Henriksen, Peter; Beli, Petra; Nakamura, Kyosuke; Offman, Judith; Mathiasen, David P; Silva, Sonia; Hoffmann, Eva; Groth, Anja; Choudhary, Chunaram; Lisby, Michael

    2015-03-30

    DNA replication stress is a source of genomic instability. Here we identify changed mutation rate 1 (Cmr1) as a factor involved in the response to DNA replication stress in Saccharomyces cerevisiae and show that Cmr1--together with Mrc1/Claspin, Pph3, the chaperonin containing TCP1 (CCT) and 25 other proteins--define a novel intranuclear quality control compartment (INQ) that sequesters misfolded, ubiquitylated and sumoylated proteins in response to genotoxic stress. The diversity of proteins that localize to INQ indicates that other biological processes such as cell cycle progression, chromatin and mitotic spindle organization may also be regulated through INQ. Similar to Cmr1, its human orthologue WDR76 responds to proteasome inhibition and DNA damage by relocalizing to nuclear foci and physically associating with CCT, suggesting an evolutionarily conserved biological function. We propose that Cmr1/WDR76 plays a role in the recovery from genotoxic stress through regulation of the turnover of sumoylated and phosphorylated proteins.

  19. Cmr1/WDR76 defines a nuclear genotoxic stress body linking genome integrity and protein quality control

    PubMed Central

    Gallina, Irene; Colding, Camilla; Henriksen, Peter; Beli, Petra; Nakamura, Kyosuke; Offman, Judith; Mathiasen, David P.; Silva, Sonia; Hoffmann, Eva; Groth, Anja; Choudhary, Chunaram; Lisby, Michael

    2015-01-01

    DNA replication stress is a source of genomic instability. Here we identify changed mutation rate 1 (Cmr1) as a factor involved in the response to DNA replication stress in Saccharomyces cerevisiae and show that Cmr1—together with Mrc1/Claspin, Pph3, the chaperonin containing TCP1 (CCT) and 25 other proteins—define a novel intranuclear quality control compartment (INQ) that sequesters misfolded, ubiquitylated and sumoylated proteins in response to genotoxic stress. The diversity of proteins that localize to INQ indicates that other biological processes such as cell cycle progression, chromatin and mitotic spindle organization may also be regulated through INQ. Similar to Cmr1, its human orthologue WDR76 responds to proteasome inhibition and DNA damage by relocalizing to nuclear foci and physically associating with CCT, suggesting an evolutionarily conserved biological function. We propose that Cmr1/WDR76 plays a role in the recovery from genotoxic stress through regulation of the turnover of sumoylated and phosphorylated proteins. PMID:25817432

  20. Maintenance of Genome Stability and Breast Cancer: Molecular Analysis of DNA Damage-Activated Kinases

    DTIC Science & Technology

    2008-03-01

    Breast Cancer: Molecular Analysis of DNA Damage-Activated Kinases PRINCIPAL INVESTIGATOR: Daniel Mordes...Maintenance of Genome Stability and Breast Cancer: Molecular Analysis of DNA Damage-Activated Kinases 5b. GRANT NUMBER W81XWH-06-1-0352 5c...shown that this domain of Dpb11 stimulates the kinase activity of wild-type Mec1-Ddc2 yet did not simulate Mec1-ddc2-top. Thus, we have demonstrated

  1. Identification of the "A" genome of finger millet using chloroplast DNA.

    PubMed

    Hilu, K W

    1988-01-01

    Finger millet (Eleusine corocana subsp. coracana), an important cereal in East Africa and India, is a tetraploid species with unknown genomic components. A recent cytogenetic study confirmed the direct origin of this millet from the tetraploid E. coracana subsp. africana but questioned Eleusine indica as a genomic donor. Chloroplast (ct) DNA sequence analysis using restriction fragment pattern was used to examine the phylogenetic relationships between E. coracana subsp. coracana (domesticated finger millet), E. coracana subspecies africana (wild finger millet), and E. indica. Eleusine tristachya was included since it is the only other annual diploid species in the genus with a basic chromosome number of x = 9 like finger millet. Eight of the ten restriction endonucleases used had 16 to over 30 restriction sites per genome and were informative. E. coracana subsp. coracana and subsp. africana and E. indica were identical in all the restriction sites surveyed, while the ct genome of E, tristachya differed consistently by at least one mutational event for each restriction enzyme surveyed. This random survey of the ct genomes of these species points out E. indica as one of the genome donors (maternal genome donor) of domesticated finger millet contrary to a previous cytogenetic study. The data also substantiate E. coracana subsp. africana as the progenitor of domesticated finger millet. The disparity between the cytogenetic and the molecular approaches is discussed in light of the problems associated with chromosome pairing and polyploidy.

  2. Identification of the ``a'' Genome of Finger Millet Using Chloroplast DNA

    PubMed Central

    Hilu, K. W.

    1988-01-01

    Finger millet (Eleusine corocana subsp. coracana), an important cereal in East Africa and India, is a tetraploid species with unknown genomic components. A recent cytogenetic study confirmed the direct origin of this millet from the tetraploid E. coracana subsp. africana but questioned Eleusine indica as a genomic donor. Chloroplast (ct) DNA sequence analysis using restriction fragment pattern was used to examine the phylogenetic relationships between E. coracana subsp. coracana (domesticated finger millet), E. coracana subspecies africana (wild finger millet), and E. indica. Eleusine tristachya was included since it is the only other annual diploid species in the genus with a basic chromosome number of x = 9 like finger millet. Eight of the ten restriction endonucleases used had 16 to over 30 restriction sites per genome and were informative. E. coracana subsp. coracana and subsp. africana and E. indica were identical in all the restriction sites surveyed, while the ct genome of E. tristachya differed consistently by at least one mutational event for each restriction enzyme surveyed. This random survey of the ct genomes of these species points out E. indica as one of the genome donors (maternal genome donor) of domesticated finger millet contrary to a previous cytogenetic study. The data also substantiate E. coracana subsp. africana as the progenitor of domesticated finger millet. The disparity between the cytogenetic and the molecular approaches is discussed in light of the problems associated with chromosome pairing and polyploidy. PMID:8608927

  3. ATM Deficiency Generating Genomic Instability Sensitizes Pancreatic Ductal Adenocarcinoma Cells to Therapy-Induced DNA Damage.

    PubMed

    Perkhofer, Lukas; Schmitt, Anna; Romero Carrasco, Maria Carolina; Ihle, Michaela; Hampp, Stephanie; Ruess, Dietrich Alexander; Hessmann, Elisabeth; Russell, Ronan; Lechel, André; Azoitei, Ninel; Lin, Qiong; Liebau, Stefan; Hohwieler, Meike; Bohnenberger, Hanibal; Lesina, Marina; Algül, Hana; Gieldon, Laura; Schröck, Evelin; Gaedcke, Jochen; Wagner, Martin; Wiesmüller, Lisa; Sipos, Bence; Seufferlein, Thomas; Reinhardt, Hans Christian; Frappart, Pierre-Olivier; Kleger, Alexander

    2017-10-15

    Pancreatic ductal adenocarcinomas (PDAC) harbor recurrent functional mutations of the master DNA damage response kinase ATM, which has been shown to accelerate tumorigenesis and epithelial-mesenchymal transition. To study how ATM deficiency affects genome integrity in this setting, we evaluated the molecular and functional effects of conditional Atm deletion in a mouse model of PDAC. ATM deficiency was associated with increased mitotic defects, recurrent genomic rearrangements, and deregulated DNA integrity checkpoints, reminiscent of human PDAC. We hypothesized that altered genome integrity might allow synthetic lethality-based options for targeted therapeutic intervention. Supporting this possibility, we found that the PARP inhibitor olaparib or ATR inhibitors reduced the viability of PDAC cells in vitro and in vivo associated with a genotype-selective increase in apoptosis. Overall, our results offered a preclinical mechanistic rationale for the use of PARP and ATR inhibitors to improve treatment of ATM-mutant PDAC. Cancer Res; 77(20); 5576-90. ©2017 AACR . ©2017 American Association for Cancer Research.

  4. A Sequence-Independent Strategy for Detection and Cloning of Circular DNA Virus Genomes by Using Multiply Primed Rolling-Circle Amplification

    PubMed Central

    Rector, Annabel; Tachezy, Ruth; Van Ranst, Marc

    2004-01-01

    The discovery of novel viruses has often been accomplished by using hybridization-based methods that necessitate the availability of a previously characterized virus genome probe or knowledge of the viral nucleotide sequence to construct consensus or degenerate PCR primers. In their natural replication cycle, certain viruses employ a rolling-circle mechanism to propagate their circular genomes, and multiply primed rolling-circle amplification (RCA) with φ29 DNA polymerase has recently been applied in the amplification of circular plasmid vectors used in cloning. We employed an isothermal RCA protocol that uses random hexamer primers to amplify the complete genomes of papillomaviruses without the need for prior knowledge of their DNA sequences. We optimized this RCA technique with extracted human papillomavirus type 16 (HPV-16) DNA from W12 cells, using a real-time quantitative PCR assay to determine amplification efficiency, and obtained a 2.4 × 104-fold increase in HPV-16 DNA concentration. We were able to clone the complete HPV-16 genome from this multiply primed RCA product. The optimized protocol was subsequently applied to a bovine fibropapillomatous wart tissue sample. Whereas no papillomavirus DNA could be detected by restriction enzyme digestion of the original sample, multiply primed RCA enabled us to obtain a sufficient amount of papillomavirus DNA for restriction enzyme analysis, cloning, and subsequent sequencing of a novel variant of bovine papillomavirus type 1. The multiply primed RCA method allows the discovery of previously unknown papillomaviruses, and possibly also other circular DNA viruses, without a priori sequence information. PMID:15113879

  5. Genome-wide Analysis Reveals Extensive Functional Interaction between DNA Replication Initiation and Transcription in the Genome of Trypanosoma brucei

    PubMed Central

    Tiengwe, Calvin; Marcello, Lucio; Farr, Helen; Dickens, Nicholas; Kelly, Steven; Swiderski, Michal; Vaughan, Diane; Gull, Keith; Barry, J. David; Bell, Stephen D.; McCulloch, Richard

    2012-01-01

    Summary Identification of replication initiation sites, termed origins, is a crucial step in understanding genome transmission in any organism. Transcription of the Trypanosoma brucei genome is highly unusual, with each chromosome comprising a few discrete transcription units. To understand how DNA replication occurs in the context of such organization, we have performed genome-wide mapping of the binding sites of the replication initiator ORC1/CDC6 and have identified replication origins, revealing that both localize to the boundaries of the transcription units. A remarkably small number of active origins is seen, whose spacing is greater than in any other eukaryote. We show that replication and transcription in T. brucei have a profound functional overlap, as reducing ORC1/CDC6 levels leads to genome-wide increases in mRNA levels arising from the boundaries of the transcription units. In addition, ORC1/CDC6 loss causes derepression of silent Variant Surface Glycoprotein genes, which are critical for host immune evasion. PMID:22840408

  6. Extreme-Scale De Novo Genome Assembly

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Georganas, Evangelos; Hofmeyr, Steven; Egan, Rob

    De novo whole genome assembly reconstructs genomic sequence from short, overlapping, and potentially erroneous DNA segments and is one of the most important computations in modern genomics. This work presents HipMER, a high-quality end-to-end de novo assembler designed for extreme scale analysis, via efficient parallelization of the Meraculous code. Genome assembly software has many components, each of which stresses different components of a computer system. This chapter explains the computational challenges involved in each step of the HipMer pipeline, the key distributed data structures, and communication costs in detail. We present performance results of assembling the human genome and themore » large hexaploid wheat genome on large supercomputers up to tens of thousands of cores.« less

  7. Genome-wide and locus-specific DNA hypomethylation in G9a deficient mouse embryonic stem cells.

    PubMed

    Ikegami, Kohta; Iwatani, Misa; Suzuki, Masako; Tachibana, Makoto; Shinkai, Yoichi; Tanaka, Satoshi; Greally, John M; Yagi, Shintaro; Hattori, Naka; Shiota, Kunio

    2007-01-01

    In the mammalian genome, numerous CpG-rich loci define tissue-dependent and differentially methylated regions (T-DMRs). Euchromatin from different cell types differs in terms of its tissue-specific DNA methylation profile as defined by these T-DMRs. G9a is a euchromatin-localized histone methyltransferase (HMT) and catalyzes methylation of histone H3 at lysines 9 and 27 (H3-K9 and -K27). To test whether HMT activity influences euchromatic cytosine methylation, we analyzed the DNA methylation status of approximately 2000 CpG-rich loci, which are predicted in silico, in G9a(-/-) embryonic stem cells by restriction landmark genomic scanning (RLGS). While the RLGS profile of wild-type cells contained about 1300 spots, 32 new spots indicating DNA demethylation were seen in the profile of G9a(-/-) cells. Virtual-image RLGS (Vi-RLGS) allowed us to identify the genomic source of ten of these spots. These were confirmed to be cytosine demethylated, not just at the Not I site detected by the RLGS but extending over several kilobase pairs in cis. Chromatin immunoprecipitation (ChIP) confirmed these loci to be targets of G9a, with decreased H3-K9 and/or -K27 dimethylation in the G9a(-/-) cells. These data indicate that G9a site-selectively contributes to DNA methylation.

  8. Isolation and characterization of 5S rDNA sequences in catfishes genome (Heptapteridae and Pseudopimelodidae): perspectives for rDNA studies in fish by C0t method.

    PubMed

    Gouveia, Juceli Gonzalez; Wolf, Ivan Rodrigo; de Moraes-Manécolo, Vivian Patrícia Oliveira; Bardella, Vanessa Belline; Ferracin, Lara Munique; Giuliano-Caetano, Lucia; da Rosa, Renata; Dias, Ana Lúcia

    2016-12-01

    Sequences of 5S ribosomal RNA (rRNA) are extensively used in fish cytogenomic studies, once they have a flexible organization at the chromosomal level, showing inter- and intra-specific variation in number and position in karyotypes. Sequences from the genome of Imparfinis schubarti (Heptapteridae) were isolated, aiming to understand the organization of 5S rDNA families in the fish genome. The isolation of 5S rDNA from the genome of I. schubarti was carried out by reassociation kinetics (C 0 t) and PCR amplification. The obtained sequences were cloned for the construction of a micro-library. The obtained clones were sequenced and hybridized in I. schubarti and Microglanis cottoides (Pseudopimelodidae) for chromosome mapping. An analysis of the sequence alignments with other fish groups was accomplished. Both methods were effective when using 5S rDNA for hybridization in I. schubarti genome. However, the C 0 t method enabled the use of a complete 5S rRNA gene, which was also successful in the hybridization of M. cottoides. Nevertheless, this gene was obtained only partially by PCR. The hybridization results and sequence analyses showed that intact 5S regions are more appropriate for the probe operation, due to conserved structure and motifs. This study contributes to a better understanding of the organization of multigene families in catfish's genomes.

  9. Development of NIST standard reference material 2373: Genomic DNA standards for HER2 measurements.

    PubMed

    He, Hua-Jun; Almeida, Jamie L; Lund, Steve P; Steffen, Carolyn R; Choquette, Steve; Cole, Kenneth D

    2016-06-01

    NIST standard reference material (SRM) 2373 was developed to improve the measurements of the HER2 gene amplification in DNA samples. SRM 2373 consists of genomic DNA extracted from five breast cancer cell lines with different amounts of amplification of the HER2 gene. The five components are derived from the human cell lines SK-BR-3, MDA-MB-231, MDA-MB-361, MDA-MB-453, and BT-474. The certified values are the ratios of the HER2 gene copy numbers to the copy numbers of selected reference genes DCK, EIF5B, RPS27A, and PMM1. The ratios were measured using quantitative polymerase chain reaction and digital PCR, methods that gave similar ratios. The five components of SRM 2373 have certified HER2 amplification ratios that range from 1.3 to 17.7. The stability and homogeneity of the reference materials were shown by repeated measurements over a period of several years. SRM 2373 is a well characterized genomic DNA reference material that can be used to improve the confidence of the measurements of HER2 gene copy number.

  10. Within-Genome Evolution of REPINs: a New Family of Miniature Mobile DNA in Bacteria

    PubMed Central

    Bertels, Frederic; Rainey, Paul B.

    2011-01-01

    Repetitive sequences are a conserved feature of many bacterial genomes. While first reported almost thirty years ago, and frequently exploited for genotyping purposes, little is known about their origin, maintenance, or processes affecting the dynamics of within-genome evolution. Here, beginning with analysis of the diversity and abundance of short oligonucleotide sequences in the genome of Pseudomonas fluorescens SBW25, we show that over-represented short sequences define three distinct groups (GI, GII, and GIII) of repetitive extragenic palindromic (REP) sequences. Patterns of REP distribution suggest that closely linked REP sequences form a functional replicative unit: REP doublets are over-represented, randomly distributed in extragenic space, and more highly conserved than singlets. In addition, doublets are organized as inverted repeats, which together with intervening spacer sequences are predicted to form hairpin structures in ssDNA or mRNA. We refer to these newly defined entities as REPINs (REP doublets forming hairpins) and identify short reads from population sequencing that reveal putative transposition intermediates. The proximal relationship between GI, GII, and GIII REPINs and specific REP-associated tyrosine transposases (RAYTs), combined with features of the putative transposition intermediate, suggests a mechanism for within-genome dissemination. Analysis of the distribution of REPs in a range of RAYT–containing bacterial genomes, including Escherichia coli K-12 and Nostoc punctiforme, show that REPINs are a widely distributed, but hitherto unrecognized, family of miniature non-autonomous mobile DNA. PMID:21698139

  11. Within-genome evolution of REPINs: a new family of miniature mobile DNA in bacteria.

    PubMed

    Bertels, Frederic; Rainey, Paul B

    2011-06-01

    Repetitive sequences are a conserved feature of many bacterial genomes. While first reported almost thirty years ago, and frequently exploited for genotyping purposes, little is known about their origin, maintenance, or processes affecting the dynamics of within-genome evolution. Here, beginning with analysis of the diversity and abundance of short oligonucleotide sequences in the genome of Pseudomonas fluorescens SBW25, we show that over-represented short sequences define three distinct groups (GI, GII, and GIII) of repetitive extragenic palindromic (REP) sequences. Patterns of REP distribution suggest that closely linked REP sequences form a functional replicative unit: REP doublets are over-represented, randomly distributed in extragenic space, and more highly conserved than singlets. In addition, doublets are organized as inverted repeats, which together with intervening spacer sequences are predicted to form hairpin structures in ssDNA or mRNA. We refer to these newly defined entities as REPINs (REP doublets forming hairpins) and identify short reads from population sequencing that reveal putative transposition intermediates. The proximal relationship between GI, GII, and GIII REPINs and specific REP-associated tyrosine transposases (RAYTs), combined with features of the putative transposition intermediate, suggests a mechanism for within-genome dissemination. Analysis of the distribution of REPs in a range of RAYT-containing bacterial genomes, including Escherichia coli K-12 and Nostoc punctiforme, show that REPINs are a widely distributed, but hitherto unrecognized, family of miniature non-autonomous mobile DNA.

  12. Design of a DNA panel for genomic studies in Russian cattle breeds

    USDA-ARS?s Scientific Manuscript database

    A panel of 96 DNA samples (Russian Cattle Genomic Diversity Panel 1.0 or RCGDP 1.0) characterizing the breadth of genetic diversity in popular Russian cattle breeds was designed. The panel contains from four to eight animals from each of 11 dairy and six dairy-meat and meat breeds. The main criterio...

  13. An Evaluation Framework for Lossy Compression of Genome Sequencing Quality Values.

    PubMed

    Alberti, Claudio; Daniels, Noah; Hernaez, Mikel; Voges, Jan; Goldfeder, Rachel L; Hernandez-Lopez, Ana A; Mattavelli, Marco; Berger, Bonnie

    2016-01-01

    This paper provides the specification and an initial validation of an evaluation framework for the comparison of lossy compressors of genome sequencing quality values. The goal is to define reference data, test sets, tools and metrics that shall be used to evaluate the impact of lossy compression of quality values on human genome variant calling. The functionality of the framework is validated referring to two state-of-the-art genomic compressors. This work has been spurred by the current activity within the ISO/IEC SC29/WG11 technical committee (a.k.a. MPEG), which is investigating the possibility of starting a standardization activity for genomic information representation.

  14. The Organization of Repetitive DNA in the Genomes of Amazonian Lizard Species in the Family Teiidae.

    PubMed

    Carvalho, Natalia D M; Pinheiro, Vanessa S S; Carmo, Edson J; Goll, Leonardo G; Schneider, Carlos H; Gross, Maria C

    2015-01-01

    Repetitive DNA is the largest fraction of the eukaryote genome and comprises tandem and dispersed sequences. It presents variations in relation to its composition, number of copies, distribution, dynamics, and genome organization, and participates in the evolutionary diversification of different vertebrate species. Repetitive sequences are usually located in the heterochromatin of centromeric and telomeric regions of chromosomes, contributing to chromosomal structures. Therefore, the aim of this study was to physically map repetitive DNA sequences (5S rDNA, telomeric sequences, tropomyosin gene 1, and retroelements Rex1 and SINE) of mitotic chromosomes of Amazonian species of teiids (Ameiva ameiva, Cnemidophorus sp. 1, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin) to understand their genome organization and karyotype evolution. The mapping of repetitive sequences revealed a distinct pattern in Cnemidophorus sp. 1, whereas the other species showed all sequences interspersed in the heterochromatic region. Physical mapping of the tropomyosin 1 gene was performed for the first time in lizards and showed that in addition to being functional, this gene has a structural function similar to the mapped repetitive elements as it is located preferentially in centromeric regions and termini of chromosomes. © 2016 S. Karger AG, Basel.

  15. Complementation between polymerase- and exonuclease-deficient mitochondrial DNA polymerase mutants in genomically engineered flies

    PubMed Central

    Bratic, Ana; Kauppila, Timo E. S.; Macao, Bertil; Grönke, Sebastian; Siibak, Triinu; Stewart, James B.; Baggio, Francesca; Dols, Jacqueline; Partridge, Linda; Falkenberg, Maria; Wredenberg, Anna; Larsson, Nils-Göran

    2015-01-01

    Replication errors are the main cause of mitochondrial DNA (mtDNA) mutations and a compelling approach to decrease mutation levels would therefore be to increase the fidelity of the catalytic subunit (POLγA) of the mtDNA polymerase. Here we genomically engineer the tamas locus, encoding fly POLγA, and introduce alleles expressing exonuclease- (exo−) and polymerase-deficient (pol−) POLγA versions. The exo− mutant leads to accumulation of point mutations and linear deletions of mtDNA, whereas pol− mutants cause mtDNA depletion. The mutant tamas alleles are developmentally lethal but can complement each other in trans resulting in viable flies with clonally expanded mtDNA mutations. Reconstitution of human mtDNA replication in vitro confirms that replication is a highly dynamic process where POLγA goes on and off the template to allow complementation during proofreading and elongation. The created fly models are valuable tools to study germ line transmission of mtDNA and the pathophysiology of POLγA mutation disease. PMID:26554610

  16. Genomic mapping of single-stranded DNA in hydroxyurea-challenged yeasts identifies origins of replication.

    PubMed

    Feng, Wenyi; Collingwood, David; Boeck, Max E; Fox, Lindsay A; Alvino, Gina M; Fangman, Walton L; Raghuraman, Mosur K; Brewer, Bonita J

    2006-02-01

    During DNA replication one or both strands transiently become single stranded: first at the sites where initiation of DNA synthesis occurs (known as origins of replication) and subsequently on the lagging strands of replication forks as discontinuous Okazaki fragments are generated. We report a genome-wide analysis of single-stranded DNA (ssDNA) formation in the presence of hydroxyurea during DNA replication in wild-type and checkpoint-deficient rad53 Saccharomyces cerevisiae cells. In wild-type cells, ssDNA was first observed at a subset of replication origins and later 'migrated' bi-directionally, suggesting that ssDNA formation is associated with continuously moving replication forks. In rad53 cells, ssDNA was observed at virtually every known origin, but remained there over time, suggesting that replication forks stall. Telomeric regions seemed to be particularly sensitive to the loss of Rad53 checkpoint function. Replication origins in Schizosaccharomyces pombe were also mapped using our method.

  17. Rapid and sensitive detection of foodborne pathogenic bacteria (Staphylococcus aureus) using an electrochemical DNA genomic biosensor and its application in fresh beef.

    PubMed

    Abdalhai, Mandour H; Fernandes, António Maximiano; Bashari, Mohand; Ji, Jian; He, Qian; Sun, Xiulan

    2014-12-31

    Rapid early detection of food contamination is the main key in food safety and quality control. Biosensors are emerging as a vibrant area of research, and the use of DNA biosensor recognition detectors is relatively new. In this study a genomic DNA biosensor system with a fixing and capture probe was modified by a sulfhydryl and amino group, respectively, as complementary with target DNA. After immobilization and hybridization, the following sandwich structure fixing DNA-target DNA-capture DNA-PbS NPs was formed to detect pathogenic bacteria (Staphylococuus aureus EF529607.1) by using GCE modified with (multiwalled carbon nanotubes-chitosan-bismuth) to increase the sensitivity of the electrode. The modification procedure was characterized by cyclic voltammetry and electrochemical impedance spectroscopy. The sandwich structure was dissolved in 1 M nitric acid to become accessible to the electrode, and the PbS NPs was measured in solution by differential pulse voltammetry (DPV). The results showed that the detection limit of the DNA sensor was 3.17 × 10(-14) M S. aureus using PbS NPs, whereas the result for beef samples was 1.23 ng/mL. Thus, according to the experimental results presented, the DNA biosensor exhibited high sensitivity and rapid response, and it will be useful for the food matrix.

  18. Horizontal transfer of DNA from the mitochondrial to the plastid genome and its subsequent evolution in milkweeds (Apocynaceae)

    Treesearch

    Shannon C.K. Straub; Richard C. Cronn; Christopher Edwards; Mark Fishbein; Aaron Liston

    2013-01-01

    Horizontal gene transfer (HGT) of DNA from the plastid to the nuclear and mitochondrial genomes of higher plants is a common phenomenon; however, plastid genomes (plastomes) are highly conserved and have generally been regarded as impervious to HGT. We sequenced the 158 kb plastome and the 690 kb mitochondrial genome of common milkweed (Asclepias syriaca [Apocynaceae...

  19. A Domain of Herpes Simplex Virus pUL33 Required To Release Monomeric Viral Genomes from Cleaved Concatemeric DNA.

    PubMed

    Yang, Kui; Dang, Xiaoqun; Baines, Joel D

    2017-10-15

    Monomeric herpesvirus DNA is cleaved from concatemers and inserted into preformed capsids through the actions of the viral terminase. The terminase of herpes simplex virus (HSV) is composed of three subunits encoded by U L 15, U L 28, and U L 33. The U L 33-encoded protein (pU L 33) interacts with pU L 28, but its precise role in the DNA cleavage and packaging reaction is unclear. To investigate the function of pU L 33, we generated a panel of recombinant viruses with either deletions or substitutions in the most conserved regions of U L 33 using a bacterial artificial chromosome system. Deletion of 11 amino acids (residues 50 to 60 or residues 110 to 120) precluded viral replication, whereas the truncation of the last 10 amino acids from the pU L 33 C terminus did not affect viral replication or the interaction of pU L 33 with pU L 28. Mutations that replaced the lysine at codon 110 and the arginine at codon 111 with alanine codons failed to replicate, and the pU L 33 mutant interacted with pU L 28 less efficiently. Interestingly, genomic termini of the large (L) and small (S) components were detected readily in cells infected with these mutants, indicating that concatemeric DNA was cleaved efficiently. However, the release of monomeric genomes as assessed by pulsed-field gel electrophoresis was greatly diminished, and DNA-containing capsids were not observed. These results suggest that pU L 33 is necessary for one of the two viral DNA cleavage events required to release individual genomes from concatemeric viral DNA. IMPORTANCE This paper shows a role for pU L 33 in one of the two DNA cleavage events required to release monomeric genomes from concatemeric viral DNA. This is the first time that such a phenotype has been observed and is the first identification of a function of this protein relevant to DNA packaging other than its interaction with other terminase components. Copyright © 2017 Yang et al.

  20. Genome-wide specificity of DNA binding, gene regulation, and chromatin remodeling by TALE- and CRISPR/Cas9-based transcriptional activators

    PubMed Central

    Polstein, Lauren R.; Perez-Pinera, Pablo; Kocak, D. Dewran; Vockley, Christopher M.; Bledsoe, Peggy; Song, Lingyun; Safi, Alexias; Crawford, Gregory E.; Reddy, Timothy E.; Gersbach, Charles A.

    2015-01-01

    Genome engineering technologies based on the CRISPR/Cas9 and TALE systems are enabling new approaches in science and biotechnology. However, the specificity of these tools in complex genomes and the role of chromatin structure in determining DNA binding are not well understood. We analyzed the genome-wide effects of TALE- and CRISPR-based transcriptional activators in human cells using ChIP-seq to assess DNA-binding specificity and RNA-seq to measure the specificity of perturbing the transcriptome. Additionally, DNase-seq was used to assess genome-wide chromatin remodeling that occurs as a result of their action. Our results show that these transcription factors are highly specific in both DNA binding and gene regulation and are able to open targeted regions of closed chromatin independent of gene activation. Collectively, these results underscore the potential for these technologies to make precise changes to gene expression for gene and cell therapies or fundamental studies of gene function. PMID:26025803

  1. Characterization of 107 Genomic DNA Reference Materials for CYP2D6, CYP2C19, CYP2C9, VKORC1, and UGT1A1

    PubMed Central

    Pratt, Victoria M.; Zehnbauer, Barbara; Wilson, Jean Amos; Baak, Ruth; Babic, Nikolina; Bettinotti, Maria; Buller, Arlene; Butz, Ken; Campbell, Matthew; Civalier, Chris; El-Badry, Abdalla; Farkas, Daniel H.; Lyon, Elaine; Mandal, Saptarshi; McKinney, Jason; Muralidharan, Kasinathan; Noll, LeAnne; Sander, Tara; Shabbeer, Junaid; Smith, Chingying; Telatar, Milhan; Toji, Lorraine; Vairavan, Anand; Vance, Carlos; Weck, Karen E.; Wu, Alan H.B.; Yeo, Kiang-Teck J.; Zeller, Markus; Kalman, Lisa

    2010-01-01

    Pharmacogenetic testing is becoming more common; however, very few quality control and other reference materials that cover alleles commonly included in such assays are currently available. To address these needs, the Centers for Disease Control and Prevention's Genetic Testing Reference Material Coordination Program, in collaboration with members of the pharmacogenetic testing community and the Coriell Cell Repositories, have characterized a panel of 107 genomic DNA reference materials for five loci (CYP2D6, CYP2C19, CYP2C9, VKORC1, and UGT1A1) that are commonly included in pharmacogenetic testing panels and proficiency testing surveys. Genomic DNA from publicly available cell lines was sent to volunteer laboratories for genotyping. Each sample was tested in three to six laboratories using a variety of commercially available or laboratory-developed platforms. The results were consistent among laboratories, with differences in allele assignments largely related to the manufacturer's assay design and variable nomenclature, especially for CYP2D6. The alleles included in the assay platforms varied, but most were identified in the set of 107 DNA samples. Nine additional pharmacogenetic loci (CYP4F2, EPHX1, ABCB1, HLAB, KIF6, CYP3A4, CYP3A5, TPMT, and DPD) were also tested. These samples are publicly available from Coriell and will be useful for quality assurance, proficiency testing, test development, and research. PMID:20889555

  2. Acinetobacter phage genome is similar to Sphinx 2.36, the circular DNA copurified with TSE infected particles.

    PubMed

    Longkumer, Toshisangba; Kamireddy, Swetha; Muthyala, Venkateswar Reddy; Akbarpasha, Shaikh; Pitchika, Gopi Krishna; Kodetham, Gopinath; Ayaluru, Murali; Siddavattam, Dayananda

    2013-01-01

    While analyzing plasmids of Acinetobacter sp. DS002 we have detected a circular DNA molecule pTS236, which upon further investigation is identified as the genome of a phage. The phage genome has shown sequence similarity to the recently discovered Sphinx 2.36 DNA sequence co-purified with the Transmissible Spongiform Encephalopathy (TSE) particles isolated from infected brain samples collected from diverse geographical regions. As in Sphinx 2.36, the phage genome also codes for three proteins. One of them codes for RepA and is shown to be involved in replication of pTS236 through rolling circle (RC) mode. The other two translationally coupled ORFs, orf106 and orf96, code for coat proteins of the phage. Although an orf96 homologue was not previously reported in Sphinx 2.36, a closer examination of DNA sequence of Sphinx 2.36 revealed its presence downstream of orf106 homologue. TEM images and infection assays revealed existence of phage AbDs1 in Acinetobacter sp. DS002.

  3. Acinetobacter phage genome is similar to Sphinx 2.36, the circular DNA copurified with TSE infected particles

    PubMed Central

    Longkumer, Toshisangba; Kamireddy, Swetha; Muthyala, Venkateswar Reddy; Akbarpasha, Shaikh; Pitchika, Gopi Krishna; Kodetham, Gopinath; Ayaluru, Murali; Siddavattam, Dayananda

    2013-01-01

    While analyzing plasmids of Acinetobacter sp. DS002 we have detected a circular DNA molecule pTS236, which upon further investigation is identified as the genome of a phage. The phage genome has shown sequence similarity to the recently discovered Sphinx 2.36 DNA sequence co-purified with the Transmissible Spongiform Encephalopathy (TSE) particles isolated from infected brain samples collected from diverse geographical regions. As in Sphinx 2.36, the phage genome also codes for three proteins. One of them codes for RepA and is shown to be involved in replication of pTS236 through rolling circle (RC) mode. The other two translationally coupled ORFs, orf106 and orf96, code for coat proteins of the phage. Although an orf96 homologue was not previously reported in Sphinx 2.36, a closer examination of DNA sequence of Sphinx 2.36 revealed its presence downstream of orf106 homologue. TEM images and infection assays revealed existence of phage AbDs1 in Acinetobacter sp. DS002. PMID:23867905

  4. Whole-exome/genome sequencing and genomics.

    PubMed

    Grody, Wayne W; Thompson, Barry H; Hudgins, Louanne

    2013-12-01

    As medical genetics has progressed from a descriptive entity to one focused on the functional relationship between genes and clinical disorders, emphasis has been placed on genomics. Genomics, a subelement of genetics, is the study of the genome, the sum total of all the genes of an organism. The human genome, which is contained in the 23 pairs of nuclear chromosomes and in the mitochondrial DNA of each cell, comprises >6 billion nucleotides of genetic code. There are some 23,000 protein-coding genes, a surprisingly small fraction of the total genetic material, with the remainder composed of noncoding DNA, regulatory sequences, and introns. The Human Genome Project, launched in 1990, produced a draft of the genome in 2001 and then a finished sequence in 2003, on the 50th anniversary of the initial publication of Watson and Crick's paper on the double-helical structure of DNA. Since then, this mass of genetic information has been translated at an ever-increasing pace into useable knowledge applicable to clinical medicine. The recent advent of massively parallel DNA sequencing (also known as shotgun, high-throughput, and next-generation sequencing) has brought whole-genome analysis into the clinic for the first time, and most of the current applications are directed at children with congenital conditions that are undiagnosable by using standard genetic tests for single-gene disorders. Thus, pediatricians must become familiar with this technology, what it can and cannot offer, and its technical and ethical challenges. Here, we address the concepts of human genomic analysis and its clinical applicability for primary care providers.

  5. Real-time imaging of specific genomic loci in eukaryotic cells using the ANCHOR DNA labelling system.

    PubMed

    Germier, Thomas; Sylvain, Audibert; Silvia, Kocanova; David, Lane; Kerstin, Bystricky

    2018-06-01

    Spatio-temporal organization of the cell nucleus adapts to and regulates genomic processes. Microscopy approaches that enable direct monitoring of specific chromatin sites in single cells and in real time are needed to better understand the dynamics involved. In this chapter, we describe the principle and development of ANCHOR, a novel tool for DNA labelling in eukaryotic cells. Protocols for use of ANCHOR to visualize a single genomic locus in eukaryotic cells are presented. We describe an approach for live cell imaging of a DNA locus during the entire cell cycle in human breast cancer cells. Copyright © 2018 Elsevier Inc. All rights reserved.

  6. A Glance at Microsatellite Motifs from 454 Sequencing Reads of Watermelon Genomic DNA

    USDA-ARS?s Scientific Manuscript database

    A single 454 (Life Sciences Sequencing Technology) run of Charleston Gray watermelon (Citrullus lanatus var. lanatus) genomic DNA was performed and sequence data were assembled. A large scale identification of simple sequence repeat (SSR) was performed and SSR sequence data were used for the develo...

  7. MethylMix 2.0: an R package for identifying DNA methylation genes. | Office of Cancer Genomics

    Cancer.gov

    DNA methylation is an important mechanism regulating gene transcription, and its role in carcinogenesis has been extensively studied. Hyper and hypomethylation of genes is a major mechanism of gene expression deregulation in a wide range of diseases. At the same time, high-throughput DNA methylation assays have been developed generating vast amounts of genome wide DNA methylation measurements. We developed MethylMix, an algorithm implemented in R to identify disease specific hyper and hypomethylated genes.

  8. Effects of As2O3 on DNA methylation, genomic instability, and LTR retrotransposon polymorphism in Zea mays.

    PubMed

    Erturk, Filiz Aygun; Aydin, Murat; Sigmaz, Burcu; Taspinar, M Sinan; Arslan, Esra; Agar, Guleray; Yagci, Semra

    2015-12-01

    Arsenic is a well-known toxic substance on the living organisms. However, limited efforts have been made to study its DNA methylation, genomic instability, and long terminal repeat (LTR) retrotransposon polymorphism causing properties in different crops. In the present study, effects of As2O3 (arsenic trioxide) on LTR retrotransposon polymorphism and DNA methylation as well as DNA damage in Zea mays seedlings were investigated. The results showed that all of arsenic doses caused a decreasing genomic template stability (GTS) and an increasing Random Amplified Polymorphic DNAs (RAPDs) profile changes (DNA damage). In addition, increasing DNA methylation and LTR retrotransposon polymorphism characterized a model to explain the epigenetically changes in the gene expression were also found. The results of this experiment have clearly shown that arsenic has epigenetic effect as well as its genotoxic effect. Especially, the increasing of polymorphism of some LTR retrotransposon under arsenic stress may be a part of the defense system against the stress.

  9. Using Partial Genomic Fosmid Libraries for Sequencing CompleteOrganellar Genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McNeal, Joel R.; Leebens-Mack, James H.; Arumuganathan, K.

    2005-08-26

    Organellar genome sequences provide numerous phylogenetic markers and yield insight into organellar function and molecular evolution. These genomes are much smaller in size than their nuclear counterparts; thus, their complete sequencing is much less expensive than total nuclear genome sequencing, making broader phylogenetic sampling feasible. However, for some organisms it is challenging to isolate plastid DNA for sequencing using standard methods. To overcome these difficulties, we constructed partial genomic libraries from total DNA preparations of two heterotrophic and two autotrophic angiosperm species using fosmid vectors. We then used macroarray screening to isolate clones containing large fragments of plastid DNA. Amore » minimum tiling path of clones comprising the entire genome sequence of each plastid was selected, and these clones were shotgun-sequenced and assembled into complete genomes. Although this method worked well for both heterotrophic and autotrophic plants, nuclear genome size had a dramatic effect on the proportion of screened clones containing plastid DNA and, consequently, the overall number of clones that must be screened to ensure full plastid genome coverage. This technique makes it possible to determine complete plastid genome sequences for organisms that defy other available organellar genome sequencing methods, especially those for which limited amounts of tissue are available.« less

  10. Role of transposon-derived small RNAs in the interplay between genomes and parasitic DNA in rice.

    PubMed

    Nosaka, Misuzu; Itoh, Jun-Ichi; Nagato, Yasuo; Ono, Akemi; Ishiwata, Aiko; Sato, Yutaka

    2012-09-01

    RNA silencing is a defense system against "genomic parasites" such as transposable elements (TE), which are potentially harmful to host genomes. In plants, transcripts from TEs induce production of double-stranded RNAs (dsRNAs) and are processed into small RNAs (small interfering RNAs, siRNAs) that suppress TEs by RNA-directed DNA methylation. Thus, the majority of TEs are epigenetically silenced. On the other hand, most of the eukaryotic genome is composed of TEs and their remnants, suggesting that TEs have evolved countermeasures against host-mediated silencing. Under some circumstances, TEs can become active and increase in copy number. Knowledge is accumulating on the mechanisms of TE silencing by the host; however, the mechanisms by which TEs counteract silencing are poorly understood. Here, we show that a class of TEs in rice produces a microRNA (miRNA) to suppress host silencing. Members of the microRNA820 (miR820) gene family are located within CACTA DNA transposons in rice and target a de novo DNA methyltransferase gene, OsDRM2, one of the components of epigenetic silencing. We confirmed that miR820 negatively regulates the expression of OsDRM2. In addition, we found that expression levels of various TEs are increased quite sensitively in response to decreased OsDRM2 expression and DNA methylation at TE loci. Furthermore, we found that the nucleotide sequence of miR820 and its recognition site within the target gene in some Oryza species have co-evolved to maintain their base-pairing ability. The co-evolution of these sequences provides evidence for the functionality of this regulation. Our results demonstrate how parasitic elements in the genome escape the host's defense machinery. Furthermore, our analysis of the regulation of OsDRM2 by miR820 sheds light on the action of transposon-derived small RNAs, not only as a defense mechanism for host genomes but also as a regulator of interactions between hosts and their parasitic elements.

  11. Surface-enhanced Raman spectroscopy of genomic DNA from in vitro grown tomato (Lycopersicon esculentum Mill.) cultivars before and after plant cryopreservation.

    PubMed

    Muntean, Cristina M; Leopold, Nicolae; Tripon, Carmen; Coste, Ana; Halmagyi, Adela

    2015-06-05

    In this work the surface-enhanced Raman scattering (SERS) spectra of five genomic DNAs from non-cryopreserved control tomato plants (Lycopersicon esculentum Mill. cultivars Siriana, Darsirius, Kristin, Pontica and Capriciu) respectively, have been analyzed in the wavenumber range 400-1800 cm(-1). Structural changes induced in genomic DNAs upon cryopreservation were discussed in detail for four of the above mentioned tomato cultivars. The surface-enhanced Raman vibrational modes for each of these cases, spectroscopic band assignments and structural interpretations of genomic DNAs are reported. We have found, that DNA isolated from Siriana cultivar leaf tissues suffers the weakest structural changes upon cryogenic storage of tomato shoot apices. On the contrary, genomic DNA extracted from Pontica cultivar is the most responsive system to cryopreservation process. Particularly, both C2'-endo-anti and C3'-endo-anti conformations have been detected. As a general observation, the wavenumber range 1511-1652 cm(-1), being due to dA, dG and dT residues seems to be influenced by cryopreservation process. These changes could reflect unstacking of DNA bases. However, not significant structural changes of genomic DNAs from Siriana, Darsirius and Kristin have been found upon cryopreservation process of tomato cultivars. Based on this work, specific plant DNA-ligand interactions or accurate local structure of DNA in the proximity of a metallic surface, might be further investigated using surface-enhanced Raman spectroscopy. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. Genome-wide inference of transcription factor-DNA binding specificity in cell regeneration using a combination strategy.

    PubMed

    Wang, Xiaofeng; Zhang, Aiqun; Ren, Weizheng; Chen, Caiyu; Dong, Jiahong

    2012-11-01

    The cell growth, development, and regeneration of tissue and organ are associated with a large number of gene regulation events, which are mediated in part by transcription factors (TFs) binding to cis-regulatory elements involved in the genome. Predicting the binding affinity and inferring the binding specificity of TF-DNA interactions at the genomic level would be fundamentally helpful for our understanding of the molecular mechanism and biological implication underlying sequence-specific TF-DNA recognition. In this study, we report the development of a combination method to characterize the interaction behavior of a 11-mer oligonucleotide segment and its mutations with the Gcn4p protein, a homodimeric, basic leucine zipper TF, and to predict the binding affinity and specificity of potential Gcn4p binders in the genome-wide scale. In this procedure, a position-mutated energy matrix is created based on molecular modeling analysis of native and mutated Gcn4p-DNA complex structures to describe the position-independent interaction energy profile of Gcn4p with different nucleotide types at each position of the oligonucleotide, and the energy terms extracted from the matrix and their interactives are then correlated with experimentally measured affinities of 19268 distinct oligonucleotides using statistical modeling methodology. Subsequently, the best one of built regression models is successfully applied to screen those of potential high-affinity Gcn4p binders from the complete genome. The findings arising from this study are briefly listed below: (i) The 11 positions of oligonucleotides are highly interactive and non-additive in contribution to Gcn4p-DNA binding affinity; (ii) Indirect conformational effects upon nucleotide mutations as well as associated subtle changes in interfacial atomic contacts, but not the direct nonbonded interactions, are primarily responsible for the sequence-specific recognition; (iii) The intrinsic synergistic effects among the sequence

  13. Genome-wide DNA methylation modified by soy phytoestrogens: role for epigenetic therapeutics in prostate cancer?

    PubMed

    Karsli-Ceppioglu, Seher; Ngollo, Marjolaine; Adjakly, Mawussi; Dagdemir, Aslihan; Judes, Gaëlle; Lebert, André; Boiteux, Jean-Paul; Penault-LLorca, Frédérique; Bignon, Yves-Jean; Guy, Laurent; Bernard-Gallon, Dominique

    2015-04-01

    In prostate cancer, DNA methylation is significantly associated with tumor initiation, progression, and metastasis. Previous studies have suggested that soy phytoestrogens might regulate DNA methylation at individual candidate gene loci and that they play a crucial role as potential therapeutic agents for prostate cancer. The purpose of our study was to examine the modulation effects of phytoestrogens on a genome-wide scale in regards to DNA methylation in prostate cancer. Prostate cancer cell lines DU-145 and LNCaP were treated with 40 μM of genistein and 110 μM of daidzein. DNMT inhibitor 5-azacytidine (2 μM) and the methylating agent budesonide (2 μM) were used to compare their demethylation/methylation effects with phytoestrogens. The regulatory effects of phytoestrogens on DNA methylation were analyzed by using a methyl-DNA immunoprecipitation method coupled with Human DNA Methylation Microarrays (MeDIP-chip). We observed that the methylation profiles of 58 genes were altered by genistein and daidzein treatments in DU-145 and LNCaP prostate cancer cells. In addition, the methylation frequencies of the MAD1L1, TRAF7, KDM4B, and hTERT genes were remarkably modified by genistein treatment. Our results suggest that the modulation effects of phytoestrogens on DNA methylation essentially lead to inhibition of cell growth and induction of apoptosis. Genome-wide methylation profiling reported here suggests that epigenetic regulation mechanisms and, by extension, epigenetics-driven novel therapeutic candidates warrant further consideration in future "omics" studies of prostate cancer.

  14. Evolutionarily diverse determinants of meiotic DNA break and recombination landscapes across the genome

    PubMed Central

    Fowler, Kyle R.; Sasaki, Mariko; Milman, Neta

    2014-01-01

    Fission yeast Rec12 (Spo11 homolog) initiates meiotic recombination by forming developmentally programmed DNA double-strand breaks (DSBs). DSB distributions influence patterns of heredity and genome evolution, but the basis of the highly nonrandom choice of Rec12 cleavage sites is poorly understood, largely because available maps are of relatively low resolution and sensitivity. Here, we determined DSBs genome-wide at near-nucleotide resolution by sequencing the oligonucleotides attached to Rec12 following DNA cleavage. The single oligonucleotide size class allowed us to deeply sample all break events. We find strong evidence across the genome for differential DSB repair accounting for crossover invariance (constant cM/kb in spite of DSB hotspots). Surprisingly, about half of all crossovers occur in regions where DSBs occur at low frequency and are widely dispersed in location from cell to cell. These previously undetected, low-level DSBs thus play an outsized and crucial role in meiosis. We further find that the influence of underlying nucleotide sequence and chromosomal architecture differs in multiple ways from that in budding yeast. DSBs are not strongly restricted to nucleosome-depleted regions, as they are in budding yeast, but are nevertheless spatially influenced by chromatin structure. Our analyses demonstrate that evolutionarily fluid factors contribute to crossover initiation and regulation. PMID:25024163

  15. Mapping of Variable DNA Methylation Across Multiple Cell Types Defines a Dynamic Regulatory Landscape of the Human Genome.

    PubMed

    Gu, Junchen; Stevens, Michael; Xing, Xiaoyun; Li, Daofeng; Zhang, Bo; Payton, Jacqueline E; Oltz, Eugene M; Jarvis, James N; Jiang, Kaiyu; Cicero, Theodore; Costello, Joseph F; Wang, Ting

    2016-04-07

    DNA methylation is an important epigenetic modification involved in many biological processes and diseases. Many studies have mapped DNA methylation changes associated with embryogenesis, cell differentiation, and cancer at a genome-wide scale. Our understanding of genome-wide DNA methylation changes in a developmental or disease-related context has been steadily growing. However, the investigation of which CpGs are variably methylated in different normal cell or tissue types is still limited. Here, we present an in-depth analysis of 54 single-CpG-resolution DNA methylomes of normal human cell types by integrating high-throughput sequencing-based methylation data. We found that the ratio of methylated to unmethylated CpGs is relatively constant regardless of cell type. However, which CpGs made up the unmethylated complement was cell-type specific. We categorized the 26,000,000 human autosomal CpGs based on their methylation levels across multiple cell types to identify variably methylated CpGs and found that 22.6% exhibited variable DNA methylation. These variably methylated CpGs formed 660,000 variably methylated regions (VMRs), encompassing 11% of the genome. By integrating a multitude of genomic data, we found that VMRs enrich for histone modifications indicative of enhancers, suggesting their role as regulatory elements marking cell type specificity. VMRs enriched for transcription factor binding sites in a tissue-dependent manner. Importantly, they enriched for GWAS variants, suggesting that VMRs could potentially be implicated in disease and complex traits. Taken together, our results highlight the link between CpG methylation variation, genetic variation, and disease risk for many human cell types. Copyright © 2016 Gu et al.

  16. MASQOT: a method for cDNA microarray spot quality control

    PubMed Central

    Bylesjö, Max; Eriksson, Daniel; Sjödin, Andreas; Sjöström, Michael; Jansson, Stefan; Antti, Henrik; Trygg, Johan

    2005-01-01

    Background cDNA microarray technology has emerged as a major player in the parallel detection of biomolecules, but still suffers from fundamental technical problems. Identifying and removing unreliable data is crucial to prevent the risk of receiving illusive analysis results. Visual assessment of spot quality is still a common procedure, despite the time-consuming work of manually inspecting spots in the range of hundreds of thousands or more. Results A novel methodology for cDNA microarray spot quality control is outlined. Multivariate discriminant analysis was used to assess spot quality based on existing and novel descriptors. The presented methodology displays high reproducibility and was found superior in identifying unreliable data compared to other evaluated methodologies. Conclusion The proposed methodology for cDNA microarray spot quality control generates non-discrete values of spot quality which can be utilized as weights in subsequent analysis procedures as well as to discard spots of undesired quality using the suggested threshold values. The MASQOT approach provides a consistent assessment of spot quality and can be considered an alternative to the labor-intensive manual quality assessment process. PMID:16223442

  17. Correction of the lack of commutability between plasmid DNA and genomic DNA for quantification of genetically modified organisms using pBSTopas as a model.

    PubMed

    Zhang, Li; Wu, Yuhua; Wu, Gang; Cao, Yinglong; Lu, Changming

    2014-10-01

    Plasmid calibrators are increasingly applied for polymerase chain reaction (PCR) analysis of genetically modified organisms (GMOs). To evaluate the commutability between plasmid DNA (pDNA) and genomic DNA (gDNA) as calibrators, a plasmid molecule, pBSTopas, was constructed, harboring a Topas 19/2 event-specific sequence and a partial sequence of the rapeseed reference gene CruA. Assays of the pDNA showed similar limits of detection (five copies for Topas 19/2 and CruA) and quantification (40 copies for Topas 19/2 and 20 for CruA) as those for the gDNA. Comparisons of plasmid and genomic standard curves indicated that the slopes, intercepts, and PCR efficiency for pBSTopas were significantly different from CRM Topas 19/2 gDNA for quantitative analysis of GMOs. Three correction methods were used to calibrate the quantitative analysis of control samples using pDNA as calibrators: model a, or coefficient value a (Cva); model b, or coefficient value b (Cvb); and the novel model c or coefficient formula (Cf). Cva and Cvb gave similar estimated values for the control samples, and the quantitative bias of the low concentration sample exceeded the acceptable range within ±25% in two of the four repeats. Using Cfs to normalize the Ct values of test samples, the estimated values were very close to the reference values (bias -13.27 to 13.05%). In the validation of control samples, model c was more appropriate than Cva or Cvb. The application of Cf allowed pBSTopas to substitute for Topas 19/2 gDNA as a calibrator to accurately quantify the GMO.

  18. Dynamic changes of genome-wide DNA methylation during soybean seed development

    USDA-ARS?s Scientific Manuscript database

    Seed development is programmed by expression of many genes in plants. Seed maturation is an important developmental process to soybean seed quality and yield. DNA methylation is a major epigenetic modification regulating gene expression. However, little is known about the dynamic nature of DNA me...

  19. Excited-state solvation and proton transfer dynamics of DAPI in biomimetics and genomic DNA.

    PubMed

    Banerjee, Debapriya; Pal, Samir Kumar

    2008-08-14

    The fluorescent probe DAPI (4',6-diamidino-2-phenylindole) is an efficient DNA binder. Studies on the DAPI-DNA complexes show that the probe exhibits a wide variety of interactions of different strengths and specificities with DNA. Recently the probe has been used to report the environmental dynamics of a DNA minor groove. However, the use of the probe as a solvation reporter in restricted environments is not straightforward. This is due to the presence of two competing relaxation processes (intramolecular proton transfer and solvation stabilization) in the excited state, which can lead to erroneous interpretation of the observed excited-state dynamics. In this study, the possibility of using DAPI to unambiguously report the environmental dynamics in restricted environments including DNA is explored. The dynamics of the probe is studied in bulk solvents, biomimetics like micelles and reverse micelles, and genomic DNA using steady-state and picosecond-resolved fluorescence spectroscopies.

  20. Partial structure of the phylloxin gene from the giant monkey frog, Phyllomedusa bicolor: parallel cloning of precursor cDNA and genomic DNA from lyophilized skin secretion.

    PubMed

    Chen, Tianbao; Gagliardo, Ron; Walker, Brian; Zhou, Mei; Shaw, Chris

    2005-12-01

    Phylloxin is a novel prototype antimicrobial peptide from the skin of Phyllomedusa bicolor. Here, we describe parallel identification and sequencing of phylloxin precursor transcript (mRNA) and partial gene structure (genomic DNA) from the same sample of lyophilized skin secretion using our recently-described cloning technique. The open-reading frame of the phylloxin precursor was identical in nucleotide sequence to that previously reported and alignment with the nucleotide sequence derived from genomic DNA indicated the presence of a 175 bp intron located in a near identical position to that found in the dermaseptins. The highly-conserved structural organization of skin secretion peptide genes in P. bicolor can thus be extended to include that encoding phylloxin (plx). These data further reinforce our assertion that application of the described methodology can provide robust genomic/transcriptomic/peptidomic data without the need for specimen sacrifice.