Comparison and quantitative verification of mapping algorithms for whole genome bisulfite sequencing
USDA-ARS?s Scientific Manuscript database
Coupling bisulfite conversion with next-generation sequencing (Bisulfite-seq) enables genome-wide measurement of DNA methylation, but poses unique challenges for mapping. However, despite a proliferation of Bisulfite-seq mapping tools, no systematic comparison of their genomic coverage and quantitat...
GPU-BSM: A GPU-Based Tool to Map Bisulfite-Treated Reads
Manconi, Andrea; Orro, Alessandro; Manca, Emanuele; Armano, Giuliano; Milanesi, Luciano
2014-01-01
Cytosine DNA methylation is an epigenetic mark implicated in several biological processes. Bisulfite treatment of DNA is acknowledged as the gold standard technique to study methylation. This technique introduces changes in the genomic DNA by converting cytosines to uracils while 5-methylcytosines remain nonreactive. During PCR amplification 5-methylcytosines are amplified as cytosine, whereas uracils and thymines as thymine. To detect the methylation levels, reads treated with the bisulfite must be aligned against a reference genome. Mapping these reads to a reference genome represents a significant computational challenge mainly due to the increased search space and the loss of information introduced by the treatment. To deal with this computational challenge we devised GPU-BSM, a tool based on modern Graphics Processing Units. Graphics Processing Units are hardware accelerators that are increasingly being used successfully to accelerate general-purpose scientific applications. GPU-BSM is a tool able to map bisulfite-treated reads from whole genome bisulfite sequencing and reduced representation bisulfite sequencing, and to estimate methylation levels, with the goal of detecting methylation. Due to the massive parallelization obtained by exploiting graphics cards, GPU-BSM aligns bisulfite-treated reads faster than other cutting-edge solutions, while outperforming most of them in terms of unique mapped reads. PMID:24842718
MethPrimer: designing primers for methylation PCRs.
Li, Long-Cheng; Dahiya, Rajvir
2002-11-01
DNA methylation is an epigenetic mechanism of gene regulation. Bisulfite- conversion-based PCR methods, such as bisulfite sequencing PCR (BSP) and methylation specific PCR (MSP), remain the most commonly used techniques for methylation mapping. Existing primer design programs developed for standard PCR cannot handle primer design for bisulfite-conversion-based PCRs due to changes in DNA sequence context caused by bisulfite treatment and many special constraints both on the primers and the region to be amplified for such experiments. Therefore, the present study was designed to develop a program for such applications. MethPrimer, based on Primer 3, is a program for designing PCR primers for methylation mapping. It first takes a DNA sequence as its input and searches the sequence for potential CpG islands. Primers are then picked around the predicted CpG islands or around regions specified by users. MethPrimer can design primers for BSP and MSP. Results of primer selection are delivered through a web browser in text and in graphic view.
Pardo, Carolina E; Carr, Ian M; Hoffman, Christopher J; Darst, Russell P; Markham, Alexander F; Bonthron, David T; Kladde, Michael P
2011-01-01
Bisulfite sequencing is a widely-used technique for examining cytosine DNA methylation at nucleotide resolution along single DNA strands. Probing with cytosine DNA methyltransferases followed by bisulfite sequencing (MAPit) is an effective technique for mapping protein-DNA interactions. Here, MAPit methylation footprinting with M.CviPI, a GC methyltransferase we previously cloned and characterized, was used to probe hMLH1 chromatin in HCT116 and RKO colorectal cancer cells. Because M.CviPI-probed samples contain both CG and GC methylation, we developed a versatile, visually-intuitive program, called MethylViewer, for evaluating the bisulfite sequencing results. Uniquely, MethylViewer can simultaneously query cytosine methylation status in bisulfite-converted sequences at as many as four different user-defined motifs, e.g. CG, GC, etc., including motifs with degenerate bases. Data can also be exported for statistical analysis and as publication-quality images. Analysis of hMLH1 MAPit data with MethylViewer showed that endogenous CG methylation and accessible GC sites were both mapped on single molecules at high resolution. Disruption of positioned nucleosomes on single molecules of the PHO5 promoter was detected in budding yeast using M.CviPII, increasing the number of enzymes available for probing protein-DNA interactions. MethylViewer provides an integrated solution for primer design and rapid, accurate and detailed analysis of bisulfite sequencing or MAPit datasets from virtually any biological or biochemical system.
Technical Considerations for Reduced Representation Bisulfite Sequencing with Multiplexed Libraries
Chatterjee, Aniruddha; Rodger, Euan J.; Stockwell, Peter A.; Weeks, Robert J.; Morison, Ian M.
2012-01-01
Reduced representation bisulfite sequencing (RRBS), which couples bisulfite conversion and next generation sequencing, is an innovative method that specifically enriches genomic regions with a high density of potential methylation sites and enables investigation of DNA methylation at single-nucleotide resolution. Recent advances in the Illumina DNA sample preparation protocol and sequencing technology have vastly improved sequencing throughput capacity. Although the new Illumina technology is now widely used, the unique challenges associated with multiplexed RRBS libraries on this platform have not been previously described. We have made modifications to the RRBS library preparation protocol to sequence multiplexed libraries on a single flow cell lane of the Illumina HiSeq 2000. Furthermore, our analysis incorporates a bioinformatics pipeline specifically designed to process bisulfite-converted sequencing reads and evaluate the output and quality of the sequencing data generated from the multiplexed libraries. We obtained an average of 42 million paired-end reads per sample for each flow-cell lane, with a high unique mapping efficiency to the reference human genome. Here we provide a roadmap of modifications, strategies, and trouble shooting approaches we implemented to optimize sequencing of multiplexed libraries on an a RRBS background. PMID:23193365
Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers
Pabinger, Stephan; Ernst, Karina; Pulverer, Walter; Kallmeyer, Rainer; Valdes, Ana M.; Metrustry, Sarah; Katic, Denis; Nuzzo, Angelo; Kriegner, Albert; Vierlinger, Klemens; Weinhaeusel, Andreas
2016-01-01
Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM). Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage. TABSAT is freely available under a GNU General Public License version 3.0 (GPLv3) at https://github.com/tadkeys/tabsat/ and http://demo.platomics.com/. PMID:27467908
Clark, Stephen J; Smallwood, Sébastien A; Lee, Heather J; Krueger, Felix; Reik, Wolf; Kelsey, Gavin
2017-03-01
DNA methylation (DNAme) is an important epigenetic mark in diverse species. Our current understanding of DNAme is based on measurements from bulk cell samples, which obscures intercellular differences and prevents analyses of rare cell types. Thus, the ability to measure DNAme in single cells has the potential to make important contributions to the understanding of several key biological processes, such as embryonic development, disease progression and aging. We have recently reported a method for generating genome-wide DNAme maps from single cells, using single-cell bisulfite sequencing (scBS-seq), allowing the quantitative measurement of DNAme at up to 50% of CpG dinucleotides throughout the mouse genome. Here we present a detailed protocol for scBS-seq that includes our most recent developments to optimize recovery of CpGs, mapping efficiency and success rate; reduce hands-on time; and increase sample throughput with the option of using an automated liquid handler. We provide step-by-step instructions for each stage of the method, comprising cell lysis and bisulfite (BS) conversion, preamplification and adaptor tagging, library amplification, sequencing and, lastly, alignment and methylation calling. An individual with relevant molecular biology expertise can complete library preparation within 3 d. Subsequent computational steps require 1-3 d for someone with bioinformatics expertise.
BiQ Analyzer HT: locus-specific analysis of DNA methylation by high-throughput bisulfite sequencing
Lutsik, Pavlo; Feuerbach, Lars; Arand, Julia; Lengauer, Thomas; Walter, Jörn; Bock, Christoph
2011-01-01
Bisulfite sequencing is a widely used method for measuring DNA methylation in eukaryotic genomes. The assay provides single-base pair resolution and, given sufficient sequencing depth, its quantitative accuracy is excellent. High-throughput sequencing of bisulfite-converted DNA can be applied either genome wide or targeted to a defined set of genomic loci (e.g. using locus-specific PCR primers or DNA capture probes). Here, we describe BiQ Analyzer HT (http://biq-analyzer-ht.bioinf.mpi-inf.mpg.de/), a user-friendly software tool that supports locus-specific analysis and visualization of high-throughput bisulfite sequencing data. The software facilitates the shift from time-consuming clonal bisulfite sequencing to the more quantitative and cost-efficient use of high-throughput sequencing for studying locus-specific DNA methylation patterns. In addition, it is useful for locus-specific visualization of genome-wide bisulfite sequencing data. PMID:21565797
Owa, Chie; Poulin, Matthew; Yan, Liying; Shioda, Toshi
2018-01-01
The existence of cytosine methylation in mammalian mitochondrial DNA (mtDNA) is a controversial subject. Because detection of DNA methylation depends on resistance of 5'-modified cytosines to bisulfite-catalyzed conversion to uracil, examined parameters that affect technical adequacy of mtDNA methylation analysis. Negative control amplicons (NCAs) devoid of cytosine methylation were amplified to cover the entire human or mouse mtDNA by long-range PCR. When the pyrosequencing template amplicons were gel-purified after bisulfite conversion, bisulfite pyrosequencing of NCAs did not detect significant levels of bisulfite-resistant cytosines (brCs) at ND1 (7 CpG sites) or CYTB (8 CpG sites) genes (CI95 = 0%-0.94%); without gel-purification, significant false-positive brCs were detected from NCAs (CI95 = 4.2%-6.8%). Bisulfite pyrosequencing of highly purified, linearized mtDNA isolated from human iPS cells or mouse liver detected significant brCs (~30%) in human ND1 gene when the sequencing primer was not selective in bisulfite-converted and unconverted templates. However, repeated experiments using a sequencing primer selective in bisulfite-converted templates almost completely (< 0.8%) suppressed brC detection, supporting the false-positive nature of brCs detected using the non-selective primer. Bisulfite-seq deep sequencing of linearized, gel-purified human mtDNA detected 9.4%-14.8% brCs for 9 CpG sites in ND1 gene. However, because all these brCs were associated with adjacent non-CpG brCs showing the same degrees of bisulfite resistance, DNA methylation in this mtDNA-encoded gene was not confirmed. Without linearization, data generated by bisulfite pyrosequencing or deep sequencing of purified mtDNA templates did not pass the quality control criteria. Shotgun bisulfite sequencing of human mtDNA detected extremely low levels of CpG methylation (<0.65%) over non-CpG methylation (<0.55%). Taken together, our study demonstrates that adequacy of mtDNA methylation analysis using methods dependent on bisulfite conversion needs to be established for each experiment, taking effects of incomplete bisulfite conversion and template impurity or topology into consideration.
Guo, Shicheng; Diep, Dinh; Plongthongkum, Nongluk; Fung, Ho-Lim; Zhang, Kang; Zhang, Kun
2017-04-01
Adjacent CpG sites in mammalian genomes can be co-methylated owing to the processivity of methyltransferases or demethylases, yet discordant methylation patterns have also been observed, which are related to stochastic or uncoordinated molecular processes. We focused on a systematic search and investigation of regions in the full human genome that show highly coordinated methylation. We defined 147,888 blocks of tightly coupled CpG sites, called methylation haplotype blocks, after analysis of 61 whole-genome bisulfite sequencing data sets and validation with 101 reduced-representation bisulfite sequencing data sets and 637 methylation array data sets. Using a metric called methylation haplotype load, we performed tissue-specific methylation analysis at the block level. Subsets of informative blocks were further identified for deconvolution of heterogeneous samples. Finally, using methylation haplotypes we demonstrated quantitative estimation of tumor load and tissue-of-origin mapping in the circulating cell-free DNA of 59 patients with lung or colorectal cancer.
Mapping the zebrafish brain methylome using reduced representation bisulfite sequencing
Chatterjee, Aniruddha; Ozaki, Yuichi; Stockwell, Peter A; Horsfield, Julia A; Morison, Ian M; Nakagawa, Shinichi
2013-01-01
Reduced representation bisulfite sequencing (RRBS) has been used to profile DNA methylation patterns in mammalian genomes such as human, mouse and rat. The methylome of the zebrafish, an important animal model, has not yet been characterized at base-pair resolution using RRBS. Therefore, we evaluated the technique of RRBS in this model organism by generating four single-nucleotide resolution DNA methylomes of adult zebrafish brain. We performed several simulations to show the distribution of fragments and enrichment of CpGs in different in silico reduced representation genomes of zebrafish. Four RRBS brain libraries generated 98 million sequenced reads and had higher frequencies of multiple mapping than equivalent human RRBS libraries. The zebrafish methylome indicates there is higher global DNA methylation in the zebrafish genome compared with its equivalent human methylome. This observation was confirmed by RRBS of zebrafish liver. High coverage CpG dinucleotides are enriched in CpG island shores more than in the CpG island core. We found that 45% of the mapped CpGs reside in gene bodies, and 7% in gene promoters. This analysis provides a roadmap for generating reproducible base-pair level methylomes for zebrafish using RRBS and our results provide the first evidence that RRBS is a suitable technique for global methylation analysis in zebrafish. PMID:23975027
Stroma Based Prognosticators Incorporating Differences between African and European Americans
2017-10-01
amenable to bisulfite sequencing of more than a few genes. Exploiting the recent three-fold reduction in the cost of sequencing per read , we developed oligo...cards. The ability of the HiSeq 4000 to obtain about three times as many reads as the HiSeq2500, at the same price, means we can stay on track, though...capture, and sequencing (Table 2). We obtain tens of millions of mapped deduplicated reads per sample, while using only 5% of a sequencing lane per sample
Ludgate, Jackie L; Wright, James; Stockwell, Peter A; Morison, Ian M; Eccles, Michael R; Chatterjee, Aniruddha
2017-08-31
Formalin fixed paraffin embedded (FFPE) tumor samples are a major source of DNA from patients in cancer research. However, FFPE is a challenging material to work with due to macromolecular fragmentation and nucleic acid crosslinking. FFPE tissue particularly possesses challenges for methylation analysis and for preparing sequencing-based libraries relying on bisulfite conversion. Successful bisulfite conversion is a key requirement for sequencing-based methylation analysis. Here we describe a complete and streamlined workflow for preparing next generation sequencing libraries for methylation analysis from FFPE tissues. This includes, counting cells from FFPE blocks and extracting DNA from FFPE slides, testing bisulfite conversion efficiency with a polymerase chain reaction (PCR) based test, preparing reduced representation bisulfite sequencing libraries and massively parallel sequencing. The main features and advantages of this protocol are: An optimized method for extracting good quality DNA from FFPE tissues. An efficient bisulfite conversion and next generation sequencing library preparation protocol that uses 50 ng DNA from FFPE tissue. Incorporation of a PCR-based test to assess bisulfite conversion efficiency prior to sequencing. We provide a complete workflow and an integrated protocol for performing DNA methylation analysis at the genome-scale and we believe this will facilitate clinical epigenetic research that involves the use of FFPE tissue.
A pooling-based approach to mapping genetic variants associated with DNA methylation
Kaplow, Irene M.; MacIsaac, Julia L.; Mah, Sarah M.; McEwen, Lisa M.; Kobor, Michael S.; Fraser, Hunter B.
2015-01-01
DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a truly genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. We found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data. PMID:25910490
A pooling-based approach to mapping genetic variants associated with DNA methylation
Kaplow, Irene M.; MacIsaac, Julia L.; Mah, Sarah M.; ...
2015-04-24
DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a trulymore » genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. Here we found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data.« less
Bicycle: a bioinformatics pipeline to analyze bisulfite sequencing data.
Graña, Osvaldo; López-Fernández, Hugo; Fdez-Riverola, Florentino; González Pisano, David; Glez-Peña, Daniel
2018-04-15
High-throughput sequencing of bisulfite-converted DNA is a technique used to measure DNA methylation levels. Although a considerable number of computational pipelines have been developed to analyze such data, none of them tackles all the peculiarities of the analysis together, revealing limitations that can force the user to manually perform additional steps needed for a complete processing of the data. This article presents bicycle, an integrated, flexible analysis pipeline for bisulfite sequencing data. Bicycle analyzes whole genome bisulfite sequencing data, targeted bisulfite sequencing data and hydroxymethylation data. To show how bicycle overtakes other available pipelines, we compared them on a defined number of features that are summarized in a table. We also tested bicycle with both simulated and real datasets, to show its level of performance, and compared it to different state-of-the-art methylation analysis pipelines. Bicycle is publicly available under GNU LGPL v3.0 license at http://www.sing-group.org/bicycle. Users can also download a customized Ubuntu LiveCD including bicycle and other bisulfite sequencing data pipelines compared here. In addition, a docker image with bicycle and its dependencies, which allows a straightforward use of bicycle in any platform (e.g. Linux, OS X or Windows), is also available. ograna@cnio.es or dgpena@uvigo.es. Supplementary data are available at Bioinformatics online.
Olova, Nelly; Krueger, Felix; Andrews, Simon; Oxley, David; Berrens, Rebecca V; Branco, Miguel R; Reik, Wolf
2018-03-15
Whole-genome bisulfite sequencing (WGBS) is becoming an increasingly accessible technique, used widely for both fundamental and disease-oriented research. Library preparation methods benefit from a variety of available kits, polymerases and bisulfite conversion protocols. Although some steps in the procedure, such as PCR amplification, are known to introduce biases, a systematic evaluation of biases in WGBS strategies is missing. We perform a comparative analysis of several commonly used pre- and post-bisulfite WGBS library preparation protocols for their performance and quality of sequencing outputs. Our results show that bisulfite conversion per se is the main trigger of pronounced sequencing biases, and PCR amplification builds on these underlying artefacts. The majority of standard library preparation methods yield a significantly biased sequence output and overestimate global methylation. Importantly, both absolute and relative methylation levels at specific genomic regions vary substantially between methods, with clear implications for DNA methylation studies. We show that amplification-free library preparation is the least biased approach for WGBS. In protocols with amplification, the choice of bisulfite conversion protocol or polymerase can significantly minimize artefacts. To aid with the quality assessment of existing WGBS datasets, we have integrated a bias diagnostic tool in the Bismark package and offer several approaches for consideration during the preparation and analysis of WGBS datasets.
USDA-ARS?s Scientific Manuscript database
Analysis of DNA methylation patterns relies increasingly on sequencing-based profiling methods. The four most frequently used sequencing-based technologies are the bisulfite-based methods MethylC-seq and reduced representation bisulfite sequencing (RRBS), and the enrichment-based techniques methylat...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kaplow, Irene M.; MacIsaac, Julia L.; Mah, Sarah M.
DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a trulymore » genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. Here we found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data.« less
Leontiou, Chrysanthia A.; Hadjidaniel, Michael D.; Mina, Petros; Antoniou, Pavlos; Ioannides, Marios; Patsalis, Philippos C.
2015-01-01
Introduction Epigenetic alterations, including DNA methylation, play an important role in the regulation of gene expression. Several methods exist for evaluating DNA methylation, but bisulfite sequencing remains the gold standard by which base-pair resolution of CpG methylation is achieved. The challenge of the method is that the desired outcome (conversion of unmethylated cytosines) positively correlates with the undesired side effects (DNA degradation and inappropriate conversion), thus several commercial kits try to adjust a balance between the two. The aim of this study was to compare the performance of four bisulfite conversion kits [Premium Bisulfite kit (Diagenode), EpiTect Bisulfite kit (Qiagen), MethylEdge Bisulfite Conversion System (Promega) and BisulFlash DNA Modification kit (Epigentek)] regarding conversion efficiency, DNA degradation and conversion specificity. Methods Performance was tested by combining fully methylated and fully unmethylated λ-DNA controls in a series of spikes by means of Sanger sequencing (0%, 25%, 50% and 100% methylated spikes) and Next-Generation Sequencing (0%, 3%, 5%, 7%, 10%, 25%, 50% and 100% methylated spikes). We also studied the methylation status of two of our previously published differentially methylated regions (DMRs) at base resolution by using spikes of chorionic villus sample in whole blood. Results The kits studied showed different but comparable results regarding DNA degradation, conversion efficiency and conversion specificity. However, the best performance was observed with the MethylEdge Bisulfite Conversion System (Promega) followed by the Premium Bisulfite kit (Diagenode). The DMRs, EP6 and EP10, were confirmed to be hypermethylated in the CVS and hypomethylated in whole blood. Conclusion Our findings indicate that the MethylEdge Bisulfite Conversion System (Promega) was shown to have the best performance among the kits. In addition, the methylation level of two of our DMRs, EP6 and EP10, was confirmed. Finally, we showed that bisulfite amplicon sequencing is a suitable approach for methylation analysis of targeted regions. PMID:26247357
Leontiou, Chrysanthia A; Hadjidaniel, Michael D; Mina, Petros; Antoniou, Pavlos; Ioannides, Marios; Patsalis, Philippos C
2015-01-01
Epigenetic alterations, including DNA methylation, play an important role in the regulation of gene expression. Several methods exist for evaluating DNA methylation, but bisulfite sequencing remains the gold standard by which base-pair resolution of CpG methylation is achieved. The challenge of the method is that the desired outcome (conversion of unmethylated cytosines) positively correlates with the undesired side effects (DNA degradation and inappropriate conversion), thus several commercial kits try to adjust a balance between the two. The aim of this study was to compare the performance of four bisulfite conversion kits [Premium Bisulfite kit (Diagenode), EpiTect Bisulfite kit (Qiagen), MethylEdge Bisulfite Conversion System (Promega) and BisulFlash DNA Modification kit (Epigentek)] regarding conversion efficiency, DNA degradation and conversion specificity. Performance was tested by combining fully methylated and fully unmethylated λ-DNA controls in a series of spikes by means of Sanger sequencing (0%, 25%, 50% and 100% methylated spikes) and Next-Generation Sequencing (0%, 3%, 5%, 7%, 10%, 25%, 50% and 100% methylated spikes). We also studied the methylation status of two of our previously published differentially methylated regions (DMRs) at base resolution by using spikes of chorionic villus sample in whole blood. The kits studied showed different but comparable results regarding DNA degradation, conversion efficiency and conversion specificity. However, the best performance was observed with the MethylEdge Bisulfite Conversion System (Promega) followed by the Premium Bisulfite kit (Diagenode). The DMRs, EP6 and EP10, were confirmed to be hypermethylated in the CVS and hypomethylated in whole blood. Our findings indicate that the MethylEdge Bisulfite Conversion System (Promega) was shown to have the best performance among the kits. In addition, the methylation level of two of our DMRs, EP6 and EP10, was confirmed. Finally, we showed that bisulfite amplicon sequencing is a suitable approach for methylation analysis of targeted regions.
BS-virus-finder: virus integration calling using bisulfite sequencing data.
Gao, Shengjie; Hu, Xuesong; Xu, Fengping; Gao, Changduo; Xiong, Kai; Zhao, Xiao; Chen, Haixiao; Zhao, Shancen; Wang, Mengyao; Fu, Dongke; Zhao, Xiaohui; Bai, Jie; Mao, Likai; Li, Bo; Wu, Song; Wang, Jian; Li, Shengbin; Yang, Huangming; Bolund, Lars; Pedersen, Christian N S
2018-01-01
DNA methylation plays a key role in the regulation of gene expression and carcinogenesis. Bisulfite sequencing studies mainly focus on calling single nucleotide polymorphism, different methylation region, and find allele-specific DNA methylation. Until now, only a few software tools have focused on virus integration using bisulfite sequencing data. We have developed a new and easy-to-use software tool, named BS-virus-finder (BSVF, RRID:SCR_015727), to detect viral integration breakpoints in whole human genomes. The tool is hosted at https://github.com/BGI-SZ/BSVF. BS-virus-finder demonstrates high sensitivity and specificity. It is useful in epigenetic studies and to reveal the relationship between viral integration and DNA methylation. BS-virus-finder is the first software tool to detect virus integration loci by using bisulfite sequencing data. © The Authors 2017. Published by Oxford University Press.
Detection of DNA Methylation by Whole-Genome Bisulfite Sequencing.
Li, Qing; Hermanson, Peter J; Springer, Nathan M
2018-01-01
DNA methylation plays an important role in the regulation of the expression of transposons and genes. Various methods have been developed to assay DNA methylation levels. Bisulfite sequencing is considered to be the "gold standard" for single-base resolution measurement of DNA methylation levels. Coupled with next-generation sequencing, whole-genome bisulfite sequencing (WGBS) allows DNA methylation to be evaluated at a genome-wide scale. Here, we described a protocol for WGBS in plant species with large genomes. This protocol has been successfully applied to assay genome-wide DNA methylation levels in maize and barley. This protocol has also been successfully coupled with sequence capture technology to assay DNA methylation levels in a targeted set of genomic regions.
Leakey, Tatiana I; Zielinski, Jerzy; Siegfried, Rachel N; Siegel, Eric R; Fan, Chun-Yang; Cooney, Craig A
2008-06-01
DNA methylation at cytosines is a widely studied epigenetic modification. Methylation is commonly detected using bisulfite modification of DNA followed by PCR and additional techniques such as restriction digestion or sequencing. These additional techniques are either laborious, require specialized equipment, or are not quantitative. Here we describe a simple algorithm that yields quantitative results from analysis of conventional four-dye-trace sequencing. We call this method Mquant and we compare it with the established laboratory method of combined bisulfite restriction assay (COBRA). This analysis of sequencing electropherograms provides a simple, easily applied method to quantify DNA methylation at specific CpG sites.
Identification of differentially methylated sites with weak methylation effect
USDA-ARS?s Scientific Manuscript database
DNA methylation is an epigenetic alteration crucial for regulating stress responses. Identifying large-scale DNA methylation at single nucleotide resolution is made possible by whole genome bisulfite sequencing. An essential task following the generation of bisulfite sequencing data is to detect dif...
Giehr, Pascal; Walter, Jörn
2018-01-01
The accurate and quantitative detection of 5-methylcytosine is of great importance in the field of epigenetics. The method of choice is usually bisulfite sequencing because of the high resolution and the possibility to combine it with next generation sequencing. Nevertheless, also this method has its limitations. Following the bisulfite treatment DNA strands are no longer complementary such that in a subsequent PCR amplification the DNA methylation patterns information of only one of the two DNA strand is preserved. Several years ago Hairpin Bisulfite sequencing was developed as a method to obtain the pattern information on complementary DNA strands. The method requires fragmentation (usually by enzymatic cleavage) of genomic DNA followed by a covalent linking of both DNA strands through ligation of a short DNA hairpin oligonucleotide to both strands. The ligated covalently linked dsDNA products are then subjected to a conventional bisulfite treatment during which all unmodified cytosines are converted to uracils. During the treatment the DNA is denatured forming noncomplementary ssDNA circles. These circles serve as a template for a locus specific PCR to amplify chromosomal patterns of the region of interest. As a result one ends up with a linearized product, which contains the methylation information of both complementary DNA strands.
Su, Chang; Wang, Chao; He, Lin; Yang, Chuanping; Wang, Yucheng
2014-01-01
DNA methylation plays a critical role in the regulation of gene expression. Most studies of DNA methylation have been performed in herbaceous plants, and little is known about the methylation patterns in tree genomes. In the present study, we generated a map of methylated cytosines at single base pair resolution for Betula platyphylla (white birch) by bisulfite sequencing combined with transcriptomics to analyze DNA methylation and its effects on gene expression. We obtained a detailed view of the function of DNA methylation sequence composition and distribution in the genome of B. platyphylla. There are 34,460 genes in the whole genome of birch, and 31,297 genes are methylated. Conservatively, we estimated that 14.29% of genomic cytosines are methylcytosines in birch. Among the methylation sites, the CHH context accounts for 48.86%, and is the largest proportion. Combined transcriptome and methylation analysis showed that the genes with moderate methylation levels had higher expression levels than genes with high and low methylation. In addition, methylated genes are highly enriched for the GO subcategories of binding activities, catalytic activities, cellular processes, response to stimulus and cell death, suggesting that methylation mediates these pathways in birch trees. PMID:25514241
Smith, Rick W A; Monroe, Cara; Bolnick, Deborah A
2015-01-01
While cytosine methylation has been widely studied in extant populations, relatively few studies have analyzed methylation in ancient DNA. Most existing studies of epigenetic marks in ancient DNA have inferred patterns of methylation in highly degraded samples using post-mortem damage to cytosines as a proxy for cytosine methylation levels. However, this approach limits the inference of methylation compared with direct bisulfite sequencing, the current gold standard for analyzing cytosine methylation at single nucleotide resolution. In this study, we used direct bisulfite sequencing to assess cytosine methylation in ancient DNA from the skeletal remains of 30 Native Americans ranging in age from approximately 230 to 4500 years before present. Unmethylated cytosines were converted to uracils by treatment with sodium bisulfite, bisulfite products of a CpG-rich retrotransposon were pyrosequenced, and C-to-T ratios were quantified for a single CpG position. We found that cytosine methylation is readily recoverable from most samples, given adequate preservation of endogenous nuclear DNA. In addition, our results indicate that the precision of cytosine methylation estimates is inversely correlated with aDNA preservation, such that samples of low DNA concentration show higher variability in measures of percent methylation than samples of high DNA concentration. In particular, samples in this study with a DNA concentration above 0.015 ng/μL generated the most consistent measures of cytosine methylation. This study presents evidence of cytosine methylation in a large collection of ancient human remains, and indicates that it is possible to analyze epigenetic patterns in ancient populations using direct bisulfite sequencing approaches.
Performances of Different Fragment Sizes for Reduced Representation Bisulfite Sequencing in Pigs.
Yuan, Xiao-Long; Zhang, Zhe; Pan, Rong-Yang; Gao, Ning; Deng, Xi; Li, Bin; Zhang, Hao; Sangild, Per Torp; Li, Jia-Qi
2017-01-01
Reduced representation bisulfite sequencing (RRBS) has been widely used to profile genome-scale DNA methylation in mammalian genomes. However, the applications and technical performances of RRBS with different fragment sizes have not been systematically reported in pigs, which serve as one of the important biomedical models for humans. The aims of this study were to evaluate capacities of RRBS libraries with different fragment sizes to characterize the porcine genome. We found that the Msp I-digested segments between 40 and 220 bp harbored a high distribution peak at 74 bp, which were highly overlapped with the repetitive elements and might reduce the unique mapping alignment. The RRBS library of 110-220 bp fragment size had the highest unique mapping alignment and the lowest multiple alignment. The cost-effectiveness of the 40-110 bp, 110-220 bp and 40-220 bp fragment sizes might decrease when the dataset size was more than 70, 50 and 110 million reads for these three fragment sizes, respectively. Given a 50-million dataset size, the average sequencing depth of the detected CpG sites in the 110-220 bp fragment size appeared to be deeper than in the 40-110 bp and 40-220 bp fragment sizes, and these detected CpG sties differently located in gene- and CpG island-related regions. In this study, our results demonstrated that selections of fragment sizes could affect the numbers and sequencing depth of detected CpG sites as well as the cost-efficiency. No single solution of RRBS is optimal in all circumstances for investigating genome-scale DNA methylation. This work provides the useful knowledge on designing and executing RRBS for investigating the genome-wide DNA methylation in tissues from pigs.
Lu, Jennifer; Ru, Kelin; Candiloro, Ida; Dobrovic, Alexander; Korbie, Darren; Trau, Matt
2017-03-22
Multiplex bisulfite-PCR sequencing is a convenient and scalable method for the quantitative determination of the methylation state of target DNA regions. A challenge of this application is the presence of CpGs in the same region where primers are being placed. A common solution to the presence of CpGs within a primer-binding region is to substitute a base degeneracy at the cytosine position. However, the efficacy of different substitutions and the extent to which bias towards methylated or unmethylated templates may occur has never been evaluated in bisulfite multiplex sequencing applications. In response, we examined the performance of four different primer substitutions at the cytosine position of CpG's contained within the PCR primers. In this study, deoxyinosine-, 5-nitroindole-, mixed-base primers and primers with an abasic site were evaluated across a series of methylated controls. Primers that contained mixed- or deoxyinosine- base modifications performed most robustly. Mixed-base primers were further selected to determine the conditions that induce bias towards methylated templates. This identified an optimized set of conditions where the methylated state of bisulfite DNA templates can be accurately assessed using mixed-base primers, and expands the scope of bisulfite resequencing assays when working with challenging templates.
Guo, Hongshan; Zhu, Ping; Guo, Fan; Li, Xianlong; Wu, Xinglong; Fan, Xiaoying; Wen, Lu; Tang, Fuchou
2015-05-01
The heterogeneity of DNA methylation within a population of cells necessitates DNA methylome profiling at single-cell resolution. Recently, we developed a single-cell reduced-representation bisulfite sequencing (scRRBS) technique in which we modified the original RRBS method by integrating all the experimental steps before PCR amplification into a single-tube reaction. These modifications enable scRRBS to provide digitized methylation information on ∼1 million CpG sites within an individual diploid mouse or human cell at single-base resolution. Compared with the single-cell bisulfite sequencing (scBS) technique, scRRBS covers fewer CpG sites, but it provides better coverage for CpG islands (CGIs), which are likely to be the most informative elements for DNA methylation. The entire procedure takes ∼3 weeks, and it requires strong molecular biology skills.
Schmidt, Martin; Van Bel, Michiel; Woloszynska, Magdalena; Slabbinck, Bram; Martens, Cindy; De Block, Marc; Coppens, Frederik; Van Lijsebettens, Mieke
2017-07-06
Cytosine methylation in plant genomes is important for the regulation of gene transcription and transposon activity. Genome-wide methylomes are studied upon mutation of the DNA methyltransferases, adaptation to environmental stresses or during development. However, from basic biology to breeding programs, there is a need to monitor multiple samples to determine transgenerational methylation inheritance or differential cytosine methylation. Methylome data obtained by sodium hydrogen sulfite (bisulfite)-conversion and next-generation sequencing (NGS) provide genome-wide information on cytosine methylation. However, a profiling method that detects cytosine methylation state dispersed over the genome would allow high-throughput analysis of multiple plant samples with distinct epigenetic signatures. We use specific restriction endonucleases to enrich for cytosine coverage in a bisulfite and NGS-based profiling method, which was compared to whole-genome bisulfite sequencing of the same plant material. We established an effective methylome profiling method in plants, termed plant-reduced representation bisulfite sequencing (plant-RRBS), using optimized double restriction endonuclease digestion, fragment end repair, adapter ligation, followed by bisulfite conversion, PCR amplification and NGS. We report a performant laboratory protocol and a straightforward bioinformatics data analysis pipeline for plant-RRBS, applicable for any reference-sequenced plant species. As a proof of concept, methylome profiling was performed using an Oryza sativa ssp. indica pure breeding line and a derived epigenetically altered line (epiline). Plant-RRBS detects methylation levels at tens of millions of cytosine positions deduced from bisulfite conversion in multiple samples. To evaluate the method, the coverage of cytosine positions, the intra-line similarity and the differential cytosine methylation levels between the pure breeding line and the epiline were determined. Plant-RRBS reproducibly covers commonly up to one fourth of the cytosine positions in the rice genome when using MspI-DpnII within a group of five biological replicates of a line. The method predominantly detects cytosine methylation in putative promoter regions and not-annotated regions in rice. Plant-RRBS offers high-throughput and broad, genome-dispersed methylation detection by effective read number generation obtained from reproducibly covered genome fractions using optimized endonuclease combinations, facilitating comparative analyses of multi-sample studies for cytosine methylation and transgenerational stability in experimental material and plant breeding populations.
Sun, Zhifu; Cunningham, Julie; Slager, Susan; Kocher, Jean-Pierre
2015-01-01
Bisulfite treatment-based methylation microarray (mainly Illumina 450K Infinium array) and next-generation sequencing (reduced representation bisulfite sequencing, Agilent SureSelect Human Methyl-Seq, NimbleGen SeqCap Epi CpGiant or whole-genome bisulfite sequencing) are commonly used for base resolution DNA methylome research. Although multiple tools and methods have been developed and used for the data preprocessing and analysis, confusions remains for these platforms including how and whether the 450k array should be normalized; which platform should be used to better fit researchers’ needs; and which statistical models would be more appropriate for differential methylation analysis. This review presents the commonly used platforms and compares the pros and cons of each in methylome profiling. We then discuss approaches to study design, data normalization, bias correction and model selection for differentially methylated individual CpGs and regions. PMID:26366945
Crampton, Mollee; Sripathi, Venkateswara R; Hossain, Khwaja; Kalavacharla, Venu
2016-01-01
Common bean (Phaseolus vulgaris L.) is economically important for its high protein, fiber, and micronutrient contents, with a relatively small genome size of ∼587 Mb. Common bean is genetically diverse with two major gene pools, Meso-American and Andean. The phenotypic variability within common bean is partly attributed to the genetic diversity and epigenetic changes that are largely influenced by environmental factors. It is well established that an important epigenetic regulator of gene expression is DNA methylation. Here, we present results generated from two high-throughput sequencing technologies, methylated DNA immunoprecipitation-sequencing (MeDIP-seq) and whole genome bisulfite-sequencing (BS-Seq). Our analyses revealed that this Meso-American common bean displays similar methylation patterns as other previously published plant methylomes, with CG ∼50%, CHG ∼30%, and CHH ∼2.7% methylation, however, these differ from the common bean reference methylome of Andean origin. We identified higher CG methylation levels in both promoter and genic regions than CHG and CHH contexts. Moreover, we found relatively higher CG methylation levels in genes than in promoters. Conversely, the CHG and CHH methylation levels were highest in promoters than in genes. This is the first genome-wide DNA methylation profiling study in a Meso-American common bean cultivar ("Sierra") using NGS approaches. Our long-term goal is to generate genome-wide epigenomic maps in common bean focusing on chromatin accessibility, histone modifications, and DNA methylation.
Crampton, Mollee; Sripathi, Venkateswara R.; Hossain, Khwaja; Kalavacharla, Venu
2016-01-01
Common bean (Phaseolus vulgaris L.) is economically important for its high protein, fiber, and micronutrient contents, with a relatively small genome size of ∼587 Mb. Common bean is genetically diverse with two major gene pools, Meso-American and Andean. The phenotypic variability within common bean is partly attributed to the genetic diversity and epigenetic changes that are largely influenced by environmental factors. It is well established that an important epigenetic regulator of gene expression is DNA methylation. Here, we present results generated from two high-throughput sequencing technologies, methylated DNA immunoprecipitation-sequencing (MeDIP-seq) and whole genome bisulfite-sequencing (BS-Seq). Our analyses revealed that this Meso-American common bean displays similar methylation patterns as other previously published plant methylomes, with CG ∼50%, CHG ∼30%, and CHH ∼2.7% methylation, however, these differ from the common bean reference methylome of Andean origin. We identified higher CG methylation levels in both promoter and genic regions than CHG and CHH contexts. Moreover, we found relatively higher CG methylation levels in genes than in promoters. Conversely, the CHG and CHH methylation levels were highest in promoters than in genes. This is the first genome-wide DNA methylation profiling study in a Meso-American common bean cultivar (“Sierra”) using NGS approaches. Our long-term goal is to generate genome-wide epigenomic maps in common bean focusing on chromatin accessibility, histone modifications, and DNA methylation. PMID:27199997
Dikow, Nicola; Nygren, Anders Oh; Schouten, Jan P; Hartmann, Carolin; Krämer, Nikola; Janssen, Bart; Zschocke, Johannes
2007-06-01
Standard methods used for genomic methylation analysis allow the detection of complete absence of either methylated or non-methylated alleles but are usually unable to detect changes in the proportion of methylated and unmethylated alleles. We compare two methods for quantitative methylation analysis, using the chromosome 15q11-q13 imprinted region as model. Absence of the non-methylated paternal allele in this region leads to Prader-Willi syndrome (PWS) whilst absence of the methylated maternal allele results in Angelman syndrome (AS). A proportion of AS is caused by mosaic imprinting defects which may be missed with standard methods and require quantitative analysis for their detection. Sequence-based quantitative methylation analysis (SeQMA) involves quantitative comparison of peaks generated through sequencing reactions after bisulfite treatment. It is simple, cost-effective and can be easily established for a large number of genes. However, our results support previous suggestions that methods based on bisulfite treatment may be problematic for exact quantification of methylation status. Methylation-specific multiplex ligation-dependent probe amplification (MS-MLPA) avoids bisulfite treatment. It detects changes in both CpG methylation as well as copy number of up to 40 chromosomal sequences in one simple reaction. Once established in a laboratory setting, the method is more accurate, reliable and less time consuming.
Xu, Yi-Hua; Manoharan, Herbert T; Pitot, Henry C
2007-09-01
The bisulfite genomic sequencing technique is one of the most widely used techniques to study sequence-specific DNA methylation because of its unambiguous ability to reveal DNA methylation status to the order of a single nucleotide. One characteristic feature of the bisulfite genomic sequencing technique is that a number of sample sequence files will be produced from a single DNA sample. The PCR products of bisulfite-treated DNA samples cannot be sequenced directly because they are heterogeneous in nature; therefore they should be cloned into suitable plasmids and then sequenced. This procedure generates an enormous number of sample DNA sequence files as well as adding extra bases belonging to the plasmids to the sequence, which will cause problems in the final sequence comparison. Finding the methylation status for each CpG in each sample sequence is not an easy job. As a result CpG PatternFinder was developed for this purpose. The main functions of the CpG PatternFinder are: (i) to analyze the reference sequence to obtain CpG and non-CpG-C residue position information. (ii) To tailor sample sequence files (delete insertions and mark deletions from the sample sequence files) based on a configuration of ClustalW multiple alignment. (iii) To align sample sequence files with a reference file to obtain bisulfite conversion efficiency and CpG methylation status. And, (iv) to produce graphics, highlighted aligned sequence text and a summary report which can be easily exported to Microsoft Office suite. CpG PatternFinder is designed to operate cooperatively with BioEdit, a freeware on the internet. It can handle up to 100 files of sample DNA sequences simultaneously, and the total CpG pattern analysis process can be finished in minutes. CpG PatternFinder is an ideal software tool for DNA methylation studies to determine the differential methylation pattern in a large number of individuals in a population. Previously we developed the CpG Analyzer program; CpG PatternFinder is our further effort to create software tools for DNA methylation studies.
Comprehensive Analysis of DNA Methylation Data with RnBeads
Walter, Jörn; Lengauer, Thomas; Bock, Christoph
2014-01-01
RnBeads is a software tool for large-scale analysis and interpretation of DNA methylation data, providing a user-friendly analysis workflow that yields detailed hypertext reports (http://rnbeads.mpi-inf.mpg.de). Supported assays include whole genome bisulfite sequencing, reduced representation bisulfite sequencing, Infinium microarrays, and any other protocol that produces high-resolution DNA methylation data. Important applications of RnBeads include the analysis of epigenome-wide association studies and epigenetic biomarker discovery in cancer cohorts. PMID:25262207
Condon, David E; Tran, Phu V; Lien, Yu-Chin; Schug, Jonathan; Georgieff, Michael K; Simmons, Rebecca A; Won, Kyoung-Jae
2018-02-05
Identification of differentially methylated regions (DMRs) is the initial step towards the study of DNA methylation-mediated gene regulation. Previous approaches to call DMRs suffer from false prediction, use extreme resources, and/or require library installation and input conversion. We developed a new approach called Defiant to identify DMRs. Employing Weighted Welch Expansion (WWE), Defiant showed superior performance to other predictors in the series of benchmarking tests on artificial and real data. Defiant was subsequently used to investigate DNA methylation changes in iron-deficient rat hippocampus. Defiant identified DMRs close to genes associated with neuronal development and plasticity, which were not identified by its competitor. Importantly, Defiant runs between 5 to 479 times faster than currently available software packages. Also, Defiant accepts 10 different input formats widely used for DNA methylation data. Defiant effectively identifies DMRs for whole-genome bisulfite sequencing (WGBS), reduced-representation bisulfite sequencing (RRBS), Tet-assisted bisulfite sequencing (TAB-seq), and HpaII tiny fragment enrichment by ligation-mediated PCR-tag (HELP) assays.
PrimerSuite: A High-Throughput Web-Based Primer Design Program for Multiplex Bisulfite PCR.
Lu, Jennifer; Johnston, Andrew; Berichon, Philippe; Ru, Ke-Lin; Korbie, Darren; Trau, Matt
2017-01-24
The analysis of DNA methylation at CpG dinucleotides has become a major research focus due to its regulatory role in numerous biological processes, but the requisite need for assays which amplify bisulfite-converted DNA represents a major bottleneck due to the unique design constraints imposed on bisulfite-PCR primers. Moreover, a review of the literature indicated no available software solutions which accommodated both high-throughput primer design, support for multiplex amplification assays, and primer-dimer prediction. In response, the tri-modular software package PrimerSuite was developed to support bisulfite multiplex PCR applications. This software was constructed to (i) design bisulfite primers against multiple regions simultaneously (PrimerSuite), (ii) screen for primer-primer dimerizing artefacts (PrimerDimer), and (iii) support multiplex PCR assays (PrimerPlex). Moreover, a major focus in the development of this software package was the emphasis on extensive empirical validation, and over 1300 unique primer pairs have been successfully designed and screened, with over 94% of them producing amplicons of the expected size, and an average mapping efficiency of 93% when screened using bisulfite multiplex resequencing. The potential use of the software in other bisulfite-based applications such as methylation-specific PCR is under consideration for future updates. This resource is freely available for use at PrimerSuite website (www.primer-suite.com).
NASA Astrophysics Data System (ADS)
Alvarez, Jose; Massey, Steven; Kalitsov, Alan; Velev, Julian
Nanopore sequencing via transverse current has emerged as a competitive candidate for mapping DNA methylation without needed bisulfite-treatment, fluorescent tag, or PCR amplification. By eliminating the error producing amplification step, long read lengths become feasible, which greatly simplifies the assembly process and reduces the time and the cost inherent in current technologies. However, due to the large error rates of nanopore sequencing, single base resolution has not been reached. A very important source of noise is the intrinsic structural noise in the electric signature of the nucleotide arising from the influence of neighboring nucleotides. In this work we perform calculations of the tunneling current through DNA molecules in nanopores using the non-equilibrium electron transport method within an effective multi-orbital tight-binding model derived from first-principles calculations. We develop a base-calling algorithm accounting for the correlations of the current through neighboring bases, which in principle can reduce the error rate below any desired precision. Using this method we show that we can clearly distinguish DNA methylation and other base modifications based on the reading of the tunneling current.
Zackay, Arie; Steinhoff, Christine
2010-12-15
Exploration of DNA methylation and its impact on various regulatory mechanisms has become a very active field of research. Simultaneously there is an arising need for tools to process and analyse the data together with statistical investigation and visualisation. MethVisual is a new application that enables exploratory analysis and intuitive visualization of DNA methylation data as is typically generated by bisulfite sequencing. The package allows the import of DNA methylation sequences, aligns them and performs quality control comparison. It comprises basic analysis steps as lollipop visualization, co-occurrence display of methylation of neighbouring and distant CpG sites, summary statistics on methylation status, clustering and correspondence analysis. The package has been developed for methylation data but can be also used for other data types for which binary coding can be inferred. The application of the package, as well as a comparison to existing DNA methylation analysis tools and its workflow based on two datasets is presented in this paper. The R package MethVisual offers various analysis procedures for data that can be binarized, in particular for bisulfite sequenced methylation data. R/Bioconductor has become one of the most important environments for statistical analysis of various types of biological and medical data. Therefore, any data analysis within R that allows the integration of various data types as provided from different technological platforms is convenient. It is the first and so far the only specific package for DNA methylation analysis, in particular for bisulfite sequenced data available in R/Bioconductor enviroment. The package is available for free at http://methvisual.molgen.mpg.de/ and from the Bioconductor Consortium http://www.bioconductor.org.
2010-01-01
Background Exploration of DNA methylation and its impact on various regulatory mechanisms has become a very active field of research. Simultaneously there is an arising need for tools to process and analyse the data together with statistical investigation and visualisation. Findings MethVisual is a new application that enables exploratory analysis and intuitive visualization of DNA methylation data as is typically generated by bisulfite sequencing. The package allows the import of DNA methylation sequences, aligns them and performs quality control comparison. It comprises basic analysis steps as lollipop visualization, co-occurrence display of methylation of neighbouring and distant CpG sites, summary statistics on methylation status, clustering and correspondence analysis. The package has been developed for methylation data but can be also used for other data types for which binary coding can be inferred. The application of the package, as well as a comparison to existing DNA methylation analysis tools and its workflow based on two datasets is presented in this paper. Conclusions The R package MethVisual offers various analysis procedures for data that can be binarized, in particular for bisulfite sequenced methylation data. R/Bioconductor has become one of the most important environments for statistical analysis of various types of biological and medical data. Therefore, any data analysis within R that allows the integration of various data types as provided from different technological platforms is convenient. It is the first and so far the only specific package for DNA methylation analysis, in particular for bisulfite sequenced data available in R/Bioconductor enviroment. The package is available for free at http://methvisual.molgen.mpg.de/ and from the Bioconductor Consortium http://www.bioconductor.org. PMID:21159174
Maximizing ecological and evolutionary insight in bisulfite sequencing data sets
Lea, Amanda J.; Vilgalys, Tauras P.; Durst, Paul A.P.; Tung, Jenny
2017-01-01
Preface Genome-scale bisulfite sequencing approaches have opened the door to ecological and evolutionary studies of DNA methylation in many organisms. These approaches can be powerful. However, they introduce new methodological and statistical considerations, some of which are particularly relevant to non-model systems. Here, we highlight how these considerations influence a study’s power to link methylation variation with a predictor variable of interest. Relative to current practice, we argue that sample sizes will need to increase to provide robust insights. We also provide recommendations for overcoming common challenges and an R Shiny app to aid in study design. PMID:29046582
Partial bisulfite conversion for unique template sequencing
Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael
2018-01-01
Abstract We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. PMID:29161423
Herrmann, Alexander; Haake, Andrea; Ammerpohl, Ole; Martin-Guerrero, Idoia; Szafranski, Karol; Stemshorn, Kathryn; Nothnagel, Michael; Kotsopoulos, Steve K; Richter, Julia; Warner, Jason; Olson, Jeff; Link, Darren R; Schreiber, Stefan; Krawczak, Michael; Platzer, Matthias; Nürnberg, Peter; Siebert, Reiner; Hampe, Jochen
2011-01-01
Cytosine methylation provides an epigenetic level of cellular plasticity that is important for development, differentiation and cancerogenesis. We adopted microdroplet PCR to bisulfite treated target DNA in combination with second generation sequencing to simultaneously assess DNA sequence and methylation. We show measurement of methylation status in a wide range of target sequences (total 34 kb) with an average coverage of 95% (median 100%) and good correlation to the opposite strand (rho = 0.96) and to pyrosequencing (rho = 0.87). Data from lymphoma and colorectal cancer samples for SNRPN (imprinted gene), FGF6 (demethylated in the cancer samples) and HS3ST2 (methylated in the cancer samples) serve as a proof of principle showing the integration of SNP data and phased DNA-methylation information into "hepitypes" and thus the analysis of DNA methylation phylogeny in the somatic evolution of cancer.
Scala, Giovanni; Affinito, Ornella; Palumbo, Domenico; Florio, Ermanno; Monticelli, Antonella; Miele, Gennaro; Chiariotti, Lorenzo; Cocozza, Sergio
2016-11-25
CpG sites in an individual molecule may exist in a binary state (methylated or unmethylated) and each individual DNA molecule, containing a certain number of CpGs, is a combination of these states defining an epihaplotype. Classic quantification based approaches to study DNA methylation are intrinsically unable to fully represent the complexity of the underlying methylation substrate. Epihaplotype based approaches, on the other hand, allow methylation profiles of cell populations to be studied at the single molecule level. For such investigations, next-generation sequencing techniques can be used, both for quantitative and for epihaplotype analysis. Currently available tools for methylation analysis lack output formats that explicitly report CpG methylation profiles at the single molecule level and that have suited statistical tools for their interpretation. Here we present ampliMethProfiler, a python-based pipeline for the extraction and statistical epihaplotype analysis of amplicons from targeted deep bisulfite sequencing of multiple DNA regions. ampliMethProfiler tool provides an easy and user friendly way to extract and analyze the epihaplotype composition of reads from targeted bisulfite sequencing experiments. ampliMethProfiler is written in python language and requires a local installation of BLAST and (optionally) QIIME tools. It can be run on Linux and OS X platforms. The software is open source and freely available at http://amplimethprofiler.sourceforge.net .
Epigenome-wide inheritance of cytosine methylation variants in a recombinant inbred population
Schmitz, Robert J.; He, Yupeng; Valdés-López, Oswaldo; Khan, Saad M.; Joshi, Trupti; Urich, Mark A.; Nery, Joseph R.; Diers, Brian; Xu, Dong; Stacey, Gary; Ecker, Joseph R.
2013-01-01
Cytosine DNA methylation is one avenue for passing information through cell divisions. Here, we present epigenomic analyses of soybean recombinant inbred lines (RILs) and their parents. Identification of differentially methylated regions (DMRs) revealed that DMRs mostly cosegregated with the genotype from which they were derived, but examples of the uncoupling of genotype and epigenotype were identified. Linkage mapping of methylation states assessed from whole-genome bisulfite sequencing of 83 RILs uncovered widespread evidence for local methylQTL. This epigenomics approach provides a comprehensive study of the patterns and heritability of methylation variants in a complex genetic population over multiple generations, paving the way for understanding how methylation variants contribute to phenotypic variation. PMID:23739894
Epigenome-wide inheritance of cytosine methylation variants in a recombinant inbred population.
Schmitz, Robert J; He, Yupeng; Valdés-López, Oswaldo; Khan, Saad M; Joshi, Trupti; Urich, Mark A; Nery, Joseph R; Diers, Brian; Xu, Dong; Stacey, Gary; Ecker, Joseph R
2013-10-01
Cytosine DNA methylation is one avenue for passing information through cell divisions. Here, we present epigenomic analyses of soybean recombinant inbred lines (RILs) and their parents. Identification of differentially methylated regions (DMRs) revealed that DMRs mostly cosegregated with the genotype from which they were derived, but examples of the uncoupling of genotype and epigenotype were identified. Linkage mapping of methylation states assessed from whole-genome bisulfite sequencing of 83 RILs uncovered widespread evidence for local methylQTL. This epigenomics approach provides a comprehensive study of the patterns and heritability of methylation variants in a complex genetic population over multiple generations, paving the way for understanding how methylation variants contribute to phenotypic variation.
CloudAligner: A fast and full-featured MapReduce based tool for sequence mapping.
Nguyen, Tung; Shi, Weisong; Ruden, Douglas
2011-06-06
Research in genetics has developed rapidly recently due to the aid of next generation sequencing (NGS). However, massively-parallel NGS produces enormous amounts of data, which leads to storage, compatibility, scalability, and performance issues. The Cloud Computing and MapReduce framework, which utilizes hundreds or thousands of shared computers to map sequencing reads quickly and efficiently to reference genome sequences, appears to be a very promising solution for these issues. Consequently, it has been adopted by many organizations recently, and the initial results are very promising. However, since these are only initial steps toward this trend, the developed software does not provide adequate primary functions like bisulfite, pair-end mapping, etc., in on-site software such as RMAP or BS Seeker. In addition, existing MapReduce-based applications were not designed to process the long reads produced by the most recent second-generation and third-generation NGS instruments and, therefore, are inefficient. Last, it is difficult for a majority of biologists untrained in programming skills to use these tools because most were developed on Linux with a command line interface. To urge the trend of using Cloud technologies in genomics and prepare for advances in second- and third-generation DNA sequencing, we have built a Hadoop MapReduce-based application, CloudAligner, which achieves higher performance, covers most primary features, is more accurate, and has a user-friendly interface. It was also designed to be able to deal with long sequences. The performance gain of CloudAligner over Cloud-based counterparts (35 to 80%) mainly comes from the omission of the reduce phase. In comparison to local-based approaches, the performance gain of CloudAligner is from the partition and parallel processing of the huge reference genome as well as the reads. The source code of CloudAligner is available at http://cloudaligner.sourceforge.net/ and its web version is at http://mine.cs.wayne.edu:8080/CloudAligner/. Our results show that CloudAligner is faster than CloudBurst, provides more accurate results than RMAP, and supports various input as well as output formats. In addition, with the web-based interface, it is easier to use than its counterparts.
Chatterjee, Aniruddha; Stockwell, Peter A; Ahn, Antonio; Rodger, Euan J; Leichter, Anna L; Eccles, Michael R
2017-01-01
Epigenetic alterations are increasingly implicated in metastasis, whereas very few genetic mutations have been identified as authentic drivers of cancer metastasis. Yet, to date, few studies have identified metastasis-related epigenetic drivers, in part because a framework for identifying driver epigenetic changes in metastasis has not been established. Using reduced representation bisulfite sequencing (RRBS), we mapped genome-wide DNA methylation patterns in three cutaneous primary and metastatic melanoma cell line pairs to identify metastasis-related epigenetic drivers. Globally, metastatic melanoma cell lines were hypomethylated compared to the matched primary melanoma cell lines. Using whole genome RRBS we identified 75 shared (10 hyper- and 65 hypomethylated) differentially methylated fragments (DMFs), which were associated with 68 genes showing significant methylation differences. One gene, Early B Cell Factor 3 (EBF3), exhibited promoter hypermethylation in metastatic cell lines, and was validated with bisulfite sequencing and in two publicly available independent melanoma cohorts (n = 40 and 458 melanomas, respectively). We found that hypermethylation of the EBF3 promoter was associated with increased EBF3 mRNA levels in metastatic melanomas and subsequent inhibition of DNA methylation reduced EBF3 expression. RNAi-mediated knockdown of EBF3 mRNA levels decreased proliferation, migration and invasion in primary and metastatic melanoma cell lines. Overall, we have identified numerous epigenetic changes characterising metastatic melanoma cell lines, including EBF3-induced aggressive phenotypic behaviour with elevated EBF3 expression in metastatic melanoma, suggesting that EBF3 promoter hypermethylation may be a candidate epigenetic driver of metastasis. PMID:28030832
Molecular barcodes detect redundancy and contamination in hairpin-bisulfite PCR
Miner, Brooks E.; Stöger, Reinhard J.; Burden, Alice F.; Laird, Charles D.; Hansen, R. Scott
2004-01-01
PCR amplification of limited amounts of DNA template carries an increased risk of product redundancy and contamination. We use molecular barcoding to label each genomic DNA template with an individual sequence tag prior to PCR amplification. In addition, we include molecular ‘batch-stamps’ that effectively label each genomic template with a sample ID and analysis date. This highly sensitive method identifies redundant and contaminant sequences and serves as a reliable method for positive identification of desired sequences; we can therefore capture accurately the genomic template diversity in the sample analyzed. Although our application described here involves the use of hairpin-bisulfite PCR for amplification of double-stranded DNA, the method can readily be adapted to single-strand PCR. Useful applications will include analyses of limited template DNA for biomedical, ancient DNA and forensic purposes. PMID:15459281
Matsuyama, Tomoki; Kimura, Makoto T.; Koike, Kuniaki; Abe, Tomoko; Nakano, Takeshi; Asami, Tadao; Ebisuzaki, Toshikazu; Held, William A.; Yoshida, Shigeo; Nagase, Hiroki
2003-01-01
Understanding the role of ‘epigenetic’ changes such as DNA methylation and chromatin remodeling has now become critical in understanding many biological processes. In order to delineate the global methylation pattern in a given genomic DNA, computer software has been developed to create a virtual image of restriction landmark genomic scanning (Vi-RLGS). When using a methylation- sensitive enzyme such as NotI as the restriction landmark, the comparison between real and in silico RLGS profiles of the genome provides a methylation map of genomic NotI sites. A methylation map of the Arabidopsis genome was created that could be confirmed by a methylation-sensitive PCR assay. The method has also been applied to the mouse genome. Although a complete methylation map has not been completed, a region of methylation difference between two tissues has been tested and confirmed by bisulfite sequencing. Vi-RLGS in conjunction with real RLGS will make it possible to develop a more complete map of genomic sites that are methylated or demethylated as a consequence of normal or abnormal development. PMID:12888509
Partial bisulfite conversion for unique template sequencing.
Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael; Levy, Dan
2018-01-25
We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Differential DNA methylation and transcription profiles in date palm roots exposed to salinity
Al-Harrasi, Ibtisam; Al-Yahyai, Rashid
2018-01-01
As a salt-adaptive plant, the date palm (Phoenix dactylifera L.) requires a suitable mechanism to adapt to the stress of saline soils. There is growing evidence that DNA methylation plays an important role in regulating gene expression in response to abiotic stresses, including salinity. Thus, the present study sought to examine the differential methylation status that occurs in the date palm genome when plants are exposed to salinity, and to identify salinity responsive genes that are regulated by DNA methylation. To achieve these, whole-genome bisulfite sequencing (WGBS) was employed and mRNA was sequenced from salinity-treated and untreated roots. The WGBS analysis included 324,987,795 and 317,056,091 total reads of the control and the salinity-treated samples, respectively. The analysis covered about 81% of the total genomic DNA with about 40% of mapping efficiency of the sequenced reads and an average read depth of 17-fold coverage per DNA strand, and with a bisulfite conversion rate of around 99%. The level of methylation within the differentially methylated regions (DMRs) was significantly (p < 0.05, FDR ≤ 0.05) increased in response to salinity specifically at the mCHG and mCHH sequence contexts. Consistently, the mass spectrometry and the enzyme-linked immunosorbent assay (ELISA) showed that there was a significant (p < 0.05) increase in the global DNA methylation in response to salinity. mRNA sequencing revealed the presence of 6,405 differentially regulated genes with a significant value (p < 0.001, FDR ≤ 0.05) in response to salinity. Integration of high-resolution methylome and transcriptome analyses revealed a negative correlation between mCG methylation located within the promoters and the gene expression, while a positive correlation was noticed between mCHG/mCHH methylation rations and gene expression specifically when plants grew under control conditions. Therefore, the methylome and transcriptome relationships vary based on the methylated sequence context, the methylated region within the gene, the protein-coding ability of the gene, and the salinity treatment. These results provide insights into interplay among DNA methylation and gene expression, and highlight the effect of salinity on the nature of this relationship, which may involve other genetic and epigenetic players under salt stress conditions. The results obtained from this project provide the first draft map of the differential methylome and transcriptome of date palm when exposed to an abiotic stress. PMID:29352281
Differential DNA methylation and transcription profiles in date palm roots exposed to salinity.
Al-Harrasi, Ibtisam; Al-Yahyai, Rashid; Yaish, Mahmoud W
2018-01-01
As a salt-adaptive plant, the date palm (Phoenix dactylifera L.) requires a suitable mechanism to adapt to the stress of saline soils. There is growing evidence that DNA methylation plays an important role in regulating gene expression in response to abiotic stresses, including salinity. Thus, the present study sought to examine the differential methylation status that occurs in the date palm genome when plants are exposed to salinity, and to identify salinity responsive genes that are regulated by DNA methylation. To achieve these, whole-genome bisulfite sequencing (WGBS) was employed and mRNA was sequenced from salinity-treated and untreated roots. The WGBS analysis included 324,987,795 and 317,056,091 total reads of the control and the salinity-treated samples, respectively. The analysis covered about 81% of the total genomic DNA with about 40% of mapping efficiency of the sequenced reads and an average read depth of 17-fold coverage per DNA strand, and with a bisulfite conversion rate of around 99%. The level of methylation within the differentially methylated regions (DMRs) was significantly (p < 0.05, FDR ≤ 0.05) increased in response to salinity specifically at the mCHG and mCHH sequence contexts. Consistently, the mass spectrometry and the enzyme-linked immunosorbent assay (ELISA) showed that there was a significant (p < 0.05) increase in the global DNA methylation in response to salinity. mRNA sequencing revealed the presence of 6,405 differentially regulated genes with a significant value (p < 0.001, FDR ≤ 0.05) in response to salinity. Integration of high-resolution methylome and transcriptome analyses revealed a negative correlation between mCG methylation located within the promoters and the gene expression, while a positive correlation was noticed between mCHG/mCHH methylation rations and gene expression specifically when plants grew under control conditions. Therefore, the methylome and transcriptome relationships vary based on the methylated sequence context, the methylated region within the gene, the protein-coding ability of the gene, and the salinity treatment. These results provide insights into interplay among DNA methylation and gene expression, and highlight the effect of salinity on the nature of this relationship, which may involve other genetic and epigenetic players under salt stress conditions. The results obtained from this project provide the first draft map of the differential methylome and transcriptome of date palm when exposed to an abiotic stress.
Han, Lin; Wu, Hua-Jun; Zhu, Haiying; Kim, Kun-Yong; Marjani, Sadie L.; Riester, Markus; Euskirchen, Ghia; Zi, Xiaoyuan; Yang, Jennifer; Han, Jasper; Snyder, Michael; Park, In-Hyun; Irizarry, Rafael; Weissman, Sherman M.
2017-01-01
Abstract Conventional DNA bisulfite sequencing has been extended to single cell level, but the coverage consistency is insufficient for parallel comparison. Here we report a novel method for genome-wide CpG island (CGI) methylation sequencing for single cells (scCGI-seq), combining methylation-sensitive restriction enzyme digestion and multiple displacement amplification for selective detection of methylated CGIs. We applied this method to analyzing single cells from two types of hematopoietic cells, K562 and GM12878 and small populations of fibroblasts and induced pluripotent stem cells. The method detected 21 798 CGIs (76% of all CGIs) per cell, and the number of CGIs consistently detected from all 16 profiled single cells was 20 864 (72.7%), with 12 961 promoters covered. This coverage represents a substantial improvement over results obtained using single cell reduced representation bisulfite sequencing, with a 66-fold increase in the fraction of consistently profiled CGIs across individual cells. Single cells of the same type were more similar to each other than to other types, but also displayed epigenetic heterogeneity. The method was further validated by comparing the CpG methylation pattern, methylation profile of CGIs/promoters and repeat regions and 41 classes of known regulatory markers to the ENCODE data. Although not every minor methylation differences between cells are detectable, scCGI-seq provides a solid tool for unsupervised stratification of a heterogeneous cell population. PMID:28126923
Parrish, R Ryley; Day, Jeremy J; Lubin, Farah D
2012-07-01
DNA methylation is an epigenetic modification that is essential for the development and mature function of the central nervous system. Due to the relevance of this modification to the transcriptional control of gene expression, it is often necessary to examine changes in DNA methylation patterns with both gene and single-nucleotide resolution. Here, we describe an in-depth basic protocol for direct bisulfite sequencing of DNA isolated from brain tissue, which will permit direct assessment of methylation status at individual genes as well as individual cytosine molecules/nucleotides within a genomic region. This method yields analysis of DNA methylation patterns that is robust, accurate, and reproducible, thereby allowing insights into the role of alterations in DNA methylation in brain tissue.
Zhang, Yun; Baheti, Saurabh; Sun, Zhifu
2018-05-01
High-throughput bisulfite methylation sequencing such as reduced representation bisulfite sequencing (RRBS), Agilent SureSelect Human Methyl-Seq (Methyl-seq) or whole-genome bisulfite sequencing is commonly used for base resolution methylome research. These data are represented either by the ratio of methylated cytosine versus total coverage at a CpG site or numbers of methylated and unmethylated cytosines. Multiple statistical methods can be used to detect differentially methylated CpGs (DMCs) between conditions, and these methods are often the base for the next step of differentially methylated region identification. The ratio data have a flexibility of fitting to many linear models, but the raw count data take consideration of coverage information. There is an array of options in each datatype for DMC detection; however, it is not clear which is an optimal statistical method. In this study, we systematically evaluated four statistic methods on methylation ratio data and four methods on count-based data and compared their performances with regard to type I error control, sensitivity and specificity of DMC detection and computational resource demands using real RRBS data along with simulation. Our results show that the ratio-based tests are generally more conservative (less sensitive) than the count-based tests. However, some count-based methods have high false-positive rates and should be avoided. The beta-binomial model gives a good balance between sensitivity and specificity and is preferred method. Selection of methods in different settings, signal versus noise and sample size estimation are also discussed.
Li, Zibo; Guo, Xinwu; Tang, Lili; Peng, Limin; Chen, Ming; Luo, Xipeng; Wang, Shouman; Xiao, Zhi; Deng, Zhongping; Dai, Lizhong; Xia, Kun; Wang, Jun
2016-10-01
Circulating cell-free DNA (cfDNA) has been considered as a potential biomarker for non-invasive cancer detection. To evaluate the methylation levels of six candidate genes (EGFR, GREM1, PDGFRB, PPM1E, SOX17, and WRN) in plasma cfDNA as biomarkers for breast cancer early detection, quantitative analysis of the promoter methylation of these genes from 86 breast cancer patients and 67 healthy controls was performed by using microfluidic-PCR-based target enrichment and next-generation bisulfite sequencing technology. The predictive performance of different logistic models based on methylation status of candidate genes was investigated by means of the area under the ROC curve (AUC) and odds ratio (OR) analysis. Results revealed that EGFR, PPM1E, and 8 gene-specific CpG sites showed significantly hypermethylation in cancer patients' plasma and significantly associated with breast cancer (OR ranging from 2.51 to 9.88). The AUC values for these biomarkers were ranging from 0.66 to 0.75. Combinations of multiple hypermethylated genes or CpG sites substantially improved the predictive performance for breast cancer detection. Our study demonstrated the feasibility of quantitative measurement of candidate gene methylation in cfDNA by using microfluidic-PCR-based target enrichment and bisulfite next-generation sequencing, which is worthy of further validation and potentially benefits a broad range of applications in clinical oncology practice. Quantitative analysis of methylation pattern of plasma cfDNA by next-generation sequencing might be a valuable non-invasive tool for early detection of breast cancer.
Huh, Iksoo; Wu, Xin; Park, Taesung; Yi, Soojin V
2017-07-21
DNA methylation is one of the most extensively studied epigenetic modifications of genomic DNA. In recent years, sequencing of bisulfite-converted DNA, particularly via next-generation sequencing technologies, has become a widely popular method to study DNA methylation. This method can be readily applied to a variety of species, dramatically expanding the scope of DNA methylation studies beyond the traditionally studied human and mouse systems. In parallel to the increasing wealth of genomic methylation profiles, many statistical tools have been developed to detect differentially methylated loci (DMLs) or differentially methylated regions (DMRs) between biological conditions. We discuss and summarize several key properties of currently available tools to detect DMLs and DMRs from sequencing of bisulfite-converted DNA. However, the majority of the statistical tools developed for DML/DMR analyses have been validated using only mammalian data sets, and less priority has been placed on the analyses of invertebrate or plant DNA methylation data. We demonstrate that genomic methylation profiles of non-mammalian species are often highly distinct from those of mammalian species using examples of honey bees and humans. We then discuss how such differences in data properties may affect statistical analyses. Based on these differences, we provide three specific recommendations to improve the power and accuracy of DML and DMR analyses of invertebrate data when using currently available statistical tools. These considerations should facilitate systematic and robust analyses of DNA methylation from diverse species, thus advancing our understanding of DNA methylation. © The Author 2017. Published by Oxford University Press.
Single-cell DNA methylome sequencing and bioinformatic inference of epigenomic cell-state dynamics.
Farlik, Matthias; Sheffield, Nathan C; Nuzzo, Angelo; Datlinger, Paul; Schönegger, Andreas; Klughammer, Johanna; Bock, Christoph
2015-03-03
Methods for single-cell genome and transcriptome sequencing have contributed to our understanding of cellular heterogeneity, whereas methods for single-cell epigenomics are much less established. Here, we describe a whole-genome bisulfite sequencing (WGBS) assay that enables DNA methylation mapping in very small cell populations (μWGBS) and single cells (scWGBS). Our assay is optimized for profiling many samples at low coverage, and we describe a bioinformatic method that analyzes collections of single-cell methylomes to infer cell-state dynamics. Using these technological advances, we studied epigenomic cell-state dynamics in three in vitro models of cellular differentiation and pluripotency, where we observed characteristic patterns of epigenome remodeling and cell-to-cell heterogeneity. The described method enables single-cell analysis of DNA methylation in a broad range of biological systems, including embryonic development, stem cell differentiation, and cancer. It can also be used to establish composite methylomes that account for cell-to-cell heterogeneity in complex tissue samples. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
DNA methylation assessment from human slow- and fast-twitch skeletal muscle fibers
Begue, Gwénaëlle; Raue, Ulrika; Jemiolo, Bozena
2017-01-01
A new application of the reduced representation bisulfite sequencing method was developed using low-DNA input to investigate the epigenetic profile of human slow- and fast-twitch skeletal muscle fibers. Successful library construction was completed with as little as 15 ng of DNA, and high-quality sequencing data were obtained with 32 ng of DNA. Analysis identified 143,160 differentially methylated CpG sites across 14,046 genes. In both fiber types, selected genes predominantly expressed in slow or fast fibers were hypomethylated, which was supported by the RNA-sequencing analysis. These are the first fiber type-specific methylation data from human skeletal muscle and provide a unique platform for future research. NEW & NOTEWORTHY This study validates a low-DNA input reduced representation bisulfite sequencing method for human muscle biopsy samples to investigate the methylation patterns at a fiber type-specific level. These are the first fiber type-specific methylation data reported from human skeletal muscle and thus provide initial insight into basal state differences in myosin heavy chain I and IIa muscle fibers among young, healthy men. PMID:28057818
Maggi, Elaine C; Gravina, Silvia; Cheng, Haiying; Piperdi, Bilal; Yuan, Ziqiang; Dong, Xiao; Libutti, Steven K; Vijg, Jan; Montagna, Cristina
2018-01-01
The goal of this study was to develop a method for whole genome cell-free DNA (cfDNA) methylation analysis in humans and mice with the ultimate goal to facilitate the identification of tumor derived DNA methylation changes in the blood. Plasma or serum from patients with pancreatic neuroendocrine tumors or lung cancer, and plasma from a murine model of pancreatic adenocarcinoma was used to develop a protocol for cfDNA isolation, library preparation and whole-genome bisulfite sequencing of ultra low quantities of cfDNA, including tumor-specific DNA. The protocol developed produced high quality libraries consistently generating a conversion rate >98% that will be applicable for the analysis of human and mouse plasma or serum to detect tumor-derived changes in DNA methylation.
Han, Lin; Wu, Hua-Jun; Zhu, Haiying; Kim, Kun-Yong; Marjani, Sadie L; Riester, Markus; Euskirchen, Ghia; Zi, Xiaoyuan; Yang, Jennifer; Han, Jasper; Snyder, Michael; Park, In-Hyun; Irizarry, Rafael; Weissman, Sherman M; Michor, Franziska; Fan, Rong; Pan, Xinghua
2017-06-02
Conventional DNA bisulfite sequencing has been extended to single cell level, but the coverage consistency is insufficient for parallel comparison. Here we report a novel method for genome-wide CpG island (CGI) methylation sequencing for single cells (scCGI-seq), combining methylation-sensitive restriction enzyme digestion and multiple displacement amplification for selective detection of methylated CGIs. We applied this method to analyzing single cells from two types of hematopoietic cells, K562 and GM12878 and small populations of fibroblasts and induced pluripotent stem cells. The method detected 21 798 CGIs (76% of all CGIs) per cell, and the number of CGIs consistently detected from all 16 profiled single cells was 20 864 (72.7%), with 12 961 promoters covered. This coverage represents a substantial improvement over results obtained using single cell reduced representation bisulfite sequencing, with a 66-fold increase in the fraction of consistently profiled CGIs across individual cells. Single cells of the same type were more similar to each other than to other types, but also displayed epigenetic heterogeneity. The method was further validated by comparing the CpG methylation pattern, methylation profile of CGIs/promoters and repeat regions and 41 classes of known regulatory markers to the ENCODE data. Although not every minor methylation differences between cells are detectable, scCGI-seq provides a solid tool for unsupervised stratification of a heterogeneous cell population. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Feng, Hao; Conneely, Karen N.; Wu, Hao
2014-01-01
DNA methylation is an important epigenetic modification that has essential roles in cellular processes including gene regulation, development and disease and is widely dysregulated in most types of cancer. Recent advances in sequencing technology have enabled the measurement of DNA methylation at single nucleotide resolution through methods such as whole-genome bisulfite sequencing and reduced representation bisulfite sequencing. In DNA methylation studies, a key task is to identify differences under distinct biological contexts, for example, between tumor and normal tissue. A challenge in sequencing studies is that the number of biological replicates is often limited by the costs of sequencing. The small number of replicates leads to unstable variance estimation, which can reduce accuracy to detect differentially methylated loci (DML). Here we propose a novel statistical method to detect DML when comparing two treatment groups. The sequencing counts are described by a lognormal-beta-binomial hierarchical model, which provides a basis for information sharing across different CpG sites. A Wald test is developed for hypothesis testing at each CpG site. Simulation results show that the proposed method yields improved DML detection compared to existing methods, particularly when the number of replicates is low. The proposed method is implemented in the Bioconductor package DSS. PMID:24561809
P-Hint-Hunt: a deep parallelized whole genome DNA methylation detection tool.
Peng, Shaoliang; Yang, Shunyun; Gao, Ming; Liao, Xiangke; Liu, Jie; Yang, Canqun; Wu, Chengkun; Yu, Wenqiang
2017-03-14
The increasing studies have been conducted using whole genome DNA methylation detection as one of the most important part of epigenetics research to find the significant relationships among DNA methylation and several typical diseases, such as cancers and diabetes. In many of those studies, mapping the bisulfite treated sequence to the whole genome has been the main method to study DNA cytosine methylation. However, today's relative tools almost suffer from inaccuracies and time-consuming problems. In our study, we designed a new DNA methylation prediction tool ("Hint-Hunt") to solve the problem. By having an optimal complex alignment computation and Smith-Waterman matrix dynamic programming, Hint-Hunt could analyze and predict the DNA methylation status. But when Hint-Hunt tried to predict DNA methylation status with large-scale dataset, there are still slow speed and low temporal-spatial efficiency problems. In order to solve the problems of Smith-Waterman dynamic programming and low temporal-spatial efficiency, we further design a deep parallelized whole genome DNA methylation detection tool ("P-Hint-Hunt") on Tianhe-2 (TH-2) supercomputer. To the best of our knowledge, P-Hint-Hunt is the first parallel DNA methylation detection tool with a high speed-up to process large-scale dataset, and could run both on CPU and Intel Xeon Phi coprocessors. Moreover, we deploy and evaluate Hint-Hunt and P-Hint-Hunt on TH-2 supercomputer in different scales. The experimental results illuminate our tools eliminate the deviation caused by bisulfite treatment in mapping procedure and the multi-level parallel program yields a 48 times speed-up with 64 threads. P-Hint-Hunt gain a deep acceleration on CPU and Intel Xeon Phi heterogeneous platform, which gives full play of the advantages of multi-cores (CPU) and many-cores (Phi).
Methylation Integration (Mint) | Informatics Technology for Cancer Research (ITCR)
A comprehensive software pipeline and set of Galaxy tools/workflows for integrative analysis of genome-wide DNA methylation and hydroxymethylation data. Data types can be either bisulfite sequencing and/or pull-down methods.
Successful amplification of DNA aboard the International Space Station.
Boguraev, Anna-Sophia; Christensen, Holly C; Bonneau, Ashley R; Pezza, John A; Nichols, Nicole M; Giraldez, Antonio J; Gray, Michelle M; Wagner, Brandon M; Aken, Jordan T; Foley, Kevin D; Copeland, D Scott; Kraves, Sebastian; Alvarez Saavedra, Ezequiel
2017-01-01
As the range and duration of human ventures into space increase, it becomes imperative that we understand the effects of the cosmic environment on astronaut health. Molecular technologies now widely used in research and medicine will need to become available in space to ensure appropriate care of astronauts. The polymerase chain reaction (PCR) is the gold standard for DNA analysis, yet its potential for use on-orbit remains under-explored. We describe DNA amplification aboard the International Space Station (ISS) through the use of a miniaturized miniPCR system. Target sequences in plasmid, zebrafish genomic DNA, and bisulfite-treated DNA were successfully amplified under a variety of conditions. Methylation-specific primers differentially amplified bisulfite-treated samples as would be expected under standard laboratory conditions. Our findings establish proof of concept for targeted detection of DNA sequences during spaceflight and lay a foundation for future uses ranging from environmental monitoring to on-orbit diagnostics.
TEA: the epigenome platform for Arabidopsis methylome study.
Su, Sheng-Yao; Chen, Shu-Hwa; Lu, I-Hsuan; Chiang, Yih-Shien; Wang, Yu-Bin; Chen, Pao-Yang; Lin, Chung-Yen
2016-12-22
Bisulfite sequencing (BS-seq) has become a standard technology to profile genome-wide DNA methylation at single-base resolution. It allows researchers to conduct genome-wise cytosine methylation analyses on issues about genomic imprinting, transcriptional regulation, cellular development and differentiation. One single data from a BS-Seq experiment is resolved into many features according to the sequence contexts, making methylome data analysis and data visualization a complex task. We developed a streamlined platform, TEA, for analyzing and visualizing data from whole-genome BS-Seq (WGBS) experiments conducted in the model plant Arabidopsis thaliana. To capture the essence of the genome methylation level and to meet the efficiency for running online, we introduce a straightforward method for measuring genome methylation in each sequence context by gene. The method is scripted in Java to process BS-Seq mapping results. Through a simple data uploading process, the TEA server deploys a web-based platform for deep analysis by linking data to an updated Arabidopsis annotation database and toolkits. TEA is an intuitive and efficient online platform for analyzing the Arabidopsis genomic DNA methylation landscape. It provides several ways to help users exploit WGBS data. TEA is freely accessible for academic users at: http://tea.iis.sinica.edu.tw .
iMETHYL: an integrative database of human DNA methylation, gene expression, and genomic variation.
Komaki, Shohei; Shiwa, Yuh; Furukawa, Ryohei; Hachiya, Tsuyoshi; Ohmomo, Hideki; Otomo, Ryo; Satoh, Mamoru; Hitomi, Jiro; Sobue, Kenji; Sasaki, Makoto; Shimizu, Atsushi
2018-01-01
We launched an integrative multi-omics database, iMETHYL (http://imethyl.iwate-megabank.org). iMETHYL provides whole-DNA methylation (~24 million autosomal CpG sites), whole-genome (~9 million single-nucleotide variants), and whole-transcriptome (>14 000 genes) data for CD4 + T-lymphocytes, monocytes, and neutrophils collected from approximately 100 subjects. These data were obtained from whole-genome bisulfite sequencing, whole-genome sequencing, and whole-transcriptome sequencing, making iMETHYL a comprehensive database.
Yuan, Xiao-Long; Gao, Ning; Xing, Yan; Zhang, Hai-Bin; Zhang, Ai-Ling; Liu, Jing; He, Jin-Long; Xu, Yuan; Lin, Wen-Mian; Chen, Zan-Mou; Zhang, Hao; Zhang, Zhe; Li, Jia-Qi
2016-02-25
Substantial evidence has shown that DNA methylation regulates the initiation of ovarian and sexual maturation. Here, we investigated the genome-wide profile of DNA methylation in porcine ovaries at single-base resolution using reduced representation bisulfite sequencing. The biological variation was minimal among the three ovarian replicates. We found hypermethylation frequently occurred in regions with low gene abundance, while hypomethylation in regions with high gene abundance. The DNA methylation around transcriptional start sites was negatively correlated with their own CpG content. Additionally, the methylation level in the bodies of genes was higher than that in their 5' and 3' flanking regions. The DNA methylation pattern of the low CpG content promoter genes differed obviously from that of the high CpG content promoter genes. The DNA methylation level of the porcine ovary was higher than that of the porcine intestine. Analyses of the genome-wide DNA methylation in porcine ovaries would advance the knowledge and understanding of the porcine ovarian methylome.
USDA-ARS?s Scientific Manuscript database
Maternal obesity (OB) and excessive gestational weight gain (GWG) are strong independent contributors that augment obesity risk in offspring. However, direct evidence of epigenetic changes associated with maternal habitus remains sparse. We utilized Bisulfite Amplicon Sequencing (BSAS) to conduct t...
Nonparametric Bayesian clustering to detect bipolar methylated genomic loci.
Wu, Xiaowei; Sun, Ming-An; Zhu, Hongxiao; Xie, Hehuang
2015-01-16
With recent development in sequencing technology, a large number of genome-wide DNA methylation studies have generated massive amounts of bisulfite sequencing data. The analysis of DNA methylation patterns helps researchers understand epigenetic regulatory mechanisms. Highly variable methylation patterns reflect stochastic fluctuations in DNA methylation, whereas well-structured methylation patterns imply deterministic methylation events. Among these methylation patterns, bipolar patterns are important as they may originate from allele-specific methylation (ASM) or cell-specific methylation (CSM). Utilizing nonparametric Bayesian clustering followed by hypothesis testing, we have developed a novel statistical approach to identify bipolar methylated genomic regions in bisulfite sequencing data. Simulation studies demonstrate that the proposed method achieves good performance in terms of specificity and sensitivity. We used the method to analyze data from mouse brain and human blood methylomes. The bipolar methylated segments detected are found highly consistent with the differentially methylated regions identified by using purified cell subsets. Bipolar DNA methylation often indicates epigenetic heterogeneity caused by ASM or CSM. With allele-specific events filtered out or appropriately taken into account, our proposed approach sheds light on the identification of cell-specific genes/pathways under strong epigenetic control in a heterogeneous cell population.
Genome-wide bisulfite sensitivity profiling of yeast suggests bisulfite inhibits transcription.
Segovia, Romulo; Mathew, Veena; Tam, Annie S; Stirling, Peter C
2017-09-01
Bisulfite, in the form of sodium bisulfite or metabisulfite, is used commercially as a food preservative. Bisulfite is used in the laboratory as a single-stranded DNA mutagen in epigenomic analyses of DNA methylation. Recently it has also been used on whole yeast cells to induce mutations in exposed single-stranded regions in vivo. To understand the effects of bisulfite on live cells we conducted a genome-wide screen for bisulfite sensitive mutants in yeast. Screening the deletion mutant array, and collections of essential gene mutants we define a genetic network of bisulfite sensitive mutants. Validation of screen hits revealed hyper-sensitivity of transcription and RNA processing mutants, rather than DNA repair pathways and follow-up analyses support a role in perturbation of RNA transactions. We propose a model in which bisulfite-modified nucleotides may interfere with transcription or RNA metabolism when used in vivo. Copyright © 2017 Elsevier B.V. All rights reserved.
USDA-ARS?s Scientific Manuscript database
DNA methylation is an epigenetic mechanism central to the development and maintenance of complex mammalian tissues, but our understanding of its role in intestinal development is limited. We used whole genome bisulfite sequencing, and found that differentiation of mouse colonic intestinal stem cell...
CpG methylation differences between neurons and glia are highly conserved from mouse to human
USDA-ARS?s Scientific Manuscript database
Understanding epigenetic differences that distinguish neurons and glia is of fundamental importance to the nascent field of neuroepigenetics. A recent study used genome-wide bisulfite sequencing to survey differences in DNA methylation between these two cell types, in both humans and mice. That stud...
cuRRBS: simple and robust evaluation of enzyme combinations for reduced representation approaches.
Martin-Herranz, Daniel E; Ribeiro, António J M; Krueger, Felix; Thornton, Janet M; Reik, Wolf; Stubbs, Thomas M
2017-11-16
DNA methylation is an important epigenetic modification in many species that is critical for development, and implicated in ageing and many complex diseases, such as cancer. Many cost-effective genome-wide analyses of DNA modifications rely on restriction enzymes capable of digesting genomic DNA at defined sequence motifs. There are hundreds of restriction enzyme families but few are used to date, because no tool is available for the systematic evaluation of restriction enzyme combinations that can enrich for certain sites of interest in a genome. Herein, we present customised Reduced Representation Bisulfite Sequencing (cuRRBS), a novel and easy-to-use computational method that solves this problem. By computing the optimal enzymatic digestions and size selection steps required, cuRRBS generalises the traditional MspI-based Reduced Representation Bisulfite Sequencing (RRBS) protocol to all restriction enzyme combinations. In addition, cuRRBS estimates the fold-reduction in sequencing costs and provides a robustness value for the personalised RRBS protocol, allowing users to tailor the protocol to their experimental needs. Moreover, we show in silico that cuRRBS-defined restriction enzymes consistently out-perform MspI digestion in many biological systems, considering both CpG and CHG contexts. Finally, we have validated the accuracy of cuRRBS predictions for single and double enzyme digestions using two independent experimental datasets. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Differential DNA Methylation Analysis without a Reference Genome.
Klughammer, Johanna; Datlinger, Paul; Printz, Dieter; Sheffield, Nathan C; Farlik, Matthias; Hadler, Johanna; Fritsch, Gerhard; Bock, Christoph
2015-12-22
Genome-wide DNA methylation mapping uncovers epigenetic changes associated with animal development, environmental adaptation, and species evolution. To address the lack of high-throughput methods for DNA methylation analysis in non-model organisms, we developed an integrated approach for studying DNA methylation differences independent of a reference genome. Experimentally, our method relies on an optimized 96-well protocol for reduced representation bisulfite sequencing (RRBS), which we have validated in nine species (human, mouse, rat, cow, dog, chicken, carp, sea bass, and zebrafish). Bioinformatically, we developed the RefFreeDMA software to deduce ad hoc genomes directly from RRBS reads and to pinpoint differentially methylated regions between samples or groups of individuals (http://RefFreeDMA.computational-epigenetics.org). The identified regions are interpreted using motif enrichment analysis and/or cross-mapping to annotated genomes. We validated our method by reference-free analysis of cell-type-specific DNA methylation in the blood of human, cow, and carp. In summary, we present a cost-effective method for epigenome analysis in ecology and evolution, which enables epigenome-wide association studies in natural populations and species without a reference genome. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Using whole-genome bisulfite sequencing (WGBS), we profiled the DNA methylome of cattle sperms through comparison with three bovine somatic tissues (mammary grand, brain and blood). Large differences between them were observed in the methylation patterns of global CpGs, pericentromeric satellites, p...
DNA demethylation activates genes in seed maternal integument development in rice (Oryza sativa L.).
Wang, Yifeng; Lin, Haiyan; Tong, Xiaohong; Hou, Yuxuan; Chang, Yuxiao; Zhang, Jian
2017-11-01
DNA methylation is an important epigenetic modification that regulates various plant developmental processes. Rice seed integument determines the seed size. However, the role of DNA methylation in its development remains largely unknown. Here, we report the first dynamic DNA methylomic profiling of rice maternal integument before and after pollination by using a whole-genome bisulfite deep sequencing approach. Analysis of DNA methylation patterns identified 4238 differentially methylated regions underpin 4112 differentially methylated genes, including GW2, DEP1, RGB1 and numerous other regulators participated in maternal integument development. Bisulfite sanger sequencing and qRT-PCR of six differentially methylated genes revealed extensive occurrence of DNA hypomethylation triggered by double fertilization at IAP compared with IBP, suggesting that DNA demethylation might be a key mechanism to activate numerous maternal controlling genes. These results presented here not only greatly expanded the rice methylome dataset, but also shed novel insight into the regulatory roles of DNA methylation in rice seed maternal integument development. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
21 CFR 182.3616 - Potassium bisulfite.
Code of Federal Regulations, 2012 CFR
2012-04-01
... 21 Food and Drugs 3 2012-04-01 2012-04-01 false Potassium bisulfite. 182.3616 Section 182.3616 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) FOOD....3616 Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations...
21 CFR 182.3616 - Potassium bisulfite.
Code of Federal Regulations, 2013 CFR
2013-04-01
... 21 Food and Drugs 3 2013-04-01 2013-04-01 false Potassium bisulfite. 182.3616 Section 182.3616 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) FOOD....3616 Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations...
21 CFR 582.3616 - Potassium bisulfite.
Code of Federal Regulations, 2013 CFR
2013-04-01
... 21 Food and Drugs 6 2013-04-01 2013-04-01 false Potassium bisulfite. 582.3616 Section 582.3616 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3616 Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations...
21 CFR 182.3616 - Potassium bisulfite.
Code of Federal Regulations, 2011 CFR
2011-04-01
... 21 Food and Drugs 3 2011-04-01 2011-04-01 false Potassium bisulfite. 182.3616 Section 182.3616 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) FOOD....3616 Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations...
21 CFR 582.3616 - Potassium bisulfite.
Code of Federal Regulations, 2011 CFR
2011-04-01
... 21 Food and Drugs 6 2011-04-01 2011-04-01 false Potassium bisulfite. 582.3616 Section 582.3616 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3616 Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations...
21 CFR 582.3616 - Potassium bisulfite.
Code of Federal Regulations, 2012 CFR
2012-04-01
... 21 Food and Drugs 6 2012-04-01 2012-04-01 false Potassium bisulfite. 582.3616 Section 582.3616 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3616 Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations...
21 CFR 182.3616 - Potassium bisulfite.
Code of Federal Regulations, 2014 CFR
2014-04-01
... 21 Food and Drugs 3 2014-04-01 2014-04-01 false Potassium bisulfite. 182.3616 Section 182.3616...) SUBSTANCES GENERALLY RECOGNIZED AS SAFE Chemical Preservatives § 182.3616 Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or explanation. This substance is...
21 CFR 582.3616 - Potassium bisulfite.
Code of Federal Regulations, 2014 CFR
2014-04-01
... 21 Food and Drugs 6 2014-04-01 2014-04-01 false Potassium bisulfite. 582.3616 Section 582.3616 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3616 Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations...
21 CFR 582.3616 - Potassium bisulfite.
Code of Federal Regulations, 2010 CFR
2010-04-01
... 21 Food and Drugs 6 2010-04-01 2010-04-01 false Potassium bisulfite. 582.3616 Section 582.3616 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3616 Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations...
Genetic Perturbation of the Maize Methylome[W
Li, Qing; Hermanson, Peter J.; Zaunbrecher, Virginia M.; Song, Jawon; Wendt, Jennifer; Rosenbaum, Heidi; Madzima, Thelma F.; Sloan, Amy E.; Huang, Ji; Burgess, Daniel L.; Richmond, Todd A.; McGinnis, Karen M.; Meeley, Robert B.; Danilevskaya, Olga N.; Vaughn, Matthew W.; Kaeppler, Shawn M.; Jeddeloh, Jeffrey A.
2014-01-01
DNA methylation can play important roles in the regulation of transposable elements and genes. A collection of mutant alleles for 11 maize (Zea mays) genes predicted to play roles in controlling DNA methylation were isolated through forward- or reverse-genetic approaches. Low-coverage whole-genome bisulfite sequencing and high-coverage sequence-capture bisulfite sequencing were applied to mutant lines to determine context- and locus-specific effects of these mutations on DNA methylation profiles. Plants containing mutant alleles for components of the RNA-directed DNA methylation pathway exhibit loss of CHH methylation at many loci as well as CG and CHG methylation at a small number of loci. Plants containing loss-of-function alleles for chromomethylase (CMT) genes exhibit strong genome-wide reductions in CHG methylation and some locus-specific loss of CHH methylation. In an attempt to identify stocks with stronger reductions in DNA methylation levels than provided by single gene mutations, we performed crosses to create double mutants for the maize CMT3 orthologs, Zmet2 and Zmet5, and for the maize DDM1 orthologs, Chr101 and Chr106. While loss-of-function alleles are viable as single gene mutants, the double mutants were not recovered, suggesting that severe perturbations of the maize methylome may have stronger deleterious phenotypic effects than in Arabidopsis thaliana. PMID:25527708
Identification of Differentially Methylated Sites with Weak Methylation Effects
Tran, Hong; Zhu, Hongxiao; Wu, Xiaowei; Kim, Gunjune; Clarke, Christopher R.; Larose, Hailey; Haak, David C.; Westwood, James H.; Zhang, Liqing
2018-01-01
Deoxyribonucleic acid (DNA) methylation is an epigenetic alteration crucial for regulating stress responses. Identifying large-scale DNA methylation at single nucleotide resolution is made possible by whole genome bisulfite sequencing. An essential task following the generation of bisulfite sequencing data is to detect differentially methylated cytosines (DMCs) among treatments. Most statistical methods for DMC detection do not consider the dependency of methylation patterns across the genome, thus possibly inflating type I error. Furthermore, small sample sizes and weak methylation effects among different phenotype categories make it difficult for these statistical methods to accurately detect DMCs. To address these issues, the wavelet-based functional mixed model (WFMM) was introduced to detect DMCs. To further examine the performance of WFMM in detecting weak differential methylation events, we used both simulated and empirical data and compare WFMM performance to a popular DMC detection tool methylKit. Analyses of simulated data that replicated the effects of the herbicide glyphosate on DNA methylation in Arabidopsis thaliana show that WFMM results in higher sensitivity and specificity in detecting DMCs compared to methylKit, especially when the methylation differences among phenotype groups are small. Moreover, the performance of WFMM is robust with respect to small sample sizes, making it particularly attractive considering the current high costs of bisulfite sequencing. Analysis of empirical Arabidopsis thaliana data under varying glyphosate dosages, and the analysis of monozygotic (MZ) twins who have different pain sensitivities—both datasets have weak methylation effects of <1%—show that WFMM can identify more relevant DMCs related to the phenotype of interest than methylKit. Differentially methylated regions (DMRs) are genomic regions with different DNA methylation status across biological samples. DMRs and DMCs are essentially the same concepts, with the only difference being how methylation information across the genome is summarized. If methylation levels are determined by grouping neighboring cytosine sites, then they are DMRs; if methylation levels are calculated based on single cytosines, they are DMCs. PMID:29419727
21 CFR 182.3616 - Potassium bisulfite.
Code of Federal Regulations, 2010 CFR
2010-04-01
... 21 Food and Drugs 3 2010-04-01 2009-04-01 true Potassium bisulfite. 182.3616 Section 182.3616 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) FOOD FOR... Potassium bisulfite. (a) Product. Potassium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
21 CFR 182.3739 - Sodium bisulfite.
Code of Federal Regulations, 2011 CFR
2011-04-01
... 21 Food and Drugs 3 2011-04-01 2011-04-01 false Sodium bisulfite. 182.3739 Section 182.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) FOOD FOR... Sodium bisulfite. (a) Product. Sodium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
21 CFR 182.3739 - Sodium bisulfite.
Code of Federal Regulations, 2012 CFR
2012-04-01
... 21 Food and Drugs 3 2012-04-01 2012-04-01 false Sodium bisulfite. 182.3739 Section 182.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) FOOD FOR... Sodium bisulfite. (a) Product. Sodium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
21 CFR 182.3739 - Sodium bisulfite.
Code of Federal Regulations, 2010 CFR
2010-04-01
... 21 Food and Drugs 3 2010-04-01 2009-04-01 true Sodium bisulfite. 182.3739 Section 182.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) FOOD FOR... Sodium bisulfite. (a) Product. Sodium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
21 CFR 582.3739 - Sodium bisulfite.
Code of Federal Regulations, 2012 CFR
2012-04-01
... 21 Food and Drugs 6 2012-04-01 2012-04-01 false Sodium bisulfite. 582.3739 Section 582.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3739 Sodium bisulfite. (a) Product. Sodium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
21 CFR 182.3739 - Sodium bisulfite.
Code of Federal Regulations, 2013 CFR
2013-04-01
... 21 Food and Drugs 3 2013-04-01 2013-04-01 false Sodium bisulfite. 182.3739 Section 182.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) FOOD FOR... Sodium bisulfite. (a) Product. Sodium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
21 CFR 582.3739 - Sodium bisulfite.
Code of Federal Regulations, 2014 CFR
2014-04-01
... 21 Food and Drugs 6 2014-04-01 2014-04-01 false Sodium bisulfite. 582.3739 Section 582.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3739 Sodium bisulfite. (a) Product. Sodium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
21 CFR 582.3739 - Sodium bisulfite.
Code of Federal Regulations, 2011 CFR
2011-04-01
... 21 Food and Drugs 6 2011-04-01 2011-04-01 false Sodium bisulfite. 582.3739 Section 582.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3739 Sodium bisulfite. (a) Product. Sodium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
21 CFR 582.3739 - Sodium bisulfite.
Code of Federal Regulations, 2010 CFR
2010-04-01
... 21 Food and Drugs 6 2010-04-01 2010-04-01 false Sodium bisulfite. 582.3739 Section 582.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3739 Sodium bisulfite. (a) Product. Sodium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
21 CFR 582.3739 - Sodium bisulfite.
Code of Federal Regulations, 2013 CFR
2013-04-01
... 21 Food and Drugs 6 2013-04-01 2013-04-01 false Sodium bisulfite. 582.3739 Section 582.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) ANIMAL....3739 Sodium bisulfite. (a) Product. Sodium bisulfite. (b) [Reserved] (c) Limitations, restrictions, or...
Wan, Ma; Bennett, Brian D; Pittman, Gary S; Campbell, Michelle R; Reynolds, Lindsay M; Porter, Devin K; Crowl, Christopher L; Wang, Xuting; Su, Dan; Englert, Neal A; Thompson, Isabel J; Liu, Yongmei; Bell, Douglas A
2018-04-27
Cigarette smoke is a causal factor in cancers and cardiovascular disease. Smoking-associated differentially methylated regions (SM-DMRs) have been observed in disease studies, but the causal link between altered DNA methylation and transcriptional change is obscure. Our objectives were to finely resolve SM-DMRs and to interrogate the mechanistic link between SM-DMRs and altered transcription of enhancer noncoding RNA (eRNA) and mRNA in human circulating monocytes. We integrated SM-DMRs identified by reduced representation bisulfite sequencing (RRBS) of circulating CD14+ monocyte DNA collected from two independent human studies [ n =38 from Clinical Research Unit (CRU) and n =55 from the Multi-Ethnic Study of Atherosclerosis (MESA), about half of whom were active smokers] with gene expression for protein-coding genes and noncoding RNAs measured by RT-PCR or RNA sequencing. Candidate SM-DMRs were compared with RRBS of purified CD4+ T cells, CD8+ T cells, CD15+ granulocytes, CD19+ B cells, and CD56+ NK cells ( n =19 females, CRU). DMRs were validated using pyrosequencing or bisulfite amplicon sequencing in up to 85 CRU volunteers, who also provided saliva DNA. RRBS identified monocyte SM-DMRs frequently located in putative gene regulatory regions. The most significant monocyte DMR occurred at a poised enhancer in the aryl-hydrocarbon receptor repressor gene ( AHRR ) and it was also detected in both granulocytes and saliva DNA. To our knowledge, we identify for the first time that SM-DMRs in or near AHRR , C5orf55-EXOC-AS , and SASH1 were associated with increased noncoding eRNA as well as mRNA in monocytes. Functionally, the AHRR SM-DMR appeared to up-regulate AHRR mRNA through activating the AHRR enhancer, as suggested by increased eRNA in the monocytes, but not granulocytes, from smokers compared with nonsmokers. Our findings suggest that AHRR SM-DMR up-regulates AHRR mRNA in a monocyte-specific manner by activating the AHRR enhancer. Cell type-specific activation of enhancers at SM-DMRs may represent a mechanism driving smoking-related disease. https://doi.org/10.1289/EHP2395.
Pelch, Katherine E; Tokar, Erik J; Merrick, B Alex; Waalkes, Michael P
2015-08-01
Previous work shows altered methylation patterns in inorganic arsenic (iAs)- or cadmium (Cd)-transformed epithelial cells. Here, the methylation status near the transcriptional start site was assessed in the normal human prostate epithelial cell line (RWPE-1) that was malignantly transformed by 10μM Cd for 11weeks (CTPE) or 5μM iAs for 29weeks (CAsE-PE), at which time cells showed multiple markers of acquired cancer phenotype. Next generation sequencing of the transcriptome of CAsE-PE cells identified multiple dysregulated genes. Of the most highly dysregulated genes, five genes that can be relevant to the carcinogenic process (S100P, HYAL1, NTM, NES, ALDH1A1) were chosen for an in-depth analysis of the DNA methylation profile. DNA was isolated, bisulfite converted, and combined bisulfite restriction analysis was used to identify differentially methylated CpG sites, which was confirmed with bisulfite sequencing. Four of the five genes showed differential methylation in transformants relative to control cells that was inversely related to altered gene expression. Increased expression of HYAL1 (>25-fold) and S100P (>40-fold) in transformants was correlated with hypomethylation near the transcriptional start site. Decreased expression of NES (>15-fold) and NTM (>1000-fold) in transformants was correlated with hypermethylation near the transcriptional start site. ALDH1A1 expression was differentially expressed in transformed cells but was not differentially methylated relative to control. In conclusion, altered gene expression observed in Cd and iAs transformed cells may result from altered DNA methylation status. Published by Elsevier Inc.
Yu, Miao; Ji, Lexiang; Neumann, Drexel A.; ...
2015-07-15
Restriction-modification (R-M) systems pose a major barrier to DNA transformation and genetic engineering of bacterial species. Systematic identification of DNA methylation in R-M systems, including N 6-methyladenine (6mA), 5-methylcytosine (5mC) and N 4-methylcytosine (4mC), will enable strategies to make these species genetically tractable. Although single-molecule, real time (SMRT) sequencing technology is capable of detecting 4mC directly for any bacterial species regardless of whether an assembled genome exists or not, it is not as scalable to profiling hundreds to thousands of samples compared with the commonly used next-generation sequencing technologies. Here, we present 4mC-Tet-assisted bisulfite-sequencing (4mC-TAB-seq), a next-generation sequencing method thatmore » rapidly and cost efficiently reveals the genome-wide locations of 4mC for bacterial species with an available assembled reference genome. In 4mC-TAB-seq, both cytosines and 5mCs are read out as thymines, whereas only 4mCs are read out as cytosines, revealing their specific positions throughout the genome. We applied 4mC-TAB-seq to study the methylation of a member of the hyperthermophilc genus, Caldicellulosiruptor, in which 4mC-related restriction is a major barrier to DNA transformation from other species. Lastly, in combination with MethylC-seq, both 4mC- and 5mC-containing motifs are identified which can assist in rapid and efficient genetic engineering of these bacteria in the future.« less
Epigenetic Transgenerational Actions of Vinclozolin on Promoter Regions of the Sperm Epigenome
Guerrero-Bosagna, Carlos; Settles, Matthew; Lucker, Ben; Skinner, Michael K.
2010-01-01
Previous observations have demonstrated that embryonic exposure to the endocrine disruptor vinclozolin during gonadal sex determination promotes transgenerational adult onset disease such as male infertility, kidney disease, prostate disease, immune abnormalities and tumor development. The current study investigates genome-wide promoter DNA methylation alterations in the sperm of F3 generation rats whose F0 generation mother was exposed to vinclozolin. A methylated DNA immunoprecipitation with methyl-cytosine antibody followed by a promoter tilling microarray (MeDIP-Chip) procedure was used to identify 52 different regions with statistically significant altered methylation in the sperm promoter epigenome. Mass spectrometry bisulfite analysis was used to map the CpG DNA methylation and 16 differential DNA methylation regions were confirmed, while the remainder could not be analyzed due to bisulfite technical limitations. Analysis of these validated regions identified a consensus DNA sequence (motif) that associated with 75% of the promoters. Interestingly, only 16.8% of a random set of 125 promoters contained this motif. One candidate promoter (Fam111a) was found to be due to a copy number variation (CNV) and not a methylation change, suggesting initial alterations in the germline epigenome may promote genetic abnormalities such as induced CNV in later generations. This study identifies differential DNA methylation sites in promoter regions three generations after the initial exposure and identifies common genome features present in these regions. In addition to primary epimutations, a potential indirect genetic abnormality was identified, and both are postulated to be involved in the epigenetic transgenerational inheritance observed. This study confirms that an environmental agent has the ability to induce epigenetic transgenerational changes in the sperm epigenome. PMID:20927350
Epigenetic transgenerational actions of vinclozolin on promoter regions of the sperm epigenome.
Guerrero-Bosagna, Carlos; Settles, Matthew; Lucker, Ben; Skinner, Michael K
2010-09-30
Previous observations have demonstrated that embryonic exposure to the endocrine disruptor vinclozolin during gonadal sex determination promotes transgenerational adult onset disease such as male infertility, kidney disease, prostate disease, immune abnormalities and tumor development. The current study investigates genome-wide promoter DNA methylation alterations in the sperm of F3 generation rats whose F0 generation mother was exposed to vinclozolin. A methylated DNA immunoprecipitation with methyl-cytosine antibody followed by a promoter tilling microarray (MeDIP-Chip) procedure was used to identify 52 different regions with statistically significant altered methylation in the sperm promoter epigenome. Mass spectrometry bisulfite analysis was used to map the CpG DNA methylation and 16 differential DNA methylation regions were confirmed, while the remainder could not be analyzed due to bisulfite technical limitations. Analysis of these validated regions identified a consensus DNA sequence (motif) that associated with 75% of the promoters. Interestingly, only 16.8% of a random set of 125 promoters contained this motif. One candidate promoter (Fam111a) was found to be due to a copy number variation (CNV) and not a methylation change, suggesting initial alterations in the germline epigenome may promote genetic abnormalities such as induced CNV in later generations. This study identifies differential DNA methylation sites in promoter regions three generations after the initial exposure and identifies common genome features present in these regions. In addition to primary epimutations, a potential indirect genetic abnormality was identified, and both are postulated to be involved in the epigenetic transgenerational inheritance observed. This study confirms that an environmental agent has the ability to induce epigenetic transgenerational changes in the sperm epigenome.
COBRA-Seq: Sensitive and Quantitative Methylome Profiling
Varinli, Hilal; Statham, Aaron L.; Clark, Susan J.; Molloy, Peter L.; Ross, Jason P.
2015-01-01
Combined Bisulfite Restriction Analysis (COBRA) quantifies DNA methylation at a specific locus. It does so via digestion of PCR amplicons produced from bisulfite-treated DNA, using a restriction enzyme that contains a cytosine within its recognition sequence, such as TaqI. Here, we introduce COBRA-seq, a genome wide reduced methylome method that requires minimal DNA input (0.1–1.0 μg) and can either use PCR or linear amplification to amplify the sequencing library. Variants of COBRA-seq can be used to explore CpG-depleted as well as CpG-rich regions in vertebrate DNA. The choice of enzyme influences enrichment for specific genomic features, such as CpG-rich promoters and CpG islands, or enrichment for less CpG dense regions such as enhancers. COBRA-seq coupled with linear amplification has the additional advantage of reduced PCR bias by producing full length fragments at high abundance. Unlike other reduced representative methylome methods, COBRA-seq has great flexibility in the choice of enzyme and can be multiplexed and tuned, to reduce sequencing costs and to interrogate different numbers of sites. Moreover, COBRA-seq is applicable to non-model organisms without the reference genome and compatible with the investigation of non-CpG methylation by using restriction enzymes containing CpA, CpT, and CpC in their recognition site. PMID:26512698
Sina, Abu Ali Ibn; Foster, Matthew Thomas; Korbie, Darren; Carrascosa, Laura G; Shiddiky, Muhammad J A; Gao, Jing; Dey, Shuvashis; Trau, Matt
2017-10-07
We report a new multiplexed strategy for the electrochemical detection of regional DNA methylation across multiple regions. Using the sequence dependent affinity of bisulfite treated DNA towards gold surfaces, the method integrates the high sensitivity of a micro-fabricated multiplex device comprising a microarray of gold electrodes, with the powerful multiplexing capability of multiplex-PCR. The synergy of this combination enables the monitoring of the methylation changes across several genomic regions simultaneously from as low as 500 pg μl -1 of DNA with no sequencing requirement.
2014-01-01
Affinity capture of DNA methylation combined with high-throughput sequencing strikes a good balance between the high cost of whole genome bisulfite sequencing and the low coverage of methylation arrays. We present BayMeth, an empirical Bayes approach that uses a fully methylated control sample to transform observed read counts into regional methylation levels. In our model, inefficient capture can readily be distinguished from low methylation levels. BayMeth improves on existing methods, allows explicit modeling of copy number variation, and offers computationally efficient analytical mean and variance estimators. BayMeth is available in the Repitools Bioconductor package. PMID:24517713
Impacts of Chromatin States and Long-Range Genomic Segments on Aging and DNA Methylation
Sun, Dan; Yi, Soojin V.
2015-01-01
Understanding the fundamental dynamics of epigenome variation during normal aging is critical for elucidating key epigenetic alterations that affect development, cell differentiation and diseases. Advances in the field of aging and DNA methylation strongly support the aging epigenetic drift model. Although this model aligns with previous studies, the role of other epigenetic marks, such as histone modification, as well as the impact of sampling specific CpGs, must be evaluated. Ultimately, it is crucial to investigate how all CpGs in the human genome change their methylation with aging in their specific genomic and epigenomic contexts. Here, we analyze whole genome bisulfite sequencing DNA methylation maps of brain frontal cortex from individuals of diverse ages. Comparisons with blood data reveal tissue-specific patterns of epigenetic drift. By integrating chromatin state information, divergent degrees and directions of aging-associated methylation in different genomic regions are revealed. Whole genome bisulfite sequencing data also open a new door to investigate whether adjacent CpG sites exhibit coordinated DNA methylation changes with aging. We identified significant ‘aging-segments’, which are clusters of nearby CpGs that respond to aging by similar DNA methylation changes. These segments not only capture previously identified aging-CpGs but also include specific functional categories of genes with implications on epigenetic regulation of aging. For example, genes associated with development are highly enriched in positive aging segments, which are gradually hyper-methylated with aging. On the other hand, regions that are gradually hypo-methylated with aging (‘negative aging segments’) in the brain harbor genes involved in metabolism and protein ubiquitination. Given the importance of protein ubiquitination in proteome homeostasis of aging brains and neurodegenerative disorders, our finding suggests the significance of epigenetic regulation of this posttranslational modification pathway in the aging brain. Utilizing aging segments rather than individual CpGs will provide more comprehensive genomic and epigenomic contexts to understand the intricate associations between genomic neighborhoods and developmental and aging processes. These results complement the aging epigenetic drift model and provide new insights. PMID:26091484
Integrative Cardiac Health Project, Windber Research Institute
2014-07-01
laparoscopically placed adjustable gastric banding (LAGB) baseline (5) and one year (5), control baseline (5) and one year (5). OD260/280 ratios...coverage and detection of 3-4 million CpG sites . All samples had a bisulfite conversion rate of >98.25%; number of CpG (methylated) sites per sample...methylation) and hyper-methylated (increasing methylation) sites in the three groups were identified. For LAGB patients, a heat map based on
A colorimetric and fluorogenic probe for bisulfite using benzopyrylium as the recognition unit.
Zhang, Yun; Zhang, Xiangwen; Yang, Xiao-Feng; Zhang, Juan
2017-11-01
A coumarin-benzopyrylium (CB) platform has been developed for the colorimetric and fluorogenic detection of bisulfite. The proposed probe utilizes coumarin as the fluorophore and positively charged benzopyrylium as the reaction site. The method employs the nucleophilic addition of bisulfite to the benzopyrylium moiety of CB to inactivate the electron-deficient oxonium ion. The driving force for photo-induced electron transfer is considerably diminished, thereby promoting the emission intensity of the coumarin fluorophore. The fluorescence intensity at 510 nm is linear with bisulfite concentration over a range of 0.2-7.5 μM with a detection limit of 42 nM (3δ). CB shows a rapid response (within 30 s) and high selectivity and sensitivity for bisulfite. Preliminary studies show that CB has great potential for bisulfite detection in real samples and in living cells. Copyright © 2017 John Wiley & Sons, Ltd.
Phylogenetic and environmental diversity of DsrAB-type dissimilatory (bi)sulfite reductases
Müller, Albert Leopold; Kjeldsen, Kasper Urup; Rattei, Thomas; Pester, Michael; Loy, Alexander
2015-01-01
The energy metabolism of essential microbial guilds in the biogeochemical sulfur cycle is based on a DsrAB-type dissimilatory (bi)sulfite reductase that either catalyzes the reduction of sulfite to sulfide during anaerobic respiration of sulfate, sulfite and organosulfonates, or acts in reverse during sulfur oxidation. Common use of dsrAB as a functional marker showed that dsrAB richness in many environments is dominated by novel sequence variants and collectively represents an extensive, largely uncharted sequence assemblage. Here, we established a comprehensive, manually curated dsrAB/DsrAB database and used it to categorize the known dsrAB diversity, reanalyze the evolutionary history of dsrAB and evaluate the coverage of published dsrAB-targeted primers. Based on a DsrAB consensus phylogeny, we introduce an operational classification system for environmental dsrAB sequences that integrates established taxonomic groups with operational taxonomic units (OTUs) at multiple phylogenetic levels, ranging from DsrAB enzyme families that reflect reductive or oxidative DsrAB types of bacterial or archaeal origin, superclusters, uncultured family-level lineages to species-level OTUs. Environmental dsrAB sequences constituted at least 13 stable family-level lineages without any cultivated representatives, suggesting that major taxa of sulfite/sulfate-reducing microorganisms have not yet been identified. Three of these uncultured lineages occur mainly in marine environments, while specific habitat preferences are not evident for members of the other 10 uncultured lineages. In summary, our publically available dsrAB/DsrAB database, the phylogenetic framework, the multilevel classification system and a set of recommended primers provide a necessary foundation for large-scale dsrAB ecology studies with next-generation sequencing methods. PMID:25343514
An optimized rapid bisulfite conversion method with high recovery of cell-free DNA.
Yi, Shaohua; Long, Fei; Cheng, Juanbo; Huang, Daixin
2017-12-19
Methylation analysis of cell-free DNA is a encouraging tool for tumor diagnosis, monitoring and prognosis. Sensitivity of methylation analysis is a very important matter due to the tiny amounts of cell-free DNA available in plasma. Most current methods of DNA methylation analysis are based on the difference of bisulfite-mediated deamination of cytosine between cytosine and 5-methylcytosine. However, the recovery of bisulfite-converted DNA based on current methods is very poor for the methylation analysis of cell-free DNA. We optimized a rapid method for the crucial steps of bisulfite conversion with high recovery of cell-free DNA. A rapid deamination step and alkaline desulfonation was combined with the purification of DNA on a silica column. The conversion efficiency and recovery of bisulfite-treated DNA was investigated by the droplet digital PCR. The optimization of the reaction results in complete cytosine conversion in 30 min at 70 °C and about 65% of recovery of bisulfite-treated cell-free DNA, which is higher than current methods. The method allows high recovery from low levels of bisulfite-treated cell-free DNA, enhancing the analysis sensitivity of methylation detection from cell-free DNA.
21 CFR 573.620 - Menadione dimethylpyrimidinol bisulfite.
Code of Federal Regulations, 2013 CFR
2013-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.620 Menadione dimethylpyrimidinol bisulfite. The food additive... 21 Food and Drugs 6 2013-04-01 2013-04-01 false Menadione dimethylpyrimidinol bisulfite. 573.620...
21 CFR 573.620 - Menadione dimethylpyrimidinol bisulfite.
Code of Federal Regulations, 2014 CFR
2014-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.620 Menadione dimethylpyrimidinol bisulfite. The food additive... 21 Food and Drugs 6 2014-04-01 2014-04-01 false Menadione dimethylpyrimidinol bisulfite. 573.620...
21 CFR 573.620 - Menadione dimethylpyrimidinol bisulfite.
Code of Federal Regulations, 2011 CFR
2011-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.620 Menadione dimethylpyrimidinol bisulfite. The food additive... 21 Food and Drugs 6 2011-04-01 2011-04-01 false Menadione dimethylpyrimidinol bisulfite. 573.620...
21 CFR 573.620 - Menadione dimethylpyrimidinol bisulfite.
Code of Federal Regulations, 2012 CFR
2012-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.620 Menadione dimethylpyrimidinol bisulfite. The food additive... 21 Food and Drugs 6 2012-04-01 2012-04-01 false Menadione dimethylpyrimidinol bisulfite. 573.620...
21 CFR 573.620 - Menadione dimethylpyrimidinol bisulfite.
Code of Federal Regulations, 2010 CFR
2010-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.620 Menadione dimethylpyrimidinol bisulfite. The food additive... 21 Food and Drugs 6 2010-04-01 2010-04-01 false Menadione dimethylpyrimidinol bisulfite. 573.620...
Parker, John C.
1969-01-01
It is known that bisulfite ions can selectively deplete red blood cells of 2,3-diphosphoglycerate (2,3-DPG). Studies of the effects of bisulfite on sodium-potassium permeability and metabolism were undertaken to clarify the physiologic role of the abundant quantities of 2,3-DPG in human erythrocytes. Treatment of cells with bisulfite results in a reversible increase in the passive permeability to Na and K ions. Metabolism of glucose to lactate is increased, with a rise in the intracellular ratio of fructose diphosphate to hexose monophosphate. Cell 2,3-DPG is quantitatively converted to pyruvate and inorganic phosphate. The permeability effects of bisulfite are countered by ethacrynic acid and by such oxidizing agents as pyruvate and methylene blue. Taken together, the results suggest that the effects on Na-K flux of bisulfite are related more to the reducing potential of this anion than to its capacity to deplete cells of 2,3-DPG. PMID:5765015
Global DNA methylation analysis using methyl-sensitive amplification polymorphism (MSAP).
Yaish, Mahmoud W; Peng, Mingsheng; Rothstein, Steven J
2014-01-01
DNA methylation is a crucial epigenetic process which helps control gene transcription activity in eukaryotes. Information regarding the methylation status of a regulatory sequence of a particular gene provides important knowledge of this transcriptional control. DNA methylation can be detected using several methods, including sodium bisulfite sequencing and restriction digestion using methylation-sensitive endonucleases. Methyl-Sensitive Amplification Polymorphism (MSAP) is a technique used to study the global DNA methylation status of an organism and hence to distinguish between two individuals based on the DNA methylation status determined by the differential digestion pattern. Therefore, this technique is a useful method for DNA methylation mapping and positional cloning of differentially methylated genes. In this technique, genomic DNA is first digested with a methylation-sensitive restriction enzyme such as HpaII, and then the DNA fragments are ligated to adaptors in order to facilitate their amplification. Digestion using a methylation-insensitive isoschizomer of HpaII, MspI is used in a parallel digestion reaction as a loading control in the experiment. Subsequently, these fragments are selectively amplified by fluorescently labeled primers. PCR products from different individuals are compared, and once an interesting polymorphic locus is recognized, the desired DNA fragment can be isolated from a denaturing polyacrylamide gel, sequenced and identified based on DNA sequence similarity to other sequences available in the database. We will use analysis of met1, ddm1, and atmbd9 mutants and wild-type plants treated with a cytidine analogue, 5-azaC, or zebularine to demonstrate how to assess the genetic modulation of DNA methylation in Arabidopsis. It should be noted that despite the fact that MSAP is a reliable technique used to fish for polymorphic methylated loci, its power is limited to the restriction recognition sites of the enzymes used in the genomic DNA digestion.
21 CFR 182.3739 - Sodium bisulfite.
Code of Federal Regulations, 2014 CFR
2014-04-01
... 21 Food and Drugs 3 2014-04-01 2014-04-01 false Sodium bisulfite. 182.3739 Section 182.3739 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF HEALTH AND HUMAN SERVICES (CONTINUED) SUBSTANCES GENERALLY RECOGNIZED AS SAFE Chemical Preservatives § 182.3739 Sodium bisulfite. (a) Product. Sodium...
Genome-wide DNA methylation analysis in jejunum of Sus scrofa with intrauterine growth restriction.
Hu, Yue; Hu, Liang; Gong, Desheng; Lu, Hanlin; Xuan, Yue; Wang, Ru; Wu, De; Chen, Daiwen; Zhang, Keying; Gao, Fei; Che, Lianqiang
2018-02-01
Intrauterine growth restriction (IUGR) may elicit a series of postnatal body developmental and metabolic diseases due to their impaired growth and development in the mammalian embryo/fetus during pregnancy. In the present study, we hypothesized that IUGR may lead to abnormally regulated DNA methylation in the intestine, causing intestinal dysfunctions. We applied reduced representation bisulfite sequencing (RRBS) technology to study the jejunum tissues from four newborn IUGR piglets and their normal body weight (NBW) littermates. The results revealed extensively regional DNA methylation changes between IUGR/NBW pairs from different gilts, affecting dozens of genes. Hiseq-based bisulfite sequencing PCR (Hiseq-BSP) was used for validations of 19 genes with epigenetic abnormality, confirming three genes (AIFM1, MTMR1, and TWIST2) in extra samples. Furthermore, integrated analysis of these 19 genes with proteome data indicated that there were three main genes (BCAP31, IRAK1, and AIFM1) interacting with important immunity- or metabolism-related proteins, which could explain the potential intestinal dysfunctions of IUGR piglets. We conclude that IUGR can lead to disparate DNA methylation in the intestine and these changes may affect several important biological processes such as cell apoptosis, cell differentiation, and immunity, which provides more clues linking IUGR and its long-term complications.
21 CFR 573.625 - Menadione nicotinamide bisulfite.
Code of Federal Regulations, 2012 CFR
2012-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.625 Menadione nicotinamide bisulfite. The food additive may be safely... 21 Food and Drugs 6 2012-04-01 2012-04-01 false Menadione nicotinamide bisulfite. 573.625 Section...
21 CFR 573.625 - Menadione nicotinamide bisulfite.
Code of Federal Regulations, 2010 CFR
2010-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.625 Menadione nicotinamide bisulfite. The food additive may be safely... 21 Food and Drugs 6 2010-04-01 2010-04-01 false Menadione nicotinamide bisulfite. 573.625 Section...
21 CFR 573.625 - Menadione nicotinamide bisulfite.
Code of Federal Regulations, 2014 CFR
2014-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.625 Menadione nicotinamide bisulfite. The food additive may be safely... 21 Food and Drugs 6 2014-04-01 2014-04-01 false Menadione nicotinamide bisulfite. 573.625 Section...
21 CFR 573.625 - Menadione nicotinamide bisulfite.
Code of Federal Regulations, 2011 CFR
2011-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.625 Menadione nicotinamide bisulfite. The food additive may be safely... 21 Food and Drugs 6 2011-04-01 2011-04-01 false Menadione nicotinamide bisulfite. 573.625 Section...
21 CFR 573.625 - Menadione nicotinamide bisulfite.
Code of Federal Regulations, 2013 CFR
2013-04-01
... (CONTINUED) ANIMAL DRUGS, FEEDS, AND RELATED PRODUCTS FOOD ADDITIVES PERMITTED IN FEED AND DRINKING WATER OF ANIMALS Food Additive Listing § 573.625 Menadione nicotinamide bisulfite. The food additive may be safely... 21 Food and Drugs 6 2013-04-01 2013-04-01 false Menadione nicotinamide bisulfite. 573.625 Section...
Nair, Bindu; Elmore, Amy R
2003-01-01
Sodium Sulfite, Ammonium Sulfite, Sodium Bisulfite, Potassium Bisulfite, Ammonium Bisulfite, Sodium Metabisulfite, and Potassium Metabisulfite are inorganic salts that function as reducing agents in cosmetic formulations. All except Sodium Metabisulfite also function as hair-waving/straightening agents. In addition, Sodium Sulfite, Potassium Sulfite, Sodium Bisulfite, and Sodium Metabisulfite function as antioxidants. Although Ammonium Sulfite is not in current use, the others are widely used in hair care products. Sulfites that enter mammals via ingestion, inhalation, or injection are metabolized by sulfite oxidase to sulfate. In oral-dose animal toxicity studies, hyperplastic changes in the gastric mucosa were the most common findings at high doses. Ammonium Sulfite aerosol had an acute LC(50) of >400 mg/m(3) in guinea pigs. A single exposure to low concentrations of a Sodium Sulfite fine aerosol produced dose-related changes in the lung capacity parameters of guinea pigs. A 3-day exposure of rats to a Sodium Sulfite fine aerosol produced mild pulmonary edema and irritation of the tracheal epithelium. Severe epithelial changes were observed in dogs exposed for 290 days to 1 mg/m(3) of a Sodium Metabisulfite fine aerosol. These fine aerosols contained fine respirable particle sizes that are not found in cosmetic aerosols or pump sprays. None of the cosmetic product types, however, in which these ingredients are used are aerosolized. Sodium Bisulfite (tested at 38%) and Sodium Metabisulfite (undiluted) were not irritants to rabbits following occlusive exposures. Sodium Metabisulfite (tested at 50%) was irritating to guinea pigs following repeated exposure. In rats, Sodium Sulfite heptahydrate at large doses (up to 3.3 g/kg) produced fetal toxicity but not teratogenicity. Sodium Bisulfite, Sodium Metabisulfite, and Potassium Metabisulfite were not teratogenic for mice, rats, hamsters, or rabbits at doses up to 160 mg/kg. Generally, Sodium Sulfite, Sodium Metabisulfite, and Potassium Metabisulfite were negative in mutagenicity studies. Sodium Bisulfite produced both positive and negative results. Clinical oral and ocular-exposure studies reported no adverse effects. Sodium Sulfite was not irritating or sensitizing in clinical tests. These ingredients, however, may produce positive reactions in dermatologic patients under patch test. In evaluating the positive genotoxicity data found with Sodium Bisulfite, the equilibrium chemistry of sulfurous acid, sulfur dioxide, bisulfite, sulfite, and metabisulfite was considered. This information, however, suggests that some bisulfite may have been present in genotoxicity tests involving the other ingredients and vice versa. On that basis, the genotoxicity data did not give a clear, consistent picture. In cosmetics, however, the bisulfite form is used at very low concentrations (0.03% to 0.7%) in most products except wave sets. In wave sets, the pH ranges from 8 to 9 where the sulfite form would predominate. Skin penetration would be low due to the highly charged nature of these particles and any sulfite that did penetrate would be converted to sulfate by the enzyme sulfate oxidase. As used in cosmetics, therefore, these ingredients would not present a genotoxicity risk. The Cosmetic Ingredient Review Expert Panel concluded that Sodium Sulfite, Potassium Sulfite, Ammonium Sulfite, Sodium Bisulfite, Ammonium Bisulfite, Sodium Metabisulfite, and Potassium Metabisulfite are safe as used in cosmetic formulations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pelch, Katherine E.; Tokar, Erik J.; Merrick, B. Alex
Previous work shows altered methylation patterns in inorganic arsenic (iAs)- or cadmium (Cd)-transformed epithelial cells. Here, the methylation status near the transcriptional start site was assessed in the normal human prostate epithelial cell line (RWPE-1) that was malignantly transformed by 10 μM Cd for 11 weeks (CTPE) or 5 μM iAs for 29 weeks (CAsE-PE), at which time cells showed multiple markers of acquired cancer phenotype. Next generation sequencing of the transcriptome of CAsE-PE cells identified multiple dysregulated genes. Of the most highly dysregulated genes, five genes that can be relevant to the carcinogenic process (S100P, HYAL1, NTM, NES, ALDH1A1)more » were chosen for an in-depth analysis of the DNA methylation profile. DNA was isolated, bisulfite converted, and combined bisulfite restriction analysis was used to identify differentially methylated CpG sites, which was confirmed with bisulfite sequencing. Four of the five genes showed differential methylation in transformants relative to control cells that was inversely related to altered gene expression. Increased expression of HYAL1 (> 25-fold) and S100P (> 40-fold) in transformants was correlated with hypomethylation near the transcriptional start site. Decreased expression of NES (> 15-fold) and NTM (> 1000-fold) in transformants was correlated with hypermethylation near the transcriptional start site. ALDH1A1 expression was differentially expressed in transformed cells but was not differentially methylated relative to control. In conclusion, altered gene expression observed in Cd and iAs transformed cells may result from altered DNA methylation status. - Highlights: • Cd and iAs are known human carcinogens, yet neither appears directly mutagenic. • Prior data suggest epigenetic modification plays a role in Cd or iAs induced cancer. • Altered methylation of four misregulated genes was found in Cd or iAs transformants. • The resulting altered gene expression may be relevant to cellular transformation.« less
DNA methylome of the 20-gigabase Norway spruce genome
Ausin, Israel; Feng, Suhua; Yu, Chaowei; Liu, Wanlu; Kuo, Hsuan Yu; Jacobsen, Elise L.; Zhai, Jixian; Gallego-Bartolome, Javier; Wang, Lin; Egertsdotter, Ulrika; Street, Nathaniel R.; Jacobsen, Steven E.; Wang, Haifeng
2016-01-01
DNA methylation plays important roles in many biological processes, such as silencing of transposable elements, imprinting, and regulating gene expression. Many studies of DNA methylation have shown its essential roles in angiosperms (flowering plants). However, few studies have examined the roles and patterns of DNA methylation in gymnosperms. Here, we present genome-wide high coverage single-base resolution methylation maps of Norway spruce (Picea abies) from both needles and somatic embryogenesis culture cells via whole genome bisulfite sequencing. On average, DNA methylation levels of CG and CHG of Norway spruce were higher than most other plants studied. CHH methylation was found at a relatively low level; however, at least one copy of most of the RNA-directed DNA methylation pathway genes was found in Norway spruce, and CHH methylation was correlated with levels of siRNAs. In comparison with needles, somatic embryogenesis culture cells that are used for clonally propagating spruce trees showed lower levels of CG and CHG methylation but higher level of CHH methylation, suggesting that like in other species, these culture cells show abnormal methylation patterns. PMID:27911846
Miyake, Kunio; Kawaguchi, Akio; Miura, Ryu; Kobayashi, Sachiko; Tran, Nguyen Quoc Vuong; Kobayashi, Sumitaka; Miyashita, Chihiro; Araki, Atsuko; Kubota, Takeo; Yamagata, Zentaro; Kishi, Reiko
2018-04-04
Maternal smoking is reported to cause adverse effects on the health of the unborn child, the underlying mechanism for which is thought to involve alterations in DNA methylation. We examined the effects of maternal smoking on DNA methylation in cord blood, in 247 mother-infant pairs in the Sapporo cohort of the Hokkaido Study, using the Infinium HumanMethylation 450K BeadChip. We first identified differentially methylated CpG sites with a false discovery rate (FDR) of <0.05 and the magnitude of DNA methylation changes (|β| >0.02) from the pairwise comparisons of never-smokers (Ne-S), sustained-smokers (Su-S), and stopped-smokers (St-S). Subsequently, secondary comparisons between St-S and Su-S revealed nine common sites that mapped to ACSM3, AHRR, CYP1A1, GFI1, SHANK2, TRIM36, and the intergenic region between ANKRD9 and RCOR1 in Ne-S vs. Su-S, and one common CpG site mapping to EVC2 in Ne-S vs. St-S. Further, we verified these CpG sites and examined neighbouring sites using bisulfite next-generation sequencing, except for AHRR cg21161138. These changes in DNA methylation implicate the effect of smoking cessation. Our findings add to the current knowledge of the association between DNA methylation and maternal smoking and suggest future studies for clarifying this relationship in disease development.
Paliwal, Anupam; Temkin, Alexis M; Kerkel, Kristi; Yale, Alexander; Yotova, Iveta; Drost, Natalia; Lax, Simon; Nhan-Chang, Chia-Ling; Powell, Charles; Borczuk, Alain; Aviv, Abraham; Wapner, Ronald; Chen, Xiaowei; Nagy, Peter L; Schork, Nicholas; Do, Catherine; Torkamani, Ali; Tycko, Benjamin
2013-08-01
Allele-specific DNA methylation (ASM) is well studied in imprinted domains, but this type of epigenetic asymmetry is actually found more commonly at non-imprinted loci, where the ASM is dictated not by parent-of-origin but instead by the local haplotype. We identified loci with strong ASM in human tissues from methylation-sensitive SNP array data. Two index regions (bisulfite PCR amplicons), one between the C3orf27 and RPN1 genes in chromosome band 3q21 and the other near the VTRNA2-1 vault RNA in band 5q31, proved to be new examples of imprinted DMRs (maternal alleles methylated) while a third, between STEAP3 and C2orf76 in chromosome band 2q14, showed non-imprinted haplotype-dependent ASM. Using long-read bisulfite sequencing (bis-seq) in 8 human tissues we found that in all 3 domains the ASM is restricted to single differentially methylated regions (DMRs), each less than 2kb. The ASM in the C3orf27-RPN1 intergenic region was placenta-specific and associated with allele-specific expression of a long non-coding RNA. Strikingly, the discrete DMRs in all 3 regions overlap with binding sites for the insulator protein CTCF, which we found selectively bound to the unmethylated allele of the STEAP3-C2orf76 DMR. Methylation mapping in two additional genes with non-imprinted haplotype-dependent ASM, ELK3 and CYP2A7, showed that the CYP2A7 DMR also overlaps a CTCF site. Thus, two features of imprinted domains, highly localized DMRs and allele-specific insulator occupancy by CTCF, can also be found in chromosomal domains with non-imprinted ASM. Arguing for biological importance, our analysis of published whole genome bis-seq data from hES cells revealed multiple genome-wide association study (GWAS) peaks near CTCF binding sites with ASM.
Kerkel, Kristi; Yale, Alexander; Yotova, Iveta; Drost, Natalia; Lax, Simon; Nhan-Chang, Chia-Ling; Powell, Charles; Borczuk, Alain; Aviv, Abraham; Wapner, Ronald; Chen, Xiaowei; Nagy, Peter L.; Schork, Nicholas; Do, Catherine; Torkamani, Ali; Tycko, Benjamin
2013-01-01
Allele-specific DNA methylation (ASM) is well studied in imprinted domains, but this type of epigenetic asymmetry is actually found more commonly at non-imprinted loci, where the ASM is dictated not by parent-of-origin but instead by the local haplotype. We identified loci with strong ASM in human tissues from methylation-sensitive SNP array data. Two index regions (bisulfite PCR amplicons), one between the C3orf27 and RPN1 genes in chromosome band 3q21 and the other near the VTRNA2-1 vault RNA in band 5q31, proved to be new examples of imprinted DMRs (maternal alleles methylated) while a third, between STEAP3 and C2orf76 in chromosome band 2q14, showed non-imprinted haplotype-dependent ASM. Using long-read bisulfite sequencing (bis-seq) in 8 human tissues we found that in all 3 domains the ASM is restricted to single differentially methylated regions (DMRs), each less than 2kb. The ASM in the C3orf27-RPN1 intergenic region was placenta-specific and associated with allele-specific expression of a long non-coding RNA. Strikingly, the discrete DMRs in all 3 regions overlap with binding sites for the insulator protein CTCF, which we found selectively bound to the unmethylated allele of the STEAP3-C2orf76 DMR. Methylation mapping in two additional genes with non-imprinted haplotype-dependent ASM, ELK3 and CYP2A7, showed that the CYP2A7 DMR also overlaps a CTCF site. Thus, two features of imprinted domains, highly localized DMRs and allele-specific insulator occupancy by CTCF, can also be found in chromosomal domains with non-imprinted ASM. Arguing for biological importance, our analysis of published whole genome bis-seq data from hES cells revealed multiple genome-wide association study (GWAS) peaks near CTCF binding sites with ASM. PMID:24009515
Pervasive polymorphic imprinted methylation in the human placenta
Hanna, Courtney W.; Peñaherrera, Maria S.; Saadeh, Heba; Andrews, Simon; McFadden, Deborah E.; Kelsey, Gavin; Robinson, Wendy P.
2016-01-01
The maternal and paternal copies of the genome are both required for mammalian development, and this is primarily due to imprinted genes, those that are monoallelically expressed based on parent-of-origin. Typically, this pattern of expression is regulated by differentially methylated regions (DMRs) that are established in the germline and maintained after fertilization. There are a large number of germline DMRs that have not yet been associated with imprinting, and their function in development is unknown. In this study, we developed a genome-wide approach to identify novel imprinted DMRs in the human placenta and investigated the dynamics of these imprinted DMRs during development in somatic and extraembryonic tissues. DNA methylation was evaluated using the Illumina HumanMethylation450 array in 134 human tissue samples, publicly available reduced representation bisulfite sequencing in the human embryo and germ cells, and targeted bisulfite sequencing in term placentas. Forty-three known and 101 novel imprinted DMRs were identified in the human placenta by comparing methylation between diandric and digynic triploid conceptions in addition to female and male gametes. Seventy-two novel DMRs showed a pattern consistent with placental-specific imprinting, and this monoallelic methylation was entirely maternal in origin. Strikingly, these DMRs exhibited polymorphic imprinted methylation between placental samples. These data suggest that imprinting in human development is far more extensive and dynamic than previously reported and that the placenta preferentially maintains maternal germline-derived DNA methylation. PMID:26769960
Johnson, Michelle D; Dopierala, Justyna
2018-01-01
ABSTRACT DNA methylation is an important regulator of gene function. Fetal sex is associated with the risk of several specific pregnancy complications related to placental function. However, the association between fetal sex and placental DNA methylation remains poorly understood. We carried out whole-genome oxidative bisulfite sequencing in the placentas of two healthy female and two healthy male pregnancies generating an average genome depth of coverage of 25x. Most highly ranked differentially methylated regions (DMRs) were located on the X chromosome but we identified a 225 kb sex-specific DMR in the body of the CUB and Sushi Multiple Domains 1 (CSMD1) gene on chromosome 8. The sex-specific differential methylation pattern observed in this region was validated in additional placentas using in-solution target capture. In a new RNA-seq data set from 64 female and 67 male placentas, CSMD1 mRNA was 1.8-fold higher in male than in female placentas (P value = 8.5 × 10−7, Mann-Whitney test). Exon-level quantification of CSMD1 mRNA from these 131 placentas suggested a likely placenta-specific CSMD1 isoform not detected in the 21 somatic tissues analyzed. We show that the gene body of an autosomal gene, CSMD1, is differentially methylated in a sex- and placental-specific manner, displaying sex-specific differences in placental transcript abundance. PMID:29376485
Sun, Kun; Jiang, Peiyong; Chan, K. C. Allen; Wong, John; Cheng, Yvonne K. Y.; Liang, Raymond H. S.; Chan, Wai-kong; Ma, Edmond S. K.; Chan, Stephen L.; Cheng, Suk Hang; Chan, Rebecca W. Y.; Tong, Yu K.; Ng, Simon S. M.; Wong, Raymond S. M.; Hui, David S. C.; Leung, Tse Ngong; Leung, Tak Y.; Lai, Paul B. S.; Chiu, Rossa W. K.; Lo, Yuk Ming Dennis
2015-01-01
Plasma consists of DNA released from multiple tissues within the body. Using genome-wide bisulfite sequencing of plasma DNA and deconvolution of the sequencing data with reference to methylation profiles of different tissues, we developed a general approach for studying the major tissue contributors to the circulating DNA pool. We tested this method in pregnant women, patients with hepatocellular carcinoma, and subjects following bone marrow and liver transplantation. In most subjects, white blood cells were the predominant contributors to the circulating DNA pool. The placental contributions in the plasma of pregnant women correlated with the proportional contributions as revealed by fetal-specific genetic markers. The graft-derived contributions to the plasma in the transplant recipients correlated with those determined using donor-specific genetic markers. Patients with hepatocellular carcinoma showed elevated plasma DNA contributions from the liver, which correlated with measurements made using tumor-associated copy number aberrations. In hepatocellular carcinoma patients and in pregnant women exhibiting copy number aberrations in plasma, comparison of methylation deconvolution results using genomic regions with different copy number status pinpointed the tissue type responsible for the aberrations. In a pregnant woman diagnosed as having follicular lymphoma during pregnancy, methylation deconvolution indicated a grossly elevated contribution from B cells into the plasma DNA pool and localized B cells as the origin of the copy number aberrations observed in plasma. This method may serve as a powerful tool for assessing a wide range of physiological and pathological conditions based on the identification of perturbed proportional contributions of different tissues into plasma. PMID:26392541
Xing, Yang; Bu, Lingxi; Sun, Dafeng; Liu, Zhiping; Liu, Shijie; Jiang, Jianxin
2015-10-01
This study reports four schemes to pretreat wet furfural residues (FRs) with sodium bisulfite for production of fermentable sugar. The results showed that non-detoxified FRs (pH 2-3) had great potential to lower the cost of bioconversion. The optimal process was that unwashed FRs were first pretreated with bisulfite, and the whole slurry was then directly used for enzymatic hydrolysis. A maximum glucose yield of 99.4% was achieved from substrates pretreated with 0.1 g NaHSO3/g dry substrate (DS), at a relatively low temperature of 100 °C for 3 h. Compared with raw material, enzymatic hydrolysis at a high-solid of 16.5% (w/w) specifically showed more excellent performance with bisulfite treated FRs. Direct bisulfite pretreatment improved the accessibility of substrates and the total glucose recovery. Lignosulfonate in the non-detoxified slurry decreased the non-productive adsorption of cellulase on the substrate, thus improving enzymatic hydrolysis. Copyright © 2015 Elsevier Ltd. All rights reserved.
Detoxification of Dissolved SO2 (Bisulfite) by Terricolous Mosses
BHARALI, BHAGAWAN; BATES, JEFFREY W.
2006-01-01
• Background and Aims The widespread calcifuge moss Pleurozium schreberi is moderately tolerant of SO2, whereas Rhytidiadelphus triquetrus is limited to calcareous soils in regions of the UK that were strongly affected by SO2 pollution in the 20th century. The proposition that tolerance of SO2 by these terricolous mosses depends on metabolic detoxification of dissolved bisulfite was investigated. • Methods The capacities of the two mosses to accelerate loss of bisulfite from aqueous solutions of NaHSO3 were studied using DTNB [5, 5-dithio-(2-nitrobenzoic acid)] to assay bisulfite, and HPLC to assay sulfate in the incubation solutions. Incubations were performed for different durations, in the presence and absence of light, at a range of solution pH values, in the presence of metabolic inhibitors and with altered moss apoplastic Ca2+ and Fe3+ levels. • Key Results Bisulfite disappearance was markedly stimulated in the light and twice as great for R. triquetrus as for P. schreberi. DCMU, an inhibitor of photosynthetic electron chain transport, significantly reduced bisufite loss. • Conclusions Bisulfite (SO2) tolerance in these terricolous mosses involves extracellular oxidation using metabolic (photo-oxidative) energy, passive oxidation by adsorbed Fe3+ (only available to the calcifuge) and probably also internal metabolic detoxification. PMID:16319108
Bacteria of Porcine Skin, Xenografts, and Treatment with Neomycin Sulfate
Smith, Rodney F.; Evans, Barbara L.
1972-01-01
Homogenized 4-mm punch biopsies were taken from pigs and bacteriologically evaluated to determine the efficacy of surgical scrub procedures and the subsequent treatment of tissue with 0.5% neomycin sulfate-sodium bisulfite (neomycin-bisulfite) as a decontaminating agent. The majority of the lots of porcine skin taken directly from animals for xenografts in the treatment of burns contained viable bacteria at the time of grafting although scrubbing procedures substantially reduced the skin bacteria. The porcine bacteria consisted primarily of coagulase-negative staphylococci with most strains exhibiting caseinolytic and elastase activity. Staphylococci were the only abundant bacteria found in postscrub biopsies and in saline solutions used to wash the dermatome during its use. After an overnight exposure of grafting tissue soaked in neomycin-bisulfite, the spent neomycin-bisulfite solutions were tested for bacteriostatic and bactericidal activity by comparison to unused neomycin. All solutions tested were equal in bacteriostatic strength, but the bactericidal action of some spent solutions was decreased. Neomycin alone exerted a more lethal effect on sensitive bacteria than the neomycin-bisulfite solution. The desirability of having viable porcine skin for a xenograft necessitated using or discarding the tissue after storage in neomycin-bisulfite at 4 C for a maximum of 72 hr. Certain contaminating microorganisms were unaffected by antibiotic treatment, and the prolonged use of neomycin without bisulfite would have primarily eradicated only the porcine coagulase-negative staphylococci. Neither the presence of this group in grafting tissue nor their proteolytic activity had any observed adverse effect on xenografting success. Images PMID:4552886
2016-07-01
DNA methylation patterns are altered in numerous diseases and often correlate with clinically relevant information such as disease subtypes, prognosis and drug response. With suitable assays and after validation in large cohorts, such associations can be exploited for clinical diagnostics and personalized treatment decisions. Here we describe the results of a community-wide benchmarking study comparing the performance of all widely used methods for DNA methylation analysis that are compatible with routine clinical use. We shipped 32 reference samples to 18 laboratories in seven different countries. Researchers in those laboratories collectively contributed 21 locus-specific assays for an average of 27 predefined genomic regions, as well as six global assays. We evaluated assay sensitivity on low-input samples and assessed the assays' ability to discriminate between cell types. Good agreement was observed across all tested methods, with amplicon bisulfite sequencing and bisulfite pyrosequencing showing the best all-round performance. Our technology comparison can inform the selection, optimization and use of DNA methylation assays in large-scale validation studies, biomarker development and clinical diagnostics.
Comparative Methylome Analyses Identify Epigenetic Regulatory Loci of Human Brain Evolution
Mendizabal, Isabel; Shi, Lei; Keller, Thomas E.; Konopka, Genevieve; Preuss, Todd M.; Hsieh, Tzung-Fu; Hu, Enzhi; Zhang, Zhe; Su, Bing; Yi, Soojin V.
2016-01-01
How do epigenetic modifications change across species and how do these modifications affect evolution? These are fundamental questions at the forefront of our evolutionary epigenomic understanding. Our previous work investigated human and chimpanzee brain methylomes, but it was limited by the lack of outgroup data which is critical for comparative (epi)genomic studies. Here, we compared whole genome DNA methylation maps from brains of humans, chimpanzees and also rhesus macaques (outgroup) to elucidate DNA methylation changes during human brain evolution. Moreover, we validated that our approach is highly robust by further examining 38 human-specific DMRs using targeted deep genomic and bisulfite sequencing in an independent panel of 37 individuals from five primate species. Our unbiased genome-scan identified human brain differentially methylated regions (DMRs), irrespective of their associations with annotated genes. Remarkably, over half of the newly identified DMRs locate in intergenic regions or gene bodies. Nevertheless, their regulatory potential is on par with those of promoter DMRs. An intriguing observation is that DMRs are enriched in active chromatin loops, suggesting human-specific evolutionary remodeling at a higher-order chromatin structure. These findings indicate that there is substantial reprogramming of epigenomic landscapes during human brain evolution involving noncoding regions. PMID:27563052
Govindaraju, Gayathri; Jabeena, C A; Sethumadhavan, Devadathan Valiyamangalath; Rajaram, Nivethika; Rajavelu, Arumugam
2017-10-01
In eukaryotes, cytosine methylation regulates diverse biological processes such as gene expression, development and maintenance of genomic integrity. However, cytosine methylation and its functions in pathogenic apicomplexan protozoans remain enigmatic. To address this, here we investigated the presence of cytosine methylation in the nucleic acids of the protozoan Plasmodium falciparum. Interestingly, P. falciparum has TRDMT1, a conserved homologue of DNA methyltransferase DNMT2. However, we found that TRDMT1 did not methylate DNA, in vitro. We demonstrate that TRDMT1 methylates cytosine in the endogenous aspartic acid tRNA of P. falciparum. Through RNA bisulfite sequencing, we mapped the position of 5-methyl cytosine in aspartic acid tRNA and found methylation only at C38 position. P. falciparum proteome has significantly higher aspartic acid content and a higher proportion of proteins with poly aspartic acid repeats than other apicomplexan pathogenic protozoans. Proteins with such repeats are functionally important, with significant roles in host-pathogen interactions. Therefore, TRDMT1 mediated C38 methylation of aspartic acid tRNA might play a critical role by translational regulation of important proteins and modulate the pathogenicity of the malarial parasite. Copyright © 2017 Elsevier B.V. All rights reserved.
Pangeson, Tanapat; Sanguansermsri, Phanchana; Sanguansermsri, Torpong; Seeratanachot, Teerapat; Suwanakhon, Narutchala; Srikummool, Metawee; Kaewkong, Worasak; Mahingsa, Khwanruedee
2017-01-01
In the wild-type allele, DNA methylation levels of 10 consecutive CpG sites adjacent to the upstream 5′-breakpoint of α-thalassemia Southeast Asian (SEA) deletion are not different between placenta and leukocytes. However, no previous study has reported the map of DNA methylation in the SEA allele. This report aims to show that the SEA mutation is associated with DNA methylation changes, resulting in differential methylation between placenta and leukocytes. Methylation-sensitive high-resolution analysis was used to compare DNA methylation among placenta, leukocytes, and unmethylated control DNA. The result indicates that the DNA methylation between placenta and leukocyte DNA is different and shows that the CpG status of both is not fully unmethylated. Mapping of individual CpG sites was performed by targeted bisulfite sequencing. The DNA methylation level of the 10 consecutive CpG sites was different between placenta and leukocyte DNA. When the 10th CpG of the mutation allele was considered as a hallmark for comparing DNA methylation level, it was totally different from the unmethylated 10th CpG of the wild-type allele. Finally, the distinct DNA methylation patterns between both DNA were extracted. In total, 24 patterns were found in leukocyte samples and 9 patterns were found in placenta samples. This report shows that the large deletion is associated with DNA methylation change. In further studies for clinical application, the distinct DNA methylation pattern might be a potential marker for detecting cell-free fetal DNA. PMID:29162979
NASA Astrophysics Data System (ADS)
Chao, Jianbin; Liu, Yuhong; Zhang, Yan; Zhang, Yongbin; Huo, Fangjun; Yin, Caixia; Wang, Yu; Qin, Liping
2015-07-01
A new fluorescent enhanced probe based on (E)-9-(2-nitrovinyl)-anthracene is developed, which shows high selectivity and sensitivity for the detection of bisulfite anions at Na2HPO4 citric acid buffer solutions (pH 5.0). When addition of HSO3-, the fluorescence intensity is significantly enhanced and the probe displays apparent fluorescence color changes from non-fluorescence to blue under a UV lamp illumination, the solution color also changes from yellow to colorless. The detection limit is determined to be as low as 6.30 μM. This offers another specific colorimetric and fluorescent probe for bisulfite anions detection, furthermore it is applied in detecting the level of bisulfite in sugar samples.
Morandi, Luca; Gissi, Davide; Tarsitano, Achille; Asioli, Sofia; Gabusi, Andrea; Marchetti, Claudio; Montebugnoli, Lucio; Foschini, Maria Pia
2017-01-01
Oral squamous cell carcinoma (OSCC) is usually diagnosed at an advanced stage and is commonly preceded by oral premalignant lesions. The mortality rates have remained unchanged (50% within 5 years after diagnosis), and it is related to tobacco smoking and alcohol intake. Novel molecular markers for early diagnosis are urgently needed. The purpose of this study was to evaluate the diagnostic value of methylation level in a set of 18 genes by bisulfite next-generation sequencing. With minimally invasive oral brushing, 28 consecutive OSCC, one squamous cell carcinoma with sarcomatoid features, six high-grade squamous intraepithelial lesions (HGSIL), 30 normal contralateral mucosa from the same patients, and 65 healthy donors were evaluated for DNA methylation analyzing 18 target genes by quantitative bisulfite next-generation sequencing. We further evaluated an independent cohort (validation dataset) made of 20 normal donors, one oral fibroma, 14 oral lichen planus (OLP), three proliferative verrucous leukoplakia (PVL), and two OSCC. Comparing OSCC with normal healthy donors and contralateral mucosa in 355 CpGs, we identified the following epigenetically altered genes: ZAP70 , ITGA4 , KIF1A , PARP15 , EPHX3 , NTM , LRRTM1 , FLI1 , MIR193 , LINC00599 , PAX1 , and MIR137HG showing hypermethylation and MIR296 , TERT , and GP1BB showing hypomethylation . The behavior of ZAP70 , GP1BB , H19 , EPHX3 , and MIR193 fluctuated among different interrogated CpGs. The gap between normal and OSCC samples remained mostly the same (Kruskal-Wallis P values < 0.05), but the absolute values changed conspicuously. ROC curve analysis identified the most informative CpGs, and we correctly stratified OSCC and HGSIL from normal donors using a multiclass linear discriminant analysis in a 13-gene panel (AUC 0.981). Only the OSCC with sarcomatoid features was negative. Three contralateral mucosa were positive, a sign of a possible field cancerization. Among imprinted genes, only MIR296 showed loss of imprinting. DNMT1 , TERC , and H19 together with the global methylation of long interspersed element 1 were unchanged. In the validation dataset, values over the threshold were detected in 2/2 OSCC, in 3/3 PVL, and in 2/14 OLP. Our data highlight the importance of CpG location and correct estimation of DNA methylation level for highly accurate early diagnosis of OSCC.
NASA Astrophysics Data System (ADS)
Eldridge, Daniel L.; Mysen, Bjorn O.; Cody, George D.
2018-01-01
Bisulfite (HSO3-) and sulfite (SO32-) compounds play key roles in numerous geochemical and biochemical processes extending from the atmosphere to the subseafloor biosphere. Despite decades of spectroscopic investigations, the molecular composition of HSO3- in solution remains uncertain and, thus, the role of bisulfite in (bio)chemical and isotope fractionation processes is unclear. We report new experimental estimates for the bisulfite isomer quotient (Qi = [(HO)SO2-]/[(HS)O3-]; [] = concentration) as a function of temperature from the interpretation of Raman spectra collected from aqueous NaHSO3 solutions contained in fused silica capsules. In pure NaHSO3 solutions (1Na+:1HSO3-, stoichiometric) over [NaHSO3] = 0.2-0.4 m (moles/kg H2O), the following relationship is obtained:
Goedecke, Simon; Mühlisch, Jörg; Hempel, Georg; Frühwald, Michael C; Wünsch, Bernhard
2015-12-01
Along with histone modifications, RNA interference and delayed replication timing, DNA methylation belongs to the key processes in epigenetic regulation of gene expression. Therefore, reliable information about the methylation level of particular DNA fragments is of major interest. Herein the methylation level at two positions of the promoter region of the gene methylguanine-O(6) -DNA-Methyltransferase (MGMT) was investigated. Previously, it was demonstrated that the epigenetic status of this DNA region correlates with response to alkylating anticancer agents. An automated CGE method with LIF detection was established to separate the six DNA fragments resulting from combined bisulfite restriction analysis of the methylated and non-methylated MGMT promoter. In COBRA, the DNA was treated with bisulfite converting cytosine into uracil. During PCR uracil pairs with adenine, which changes the original recognition site of the restriction enzyme Taql. Artificial probes generated by mixing appropriate amounts of DNA after bisulfite treatment and PCR amplification were used for validation of the method. The methylation levels of these samples could be determined with high accuracy and precision. DNA samples prepared by mixing the corresponding clones first and then performing PCR amplification led to non-linear correlation between the corrected peak areas and the methylation levels. This effect is explained by slightly different PCR amplification of DNA with different sequences present in the mixture. The superiority of CGE over PAGE was clearly demonstrated. Finally, the established method was used to analyze the methylation levels of human brain tumor tissue samples. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Rapid analytical determination of glutaraldehyde concentrations
NASA Technical Reports Server (NTRS)
Frigerio, N. A.; Shaw, M. H.
1971-01-01
Technique utilizes the iodimetric procedure which adds unknown excess of bisulfite to glutaraldehyde /GA/ then titrates unreacted bisulfite with standard iodine isotope to determine GA concentrations. Technique may interest microscopists, food researchers, biochemical or medical laboratories, and drug manufacturers.
Draper, D E
1984-01-01
Bisulfite catalyzes transamination of cytidine at the N4 position; the suitability of this reaction for attaching reporter groups to selected cytidine residues in RNA molecules has been investigated. Poly(C) is nearly quantitatively converted to the poly (N4 aminoethyl-C) derivative after 3 hrs at 42 degrees C with ethylene diamine (pK1 = 7.6) and bisulfite. This derivative reacts quantitatively with N-hydroxysuccinimide esters; the linkage of a fluorescent dye, nitrobenzofurazan, to cytidine by this reaction is demonstrated. To direct the bisulfite reaction to selected cytidines within a large RNA molecule, the RNA is hybridized to complementary DNA containing a deletion. Only the cytidines in the single strand RNA loop (corresponding to the DNA deletion) are reactive. Two cytidines in the middle of a 340 base RNA fragment from 16S ribosomal RNA have been modified by this technique. Images PMID:6198634
Gries, Jasmin; Schumacher, Dirk; Arand, Julia; Lutsik, Pavlo; Markelova, Maria Rivera; Fichtner, Iduna; Walter, Jörn; Sers, Christine; Tierling, Sascha
2013-01-01
The use of next generation sequencing has expanded our view on whole mammalian methylome patterns. In particular, it provides a genome-wide insight of local DNA methylation diversity at single nucleotide level and enables the examination of single chromosome sequence sections at a sufficient statistical power. We describe a bisulfite-based sequence profiling pipeline, Bi-PROF, which is based on the 454 GS-FLX Titanium technology that allows to obtain up to one million sequence stretches at single base pair resolution without laborious subcloning. To illustrate the performance of the experimental workflow connected to a bioinformatics program pipeline (BiQ Analyzer HT) we present a test analysis set of 68 different epigenetic marker regions (amplicons) in five individual patient-derived xenograft tissue samples of colorectal cancer and one healthy colon epithelium sample as a control. After the 454 GS-FLX Titanium run, sequence read processing and sample decoding, the obtained alignments are quality controlled and statistically evaluated. Comprehensive methylation pattern interpretation (profiling) assessed by analyzing 102-104 sequence reads per amplicon allows an unprecedented deep view on pattern formation and methylation marker heterogeneity in tissues concerned by complex diseases like cancer. PMID:23803588
Harris, R. Alan; Wang, Ting; Coarfa, Cristian; Nagarajan, Raman P.; Hong, Chibo; Downey, Sara L.; Johnson, Brett E.; Fouse, Shaun D.; Delaney, Allen; Zhao, Yongjun; Olshen, Adam; Ballinger, Tracy; Zhou, Xin; Forsberg, Kevin J.; Gu, Junchen; Echipare, Lorigail; O’Geen, Henriette; Lister, Ryan; Pelizzola, Mattia; Xi, Yuanxin; Epstein, Charles B.; Bernstein, Bradley E.; Hawkins, R. David; Ren, Bing; Chung, Wen-Yu; Gu, Hongcang; Bock, Christoph; Gnirke, Andreas; Zhang, Michael Q.; Haussler, David; Ecker, Joseph; Li, Wei; Farnham, Peggy J.; Waterland, Robert A.; Meissner, Alexander; Marra, Marco A.; Hirst, Martin; Milosavljevic, Aleksandar; Costello, Joseph F.
2010-01-01
Sequencing-based DNA methylation profiling methods are comprehensive and, as accuracy and affordability improve, will increasingly supplant microarrays for genome-scale analyses. Here, four sequencing-based methodologies were applied to biological replicates of human embryonic stem cells to compare their CpG coverage genome-wide and in transposons, resolution, cost, concordance and its relationship with CpG density and genomic context. The two bisulfite methods reached concordance of 82% for CpG methylation levels and 99% for non-CpG cytosine methylation levels. Using binary methylation calls, two enrichment methods were 99% concordant, while regions assessed by all four methods were 97% concordant. To achieve comprehensive methylome coverage while reducing cost, an approach integrating two complementary methods was examined. The integrative methylome profile along with histone methylation, RNA, and SNP profiles derived from the sequence reads allowed genome-wide assessment of allele-specific epigenetic states, identifying most known imprinted regions and new loci with monoallelic epigenetic marks and monoallelic expression. PMID:20852635
DNA methylation analysis of phenotype specific stratified Indian population.
Rotti, Harish; Mallya, Sandeep; Kabekkodu, Shama Prasada; Chakrabarty, Sanjiban; Bhale, Sameer; Bharadwaj, Ramachandra; Bhat, Balakrishna K; Dedge, Amrish P; Dhumal, Vikram Ram; Gangadharan, G G; Gopinath, Puthiya M; Govindaraj, Periyasamy; Joshi, Kalpana S; Kondaiah, Paturu; Nair, Sreekumaran; Nair, S N Venugopalan; Nayak, Jayakrishna; Prasanna, B V; Shintre, Pooja; Sule, Mayura; Thangaraj, Kumarasamy; Patwardhan, Bhushan; Valiathan, Marthanda Varma Sankaran; Satyamoorthy, Kapaettu
2015-05-08
DNA methylation and its perturbations are an established attribute to a wide spectrum of phenotypic variations and disease conditions. Indian traditional system practices personalized medicine through indigenous concept of distinctly descriptive physiological, psychological and anatomical features known as prakriti. Here we attempted to establish DNA methylation differences in these three prakriti phenotypes. Following structured and objective measurement of 3416 subjects, whole blood DNA of 147 healthy male individuals belonging to defined prakriti (Vata, Pitta and Kapha) between the age group of 20-30years were subjected to methylated DNA immunoprecipitation (MeDIP) and microarray analysis. After data analysis, prakriti specific signatures were validated through bisulfite DNA sequencing. Differentially methylated regions in CpG islands and shores were significantly enriched in promoters/UTRs and gene body regions. Phenotypes characterized by higher metabolism (Pitta prakriti) in individuals showed distinct promoter (34) and gene body methylation (204), followed by Vata prakriti which correlates to motion showed DNA methylation in 52 promoters and 139 CpG islands and finally individuals with structural attributes (Kapha prakriti) with 23 and 19 promoters and CpG islands respectively. Bisulfite DNA sequencing of prakriti specific multiple CpG sites in promoters and 5'-UTR such as; LHX1 (Vata prakriti), SOX11 (Pitta prakriti) and CDH22 (Kapha prakriti) were validated. Kapha prakriti specific CDH22 5'-UTR CpG methylation was also found to be associated with higher body mass index (BMI). Differential DNA methylation signatures in three distinct prakriti phenotypes demonstrate the epigenetic basis of Indian traditional human classification which may have relevance to personalized medicine.
Garinet, Simon; Néou, Mario; de La Villéon, Bruno; Faillot, Simon; Sakat, Julien; Da Fonseca, Juliana P; Jouinot, Anne; Le Tourneau, Christophe; Kamal, Maud; Luscap-Rondof, Windy; Boeva, Valentina; Gaujoux, Sebastien; Vidaud, Michel; Pasmant, Eric; Letourneur, Franck; Bertherat, Jérôme; Assié, Guillaume
2017-09-01
Pangenomic studies identified distinct molecular classes for many cancers, with major clinical applications. However, routine use requires cost-effective assays. We assessed whether targeted next-generation sequencing (NGS) could call chromosomal alterations and DNA methylation status. A training set of 77 tumors and a validation set of 449 (43 tumor types) were analyzed by targeted NGS and single-nucleotide polymorphism (SNP) arrays. Thirty-two tumors were analyzed by NGS after bisulfite conversion, and compared to methylation array or methylation-specific multiplex ligation-dependent probe amplification. Considering allelic ratios, correlation was strong between targeted NGS and SNP arrays (r = 0.88). In contrast, considering DNA copy number, for variations of one DNA copy, correlation was weaker between read counts and SNP array (r = 0.49). Thus, we generated TARGOMICs, optimized for detecting chromosome alterations by combining allelic ratios and read counts generated by targeted NGS. Sensitivity for calling normal, lost, and gained chromosomes was 89%, 72%, and 31%, respectively. Specificity was 81%, 93%, and 98%, respectively. These results were confirmed in the validation set. Finally, TARGOMICs could efficiently align and compute proportions of methylated cytosines from bisulfite-converted DNA from targeted NGS. In conclusion, beyond calling mutations, targeted NGS efficiently calls chromosome alterations and methylation status in tumors. A single run and minor design/protocol adaptations are sufficient. Optimizing targeted NGS should expand translation of genomics to clinical routine. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Kizaki, Seiichiro; Zou, Tingting; Li, Yue; Han, Yong-Woon; Suzuki, Yuki; Harada, Yoshie; Sugiyama, Hiroshi
2016-11-07
Tet (ten-eleven translocation) family proteins oxidize 5-methylcytosine (mC) to 5-hydroxymethylcytosine (hmC), 5-formylcytosine (fC), and 5-carboxycytosine (caC), and are suggested to be involved in the active DNA demethylation pathway. In this study, we reconstituted positioned mononucleosomes using CpG-methylated 382 bp DNA containing the Widom 601 sequence and recombinant histone octamer, and subjected the nucleosome to treatment with Tet1 protein. The sites of oxidized methylcytosine were identified by bisulfite sequencing. We found that, for the oxidation reaction, Tet1 protein prefers mCs located in the linker region of the nucleosome compared with those located in the core region. © 2016 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Comparative Methylome Analyses Identify Epigenetic Regulatory Loci of Human Brain Evolution.
Mendizabal, Isabel; Shi, Lei; Keller, Thomas E; Konopka, Genevieve; Preuss, Todd M; Hsieh, Tzung-Fu; Hu, Enzhi; Zhang, Zhe; Su, Bing; Yi, Soojin V
2016-11-01
How do epigenetic modifications change across species and how do these modifications affect evolution? These are fundamental questions at the forefront of our evolutionary epigenomic understanding. Our previous work investigated human and chimpanzee brain methylomes, but it was limited by the lack of outgroup data which is critical for comparative (epi)genomic studies. Here, we compared whole genome DNA methylation maps from brains of humans, chimpanzees and also rhesus macaques (outgroup) to elucidate DNA methylation changes during human brain evolution. Moreover, we validated that our approach is highly robust by further examining 38 human-specific DMRs using targeted deep genomic and bisulfite sequencing in an independent panel of 37 individuals from five primate species. Our unbiased genome-scan identified human brain differentially methylated regions (DMRs), irrespective of their associations with annotated genes. Remarkably, over half of the newly identified DMRs locate in intergenic regions or gene bodies. Nevertheless, their regulatory potential is on par with those of promoter DMRs. An intriguing observation is that DMRs are enriched in active chromatin loops, suggesting human-specific evolutionary remodeling at a higher-order chromatin structure. These findings indicate that there is substantial reprogramming of epigenomic landscapes during human brain evolution involving noncoding regions. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
NGSmethDB 2017: enhanced methylomes and differential methylation
Lebrón, Ricardo; Gómez-Martín, Cristina; Carpena, Pedro; Bernaola-Galván, Pedro; Barturen, Guillermo; Hackenberg, Michael; Oliver, José L.
2017-01-01
The 2017 update of NGSmethDB stores whole genome methylomes generated from short-read data sets obtained by bisulfite sequencing (WGBS) technology. To generate high-quality methylomes, stringent quality controls were integrated with third-part software, adding also a two-step mapping process to exploit the advantages of the new genome assembly models. The samples were all profiled under constant parameter settings, thus enabling comparative downstream analyses. Besides a significant increase in the number of samples, NGSmethDB now includes two additional data-types, which are a valuable resource for the discovery of methylation epigenetic biomarkers: (i) differentially methylated single-cytosines; and (ii) methylation segments (i.e. genome regions of homogeneous methylation). The NGSmethDB back-end is now based on MongoDB, a NoSQL hierarchical database using JSON-formatted documents and dynamic schemas, thus accelerating sample comparative analyses. Besides conventional database dumps, track hubs were implemented, which improved database access, visualization in genome browsers and comparative analyses to third-part annotations. In addition, the database can be also accessed through a RESTful API. Lastly, a Python client and a multiplatform virtual machine allow for program-driven access from user desktop. This way, private methylation data can be compared to NGSmethDB without the need to upload them to public servers. Database website: http://bioinfo2.ugr.es/NGSmethDB. PMID:27794041
Single-Cell Sequencing for Drug Discovery and Drug Development.
Wu, Hongjin; Wang, Charles; Wu, Shixiu
2017-01-01
Next-generation sequencing (NGS), particularly single-cell sequencing, has revolutionized the scale and scope of genomic and biomedical research. Recent technological advances in NGS and singlecell studies have made the deep whole-genome (DNA-seq), whole epigenome and whole-transcriptome sequencing (RNA-seq) at single-cell level feasible. NGS at the single-cell level expands our view of genome, epigenome and transcriptome and allows the genome, epigenome and transcriptome of any organism to be explored without a priori assumptions and with unprecedented throughput. And it does so with single-nucleotide resolution. NGS is also a very powerful tool for drug discovery and drug development. In this review, we describe the current state of single-cell sequencing techniques, which can provide a new, more powerful and precise approach for analyzing effects of drugs on treated cells and tissues. Our review discusses single-cell whole genome/exome sequencing (scWGS/scWES), single-cell transcriptome sequencing (scRNA-seq), single-cell bisulfite sequencing (scBS), and multiple omics of single-cell sequencing. We also highlight the advantages and challenges of each of these approaches. Finally, we describe, elaborate and speculate the potential applications of single-cell sequencing for drug discovery and drug development. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Hayatsu, H; Yamashita, Y; Yui, S; Yamagata, Y; Tomita, K; Negishi, K
1982-10-25
When guanine-, adenine- and cytosine-nucleosides and nucleotides were treated with formaldehyde and then with bisulfite, stable N-sulfomethyl compounds were formed. N2-Sulfomethylguanine, N6-sulfomethyladenine, N4-sulfomthylcytosine and N6-sulfomethyl-9-beta-D-arabinofuranosyladenine were isolated as crystals and characterized. A guanine-specific sulfomethylation was brought about by treatment and denatured single-stranded DNA with formaldehyde and then with bisulfite at pH 7 and 4 degrees C. Since native double-stranded DNA was not modified by this treatment, this new method of modification is expected to be useful as a conformational probe for polynucleotides.
Hayatsu, H; Yamashita, Y; Yui, S; Yamagata, Y; Tomita, K; Negishi, K
1982-01-01
When guanine-, adenine- and cytosine-nucleosides and nucleotides were treated with formaldehyde and then with bisulfite, stable N-sulfomethyl compounds were formed. N2-Sulfomethylguanine, N6-sulfomethyladenine, N4-sulfomthylcytosine and N6-sulfomethyl-9-beta-D-arabinofuranosyladenine were isolated as crystals and characterized. A guanine-specific sulfomethylation was brought about by treatment and denatured single-stranded DNA with formaldehyde and then with bisulfite at pH 7 and 4 degrees C. Since native double-stranded DNA was not modified by this treatment, this new method of modification is expected to be useful as a conformational probe for polynucleotides. PMID:7177848
DMRfinder: efficiently identifying differentially methylated regions from MethylC-seq data.
Gaspar, John M; Hart, Ronald P
2017-11-29
DNA methylation is an epigenetic modification that is studied at a single-base resolution with bisulfite treatment followed by high-throughput sequencing. After alignment of the sequence reads to a reference genome, methylation counts are analyzed to determine genomic regions that are differentially methylated between two or more biological conditions. Even though a variety of software packages is available for different aspects of the bioinformatics analysis, they often produce results that are biased or require excessive computational requirements. DMRfinder is a novel computational pipeline that identifies differentially methylated regions efficiently. Following alignment, DMRfinder extracts methylation counts and performs a modified single-linkage clustering of methylation sites into genomic regions. It then compares methylation levels using beta-binomial hierarchical modeling and Wald tests. Among its innovative attributes are the analyses of novel methylation sites and methylation linkage, as well as the simultaneous statistical analysis of multiple sample groups. To demonstrate its efficiency, DMRfinder is benchmarked against other computational approaches using a large published dataset. Contrasting two replicates of the same sample yielded minimal genomic regions with DMRfinder, whereas two alternative software packages reported a substantial number of false positives. Further analyses of biological samples revealed fundamental differences between DMRfinder and another software package, despite the fact that they utilize the same underlying statistical basis. For each step, DMRfinder completed the analysis in a fraction of the time required by other software. Among the computational approaches for identifying differentially methylated regions from high-throughput bisulfite sequencing datasets, DMRfinder is the first that integrates all the post-alignment steps in a single package. Compared to other software, DMRfinder is extremely efficient and unbiased in this process. DMRfinder is free and open-source software, available on GitHub ( github.com/jsh58/DMRfinder ); it is written in Python and R, and is supported on Linux.
Aberrant methylation of the M-type phospholipase A2 receptor gene in leukemic cells
2012-01-01
Background The M-type phospholipase A2 receptor (PLA2R1) plays a crucial role in several signaling pathways and may act as tumor-suppressor. This study examined the expression and methylation of the PLA2R1 gene in Jurkat and U937 leukemic cell lines and its methylation in patients with myelodysplastic syndrome (MDS) or acute leukemia. Methods Sites of methylation of the PLA2R1 locus were identified by sequencing bisulfite-modified DNA fragments. Methylation specific-high resolution melting (MS-HRM) analysis was then carried out to quantify PLA2R1 methylation at 5`-CpG sites identified with differences in methylation between healthy control subjects and leukemic patients using sequencing of bisulfite-modified genomic DNA. Results Expression of PLA2R1 was found to be completely down-regulated in Jurkat and U937 cells, accompanied by complete methylation of PLA2R1 promoter and down-stream regions; PLA2R1 was re-expressed after exposure of cells to 5-aza-2´-deoxycytidine. MS-HRM analysis of the PLA2R1 locus in patients with different types of leukemia indicated an average methylation of 28.9% ± 17.8%, compared to less than 9% in control subjects. In MDS patients the extent of PLA2R1 methylation significantly increased with disease risk. Furthermore, measurements of PLA2R1 methylation appeared useful for predicting responsiveness to the methyltransferase inhibitor, azacitidine, as a pre-emptive treatment to avoid hematological relapse in patients with high-risk MDS or acute myeloid leukemia. Conclusions The study shows for the first time that PLA2R1 gene sequences are a target of hypermethylation in leukemia, which may have pathophysiological relevance for disease evolution in MDS and leukemogenesis. PMID:23217014
Li, Chengzhe; Ai, Rizi; Wang, Mengchi; Firestein, Gary S.; Wang, Wei
2016-01-01
Motivation: DNA methylation signatures in rheumatoid arthritis (RA) have been identified in fibroblast-like synoviocytes (FLS) with Illumina HumanMethylation450 array. Since <2% of CpG sites are covered by the Illumina 450K array and whole genome bisulfite sequencing is still too expensive for many samples, computationally predicting DNA methylation levels based on 450K data would be valuable to discover more RA-related genes. Results: We developed a computational model that is trained on 14 tissues with both whole genome bisulfite sequencing and 450K array data. This model integrates information derived from the similarity of local methylation pattern between tissues, the methylation information of flanking CpG sites and the methylation tendency of flanking DNA sequences. The predicted and measured methylation values were highly correlated with a Pearson correlation coefficient of 0.9 in leave-one-tissue-out cross-validations. Importantly, the majority (76%) of the top 10% differentially methylated loci among the 14 tissues was correctly detected using the predicted methylation values. Applying this model to 450K data of RA, osteoarthritis and normal FLS, we successfully expanded the coverage of CpG sites 18.5-fold and accounts for about 30% of all the CpGs in the human genome. By integrative omics study, we identified genes and pathways tightly related to RA pathogenesis, among which 12 genes were supported by triple evidences, including 6 genes already known to perform specific roles in RA and 6 genes as new potential therapeutic targets. Availability and implementation: The source code, required data for prediction, and demo data for test are freely available at: http://wanglab.ucsd.edu/star/LR450K/. Contact: wei-wang@ucsd.edu or gfirestein@ucsd.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26883487
Code of Federal Regulations, 2011 CFR
2011-07-01
... [Bisulfite liquor/surface condensers; BPT effluent limitations for papergrade sulfite facilities where blow... range of 5.0 to 9.0 at all times. Subpart E [Bisulfite liquor/barometric condensers; BPT effluent... [Acid sulfite liquor/surface condensers; BPT effluent limitations for papergrade sulfite facilities...
Code of Federal Regulations, 2010 CFR
2010-07-01
... [Bisulfite liquor/surface condensers; BPT effluent limitations for papergrade sulfite facilities where blow... range of 5.0 to 9.0 at all times. Subpart E [Bisulfite liquor/barometric condensers; BPT effluent... [Acid sulfite liquor/surface condensers; BPT effluent limitations for papergrade sulfite facilities...
Galetzka, Danuta; Hansmann, Tamara; El Hajj, Nady; Weis, Eva; Irmscher, Benjamin; Ludwig, Marco; Schneider-Rätzke, Brigitte; Kohlschmidt, Nicolai; Beyer, Vera; Bartsch, Oliver; Zechner, Ulrich; Spix, Claudia; Haaf, Thomas
2012-01-01
We describe monozygotic twins discordant for childhood leukemia and secondary thyroid carcinoma. We used bisulfite pyrosequencing to compare the constitutive promoter methylation of BRCA1 and several other tumor suppressor genes in primary fibroblasts. The affected twin displayed an increased BRCA1 methylation (12%), compared with her sister (3%). Subsequent bisulfite plasmid sequencing demonstrated that 13% (6 of 47) BRCA1 alleles were fully methylated in the affected twin, whereas her sister displayed only single CpG errors without functional implications. This between-twin methylation difference was also found in irradiated fibroblasts and untreated saliva cells. The BRCA1 epimutation may have originated by an early somatic event in the affected twin: approximately 25% of her body cells derived from different embryonic cell lineages carry one epigenetically inactivated BRCA1 allele. This epimutation was associated with reduced basal protein levels and a higher induction of BRCA1 after DNA damage. In addition, we performed a genome-wide microarray analysis of both sisters and found several copy number variations, i.e., heterozygous deletion and reduced expression of the RSPO3 gene in the affected twin. This monozygotic twin pair represents an impressive example of epigenetic somatic mosaicism, suggesting a role for constitutive epimutations, maybe along with de novo genetic alterations in recurrent tumor development.
Galetzka, Danuta; Hansmann, Tamara; El Hajj, Nady; Weis, Eva; Irmscher, Benjamin; Ludwig, Marco; Schneider-Rätzke, Brigitte; Kohlschmidt, Nicolai; Beyer, Vera; Bartsch, Oliver; Zechner, Ulrich; Spix, Claudia; Haaf, Thomas
2012-01-01
We describe monozygotic twins discordant for childhood leukemia and secondary thyroid carcinoma. We used bisulfite pyrosequencing to compare the constitutive promoter methylation of BRCA1 and several other tumor suppressor genes in primary fibroblasts. The affected twin displayed an increased BRCA1 methylation (12%), compared with her sister (3%). Subsequent bisulfite plasmid sequencing demonstrated that 13% (6 of 47) BRCA1 alleles were fully methylated in the affected twin, whereas her sister displayed only single CpG errors without functional implications. This between-twin methylation difference was also found in irradiated fibroblasts and untreated saliva cells. The BRCA1 epimutation may have originated by an early somatic event in the affected twin: approximately 25% of her body cells derived from different embryonic cell lineages carry one epigenetically inactivated BRCA1 allele. This epimutation was associated with reduced basal protein levels and a higher induction of BRCA1 after DNA damage. In addition, we performed a genome-wide microarray analysis of both sisters and found several copy number variations, i.e., heterozygous deletion and reduced expression of the RSPO3 gene in the affected twin. This monozygotic twin pair represents an impressive example of epigenetic somatic mosaicism, suggesting a role for constitutive epimutations, maybe along with de novo genetic alterations in recurrent tumor development. PMID:22207351
Kurita, Ryoji; Yanagisawa, Hiroyuki; Kamata, Tomoyuki; Kato, Dai; Niwa, Osamu
2017-06-06
This paper reports an on-chip electrochemical assessment of the DNA methylation status in genomic DNA on a conductive nanocarbon film electrode realized with combined bisulfite restriction analysis (COBRA). The film electrode consists of sp 2 and sp 3 hybrid bonds and is fabricated with an unbalanced magnetron (UBM) sputtering method. First, we studied the effect of the sp 2 /sp 3 ratio of the UBM nanocarbon film electrode with p-aminophenol, which is a major electro-active product of the labeling enzyme from p-aminophenol phosphate. The signal current for p-aminophenol increases as the sp 2 content in the UBM nanocarbon film electrode increases because of the π-π interaction between aromatic p-aminophenol and the graphene-like sp 2 structure. Furthermore, the capacitative current at the UBM nanocarbon film electrode was successfully reduced by about 1 order of magnitude thanks to the angstrom-level surface flatness. Therefore, a high signal-to-noise ratio was achieved compared with that of conventional electrodes. Then, after performing an ELISA-like hybridization assay with a restriction enzyme, we undertook an electrochemical evaluation of the cytosine methylation status in DNA by measuring the oxidation current derived from p-aminophenol. When the target cytosine in the analyte sequence is methylated (unmethylated), the restriction enzyme of HpyCH4IV is able (unable) to cleave the sequence, that is, the detection probe cannot (can) hybridize. We succeeded in estimating the methylation ratio at a site-specific CpG site from the peak current of a cyclic voltammogram obtained from a PCR product solution ranging from 0.01 to 1 nM.
Santa-Cruz, Diego; Pacienza, Natalia; Zilli, Carla; Pagano, Eduardo; Balestrasse, Karina; Yannarelli, Gustavo
2017-08-01
Heme oxygenase-1 (HO-1) plays a protective role against oxidative stress in plants. The mechanisms regulating its expression, however, remain unclear. Here we studied the methylation state of a GC rich HO-1 promoter region and the expression of several stress-related transcription factors (TFs) in soybean plants subjected to ultraviolet-B (UV-B) radiation. Genomic DNA and total RNA were isolated from leaves of plants irradiated with 7.5 and 15kJm-2 UV-B. A 304bp HO-1 promoter region was amplified by PCR from sodium bisulfite-treated DNA, cloned into pGEMT plasmid vector and evaluated by DNA sequencing. Bisulfite sequencing analysis showed similar HO-1 promoter methylation levels in control and UV-B-treated plants (C: 3.4±1.3%; 7.5: 2.6±0.5%; 15: 3.1±1.1%). Interestingly, HO-1 promoter was strongly unmethylated in control plants. Quantitative RT-PCR analysis of TFs showed that GmMYB177, GmMYBJ6, GmWRKY21, GmNAC11, GmNAC20 and GmGT2A but not GmWRK13 and GmDREB were induced by UV-B radiation. The expression of several TFs was also enhanced by hemin, a potent and specific HO inducer, inferring that they may mediate HO-1 up-regulation. These results suggest that soybean HO-1 gene expression is not epigenetically regulated. Moreover, the low level of HO-1 promoter methylation suggests that this antioxidant enzyme can rapidly respond to environmental stress. Finally, this study has identified some stress-related TFs involved in HO-1 up-regulation under UV-B radiation. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Epigenetic Inactivation of GALR1 in Head and Neck Cancer
Misawa, Kiyoshi; Ueda, Yo; Kanazawa, Takeharu; Misawa, Yuki; Jang, Ilwhan; Brenner, John Chadwick; Ogawa, Tetsuya; Takebayashi, Satoru; Grenman, Reidar A.; Herman, James G.; Mineta, Hiroyuki; Carey, Thomas E.
2011-01-01
Purpose One copy of the GALR1 locus on 18q is often deleted and expression is absent in some head and neck squamous cell carcinoma (HNSCC) cell lines. To determine if LOH and hypermethylation might silence the GALR1 gene, promoter methylation status and gene expression were assessed in a large panel of HNSCC cell lines and tumors. Experimental Design Promoter methylation of GALR1 in 72 cell lines and 100 primary tumor samples was analyzed using methylation-specific PCR (MSP). GALR1 expression and methylation status were analyzed further by real-time PCR and bisulfite sequencing analysis. Results The GALR1 promoter was fully or partially methylated in 38 of 72 HNSCC cell lines (52.7%) but not in the majority 18/20 (90.0%) of non-malignant lines. GALR1 methylation was also found in 38/100 (38%) primary tumor specimens. Methylation correlated with decreased GALR1 expression. In tumors methylation was significantly correlated with increased tumor size (P=0.0036), lymph-node status (P=0.0414), tumor stage (P=0.0037), cyclin D1 expression (P=0.0420), and p16 methylation (P=0.0494) and survival (P=0.045). Bisulfite sequencing of 36 CpG sites upstream of the transcription start site revealed that CpG methylation within transcription factor binding sites correlated with complete suppression of GALR1 mRNA. Treatment with TSA and 5-azacytidine restored GALR1 expression. In UM-SCC-23 cells that have total silencing of GALR1, exogenous GALR1 expression and stimulation with galanin suppressed cell proliferation. Conclusions Frequent promoter hypermethylation, gene silencing, association with prognosis, and growth suppression after re-expression support the hypothesis that GALR1 is a tumor suppressor gene in HNSCC. PMID:19047085
An, Dongshan; Dong, Xiaoli; An, Annie; Park, Hyung S.; Strous, Marc; Voordouw, Gerrit
2016-01-01
Sodium bisulfite (SBS) is used as an oxygen scavenger to decrease corrosion in pipelines transporting brackish subsurface water used in the production of bitumen by steam-assisted gravity drainage. Sequencing 16S rRNA gene amplicons has indicated that SBS addition increased the fraction of the sulfate-reducing bacteria (SRB) Desulfomicrobium, as well as of Desulfocapsa, which can also grow by disproportionating sulfite into sulfide, sulfur, and sulfate. SRB use cathodic H2, formed by reduction of aqueous protons at the iron surface, or use low potential electrons from iron and aqueous protons directly for sulfate reduction. In order to reveal the effects of SBS treatment in more detail, metagenomic analysis was performed with pipe-associated solids (PAS) scraped from a pipe section upstream (PAS-616P) and downstream (PAS-821TP) of the SBS injection point. A major SBS-induced change in microbial community composition and in affiliated hynL genes for the large subunit of [NiFe] hydrogenase was the appearance of sulfur-metabolizing Epsilonproteobacteria of the genera Sulfuricurvum and Sulfurovum. These are chemolithotrophs, which oxidize sulfide or sulfur with O2 or reduce sulfur with H2. Because O2 was absent, this class likely catalyzed reduction of sulfur (S0) originating from the metabolism of bisulfite with cathodic H2 (or low potential electrons and aqueous protons) originating from the corrosion of steel (Fe0). Overall this accelerates reaction of of S0 and Fe0 to form FeS, making this class a potentially powerful contributor to microbial corrosion. The PAS-821TP metagenome also had increased fractions of Deltaproteobacteria including the SRB Desulfomicrobium and Desulfocapsa. Altogether, SBS increased the fraction of hydrogen-utilizing Delta- and Epsilonproteobacteria in brackish-water-transporting pipelines, potentially stimulating anaerobic pipeline corrosion if dosed in excess of the intended oxygen scavenger function. PMID:26858705
Ghanem, Mashhour M; Abu-Lafi, Saleh A; Hallak, Hussein O
2013-01-01
A simple, specific, accurate, and stability-indicating method was developed and validated for the quantitative determination of menadione sodium bisulfite in the injectable solution formulation. The method is based on zwitterionic hydrophilic interaction liquid chromatography (ZIC-HILIC) coupled with a photodiode array detector. The desired separation was achieved on the ZIC-HILIC column (250 mm × 4.6 mm, 5 μm) at 25°C temperature. The optimized mobile phase consisted of an isocratic solvent mixture of 200mM ammonium acetate (NH4AC) solution and acetonitrile (ACN) (20:80; v/v) pH-adjusted to 5.7 by glacial acetic acid. The mobile phase was fixed at 0.5 ml/min and the analytes were monitored at 261 nm using a photodiode array detector. The effects of the chromatographic conditions on the peak retention, peak USP tailing factor, and column efficiency were systematically optimized. Forced degradation experiments were carried out by exposing menadione sodium bisulfite standard and the injectable solution formulation to thermal, photolytic, oxidative, and acid-base hydrolytic stress conditions. The degradation products were well-resolved from the main peak and the excipients, thus proving that the method is a reliable, stability-indicating tool. The method was validated as per ICH and USP guidelines (USP34/NF29) and found to be adequate for the routine quantitative estimation of menadione sodium bisulfite in commercially available menadione sodium bisulfite injectable solution dosage forms.
Wijetunga, N. Ari; Belbin, Thomas J.; Burk, Robert D.; Whitney, Kathleen; Abadi, Maria; Greally, John M.; Einstein, Mark H.; Schlecht, Nicolas F.
2016-01-01
Objective To conduct a comprehensive mapping of the genomic DNA methylation in CDKN2A, which codes for the p16INK4A and p14ARF proteins, and 14 of the most promising DNA methylation marker candidates previously reported to be associated with progression of low-grade cervical intraepithelial neoplasia (CIN1) to cervical cancer. Methods We analyzed DNA methylation in 68 HIV-seropositive and negative women with incident CIN1, CIN2, CIN3 and invasive cervical cancer, assaying 120 CpG dinucleotide sites spanning APC, CDH1, CDH13, CDKN2A, CDKN2B, DAPK1, FHIT, GSTP1, HIC1, MGMT, MLH1, RARB, RASSF1, TERT and TIMP3 using the Illumina Infinium array. Validation was performed using high resolution mapping of the target genes with HELP-tagging for 286 CpGs, followed by fine mapping of candidate genes with targeted bisulfite sequencing. We assessed for statistical differences in DNA methylation levels for each CpG loci assayed using univariate and multivariate methods correcting for multiple comparisons. Results In our discovery sample set, we identified dose dependent differences in DNA methylation with grade of disease in CDKN2A, APC, MGMT, MLH1 and HIC1, whereas single CpG locus differences between CIN2/3 and cancer groups were seen for CDH13, DAPK1 and TERT. Only those CpGs in the gene body of CDKN2A showed a monotonic increase in methylation between persistent CIN1, CIN2, CIN3 and cancers. Conclusion Our data suggests a novel link between early cervical disease progression and DNA methylation in a region downstream of the CDKN2A transcription start site that may lead to increased p16INK4A/p14ARF expression prior to development of malignant disease. PMID:27401842
Wijetunga, N Ari; Belbin, Thomas J; Burk, Robert D; Whitney, Kathleen; Abadi, Maria; Greally, John M; Einstein, Mark H; Schlecht, Nicolas F
2016-09-01
To conduct a comprehensive mapping of the genomic DNA methylation in CDKN2A, which codes for the p16(INK4A) and p14(ARF) proteins, and 14 of the most promising DNA methylation marker candidates previously reported to be associated with progression of low-grade cervical intraepithelial neoplasia (CIN1) to cervical cancer. We analyzed DNA methylation in 68 HIV-seropositive and negative women with incident CIN1, CIN2, CIN3 and invasive cervical cancer, assaying 120 CpG dinucleotide sites spanning APC, CDH1, CDH13, CDKN2A, CDKN2B, DAPK1, FHIT, GSTP1, HIC1, MGMT, MLH1, RARB, RASSF1, TERT and TIMP3 using the Illumina Infinium array. Validation was performed using high resolution mapping of the target genes with HELP-tagging for 286 CpGs, followed by fine mapping of candidate genes with targeted bisulfite sequencing. We assessed for statistical differences in DNA methylation levels for each CpG loci assayed using univariate and multivariate methods correcting for multiple comparisons. In our discovery sample set, we identified dose dependent differences in DNA methylation with grade of disease in CDKN2A, APC, MGMT, MLH1 and HIC1, whereas single CpG locus differences between CIN2/3 and cancer groups were seen for CDH13, DAPK1 and TERT. Only those CpGs in the gene body of CDKN2A showed a monotonic increase in methylation between persistent CIN1, CIN2, CIN3 and cancers. Our data suggests a novel link between early cervical disease progression and DNA methylation in a region downstream of the CDKN2A transcription start site that may lead to increased p16(INK4A)/p14(ARF) expression prior to development of malignant disease. Copyright © 2016 Elsevier Inc. All rights reserved.
Bottacini, Francesca; Morrissey, Ruth; Roberts, Richard John; James, Kieran; van Breen, Justin; Egan, Muireann; Lambert, Jolanda; van Limpt, Kees; Knol, Jan; Motherway, Mary O’Connell; van Sinderen, Douwe
2018-01-01
Abstract Bifidobacterium breve represents one of the most abundant bifidobacterial species in the gastro-intestinal tract of breast-fed infants, where their presence is believed to exert beneficial effects. In the present study whole genome sequencing, employing the PacBio Single Molecule, Real-Time (SMRT) sequencing platform, combined with comparative genome analysis allowed the most extensive genetic investigation of this taxon. Our findings demonstrate that genes encoding Restriction/Modification (R/M) systems constitute a substantial part of the B. breve variable gene content (or variome). Using the methylome data generated by SMRT sequencing, combined with targeted Illumina bisulfite sequencing (BS-seq) and comparative genome analysis, we were able to detect methylation recognition motifs and assign these to identified B. breve R/M systems, where in several cases such assignments were confirmed by restriction analysis. Furthermore, we show that R/M systems typically impose a very significant barrier to genetic accessibility of B. breve strains, and that cloning of a methyltransferase-encoding gene may overcome such a barrier, thus allowing future functional investigations of members of this species. PMID:29294107
The sampler developed by Charles and Cahill, with Dr. Vincent Seaman, consists of a custom-built glass mist chamber in which air enters at a high flow rate and carbonyls are trapped in a solution of sodium bisulfite as carbonyl-bisulfite adducts. This reaction is rapid (on ...
DNA hypomethylation of individual sequences in aborted cloned bovine fetuses.
Chen, Tao; Jiang, Yan; Zhang, Yan-Ling; Liu, Jing-He; Hou, Yi; Schatten, Heide; Chen, Da-Yuan; Sun, Qing-Yuan
2005-09-01
Cloned bovines have a much higher abortion rate than those derived in vivo. Available evidence indicates that inappropriate epigenetic reprogramming of donor nuclei is the primary cause of cloning failure. To gain a better understanding of the DNA methylation changes associated with the high abortion rate of cloned bovines, we examined the DNA methylation status of a repeated sequence (satellite I) and the promoter regions of two single-copy genes (interleukin 3/cytokeratin) in aborted cloned fetuses, aborted fetuses derived from artificial insemination (AI), cloned adults and AI adults by bisulfite sequencing and restriction enzyme analysis. Two of four aborted cloned fetuses show very low methylation levels in the two single-copy gene promoter regions. One of the two fetuses also showed undermethylated status in the satellite I sequence. The other two aborted cloned fetuses have similar methylation levels to those of aborted AI fetuses. However, no difference in methylation was observed between cloned adults and AI adults. Our results demonstrate for the first time the undermethylated status of individual sequences in aborted cloned fetuses. These findings suggest that aberrant DNA methylation may contribute to the developmental failure of cloned bovine fetuses.
Zeb2 Regulates Cell Fate at the Exit from Epiblast State in Mouse Embryonic Stem Cells
Stryjewska, Agata; Dries, Ruben; Pieters, Tim; Verstappen, Griet; Conidi, Andrea; Coddens, Kathleen; Francis, Annick; Umans, Lieve; van IJcken, Wilfred F. J.; Berx, Geert; van Grunsven, Leo A.; Grosveld, Frank G.; Goossens, Steven; Haigh, Jody J.
2016-01-01
Abstract In human embryonic stem cells (ESCs) the transcription factor Zeb2 regulates neuroectoderm versus mesendoderm formation, but it is unclear how Zeb2 affects the global transcriptional regulatory network in these cell‐fate decisions. We generated Zeb2 knockout (KO) mouse ESCs, subjected them as embryoid bodies (EBs) to neural and general differentiation and carried out temporal RNA‐sequencing (RNA‐seq) and reduced representation bisulfite sequencing (RRBS) analysis in neural differentiation. This shows that Zeb2 acts preferentially as a transcriptional repressor associated with developmental progression and that Zeb2 KO ESCs can exit from their naïve state. However, most cells in these EBs stall in an early epiblast‐like state and are impaired in both neural and mesendodermal differentiation. Genes involved in pluripotency, epithelial‐to‐mesenchymal transition (EMT), and DNA‐(de)methylation, including Tet1, are deregulated in the absence of Zeb2. The observed elevated Tet1 levels in the mutant cells and the knowledge of previously mapped Tet1‐binding sites correlate with loss‐of‐methylation in neural‐stimulating conditions, however, after the cells initially acquired the correct DNA‐methyl marks. Interestingly, cells from such Zeb2 KO EBs maintain the ability to re‐adapt to 2i + LIF conditions even after prolonged differentiation, while knockdown of Tet1 partially rescues their impaired differentiation. Hence, in addition to its role in EMT, Zeb2 is critical in ESCs for exit from the epiblast state, and links the pluripotency network and DNA‐methylation with irreversible commitment to differentiation. Stem Cells 2017;35:611–625 PMID:27739137
2012-01-01
Background DNA cytosine methylation is an epigenetic modification that has been implicated in many biological processes. However, large-scale epigenomic studies have been applied to very few plant species, and variability in methylation among specialized tissues and its relationship to gene expression is poorly understood. Results We surveyed DNA methylation from seven distinct tissue types (vegetative bud, male inflorescence [catkin], female catkin, leaf, root, xylem, phloem) in the reference tree species black cottonwood (Populus trichocarpa). Using 5-methyl-cytosine DNA immunoprecipitation followed by Illumina sequencing (MeDIP-seq), we mapped a total of 129,360,151 36- or 32-mer reads to the P. trichocarpa reference genome. We validated MeDIP-seq results by bisulfite sequencing, and compared methylation and gene expression using published microarray data. Qualitative DNA methylation differences among tissues were obvious on a chromosome scale. Methylated genes had lower expression than unmethylated genes, but genes with methylation in transcribed regions ("gene body methylation") had even lower expression than genes with promoter methylation. Promoter methylation was more frequent than gene body methylation in all tissues except male catkins. Male catkins differed in demethylation of particular transposable element categories, in level of gene body methylation, and in expression range of genes with methylated transcribed regions. Tissue-specific gene expression patterns were correlated with both gene body and promoter methylation. Conclusions We found striking differences among tissues in methylation, which were apparent at the chromosomal scale and when genes and transposable elements were examined. In contrast to other studies in plants, gene body methylation had a more repressive effect on transcription than promoter methylation. PMID:22251412
Zeb2 Regulates Cell Fate at the Exit from Epiblast State in Mouse Embryonic Stem Cells.
Stryjewska, Agata; Dries, Ruben; Pieters, Tim; Verstappen, Griet; Conidi, Andrea; Coddens, Kathleen; Francis, Annick; Umans, Lieve; van IJcken, Wilfred F J; Berx, Geert; van Grunsven, Leo A; Grosveld, Frank G; Goossens, Steven; Haigh, Jody J; Huylebroeck, Danny
2017-03-01
In human embryonic stem cells (ESCs) the transcription factor Zeb2 regulates neuroectoderm versus mesendoderm formation, but it is unclear how Zeb2 affects the global transcriptional regulatory network in these cell-fate decisions. We generated Zeb2 knockout (KO) mouse ESCs, subjected them as embryoid bodies (EBs) to neural and general differentiation and carried out temporal RNA-sequencing (RNA-seq) and reduced representation bisulfite sequencing (RRBS) analysis in neural differentiation. This shows that Zeb2 acts preferentially as a transcriptional repressor associated with developmental progression and that Zeb2 KO ESCs can exit from their naïve state. However, most cells in these EBs stall in an early epiblast-like state and are impaired in both neural and mesendodermal differentiation. Genes involved in pluripotency, epithelial-to-mesenchymal transition (EMT), and DNA-(de)methylation, including Tet1, are deregulated in the absence of Zeb2. The observed elevated Tet1 levels in the mutant cells and the knowledge of previously mapped Tet1-binding sites correlate with loss-of-methylation in neural-stimulating conditions, however, after the cells initially acquired the correct DNA-methyl marks. Interestingly, cells from such Zeb2 KO EBs maintain the ability to re-adapt to 2i + LIF conditions even after prolonged differentiation, while knockdown of Tet1 partially rescues their impaired differentiation. Hence, in addition to its role in EMT, Zeb2 is critical in ESCs for exit from the epiblast state, and links the pluripotency network and DNA-methylation with irreversible commitment to differentiation. Stem Cells 2017;35:611-625. © 2016 The Authors Stem Cells published by Wiley Periodicals, Inc. on behalf of AlphaMed Press.
2008-03-01
overcomes bias in bisulfite PCR methylation analysis. Biotechniques, 42: 48, 50, 52 passim, 2007. 46. Warnecke, P. M., Stirzaker, C., Melki , J. R...overcomes bias in bisulfite PCR methylation analysis. Biotechniques 2007;42:48, 50, 2 passim. 36. Warnecke PM, Stirzaker C, Melki JR, Millar DS, Paul CL
NGSmethDB 2017: enhanced methylomes and differential methylation.
Lebrón, Ricardo; Gómez-Martín, Cristina; Carpena, Pedro; Bernaola-Galván, Pedro; Barturen, Guillermo; Hackenberg, Michael; Oliver, José L
2017-01-04
The 2017 update of NGSmethDB stores whole genome methylomes generated from short-read data sets obtained by bisulfite sequencing (WGBS) technology. To generate high-quality methylomes, stringent quality controls were integrated with third-part software, adding also a two-step mapping process to exploit the advantages of the new genome assembly models. The samples were all profiled under constant parameter settings, thus enabling comparative downstream analyses. Besides a significant increase in the number of samples, NGSmethDB now includes two additional data-types, which are a valuable resource for the discovery of methylation epigenetic biomarkers: (i) differentially methylated single-cytosines; and (ii) methylation segments (i.e. genome regions of homogeneous methylation). The NGSmethDB back-end is now based on MongoDB, a NoSQL hierarchical database using JSON-formatted documents and dynamic schemas, thus accelerating sample comparative analyses. Besides conventional database dumps, track hubs were implemented, which improved database access, visualization in genome browsers and comparative analyses to third-part annotations. In addition, the database can be also accessed through a RESTful API. Lastly, a Python client and a multiplatform virtual machine allow for program-driven access from user desktop. This way, private methylation data can be compared to NGSmethDB without the need to upload them to public servers. Database website: http://bioinfo2.ugr.es/NGSmethDB. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Huang, Yong-Zhen; Sun, Jia-Jie; Zhang, Liang-Zhi; Li, Cong-Jun; Womack, James E.; Li, Zhuan-Jian; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Zhao, Xin; Chen, Hong
2014-01-01
DNA methylation is a key epigenetic modification in mammals and plays important roles in muscle development. We sampled longissimus dorsi muscle (LDM) from a well-known elite native breed of Chinese Qinchuan cattle living within the same environment but displaying distinct skeletal muscle at the fetal and adult stages. We generated and provided a genome-wide landscape of DNA methylomes and their relationship with mRNA and miRNA for fetal and adult muscle studies. Integration analysis revealed a total of 77 and 1,054 negatively correlated genes with methylation in the promoter and gene body regions, respectively, in both the fetal and adult bovine libraries. Furthermore, we identified expression patterns of high-read genes that exhibit a negative correlation between methylation and expression from nine different tissues at multiple developmental stages of bovine muscle-related tissue or organs. In addition, we validated the MeDIP-Seq results by bisulfite sequencing PCR (BSP) in some of the differentially methylated promoters. Together, these results provide valuable data for future biomedical research and genomic and epigenomic studies of bovine skeletal muscle that may help uncover the molecular basis underlying economically valuable traits in cattle. This comprehensive map also provides a solid basis for exploring the epigenetic mechanisms of muscle growth and development. PMID:25306978
Resin in bisulfite pulp from Pinus radiata wood and its relationship to pitch troubles
P.J. Nelson; Richard W. Hemingway
1971-01-01
The resin content of bisulfite pulp from Pinus radiata D. Don was determined at various stages in its manufacture and the changes in the composition of the resin studied. About 50% of the resin present in the wood was removed during cooking, an additional 11% by blowpit washing, and a further 8% by screening. Resin acids and fatty acids were...
Perrier, Jean-Philippe; Sellem, Eli; Prézelin, Audrey; Gasselin, Maxime; Jouneau, Luc; Piumi, François; Al Adhami, Hala; Weber, Michaël; Fritz, Sébastien; Boichard, Didier; Le Danvic, Chrystelle; Schibler, Laurent; Jammes, Hélène; Kiefer, Hélène
2018-05-29
Spermatozoa have a remarkable epigenome in line with their degree of specialization, their unique nature and different requirements for successful fertilization. Accordingly, perturbations in the establishment of DNA methylation patterns during male germ cell differentiation have been associated with infertility in several species. While bull semen is widely used in artificial insemination, the literature describing DNA methylation in bull spermatozoa is still scarce. The purpose of this study was therefore to characterize the bull sperm methylome relative to both bovine somatic cells and the sperm of other mammals through a multiscale analysis. The quantification of DNA methylation at CCGG sites using luminometric methylation assay (LUMA) highlighted the undermethylation of bull sperm compared to the sperm of rams, stallions, mice, goats and men. Total blood cells displayed a similarly high level of methylation in bulls and rams, suggesting that undermethylation of the bovine genome was specific to sperm. Annotation of CCGG sites in different species revealed no striking bias in the distribution of genome features targeted by LUMA that could explain undermethylation of bull sperm. To map DNA methylation at a genome-wide scale, bull sperm was compared with bovine liver, fibroblasts and monocytes using reduced representation bisulfite sequencing (RRBS) and immunoprecipitation of methylated DNA followed by microarray hybridization (MeDIP-chip). These two methods exhibited differences in terms of genome coverage, and consistently, two independent sets of sequences differentially methylated in sperm and somatic cells were identified for RRBS and MeDIP-chip. Remarkably, in the two sets most of the differentially methylated sequences were hypomethylated in sperm. In agreement with previous studies in other species, the sequences that were specifically hypomethylated in bull sperm targeted processes relevant to the germline differentiation program (piRNA metabolism, meiosis, spermatogenesis) and sperm functions (cell adhesion, fertilization), as well as satellites and rDNA repeats. These results highlight the undermethylation of bull spermatozoa when compared with both bovine somatic cells and the sperm of other mammals, and raise questions regarding the dynamics of DNA methylation in bovine male germline. Whether sperm undermethylation has potential interactions with structural variation in the cattle genome may deserve further attention.
Epigenetic silencing of MicroRNA-503 regulates FANCA expression in non-small cell lung cancer cell.
Li, Ning; Zhang, Fangfang; Li, Suyun; Zhou, Suzhen
2014-02-21
It is reported that MicroRNA-503 (miR-503) regulates cell apoptosis, and thus modulates the resistance of non-small cell lung cancer cells (NSCLC) to cisplatin. However, the exact role of miR-503 in NSCLC remains unknown. In the present study, the level of miR-503 expression in NSCLC was evaluated using realtime PCR, and the DNA methylation status within miR-503 promoter was analyzed by Combined Bisulfite Restriction Analysis (COBRA) or bisulfite-treated DNA sequencing assays (BSP). We found that the expression of miR-503 was significantly decreased in NSCLC tissues compared to normal tissues. A statistically significant inverse association was found between miR-503 methylation status and expression of the miR-503 in tumor tissues (P<0.001), and expression of miR-503 was restored by the demethylating agent 5-aza-2'-deoxycytidine, suggesting that methylation was associated with the transcriptional silencing. Then, we show that miR-503 targets a homologous DNA region in the 3'-UTR region of the Fanconi anemia complementation group A protein (FANCA) gene and represses its expression at the transcriptional level. Taken together, our results suggest that miR-503 regulates the resistance of non-small cell lung cancer cells to cisplatin at least in part by targeting FANCA. Copyright © 2014 Elsevier Inc. All rights reserved.
Lin, Lin; Liu, Yong; Xu, Fengping; Huang, Jinrong; Daugaard, Tina Fuglsang; Petersen, Trine Skov; Hansen, Bettina; Ye, Lingfei; Zhou, Qing; Fang, Fang; Yang, Ling; Li, Shengting; Fløe, Lasse; Jensen, Kristopher Torp; Shrock, Ellen; Chen, Fang; Yang, Huanming; Wang, Jian; Liu, Xin; Xu, Xun; Bolund, Lars; Nielsen, Anders Lade; Luo, Yonglun
2018-01-01
Abstract Background Fusion of DNA methyltransferase domains to the nuclease-deficient clustered regularly interspaced short palindromic repeat (CRISPR) associated protein 9 (dCas9) has been used for epigenome editing, but the specificities of these dCas9 methyltransferases have not been fully investigated. Findings We generated CRISPR-guided DNA methyltransferases by fusing the catalytic domain of DNMT3A or DNMT3B to the C terminus of the dCas9 protein from Streptococcus pyogenes and validated its on-target and global off-target characteristics. Using targeted quantitative bisulfite pyrosequencing, we prove that dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B can efficiently methylate the CpG dinucleotides flanking its target sites at different genomic loci (uPA and TGFBR3) in human embryonic kidney cells (HEK293T). Furthermore, we conducted whole genome bisulfite sequencing (WGBS) to address the specificity of our dCas9 methyltransferases. WGBS revealed that although dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B did not cause global methylation changes, a substantial number (more than 1000) of the off-target differentially methylated regions (DMRs) were identified. The off-target DMRs, which were hypermethylated in cells expressing dCas9 methyltransferase and guide RNAs, were predominantly found in promoter regions, 5΄ untranslated regions, CpG islands, and DNase I hypersensitivity sites, whereas unexpected hypomethylated off-target DMRs were significantly enriched in repeated sequences. Through chromatin immunoprecipitation with massive parallel DNA sequencing analysis, we further revealed that these off-target DMRs were weakly correlated with dCas9 off-target binding sites. Using quantitative polymerase chain reaction, RNA sequencing, and fluorescence reporter cells, we also found that dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B can mediate transient inhibition of gene expression, which might be caused by dCas9-mediated de novo DNA methylation as well as interference with transcription. Conclusion Our results prove that dCas9 methyltransferases cause efficient RNA-guided methylation of specific endogenous CpGs. However, there is significant off-target methylation indicating that further improvements of the specificity of CRISPR-dCas9 based DNA methylation modifiers are required. PMID:29635374
Lin, Lin; Liu, Yong; Xu, Fengping; Huang, Jinrong; Daugaard, Tina Fuglsang; Petersen, Trine Skov; Hansen, Bettina; Ye, Lingfei; Zhou, Qing; Fang, Fang; Yang, Ling; Li, Shengting; Fløe, Lasse; Jensen, Kristopher Torp; Shrock, Ellen; Chen, Fang; Yang, Huanming; Wang, Jian; Liu, Xin; Xu, Xun; Bolund, Lars; Nielsen, Anders Lade; Luo, Yonglun
2018-03-01
Fusion of DNA methyltransferase domains to the nuclease-deficient clustered regularly interspaced short palindromic repeat (CRISPR) associated protein 9 (dCas9) has been used for epigenome editing, but the specificities of these dCas9 methyltransferases have not been fully investigated. We generated CRISPR-guided DNA methyltransferases by fusing the catalytic domain of DNMT3A or DNMT3B to the C terminus of the dCas9 protein from Streptococcus pyogenes and validated its on-target and global off-target characteristics. Using targeted quantitative bisulfite pyrosequencing, we prove that dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B can efficiently methylate the CpG dinucleotides flanking its target sites at different genomic loci (uPA and TGFBR3) in human embryonic kidney cells (HEK293T). Furthermore, we conducted whole genome bisulfite sequencing (WGBS) to address the specificity of our dCas9 methyltransferases. WGBS revealed that although dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B did not cause global methylation changes, a substantial number (more than 1000) of the off-target differentially methylated regions (DMRs) were identified. The off-target DMRs, which were hypermethylated in cells expressing dCas9 methyltransferase and guide RNAs, were predominantly found in promoter regions, 5΄ untranslated regions, CpG islands, and DNase I hypersensitivity sites, whereas unexpected hypomethylated off-target DMRs were significantly enriched in repeated sequences. Through chromatin immunoprecipitation with massive parallel DNA sequencing analysis, we further revealed that these off-target DMRs were weakly correlated with dCas9 off-target binding sites. Using quantitative polymerase chain reaction, RNA sequencing, and fluorescence reporter cells, we also found that dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B can mediate transient inhibition of gene expression, which might be caused by dCas9-mediated de novo DNA methylation as well as interference with transcription. Our results prove that dCas9 methyltransferases cause efficient RNA-guided methylation of specific endogenous CpGs. However, there is significant off-target methylation indicating that further improvements of the specificity of CRISPR-dCas9 based DNA methylation modifiers are required.
Analysis of the regulation of viral transcription.
Gloss, Bernd; Kalantari, Mina; Bernard, Hans-Ulrich
2005-01-01
Despite the small genomes and number of genes of papillomaviruses, regulation of their transcription is very complex and governed by numerous transcription factors, cis-responsive elements, and epigenetic phenomena. This chapter describes the strategies of how one can approach a systematic analysis of these factors, elements, and mechanisms. From the numerous different techniques useful for studying transcription, we describe in detail three selected protocols of approaches that have been relevant in shaping our knowledge of human papillomavirus transcription. These are DNAse I protection ("footprinting") for location of transcription-factor binding sites, electrophoretic mobility shifts ("gelshifts") for analysis of bound transcription factors, and bisulfite sequencing for analysis of DNA methylation as a prerequisite for epigenetic transcriptional regulation.
Deng, Jingyu; Liang, Han; Dong, Qiuping; Hou, Yachao; Xie, Xingming; Yu, Jun; Fan, Daiming; Hao, Xishan
2014-07-01
The methylation of B-cell CLL/lymphoma 6 member B (BCL6B) DNA promoter was detected in several malignancies. Here, we quantitatively detect the methylated status of CpG sites of BCL6B DNA promoter of 459 patients with gastric cancer (GC) by using bisulfite gene sequencing. We show that patients with three or more methylated CpG sites in the BCL6B promoter were significantly associated with poor survival. Furthermore, by using the Akaike information criterion value calculation, we show that the methylated count of BCL6B promoter was identified to be the optimal prognostic predictor of GC patients.
László, Brigitta; Ferenczi, Annamária; Madar, László; Gyöngyösi, Eszter; Szalmás, Anita; Szakács, Levente; Veress, György; Kónya, József
2016-08-01
The mechanisms that regulate papillomavirus gene expression include DNA methylation. The transcription of papillomavirus oncogenes E6 and E7 is controlled by certain regulatory elements in the LCR, which include binding sites for the E2 protein, a viral regulator of oncogene expression. In HPV-31-infected exfoliated cervical cells, the CpG methylation of the entire LCR was determined by next-generation sequencing after bisulfite modification. Six of the 22 cases had methylated CpG sites in the HPV-31 LCR, including position 7479 and/or 7485, at the promoter distal E2 binding site, thus suggesting a potential regulatory mechanism for papillomavirus transcription.
Faul, Margaret; Larsen, Rob; Levinson, Adam; Tedrow, Jason; Vounatsos, Filisaty
2013-02-15
Aldehyde-bisulfite adducts dervied from unstable parent aldehydes were reductively alkylated in a direct fashion with a variety of amines. This approach features the use of 2-picoline borane as the reducing agent and a protic solvent for the reaction media and has been successfully applied to the synthesis of a DPP-IV inhibitor and a variety of other amines.
Shi, Yan
2014-02-01
Degradation of fermentable monosaccharides is one of the primary concerns for acid prehydrolysis of lignocellulosic biomass. Recently, in our research on degradation of pure monosaccharides in aqueous SO₂ solution by gas chromatography (GC) analysis, we found that detected yield was not actual yield of each monosaccharide due to the existence of sugar-bisulfite adducts, and a new method was developed by ourselves which led to accurate detection of recovery yield of each monosaccharide in aqueous SO₂ solution by GC analysis. By the use of this method, degradation of each monosaccharide in aqueous SO₂ was investigated and results showed that sugar-bisulfite adducts have different inhibiting effect on degradation of each monosaccharide in aqueous SO₂ because of their different stability. In addition, NMR testing also demonstrated possible existence of reaction between conjugated based HSO₃(-) and aldehyde group of sugars in acid system.
Validation of SCT Methylation as a Hallmark Biomarker for Lung Cancers.
Zhang, Yu-An; Ma, Xiaotu; Sathe, Adwait; Fujimoto, Junya; Wistuba, Ignacio; Lam, Stephen; Yatabe, Yasushi; Wang, Yi-Wei; Stastny, Victor; Gao, Boning; Larsen, Jill E; Girard, Luc; Liu, Xiaoyun; Song, Kai; Behrens, Carmen; Kalhor, Neda; Xie, Yang; Zhang, Michael Q; Minna, John D; Gazdar, Adi F
2016-03-01
The human secretin gene (SCT) encodes secretin, a hormone with limited tissue distribution. Analysis of the 450k methylation array data in The Cancer Genome Atlas (TCGA) indicated that the SCT promoter region is differentially hypermethylated in lung cancer. Our purpose was to validate SCT methylation as a potential biomarker for lung cancer. We analyzed data from TCGA and developed and applied SCT-specific bisulfite DNA sequencing and quantitative methylation-specific polymerase chain reaction assays. The analyses of TCGA 450K data for 801 samples showed that SCT hypermethylation has an area under the curve (AUC) value greater than 0.98 that can be used to distinguish lung adenocarcinomas or squamous cell carcinomas from nonmalignant lung tissue. Bisulfite sequencing of lung cancer cell lines and normal blood cells allowed us to confirm that SCT methylation is highly discriminative. By applying a quantitative methylation-specific polymerase chain reaction assay, we found that SCT hypermethylation is frequently detected in all major subtypes of malignant non-small cell lung cancer (AUC = 0.92, n = 108) and small cell lung cancer (AUC = 0.93, n = 40) but is less frequent in lung carcinoids (AUC = 0.54, n = 20). SCT hypermethylation appeared in samples of lung carcinoma in situ during multistage pathogenesis and increased in invasive samples. Further analyses of TCGA 450k data showed that SCT hypermethylation is highly discriminative in most other types of malignant tumors but less frequent in low-grade malignant tumors. The only normal tissue with a high level of methylation was the placenta. Our findings demonstrated that SCT methylation is a highly discriminative biomarker for lung and other malignant tumors, is less frequent in low-grade malignant tumors (including lung carcinoids), and appears at the carcinoma in situ stage. Copyright © 2015 International Association for the Study of Lung Cancer. Published by Elsevier Inc. All rights reserved.
Jenkinson, Garrett; Abante, Jordi; Feinberg, Andrew P; Goutsias, John
2018-03-07
DNA methylation is a stable form of epigenetic memory used by cells to control gene expression. Whole genome bisulfite sequencing (WGBS) has emerged as a gold-standard experimental technique for studying DNA methylation by producing high resolution genome-wide methylation profiles. Statistical modeling and analysis is employed to computationally extract and quantify information from these profiles in an effort to identify regions of the genome that demonstrate crucial or aberrant epigenetic behavior. However, the performance of most currently available methods for methylation analysis is hampered by their inability to directly account for statistical dependencies between neighboring methylation sites, thus ignoring significant information available in WGBS reads. We present a powerful information-theoretic approach for genome-wide modeling and analysis of WGBS data based on the 1D Ising model of statistical physics. This approach takes into account correlations in methylation by utilizing a joint probability model that encapsulates all information available in WGBS methylation reads and produces accurate results even when applied on single WGBS samples with low coverage. Using the Shannon entropy, our approach provides a rigorous quantification of methylation stochasticity in individual WGBS samples genome-wide. Furthermore, it utilizes the Jensen-Shannon distance to evaluate differences in methylation distributions between a test and a reference sample. Differential performance assessment using simulated and real human lung normal/cancer data demonstrate a clear superiority of our approach over DSS, a recently proposed method for WGBS data analysis. Critically, these results demonstrate that marginal methods become statistically invalid when correlations are present in the data. This contribution demonstrates clear benefits and the necessity of modeling joint probability distributions of methylation using the 1D Ising model of statistical physics and of quantifying methylation stochasticity using concepts from information theory. By employing this methodology, substantial improvement of DNA methylation analysis can be achieved by effectively taking into account the massive amount of statistical information available in WGBS data, which is largely ignored by existing methods.
Kitamoto, Takuya; Kitamoto, Aya; Ogawa, Yuji; Honda, Yasushi; Imajo, Kento; Saito, Satoru; Yoneda, Masato; Nakamura, Takahiro; Nakajima, Atsushi; Hotta, Kikuko
2015-08-01
The pathogenesis of non-alcoholic fatty liver disease (NAFLD) is affected by epigenetic factors as well as by genetic variation. We performed targeted-bisulfite sequencing to determine the levels of DNA methylation of 4 CpG islands (CpG99, CpG71, CpG26, and CpG101) in the regulatory regions of PNPLA3, SAMM50, PARVB variant 1, and PARVB variant 2, respectively. We compared the levels of methylation of DNA in the livers of the first and second sets of patients with mild (fibrosis stages 0 and 1) or advanced (fibrosis stages 2 to 4) NAFLD and in those of patients with mild (F0 to F2) or advanced (F3 and F4) chronic hepatitis C infection. The hepatic mRNA levels of PNPLA3, SAMM50, and PARVB were measured using qPCR. CpG26, which resides in the regulatory region of PARVB variant 1, was markedly hypomethylated in the livers of patients with advanced NAFLD. Conversely, CpG99 in the regulatory region of PNPLA3 was substantially hypermethylated in these patients. These differences in DNA methylation were replicated in a second set of patients with NAFLD or chronic hepatitis C. PNPLA3 mRNA levels in the liver of the same section of a biopsy specimen used for genomic DNA preparation were lower in patients with advanced NAFLD compared with those with mild NAFLD and correlated inversely with CpG99 methylation in liver DNA. Moreover, the levels of CpG99 methylation and PNPLA3 mRNA were affected by the rs738409 genotype. Hypomethylation of CpG26 and hypermethylation of CpG99 may contribute to the severity of fibrosis in patients with NAFLD or chronic hepatitis C infection. Copyright © 2015 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
Zhang, Qian; Sun, Xiaofang; Xiao, Xinhua; Zheng, Jia; Li, Ming; Yu, Miao; Ping, Fan; Wang, Zhixin; Qi, Cuijuan; Wang, Tong; Wang, Xiaojing
2017-01-01
Maternal undernutrition is linked with an elevated risk of diabetes mellitus in offspring regardless of the postnatal dietary status. This is also found in maternal micro-nutrition deficiency, especial chromium which is a key glucose regulator. We investigated whether maternal chromium restriction contributes to the development of diabetes in offspring by affecting DNA methylation status in liver tissue. After being mated with control males, female weanling 8-week-old C57BL mice were fed a control diet (CON, 1.19 mg chromium/kg diet) or a low chromium diet (LC, 0.14 mg chromium/kg diet) during pregnancy and lactation. After weaning, some offspring were shifted to the other diet (CON-LC, or LC-CON), while others remained on the same diet (CON-CON, or LC-LC) for 29 weeks. Fasting blood glucose, serum insulin, and oral glucose tolerance test was performed to evaluate the glucose metabolism condition. Methylation differences in liver from the LC-CON group and CON-CON groups were studied by using a DNA methylation array. Bisulfite sequencing was carried out to validate the results of the methylation array. Maternal chromium limitation diet increased the body weight, blood glucose, and serum insulin levels. Even when switched to the control diet after weaning, the offspring also showed impaired glucose tolerance and insulin resistance. DNA methylation profiling of the offspring livers revealed 935 differentially methylated genes in livers of the maternal chromium restriction diet group. Pathway analysis identified the insulin signaling pathway was the main process affected by hypermethylated genes. Bisulfite sequencing confirmed that some genes in insulin signaling pathway were hypermethylated in livers of the LC-CON and LC-LC group. Accordingly, the expression of genes in insulin signaling pathway was downregulated. There findings suggest that maternal chromium restriction diet results in glucose intolerance in male offspring through alterations in DNA methylation which is associated with the insulin signaling pathway in the mice livers. PMID:28072825
Zhang, Qian; Sun, Xiaofang; Xiao, Xinhua; Zheng, Jia; Li, Ming; Yu, Miao; Ping, Fan; Wang, Zhixin; Qi, Cuijuan; Wang, Tong; Wang, Xiaojing
2017-01-01
Maternal undernutrition is linked with an elevated risk of diabetes mellitus in offspring regardless of the postnatal dietary status. This is also found in maternal micro-nutrition deficiency, especial chromium which is a key glucose regulator. We investigated whether maternal chromium restriction contributes to the development of diabetes in offspring by affecting DNA methylation status in liver tissue. After being mated with control males, female weanling 8-week-old C57BL mice were fed a control diet (CON, 1.19 mg chromium/kg diet) or a low chromium diet (LC, 0.14 mg chromium/kg diet) during pregnancy and lactation. After weaning, some offspring were shifted to the other diet (CON-LC, or LC-CON), while others remained on the same diet (CON-CON, or LC-LC) for 29 weeks. Fasting blood glucose, serum insulin, and oral glucose tolerance test was performed to evaluate the glucose metabolism condition. Methylation differences in liver from the LC-CON group and CON-CON groups were studied by using a DNA methylation array. Bisulfite sequencing was carried out to validate the results of the methylation array. Maternal chromium limitation diet increased the body weight, blood glucose, and serum insulin levels. Even when switched to the control diet after weaning, the offspring also showed impaired glucose tolerance and insulin resistance. DNA methylation profiling of the offspring livers revealed 935 differentially methylated genes in livers of the maternal chromium restriction diet group. Pathway analysis identified the insulin signaling pathway was the main process affected by hypermethylated genes. Bisulfite sequencing confirmed that some genes in insulin signaling pathway were hypermethylated in livers of the LC-CON and LC-LC group. Accordingly, the expression of genes in insulin signaling pathway was downregulated. There findings suggest that maternal chromium restriction diet results in glucose intolerance in male offspring through alterations in DNA methylation which is associated with the insulin signaling pathway in the mice livers.
Chang, Guimin; Xu, Shuping; Dhir, Rajiv; Chandran, Uma; O'Keefe, Denise S; Greenberg, Norman M; Gingrich, Jeffrey R
2010-11-15
Cell adhesion molecules (CADM) comprise a newly identified protein family whose functions include cell polarity maintenance and tumor suppression. CADM-1, CADM-3, and CADM-4 have been shown to act as tumor suppressor genes in multiple cancers including prostate cancer. However, CADM-2 expression has not been determined in prostate cancer. The CADM-2 gene was cloned and characterized and its expression in human prostatic cell lines and cancer specimens was analyzed by reverse transcription-PCR and an immunohistochemical tissue array, respectively. The effects of adenovirus-mediated CADM-2 expression on prostate cancer cells were also investigated. CADM-2 promoter methylation was evaluated by bisulfite sequencing and methylation-specific PCR. We report the initial characterization of CADM-2 isoforms: CADM-2a and CADM-2b, each with separate promoters, in human chromosome 3p12.1. Prostate cancer cell lines, LNCaP and DU145, expressed negligible CADM-2a relative to primary prostate tissue and cell lines, RWPE-1 and PPC-1, whereas expression of CADM-2b was maintained. Using immunohistochemistry, tissue array results from clinical specimens showed statistically significant decreased expression in prostate carcinoma compared with normal donor prostate, benign prostatic hyperplasia, prostatic intraepithelial neoplasia, and normal tissue adjacent to tumor (P < 0.001). Adenovirus-mediated CADM-2a expression suppressed DU145 cell proliferation in vitro and colony formation in soft agar. The decrease in CADM-2a mRNA in cancer cell lines correlated with promoter region hypermethylation as determined by bisulfite sequencing and methylation-specific PCR. Accordingly, treatment of cells with the demethylating agent 5-aza-2'-deoxycytidine alone or in combination with the histone deacetylase inhibitor trichostatin A resulted in the reactivation of CADM-2a expression. CADM-2a protein expression is significantly reduced in prostate cancer. Its expression is regulated in part by promoter methylation and implicates CADM-2 as a previously unrecognized tumor suppressor gene in a proportion of human prostate cancers. ©2010 AACR.
Correlation between ZBED6 Gene Upstream CpG Island methylation and mRNA expression in cattle.
Huang, Yong-Zhen; Zhang, Zi-Jing; He, Hua; Cao, Xiu-Kai; Song, Cheng-Chuang; Liu, Kun-Peng; Lan, Xian-Yong; Lei, Chu-Zhao; Qi, Xing-Lei; Bai, Yue-Yu; Chen, Hong
2017-04-03
DNA methylation is essential for the regulation of gene expression and important roles in muscle development. To assess the extent of epigenetic modifications and gene expression on the differentially methylated region (DMR) in ZBED6, we simultaneously examined DNA methylation and expression in six tissues from two different developmental stages (fetal bovine and adult bovine). The DNA methylation pattern was compared using bisulfite sequencing polymerase chain reaction (BSP) and combined bisulfite restriction analysis (COBRA). The result of quantitative real-time PCR (qPCR) analysis showed that ZBED6 has a broad tissue distribution and is highly expressed in adult bovine (P < 0.05 or P < 0.01). The DNA methylation level was significantly different in liver, lung and spleen between the two cattle groups (P < 0.05 or P < 0.01). The adult bovine group exhibited a significantly higher mRNA level and lower DNA methylation level than the fetal bovine group in liver, lung, and spleen. No significant association was detected between DNA methylation level and muscle, heart, and kidney at two different stages. In this study, the statistical analyses indicated that DNA methylation patterns are associated with mRNA level in some tissues, these results may be a useful parameter to investigate muscle developmental in cattle and as a model for studies in other species, potentially contributing to an improvement of growth performance selection in beef cattle breeding program.
Green, Benjamin B; Houseman, E Andres; Johnson, Kevin C; Guerin, Dylan J; Armstrong, David A; Christensen, Brock C; Marsit, Carmen J
2016-08-01
The conversion of cytosine to 5-methylcystosine (5mC) is an important regulator of gene expression. 5mC may be enzymatically converted to 5-hydroxymethylcytosine (5hmC), with a potentially distinct regulatory function. We sought to investigate these cytosine modifications and their effect on gene expression by parallel processing of genomic DNA using bisulfite and oxidative bisulfite conversion in conjunction with RNA sequencing. Although values of 5hmC across the placental genome were generally low, we identified ∼21,000 loci with consistently elevated levels of 5-hydroxymethycytosine. Absence of 5hmC was observed in CpG islands and, to a greater extent, in non-CpG island-associated regions. 5hmC was enriched within poised enhancers, and depleted within active enhancers, as defined by H3K27ac and H3K4me1 measurements. 5hmC and 5mC were significantly elevated in transcriptionally silent genes when compared with actively transcribed genes. 5hmC was positively associated with transcription in actively transcribed genes only. Our data suggest that dynamic cytosine regulation, associated with transcription, provides the most complete epigenomic landscape of the human placenta, and will be useful for future studies of the placental epigenome.-Green, B. B., Houseman, E. A., Johnson, K. C., Guerin, D. J., Armstrong, D. A., Christensen, B. C., Marsit, C. J. Hydroxymethylation is uniquely distributed within term placenta, and is associated with gene expression. © FASEB.
Green, Benjamin B.; Houseman, E. Andres; Johnson, Kevin C.; Guerin, Dylan J.; Armstrong, David A.; Christensen, Brock C.; Marsit, Carmen J.
2016-01-01
The conversion of cytosine to 5-methylcystosine (5mC) is an important regulator of gene expression. 5mC may be enzymatically converted to 5-hydroxymethylcytosine (5hmC), with a potentially distinct regulatory function. We sought to investigate these cytosine modifications and their effect on gene expression by parallel processing of genomic DNA using bisulfite and oxidative bisulfite conversion in conjunction with RNA sequencing. Although values of 5hmC across the placental genome were generally low, we identified ∼21,000 loci with consistently elevated levels of 5-hydroxymethycytosine. Absence of 5hmC was observed in CpG islands and, to a greater extent, in non-CpG island–associated regions. 5hmC was enriched within poised enhancers, and depleted within active enhancers, as defined by H3K27ac and H3K4me1 measurements. 5hmC and 5mC were significantly elevated in transcriptionally silent genes when compared with actively transcribed genes. 5hmC was positively associated with transcription in actively transcribed genes only. Our data suggest that dynamic cytosine regulation, associated with transcription, provides the most complete epigenomic landscape of the human placenta, and will be useful for future studies of the placental epigenome.—Green, B. B., Houseman, E. A., Johnson, K. C., Guerin, D. J., Armstrong, D. A., Christensen, B. C., Marsit, C. J. Hydroxymethylation is uniquely distributed within term placenta, and is associated with gene expression. PMID:27118675
Li, Yanwei; Ding, Xianlong; Wang, Xuan; He, Tingting; Zhang, Hao; Yang, Longshu; Wang, Tanliu; Chen, Linfeng; Gai, Junyi; Yang, Shouping
2017-08-10
DNA methylation is an important epigenetic modification. It can regulate the expression of many key genes without changing the primary structure of the genomic DNA, and plays a vital role in the growth and development of the organism. The genome-wide DNA methylation profile of the cytoplasmic male sterile (CMS) line in soybean has not been reported so far. In this study, genome-wide comparative analysis of DNA methylation between soybean CMS line NJCMS5A and its maintainer NJCMS5B was conducted by whole-genome bisulfite sequencing. The results showed 3527 differentially methylated regions (DMRs) and 485 differentially methylated genes (DMGs), including 353 high-credible methylated genes, 56 methylated genes coding unknown protein and 76 novel methylated genes with no known function were identified. Among them, 25 DMRs were further validated that the genome-wide DNA methylation data were reliable through bisulfite treatment, and 9 DMRs were confirmed the relationship between DNA methylation and gene expression by qRT-PCR. Finally, 8 key DMGs possibly associated with soybean CMS were identified. Genome-wide DNA methylation profile of the soybean CMS line NJCMS5A and its maintainer NJCMS5B was obtained for the first time. Several specific DMGs which participated in pollen and flower development were further identified to be probably associated with soybean CMS. This study will contribute to further understanding of the molecular mechanism behind soybean CMS.
Xu, Ning; Kwon, Soonil; Abbott, David H; Geller, David H; Dumesic, Daniel A; Azziz, Ricardo; Guo, Xiuqing; Goodarzi, Mark O
2011-01-01
The pathogenesis of polycystic ovary syndrome (PCOS) is poorly understood. PCOS-like phenotypes are produced by prenatal androgenization (PA) of female rhesus monkeys. We hypothesize that perturbation of the epigenome, through altered DNA methylation, is one of the mechanisms whereby PA reprograms monkeys to develop PCOS. Infant and adult visceral adipose tissues (VAT) harvested from 15 PA and 10 control monkeys were studied. Bisulfite treated samples were subjected to genome-wide CpG methylation analysis, designed to simultaneously measure methylation levels at 27,578 CpG sites. Analysis was carried out using Bayesian Classification with Singular Value Decomposition (BCSVD), testing all probes simultaneously in a single test. Stringent criteria were then applied to filter out invalid probes due to sequence dissimilarities between human probes and monkey DNA, and then mapped to the rhesus genome. This yielded differentially methylated loci between PA and control monkeys, 163 in infant VAT, and 325 in adult VAT (BCSVD P<0.05). Among these two sets of genes, we identified several significant pathways, including the antiproliferative role of TOB in T cell signaling and transforming growth factor-β (TGF-β) signaling. Our results suggest PA may modify DNA methylation patterns in both infant and adult VAT. This pilot study suggests that excess fetal androgen exposure in female nonhuman primates may predispose to PCOS via alteration of the epigenome, providing a novel avenue to understand PCOS in humans.
LARG at chromosome 11q23 has functional characteristics of a tumor suppressor in human breast cancer
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ong, Danny C.T.; Rudduck, Christina; Chin, Koei
2008-05-06
Deletion of 11q23-q24 is frequent in a diverse variety of malignancies, including breast and colorectal carcinoma, implicating the presence of a tumor suppressor gene at that chromosomal region. We show here that LARG, from 11q23, has functional characteristics of a tumor suppressor. We examined a 6-Mb region on 11q23 by high-resolution deletion mapping, utilizing both loss of heterozygosity (LOH) analysis and microarray comparative genomic hybridization (CGH). LARG (also called ARHGEF12), identified from the analyzed region, was underexpressed in 34% of primary breast carcinomas and 80% of breast cancer cell lines including the MCF-7 line. Multiplex ligation-dependent probe amplification on 30more » primary breast cancers and six breast cancer cell lines showed that LARG had the highest frequency of deletion compared to the BCSC-1 and TSLC1 genes, two known candidate tumor suppressor genes from 11q. In vitro analysis of breast cancer cell lines that underexpress LARG showed that LARG could be reactivated by trichostatin A, a histone deacetylase inhibitor, but not by 5-Aza-2{prime}-deoxycytidine, a demethylating agent. Bisulfite sequencing and quantitative high-throughput analysis of DNA methylation confirmed the lack of CpG island methylation in LARG in breast cancer. Restoration of LARG expression in MCF-7 cells by stable transfection resulted in reduced proliferation and colony formation, suggesting that LARG has functional characteristics of a tumor suppressor gene.« less
Potential energy landscapes identify the information-theoretic nature of the epigenome
Jenkinson, Garrett; Pujadas, Elisabet; Goutsias, John; Feinberg, Andrew P.
2017-01-01
Epigenetics studies genomic modifications carrying information independent of DNA sequence heritable through cell division. In 1940, Waddington coined the term “epigenetic landscape” as a metaphor for pluripotency and differentiation, but methylation landscapes have not yet been rigorously computed. By using principles of statistical physics and information theory, we derive epigenetic energy landscapes from whole-genome bisulfite sequencing data that allow us to quantify methylation stochasticity genome-wide using Shannon’s entropy and associate entropy with chromatin structure. Moreover, we consider the Jensen-Shannon distance between sample-specific energy landscapes as a measure of epigenetic dissimilarity and demonstrate its effectiveness for discerning epigenetic differences. By viewing methylation maintenance as a communications system, we introduce methylation channels and show that higher-order chromatin organization can be predicted from their informational properties. Our results provide a fundamental understanding of the information-theoretic nature of the epigenome that leads to a powerful approach for studying its role in disease and aging. PMID:28346445
Genome-wide methylation analysis identified sexually dimorphic methylated regions in hybrid tilapia
Wan, Zi Yi; Xia, Jun Hong; Lin, Grace; Wang, Le; Lin, Valerie C. L.; Yue, Gen Hua
2016-01-01
Sexual dimorphism is an interesting biological phenomenon. Previous studies showed that DNA methylation might play a role in sexual dimorphism. However, the overall picture of the genome-wide methylation landscape in sexually dimorphic species remains unclear. We analyzed the DNA methylation landscape and transcriptome in hybrid tilapia (Oreochromis spp.) using whole genome bisulfite sequencing (WGBS) and RNA-sequencing (RNA-seq). We found 4,757 sexually dimorphic differentially methylated regions (DMRs), with significant clusters of DMRs located on chromosomal regions associated with sex determination. CpG methylation in promoter regions was negatively correlated with the gene expression level. MAPK/ERK pathway was upregulated in male tilapia. We also inferred active cis-regulatory regions (ACRs) in skeletal muscle tissues from WGBS datasets, revealing sexually dimorphic cis-regulatory regions. These results suggest that DNA methylation contribute to sex-specific phenotypes and serve as resources for further investigation to analyze the functions of these regions and their contributions towards sexual dimorphisms. PMID:27782217
An epigenetic aging clock for dogs and wolves.
Thompson, Michael J; vonHoldt, Bridgett; Horvath, Steve; Pellegrini, Matteo
2017-03-28
Several articles describe highly accurate age estimation methods based on human DNA-methylation data. It is not yet known whether similar epigenetic aging clocks can be developed based on blood methylation data from canids. Using Reduced Representation Bisulfite Sequencing, we assessed blood DNA-methylation data from 46 domesticated dogs ( Canis familiaris ) and 62 wild gray wolves ( C. lupus ). By regressing chronological dog age on the resulting CpGs, we defined highly accurate multivariate age estimators for dogs (based on 41 CpGs), wolves (67 CpGs), and both combined (115 CpGs). Age related DNA methylation changes in canids implicate similar gene ontology categories as those observed in humans suggesting an evolutionarily conserved mechanism underlying age-related DNA methylation in mammals.
An epigenetic aging clock for dogs and wolves
Thompson, Michael J.; vonHoldt, Bridgett; Horvath, Steve; Pellegrini, Matteo
2017-01-01
Several articles describe highly accurate age estimation methods based on human DNA-methylation data. It is not yet known whether similar epigenetic aging clocks can be developed based on blood methylation data from canids. Using Reduced Representation Bisulfite Sequencing, we assessed blood DNA-methylation data from 46 domesticated dogs (Canis familiaris) and 62 wild gray wolves (C. lupus). By regressing chronological dog age on the resulting CpGs, we defined highly accurate multivariate age estimators for dogs (based on 41 CpGs), wolves (67 CpGs), and both combined (115 CpGs). Age related DNA methylation changes in canids implicate similar gene ontology categories as those observed in humans suggesting an evolutionarily conserved mechanism underlying age-related DNA methylation in mammals. PMID:28373601
DOE Office of Scientific and Technical Information (OSTI.GOV)
Osmond, C.B.; Avadhani, P.N.
1970-01-01
Bisulfite compounds are well known as inhibitors of glycolate oxidase in green tissues of higher plants. In an effort to understand the relation between low glycolate oxidase activity and high P-enolpyruvate carboxylase activity in plants with the C/sub 4/ dicarboxylic acid pathway of photosynthesis, the authors have treated leaves of related species of Atriplex with these compounds. In this photosynthetic process, as well as during dark CO/sub 2/ fixation leading to acidification of Sedum leaves, they have found bisulfite compounds to be effective inhibitors of the P-enolpyruvate carboxylation system. This report provides evidence in vivo for this inhibition and describesmore » the inhibition in vitro of P-enolpyruvate carboxylation system. This report provides evidence in vivo for this inhibition and describes the inhibition in vitro of P-enolpyruvate carboxylase and NADH malate dehydrogenase. 16 references, 4 figures, 1 table.« less
Ashktorab, Hassan; Daremipouran, M; Goel, Ajay; Varma, Sudhir; Leavitt, R; Sun, Xueguang; Brim, Hassan
2014-04-01
The identification of genes that are differentially methylated in colorectal cancer (CRC) has potential value for both diagnostic and therapeutic interventions specifically in high-risk populations such as African Americans (AAs). However, DNA methylation patterns in CRC, especially in AAs, have not been systematically explored and remain poorly understood. Here, we performed DNA methylome profiling to identify the methylation status of CpG islands within candidate genes involved in critical pathways important in the initiation and development of CRC. We used reduced representation bisulfite sequencing (RRBS) in colorectal cancer and adenoma tissues that were compared with DNA methylome from a healthy AA subject's colon tissue and peripheral blood DNA. The identified methylation markers were validated in fresh frozen CRC tissues and corresponding normal tissues from AA patients diagnosed with CRC at Howard University Hospital. We identified and validated the methylation status of 355 CpG sites located within 16 gene promoter regions associated with CpG islands. Fifty CpG sites located within CpG islands-in genes ATXN7L1 (2), BMP3 (7), EID3 (15), GAS7 (1), GPR75 (24), and TNFAIP2 (1)-were significantly hypermethylated in tumor vs. normal tissues (P<0.05). The methylation status of BMP3, EID3, GAS7, and GPR75 was confirmed in an independent, validation cohort. Ingenuity pathway analysis mapped three of these markers (GAS7, BMP3 and GPR) in the insulin and TGF-β1 network-the two key pathways in CRC. In addition to hypermethylated genes, our analysis also revealed that LINE-1 repeat elements were progressively hypomethylated in the normal-adenoma-cancer sequence. We conclude that DNA methylome profiling based on RRBS is an effective method for screening aberrantly methylated genes in CRC. While previous studies focused on the limited identification of hypermethylated genes, ours is the first study to systematically and comprehensively identify novel hypermethylated genes, as well as hypomethylated LINE-1 sequences, which may serve as potential biomarkers for CRC in African Americans. Our discovered biomarkers were intimately linked to the insulin/TGF-B1 pathway, further strengthening the association of diabetic disorders with colon oncogenic transformation.
Locating Sequence on FPC Maps and Selecting a Minimal Tiling Path
Engler, Friedrich W.; Hatfield, James; Nelson, William; Soderlund, Carol A.
2003-01-01
This study discusses three software tools, the first two aid in integrating sequence with an FPC physical map and the third automatically selects a minimal tiling path given genomic draft sequence and BAC end sequences. The first tool, FSD (FPC Simulated Digest), takes a sequenced clone and adds it back to the map based on a fingerprint generated by an in silico digest of the clone. This allows verification of sequenced clone positions and the integration of sequenced clones that were not originally part of the FPC map. The second tool, BSS (Blast Some Sequence), takes a query sequence and positions it on the map based on sequence associated with the clones in the map. BSS has multiple uses as follows: (1) When the query is a file of marker sequences, they can be added as electronic markers. (2) When the query is draft sequence, the results of BSS can be used to close gaps in a sequenced clone or the physical map. (3) When the query is a sequenced clone and the target is BAC end sequences, one may select the next clone for sequencing using both sequence comparison results and map location. (4) When the query is whole-genome draft sequence and the target is BAC end sequences, the results can be used to select many clones for a minimal tiling path at once. The third tool, pickMTP, automates the majority of this last usage of BSS. Results are presented using the rice FPC map, BAC end sequences, and whole-genome shotgun from Syngenta. PMID:12915486
Cheng, Jinkui; Lai, Jinsheng; Gong, Zhizhong
2016-01-01
DNA polymerase δ plays crucial roles in DNA repair and replication as well as maintaining genomic stability. However, the function of POLD2, the second small subunit of DNA polymerase δ, has not been characterized yet in Arabidopsis (Arabidopsis thaliana). During a genetic screen for release of transcriptional gene silencing, we identified a mutation in POLD2. Whole-genome bisulfite sequencing indicated that POLD2 is not involved in the regulation of DNA methylation. POLD2 genetically interacts with Ataxia Telangiectasia-mutated and Rad3-related and DNA polymerase α. The pold2-1 mutant exhibits genomic instability with a high frequency of homologous recombination. It also exhibits hypersensitivity to DNA-damaging reagents and short telomere length. Whole-genome chromatin immunoprecipitation sequencing and RNA sequencing analyses suggest that pold2-1 changes H3K27me3 and H3K4me3 modifications, and these changes are correlated with the gene expression levels. Our study suggests that POLD2 is required for maintaining genome integrity and properly establishing the epigenetic markers during DNA replication to modulate gene expression. PMID:27208288
Defining the location of promoter-associated R-loops at near-nucleotide resolution using bisDRIP-seq
Dumelie, Jason G
2017-01-01
R-loops are features of chromatin consisting of a strand of DNA hybridized to RNA, as well as the expelled complementary DNA strand. R-loops are enriched at promoters where they have recently been shown to have important roles in modifying gene expression. However, the location of promoter-associated R-loops and the genomic domains they perturb to modify gene expression remain unclear. To resolve this issue, we developed a bisulfite-based approach, bisDRIP-seq, to map R-loops across the genome at near-nucleotide resolution in MCF-7 cells. We found the location of promoter-associated R-loops is dependent on the presence of introns. In intron-containing genes, R-loops are bounded between the transcription start site and the first exon-intron junction. In intronless genes, the 3' boundary displays gene-specific heterogeneity. Moreover, intronless genes are often associated with promoter-associated R-loop formation. Together, these studies provide a high-resolution map of R-loops and identify gene structure as a critical determinant of R-loop formation. PMID:29072160
Tan, Liping; Yu, Yongcheng; Li, Xuezhi; Zhao, Jian; Qu, Yinbo; Choo, Yuen May; Loh, Soh Kheang
2013-05-01
This study evaluates the effects of some pretreatment processes to improve the enzymatic hydrolysis of oil palm empty fruit bunch (EFB) for ethanol production. The experimental results show that the bisulfite pretreatment was practical for EFB pretreatment. Moreover, the optimum pretreatment conditions of the bisulfite pretreatment (180 °C, 30 min, 8% NaHSO3, 1% H2SO4) were identified. In the experiments, a biorefinery process of EFB was proposed to produce ethanol, xylose products, and lignosulfonates. Copyright © 2012 Elsevier Ltd. All rights reserved.
[Genome-scale sequence data processing and epigenetic analysis of DNA methylation].
Wang, Ting-Zhang; Shan, Gao; Xu, Jian-Hong; Xue, Qing-Zhong
2013-06-01
A new approach recently developed for detecting cytosine DNA methylation (mC) and analyzing the genome-scale DNA methylation profiling, is called BS-Seq which is based on bisulfite conversion of genomic DNA combined with next-generation sequencing. The method can not only provide an insight into the difference of genome-scale DNA methylation among different organisms, but also reveal the conservation of DNA methylation in all contexts and nucleotide preference for different genomic regions, including genes, exons, and repetitive DNA sequences. It will be helpful to under-stand the epigenetic impacts of cytosine DNA methylation on the regulation of gene expression and maintaining silence of repetitive sequences, such as transposable elements. In this paper, we introduce the preprocessing steps of DNA methylation data, by which cytosine (C) and guanine (G) in the reference sequence are transferred to thymine (T) and adenine (A), and cytosine in reads is transferred to thymine, respectively. We also comprehensively review the main content of the DNA methylation analysis on the genomic scale: (1) the cytosine methylation under the context of different sequences; (2) the distribution of genomic methylcytosine; (3) DNA methylation context and the preference for the nucleotides; (4) DNA- protein interaction sites of DNA methylation; (5) degree of methylation of cytosine in the different structural elements of genes. DNA methylation analysis technique provides a powerful tool for the epigenome study in human and other species, and genes and environment interaction, and founds the theoretical basis for further development of disease diagnostics and therapeutics in human.
Staňková, Helena; Hastie, Alex R; Chan, Saki; Vrána, Jan; Tulpová, Zuzana; Kubaláková, Marie; Visendi, Paul; Hayashi, Satomi; Luo, Mingcheng; Batley, Jacqueline; Edwards, David; Doležel, Jaroslav; Šimková, Hana
2016-07-01
The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC-by-BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high-resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high-resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome-scale analysis of repetitive sequences and revealed a ~800-kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone-by-clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC-contig physical map and validate sequence assembly on a chromosome-arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome-by-chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Optical mapping and its potential for large-scale sequencing projects.
Aston, C; Mishra, B; Schwartz, D C
1999-07-01
Physical mapping has been rediscovered as an important component of large-scale sequencing projects. Restriction maps provide landmark sequences at defined intervals, and high-resolution restriction maps can be assembled from ensembles of single molecules by optical means. Such optical maps can be constructed from both large-insert clones and genomic DNA, and are used as a scaffold for accurately aligning sequence contigs generated by shotgun sequencing.
Isolation of Assimilatory- and Dissimilatory-Type Sulfite Reductases from Desulfovibrio vulgaris
Lee, Jin-Po; LeGall, Jean; Peck, Harry D.
1973-01-01
Bisulfite reductase (desulfoviridin) and an assimilatory sulfite reductase have been purified from extracts of Desulfovibrio vulgaris. The bisulfite reductase has absorption maxima at 628, 580, 408, 390, and 279 nm, and a molecular weight of 226,000 by sedimentation equilibrium, and was judged to be free of other proteins by disk electrophoresis and ultracentrifugation. On gels, purified bisulfite reductase exhibited two green bands which coincided with activity and protein. The enzyme appears to be a tetramer but was shown to have two different types of subunits having molecular weights of 42,000 and 50,000. The chromophore did not form an alkaline ferrohemochromogen, was not reduced with dithionite or borohydride, and did not form a spectrally visible complex with CO. The assimilatory sulfite reductase has absorption maxima at 590, 545, 405 and 275 nm and a molecular weight of 26,800, and appears to consist of a single polypeptide chain as it is not dissociated into subunits by sodium dodecyl sulfate. By disk electrophoresis, purified sulfite reductase exhibited a single greenish-brown band which coincided with activity and protein. The sole product of the reduction was sulfide, and the chromophore was reduced by borohydride in the presence of sulfite. Carbon monoxide reacted with the reduced chromophore but it did not form a typical pyridine ferrohemochromogen. Thiosulfate, trithionate, and tetrathionate were not reduced by either enzyme preparation. In the presence of 8 M urea, the spectrum of bisulfite reductase resembles that of the sulfite reductase, thus suggesting a chemical relationship between the two chromophores. Images PMID:4725615
Chronic exposure to water pollutant trichloroethylene increased epigenetic drift in CD4(+) T cells.
Gilbert, Kathleen M; Blossom, Sarah J; Erickson, Stephen W; Reisfeld, Brad; Zurlinden, Todd J; Broadfoot, Brannon; West, Kirk; Bai, Shasha; Cooney, Craig A
2016-05-01
Autoimmune disease and CD4(+) T-cell alterations are induced in mice exposed to the water pollutant trichloroethylene (TCE). We examined here whether TCE altered gene-specific DNA methylation in CD4(+) T cells as a possible mechanism of immunotoxicity. Naive and effector/memory CD4(+) T cells from mice exposed to TCE (0.5 mg/ml in drinking water) for 40 weeks were examined by bisulfite next-generation DNA sequencing. A probabilistic model calculated from multiple genes showed that TCE decreased methylation control in CD4(+) T cells. Data from individual genes fitted to a quadratic regression model showed that TCE increased gene-specific methylation variance in both CD4 subsets. TCE increased epigenetic drift of specific CpG sites in CD4(+) T cells.
Landau, Dan A; Clement, Kendell; Ziller, Michael J; Boyle, Patrick; Fan, Jean; Gu, Hongcang; Stevenson, Kristen; Sougnez, Carrie; Wang, Lili; Li, Shuqiang; Kotliar, Dylan; Zhang, Wandi; Ghandi, Mahmoud; Garraway, Levi; Fernandes, Stacey M; Livak, Kenneth J; Gabriel, Stacey; Gnirke, Andreas; Lander, Eric S; Brown, Jennifer R; Neuberg, Donna; Kharchenko, Peter V; Hacohen, Nir; Getz, Gad; Meissner, Alexander; Wu, Catherine J
2014-12-08
Intratumoral heterogeneity plays a critical role in tumor evolution. To define the contribution of DNA methylation to heterogeneity within tumors, we performed genome-scale bisulfite sequencing of 104 primary chronic lymphocytic leukemias (CLLs). Compared with 26 normal B cell samples, CLLs consistently displayed higher intrasample variability of DNA methylation patterns across the genome, which appears to arise from stochastically disordered methylation in malignant cells. Transcriptome analysis of bulk and single CLL cells revealed that methylation disorder was linked to low-level expression. Disordered methylation was further associated with adverse clinical outcome. We therefore propose that disordered methylation plays a similar role to that of genetic instability, enhancing the ability of cancer cells to search for superior evolutionary trajectories. Copyright © 2014 Elsevier Inc. All rights reserved.
Landau, Dan A.; Clement, Kendell; Ziller, Michael J.; Boyle, Patrick; Fan, Jean; Gu, Hongcang; Stevenson, Kristen; Sougnez, Carrie; Wang, Lili; Li, Shuqiang; Kotliar, Dylan; Zhang, Wandi; Ghandi, Mahmoud; Garraway, Levi; Fernandes, Stacey M.; Livak, Kenneth J.; Gabriel, Stacey; Gnirke, Andreas; Lander, Eric S.; Brown, Jennifer R.; Neuberg, Donna; Kharchenko, Peter V.; Hacohen, Nir; Getz, Gad; Meissner, Alexander; Wu, Catherine J.
2014-01-01
SUMMARY Intra-tumoral heterogeneity plays a critical role in tumor evolution. To define the contribution of DNA methylation to heterogeneity within tumors, we performed genome-scale bisulfite sequencing of 104 primary chronic lymphocytic leukemias (CLL). Compared to 26 normal B cell samples, CLLs consistently displayed higher intra-sample variability of DNA methylation patterns across the genome, which appears to arise from stochastically disordered methylation in malignant cells. Transcriptome analysis of bulk and single CLL cells revealed that methylation disorder was linked to low-level expression. Disordered methylation was further associated with adverse clinical outcome. We therefore propose that disordered methylation plays a similar role to genetic instability, enhancing the ability of cancer cells to search for superior evolutionary trajectories. PMID:25490447
2009-01-01
Background ESTs or variable sequence reads can be available in prokaryotic studies well before a complete genome is known. Use cases include (i) transcriptome studies or (ii) single cell sequencing of bacteria. Without suitable software their further analysis and mapping would have to await finalization of the corresponding genome. Results The tool JANE rapidly maps ESTs or variable sequence reads in prokaryotic sequencing and transcriptome efforts to related template genomes. It provides an easy-to-use graphics interface for information retrieval and a toolkit for EST or nucleotide sequence function prediction. Furthermore, we developed for rapid mapping an enhanced sequence alignment algorithm which reassembles and evaluates high scoring pairs provided from the BLAST algorithm. Rapid assembly on and replacement of the template genome by sequence reads or mapped ESTs is achieved. This is illustrated (i) by data from Staphylococci as well as from a Blattabacteria sequencing effort, (ii) mapping single cell sequencing reads is shown for poribacteria to sister phylum representative Rhodopirellula Baltica SH1. The algorithm has been implemented in a web-server accessible at http://jane.bioapps.biozentrum.uni-wuerzburg.de. Conclusion Rapid prokaryotic EST mapping or mapping of sequence reads is achieved applying JANE even without knowing the cognate genome sequence. PMID:19943962
Xing, Yang; Bu, Lingxi; Zheng, Tianran; Liu, Shijie; Jiang, Jianxin
2016-12-01
Co-production of glucose, furfural and other green materials based on a lignocellulosic biorefinery is a promising way to realize the commercial application of corncob residues. An effective process was developed for glucose production using low temperature bisulfite pretreatment and high-solids enzymatic hydrolysis. Corncob residues from furfural production (FRs) were pretreated with 0.1g NaHSO 3 /g dry substrate at 100°C for 3h. Lignin was sulfonated and sulfonic groups were produced during pretreatment, which resulted in decreasing the zeta potential of the samples. Compared with raw material, bisulfite pretreatment of FRs increased the glucose yield from 18.6 to 99.45% after 72h hydrolysis at a solids loading of 12.5%. The hydrolysis residues showed a relatively high thermal stability and concentrated high derivatives. Direct pretreatment followed by enzymatic hydrolysis is an environmentally-friendly and economically-feasible method for the production of glucose and high-purity lignin, which could be further converted into high-value products. Copyright © 2016 Elsevier Ltd. All rights reserved.
Gardiner, Laura-Jayne; Bansept-Basler, Pauline; Olohan, Lisa; Joynson, Ryan; Brenchley, Rachel; Hall, Neil; O'Sullivan, Donal M; Hall, Anthony
2016-08-01
Previously we extended the utility of mapping-by-sequencing by combining it with sequence capture and mapping sequence data to pseudo-chromosomes that were organized using wheat-Brachypodium synteny. This, with a bespoke haplotyping algorithm, enabled us to map the flowering time locus in the diploid wheat Triticum monococcum L. identifying a set of deleted genes (Gardiner et al., 2014). Here, we develop this combination of gene enrichment and sliding window mapping-by-synteny analysis to map the Yr6 locus for yellow stripe rust resistance in hexaploid wheat. A 110 MB NimbleGen capture probe set was used to enrich and sequence a doubled haploid mapping population of hexaploid wheat derived from an Avalon and Cadenza cross. The Yr6 locus was identified by mapping to the POPSEQ chromosomal pseudomolecules using a bespoke pipeline and algorithm (Chapman et al., 2015). Furthermore the same locus was identified using newly developed pseudo-chromosome sequences as a mapping reference that are based on the genic sequence used for sequence enrichment. The pseudo-chromosomes allow us to demonstrate the application of mapping-by-sequencing to even poorly defined polyploidy genomes where chromosomes are incomplete and sub-genome assemblies are collapsed. This analysis uniquely enabled us to: compare wheat genome annotations; identify the Yr6 locus - defining a smaller genic region than was previously possible; associate the interval with one wheat sub-genome and increase the density of SNP markers associated. Finally, we built the pipeline in iPlant, making it a user-friendly community resource for phenotype mapping. © 2016 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Higareda-Almaraz, Juan Carlos; Ruiz-Moreno, Juan S; Klimentova, Jana; Barbieri, Daniela; Salvador-Gallego, Raquel; Ly, Regina; Valtierra-Gutierrez, Ilse A; Dinsart, Christiane; Rabinovich, Gabriel A; Stulik, Jiri; Rösl, Frank; Rincon-Orozco, Bladimiro
2016-08-24
Galectin-7 (Gal-7) is negatively regulated in cervical cancer, and appears to be a link between the apoptotic response triggered by cancer and the anti-tumoral activity of the immune system. Our understanding of how cervical cancer cells and their molecular networks adapt in response to the expression of Gal-7 remains limited. Meta-analysis of Gal-7 expression was conducted in three cervical cancer cohort studies and TCGA. In silico prediction and bisulfite sequencing were performed to inquire epigenetic alterations. To study the effect of Gal-7 on cervical cancer, we ectopically re-expressed it in the HeLa and SiHa cervical cancer cell lines, and analyzed their transcriptome and SILAC-based proteome. We also examined the tumor and microenvironment host cell transcriptomes after xenotransplantation into immunocompromised mice. Differences between samples were assessed with the Kruskall-Wallis, Dunn's Multiple Comparison and T tests. Kaplan-Meier and log-rank tests were used to determine overall survival. Gal-7 was constantly downregulated in our meta-analysis (p < 0.0001). Tumors with combined high Gal-7 and low galectin-1 expression (p = 0.0001) presented significantly better prognoses (p = 0.005). In silico and bisulfite sequencing assays showed de novo methylation in the Gal-7 promoter and first intron. Cells re-expressing Gal-7 showed a high apoptosis ratio (p < 0.05) and their xenografts displayed strong growth retardation (p < 0.001). Multiple gene modules and transcriptional regulators were modulated in response to Gal-7 reconstitution, both in cervical cancer cells and their microenvironments (FDR < 0.05 %). Most of these genes and modules were associated with tissue morphogenesis, metabolism, transport, chemokine activity, and immune response. These functional modules could exert the same effects in vitro and in vivo, even despite different compositions between HeLa and SiHa samples. Gal-7 re-expression affects the regulation of molecular networks in cervical cancer that are involved in diverse cancer hallmarks, such as metabolism, growth control, invasion and evasion of apoptosis. The effect of Gal-7 extends to the microenvironment, where networks involved in its configuration and in immune surveillance are particularly affected.
Construction of a map-based reference genome sequence for barley, Hordeum vulgare L.
Beier, Sebastian; Himmelbach, Axel; Colmsee, Christian; Zhang, Xiao-Qi; Barrero, Roberto A.; Zhang, Qisen; Li, Lin; Bayer, Micha; Bolser, Daniel; Taudien, Stefan; Groth, Marco; Felder, Marius; Hastie, Alex; Šimková, Hana; Staňková, Helena; Vrána, Jan; Chan, Saki; Muñoz-Amatriaín, María; Ounit, Rachid; Wanamaker, Steve; Schmutzer, Thomas; Aliyeva-Schnorr, Lala; Grasso, Stefano; Tanskanen, Jaakko; Sampath, Dharanya; Heavens, Darren; Cao, Sujie; Chapman, Brett; Dai, Fei; Han, Yong; Li, Hua; Li, Xuan; Lin, Chongyun; McCooke, John K.; Tan, Cong; Wang, Songbo; Yin, Shuya; Zhou, Gaofeng; Poland, Jesse A.; Bellgard, Matthew I.; Houben, Andreas; Doležel, Jaroslav; Ayling, Sarah; Lonardi, Stefano; Langridge, Peter; Muehlbauer, Gary J.; Kersey, Paul; Clark, Matthew D.; Caccamo, Mario; Schulman, Alan H.; Platzer, Matthias; Close, Timothy J.; Hansson, Mats; Zhang, Guoping; Braumann, Ilka; Li, Chengdao; Waugh, Robbie; Scholz, Uwe; Stein, Nils; Mascher, Martin
2017-01-01
Barley (Hordeum vulgare L.) is a cereal grass mainly used as animal fodder and raw material for the malting industry. The map-based reference genome sequence of barley cv. ‘Morex’ was constructed by the International Barley Genome Sequencing Consortium (IBSC) using hierarchical shotgun sequencing. Here, we report the experimental and computational procedures to (i) sequence and assemble more than 80,000 bacterial artificial chromosome (BAC) clones along the minimum tiling path of a genome-wide physical map, (ii) find and validate overlaps between adjacent BACs, (iii) construct 4,265 non-redundant sequence scaffolds representing clusters of overlapping BACs, and (iv) order and orient these BAC clusters along the seven barley chromosomes using positional information provided by dense genetic maps, an optical map and chromosome conformation capture sequencing (Hi-C). Integrative access to these sequence and mapping resources is provided by the barley genome explorer (BARLEX). PMID:28448065
GobyWeb: Simplified Management and Analysis of Gene Expression and DNA Methylation Sequencing Data
Dorff, Kevin C.; Chambwe, Nyasha; Zeno, Zachary; Simi, Manuele; Shaknovich, Rita; Campagne, Fabien
2013-01-01
We present GobyWeb, a web-based system that facilitates the management and analysis of high-throughput sequencing (HTS) projects. The software provides integrated support for a broad set of HTS analyses and offers a simple plugin extension mechanism. Analyses currently supported include quantification of gene expression for messenger and small RNA sequencing, estimation of DNA methylation (i.e., reduced bisulfite sequencing and whole genome methyl-seq), or the detection of pathogens in sequenced data. In contrast to previous analysis pipelines developed for analysis of HTS data, GobyWeb requires significantly less storage space, runs analyses efficiently on a parallel grid, scales gracefully to process tens or hundreds of multi-gigabyte samples, yet can be used effectively by researchers who are comfortable using a web browser. We conducted performance evaluations of the software and found it to either outperform or have similar performance to analysis programs developed for specialized analyses of HTS data. We found that most biologists who took a one-hour GobyWeb training session were readily able to analyze RNA-Seq data with state of the art analysis tools. GobyWeb can be obtained at http://gobyweb.campagnelab.org and is freely available for non-commercial use. GobyWeb plugins are distributed in source code and licensed under the open source LGPL3 license to facilitate code inspection, reuse and independent extensions http://github.com/CampagneLaboratory/gobyweb2-plugins. PMID:23936070
CaMV-35S promoter sequence-specific DNA methylation in lettuce.
Okumura, Azusa; Shimada, Asahi; Yamasaki, Satoshi; Horino, Takuya; Iwata, Yuji; Koizumi, Nozomu; Nishihara, Masahiro; Mishiba, Kei-ichiro
2016-01-01
We found 35S promoter sequence-specific DNA methylation in lettuce. Additionally, transgenic lettuce plants having a modified 35S promoter lost methylation, suggesting the modified sequence is subjected to the methylation machinery. We previously reported that cauliflower mosaic virus 35S promoter-specific DNA methylation in transgenic gentian (Gentiana triflora × G. scabra) plants occurs irrespective of the copy number and the genomic location of T-DNA, and causes strong gene silencing. To confirm whether 35S-specific methylation can occur in other plant species, transgenic lettuce (Lactuca sativa L.) plants with a single copy of the 35S promoter-driven sGFP gene were produced and analyzed. Among 10 lines of transgenic plants, 3, 4, and 3 lines showed strong, weak, and no expression of sGFP mRNA, respectively. Bisulfite genomic sequencing of the 35S promoter region showed hypermethylation at CpG and CpWpG (where W is A or T) sites in 9 of 10 lines. Gentian-type de novo methylation pattern, consisting of methylated cytosines at CpHpH (where H is A, C, or T) sites, was also observed in the transgenic lettuce lines, suggesting that lettuce and gentian share similar methylation machinery. Four of five transgenic lettuce lines having a single copy of a modified 35S promoter, which was modified in the proposed core target of de novo methylation in gentian, exhibited 35S hypomethylation, indicating that the modified sequence may be the target of the 35S-specific methylation machinery.
Chan, Robin F.; Shabalin, Andrey A.; Xie, Lin Y.; Adkins, Daniel E.; Zhao, Min; Turecki, Gustavo; Clark, Shaunna L.; Aberg, Karolina A.
2017-01-01
Abstract Methylome-wide association studies are typically performed using microarray technologies that only assay a very small fraction of the CG methylome and entirely miss two forms of methylation that are common in brain and likely of particular relevance for neuroscience and psychiatric disorders. The alternative is to use whole genome bisulfite (WGB) sequencing but this approach is not yet practically feasible with sample sizes required for adequate statistical power. We argue for revisiting methylation enrichment methods that, provided optimal protocols are used, enable comprehensive, adequately powered and cost-effective genome-wide investigations of the brain methylome. To support our claim we use data showing that enrichment methods approximate the sensitivity obtained with WGB methods and with slightly better specificity. However, this performance is achieved at <5% of the reagent costs. Furthermore, because many more samples can be sequenced simultaneously, projects can be completed about 15 times faster. Currently the only viable option available for comprehensive brain methylome studies, enrichment methods may be critical for moving the field forward. PMID:28334972
Integrated genome browser: visual analytics platform for genomics.
Freese, Nowlan H; Norris, David C; Loraine, Ann E
2016-07-15
Genome browsers that support fast navigation through vast datasets and provide interactive visual analytics functions can help scientists achieve deeper insight into biological systems. Toward this end, we developed Integrated Genome Browser (IGB), a highly configurable, interactive and fast open source desktop genome browser. Here we describe multiple updates to IGB, including all-new capabilities to display and interact with data from high-throughput sequencing experiments. To demonstrate, we describe example visualizations and analyses of datasets from RNA-Seq, ChIP-Seq and bisulfite sequencing experiments. Understanding results from genome-scale experiments requires viewing the data in the context of reference genome annotations and other related datasets. To facilitate this, we enhanced IGB's ability to consume data from diverse sources, including Galaxy, Distributed Annotation and IGB-specific Quickload servers. To support future visualization needs as new genome-scale assays enter wide use, we transformed the IGB codebase into a modular, extensible platform for developers to create and deploy all-new visualizations of genomic data. IGB is open source and is freely available from http://bioviz.org/igb aloraine@uncc.edu. © The Author 2016. Published by Oxford University Press.
Park, Hyung Soo; Chatterjee, Indranil; Dong, Xiaoli; Wang, Sheng-Hung; Sensen, Christoph W.; Caffrey, Sean M.; Jack, Thomas R.; Boivin, Joe; Voordouw, Gerrit
2011-01-01
Pipelines transporting brackish subsurface water, used in the production of bitumen by steam-assisted gravity drainage, are subject to frequent corrosion failures despite the addition of the oxygen scavenger sodium bisulfite (SBS). Pyrosequencing of 16S rRNA genes was used to determine the microbial community composition for planktonic samples of transported water and for sessile samples of pipe-associated solids (PAS) scraped from pipeline cutouts representing corrosion failures. These were obtained from upstream (PAS-616P) and downstream (PAS-821TP and PAS-821LP, collected under rapid-flow and stagnant conditions, respectively) of the SBS injection point. Most transported water samples had a large fraction (1.8% to 97% of pyrosequencing reads) of Pseudomonas not found in sessile pipe samples. The sessile population of PAS-616P had methanogens (Methanobacteriaceae) as the main (56%) community component, whereas Deltaproteobacteria of the genera Desulfomicrobium and Desulfocapsa were not detected. In contrast, PAS-821TP and PAS-821LP had lower fractions (41% and 0.6%) of Methanobacteriaceae archaea but increased fractions of sulfate-reducing Desulfomicrobium (18% and 48%) and of bisulfite-disproportionating Desulfocapsa (35% and 22%) bacteria. Hence, SBS injection strongly changed the sessile microbial community populations. X-ray diffraction analysis of pipeline scale indicated that iron carbonate was present both upstream and downstream, whereas iron sulfide and sulfur were found only downstream of the SBS injection point, suggesting a contribution of the bisulfite-disproportionating and sulfate-reducing bacteria in the scale to iron corrosion. Incubation of iron coupons with pipeline waters indicated iron corrosion coupled to the formation of methane. Hence, both methanogenic and sulfidogenic microbial communities contributed to corrosion of pipelines transporting these brackish waters. PMID:21856836
Single-tube analysis of DNA methylation with silica superparamagnetic beads.
Bailey, Vasudev J; Zhang, Yi; Keeley, Brian P; Yin, Chao; Pelosky, Kristen L; Brock, Malcolm; Baylin, Stephen B; Herman, James G; Wang, Tza-Huei
2010-06-01
DNA promoter methylation is a signature for the silencing of tumor suppressor genes. Most widely used methods to detect DNA methylation involve 3 separate, independent processes: DNA extraction, bisulfite conversion, and methylation detection via a PCR method, such as methylation-specific PCR (MSP). This method includes many disconnected steps with associated losses of material, potentially reducing the analytical sensitivity required for analysis of challenging clinical samples. Methylation on beads (MOB) is a new technique that integrates DNA extraction, bisulfite conversion, and PCR in a single tube via the use of silica superparamagnetic beads (SSBs) as a common DNA carrier for facilitating cell debris removal and buffer exchange throughout the entire process. In addition, PCR buffer is used to directly elute bisulfite-treated DNA from SSBs for subsequent target amplifications. The diagnostic sensitivity of MOB was evaluated by methylation analysis of the CDKN2A [cyclin-dependent kinase inhibitor 2A (melanoma, p16, inhibits CDK4); also known as p16(INK4a)] promoter in serum DNA of lung cancer patients and compared with that of conventional methods. Methylation analysis consisting of DNA extraction followed by bisulfite conversion and MSP was successfully carried out within 9 h in a single tube. The median pre-PCR DNA yield was 6.61-fold higher with the MOB technique than with conventional techniques. Furthermore, MOB increased the diagnostic sensitivity in our analysis of the CDKN2A promoter in patient serum by successfully detecting methylation in 74% of cancer patients, vs the 45% detection rate obtained with conventional techniques. The MOB technique successfully combined 3 processes into a single tube, thereby allowing ease in handling and an increased detection throughput. The increased pre-PCR yield in MOB allowed efficient, diagnostically sensitive methylation detection.
Zhou, Gaofeng; Jian, Jianbo; Wang, Penghao; Li, Chengdao; Tao, Ye; Li, Xuan; Renshaw, Daniel; Clements, Jonathan; Sweetingham, Mark; Yang, Huaan
2018-01-01
An ultra-high density genetic map containing 34,574 sequence-defined markers was developed in Lupinus angustifolius. Markers closely linked to nine genes of agronomic traits were identified. A physical map was improved to cover 560.5 Mb genome sequence. Lupin (Lupinus angustifolius L.) is a recently domesticated legume grain crop. In this study, we applied the restriction-site associated DNA sequencing (RADseq) method to genotype an F 9 recombinant inbred line population derived from a wild type × domesticated cultivar (W × D) cross. A high density linkage map was developed based on the W × D population. By integrating sequence-defined DNA markers reported in previous mapping studies, we established an ultra-high density consensus genetic map, which contains 34,574 markers consisting of 3508 loci covering 2399 cM on 20 linkage groups. The largest gap in the entire consensus map was 4.73 cM. The high density W × D map and the consensus map were used to develop an improved physical map, which covered 560.5 Mb of genome sequence data. The ultra-high density consensus linkage map, the improved physical map and the markers linked to genes of breeding interest reported in this study provide a common tool for genome sequence assembly, structural genomics, comparative genomics, functional genomics, QTL mapping, and molecular plant breeding in lupin.
Bilton, Timothy P.; Schofield, Matthew R.; Black, Michael A.; Chagné, David; Wilcox, Phillip L.; Dodds, Ken G.
2018-01-01
Next-generation sequencing is an efficient method that allows for substantially more markers than previous technologies, providing opportunities for building high-density genetic linkage maps, which facilitate the development of nonmodel species’ genomic assemblies and the investigation of their genes. However, constructing genetic maps using data generated via high-throughput sequencing technology (e.g., genotyping-by-sequencing) is complicated by the presence of sequencing errors and genotyping errors resulting from missing parental alleles due to low sequencing depth. If unaccounted for, these errors lead to inflated genetic maps. In addition, map construction in many species is performed using full-sibling family populations derived from the outcrossing of two individuals, where unknown parental phase and varying segregation types further complicate construction. We present a new methodology for modeling low coverage sequencing data in the construction of genetic linkage maps using full-sibling populations of diploid species, implemented in a package called GUSMap. Our model is based on the Lander–Green hidden Markov model but extended to account for errors present in sequencing data. We were able to obtain accurate estimates of the recombination fractions and overall map distance using GUSMap, while most existing mapping packages produced inflated genetic maps in the presence of errors. Our results demonstrate the feasibility of using low coverage sequencing data to produce genetic maps without requiring extensive filtering of potentially erroneous genotypes, provided that the associated errors are correctly accounted for in the model. PMID:29487138
Bilton, Timothy P; Schofield, Matthew R; Black, Michael A; Chagné, David; Wilcox, Phillip L; Dodds, Ken G
2018-05-01
Next-generation sequencing is an efficient method that allows for substantially more markers than previous technologies, providing opportunities for building high-density genetic linkage maps, which facilitate the development of nonmodel species' genomic assemblies and the investigation of their genes. However, constructing genetic maps using data generated via high-throughput sequencing technology ( e.g. , genotyping-by-sequencing) is complicated by the presence of sequencing errors and genotyping errors resulting from missing parental alleles due to low sequencing depth. If unaccounted for, these errors lead to inflated genetic maps. In addition, map construction in many species is performed using full-sibling family populations derived from the outcrossing of two individuals, where unknown parental phase and varying segregation types further complicate construction. We present a new methodology for modeling low coverage sequencing data in the construction of genetic linkage maps using full-sibling populations of diploid species, implemented in a package called GUSMap. Our model is based on the Lander-Green hidden Markov model but extended to account for errors present in sequencing data. We were able to obtain accurate estimates of the recombination fractions and overall map distance using GUSMap, while most existing mapping packages produced inflated genetic maps in the presence of errors. Our results demonstrate the feasibility of using low coverage sequencing data to produce genetic maps without requiring extensive filtering of potentially erroneous genotypes, provided that the associated errors are correctly accounted for in the model. Copyright © 2018 Bilton et al.
Comparative Analyses of DNA Methylation and Sequence Evolution Using Nasonia Genomes
Park, Jungsun; Peng, Zuogang; Zeng, Jia; Elango, Navin; Park, Taesung; Wheeler, Dave; Werren, John H.; Yi, Soojin V.
2011-01-01
The functional and evolutionary significance of DNA methylation in insect genomes remains to be resolved. Nasonia is well situated for comparative analyses of DNA methylation and genome evolution, since the genomes of a moderately distant outgroup species as well as closely related sibling species are available. Using direct sequencing of bisulfite-converted DNA, we uncovered a substantial level of DNA methylation in 17 of 18 Nasonia vitripennis genes and a strong correlation between methylation level and CpG depletion. Notably, in the sex-determining locus transformer, the exon that is alternatively spliced between the sexes is heavily methylated in both males and females, whereas other exons are only sparsely methylated. Orthologous genes of the honeybee and Nasonia show highly similar relative levels of CpG depletion, despite ∼190 My divergence. Densely and sparsely methylated genes in these species also exhibit similar functional enrichments. We found that the degree of CpG depletion is negatively correlated with substitution rates between closely related Nasonia species for synonymous, nonsynonymous, and intron sites. This suggests that mutation rates increase with decreasing levels of germ line methylation. Thus, DNA methylation is prevalent in the Nasonia genome, may participate in regulatory processes such as sex determination and alternative splicing, and is correlated with several aspects of genome and sequence evolution. PMID:21693438
Heller, G; Topakian, T; Altenberger, C; Cerny-Reiterer, S; Herndlhofer, S; Ziegler, B; Datlinger, P; Byrgazov, K; Bock, C; Mannhalter, C; Hörmann, G; Sperr, W R; Lion, T; Zielinski, C C; Valent, P; Zöchbauer-Müller, S
2016-01-01
Little is known about the impact of DNA methylation on the evolution/progression of Ph+ chronic myeloid leukemia (CML). We investigated the methylome of CML patients in chronic phase (CP-CML), accelerated phase (AP-CML) and blast crisis (BC-CML) as well as in controls by reduced representation bisulfite sequencing. Although only ~600 differentially methylated CpG sites were identified in samples obtained from CP-CML patients compared with controls, ~6500 differentially methylated CpG sites were found in samples from BC-CML patients. In the majority of affected CpG sites, methylation was increased. In CP-CML patients who progressed to AP-CML/BC-CML, we identified up to 897 genes that were methylated at the time of progression but not at the time of diagnosis. Using RNA-sequencing, we observed downregulated expression of many of these genes in BC-CML compared with CP-CML samples. Several of them are well-known tumor-suppressor genes or regulators of cell proliferation, and gene re-expression was observed by the use of epigenetic active drugs. Together, our results demonstrate that CpG site methylation clearly increases during CML progression and that it may provide a useful basis for revealing new targets of therapy in advanced CML. PMID:27211271
DRME: Count-based differential RNA methylation analysis at small sample size scenario.
Liu, Lian; Zhang, Shao-Wu; Gao, Fan; Zhang, Yixin; Huang, Yufei; Chen, Runsheng; Meng, Jia
2016-04-15
Differential methylation, which concerns difference in the degree of epigenetic regulation via methylation between two conditions, has been formulated as a beta or beta-binomial distribution to address the within-group biological variability in sequencing data. However, a beta or beta-binomial model is usually difficult to infer at small sample size scenario with discrete reads count in sequencing data. On the other hand, as an emerging research field, RNA methylation has drawn more and more attention recently, and the differential analysis of RNA methylation is significantly different from that of DNA methylation due to the impact of transcriptional regulation. We developed DRME to better address the differential RNA methylation problem. The proposed model can effectively describe within-group biological variability at small sample size scenario and handles the impact of transcriptional regulation on RNA methylation. We tested the newly developed DRME algorithm on simulated and 4 MeRIP-Seq case-control studies and compared it with Fisher's exact test. It is in principle widely applicable to several other RNA-related data types as well, including RNA Bisulfite sequencing and PAR-CLIP. The code together with an MeRIP-Seq dataset is available online (https://github.com/lzcyzm/DRME) for evaluation and reproduction of the figures shown in this article. Copyright © 2016 Elsevier Inc. All rights reserved.
DNA Methylation of Gene Expression in Acanthamoeba castellanii Encystation.
Moon, Eun-Kyung; Hong, Yeonchul; Lee, Hae-Ahm; Quan, Fu-Shi; Kong, Hyun-Hee
2017-04-01
Encystation mediating cyst specific cysteine proteinase (CSCP) of Acanthamoeba castellanii is expressed remarkably during encystation. However, the molecular mechanism involved in the regulation of CSCP gene expression remains unclear. In this study, we focused on epigenetic regulation of gene expression during encystation of Acanthamoeba . To evaluate methylation as a potential mechanism involved in the regulation of CSCP expression, we first investigated the correlation between promoter methylation status of CSCP gene and its expression. A 2,878 bp of promoter sequence of CSCP gene was amplified by PCR. Three CpG islands (island 1-3) were detected in this sequence using bioinformatics tools. Methylation of CpG island in trophozoites and cysts was measured by bisulfite sequence PCR. CSCP promoter methylation of CpG island 1 (1,633 bp) was found in 8.2% of trophozoites and 7.3% of cysts. Methylation of CpG island 2 (625 bp) was observed in 4.2% of trophozoites and 5.8% of cysts. Methylation of CpG island 3 (367 bp) in trophozoites and cysts was both 3.6%. These results suggest that DNA methylation system is present in CSCP gene expression of Acanthamoeba . In addition, the expression of encystation mediating CSCP is correlated with promoter CpG island 1 hypomethylation.
Chronic exposure to water pollutant trichloroethylene increased epigenetic drift in CD4+ T cells
Gilbert, Kathleen M; Blossom, Sarah J; Erickson, Stephen W; Reisfeld, Brad; Zurlinden, Todd J; Broadfoot, Brannon; West, Kirk; Bai, Shasha; Cooney, Craig A
2016-01-01
Aim: Autoimmune disease and CD4+ T-cell alterations are induced in mice exposed to the water pollutant trichloroethylene (TCE). We examined here whether TCE altered gene-specific DNA methylation in CD4+ T cells as a possible mechanism of immunotoxicity. Materials & methods: Naive and effector/memory CD4+ T cells from mice exposed to TCE (0.5 mg/ml in drinking water) for 40 weeks were examined by bisulfite next-generation DNA sequencing. Results: A probabilistic model calculated from multiple genes showed that TCE decreased methylation control in CD4+ T cells. Data from individual genes fitted to a quadratic regression model showed that TCE increased gene-specific methylation variance in both CD4 subsets. Conclusion: TCE increased epigenetic drift of specific CpG sites in CD4+ T cells. PMID:27092578
Detection of regional DNA methylation using DNA-graphene affinity interactions.
Haque, Md Hakimul; Gopalan, Vinod; Yadav, Sharda; Islam, Md Nazmul; Eftekhari, Ehsan; Li, Qin; Carrascosa, Laura G; Nguyen, Nam-Trung; Lam, Alfred K; Shiddiky, Muhammad J A
2017-01-15
We report a new method for the detection of regional DNA methylation using base-dependent affinity interaction (i.e., adsorption) of DNA with graphene. Due to the strongest adsorption affinity of guanine bases towards graphene, bisulfite-treated guanine-enriched methylated DNA leads to a larger amount of the adsorbed DNA on the graphene-modified electrodes in comparison to the adenine-enriched unmethylated DNA. The level of the methylation is quantified by monitoring the differential pulse voltammetric current as a function of the adsorbed DNA. The assay is sensitive to distinguish methylated and unmethylated DNA sequences at single CpG resolution by differentiating changes in DNA methylation as low as 5%. Furthermore, this method has been used to detect methylation levels in a collection of DNA samples taken from oesophageal cancer tissues. Copyright © 2016 Elsevier B.V. All rights reserved.
Kon, Tatsuya; Yoshikawa, Nobuyuki
2014-01-01
Apple latent spherical virus (ALSV) is an efficient virus-induced gene silencing vector in functional genomics analyses of a broad range of plant species. Here, an Agrobacterium-mediated inoculation (agroinoculation) system was developed for the ALSV vector, and virus-induced transcriptional gene silencing (VITGS) is described in plants infected with the ALSV vector. The cDNAs of ALSV RNA1 and RNA2 were inserted between the cauliflower mosaic virus 35S promoter and the NOS-T sequences in a binary vector pCAMBIA1300 to produce pCALSR1 and pCALSR2-XSB or pCALSR2-XSB/MN. When these vector constructs were agroinoculated into Nicotiana benthamiana plants with a construct expressing a viral silencing suppressor, the infection efficiency of the vectors was 100%. A recombinant ALSV vector carrying part of the 35S promoter sequence induced transcriptional gene silencing of the green fluorescent protein gene in a line of N. benthamiana plants, resulting in the disappearance of green fluorescence of infected plants. Bisulfite sequencing showed that cytosine residues at CG and CHG sites of the 35S promoter sequence were highly methylated in the silenced generation zero plants infected with the ALSV carrying the promoter sequence as well as in progeny. The ALSV-mediated VITGS state was inherited by progeny for multiple generations. In addition, induction of VITGS of an endogenous gene (chalcone synthase-A) was demonstrated in petunia plants infected with an ALSV vector carrying the native promoter sequence. These results suggest that ALSV-based vectors can be applied to study DNA methylation in plant genomes, and provide a useful tool for plant breeding via epigenetic modification. PMID:25426109
Wu, Yuting; Bu, Fangtian; Yu, Haixia; Li, Wanxia; Huang, Cheng; Meng, Xiaoming; Zhang, Lei; Ma, Taotao; Li, Jun
2017-01-15
Liver fibrosis, resulting from chronic and persistent injury to the liver, is a worldwide health problem. Advanced liver fibrosis results in cirrhosis, liver failure and even hepatocellular cancer (HCC), often eventually requiring liver transplantation, poses a huge health burden on the global community. However, the specific pathogenesis of liver fibrosis remains not fully understood. Numerous basic and clinical studies have provided evidence that epigenetic modifications, especially DNA methylation, might contribute to the activation of hepatic stellate cells (HSCs), the pivotal cell type responsible for the fibrous scar in liver. Here, reduced representation bisulfite sequencing (RRBS) and bisulfite pyrosequencing PCR (BSP) analysis identified hypermethylation status of Septin9 (Sept9) gene in liver fibrogenesis. Sept9 protein was dramatically decreased in livers of CCl4-treated mice and immortalized HSC-T6 cells exposed to TGF-β1. Nevertheless, the suppression of Sept9 could be blocked by DNMT3a-siRNA and DNA methyltransferase inhibitor, 5-aza-2'-deoxycytidine (5-azadC). Overexpressed Sept9 attenuated TGF-β1-induced expression of myofibroblast markers α-SMA and Col1a1, accompanied by up-regulation of cell apoptosis-related proteins. Conversely, RNAi-mediated silencing of Sept9 enhanced accumulation of extracellular matrix. These observations suggested that Sept9 contributed to alleviate liver fibrosis might partially through promoting activated HSCs apoptosis and this anti-fibrogenesis effect might be blocked by DNMT-3a mediated methylation of Sept9. Therefore, pharmacological agents that inhibit Sept9 methylation and increase its expression could be considered as valuable treatments for liver fibrosis. Copyright © 2016 Elsevier Inc. All rights reserved.
Kawaguchi, Koichiro; Kinameri, Ayumi; Suzuki, Shunsuke; Senga, Shogo; Ke, Youqiang; Fujii, Hiroshi
2016-02-15
FABPs (fatty-acid-binding proteins) are a family of low-molecular-mass intracellular lipid-binding proteins consisting of ten isoforms. FABPs are involved in binding and storing hydrophobic ligands such as long-chain fatty acids, as well as transporting these ligands to the appropriate compartments in the cell. FABP5 is overexpressed in multiple types of tumours. Furthermore, up-regulation of FABP5 is strongly associated with poor survival in triple-negative breast cancer. However, the mechanisms underlying the specific up-regulation of the FABP5 gene in these cancers remain poorly characterized. In the present study, we determined that FABP5 has a typical CpG island around its promoter region. The DNA methylation status of the CpG island in the FABP5 promoter of benign prostate cells (PNT2), prostate cancer cells (PC-3, DU-145, 22Rv1 and LNCaP) and human normal or tumour tissue was assessed by bisulfite sequencing analysis, and then confirmed by COBRA (combined bisulfite restriction analysis) and qAMP (quantitative analysis of DNA methylation using real-time PCR). These results demonstrated that overexpression of FABP5 in prostate cancer cells can be attributed to hypomethylation of the CpG island in its promoter region, along with up-regulation of the direct trans-acting factors Sp1 (specificity protein 1) and c-Myc. Together, these mechanisms result in the transcriptional activation of FABP5 expression during human prostate carcinogenesis. Importantly, silencing of Sp1, c-Myc or FABP5 expression led to a significant decrease in cell proliferation, indicating that up-regulation of FABP5 expression by Sp1 and c-Myc is critical for the proliferation of prostate cancer cells. © 2016 Authors; published by Portland Press Limited.
Jia, Zhaofeng; Liang, Yujie; Ma, Bin; Xu, Xiao; Xiong, Jianyi; Duan, Li; Wang, Daping
2017-05-17
The dedifferentiation of hyaline chondrocytes into fibroblastic chondrocytes often accompanies monolayer expansion of chondrocytes in vitro. The global DNA methylation level of chondrocytes is considered to be a suitable biomarker for the loss of the chondrocyte phenotype. However, results based on different experimental methods can be inconsistent. Therefore, it is important to establish a precise, simple, and rapid method to quantify global DNA methylation levels during chondrocyte dedifferentiation. Current genome-wide methylation analysis techniques largely rely on bisulfite genomic sequencing. Due to DNA degradation during bisulfite conversion, these methods typically require a large sample volume. Other methods used to quantify global DNA methylation levels include high-performance liquid chromatography (HPLC). However, HPLC requires complete digestion of genomic DNA. Additionally, the prohibitively high cost of HPLC instruments limits HPLC's wider application. In this study, genomic DNA (gDNA) was extracted from human chondrocytes cultured with varying number of passages. The gDNA methylation level was detected using a methylation-specific dot blot assay. In this dot blot approach, a gDNA mixture containing the methylated DNA to be detected was spotted directly onto an N + membrane as a dot inside a previously drawn circular template pattern. Compared with other gel electrophoresis-based blotting approaches and other complex blotting procedures, the dot blot method saves significant time. In addition, dot blots can detect overall DNA methylation level using a commercially available 5-mC antibody. We found that the DNA methylation level differed between the monolayer subcultures, and therefore could play a key role in chondrocyte dedifferentiation. The 5-mC dot blot is a reliable, simple, and rapid method to detect the general DNA methylation level to evaluate chondrocyte phenotype.
Wang, Kunning; Liang, Qiaoyi; Li, Xiaoxing; Tsoi, Ho; Zhang, Jingwan; Wang, Hua; Go, Minnie Y Y; Chiu, Philip W Y; Ng, Enders K W; Sung, Joseph J Y; Yu, Jun
2016-01-01
Background Using the promoter methylation assay, we have shown that MDGA2 (MAM domain containing glycosylphosphatidylinositol anchor 2) is preferentially methylated in gastric cancer. We analysed its biological effects and prognostic significance in gastric cancer. Methods MDGA2 methylation status was evaluated by combined bisulfite restriction analysis and bisulfite genomic sequencing. The effects of MDGA2 re-expression or knockdown on cell proliferation, apoptosis and the cell cycle were determined. MDGA2 interacting protein was identified by mass spectrometry and MDGA2-related cancer pathways by reporter activity and PCR array analyses. The clinical impact of MDGA2 was assessed in 218 patients with gastric cancer. Results MDGA2 was commonly silenced in gastric cancer cells (10/11) and primary gastric cancers due to promoter hypermethylation. MDGA2 significantly inhibited cell proliferation by causing G1–S cell cycle arrest and inducing cell apoptosis in vitro, and suppressed xenograft tumour growth in both subcutaneous and orthotopic xenograft mouse models (both p<0.001). The anti-tumorigenic effect of MDGA2 was mediated through direct stabilising of DNA methyltransferase 1 associated protein 1 (DMAP1), which played a tumour suppressive role in gastric cancer. This interaction activated their downstream key elements of p53/p21 signalling cascades. Moreover, promoter methylation of MDGA2 was detected in 62.4% (136/218) of gastric cancers. Multivariate analysis showed that patients with MDGA2 hypermethylation had a significantly decreased survival (p=0.005). Kaplan–Meier survival curves showed that MDGA2 hypermethylation was significantly associated with shortened survival in patients with early gastric cancer. Conclusions MDGA2 is a critical tumour suppressor in gastric carcinogenesis; its hypermethylation is an independent prognostic factor in patients with gastric cancer. PMID:26206665
High-Resolution Sequence-Function Mapping of Full-Length Proteins
Kowalsky, Caitlin A.; Klesmith, Justin R.; Stapleton, James A.; Kelly, Vince; Reichkitzer, Nolan; Whitehead, Timothy A.
2015-01-01
Comprehensive sequence-function mapping involves detailing the fitness contribution of every possible single mutation to a gene by comparing the abundance of each library variant before and after selection for the phenotype of interest. Deep sequencing of library DNA allows frequency reconstruction for tens of thousands of variants in a single experiment, yet short read lengths of current sequencers makes it challenging to probe genes encoding full-length proteins. Here we extend the scope of sequence-function maps to entire protein sequences with a modular, universal sequence tiling method. We demonstrate the approach with both growth-based selections and FACS screening, offer parameters and best practices that simplify design of experiments, and present analytical solutions to normalize data across independent selections. Using this protocol, sequence-function maps covering full sequences can be obtained in four to six weeks. Best practices introduced in this manuscript are fully compatible with, and complementary to, other recently published sequence-function mapping protocols. PMID:25790064
Yin, Changchuan
2015-04-01
To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
Gardiner, Laura-Jayne; Gawroński, Piotr; Olohan, Lisa; Schnurbusch, Thorsten; Hall, Neil; Hall, Anthony
2014-12-01
Mapping-by-sequencing analyses have largely required a complete reference sequence and employed whole genome re-sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re-sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early-flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene-rich regions of hexaploid bread wheat to design a 110-Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo-chromosomes derived from the capture probe target sequence, with a long-range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval. © 2014 The Authors.The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Davis, G L; McMullen, M D; Baysdorfer, C; Musket, T; Grant, D; Staebell, M; Xu, G; Polacco, M; Koster, L; Melia-Hancock, S; Houchins, K; Chao, S; Coe, E H
1999-01-01
We have constructed a 1736-locus maize genome map containing1156 loci probed by cDNAs, 545 probed by random genomic clones, 16 by simple sequence repeats (SSRs), 14 by isozymes, and 5 by anonymous clones. Sequence information is available for 56% of the loci with 66% of the sequenced loci assigned functions. A total of 596 new ESTs were mapped from a B73 library of 5-wk-old shoots. The map contains 237 loci probed by barley, oat, wheat, rice, or tripsacum clones, which serve as grass genome reference points in comparisons between maize and other grass maps. Ninety core markers selected for low copy number, high polymorphism, and even spacing along the chromosome delineate the 100 bins on the map. The average bin size is 17 cM. Use of bin assignments enables comparison among different maize mapping populations and experiments including those involving cytogenetic stocks, mutants, or quantitative trait loci. Integration of nonmaize markers in the map extends the resources available for gene discovery beyond the boundaries of maize mapping information into the expanse of map, sequence, and phenotype information from other grass species. This map provides a foundation for numerous basic and applied investigations including studies of gene organization, gene and genome evolution, targeted cloning, and dissection of complex traits. PMID:10388831
Mapping neurofibromatosis 1 homologous loci by fluorescence in situ hybridization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Viskochil, D.; Breidenbach, H.H.; Cawthon, R.
Neurofibromatosis 1 maps to chromosome band 17q11.2 and the NF1 gene is comprised of 59 exons that span approximately 335 kb of genomic DNA. In order to further analyze the structure of NF1 from exons 2 through 27b, we isolated a number of cosmid and bacteriophage P-1 genomic clones using NF1-exon probes under high-stringency hybridization conditions. Using tagged, intron-based primers and DNA from various clones as a template, we PCR-amplified and sequenced individual NF1 exons. The exon sequences in PCR products from several genomic clones differed from the exon sequence derived from cloned NF1 cDNAs. Clones with variant sequences weremore » mapped by fluorescence in situ hybridization under high-stringency conditions. Three clones mapped to chromosome band 15q11.2, one mapped to 14q11.2, one mapped to both 2q14.1-14.3 and 14q11.2, one mapped to 2q33-34, and one mapped to both 18q11.2 and 21q21. Even though some PCR-product sequences retained proper splice junctions and open reading frames, we have yet to identify cDNAs that correspond to the variant exon sequences. We are now sequencing clones that map to NF1-homologous loci in order to develop discriminating primer pairs for the exclusive amplification of NF1-specific sequences in our efforts to develop a comprehensive NF1 mutation screen using genomic DNA as template. The role of NF1-homologous sequences may play in neurofibromatosis 1 is not clear.« less
Shotgun Optical Maps of the Whole Escherichia coli O157:H7 Genome
Lim, Alex; Dimalanta, Eileen T.; Potamousis, Konstantinos D.; Yen, Galex; Apodoca, Jennifer; Tao, Chunhong; Lin, Jieyi; Qi, Rong; Skiadas, John; Ramanathan, Arvind; Perna, Nicole T.; Plunkett, Guy; Burland, Valerie; Mau, Bob; Hackett, Jeremiah; Blattner, Frederick R.; Anantharaman, Thomas S.; Mishra, Bhubaneswar; Schwartz, David C.
2001-01-01
We have constructed NheI and XhoI optical maps of Escherichia coli O157:H7 solely from genomic DNA molecules to provide a uniquely valuable scaffold for contig closure and sequence validation. E. coli O157:H7 is a common pathogen found in contaminated food and water. Our approach obviated the need for the analysis of clones, PCR products, and hybridizations, because maps were constructed from ensembles of single DNA molecules. Shotgun sequencing of bacterial genomes remains labor-intensive, despite advances in sequencing technology. This is partly due to manual intervention required during the last stages of finishing. The applicability of optical mapping to this problem was enhanced by advances in machine vision techniques that improved mapping throughput and created a path to full automation of mapping. Comparisons were made between maps and sequence data that characterized sequence gaps and guided nascent assemblies. PMID:11544203
The DNA Methylome of Human Peripheral Blood Mononuclear Cells
Ye, Mingzhi; Zheng, Hancheng; Yu, Jian; Wu, Honglong; Sun, Jihua; Zhang, Hongyu; Chen, Quan; Luo, Ruibang; Chen, Minfeng; He, Yinghua; Jin, Xin; Zhang, Qinghui; Yu, Chang; Zhou, Guangyu; Sun, Jinfeng; Huang, Yebo; Zheng, Huisong; Cao, Hongzhi; Zhou, Xiaoyu; Guo, Shicheng; Hu, Xueda; Li, Xin; Kristiansen, Karsten; Bolund, Lars; Xu, Jiujin; Wang, Wen; Yang, Huanming; Wang, Jian; Li, Ruiqiang; Beck, Stephan; Wang, Jun; Zhang, Xiuqing
2010-01-01
DNA methylation plays an important role in biological processes in human health and disease. Recent technological advances allow unbiased whole-genome DNA methylation (methylome) analysis to be carried out on human cells. Using whole-genome bisulfite sequencing at 24.7-fold coverage (12.3-fold per strand), we report a comprehensive (92.62%) methylome and analysis of the unique sequences in human peripheral blood mononuclear cells (PBMC) from the same Asian individual whose genome was deciphered in the YH project. PBMC constitute an important source for clinical blood tests world-wide. We found that 68.4% of CpG sites and <0.2% of non-CpG sites were methylated, demonstrating that non-CpG cytosine methylation is minor in human PBMC. Analysis of the PBMC methylome revealed a rich epigenomic landscape for 20 distinct genomic features, including regulatory, protein-coding, non-coding, RNA-coding, and repeat sequences. Integration of our methylome data with the YH genome sequence enabled a first comprehensive assessment of allele-specific methylation (ASM) between the two haploid methylomes of any individual and allowed the identification of 599 haploid differentially methylated regions (hDMRs) covering 287 genes. Of these, 76 genes had hDMRs within 2 kb of their transcriptional start sites of which >80% displayed allele-specific expression (ASE). These data demonstrate that ASM is a recurrent phenomenon and is highly correlated with ASE in human PBMCs. Together with recently reported similar studies, our study provides a comprehensive resource for future epigenomic research and confirms new sequencing technology as a paradigm for large-scale epigenomics studies. PMID:21085693
Localized periorbital edema as a clinical manifestation of sulfite sensitivity.
Park, H S; Nahm, D
1996-08-01
Sulfite is commonly used in pharmaceuticals as a preservative. We report a unique clinical presentation of localized periorbital edema on the left eye after administration of sulfite-containing dexamethasone. The patient's sulfite sensitivity was confirmed by sulfite oral provocation test: periorbital edema on the same site developed after ingestion of 200 mg sodium bisulfite. She was non-atopic and did not complain of any respiratory symptoms. Allergy skin prick test with 100 mg/ml sodium bisulfite showed a negative result. She also has aspirin-sensitive urticaria which was confirmed by oral provocation test. In conclusion, sulfite can induce a localized periorbital edema, an uncommon manifestation in sensitive patients. Further investigations are needed to clarify the pathogenetic mechanisms.
Georgi, Laura; Johnson-Cicalese, Jennifer; Honig, Josh; Das, Sushma Parankush; Rajah, Veeran D; Bhattacharya, Debashish; Bassil, Nahla; Rowland, Lisa J; Polashock, James; Vorsa, Nicholi
2013-03-01
The first genetic map of cranberry (Vaccinium macrocarpon) has been constructed, comprising 14 linkage groups totaling 879.9 cM with an estimated coverage of 82.2 %. This map, based on four mapping populations segregating for field fruit-rot resistance, contains 136 distinct loci. Mapped markers include blueberry-derived simple sequence repeat (SSR) and cranberry-derived sequence-characterized amplified region markers previously used for fingerprinting cranberry cultivars. In addition, SSR markers were developed near cranberry sequences resembling genes involved in flavonoid biosynthesis or defense against necrotrophic pathogens, or conserved orthologous set (COS) sequences. The cranberry SSRs were developed from next-generation cranberry genomic sequence assemblies; thus, the positions of these SSRs on the genomic map provide information about the genomic location of the sequence scaffold from which they were derived. The use of SSR markers near COS and other functional sequences, plus 33 SSR markers from blueberry, facilitates comparisons of this map with maps of other plant species. Regions of the cranberry map were identified that showed conservation of synteny with Vitis vinifera and Arabidopsis thaliana. Positioned on this map are quantitative trait loci (QTL) for field fruit-rot resistance (FFRR), fruit weight, titratable acidity, and sound fruit yield (SFY). The SFY QTL is adjacent to one of the fruit weight QTL and may reflect pleiotropy. Two of the FFRR QTL are in regions of conserved synteny with grape and span defense gene markers, and the third FFRR QTL spans a flavonoid biosynthetic gene.
JVM: Java Visual Mapping tool for next generation sequencing read.
Yang, Ye; Liu, Juan
2015-01-01
We developed a program JVM (Java Visual Mapping) for mapping next generation sequencing read to reference sequence. The program is implemented in Java and is designed to deal with millions of short read generated by sequence alignment using the Illumina sequencing technology. It employs seed index strategy and octal encoding operations for sequence alignments. JVM is useful for DNA-Seq, RNA-Seq when dealing with single-end resequencing. JVM is a desktop application, which supports reads capacity from 1 MB to 10 GB.
A reference linkage map for Eucalyptus
2012-01-01
Background Genetic linkage maps are invaluable resources in plant research. They provide a key tool for many genetic applications including: mapping quantitative trait loci (QTL); comparative mapping; identifying unlinked (i.e. independent) DNA markers for fingerprinting, population genetics and phylogenetics; assisting genome sequence assembly; relating physical and recombination distances along the genome and map-based cloning of genes. Eucalypts are the dominant tree species in most Australian ecosystems and of economic importance globally as plantation trees. The genome sequence of E. grandis has recently been released providing unprecedented opportunities for genetic and genomic research in the genus. A robust reference linkage map containing sequence-based molecular markers is needed to capitalise on this resource. Several high density linkage maps have recently been constructed for the main commercial forestry species in the genus (E. grandis, E. urophylla and E. globulus) using sequenced Diversity Arrays Technology (DArT) and microsatellite markers. To provide a single reference linkage map for eucalypts a composite map was produced through the integration of data from seven independent mapping experiments (1950 individuals) using a marker-merging method. Results The composite map totalled 1107 cM and contained 4101 markers; comprising 3880 DArT, 213 microsatellite and eight candidate genes. Eighty-one DArT markers were mapped to two or more linkage groups, resulting in the 4101 markers being mapped to 4191 map positions. Approximately 13% of DArT markers mapped to identical map positions, thus the composite map contained 3634 unique loci at an average interval of 0.31 cM. Conclusion The composite map represents the most saturated linkage map yet produced in Eucalyptus. As the majority of DArT markers contained on the map have been sequenced, the map provides a direct link to the E. grandis genome sequence and will serve as an important reference for progressing eucalypt research. PMID:22702473
Febrer, Melanie; Goicoechea, Jose Luis; Wright, Jonathan; McKenzie, Neil; Song, Xiang; Lin, Jinke; Collura, Kristi; Wissotski, Marina; Yu, Yeisoo; Ammiraju, Jetty S. S.; Wolny, Elzbieta; Idziak, Dominika; Betekhtin, Alexander; Kudrna, Dave; Hasterok, Robert; Wing, Rod A.; Bevan, Michael W.
2010-01-01
The pooid subfamily of grasses includes some of the most important crop, forage and turf species, such as wheat, barley and Lolium. Developing genomic resources, such as whole-genome physical maps, for analysing the large and complex genomes of these crops and for facilitating biological research in grasses is an important goal in plant biology. We describe a bacterial artificial chromosome (BAC)-based physical map of the wild pooid grass Brachypodium distachyon and integrate this with whole genome shotgun sequence (WGS) assemblies using BAC end sequences (BES). The resulting physical map contains 26 contigs spanning the 272 Mb genome. BES from the physical map were also used to integrate a genetic map. This provides an independent vaildation and confirmation of the published WGS assembly. Mapped BACs were used in Fluorescence In Situ Hybridisation (FISH) experiments to align the integrated physical map and sequence assemblies to chromosomes with high resolution. The physical, genetic and cytogenetic maps, integrated with whole genome shotgun sequence assemblies, enhance the accuracy and durability of this important genome sequence and will directly facilitate gene isolation. PMID:20976139
Restoration of distorted depth maps calculated from stereo sequences
NASA Technical Reports Server (NTRS)
Damour, Kevin; Kaufman, Howard
1991-01-01
A model-based Kalman estimator is developed for spatial-temporal filtering of noise and other degradations in velocity and depth maps derived from image sequences or cinema. As an illustration of the proposed procedures, edge information from image sequences of rigid objects is used in the processing of the velocity maps by selecting from a series of models for directional adaptive filtering. Adaptive filtering then allows for noise reduction while preserving sharpness in the velocity maps. Results from several synthetic and real image sequences are given.
Patrício, Patrícia; Ramalho-Carvalho, João; Costa-Pinheiro, Pedro; Almeida, Mafalda; Barros-Silva, João Diogo; Vieira, Joana; Dias, Paula Cristina; Lobo, Francisco; Oliveira, Jorge; Teixeira, Manuel R; Henrique, Rui; Jeronimo, Carmen
2013-01-01
Expression of PAX2 (Paired-box 2) is suppressed through promoter methylation at the later stages of embryonic development, but eventually reactivated during carcinogenesis. Pax-2 is commonly expressed in the most prevalent renal cell tumour (RCT) subtypes—clear cell RCC (ccRCC), papillary RCC (pRCC) and oncocytoma—but not in chromophobe RCC (chrRCC), which frequently displays chromosome 10 loss (to which PAX2 is mapped). Herein, we assessed the epigenetic and/or genetic alterations affecting PAX2 expression in RCTs and evaluated its potential as biomarker. We tested 120 RCTs (30 of each main subtype) and four normal kidney tissues. Pax-2 expression was assessed by immunohistochemistry and PAX2 mRNA expression levels were determined by quantitative RT-PCR. PAX2 promoter methylation status was assessed by methylation-specific PCR and bisulfite sequencing. Chromosome 10 and PAX2 copy number alterations were determined by FISH. Pax-2 immunoexpression was significantly lower in chrRCC compared to other RCT subtypes. Using a 10% immunoexpression cut-off, Pax-2 immunoreactivity discriminated chrRCC from oncocytoma with 67% sensitivity and 90% specificity. PAX2 mRNA expression was significantly lower in chrRCC, compared to ccRCC, pRCC and oncocytoma, and transcript levels correlated with immunoexpression. Whereas no promoter methylation was found in RCTs or normal kidney, 69% of chrRCC displayed chromosome 10 monosomy, correlating with Pax-2 immunoexpression. We concluded that Pax-2 expression might be used as an ancillary tool to discriminate chrRCC from oncocytomas with overlapping morphological features. The biological rationale lies on the causal relation between Pax-2 expression and chromosome 10 monosomy, but not PAX2 promoter methylation, in chrRCC. PMID:23890189
Thibon, Cécile; Böcker, Caroline; Shinkaruk, Svitlana; Moine, Virginie; Darriet, Philippe; Dubourdieu, Denis
2016-05-15
Two main precursors (S-3-(hexan-1-ol)-l-cysteine and S-3-(hexan-1-ol)-l-glutathione) of 3-sulfanylhexanol (3SH, formerly named 3-mercaptohexanol) have been identified so far in grape juice but a correlation between precursor concentrations in grape juices and 3SH concentrations in wines is not always observed. This suggests that there may be other compounds associated with the aromatic potential. In this work, S-3-(hexanal)-glutathione (Glut-3SH-Al) and its bisulfite (Glut-3SH-SO3) adduct were identified in Sauvignon blanc grape juice by liquid chromatography coupled to Fourier transform mass spectrometry experiments. A partial purification of the compounds was carried out by Medium Pressure Liquid Chromatography (MPLC) on the reverse phase using 5L of grape juice. The addition of synthetized Glut-3SH-Al and Glut-3SH-SO3 in the synthetic medium induced a significant release of 3SH after fermentation. Moreover, we demonstrate that Glut-3SH-Al and its bisulfite adduct are present in grape juice and could be considered as new direct 3SH precursors with molar conversion yields close to 0.4%. Copyright © 2016. Published by Elsevier Ltd.
Pavlova, T V; Kashuba, V I; Muravenko, O V; Yenamandra, S P; Ivanova, T A; Zabarovskaia, V I; Rakhmanaliev, E R; Petrenko, L A; Pronina, I V; Loginov, V I; Iurkevich, O Iu; Kiselev, L L; Zelenin, A V; Zabarovskiĭ, E R
2009-01-01
New comparative genome hybridization technology on NotI-microarrays is presented (Karolinska Institute International Patent WO02/086163). The method is based on comparative genome hybridization of NotI-probes from tumor and normal genomic DNA with the principle of new DNA NotI-microarrays. Using this method 181 NotI linking loci from human chromosome 3 were analyzed in 200 malignant tumor samples from different organs: kidney, lung, breast, ovary, cervical, prostate. Most frequently (more than in 30%) aberrations--deletions, methylation,--were identified in NotI-sites located in MINT24, BHLHB2, RPL15, RARbeta1, ITGA9, RBSP3, VHL, ZIC4 genes, that suggests they probably are involved in cancer development. Methylation of these genomic loci was confirmed by methylation-specific PCR and bisulfite sequencing. The results demonstrate perspective of using this method to solve some oncogenomic problems.
Nie, Y C; Yu, L J; Guan, H; Zhao, Y; Rong, H B; Jiang, B W; Zhang, T
2017-06-01
As an important part of epigenetic marker, DNA methylation involves in the gene regulation and attracts a wide spread attention in biological auxology, geratology and oncology fields. In forensic science, because of the relative stable, heritable, abundant, and age-related characteristics, DNA methylation is considered to be a useful complement to the classic genetic markers for age-prediction, tissue-identification, and monozygotic twins' discrimination. Various methods for DNA methylation detection have been validated based on methylation sensitive restriction endonuclease, bisulfite modification and methylation-CpG binding protein. In recent years, it is reported that the third generation sequencing method can be used to detect DNA methylation. This paper aims to make a review on the detection method of DNA methylation and its applications in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine.
Terragni, Jolyon; Zhang, Guoqiang; Sun, Zhiyi; Pradhan, Sriharsa; Song, Lingyun; Crawford, Gregory E; Lacey, Michelle; Ehrlich, Melanie
2014-01-01
Notch intercellular signaling is critical for diverse developmental pathways and for homeostasis in various types of stem cells and progenitor cells. Because Notch gene products need to be precisely regulated spatially and temporally, epigenetics is likely to help control expression of Notch signaling genes. Reduced representation bisulfite sequencing (RRBS) indicated significant hypomethylation in myoblasts, myotubes, and skeletal muscle vs. many nonmuscle samples at intragenic or intergenic regions of the following Notch receptor or ligand genes: NOTCH1, NOTCH2, JAG2, and DLL1. An enzymatic assay of sites in or near these genes revealed unusually high enrichment of 5-hydroxymethylcytosine (up to 81%) in skeletal muscle, heart, and cerebellum. Epigenetics studies and gene expression profiles suggest that hypomethylation and/or hydroxymethylation help control expression of these genes in heart, brain, myoblasts, myotubes, and within skeletal muscle myofibers. Such regulation could promote cell renewal, cell maintenance, homeostasis, and a poised state for repair of tissue damage. PMID:24670287
Giehr, Pascal; Kyriakopoulos, Charalampos; Ficz, Gabriella; Wolf, Verena; Walter, Jörn
2016-05-01
DNA methylation and demethylation are opposing processes that when in balance create stable patterns of epigenetic memory. The control of DNA methylation pattern formation by replication dependent and independent demethylation processes has been suggested to be influenced by Tet mediated oxidation of 5mC. Several alternative mechanisms have been proposed suggesting that 5hmC influences either replication dependent maintenance of DNA methylation or replication independent processes of active demethylation. Using high resolution hairpin oxidative bisulfite sequencing data, we precisely determine the amount of 5mC and 5hmC and model the contribution of 5hmC to processes of demethylation in mouse ESCs. We develop an extended hidden Markov model capable of accurately describing the regional contribution of 5hmC to demethylation dynamics. Our analysis shows that 5hmC has a strong impact on replication dependent demethylation, mainly by impairing methylation maintenance.
Whole genome DNA methylation: beyond genes silencing.
Tirado-Magallanes, Roberto; Rebbani, Khadija; Lim, Ricky; Pradhan, Sriharsa; Benoukraf, Touati
2017-01-17
The combination of DNA bisulfite treatment with high-throughput sequencing technologies has enabled investigation of genome-wide DNA methylation at near base pair level resolution, far beyond that of the kilobase-long canonical CpG islands that initially revealed the biological relevance of this covalent DNA modification. The latest high-resolution studies have revealed a role for very punctual DNA methylation in chromatin plasticity, gene regulation and splicing. Here, we aim to outline the major biological consequences of DNA methylation recently discovered. We also discuss the necessity of tuning DNA methylation resolution into an adequate scale to ease the integration of the methylome information with other chromatin features and transcription events such as gene expression, nucleosome positioning, transcription factors binding dynamic, gene splicing and genomic imprinting. Finally, our review sheds light on DNA methylation heterogeneity in cell population and the different approaches used for its assessment, including the contribution of single cell DNA analysis technology.
Whole genome DNA methylation: beyond genes silencing
Tirado-Magallanes, Roberto; Rebbani, Khadija; Lim, Ricky; Pradhan, Sriharsa; Benoukraf, Touati
2017-01-01
The combination of DNA bisulfite treatment with high-throughput sequencing technologies has enabled investigation of genome-wide DNA methylation at near base pair level resolution, far beyond that of the kilobase-long canonical CpG islands that initially revealed the biological relevance of this covalent DNA modification. The latest high-resolution studies have revealed a role for very punctual DNA methylation in chromatin plasticity, gene regulation and splicing. Here, we aim to outline the major biological consequences of DNA methylation recently discovered. We also discuss the necessity of tuning DNA methylation resolution into an adequate scale to ease the integration of the methylome information with other chromatin features and transcription events such as gene expression, nucleosome positioning, transcription factors binding dynamic, gene splicing and genomic imprinting. Finally, our review sheds light on DNA methylation heterogeneity in cell population and the different approaches used for its assessment, including the contribution of single cell DNA analysis technology. PMID:27895318
Enduring epigenetic landmarks define the cancer microenvironment
Pidsley, Ruth; Lawrence, Mitchell G.; Zotenko, Elena; Niranjan, Birunthi; Statham, Aaron; Song, Jenny; Chabanon, Roman M.; Qu, Wenjia; Wang, Hong; Richards, Michelle; Nair, Shalima S.; Armstrong, Nicola J.; Nim, Hieu T.; Papargiris, Melissa; Balanathan, Preetika; French, Hugh; Peters, Timothy; Norden, Sam; Ryan, Andrew; Pedersen, John; Kench, James; Daly, Roger J.; Horvath, Lisa G.; Stricker, Phillip; Frydenberg, Mark; Taylor, Renea A.; Stirzaker, Clare; Risbridger, Gail P.; Clark, Susan J.
2018-01-01
The growth and progression of solid tumors involves dynamic cross-talk between cancer epithelium and the surrounding microenvironment. To date, molecular profiling has largely been restricted to the epithelial component of tumors; therefore, features underpinning the persistent protumorigenic phenotype of the tumor microenvironment are unknown. Using whole-genome bisulfite sequencing, we show for the first time that cancer-associated fibroblasts (CAFs) from localized prostate cancer display remarkably distinct and enduring genome-wide changes in DNA methylation, significantly at enhancers and promoters, compared to nonmalignant prostate fibroblasts (NPFs). Differentially methylated regions associated with changes in gene expression have cancer-related functions and accurately distinguish CAFs from NPFs. Remarkably, a subset of changes is shared with prostate cancer epithelial cells, revealing the new concept of tumor-specific epigenome modifications in the tumor and its microenvironment. The distinct methylome of CAFs provides a novel epigenetic hallmark of the cancer microenvironment and promises new biomarkers to improve interpretation of diagnostic samples. PMID:29650553
A hybrid BAC physical map of potato: a framework for sequencing a heterozygous genome
2011-01-01
Background Potato is the world's third most important food crop, yet cultivar improvement and genomic research in general remain difficult because of the heterozygous and tetraploid nature of its genome. The development of physical map resources that can facilitate genomic analyses in potato has so far been very limited. Here we present the methods of construction and the general statistics of the first two genome-wide BAC physical maps of potato, which were made from the heterozygous diploid clone RH89-039-16 (RH). Results First, a gel electrophoresis-based physical map was made by AFLP fingerprinting of 64478 BAC clones, which were aligned into 4150 contigs with an estimated total length of 1361 Mb. Screening of BAC pools, followed by the KeyMaps in silico anchoring procedure, identified 1725 AFLP markers in the physical map, and 1252 BAC contigs were anchored the ultradense potato genetic map. A second, sequence-tag-based physical map was constructed from 65919 whole genome profiling (WGP) BAC fingerprints and these were aligned into 3601 BAC contigs spanning 1396 Mb. The 39733 BAC clones that overlap between both physical maps provided anchors to 1127 contigs in the WGP physical map, and reduced the number of contigs to around 2800 in each map separately. Both physical maps were 1.64 times longer than the 850 Mb potato genome. Genome heterozygosity and incomplete merging of BAC contigs are two factors that can explain this map inflation. The contig information of both physical maps was united in a single table that describes hybrid potato physical map. Conclusions The AFLP physical map has already been used by the Potato Genome Sequencing Consortium for sequencing 10% of the heterozygous genome of clone RH on a BAC-by-BAC basis. By layering a new WGP physical map on top of the AFLP physical map, a genetically anchored genome-wide framework of 322434 sequence tags has been created. This reference framework can be used for anchoring and ordering of genomic sequences of clone RH (and other potato genotypes), and opens the possibility to finish sequencing of the RH genome in a more efficient way via high throughput next generation approaches. PMID:22142254
Sequence-structure mapping errors in the PDB: OB-fold domains
Venclovas, Česlovas; Ginalski, Krzysztof; Kang, Chulhee
2004-01-01
The Protein Data Bank (PDB) is the single most important repository of structural data for proteins and other biologically relevant molecules. Therefore, it is critically important to keep the PDB data, as much as possible, error-free. In this study, we have analyzed PDB crystal structures possessing oligonucleotide/oligosaccharide binding (OB)-fold, one of the highly populated folds, for the presence of sequence-structure mapping errors. Using energy-based structure quality assessment coupled with sequence analyses, we have found that there are at least five OB-structures in the PDB that have regions where sequences have been incorrectly mapped onto the structure. We have demonstrated that the combination of these computation techniques is effective not only in detecting sequence-structure mapping errors, but also in providing guidance to correct them. Namely, we have used results of computational analysis to direct a revision of X-ray data for one of the PDB entries containing a fairly inconspicuous sequence-structure mapping error. The revised structure has been deposited with the PDB. We suggest use of computational energy assessment and sequence analysis techniques to facilitate structure determination when homologs having known structure are available to use as a reference. Such computational analysis may be useful in either guiding the sequence-structure assignment process or verifying the sequence mapping within poorly defined regions. PMID:15133161
Johnston, Christopher D; Skeete, Chelsey A; Fomenkov, Alexey; Roberts, Richard J; Rittling, Susan R
2017-01-01
Prevotella intermedia, a major periodontal pathogen, is increasingly implicated in human respiratory tract and cystic fibrosis lung infections. Nevertheless, the specific mechanisms employed by this pathogen remain only partially characterized and poorly understood, largely due to its total lack of genetic accessibility. Here, using Single Molecule, Real-Time (SMRT) genome and methylome sequencing, bisulfite sequencing, in addition to cloning and restriction analysis, we define the specific genetic barriers to exogenous DNA present in two of the most widespread laboratory strains, P. intermedia ATCC 25611 and P. intermedia Strain 17. We identified and characterized multiple restriction-modification (R-M) systems, some of which are considerably divergent between the two strains. We propose that these R-M systems are the root cause of the P. intermedia transformation barrier. Additionally, we note the presence of conserved Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) systems in both strains, which could provide a further barrier to exogenous DNA uptake and incorporation. This work will provide a valuable resource during the development of a genetic system for P. intermedia, which will be required for fundamental investigation of this organism's physiology, metabolism, and pathogenesis in human disease.
Skeete, Chelsey A.; Fomenkov, Alexey; Roberts, Richard J.; Rittling, Susan R.
2017-01-01
Prevotella intermedia, a major periodontal pathogen, is increasingly implicated in human respiratory tract and cystic fibrosis lung infections. Nevertheless, the specific mechanisms employed by this pathogen remain only partially characterized and poorly understood, largely due to its total lack of genetic accessibility. Here, using Single Molecule, Real-Time (SMRT) genome and methylome sequencing, bisulfite sequencing, in addition to cloning and restriction analysis, we define the specific genetic barriers to exogenous DNA present in two of the most widespread laboratory strains, P. intermedia ATCC 25611 and P. intermedia Strain 17. We identified and characterized multiple restriction-modification (R-M) systems, some of which are considerably divergent between the two strains. We propose that these R-M systems are the root cause of the P. intermedia transformation barrier. Additionally, we note the presence of conserved Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) systems in both strains, which could provide a further barrier to exogenous DNA uptake and incorporation. This work will provide a valuable resource during the development of a genetic system for P. intermedia, which will be required for fundamental investigation of this organism’s physiology, metabolism, and pathogenesis in human disease. PMID:28934361
HetMappsS: Heterozygous mapping strategy for high resolution Genotyping-by-Sequencing Markers
USDA-ARS?s Scientific Manuscript database
Reduced representation genotyping approaches, such as genotyping-by-sequencing (GBS), provide opportunities to generate high-resolution genetic maps at a low per-sample cost. However, missing data and non-uniform sequence coverage can complicate map creation in highly heterozygous species. To facili...
A clone-free, single molecule map of the domestic cow (Bos taurus) genome.
Zhou, Shiguo; Goldstein, Steve; Place, Michael; Bechner, Michael; Patino, Diego; Potamousis, Konstantinos; Ravindran, Prabu; Pape, Louise; Rincon, Gonzalo; Hernandez-Ortiz, Juan; Medrano, Juan F; Schwartz, David C
2015-08-28
The cattle (Bos taurus) genome was originally selected for sequencing due to its economic importance and unique biology as a model organism for understanding other ruminants, or mammals. Currently, there are two cattle genome sequence assemblies (UMD3.1 and Btau4.6) from groups using dissimilar assembly algorithms, which were complemented by genetic and physical map resources. However, past comparisons between these assemblies revealed substantial differences. Consequently, such discordances have engendered ambiguities when using reference sequence data, impacting genomic studies in cattle and motivating construction of a new optical map resource--BtOM1.0--to guide comparisons and improvements to the current sequence builds. Accordingly, our comprehensive comparisons of BtOM1.0 against the UMD3.1 and Btau4.6 sequence builds tabulate large-to-immediate scale discordances requiring mediation. The optical map, BtOM1.0, spanning the B. taurus genome (Hereford breed, L1 Dominette 01449) was assembled from an optical map dataset consisting of 2,973,315 (439 X; raw dataset size before assembly) single molecule optical maps (Rmaps; 1 Rmap = 1 restriction mapped DNA molecule) generated by the Optical Mapping System. The BamHI map spans 2,575.30 Mb and comprises 78 optical contigs assembled by a combination of iterative (using the reference sequence: UMD3.1) and de novo assembly techniques. BtOM1.0 is a high-resolution physical map featuring an average restriction fragment size of 8.91 Kb. Comparisons of BtOM1.0 vs. UMD3.1, or Btau4.6, revealed that Btau4.6 presented far more discordances (7,463) vs. UMD3.1 (4,754). Overall, we found that Btau4.6 presented almost double the number of discordances than UMD3.1 across most of the 6 categories of sequence vs. map discrepancies, which are: COMPLEX (misassembly), DELs (extraneous sequences), INSs (missing sequences), ITs (Inverted/Translocated sequences), ECs (extra restriction cuts) and MCs (missing restriction cuts). Alignments of UMD3.1 and Btau4.6 to BtOM1.0 reveal discordances commensurate with previous reports, and affirm the NCBI's current designation of UMD3.1 sequence assembly as the "reference assembly" and the Btau4.6 as the "alternate assembly." The cattle genome optical map, BtOM1.0, when used as a comprehensive and largely independent guide, will greatly assist improvements to existing sequence builds, and later serve as an accurate physical scaffold for studies concerning the comparative genomics of cattle breeds.
Speech processing using conditional observable maximum likelihood continuity mapping
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hogden, John; Nix, David
A computer implemented method enables the recognition of speech and speech characteristics. Parameters are initialized of first probability density functions that map between the symbols in the vocabulary of one or more sequences of speech codes that represent speech sounds and a continuity map. Parameters are also initialized of second probability density functions that map between the elements in the vocabulary of one or more desired sequences of speech transcription symbols and the continuity map. The parameters of the probability density functions are then trained to maximize the probabilities of the desired sequences of speech-transcription symbols. A new sequence ofmore » speech codes is then input to the continuity map having the trained first and second probability function parameters. A smooth path is identified on the continuity map that has the maximum probability for the new sequence of speech codes. The probability of each speech transcription symbol for each input speech code can then be output.« less
2012-01-01
Background Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS) technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD) might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP) marker development. Results An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. Conclusions The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison. PMID:22908993
Long Read Alignment with Parallel MapReduce Cloud Platform
Al-Absi, Ahmed Abdulhakim; Kang, Dae-Ki
2015-01-01
Genomic sequence alignment is an important technique to decode genome sequences in bioinformatics. Next-Generation Sequencing technologies produce genomic data of longer reads. Cloud platforms are adopted to address the problems arising from storage and analysis of large genomic data. Existing genes sequencing tools for cloud platforms predominantly consider short read gene sequences and adopt the Hadoop MapReduce framework for computation. However, serial execution of map and reduce phases is a problem in such systems. Therefore, in this paper, we introduce Burrows-Wheeler Aligner's Smith-Waterman Alignment on Parallel MapReduce (BWASW-PMR) cloud platform for long sequence alignment. The proposed cloud platform adopts a widely accepted and accurate BWA-SW algorithm for long sequence alignment. A custom MapReduce platform is developed to overcome the drawbacks of the Hadoop framework. A parallel execution strategy of the MapReduce phases and optimization of Smith-Waterman algorithm are considered. Performance evaluation results exhibit an average speed-up of 6.7 considering BWASW-PMR compared with the state-of-the-art Bwasw-Cloud. An average reduction of 30% in the map phase makespan is reported across all experiments comparing BWASW-PMR with Bwasw-Cloud. Optimization of Smith-Waterman results in reducing the execution time by 91.8%. The experimental study proves the efficiency of BWASW-PMR for aligning long genomic sequences on cloud platforms. PMID:26839887
Long Read Alignment with Parallel MapReduce Cloud Platform.
Al-Absi, Ahmed Abdulhakim; Kang, Dae-Ki
2015-01-01
Genomic sequence alignment is an important technique to decode genome sequences in bioinformatics. Next-Generation Sequencing technologies produce genomic data of longer reads. Cloud platforms are adopted to address the problems arising from storage and analysis of large genomic data. Existing genes sequencing tools for cloud platforms predominantly consider short read gene sequences and adopt the Hadoop MapReduce framework for computation. However, serial execution of map and reduce phases is a problem in such systems. Therefore, in this paper, we introduce Burrows-Wheeler Aligner's Smith-Waterman Alignment on Parallel MapReduce (BWASW-PMR) cloud platform for long sequence alignment. The proposed cloud platform adopts a widely accepted and accurate BWA-SW algorithm for long sequence alignment. A custom MapReduce platform is developed to overcome the drawbacks of the Hadoop framework. A parallel execution strategy of the MapReduce phases and optimization of Smith-Waterman algorithm are considered. Performance evaluation results exhibit an average speed-up of 6.7 considering BWASW-PMR compared with the state-of-the-art Bwasw-Cloud. An average reduction of 30% in the map phase makespan is reported across all experiments comparing BWASW-PMR with Bwasw-Cloud. Optimization of Smith-Waterman results in reducing the execution time by 91.8%. The experimental study proves the efficiency of BWASW-PMR for aligning long genomic sequences on cloud platforms.
SNP discovery by high-throughput sequencing in soybean
2010-01-01
Background With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology. Results A total of 39,022 putative SNPs were identified by the Illumina/Solexa sequencing system using a reduced representation DNA library of two parental lines of a mapping population. The validation rates of these putative SNPs predicted with low and high stringency were 72% and 85%, respectively. One hundred sixty four SNP markers resulted from the validation of putative SNPs and have been selectively chosen to target a known QTL, thereby increasing the marker density of the targeted region to one marker per 42 K bp. Conclusions We have demonstrated how to quickly identify large numbers of SNPs for fine mapping of QTL regions by applying massively parallel sequencing combined with genome complexity reduction techniques. This SNP discovery approach is more efficient for targeting multiple QTL regions in a same genetic population, which can be applied to other crops. PMID:20701770
Mariner 9 mapping science sequence design.
NASA Technical Reports Server (NTRS)
Goldman, A. M., Jr.
1973-01-01
The primary mission of Mariner 9 was to map the Martian surface. This paper discusses in detail the design of the mapping science sequences which were executed by the spacecraft in sixty days and during which over eighty percent of the surface was photographed. The sequence design was influenced by many factors: experimenter scientific objectives, instrument capabilities, spacecraft capabilities, orbit characteristics, and data return rates, which are illustrated graphically. Typical orbits are depicted for each of the three different mapping phases lasting twenty days. Examples of typical orbital sequence plans prepared daily during mission operations are given.
Möbius, Petra; Hölzer, Martin; Felder, Marius; Nordsiek, Gabriele; Groth, Marco; Köhler, Heike; Reichwald, Kathrin; Platzer, Matthias; Marz, Manja
2015-01-01
Mycobacterium avium (M. a.) subsp. paratuberculosis (MAP)—the etiologic agent of Johne’s disease—affects cattle, sheep, and other ruminants worldwide. To decipher phenotypic differences among sheep and cattle strains (belonging to MAP-S [Type-I/III], respectively, MAP-C [Type-II]), comparative genome analysis needs data from diverse isolates originating from different geographic regions of the world. This study presents the so far best assembled genome of a MAP-S-strain: Sheep isolate JIII-386 from Germany. One newly sequenced cattle isolate (JII-1961, Germany), four published MAP strains of MAP-C and MAP-S from the United States and Australia, and M. a. subsp. hominissuis (MAH) strain 104 were used for assembly improvement and comparisons. All genomes were annotated by BacProt and results compared with NCBI (National Center for Biotechnology Information) annotation. Corresponding protein-coding sequences (CDSs) were detected, but also CDSs that were exclusively determined by either NCBI or BacProt. A new Shine–Dalgarno sequence motif (5′-AGCTGG-3′) was extracted. Novel CDSs including PE-PGRS family protein genes and about 80 noncoding RNAs exhibiting high sequence conservation are presented. Previously found genetic differences between MAP-types are partially revised. Four of ten assumed MAP-S-specific large sequence polymorphism regions (LSPSs) are still present in MAP-C strains; new LSPSs were identified. Independently of the regional origin of the strains, the number of individual CDSs and single nucleotide variants confirms the strong similarity of MAP-C strains and shows higher diversity among MAP-S strains. This study gives ambiguous results regarding the hypothesis that MAP-S is the evolutionary intermediate between MAH and MAP-C, but it clearly shows a higher similarity of MAP to MAH than to Mycobacterium intracellulare. PMID:26384038
A note on chaotic unimodal maps and applications.
Zhou, C T; He, X T; Yu, M Y; Chew, L Y; Wang, X G
2006-09-01
Based on the word-lift technique of symbolic dynamics of one-dimensional unimodal maps, we investigate the relation between chaotic kneading sequences and linear maximum-length shift-register sequences. Theoretical and numerical evidence that the set of the maximum-length shift-register sequences is a subset of the set of the universal sequence of one-dimensional chaotic unimodal maps is given. By stabilizing unstable periodic orbits on superstable periodic orbits, we also develop techniques to control the generation of long binary sequences.
Localized periorbital edema as a clinical manifestation of sulfite sensitivity.
Park, H. S.; Nahm, D.
1996-01-01
Sulfite is commonly used in pharmaceuticals as a preservative. We report a unique clinical presentation of localized periorbital edema on the left eye after administration of sulfite-containing dexamethasone. The patient's sulfite sensitivity was confirmed by sulfite oral provocation test: periorbital edema on the same site developed after ingestion of 200 mg sodium bisulfite. She was non-atopic and did not complain of any respiratory symptoms. Allergy skin prick test with 100 mg/ml sodium bisulfite showed a negative result. She also has aspirin-sensitive urticaria which was confirmed by oral provocation test. In conclusion, sulfite can induce a localized periorbital edema, an uncommon manifestation in sensitive patients. Further investigations are needed to clarify the pathogenetic mechanisms. PMID:8878807
Savio, Andrea J; Mrkonjic, Miralem; Lemire, Mathieu; Gallinger, Steven; Knight, Julia A; Bapat, Bharat
2017-01-01
Colorectal cancers (CRCs) undergo distinct genetic and epigenetic alterations. Expression of mutL homolog 1 ( MLH1 ), a mismatch repair gene that corrects DNA replication errors, is lost in up to 15% of sporadic tumours due to mutation or, more commonly, due to DNA methylation of its promoter CpG island. A single nucleotide polymorphism (SNP) in the CpG island of MLH1 ( MLH1 -93G>A or rs1800734) is associated with CpG island hypermethylation and decreased MLH1 expression in CRC tumours. Further, in peripheral blood mononuclear cell (PBMC) DNA of both CRC cases and non-cancer controls, the variant allele of rs1800734 is associated with hypomethylation at the MLH1 shore, a region upstream of its CpG island that is less dense in CpG sites . To determine whether this genotype-epigenotype association is present in other tissue types, including colorectal tumours, we assessed DNA methylation in matched normal colorectal tissue, tumour, and PBMC DNA from 349 population-based CRC cases recruited from the Ontario Familial Colorectal Cancer Registry. Using the semi-quantitative real-time PCR-based MethyLight assay, MLH1 shore methylation was significantly higher in tumour tissue than normal colon or PBMCs ( P < 0.01). When shore methylation levels were stratified by SNP genotype, normal colorectal DNA and PBMC DNA were significantly hypomethylated in association with variant SNP genotype ( P < 0.05). However, this association was lost in tumour DNA. Among distinct stages of CRC, metastatic stage IV CRC tumours incurred significant hypomethylation compared to stage I-III cases, irrespective of genotype status. Shore methylation of MLH1 was not associated with MSI status or promoter CpG island hypermethylation, regardless of genotype. To confirm these results, bisulfite sequencing was performed in matched tumour and normal colorectal specimens from six CRC cases, including two cases per genotype (wildtype, heterozygous, and homozygous variant). Bisulfite sequencing results corroborated the methylation patterns found by MethyLight, with significant hypomethylation in normal colorectal tissue of variant SNP allele carriers. These results indicate that the normal tissue types tested (colorectum and PBMC) experience dynamic genotype-associated epigenetic alterations at the MLH1 shore, whereas tumour DNA incurs aberrant hypermethylation compared to normal DNA.
Pandey, Ram Vinay; Pulverer, Walter; Kallmeyer, Rainer; Beikircher, Gabriel; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas
2016-01-01
Bisulfite (BS) conversion-based and methylation-sensitive restriction enzyme (MSRE)-based PCR methods have been the most commonly used techniques for locus-specific DNA methylation analysis. However, both methods have advantages and limitations. Thus, an integrated approach would be extremely useful to quantify the DNA methylation status successfully with great sensitivity and specificity. Designing specific and optimized primers for target regions is the most critical and challenging step in obtaining the adequate DNA methylation results using PCR-based methods. Currently, no integrated, optimized, and high-throughput methylation-specific primer design software methods are available for both BS- and MSRE-based methods. Therefore an integrated, powerful, and easy-to-use methylation-specific primer design pipeline with great accuracy and success rate will be very useful. We have developed a new web-based pipeline, called MSP-HTPrimer, to design primers pairs for MSP, BSP, pyrosequencing, COBRA, and MSRE assays on both genomic strands. First, our pipeline converts all target sequences into bisulfite-treated templates for both forward and reverse strand and designs all possible primer pairs, followed by filtering for single nucleotide polymorphisms (SNPs) and known repeat regions. Next, each primer pairs are annotated with the upstream and downstream RefSeq genes, CpG island, and cut sites (for COBRA and MSRE). Finally, MSP-HTPrimer selects specific primers from both strands based on custom and user-defined hierarchical selection criteria. MSP-HTPrimer produces a primer pair summary output table in TXT and HTML format for display and UCSC custom tracks for resulting primer pairs in GTF format. MSP-HTPrimer is an integrated, web-based, and high-throughput pipeline and has no limitation on the number and size of target sequences and designs MSP, BSP, pyrosequencing, COBRA, and MSRE assays. It is the only pipeline, which automatically designs primers on both genomic strands to increase the success rate. It is a standalone web-based pipeline, which is fully configured within a virtual machine and thus can be readily used without any configuration. We have experimentally validated primer pairs designed by our pipeline and shown a very high success rate of primer pairs: out of 66 BSP primer pairs, 63 were successfully validated without any further optimization step and using the same qPCR conditions. The MSP-HTPrimer pipeline is freely available from http://sourceforge.net/p/msp-htprimer.
2012-01-01
Background Most modern citrus cultivars have an interspecific origin. As a foundational step towards deciphering the interspecific genome structures, a reference whole genome sequence was produced by the International Citrus Genome Consortium from a haploid derived from Clementine mandarin. The availability of a saturated genetic map of Clementine was identified as an essential prerequisite to assist the whole genome sequence assembly. Clementine is believed to be a ‘Mediterranean’ mandarin × sweet orange hybrid, and sweet orange likely arose from interspecific hybridizations between mandarin and pummelo gene pools. The primary goals of the present study were to establish a Clementine reference map using codominant markers, and to perform comparative mapping of pummelo, sweet orange, and Clementine. Results Five parental genetic maps were established from three segregating populations, which were genotyped with Single Nucleotide Polymorphism (SNP), Simple Sequence Repeats (SSR) and Insertion-Deletion (Indel) markers. An initial medium density reference map (961 markers for 1084.1 cM) of the Clementine was established by combining male and female Clementine segregation data. This Clementine map was compared with two pummelo maps and a sweet orange map. The linear order of markers was highly conserved in the different species. However, significant differences in map size were observed, which suggests a variation in the recombination rates. Skewed segregations were much higher in the male than female Clementine mapping data. The mapping data confirmed that Clementine arose from hybridization between ‘Mediterranean’ mandarin and sweet orange. The results identified nine recombination break points for the sweet orange gamete that contributed to the Clementine genome. Conclusions A reference genetic map of citrus, used to facilitate the chromosome assembly of the first citrus reference genome sequence, was established. The high conservation of marker order observed at the interspecific level should allow reasonable inferences of most citrus genome sequences by mapping next-generation sequencing (NGS) data in the reference genome sequence. The genome of the haploid Clementine used to establish the citrus reference genome sequence appears to have been inherited primarily from the ‘Mediterranean’ mandarin. The high frequency of skewed allelic segregations in the male Clementine data underline the probable extent of deviation from Mendelian segregation for characters controlled by heterozygous loci in male parents. PMID:23126659
Solignac, Michel; Mougel, Florence; Vautrin, Dominique; Monnerot, Monique; Cornuet, Jean-Marie
2007-01-01
The honey bee is a key model for social behavior and this feature led to the selection of the species for genome sequencing. A genetic map is a necessary companion to the sequence. In addition, because there was originally no physical map for the honey bee genome project, a meiotic map was the only resource for organizing the sequence assembly on the chromosomes. We present the genetic (meiotic) map here and describe the main features that emerged from comparison with the sequence-based physical map. The genetic map of the honey bee is saturated and the chromosomes are oriented from the centromeric to the telomeric regions. The map is based on 2,008 markers and is about 40 Morgans (M) long, resulting in a marker density of one every 2.05 centiMorgans (cM). For the 186 megabases (Mb) of the genome mapped and assembled, this corresponds to a very high average recombination rate of 22.04 cM/Mb. Honey bee meiosis shows a relatively homogeneous recombination rate along and across chromosomes, as well as within and between individuals. Interference is higher than inferred from the Kosambi function of distance. In addition, numerous recombination hotspots are dispersed over the genome. The very large genetic length of the honey bee genome, its small physical size and an almost complete genome sequence with a relatively low number of genes suggest a very promising future for association mapping in the honey bee, particularly as the existence of haploid males allows easy bulk segregant analysis.
Sequencing of cDNA Clones from the Genetic Map of Tomato (Lycopersicon esculentum)
Ganal, Martin W.; Czihal, Rosemarie; Hannappel, Ulrich; Kloos, Dorothee-U.; Polley, Andreas; Ling, Hong-Qing
1998-01-01
The dense RFLP linkage map of tomato (Lycopersicon esculentum) contains >300 anonymous cDNA clones. Of those clones, 272 were partially or completely sequenced. The sequences were compared at the DNA and protein level to known genes in databases. For 57% of the clones, a significant match to previously described genes was found. The information will permit the conversion of those markers to STS markers and allow their use in PCR-based mapping experiments. Furthermore, it will facilitate the comparative mapping of genes across distantly related plant species by direct comparison of DNA sequences and map positions. [cDNA sequence data reported in this paper have been submitted to the EMBL database under accession nos. AA824695–AA825005 and the dbEST_Id database under accession nos. 1546519–1546862.] PMID:9724330
Draft Sequences of the Radish (Raphanus sativus L.) Genome
Kitashiba, Hiroyasu; Li, Feng; Hirakawa, Hideki; Kawanabe, Takahiro; Zou, Zhongwei; Hasegawa, Yoichi; Tonosaki, Kaoru; Shirasawa, Sachiko; Fukushima, Aki; Yokoi, Shuji; Takahata, Yoshihito; Kakizaki, Tomohiro; Ishida, Masahiko; Okamoto, Shunsuke; Sakamoto, Koji; Shirasawa, Kenta; Tabata, Satoshi; Nishio, Takeshi
2014-01-01
Radish (Raphanus sativus L., n = 9) is one of the major vegetables in Asia. Since the genomes of Brassica and related species including radish underwent genome rearrangement, it is quite difficult to perform functional analysis based on the reported genomic sequence of Brassica rapa. Therefore, we performed genome sequencing of radish. Short reads of genomic sequences of 191.1 Gb were obtained by next-generation sequencing (NGS) for a radish inbred line, and 76,592 scaffolds of ≥300 bp were constructed along with the bacterial artificial chromosome-end sequences. Finally, the whole draft genomic sequence of 402 Mb spanning 75.9% of the estimated genomic size and containing 61,572 predicted genes was obtained. Subsequently, 221 single nucleotide polymorphism markers and 768 PCR-RFLP markers were used together with the 746 markers produced in our previous study for the construction of a linkage map. The map was combined further with another radish linkage map constructed mainly with expressed sequence tag-simple sequence repeat markers into a high-density integrated map of 1,166 cM with 2,553 DNA markers. A total of 1,345 scaffolds were assigned to the linkage map, spanning 116.0 Mb. Bulked PCR products amplified by 2,880 primer pairs were sequenced by NGS, and SNPs in eight inbred lines were identified. PMID:24848699
He, Bing; Caudy, Amy; Parsons, Lance; Rosebrock, Adam; Pane, Attilio; Raj, Sandeep; Wieschaus, Eric
2012-01-01
Heterochromatin represents a significant portion of eukaryotic genomes and has essential structural and regulatory functions. Its molecular organization is largely unknown due to difficulties in sequencing through and assembling repetitive sequences enriched in the heterochromatin. Here we developed a novel strategy using chromosomal rearrangements and embryonic phenotypes to position unmapped Drosophila melanogaster heterochromatic sequence to specific chromosomal regions. By excluding sequences that can be mapped to the assembled euchromatic arms, we identified sequences that are specific to heterochromatin and used them to design heterochromatin specific probes (“H-probes”) for microarray. By comparative genomic hybridization (CGH) analyses of embryos deficient for each chromosome or chromosome arm, we were able to map most of our H-probes to specific chromosome arms. We also positioned sequences mapped to the second and X chromosomes to finer intervals by analyzing smaller deletions with breakpoints in heterochromatin. Using this approach, we were able to map >40% (13.9 Mb) of the previously unmapped heterochromatin sequences assembled by the whole-genome sequencing effort on arm U and arm Uextra to specific locations. We also identified and mapped 110 kb of novel heterochromatic sequences. Subsequent analyses revealed that sequences located within different heterochromatic regions have distinct properties, such as sequence composition, degree of repetitiveness, and level of underreplication in polytenized tissues. Surprisingly, although heterochromatin is generally considered to be transcriptionally silent, we detected region-specific temporal patterns of transcription in heterochromatin during oogenesis and early embryonic development. Our study provides a useful approach to elucidate the molecular organization and function of heterochromatin and reveals region-specific variation of heterochromatin. PMID:22745230
A Single Molecule Scaffold for the Maize Genome
Zhou, Shiguo; Wei, Fusheng; Nguyen, John; Bechner, Mike; Potamousis, Konstantinos; Goldstein, Steve; Pape, Louise; Mehan, Michael R.; Churas, Chris; Pasternak, Shiran; Forrest, Dan K.; Wise, Roger; Ware, Doreen; Wing, Rod A.; Waterman, Michael S.; Livny, Miron; Schwartz, David C.
2009-01-01
About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome-wide, high-resolution optical map of the maize inbred line B73 genome containing >91,000 restriction sites (averaging 1 site/∼23 kb) accrued from mapping genomic DNA molecules. Our optical map comprises 66 contigs, averaging 31.88 Mb in size and spanning 91.5% (2,103.93 Mb/∼2,300 Mb) of the maize genome. A new algorithm was created that considered both optical map and unfinished BAC sequence data for placing 60/66 (2,032.42 Mb) optical map contigs onto the maize iMap. The alignment of optical maps against numerous data sources yielded comprehensive results that proved revealing and productive. For example, gaps were uncovered and characterized within the iMap, the FPC (fingerprinted contigs) map, and the chromosome-wide pseudomolecules. Such alignments also suggested amended placements of FPC contigs on the maize genetic map and proactively guided the assembly of chromosome-wide pseudomolecules, especially within complex genomic regions. Lastly, we think that the full integration of B73 optical maps with the maize iMap would greatly facilitate maize sequence finishing efforts that would make it a valuable reference for comparative studies among cereals, or other maize inbred lines and cultivars. PMID:19936062
[Multiplexing mapping of human cDNAs]. Final report, September 1, 1991--February 28, 1994
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
Using PCR with automated product analysis, 329 human brain cDNA sequences have been assigned to individual human chromosomes. Primers were designed from single-pass cDNA sequences expressed sequence tags (ESTs). Primers were used in PCR reactions with DNA from somatic cell hybrid mapping panels as templates, often with multiplexing. Many ESTs mapped match sequence database records. To evaluate of these matches, the position of the primers relative to the matching region (In), the BLAST scores and the Poisson probability values of the EST/sequence record match were determined. In cases where the gene product was stringently identified by the sequence match hadmore » already been mapped, the gene locus determined by EST was consistent with the previous position which strongly supports the validity of assigning unknown genes to human chromosomes based on the EST sequence matches. In the present cases mapping the ESTs to a chromosome can also be considered to have mapped the known gene product: rolipram-sensitive cAMP phosphodiesterase, chromosome 1; protein phosphatase 2A{beta}, chromosome 4; alpha-catenin, chromosome 5; the ELE1 oncogene, chromosome 10q11.2 or q2.1-q23; MXII protein, chromosome l0q24-qter; ribosomal protein L18a homologue, chromosome 14; ribosomal protein L3, chromosome 17; and moesin, Xp11-cen. There were also ESTs mapped that were closely related to non-human sequence records. These matches therefore can be considered to identify human counterparts of known gene products, or members of known gene families. Examples of these include membrane proteins, translation-associated proteins, structural proteins, and enzymes. These data then demonstrate that single pass sequence information is sufficient to design PCR primers useful for assigning cDNA sequences to human chromosomes. When the EST sequence matches previous sequence database records, the chromosome assignments of the EST can be used to make preliminary assignments of the human gene to a chromosome.« less
Tsuchiya, Mariko; Amano, Kojiro; Abe, Masaya; Seki, Misato; Hase, Sumitaka; Sato, Kengo; Sakakibara, Yasubumi
2016-06-15
Deep sequencing of the transcripts of regulatory non-coding RNA generates footprints of post-transcriptional processes. After obtaining sequence reads, the short reads are mapped to a reference genome, and specific mapping patterns can be detected called read mapping profiles, which are distinct from random non-functional degradation patterns. These patterns reflect the maturation processes that lead to the production of shorter RNA sequences. Recent next-generation sequencing studies have revealed not only the typical maturation process of miRNAs but also the various processing mechanisms of small RNAs derived from tRNAs and snoRNAs. We developed an algorithm termed SHARAKU to align two read mapping profiles of next-generation sequencing outputs for non-coding RNAs. In contrast with previous work, SHARAKU incorporates the primary and secondary sequence structures into an alignment of read mapping profiles to allow for the detection of common processing patterns. Using a benchmark simulated dataset, SHARAKU exhibited superior performance to previous methods for correctly clustering the read mapping profiles with respect to 5'-end processing and 3'-end processing from degradation patterns and in detecting similar processing patterns in deriving the shorter RNAs. Further, using experimental data of small RNA sequencing for the common marmoset brain, SHARAKU succeeded in identifying the significant clusters of read mapping profiles for similar processing patterns of small derived RNA families expressed in the brain. The source code of our program SHARAKU is available at http://www.dna.bio.keio.ac.jp/sharaku/, and the simulated dataset used in this work is available at the same link. Accession code: The sequence data from the whole RNA transcripts in the hippocampus of the left brain used in this work is available from the DNA DataBank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA004502. yasu@bio.keio.ac.jp Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R
2004-01-01
A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563
Kumar, Ajay; Seetan, Raed; Mergoum, Mohamed; Tiwari, Vijay K; Iqbal, Muhammad J; Wang, Yi; Al-Azzam, Omar; Šimková, Hana; Luo, Ming-Cheng; Dvorak, Jan; Gu, Yong Q; Denton, Anne; Kilian, Andrzej; Lazo, Gerard R; Kianian, Shahryar F
2015-10-16
The large and complex genome of bread wheat (Triticum aestivum L., ~17 Gb) requires high resolution genome maps with saturated marker scaffolds to anchor and orient BAC contigs/ sequence scaffolds for whole genome assembly. Radiation hybrid (RH) mapping has proven to be an excellent tool for the development of such maps for it offers much higher and more uniform marker resolution across the length of the chromosome compared to genetic mapping and does not require marker polymorphism per se, as it is based on presence (retention) vs. absence (deletion) marker assay. In this study, a 178 line RH panel was genotyped with SSRs and DArT markers to develop the first high resolution RH maps of the entire D-genome of Ae. tauschii accession AL8/78. To confirm map order accuracy, the AL8/78-RH maps were compared with:1) a DArT consensus genetic map constructed using more than 100 bi-parental populations, 2) a RH map of the D-genome of reference hexaploid wheat 'Chinese Spring', and 3) two SNP-based genetic maps, one with anchored D-genome BAC contigs and another with anchored D-genome sequence scaffolds. Using marker sequences, the RH maps were also anchored with a BAC contig based physical map and draft sequence of the D-genome of Ae. tauschii. A total of 609 markers were mapped to 503 unique positions on the seven D-genome chromosomes, with a total map length of 14,706.7 cR. The average distance between any two marker loci was 29.2 cR which corresponds to 2.1 cM or 9.8 Mb. The average mapping resolution across the D-genome was estimated to be 0.34 Mb (Mb/cR) or 0.07 cM (cM/cR). The RH maps showed almost perfect agreement with several published maps with regard to chromosome assignments of markers. The mean rank correlations between the position of markers on AL8/78 maps and the four published maps, ranged from 0.75 to 0.92, suggesting a good agreement in marker order. With 609 mapped markers, a total of 2481 deletions for the whole D-genome were detected with an average deletion size of 42.0 Mb. A total of 520 markers were anchored to 216 Ae. tauschii sequence scaffolds, 116 of which were not anchored earlier to the D-genome. This study reports the development of first high resolution RH maps for the D-genome of Ae. tauschii accession AL8/78, which were then used for the anchoring of unassigned sequence scaffolds. This study demonstrates how RH mapping, which offered high and uniform resolution across the length of the chromosome, can facilitate the complete sequence assembly of the large and complex plant genomes.
2012-01-01
Background Brassica oleracea encompass a family of vegetables and cabbage that are among the most widely cultivated crops. In 2009, the B. oleracea Genome Sequencing Project was launched using next generation sequencing technology. None of the available maps were detailed enough to anchor the sequence scaffolds for the Genome Sequencing Project. This report describes the development of a large number of SSR and SNP markers from the whole genome shotgun sequence data of B. oleracea, and the construction of a high-density genetic linkage map using a double haploid mapping population. Results The B. oleracea high-density genetic linkage map that was constructed includes 1,227 markers in nine linkage groups spanning a total of 1197.9 cM with an average of 0.98 cM between adjacent loci. There were 602 SSR markers and 625 SNP markers on the map. The chromosome with the highest number of markers (186) was C03, and the chromosome with smallest number of markers (99) was C09. Conclusions This first high-density map allowed the assembled scaffolds to be anchored to pseudochromosomes. The map also provides useful information for positional cloning, molecular breeding, and integration of information of genes and traits in B. oleracea. All the markers on the map will be transferable and could be used for the construction of other genetic maps. PMID:23033896
Detection of microRNAs in color space.
Marco, Antonio; Griffiths-Jones, Sam
2012-02-01
Deep sequencing provides inexpensive opportunities to characterize the transcriptional diversity of known genomes. The AB SOLiD technology generates millions of short sequencing reads in color-space; that is, the raw data is a sequence of colors, where each color represents 2 nt and each nucleotide is represented by two consecutive colors. This strategy is purported to have several advantages, including increased ability to distinguish sequencing errors from polymorphisms. Several programs have been developed to map short reads to genomes in color space. However, a number of previously unexplored technical issues arise when using SOLiD technology to characterize microRNAs. Here we explore these technical difficulties. First, since the sequenced reads are longer than the biological sequences, every read is expected to contain linker fragments. The color-calling error rate increases toward the 3(') end of the read such that recognizing the linker sequence for removal becomes problematic. Second, mapping in color space may lead to the loss of the first nucleotide of each read. We propose a sequential trimming and mapping approach to map small RNAs. Using our strategy, we reanalyze three published insect small RNA deep sequencing datasets and characterize 22 new microRNAs. A bash shell script to perform the sequential trimming and mapping procedure, called SeqTrimMap, is available at: http://www.mirbase.org/tools/seqtrimmap/ antonio.marco@manchester.ac.uk Supplementary data are available at Bioinformatics online.
2014-01-01
Background Tuber melanosporum, also known in the gastronomic community as “truffle”, features one of the largest fungal genomes (125 Mb) with an exceptionally high transposable element (TE) and repetitive DNA content (>58%). The main purpose of DNA methylation in fungi is TE silencing. As obligate outcrossing organisms, truffles are bound to a sexual mode of propagation, which together with TEs is thought to represent a major force driving the evolution of DNA methylation. Thus, it was of interest to examine if and how T. melanosporum exploits DNA methylation to maintain genome integrity. Findings We performed whole-genome DNA bisulfite sequencing and mRNA sequencing on different developmental stages of T. melanosporum; namely, fruitbody (“truffle”), free-living mycelium and ectomycorrhiza. The data revealed a high rate of cytosine methylation (>44%), selectively targeting TEs rather than genes with a strong preference for CpG sites. Whole genome DNA sequencing uncovered multiple TE-enriched, copy number variant regions bearing a significant fraction of hypomethylated and expressed TEs, almost exclusively in free-living mycelium propagated in vitro. Treatment of mycelia with 5-azacytidine partially reduced DNA methylation and increased TE transcription. Our transcriptome assembly also resulted in the identification of a set of novel transcripts from 614 genes. Conclusions The datasets presented here provide valuable and comprehensive (epi)genomic information that can be of interest for evolutionary genomics studies of multicellular (filamentous) fungi, in particular Ascomycetes belonging to the subphylum, Pezizomycotina. Evidence derived from comparative methylome and transcriptome analyses indicates that a non-exhaustive and partly reversible methylation process operates in truffles. PMID:25392735
Chen, Pao-Yang; Montanini, Barbara; Liao, Wen-Wei; Morselli, Marco; Jaroszewicz, Artur; Lopez, David; Ottonello, Simone; Pellegrini, Matteo
2014-01-01
Tuber melanosporum, also known in the gastronomic community as "truffle", features one of the largest fungal genomes (125 Mb) with an exceptionally high transposable element (TE) and repetitive DNA content (>58%). The main purpose of DNA methylation in fungi is TE silencing. As obligate outcrossing organisms, truffles are bound to a sexual mode of propagation, which together with TEs is thought to represent a major force driving the evolution of DNA methylation. Thus, it was of interest to examine if and how T. melanosporum exploits DNA methylation to maintain genome integrity. We performed whole-genome DNA bisulfite sequencing and mRNA sequencing on different developmental stages of T. melanosporum; namely, fruitbody ("truffle"), free-living mycelium and ectomycorrhiza. The data revealed a high rate of cytosine methylation (>44%), selectively targeting TEs rather than genes with a strong preference for CpG sites. Whole genome DNA sequencing uncovered multiple TE-enriched, copy number variant regions bearing a significant fraction of hypomethylated and expressed TEs, almost exclusively in free-living mycelium propagated in vitro. Treatment of mycelia with 5-azacytidine partially reduced DNA methylation and increased TE transcription. Our transcriptome assembly also resulted in the identification of a set of novel transcripts from 614 genes. The datasets presented here provide valuable and comprehensive (epi)genomic information that can be of interest for evolutionary genomics studies of multicellular (filamentous) fungi, in particular Ascomycetes belonging to the subphylum, Pezizomycotina. Evidence derived from comparative methylome and transcriptome analyses indicates that a non-exhaustive and partly reversible methylation process operates in truffles.
In Vivo Control of CpG and Non-CpG DNA Methylation by DNA Methyltransferases
Arand, Julia; Spieler, David; Karius, Tommy; Branco, Miguel R.; Meilinger, Daniela; Meissner, Alexander; Jenuwein, Thomas; Xu, Guoliang; Leonhardt, Heinrich; Wolf, Verena; Walter, Jörn
2012-01-01
The enzymatic control of the setting and maintenance of symmetric and non-symmetric DNA methylation patterns in a particular genome context is not well understood. Here, we describe a comprehensive analysis of DNA methylation patterns generated by high resolution sequencing of hairpin-bisulfite amplicons of selected single copy genes and repetitive elements (LINE1, B1, IAP-LTR-retrotransposons, and major satellites). The analysis unambiguously identifies a substantial amount of regional incomplete methylation maintenance, i.e. hemimethylated CpG positions, with variant degrees among cell types. Moreover, non-CpG cytosine methylation is confined to ESCs and exclusively catalysed by Dnmt3a and Dnmt3b. This sequence position–, cell type–, and region-dependent non-CpG methylation is strongly linked to neighboring CpG methylation and requires the presence of Dnmt3L. The generation of a comprehensive data set of 146,000 CpG dyads was used to apply and develop parameter estimated hidden Markov models (HMM) to calculate the relative contribution of DNA methyltransferases (Dnmts) for de novo and maintenance DNA methylation. The comparative modelling included wild-type ESCs and mutant ESCs deficient for Dnmt1, Dnmt3a, Dnmt3b, or Dnmt3a/3b, respectively. The HMM analysis identifies a considerable de novo methylation activity for Dnmt1 at certain repetitive elements and single copy sequences. Dnmt3a and Dnmt3b contribute de novo function. However, both enzymes are also essential to maintain symmetrical CpG methylation at distinct repetitive and single copy sequences in ESCs. PMID:22761581
Ferles, Christos; Beaufort, William-Scott; Ferle, Vanessa
2017-01-01
The present study devises mapping methodologies and projection techniques that visualize and demonstrate biological sequence data clustering results. The Sequence Data Density Display (SDDD) and Sequence Likelihood Projection (SLP) visualizations represent the input symbolical sequences in a lower-dimensional space in such a way that the clusters and relations of data elements are depicted graphically. Both operate in combination/synergy with the Self-Organizing Hidden Markov Model Map (SOHMMM). The resulting unified framework is in position to analyze automatically and directly raw sequence data. This analysis is carried out with little, or even complete absence of, prior information/domain knowledge.
Quantitative DNA fiber mapping
Gray, Joe W.; Weier, Heinz-Ulrich G.
1998-01-01
The present invention relates generally to the DNA mapping and sequencing technologies. In particular, the present invention provides enhanced methods and compositions for the physical mapping and positional cloning of genomic DNA. The present invention also provides a useful analytical technique to directly map cloned DNA sequences onto individual stretched DNA molecules.
NASA Astrophysics Data System (ADS)
de Andrade, Jailson B.; Tanner, Roger L.
A method is described for the specific collection of formaldehyde as hydroxymethanesulfonate on bisulfate-coated cellulose filters. Following extraction in aqueous acid and removal on unreacted bisulfite, the hydroxymethanesulfonate is decomposed by base, and HCHO is determined by DNPH (2,4-dinitrophenylhydrazine) derivatization and HPLC. Since the collection efficiency for formaldehyde is moderately high even when sampling ambient air at high-volume flow rates, a limit of detection of 0.2 ppbv is achieved with 30 min sampling times. Interference from acetaldehyde co-collected as 1-hydroxyethanesulfonate is <5% using this procedure. The technique shows promise for both short-term airborne sampling, and as a means of collecting mg-sized samples of HCHO on an inorganic matrix for carbon isotopic analyses.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gibbons, W.R.; Westby, C.A.
1987-01-01
The authors designed and tested a new process for converting fodder beets to ethanol: continuous diffusion-fermentation. This process utilizes the simultaneous diffusion-fermentation concept of the EX-FERM design; however, it overcomes the material handling problems inherent in that system by utilizing a counterflow tubular auger system. This process also eliminates the need for roller mills or presses and dryers which are required for alcohol recovery from solid phase fermentation. The latter is the only other currently feasible procedure for producing distillably worthwhile amounts of ethanol from fodder beets, sweet sorghum, and other similar feedstocks. Results on the use of sodium metamore » bisulfite (SMB) for contamination control with fermenting fodder beet cubes are reported.« less
The anti-CMS technique for genome-wide mapping of 5-hydroxymethylcytosine.
Huang, Yun; Pastor, William A; Zepeda-Martínez, Jorge A; Rao, Anjana
2012-10-01
5-Hydroxymethylcytosine (5hmC) is a recently discovered base in the mammalian genome, produced upon oxidation of 5-methylcytosine (5mC) in a process catalyzed by TET proteins. The biological functions of 5hmC and further oxidation products of 5mC are under intense investigation, as they are likely intermediates in DNA demethylation pathways. Here we describe a novel protocol to profile 5hmC at a genome-wide scale. This approach is based on sodium bisulfite-mediated conversion of 5hmC to cytosine-5-methylenesulfonate (CMS); CMS-containing DNA fragments are then immunoprecipitated using a CMS-specific antiserum. The anti-CMS technique is highly specific with a low background, and is much less dependent on 5hmC density than anti-5hmC immunoprecipitation (IP). Moreover, it does not enrich for CA and CT repeats, as noted for 5hmC DNA IP using antibodies to 5hmC. The anti-CMS protocol takes 3 d to complete.
2012-01-01
Background The genome of Mycobacterium avium subspecies paratuberculosis (MAP) is remarkably homogeneous among the genomes of bovine, human and wildlife isolates. However, previous work in our laboratories with the bovine K-10 strain has revealed substantial differences compared to sheep isolates. To systematically characterize all genomic differences that may be associated with the specific hosts, we sequenced the genomes of three U.S. sheep isolates and also obtained an optical map. Results Our analysis of one of the isolates, MAP S397, revealed a genome 4.8 Mb in size with 4,700 open reading frames (ORFs). Comparative analysis of the MAP S397 isolate showed it acquired approximately 10 large sequence regions that are shared with the human M. avium subsp. hominissuis strain 104 and lost 2 large regions that are present in the bovine strain. In addition, optical mapping defined the presence of 7 large inversions between the bovine and ovine genomes (~ 2.36 Mb). Whole-genome sequencing of 2 additional sheep strains of MAP (JTC1074 and JTC7565) further confirmed genomic homogeneity of the sheep isolates despite the presence of polymorphisms on the nucleotide level. Conclusions Comparative sequence analysis employed here provided a better understanding of the host association, evolution of members of the M. avium complex and could help in deciphering the phenotypic differences observed among sheep and cattle strains of MAP. A similar approach based on whole-genome sequencing combined with optical mapping could be employed to examine closely related pathogens. We propose an evolutionary scenario for M. avium complex strains based on these genome sequences. PMID:22409516
BAC sequencing using pooled methods.
Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina
2015-01-01
Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.
USDA-ARS?s Scientific Manuscript database
The large and complex genome of bread wheat (Triticum aestivum L., ~17 Gb) requires high-resolution genome maps saturated with ordered markers to assist in anchoring and orienting BAC contigs/ sequence scaffolds for whole genome sequence assembly. Radiation hybrid (RH) mapping has proven to be an e...
High-density genetic map construction and comparative genome analysis in asparagus bean.
Huang, Haitao; Tan, Huaqiang; Xu, Dongmei; Tang, Yi; Niu, Yisong; Lai, Yunsong; Tie, Manman; Li, Huanxiu
2018-03-19
Genetic maps are a prerequisite for quantitative trait locus (QTL) analysis, marker-assisted selection (MAS), fine gene mapping, and assembly of genome sequences. So far, several asparagus bean linkage maps have been established using various kinds of molecular markers. However, these maps were all constructed by gel- or array-based markers. No maps based on sequencing method have been reported. In this study, an NGS-based strategy, SLAF-seq, was applied to create a high-density genetic map for asparagus bean. Through SLAF library construction and Illumina sequencing of two parents and 100 F2 individuals, a total of 55,437 polymorphic SLAF markers were developed and mined for SNP markers. The map consisted of 5,225 SNP markers in 11 LGs, spanning a total distance of 1,850.81 cM, with an average distance between markers of 0.35 cM. Comparative genome analysis with four other legume species, soybean, common bean, mung bean and adzuki bean showed that asparagus bean is genetically more related to adzuki bean. The results will provide a foundation for future genomic research, such as QTL fine mapping, comparative mapping in pulses, and offer support for assembling asparagus bean genome sequence.
The Release 6 reference sequence of the Drosophila melanogaster genome
Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.; ...
2015-01-14
Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
The Release 6 reference sequence of the Drosophila melanogaster genome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.
Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
Chiu, Kuo Ping; Wong, Chee-Hong; Chen, Qiongyu; Ariyaratne, Pramila; Ooi, Hong Sain; Wei, Chia-Lin; Sung, Wing-Kin Ken; Ruan, Yijun
2006-08-25
We recently developed the Paired End diTag (PET) strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the Project Manager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management.
A SSR-based genetic linkage map of cultivated peanut (Arachis hypogaea L.)
USDA-ARS?s Scientific Manuscript database
The objective of this study was to construct a molecular linkage map of cultivated tetraploid peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Three recombinant inbre...
Sargent, D J; Rys, A; Nier, S; Simpson, D W; Tobutt, K R
2007-01-01
We have developed 46 primer pairs from exon sequences flanking polymorphic introns of 23 Fragaria gene sequences and one Malus sequence deposited in the EMBL database. Sequencing of a set of the PCR products amplified with the novel primer pairs in diploid Fragaria showed the products to be homologous to the sequences from which the primers were originally designed. By scoring the segregation of the 24 genes in two diploid Fragaria progenies FV x FN (F. vesca x F. nubicola F(2)) and 815 x 903BC (F. vesca x F. viridis BC(1)) 29 genetic loci at discrete positions on the seven linkage groups previously characterised could be mapped, bringing to 35 the total number of known function genes mapped in Fragaria. Twenty primer pairs, representing 14 genes, amplified a product of the expected size in both Malus and Prunus. To demonstrate the applicability of these gene-specific loci to comparative mapping in Rosaceae, five markers that displayed clear polymorphism between the parents of a Malus and a Prunus mapping population were selected. The markers were then scored and mapped in at least one of the two additional progenies.
Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming
2015-01-01
Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming
2015-01-01
Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430
Everts-van der Wind, Annelie; Kata, Srinivas R.; Band, Mark R.; Rebeiz, Mark; Larkin, Denis M.; Everts, Robin E.; Green, Cheryl A.; Liu, Lei; Natarajan, Shreedhar; Goldammer, Tom; Lee, Jun Heon; McKay, Stephanie; Womack, James E.; Lewin, Harris A.
2004-01-01
A second-generation 5000 rad radiation hybrid (RH) map of the cattle genome was constructed primarily using cattle ESTs that were targeted to gaps in the existing cattle–human comparative map, as well as to sparsely populated map intervals. A total of 870 targeted markers were added, bringing the number of markers mapped on the RH5000 panel to 1913. Of these, 1463 have significant BLASTN hits (E < e–5) against the human genome sequence. A cattle–human comparative map was created using human genome sequence coordinates of the paired orthologs. One-hundred and ninety-five conserved segments (defined by two or more genes) were identified between the cattle and human genomes, of which 31 are newly discovered and 34 were extended singletons on the first-generation map. The new map represents an improvement of 20% genome-wide comparative coverage compared with the first-generation map. Analysis of gene content within human genome regions where there are gaps in the comparative map revealed gaps with both significantly greater and significantly lower gene content. The new, more detailed cattle–human comparative map provides an improved resource for the analysis of mammalian chromosome evolution, the identification of candidate genes for economically important traits, and for proper alignment of sequence contigs on cattle chromosomes. PMID:15231756
Transcription Factor Map Alignment of Promoter Regions
Blanco, Enrique; Messeguer, Xavier; Smith, Temple F; Guigó, Roderic
2006-01-01
We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments. PMID:16733547
Dissecting enzyme function with microfluidic-based deep mutational scanning.
Romero, Philip A; Tran, Tuan M; Abate, Adam R
2015-06-09
Natural enzymes are incredibly proficient catalysts, but engineering them to have new or improved functions is challenging due to the complexity of how an enzyme's sequence relates to its biochemical properties. Here, we present an ultrahigh-throughput method for mapping enzyme sequence-function relationships that combines droplet microfluidic screening with next-generation DNA sequencing. We apply our method to map the activity of millions of glycosidase sequence variants. Microfluidic-based deep mutational scanning provides a comprehensive and unbiased view of the enzyme function landscape. The mapping displays expected patterns of mutational tolerance and a strong correspondence to sequence variation within the enzyme family, but also reveals previously unreported sites that are crucial for glycosidase function. We modified the screening protocol to include a high-temperature incubation step, and the resulting thermotolerance landscape allowed the discovery of mutations that enhance enzyme thermostability. Droplet microfluidics provides a general platform for enzyme screening that, when combined with DNA-sequencing technologies, enables high-throughput mapping of enzyme sequence space.
Mapping Ribonucleotides Incorporated into DNA by Hydrolytic End-Sequencing.
Orebaugh, Clinton D; Lujan, Scott A; Burkholder, Adam B; Clausen, Anders R; Kunkel, Thomas A
2018-01-01
Ribonucleotides embedded within DNA render the DNA sensitive to the formation of single-stranded breaks under alkali conditions. Here, we describe a next-generation sequencing method called hydrolytic end sequencing (HydEn-seq) to map ribonucleotides inserted into the genome of Saccharomyce cerevisiae strains deficient in ribonucleotide excision repair. We use this method to map several genomic features in wild-type and replicase variant yeast strains.
Han, Yuepeng; Chagné, David; Gasic, Ksenija; Rikkerink, Erik H A; Beever, Jonathan E; Gardiner, Susan E; Korban, Schuyler S
2009-03-01
A genome-wide BAC physical map of the apple, Malus x domestica Borkh., has been recently developed. Here, we report on integrating the physical and genetic maps of the apple using a SNP-based approach in conjunction with bin mapping. Briefly, BAC clones located at ends of BAC contigs were selected, and sequenced at both ends. The BAC end sequences (BESs) were used to identify candidate SNPs. Subsequently, these candidate SNPs were genetically mapped using a bin mapping strategy for the purpose of mapping the physical onto the genetic map. Using this approach, 52 (23%) out of 228 BESs tested were successfully exploited to develop SNPs. These SNPs anchored 51 contigs, spanning approximately 37 Mb in cumulative physical length, onto 14 linkage groups. The reliability of the integration of the physical and genetic maps using this SNP-based strategy is described, and the results confirm the feasibility of this approach to construct an integrated physical and genetic maps for apple.
Anatomy of a hash-based long read sequence mapping algorithm for next generation DNA sequencing.
Misra, Sanchit; Agrawal, Ankit; Liao, Wei-keng; Choudhary, Alok
2011-01-15
Recently, a number of programs have been proposed for mapping short reads to a reference genome. Many of them are heavily optimized for short-read mapping and hence are very efficient for shorter queries, but that makes them inefficient or not applicable for reads longer than 200 bp. However, many sequencers are already generating longer reads and more are expected to follow. For long read sequence mapping, there are limited options; BLAT, SSAHA2, FANGS and BWA-SW are among the popular ones. However, resequencing and personalized medicine need much faster software to map these long sequencing reads to a reference genome to identify SNPs or rare transcripts. We present AGILE (AliGnIng Long rEads), a hash table based high-throughput sequence mapping algorithm for longer 454 reads that uses diagonal multiple seed-match criteria, customized q-gram filtering and a dynamic incremental search approach among other heuristics to optimize every step of the mapping process. In our experiments, we observe that AGILE is more accurate than BLAT, and comparable to BWA-SW and SSAHA2. For practical error rates (< 5%) and read lengths (200-1000 bp), AGILE is significantly faster than BLAT, SSAHA2 and BWA-SW. Even for the other cases, AGILE is comparable to BWA-SW and several times faster than BLAT and SSAHA2. http://www.ece.northwestern.edu/~smi539/agile.html.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Yuting, E-mail: wuyuting1302@sina.com; Bu, Fan
Liver fibrosis, resulting from chronic and persistent injury to the liver, is a worldwide health problem. Advanced liver fibrosis results in cirrhosis, liver failure and even hepatocellular cancer (HCC), often eventually requiring liver transplantation, poses a huge health burden on the global community. However, the specific pathogenesis of liver fibrosis remains not fully understood. Numerous basic and clinical studies have provided evidence that epigenetic modifications, especially DNA methylation, might contribute to the activation of hepatic stellate cells (HSCs), the pivotal cell type responsible for the fibrous scar in liver. Here, reduced representation bisulfite sequencing (RRBS) and bisulfite pyrosequencing PCR (BSP)more » analysis identified hypermethylation status of Septin9 (Sept9) gene in liver fibrogenesis. Sept9 protein was dramatically decreased in livers of CCl4-treated mice and immortalized HSC-T6 cells exposed to TGF-β1. Nevertheless, the suppression of Sept9 could be blocked by DNMT3a-siRNA and DNA methyltransferase inhibitor, 5-aza-2′-deoxycytidine (5-azadC). Overexpressed Sept9 attenuated TGF-β1-induced expression of myofibroblast markers α-SMA and Col1a1, accompanied by up-regulation of cell apoptosis-related proteins. Conversely, RNAi-mediated silencing of Sept9 enhanced accumulation of extracellular matrix. These observations suggested that Sept9 contributed to alleviate liver fibrosis might partially through promoting activated HSCs apoptosis and this anti-fibrogenesis effect might be blocked by DNMT-3a mediated methylation of Sept9. Therefore, pharmacological agents that inhibit Sept9 methylation and increase its expression could be considered as valuable treatments for liver fibrosis. - Highlights: • This is the first report of Sept9 methylation and function in liver fibrosis. • Ectopic expression of Sept9 could block the liver fibrogenesis. • DNMT3a might be responsible for the suppression of Sept9 in liver fibrosis.« less
Thorup, Casper; Schramm, Andreas; Findlay, Alyssa J; Finster, Kai W; Schreiber, Lars
2017-07-18
This study demonstrates that the deltaproteobacterium Desulfurivibrio alkaliphilus can grow chemolithotrophically by coupling sulfide oxidation to the dissimilatory reduction of nitrate and nitrite to ammonium. Key genes of known sulfide oxidation pathways are absent from the genome of D. alkaliphilus Instead, the genome contains all of the genes necessary for sulfate reduction, including a gene for a reductive-type dissimilatory bisulfite reductase (DSR). Despite this, growth by sulfate reduction was not observed. Transcriptomic analysis revealed a very high expression level of sulfate-reduction genes during growth by sulfide oxidation, while inhibition experiments with molybdate pointed to elemental sulfur/polysulfides as intermediates. Consequently, we propose that D. alkaliphilus initially oxidizes sulfide to elemental sulfur, which is then either disproportionated, or oxidized by a reversal of the sulfate reduction pathway. This is the first study providing evidence that a reductive-type DSR is involved in a sulfide oxidation pathway. Transcriptome sequencing further suggests that nitrate reduction to ammonium is performed by a novel type of periplasmic nitrate reductase and an unusual membrane-anchored nitrite reductase. IMPORTANCE Sulfide oxidation and sulfate reduction, the two major branches of the sulfur cycle, are usually ascribed to distinct sets of microbes with distinct diagnostic genes. Here we show a more complex picture, as D. alkaliphilus , with the genomic setup of a sulfate reducer, grows by sulfide oxidation. The high expression of genes typically involved in the sulfate reduction pathway suggests that these genes, including the reductive-type dissimilatory bisulfite reductases, are also involved in as-yet-unresolved sulfide oxidation pathways. Finally, D. alkaliphilus is closely related to cable bacteria, which grow by electrogenic sulfide oxidation. Since there are no pure cultures of cable bacteria, D. alkaliphilus may represent an exciting model organism in which to study the physiology of this process. Copyright © 2017 Thorup et al.
Wang, Kunning; Liang, Qiaoyi; Li, Xiaoxing; Tsoi, Ho; Zhang, Jingwan; Wang, Hua; Go, Minnie Y Y; Chiu, Philip W Y; Ng, Enders K W; Sung, Joseph J Y; Yu, Jun
2016-10-01
Using the promoter methylation assay, we have shown that MDGA2 (MAM domain containing glycosylphosphatidylinositol anchor 2) is preferentially methylated in gastric cancer. We analysed its biological effects and prognostic significance in gastric cancer. MDGA2 methylation status was evaluated by combined bisulfite restriction analysis and bisulfite genomic sequencing. The effects of MDGA2 re-expression or knockdown on cell proliferation, apoptosis and the cell cycle were determined. MDGA2 interacting protein was identified by mass spectrometry and MDGA2-related cancer pathways by reporter activity and PCR array analyses. The clinical impact of MDGA2 was assessed in 218 patients with gastric cancer. MDGA2 was commonly silenced in gastric cancer cells (10/11) and primary gastric cancers due to promoter hypermethylation. MDGA2 significantly inhibited cell proliferation by causing G1-S cell cycle arrest and inducing cell apoptosis in vitro, and suppressed xenograft tumour growth in both subcutaneous and orthotopic xenograft mouse models (both p<0.001). The anti-tumorigenic effect of MDGA2 was mediated through direct stabilising of DNA methyltransferase 1 associated protein 1 (DMAP1), which played a tumour suppressive role in gastric cancer. This interaction activated their downstream key elements of p53/p21 signalling cascades. Moreover, promoter methylation of MDGA2 was detected in 62.4% (136/218) of gastric cancers. Multivariate analysis showed that patients with MDGA2 hypermethylation had a significantly decreased survival (p=0.005). Kaplan-Meier survival curves showed that MDGA2 hypermethylation was significantly associated with shortened survival in patients with early gastric cancer. MDGA2 is a critical tumour suppressor in gastric carcinogenesis; its hypermethylation is an independent prognostic factor in patients with gastric cancer. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
USDA-ARS?s Scientific Manuscript database
Fine-mapping of causal variants is becoming feasible for complex traits in livestock GWAS, as an increasing number of animals are sequenced. Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on small reference populations of sequenced animals. ...
USDA-ARS?s Scientific Manuscript database
Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on reference populations of sequenced animals. With the implementation of the 1000 Bull Genomes Project and increasing numbers of animals sequenced, fine-mapping of causal variants is becoming f...
Ramu, P; Kassahun, B; Senthilvel, S; Ashok Kumar, C; Jayashree, B; Folkertsma, R T; Reddy, L Ananda; Kuruvinashetti, M S; Haussmann, B I G; Hash, C T
2009-11-01
The sequencing and detailed comparative functional analysis of genomes of a number of select botanical models open new doors into comparative genomics among the angiosperms, with potential benefits for improvement of many orphan crops that feed large populations. In this study, a set of simple sequence repeat (SSR) markers was developed by mining the expressed sequence tag (EST) database of sorghum. Among the SSR-containing sequences, only those sharing considerable homology with rice genomic sequences across the lengths of the 12 rice chromosomes were selected. Thus, 600 SSR-containing sorghum EST sequences (50 homologous sequences on each of the 12 rice chromosomes) were selected, with the intention of providing coverage for corresponding homologous regions of the sorghum genome. Primer pairs were designed and polymorphism detection ability was assessed using parental pairs of two existing sorghum mapping populations. About 28% of these new markers detected polymorphism in this 4-entry panel. A subset of 55 polymorphic EST-derived SSR markers were mapped onto the existing skeleton map of a recombinant inbred population derived from cross N13 x E 36-1, which is segregating for Striga resistance and the stay-green component of terminal drought tolerance. These new EST-derived SSR markers mapped across all 10 sorghum linkage groups, mostly to regions expected based on prior knowledge of rice-sorghum synteny. The ESTs from which these markers were derived were then mapped in silico onto the aligned sorghum genome sequence, and 88% of the best hits corresponded to linkage-based positions. This study demonstrates the utility of comparative genomic information in targeted development of markers to fill gaps in linkage maps of related crop species for which sufficient genomic tools are not available.
UroMark-a urinary biomarker assay for the detection of bladder cancer.
Feber, Andrew; Dhami, Pawan; Dong, Liqin; de Winter, Patricia; Tan, Wei Shen; Martínez-Fernández, Mónica; Paul, Dirk S; Hynes-Allen, Antony; Rezaee, Sheida; Gurung, Pratik; Rodney, Simon; Mehmood, Ahmed; Villacampa, Felipe; de la Rosa, Federico; Jameson, Charles; Cheng, Kar Keung; Zeegers, Maurice P; Bryan, Richard T; James, Nicholas D; Paramio, Jesus M; Freeman, Alex; Beck, Stephan; Kelly, John D
2017-01-01
Bladder cancer (BC) is one of the most common cancers in the western world and ranks as the most expensive to manage, due to the need for cystoscopic examination. BC shows frequent changes in DNA methylation, and several studies have shown the potential utility of urinary biomarkers by detecting epigenetic alterations in voided urine. The aim of this study is to develop a targeted bisulfite next-generation sequencing assay to diagnose BC from urine with high sensitivity and specificity. We defined a 150 CpG loci biomarker panel from a cohort of 86 muscle-invasive bladder cancers and 30 normal urothelium. Based on this panel, we developed the UroMark assay, a next-generation bisulphite sequencing assay and analysis pipeline for the detection of bladder cancer from urinary sediment DNA. The 150 loci UroMark assay was validated in an independent cohort ( n = 274, non-cancer ( n = 167) and bladder cancer ( n = 107)) voided urine samples with an AUC of 97%. The UroMark classifier sensitivity of 98%, specificity of 97% and NPV of 97% for the detection of primary BC was compared to non-BC urine. Epigenetic urinary biomarkers for detection of BC have the potential to revolutionise the management of this disease. In this proof of concept study, we show the development and utility of a novel high-throughput, next-generation sequencing-based biomarker for the detection of BC-specific epigenetic alterations in urine.
Aokic, Jun-ya; Kawase, Junya; Hamada, Kazuhisa; Fujimoto, Hiroshi; Yamamoto, Ikki; Usuki, Hironori
2018-01-01
Greater amberjack (Seriola dumerili) is distributed in tropical and temperate waters worldwide and is an important aquaculture fish. We carried out de novo sequencing of the greater amberjack genome to construct a reference genome sequence to identify single nucleotide polymorphisms (SNPs) for breeding amberjack by marker-assisted or gene-assisted selection as well as to identify functional genes for biological traits. We obtained 200 times coverage and constructed a high-quality genome assembly using next generation sequencing technology. The assembled sequences were aligned onto a yellowtail (Seriola quinqueradiata) radiation hybrid (RH) physical map by sequence homology. A total of 215 of the longest amberjack sequences, with a total length of 622.8 Mbp (92% of the total length of the genome scaffolds), were lined up on the yellowtail RH map. We resequenced the whole genomes of 20 greater amberjacks and mapped the resulting sequences onto the reference genome sequence. About 186,000 nonredundant SNPs were successfully ordered on the reference genome. Further, we found differences in the genome structural variations between two greater amberjack populations using BreakDancer. We also analyzed the greater amberjack transcriptome and mapped the annotated sequences onto the reference genome sequence. PMID:29785397
Li, Qiling; Li, Min; Ma, Li; Li, Wenzhi; Wu, Xuehong; Richards, Jendai; Fu, Guoxing; Xu, Wei; Bythwood, Tameka; Li, Xu; Wang, Jianxin; Song, Qing
2014-01-01
Background The use of DNA from archival formalin and paraffin embedded (FFPE) tissue for genetic and epigenetic analyses may be problematic, since the DNA is often degraded and only limited amounts may be available. Thus, it is currently not known whether genome-wide methylation can be reliably assessed in DNA from archival FFPE tissue. Methodology/Principal Findings Ovarian tissues, which were obtained and formalin-fixed and paraffin-embedded in either 1999 or 2011, were sectioned and stained with hematoxylin-eosin (H&E).Epithelial cells were captured by laser micro dissection, and their DNA subjected to whole genomic bisulfite conversion, whole genomic polymerase chain reaction (PCR) amplification, and purification. Sequencing and software analyses were performed to identify the extent of genomic methylation. We observed that 31.7% of sequence reads from the DNA in the 1999 archival FFPE tissue, and 70.6% of the reads from the 2011 sample, could be matched with the genome. Methylation rates of CpG on the Watson and Crick strands were 32.2% and 45.5%, respectively, in the 1999 sample, and 65.1% and 42.7% in the 2011 sample. Conclusions/Significance We have developed an efficient method that allows DNA methylation to be assessed in archival FFPE tissue samples. PMID:25133528
Ibragimova, Ilsiya; Maradeo, Marie E.; Dulaimi, Essel; Cairns, Paul
2013-01-01
Recent sequencing studies of clear cell (conventional) renal cell carcinoma (ccRCC) have identified inactivating point mutations in the chromatin-modifying genes PBRM1, KDM6A/UTX, KDM5C/JARID1C, SETD2, MLL2 and BAP1. To investigate whether aberrant hypermethylation is a mechanism of inactivation of these tumor suppressor genes in ccRCC, we sequenced the promoter region within a bona fide CpG island of PBRM1, KDM6A, SETD2 and BAP1 in bisulfite-modified DNA of a representative series of 50 primary ccRCC, 4 normal renal parenchyma specimens and 5 RCC cell lines. We also interrogated the promoter methylation status of KDM5C and ARID1A in the Cancer Genome Atlas (TCGA) ccRCC Infinium data set. PBRM1, KDM6A, SETD2 and BAP1 were unmethylated in all tumor and normal specimens. KDM5C and ARID1A were unmethylated in the TCGA 219 ccRCC and 119 adjacent normal specimens. Aberrant promoter hypermethylation of PBRM1, BAP1 and the other chromatin-modifying genes examined here is therefore absent or rare in ccRCC. PMID:23644518
Heritable alteration of DNA methylation induced by whole-chromosome aneuploidy in wheat.
Gao, Lihong; Diarso, Moussa; Zhang, Ai; Zhang, Huakun; Dong, Yuzhu; Liu, Lixia; Lv, Zhenling; Liu, Bao
2016-01-01
Aneuploidy causes changes in gene expression and phenotypes in all organisms studied. A previous study in the model plant Arabidopsis thaliana showed that aneuploidy-generated phenotypic changes can be inherited to euploid progenies and implicated an epigenetic underpinning of the heritable variations. Based on an analysis by amplified fragment length polymorphism and methylation-sensitive amplified fragment length polymorphism markers, we found that although genetic changes at the nucleotide sequence level were negligible, extensive changes in cytosine DNA methylation patterns occurred in all studied homeologous group 1 whole-chromosome aneuploid lines of common wheat (Triticum aestivum), with monosomic 1A showing the greatest amount of methylation changes. The changed methylation patterns were inherited by euploid progenies derived from the aneuploid parents. The aneuploidy-induced DNA methylation alterations and their heritability were verified at selected loci by bisulfite sequencing. Our data have provided empirical evidence supporting earlier suggestions that heritability of aneuploidy-generated, but aneuploidy-independent, phenotypic variations may have an epigenetic basis. That at least one type of aneuploidy - monosomic 1A - was able to cause significant epigenetic divergence of the aneuploid plants and their euploid progenies also lends support to recent suggestions that aneuploidy may have played an important and protracted role in polyploid genome evolution. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
NASA Astrophysics Data System (ADS)
Huang, Yajuan; Hu, Nan; Si, Yufeng; Li, Siping; Wu, Shuxian; Zhang, Meizhao; Wen, Haishen; Li, Jifang; Li, Yun; He, Feng
2018-06-01
Follistatin (Fst) is a hyperplasia factor that plays a crucial role in muscle development. DNA methylation, a significant process, regulates gene expression. The aim of our study is to examine the DNA methylation and expression patterns of Fst gene at five different development stages of Japanese flounder (stage A, 7 dph; stage B, 90 dph; stage C, about 180 dph; stage D, about 24 months; stage E, about 36 months). The muscle tissue of Japanese flounder was obtained at different development stages in this experiment. DNA methylation levels in the promoter and exon 2 of Fst were determined by bisulfite sequencing, and the relative expression of the Fst gene at the five stages was measured by quantitative PCR. The results showed that the lowest methylation level was at stage A and the highest methylation level was at stage B. Moreover, the highest expression level of the Fst gene was observed at stage A. The mRNA abundance was negatively correlated with DNA methylation level. Three CpG islands in the promoter region and three CpG islands in exon 2 of Fst were found in the binding sequence of the putative transcription factor. These results offered a theoretical basis for the mechanism of Fst gene regulation to muscle development at different development stages.
21 CFR 201.22 - Prescription drugs containing sulfites; required warning statements.
Code of Federal Regulations, 2012 CFR
2012-04-01
... are added to certain drug products to inhibit the oxidation of the active drug ingredient. Oxidation.... Examples of specific sulfites used to inhibit this oxidation process include sodium bisulfite, sodium...
21 CFR 201.22 - Prescription drugs containing sulfites; required warning statements.
Code of Federal Regulations, 2010 CFR
2010-04-01
... are added to certain drug products to inhibit the oxidation of the active drug ingredient. Oxidation.... Examples of specific sulfites used to inhibit this oxidation process include sodium bisulfite, sodium...
21 CFR 201.22 - Prescription drugs containing sulfites; required warning statements.
Code of Federal Regulations, 2013 CFR
2013-04-01
... are added to certain drug products to inhibit the oxidation of the active drug ingredient. Oxidation.... Examples of specific sulfites used to inhibit this oxidation process include sodium bisulfite, sodium...
21 CFR 201.22 - Prescription drugs containing sulfites; required warning statements.
Code of Federal Regulations, 2011 CFR
2011-04-01
... are added to certain drug products to inhibit the oxidation of the active drug ingredient. Oxidation.... Examples of specific sulfites used to inhibit this oxidation process include sodium bisulfite, sodium...
21 CFR 201.22 - Prescription drugs containing sulfites; required warning statements.
Code of Federal Regulations, 2014 CFR
2014-04-01
... are added to certain drug products to inhibit the oxidation of the active drug ingredient. Oxidation.... Examples of specific sulfites used to inhibit this oxidation process include sodium bisulfite, sodium...
Sun, Bo; Dong, Hongyu; He, Di; Rao, Dandan; Guan, Xiaohong
2016-02-02
Permanganate can be activated by bisulfite to generate soluble Mn(III) (noncomplexed with ligands other than H2O and OH(-)) which oxidizes organic contaminants at extraordinarily high rates. However, the generation of Mn(III) in the permanganate/bisulfite (PM/BS) process and the reactivity of Mn(III) toward emerging contaminants have never been quantified. In this work, Mn(III) generated in the PM/BS process was shown to absorb at 230-290 nm for the first time and disproportionated more easily at higher pH, and thus, the utilization rate of Mn(III) for decomposing organic contaminant was low under alkaline conditions. A Mn(III) generation and utilization model was developed to get the second-order reaction rate parameters of benzene oxidation by soluble Mn(III), and then, benzene was chosen as the reference probe to build a competition kinetics method, which was employed to obtain the second-order rate constants of organic contaminants oxidation by soluble Mn(III). The results revealed that the second-order rate constants of aniline and bisphenol A oxidation by soluble Mn(III) were in the range of 10(5)-10(6) M(-1) s(-1). With the presence of soluble Mn(III) at micromolar concentration, contaminants could be oxidized with the observed rates several orders of magnitude higher than those by common oxidation processes, implying the great potential application of the PM/BS process in water and wastewater treatment.
A collaborative exercise on DNA methylation based body fluid typing.
Jung, Sang-Eun; Cho, Sohee; Antunes, Joana; Gomes, Iva; Uchimoto, Mari L; Oh, Yu Na; Di Giacomo, Lisa; Schneider, Peter M; Park, Min Sun; van der Meer, Dieudonne; Williams, Graham; McCord, Bruce; Ahn, Hee-Jung; Choi, Dong Ho; Lee, Yang Han; Lee, Soong Deok; Lee, Hwan Young
2016-10-01
A collaborative exercise on DNA methylation based body fluid identification was conducted by seven laboratories. For this project, a multiplex methylation SNaPshot reaction composed of seven CpG markers was used for the identification of four body fluids, including blood, saliva, semen, and vaginal fluid. A total of 30 specimens were prepared and distributed to participating laboratories after thorough testing. The required experiments included four increasingly complex tasks: (1) CE of a purified single-base extension reaction product, (2) multiplex PCR and multiplex single-base extension reaction of bisulfite-modified DNA, (3) bisulfite conversion of genomic DNA, and (4) extraction of genomic DNA from body fluid samples. In tasks 2, 3 and 4, one or more mixtures were analyzed, and specimens containing both known and unknown body fluid sources were used. Six of the laboratories generated consistent body fluid typing results for specimens of bisulfite-converted DNA and genomic DNA. One laboratory failed to set up appropriate conditions for capillary analysis of reference single-base extension products. In general, variation in the values obtained for DNA methylation analysis between laboratories increased with the complexity of the required experiments. However, all laboratories concurred on the interpretation of the DNA methylation profiles produced. Although the establishment of interpretational guidelines on DNA methylation based body fluid identification has yet to be performed, this study supports the addition of DNA methylation profiling to forensic body fluid typing. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Discovery and mapping of single feature polymorphisms in wheat using Affymetrix arrays
Bernardo, Amy N; Bradbury, Peter J; Ma, Hongxiang; Hu, Shengwa; Bowden, Robert L; Buckler, Edward S; Bai, Guihua
2009-01-01
Background Wheat (Triticum aestivum L.) is a staple food crop worldwide. The wheat genome has not yet been sequenced due to its huge genome size (~17,000 Mb) and high levels of repetitive sequences; the whole genome sequence may not be expected in the near future. Available linkage maps have low marker density due to limitation in available markers; therefore new technologies that detect genome-wide polymorphisms are still needed to discover a large number of new markers for construction of high-resolution maps. A high-resolution map is a critical tool for gene isolation, molecular breeding and genomic research. Single feature polymorphism (SFP) is a new microarray-based type of marker that is detected by hybridization of DNA or cRNA to oligonucleotide probes. This study was conducted to explore the feasibility of using the Affymetrix GeneChip to discover and map SFPs in the large hexaploid wheat genome. Results Six wheat varieties of diverse origins (Ning 7840, Clark, Jagger, Encruzilhada, Chinese Spring, and Opata 85) were analyzed for significant probe by variety interactions and 396 probe sets with SFPs were identified. A subset of 164 unigenes was sequenced and 54% showed polymorphism within probes. Microarray analysis of 71 recombinant inbred lines from the cross Ning 7840/Clark identified 955 SFPs and 877 of them were mapped together with 269 simple sequence repeat markers. The SFPs were randomly distributed within a chromosome but were unevenly distributed among different genomes. The B genome had the most SFPs, and the D genome had the least. Map positions of a selected set of SFPs were validated by mapping single nucleotide polymorphism using SNaPshot and comparing with expressed sequence tags mapping data. Conclusion The Affymetrix array is a cost-effective platform for SFP discovery and SFP mapping in wheat. The new high-density map constructed in this study will be a useful tool for genetic and genomic research in wheat. PMID:19480702
Guo, Yinshan; Shi, Guangli; Liu, Zhendong; Zhao, Yuhui; Yang, Xiaoxu; Zhu, Junchi; Li, Kun; Guo, Xiuwu
2015-01-01
In this study, 149 F1 plants from the interspecific cross between 'Red Globe' (Vitis vinifera L.) and 'Shuangyou' (Vitis amurensis Rupr.) and the parent were used to construct a molecular genetic linkage map by using the specific length amplified fragment sequencing technique. DNA sequencing generated 41.282 Gb data consisting of 206,411,693 paired-end reads. The average sequencing depths were 68.35 for 'Red Globe,' 63.65 for 'Shuangyou,' and 8.01 for each progeny. In all, 115,629 high-quality specific length amplified fragments were detected, of which 42,279 were polymorphic. The genetic map was constructed using 7,199 of these polymorphic markers. These polymorphic markers were assigned to 19 linkage groups; the total length of the map was 1929.13 cm, with an average distance of 0.28 cm between each maker. To our knowledge, the genetic maps constructed in this study contain the largest number of molecular markers. These high-density genetic maps might form the basis for the fine quantitative trait loci mapping and molecular-assisted breeding of grape.
Appliation of rad-sequencing to linkage mapping in citrus
USDA-ARS?s Scientific Manuscript database
High density linkage maps can be developed for modest cost using high-throughput DNA sequencing to genotype a defined fraction (representation) of the genome. We developed linkage maps in two citrus populations using the RAD (Restriction site Associated DNA) genotyping method which involves restrict...
Mok, Calvin A; Au, Vinci; Thompson, Owen A; Edgley, Mark L; Gevirtzman, Louis; Yochem, John; Lowry, Joshua; Memar, Nadin; Wallenfang, Matthew R; Rasoloson, Dominique; Bowerman, Bruce; Schnabel, Ralf; Seydoux, Geraldine; Moerman, Donald G; Waterston, Robert H
2017-10-01
Mutants remain a powerful means for dissecting gene function in model organisms such as Caenorhabditis elegans Massively parallel sequencing has simplified the detection of variants after mutagenesis but determining precisely which change is responsible for phenotypic perturbation remains a key step. Genetic mapping paradigms in C . elegans rely on bulk segregant populations produced by crosses with the problematic Hawaiian wild isolate and an excess of redundant information from whole-genome sequencing (WGS). To increase the repertoire of available mutants and to simplify identification of the causal change, we performed WGS on 173 temperature-sensitive (TS) lethal mutants and devised a novel mapping method. The mapping method uses molecular inversion probes (MIP-MAP) in a targeted sequencing approach to genetic mapping, and replaces the Hawaiian strain with a Million Mutation Project strain with high genomic and phenotypic similarity to the laboratory wild-type strain N2 We validated MIP-MAP on a subset of the TS mutants using a competitive selection approach to produce TS candidate mapping intervals with a mean size < 3 Mb. MIP-MAP successfully uses a non-Hawaiian mapping strain and multiplexed libraries are sequenced at a fraction of the cost of WGS mapping approaches. Our mapping results suggest that the collection of TS mutants contains a diverse library of TS alleles for genes essential to development and reproduction. MIP-MAP is a robust method to genetically map mutations in both viable and essential genes and should be adaptable to other organisms. It may also simplify tracking of individual genotypes within population mixtures. Copyright © 2017 by the Genetics Society of America.
Mok, Calvin A.; Au, Vinci; Thompson, Owen A.; Edgley, Mark L.; Gevirtzman, Louis; Yochem, John; Lowry, Joshua; Memar, Nadin; Wallenfang, Matthew R.; Rasoloson, Dominique; Bowerman, Bruce; Schnabel, Ralf; Seydoux, Geraldine; Moerman, Donald G.; Waterston, Robert H.
2017-01-01
Mutants remain a powerful means for dissecting gene function in model organisms such as Caenorhabditis elegans. Massively parallel sequencing has simplified the detection of variants after mutagenesis but determining precisely which change is responsible for phenotypic perturbation remains a key step. Genetic mapping paradigms in C. elegans rely on bulk segregant populations produced by crosses with the problematic Hawaiian wild isolate and an excess of redundant information from whole-genome sequencing (WGS). To increase the repertoire of available mutants and to simplify identification of the causal change, we performed WGS on 173 temperature-sensitive (TS) lethal mutants and devised a novel mapping method. The mapping method uses molecular inversion probes (MIP-MAP) in a targeted sequencing approach to genetic mapping, and replaces the Hawaiian strain with a Million Mutation Project strain with high genomic and phenotypic similarity to the laboratory wild-type strain N2. We validated MIP-MAP on a subset of the TS mutants using a competitive selection approach to produce TS candidate mapping intervals with a mean size < 3 Mb. MIP-MAP successfully uses a non-Hawaiian mapping strain and multiplexed libraries are sequenced at a fraction of the cost of WGS mapping approaches. Our mapping results suggest that the collection of TS mutants contains a diverse library of TS alleles for genes essential to development and reproduction. MIP-MAP is a robust method to genetically map mutations in both viable and essential genes and should be adaptable to other organisms. It may also simplify tracking of individual genotypes within population mixtures. PMID:28827289
Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi
2013-11-20
With the remarkable increase of genomic sequence data of microorganisms, novel tools are needed for comprehensive analyses of the big sequence data available. The self-organizing map (SOM) is an effective tool for clustering and visualizing high-dimensional data, such as oligonucleotide composition on one map. By modifying the conventional SOM, we developed batch-learning SOM (BLSOM), which allowed classification of sequence fragments (e.g., 1 kb) according to phylotypes, solely depending on oligonucleotide composition. Metagenomics studies of uncultivable microorganisms in clinical and environmental samples should allow extensive surveys of genes important in life sciences. BLSOM is most suitable for phylogenetic assignment of metagenomic sequences, because fragmental sequences can be clustered according to phylotypes, solely depending on oligonucleotide composition. We first constructed oligonucleotide BLSOMs for all available sequences from genomes of known species, and by mapping metagenomic sequences on these large-scale BLSOMs, we can predict phylotypes of individual metagenomic sequences, revealing a microbial community structure of uncultured microorganisms, including viruses. BLSOM has shown that influenza viruses isolated from humans and birds clearly differ in oligonucleotide composition. Based on this host-dependent oligonucleotide composition, we have proposed strategies for predicting directional changes of virus sequences and for surveilling potentially hazardous strains when introduced into humans from non-human sources.
Secco, David; Wang, Chuang; Shou, Huixia; Schultz, Matthew D; Chiarenza, Serge; Nussaume, Laurent; Ecker, Joseph R; Whelan, James; Lister, Ryan
2015-07-21
Cytosine DNA methylation (mC) is a genome modification that can regulate the expression of coding and non-coding genetic elements. However, little is known about the involvement of mC in response to environmental cues. Using whole genome bisulfite sequencing to assess the spatio-temporal dynamics of mC in rice grown under phosphate starvation and recovery conditions, we identified widespread phosphate starvation-induced changes in mC, preferentially localized in transposable elements (TEs) close to highly induced genes. These changes in mC occurred after changes in nearby gene transcription, were mostly DCL3a-independent, and could partially be propagated through mitosis, however no evidence of meiotic transmission was observed. Similar analyses performed in Arabidopsis revealed a very limited effect of phosphate starvation on mC, suggesting a species-specific mechanism. Overall, this suggests that TEs in proximity to environmentally induced genes are silenced via hypermethylation, and establishes the temporal hierarchy of transcriptional and epigenomic changes in response to stress.
Targeted and genome-scale methylomics reveals gene body signatures in human cell lines
Ball, Madeleine Price; Li, Jin Billy; Gao, Yuan; Lee, Je-Hyuk; LeProust, Emily; Park, In-Hyun; Xie, Bin; Daley, George Q.; Church, George M.
2012-01-01
Cytosine methylation, an epigenetic modification of DNA, is a target of growing interest for developing high throughput profiling technologies. Here we introduce two new, complementary techniques for cytosine methylation profiling utilizing next generation sequencing technology: bisulfite padlock probes (BSPPs) and methyl sensitive cut counting (MSCC). In the first method, we designed a set of ~10,000 BSPPs distributed over the ENCODE pilot project regions to take advantage of existing expression and chromatin immunoprecipitation data. We observed a pattern of low promoter methylation coupled with high gene body methylation in highly expressed genes. Using the second method, MSCC, we gathered genome-scale data for 1.4 million HpaII sites and confirmed that gene body methylation in highly expressed genes is a consistent phenomenon over the entire genome. Our observations highlight the usefulness of techniques which are not inherently or intentionally biased in favor of only profiling particular subsets like CpG islands or promoter regions. PMID:19329998
Zhang, Aihua; Li, Huiyao; Xiao, Yun; Chen, Liping; Zhu, Xiaonian; Li, Jun; Ma, Lu; Pan, Xueli; Chen, Wen; He, Zhini
2017-07-01
To define whether aberrant methylation of DNA repair genes is associated with chronic arsenic poisoning. Hundred and two endemic arsenicosis patients and 36 healthy subjects were recruited. Methylight and bisulfite sequencing (BSP) assays were used to examine the methylation status of ERCC1, ERCC2 and XPC genes in peripheral blood lymphocytes (PBLs) and skin lesions of arsenicosis patients and NaAsO 2 -treated HaCaT cells. Hypermethylation of ERCC1 and ERCC2 and suppressed gene expression were found in PBLs and skin lesions of arsenicosis patients and was correlated with the level of arsenic exposure. Particularly, the expression of ERCC1 and ERCC2 was associated with the severity of skin lesions. In vitro studies revealed an induction of ERCC2 hypermethylation and decreased mRNA expression in response to NaAsO 2 treatment. Hypermethylation of ERCC1 and ERCC2 and concomitant suppression of gene expression might be served as the epigenetic marks associated with arsenic exposure and adverse health effects.
Distinct Trends of DNA Methylation Patterning in the Innate and Adaptive Immune Systems
Schuyler, Ronald P.; Merkel, Angelika; Raineri, Emanuele; Altucci, Lucia; Vellenga, Edo; Martens, Joost H.A.; Pourfarzad, Farzin; Kuijpers, Taco W.; Burden, Frances; Farrow, Samantha; Downes, Kate; Ouwehand, Willem H.; Clarke, Laura; Datta, Avik; Lowy, Ernesto; Flicek, Paul; Frontini, Mattia; Stunnenberg, Hendrik G.; Martín-Subero, José I.; Gut, Ivo; Heath, Simon
2018-01-01
Summary DNA methylation and the localization and post-translational modification of nucleosomes are interdependent factors that contribute to the generation of distinct phenotypes from genetically identical cells. With 112 whole-genome bisulfite sequencing datasets from the BLUEPRINT Epigenome Project, we analyzed the global development of DNA methylation patterns during lineage commitment and maturation of a range of immune system effector cells and the cancers that arise from them. We show clear trends in methylation patterns that are distinct in the innate and adaptive arms of the human immune system, both globally and in relation to consistently positioned nucleosomes. Most notable are a progressive loss of methylation in developing lymphocytes and the consistent occurrence of non-CG methylation in specific cell types. Cancer samples from the two lineages are further polarized, suggesting the involvement of distinct lineage-specific epigenetic mechanisms. We anticipate broad utility for this resource as a basis for further comparative epigenetic analyses. PMID:27851971
Distinct Trends of DNA Methylation Patterning in the Innate and Adaptive Immune Systems.
Schuyler, Ronald P; Merkel, Angelika; Raineri, Emanuele; Altucci, Lucia; Vellenga, Edo; Martens, Joost H A; Pourfarzad, Farzin; Kuijpers, Taco W; Burden, Frances; Farrow, Samantha; Downes, Kate; Ouwehand, Willem H; Clarke, Laura; Datta, Avik; Lowy, Ernesto; Flicek, Paul; Frontini, Mattia; Stunnenberg, Hendrik G; Martín-Subero, José I; Gut, Ivo; Heath, Simon
2016-11-15
DNA methylation and the localization and post-translational modification of nucleosomes are interdependent factors that contribute to the generation of distinct phenotypes from genetically identical cells. With 112 whole-genome bisulfite sequencing datasets from the BLUEPRINT Epigenome Project, we analyzed the global development of DNA methylation patterns during lineage commitment and maturation of a range of immune system effector cells and the cancers that arise from them. We show clear trends in methylation patterns that are distinct in the innate and adaptive arms of the human immune system, both globally and in relation to consistently positioned nucleosomes. Most notable are a progressive loss of methylation in developing lymphocytes and the consistent occurrence of non-CG methylation in specific cell types. Cancer samples from the two lineages are further polarized, suggesting the involvement of distinct lineage-specific epigenetic mechanisms. We anticipate broad utility for this resource as a basis for further comparative epigenetic analyses. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Lactase non-persistence is directed by DNA variation-dependent epigenetic aging
Labrie, Viviane; Buske, Orion J; Oh, Edward; Jeremian, Richie; Ptak, Carolyn; Gasiūnas, Giedrius; Maleckas, Almantas; Petereit, Rūta; Žvirbliene, Aida; Adamonis, Kęstutis; Kriukienė, Edita; Koncevičius, Karolis; Gordevičius, Juozas; Nair, Akhil; Zhang, Aiping; Ebrahimi, Sasha; Oh, Gabriel; Šikšnys, Virginijus; Kupčinskas, Limas; Brudno, Michael; Petronis, Arturas
2016-01-01
Inability to digest lactose due to lactase non-persistence is a common trait in adult mammals, with the exception of certain human populations that exhibit lactase persistence. It is not clear how the lactase gene can be dramatically downregulated with age in most individuals, but remains active in some. We performed a comprehensive epigenetic study of the human and mouse intestine using chromosome-wide DNA modification profiling and targeted bisulfite sequencing. Epigenetically-controlled regulatory elements were found to account for the differences in lactase mRNA levels between individuals, intestinal cell types and species. The importance of these regulatory elements in modulating lactase mRNA levels was confirmed by CRISPR-Cas9-induced deletions. Genetic factors contribute to epigenetic changes occurring with age at the regulatory elements, as lactase persistence- and non-persistence-DNA haplotypes demonstrated markedly different epigenetic aging. Thus, genetic factors facilitate a gradual accumulation of epigenetic changes with age to affect phenotypic outcome. PMID:27159559
Whole-genome fingerprint of the DNA methylome during human B cell differentiation.
Kulis, Marta; Merkel, Angelika; Heath, Simon; Queirós, Ana C; Schuyler, Ronald P; Castellano, Giancarlo; Beekman, Renée; Raineri, Emanuele; Esteve, Anna; Clot, Guillem; Verdaguer-Dot, Néria; Duran-Ferrer, Martí; Russiñol, Nuria; Vilarrasa-Blasi, Roser; Ecker, Simone; Pancaldi, Vera; Rico, Daniel; Agueda, Lidia; Blanc, Julie; Richardson, David; Clarke, Laura; Datta, Avik; Pascual, Marien; Agirre, Xabier; Prosper, Felipe; Alignani, Diego; Paiva, Bruno; Caron, Gersende; Fest, Thierry; Muench, Marcus O; Fomin, Marina E; Lee, Seung-Tae; Wiemels, Joseph L; Valencia, Alfonso; Gut, Marta; Flicek, Paul; Stunnenberg, Hendrik G; Siebert, Reiner; Küppers, Ralf; Gut, Ivo G; Campo, Elías; Martín-Subero, José I
2015-07-01
We analyzed the DNA methylome of ten subpopulations spanning the entire B cell differentiation program by whole-genome bisulfite sequencing and high-density microarrays. We observed that non-CpG methylation disappeared upon B cell commitment, whereas CpG methylation changed extensively during B cell maturation, showing an accumulative pattern and affecting around 30% of all measured CpG sites. Early differentiation stages mainly displayed enhancer demethylation, which was associated with upregulation of key B cell transcription factors and affected multiple genes involved in B cell biology. Late differentiation stages, in contrast, showed extensive demethylation of heterochromatin and methylation gain at Polycomb-repressed areas, and genes with apparent functional impact in B cells were not affected. This signature, which has previously been linked to aging and cancer, was particularly widespread in mature cells with an extended lifespan. Comparing B cell neoplasms with their normal counterparts, we determined that they frequently acquire methylation changes in regions already undergoing dynamic methylation during normal B cell differentiation.
De novo DNA methylation during monkey pre-implantation embryogenesis.
Gao, Fei; Niu, Yuyu; Sun, Yi Eve; Lu, Hanlin; Chen, Yongchang; Li, Siguang; Kang, Yu; Luo, Yuping; Si, Chenyang; Yu, Juehua; Li, Chang; Sun, Nianqin; Si, Wei; Wang, Hong; Ji, Weizhi; Tan, Tao
2017-04-01
Critical epigenetic regulation of primate embryogenesis entails DNA methylome changes. Here we report genome-wide composition, patterning, and stage-specific dynamics of DNA methylation in pre-implantation rhesus monkey embryos as well as male and female gametes studied using an optimized tagmentation-based whole-genome bisulfite sequencing method. We show that upon fertilization, both paternal and maternal genomes undergo active DNA demethylation, and genome-wide de novo DNA methylation is also initiated in the same period. By the 8-cell stage, remethylation becomes more pronounced than demethylation, resulting in an increase in global DNA methylation. Promoters of genes associated with oxidative phosphorylation are preferentially remethylated at the 8-cell stage, suggesting that this mode of energy metabolism may not be favored. Unlike in rodents, X chromosome inactivation is not observed during monkey pre-implantation development. Our study provides the first comprehensive illustration of the 'wax and wane' phases of DNA methylation dynamics. Most importantly, our DNA methyltransferase loss-of-function analysis indicates that DNA methylation influences early monkey embryogenesis.
De novo DNA methylation during monkey pre-implantation embryogenesis
Gao, Fei; Niu, Yuyu; Sun, Yi Eve; Lu, Hanlin; Chen, Yongchang; Li, Siguang; Kang, Yu; Luo, Yuping; Si, Chenyang; Yu, Juehua; Li, Chang; Sun, Nianqin; Si, Wei; Wang, Hong; Ji, Weizhi; Tan, Tao
2017-01-01
Critical epigenetic regulation of primate embryogenesis entails DNA methylome changes. Here we report genome-wide composition, patterning, and stage-specific dynamics of DNA methylation in pre-implantation rhesus monkey embryos as well as male and female gametes studied using an optimized tagmentation-based whole-genome bisulfite sequencing method. We show that upon fertilization, both paternal and maternal genomes undergo active DNA demethylation, and genome-wide de novo DNA methylation is also initiated in the same period. By the 8-cell stage, remethylation becomes more pronounced than demethylation, resulting in an increase in global DNA methylation. Promoters of genes associated with oxidative phosphorylation are preferentially remethylated at the 8-cell stage, suggesting that this mode of energy metabolism may not be favored. Unlike in rodents, X chromosome inactivation is not observed during monkey pre-implantation development. Our study provides the first comprehensive illustration of the 'wax and wane' phases of DNA methylation dynamics. Most importantly, our DNA methyltransferase loss-of-function analysis indicates that DNA methylation influences early monkey embryogenesis. PMID:28233770
Dormancy activation mechanism of oral cavity cancer stem cells.
Chen, Xiang; Li, Xin; Zhao, Baohong; Shang, Dehao; Zhong, Ming; Deng, Chunfu; Jia, Xinshan
2015-07-01
Radiotherapy and chemotherapy are targeted primarily at rapidly proliferating cancer cells and are unable to eliminate cancer stem cells in the G0 phase. Thus, these treatments cannot prevent the recurrence and metastasis of cancer. Understanding the mechanisms by which cancer stem cells are maintained in the dormant G0 phase, and how they become active is key to developing new cancer therapies. The current study found that the anti-cancer drug 5-fluorouracil, acting on the oral squamous cell carcinoma KB cell line, selectively killed proliferating cells while sparing cells in the G0 phase. Bisulfite sequencing PCR showed that demethylation of the Sox2 promoter led to the expression of Sox2. This then resulted in the transformation of cancer stem cells from the G0 phase to the division stage and suggested that the transformation of cancer stem cells from the G0 phase to the division stage is closely related to an epigenetic modification of the cell.
Universal sequence map (USM) of arbitrary discrete sequences
2002-01-01
Background For over a decade the idea of representing biological sequences in a continuous coordinate space has maintained its appeal but not been fully realized. The basic idea is that any sequence of symbols may define trajectories in the continuous space conserving all its statistical properties. Ideally, such a representation would allow scale independent sequence analysis – without the context of fixed memory length. A simple example would consist on being able to infer the homology between two sequences solely by comparing the coordinates of any two homologous units. Results We have successfully identified such an iterative function for bijective mappingψ of discrete sequences into objects of continuous state space that enable scale-independent sequence analysis. The technique, named Universal Sequence Mapping (USM), is applicable to sequences with an arbitrary length and arbitrary number of unique units and generates a representation where map distance estimates sequence similarity. The novel USM procedure is based on earlier work by these and other authors on the properties of Chaos Game Representation (CGR). The latter enables the representation of 4 unit type sequences (like DNA) as an order free Markov Chain transition table. The properties of USM are illustrated with test data and can be verified for other data by using the accompanying web-based tool:http://bioinformatics.musc.edu/~jonas/usm/. Conclusions USM is shown to enable a statistical mechanics approach to sequence analysis. The scale independent representation frees sequence analysis from the need to assume a memory length in the investigation of syntactic rules. PMID:11895567
USDA-ARS?s Scientific Manuscript database
High-density genetic linkage maps are essential for fine mapping QTLs controlling disease resistance traits, such as early leaf spot (ELS), late leaf spot (LLS), and Tomato spotted wilt virus (TSWV). With completion of the genome sequences of two diploid ancestors of cultivated peanut, we could use ...
Properties of the Tent map for decimal fractions with fixed precision
NASA Astrophysics Data System (ADS)
Chetverikov, V. M.
2018-01-01
The one-dimensional discrete Tent map is a well-known example of a map whose fixed points are all unstable on the segment [0,1]. This map leads to the positivity of the Lyapunov exponent for the corresponding recurrent sequence. Therefore in a situation of general position, this sequence must demonstrate the properties of deterministic chaos. However if the first term of the recurrence sequence is taken as a decimal fraction with a fixed number “k” of digits after the decimal point and all calculations are carried out accurately, then the situation turns out to be completely different. In this case, first, the Tent map does not lead to an increase in significant digits in the terms of the sequence, and secondly, demonstrates the existence of a finite number of eventually periodic orbits, which are attractors for all other decimal numbers with the number of significant digits not exceeding “k”.
Tabata, Ryo; Kamiya, Takehiro; Shigenobu, Shuji; Yamaguchi, Katsushi; Yamada, Masashi; Hasebe, Mitsuyasu; Fujiwara, Toru; Sawa, Shinichiro
2013-01-01
Next-generation sequencing (NGS) technologies enable the rapid production of an enormous quantity of sequence data. These powerful new technologies allow the identification of mutations by whole-genome sequencing. However, most reported NGS-based mapping methods, which are based on bulked segregant analysis, are costly and laborious. To address these limitations, we designed a versatile NGS-based mapping method that consists of a combination of low- to medium-coverage multiplex SOLiD (Sequencing by Oligonucleotide Ligation and Detection) and classical genetic rough mapping. Using only low to medium coverage reduces the SOLiD sequencing costs and, since just 10 to 20 mutant F2 plants are required for rough mapping, the operation is simple enough to handle in a laboratory with limited space and funding. As a proof of principle, we successfully applied this method to identify the CTR1, which is involved in boron-mediated root development, from among a population of high boron requiring Arabidopsis thaliana mutants. Our work demonstrates that this NGS-based mapping method is a moderately priced and versatile method that can readily be applied to other model organisms. PMID:23104114
A physical map of the bovine genome
Snelling, Warren M; Chiu, Readman; Schein, Jacqueline E; Hobbs, Matthew; Abbey, Colette A; Adelson, David L; Aerts, Jan; Bennett, Gary L; Bosdet, Ian E; Boussaha, Mekki; Brauning, Rudiger; Caetano, Alexandre R; Costa, Marcos M; Crawford, Allan M; Dalrymple, Brian P; Eggen, André; Everts-van der Wind, Annelie; Floriot, Sandrine; Gautier, Mathieu; Gill, Clare A; Green, Ronnie D; Holt, Robert; Jann, Oliver; Jones, Steven JM; Kappes, Steven M; Keele, John W; de Jong, Pieter J; Larkin, Denis M; Lewin, Harris A; McEwan, John C; McKay, Stephanie; Marra, Marco A; Mathewson, Carrie A; Matukumalli, Lakshmi K; Moore, Stephen S; Murdoch, Brenda; Nicholas, Frank W; Osoegawa, Kazutoyo; Roy, Alice; Salih, Hanni; Schibler, Laurent; Schnabel, Robert D; Silveri, Licia; Skow, Loren C; Smith, Timothy PL; Sonstegard, Tad S; Taylor, Jeremy F; Tellam, Ross; Van Tassell, Curtis P; Williams, John L; Womack, James E; Wye, Natasja H; Yang, George; Zhao, Shaying
2007-01-01
Background Cattle are important agriculturally and relevant as a model organism. Previously described genetic and radiation hybrid (RH) maps of the bovine genome have been used to identify genomic regions and genes affecting specific traits. Application of these maps to identify influential genetic polymorphisms will be enhanced by integration with each other and with bacterial artificial chromosome (BAC) libraries. The BAC libraries and clone maps are essential for the hybrid clone-by-clone/whole-genome shotgun sequencing approach taken by the bovine genome sequencing project. Results A bovine BAC map was constructed with HindIII restriction digest fragments of 290,797 BAC clones from animals of three different breeds. Comparative mapping of 422,522 BAC end sequences assisted with BAC map ordering and assembly. Genotypes and pedigree from two genetic maps and marker scores from three whole-genome RH panels were consolidated on a 17,254-marker composite map. Sequence similarity allowed integrating the BAC and composite maps with the bovine draft assembly (Btau3.1), establishing a comprehensive resource describing the bovine genome. Agreement between the marker and BAC maps and the draft assembly is high, although discrepancies exist. The composite and BAC maps are more similar than either is to the draft assembly. Conclusion Further refinement of the maps and greater integration into the genome assembly process may contribute to a high quality assembly. The maps provide resources to associate phenotypic variation with underlying genomic variation, and are crucial resources for understanding the biology underpinning this important ruminant species so closely associated with humans. PMID:17697342
DNA Modification Study of Major Depressive Disorder: Beyond Locus-by-Locus Comparisons
Oh, Gabriel; Wang, Sun-Chong; Pal, Mrinal; Chen, Zheng Fei; Khare, Tarang; Tochigi, Mamoru; Ng, Catherine; Yang, Yeqing A.; Kwan, Andrew; Kaminsky, Zachary A.; Mill, Jonathan; Gunasinghe, Cerisse; Tackett, Jennifer L.; Gottesman, Irving I.; Willemsen, Gonneke; de Geus, Eco J.C.; Vink, Jacqueline M.; Slagboom, P. Eline; Wray, Naomi R.; Heath, Andrew C.; Montgomery, Grant W.; Turecki, Gustavo; Martin, Nicholas G.; Boomsma, Dorret I.; McGuffin, Peter; Kustra, Rafal; Petronis, Art
2014-01-01
Background Major depressive disorder (MDD) exhibits numerous clinical and molecular features that are consistent with putative epigenetic misregulation. Despite growing interest in epigenetic studies of psychiatric diseases, the methodologies guiding such studies have not been well defined. Methods We performed DNA modification analysis in white blood cells from monozygotic twins discordant for MDD, in brain prefrontal cortex, and germline (sperm) samples from affected individuals and control subjects (total N = 304) using 8.1K CpG island microarrays and fine mapping. In addition to the traditional locus-by-locus comparisons, we explored the potential of new analytical approaches in epigenomic studies. Results In the microarray experiment, we detected a number of nominally significant DNA modification differences in MDD and validated selected targets using bisulfite pyrosequencing. Some MDD epigenetic changes, however, overlapped across brain, blood, and sperm more often than expected by chance. We also demonstrated that stratification for disease severity and age may increase the statistical power of epimutation detection. Finally, a series of new analytical approaches, such as DNA modification networks and machine-learning algorithms using binary and quantitative depression phenotypes, provided additional insights on the epigenetic contributions to MDD. Conclusions Mapping epigenetic differences in MDD (and other psychiatric diseases) is a complex task. However, combining traditional and innovative analytical strategies may lead to identification of disease-specific etiopathogenic epimutations. PMID:25108803
DNA modification study of major depressive disorder: beyond locus-by-locus comparisons.
Oh, Gabriel; Wang, Sun-Chong; Pal, Mrinal; Chen, Zheng Fei; Khare, Tarang; Tochigi, Mamoru; Ng, Catherine; Yang, Yeqing A; Kwan, Andrew; Kaminsky, Zachary A; Mill, Jonathan; Gunasinghe, Cerisse; Tackett, Jennifer L; Gottesman, Irving I; Willemsen, Gonneke; de Geus, Eco J C; Vink, Jacqueline M; Slagboom, P Eline; Wray, Naomi R; Heath, Andrew C; Montgomery, Grant W; Turecki, Gustavo; Martin, Nicholas G; Boomsma, Dorret I; McGuffin, Peter; Kustra, Rafal; Petronis, Art
2015-02-01
Major depressive disorder (MDD) exhibits numerous clinical and molecular features that are consistent with putative epigenetic misregulation. Despite growing interest in epigenetic studies of psychiatric diseases, the methodologies guiding such studies have not been well defined. We performed DNA modification analysis in white blood cells from monozygotic twins discordant for MDD, in brain prefrontal cortex, and germline (sperm) samples from affected individuals and control subjects (total N = 304) using 8.1K CpG island microarrays and fine mapping. In addition to the traditional locus-by-locus comparisons, we explored the potential of new analytical approaches in epigenomic studies. In the microarray experiment, we detected a number of nominally significant DNA modification differences in MDD and validated selected targets using bisulfite pyrosequencing. Some MDD epigenetic changes, however, overlapped across brain, blood, and sperm more often than expected by chance. We also demonstrated that stratification for disease severity and age may increase the statistical power of epimutation detection. Finally, a series of new analytical approaches, such as DNA modification networks and machine-learning algorithms using binary and quantitative depression phenotypes, provided additional insights on the epigenetic contributions to MDD. Mapping epigenetic differences in MDD (and other psychiatric diseases) is a complex task. However, combining traditional and innovative analytical strategies may lead to identification of disease-specific etiopathogenic epimutations. Copyright © 2015 Society of Biological Psychiatry. All rights reserved.
Liu, Lian; Zhang, Shao-Wu; Huang, Yufei; Meng, Jia
2017-08-31
As a newly emerged research area, RNA epigenetics has drawn increasing attention recently for the participation of RNA methylation and other modifications in a number of crucial biological processes. Thanks to high throughput sequencing techniques, such as, MeRIP-Seq, transcriptome-wide RNA methylation profile is now available in the form of count-based data, with which it is often of interests to study the dynamics at epitranscriptomic layer. However, the sample size of RNA methylation experiment is usually very small due to its costs; and additionally, there usually exist a large number of genes whose methylation level cannot be accurately estimated due to their low expression level, making differential RNA methylation analysis a difficult task. We present QNB, a statistical approach for differential RNA methylation analysis with count-based small-sample sequencing data. Compared with previous approaches such as DRME model based on a statistical test covering the IP samples only with 2 negative binomial distributions, QNB is based on 4 independent negative binomial distributions with their variances and means linked by local regressions, and in the way, the input control samples are also properly taken care of. In addition, different from DRME approach, which relies only the input control sample only for estimating the background, QNB uses a more robust estimator for gene expression by combining information from both input and IP samples, which could largely improve the testing performance for very lowly expressed genes. QNB showed improved performance on both simulated and real MeRIP-Seq datasets when compared with competing algorithms. And the QNB model is also applicable to other datasets related RNA modifications, including but not limited to RNA bisulfite sequencing, m 1 A-Seq, Par-CLIP, RIP-Seq, etc.
Methylsorb: a simple method for quantifying DNA methylation using DNA-gold affinity interactions.
Sina, Abu Ali Ibn; Carrascosa, Laura G; Palanisamy, Ramkumar; Rauf, Sakandar; Shiddiky, Muhammad J A; Trau, Matt
2014-10-21
The analysis of DNA methylation is becoming increasingly important both in the clinic and also as a research tool to unravel key epigenetic molecular mechanisms in biology. Current methodologies for the quantification of regional DNA methylation (i.e., the average methylation over a region of DNA in the genome) are largely affected by comprehensive DNA sequencing methodologies which tend to be expensive, tedious, and time-consuming for many applications. Herein, we report an alternative DNA methylation detection method referred to as "Methylsorb", which is based on the inherent affinity of DNA bases to the gold surface (i.e., the trend of the affinity interactions is adenine > cytosine ≥ guanine > thymine).1 Since the degree of gold-DNA affinity interaction is highly sequence dependent, it provides a new capability to detect DNA methylation by simply monitoring the relative adsorption of bisulfite treated DNA sequences onto a gold chip. Because the selective physical adsorption of DNA fragments to gold enable a direct read-out of regional DNA methylation, the current requirement for DNA sequencing is obviated. To demonstrate the utility of this method, we present data on the regional methylation status of two CpG clusters located in the EN1 and MIR200B genes in MCF7 and MDA-MB-231 cells. The methylation status of these regions was obtained from the change in relative mass on gold surface with respect to relative adsorption of an unmethylated DNA source and this was detected using surface plasmon resonance (SPR) in a label-free and real-time manner. We anticipate that the simplicity of this method, combined with the high level of accuracy for identifying the methylation status of cytosines in DNA, could find broad application in biology and diagnostics.
In silico modeling of epigenetic-induced changes in photoreceptor cis-regulatory elements.
Hossain, Reafa A; Dunham, Nicholas R; Enke, Raymond A; Berndsen, Christopher E
2018-01-01
DNA methylation is a well-characterized epigenetic repressor of mRNA transcription in many plant and vertebrate systems. However, the mechanism of this repression is not fully understood. The process of transcription is controlled by proteins that regulate recruitment and activity of RNA polymerase by binding to specific cis-regulatory sequences. Cone-rod homeobox (CRX) is a well-characterized mammalian transcription factor that controls photoreceptor cell-specific gene expression. Although much is known about the functions and DNA binding specificity of CRX, little is known about how DNA methylation modulates CRX binding affinity to genomic cis-regulatory elements. We used bisulfite pyrosequencing of human ocular tissues to measure DNA methylation levels of the regulatory regions of RHO , PDE6B, PAX6 , and LINE1 retrotransposon repeats. To describe the molecular mechanism of repression, we used molecular modeling to illustrate the effect of DNA methylation on human RHO regulatory sequences. In this study, we demonstrate an inverse correlation between DNA methylation in regulatory regions adjacent to the human RHO and PDE6B genes and their subsequent transcription in human ocular tissues. Docking of CRX to the DNA models shows that CRX interacts with the grooves of these sequences, suggesting changes in groove structure could regulate binding. Molecular dynamics simulations of the RHO promoter and enhancer regions show changes in the flexibility and groove width upon epigenetic modification. Models also demonstrate changes in the local dynamics of CRX binding sites within RHO regulatory sequences which may account for the repression of CRX-dependent transcription. Collectively, these data demonstrate epigenetic regulation of CRX binding sites in human retinal tissue and provide insight into the mechanism of this mode of epigenetic regulation to be tested in future experiments.
Epigenomics of Development in Populus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Strauss, Steve; Freitag, Michael; Mockler, Todd
2013-01-10
We conducted research to determine the role of epigenetic modifications during tree development using poplar (Populus trichocarpa), a model woody feedstock species. Using methylated DNA immunoprecipitation (MeDIP) or chromatin immunoprecipitation (ChIP), followed by high-throughput sequencing, we are analyzed DNA and histone methylation patterns in the P. trichocarpa genome in relation to four biological processes: bud dormancy and release, mature organ maintenance, in vitro organogenesis, and methylation suppression. Our project is now completed. We have 1) produced 22 transgenic events for a gene involved in DNA methylation suppression and studied its phenotypic consequences; 2) completed sequencing of methylated DNA from elevenmore » target tissues in wildtype P. trichocarpa; 3) updated our customized poplar genome browser using the open-source software tools (2.13) and (V2.2) of the P. trichocarpa genome; 4) produced summary data for genome methylation in P. trichocarpa, including distribution of methylation across chromosomes and in and around genes; 5) employed bioinformatic and statistical methods to analyze differences in methylation patterns among tissue types; and 6) used bisulfite sequencing of selected target genes to confirm bioinformatics and sequencing results, and gain a higher-resolution view of methylation at selected genes 7) compared methylation patterns to expression using available microarray data. Our main findings of biological significance are the identification of extensive regions of the genome that display developmental variation in DNA methylation; highly distinctive gene-associated methylation profiles in reproductive tissues, particularly male catkins; a strong whole genome/all tissue inverse association of methylation at gene bodies and promoters with gene expression; a lack of evidence that tissue specificity of gene expression is associated with gene methylation; and evidence that genome methylation is a significant impediment to tissue dedifferentiation and redifferentiation in vitro.« less
Linkage Map of Escherichia coli K-12, Edition 10: The Traditional Map
Berlyn, Mary K. B.
1998-01-01
This map is an update of the edition 9 map by Berlyn et al. (M. K. B. Berlyn, K. B. Low, and K. E. Rudd, p. 1715–1902, in F. C. Neidhardt et al., ed., Escherichia coli and Salmonella: cellular and molecular biology, 2nd ed., vol. 2, 1996). It uses coordinates established by the completed sequence, expressed as 100 minutes for the entire circular map, and adds new genes discovered and established since 1996 and eliminates those shown to correspond to other known genes. The latter are included as synonyms. An alphabetical list of genes showing map location, synonyms, the protein or RNA product of the gene, phenotypes of mutants, and reference citations is provided. In addition to genes known to correspond to gene sequences, other genes, often older, that are described by phenotype and older mapping techniques and that have not been correlated with sequences are included. PMID:9729611
2000-04-01
Genes, LOH Mapping, Chromosome 17, Physical Mapping, Genetic Mapping, CDNA Screening, Humans, Anatomical 81 Samples, Mutation Detection, Breast Cancer...According to the established model for LOH involving tumor suppressor genes, the allele remaining in the tumor sample would harbor the deleterious mutation ...sequencing on an AB1373A sequencer (Applied Biosystems, Foster City, CA). As none of the samples we have sequenced have revealed any mutations , we have
A high-resolution cattle CNV map by population-scale genome sequencing
USDA-ARS?s Scientific Manuscript database
Copy Number Variations (CNVs) are common genomic structural variations that have been linked to human diseases and phenotypic traits. Prior studies in cattle have produced low-resolution CNV maps. We constructed a draft, high-resolution map of cattle CNVs based on whole genome sequencing data from 7...
Microbial genome sequencing using optical mapping and Illumina sequencing
USDA-ARS?s Scientific Manuscript database
Introduction Optical mapping is a technique in which strands of genomic DNA are digested with one or more restriction enzymes, and a physical map of the genome constructed from the resulting image. In outline, genomic DNA is extracted from a pure culture, linearly arrayed on a specialized glass sli...
Iehisa, Julio Cesar Masaru; Ohno, Ryoko; Kimura, Tatsuro; Enoki, Hiroyuki; Nishimura, Satoru; Okamoto, Yuki; Nasuda, Shuhei; Takumi, Shigeo
2014-01-01
The large genome and allohexaploidy of common wheat have complicated construction of a high-density genetic map. Although improvements in the throughput of next-generation sequencing (NGS) technologies have made it possible to obtain a large amount of genotyping data for an entire mapping population by direct sequencing, including hexaploid wheat, a significant number of missing data points are often apparent due to the low coverage of sequencing. In the present study, a microarray-based polymorphism detection system was developed using NGS data obtained from complexity-reduced genomic DNA of two common wheat cultivars, Chinese Spring (CS) and Mironovskaya 808. After design and selection of polymorphic probes, 13,056 new markers were added to the linkage map of a recombinant inbred mapping population between CS and Mironovskaya 808. On average, 2.49 missing data points per marker were observed in the 201 recombinant inbred lines, with a maximum of 42. Around 40% of the new markers were derived from genic regions and 11% from repetitive regions. The low number of retroelements indicated that the new polymorphic markers were mainly derived from the less repetitive region of the wheat genome. Around 25% of the mapped sequences were useful for alignment with the physical map of barley. Quantitative trait locus (QTL) analyses of 14 agronomically important traits related to flowering, spikes, and seeds demonstrated that the new high-density map showed improved QTL detection, resolution, and accuracy over the original simple sequence repeat map. PMID:24972598
Iehisa, Julio Cesar Masaru; Ohno, Ryoko; Kimura, Tatsuro; Enoki, Hiroyuki; Nishimura, Satoru; Okamoto, Yuki; Nasuda, Shuhei; Takumi, Shigeo
2014-10-01
The large genome and allohexaploidy of common wheat have complicated construction of a high-density genetic map. Although improvements in the throughput of next-generation sequencing (NGS) technologies have made it possible to obtain a large amount of genotyping data for an entire mapping population by direct sequencing, including hexaploid wheat, a significant number of missing data points are often apparent due to the low coverage of sequencing. In the present study, a microarray-based polymorphism detection system was developed using NGS data obtained from complexity-reduced genomic DNA of two common wheat cultivars, Chinese Spring (CS) and Mironovskaya 808. After design and selection of polymorphic probes, 13,056 new markers were added to the linkage map of a recombinant inbred mapping population between CS and Mironovskaya 808. On average, 2.49 missing data points per marker were observed in the 201 recombinant inbred lines, with a maximum of 42. Around 40% of the new markers were derived from genic regions and 11% from repetitive regions. The low number of retroelements indicated that the new polymorphic markers were mainly derived from the less repetitive region of the wheat genome. Around 25% of the mapped sequences were useful for alignment with the physical map of barley. Quantitative trait locus (QTL) analyses of 14 agronomically important traits related to flowering, spikes, and seeds demonstrated that the new high-density map showed improved QTL detection, resolution, and accuracy over the original simple sequence repeat map. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Accurate estimation of short read mapping quality for next-generation genome sequencing
Ruffalo, Matthew; Koyutürk, Mehmet; Ray, Soumya; LaFramboise, Thomas
2012-01-01
Motivation: Several software tools specialize in the alignment of short next-generation sequencing reads to a reference sequence. Some of these tools report a mapping quality score for each alignment—in principle, this quality score tells researchers the likelihood that the alignment is correct. However, the reported mapping quality often correlates weakly with actual accuracy and the qualities of many mappings are underestimated, encouraging the researchers to discard correct mappings. Further, these low-quality mappings tend to correlate with variations in the genome (both single nucleotide and structural), and such mappings are important in accurately identifying genomic variants. Approach: We develop a machine learning tool, LoQuM (LOgistic regression tool for calibrating the Quality of short read mappings, to assign reliable mapping quality scores to mappings of Illumina reads returned by any alignment tool. LoQuM uses statistics on the read (base quality scores reported by the sequencer) and the alignment (number of matches, mismatches and deletions, mapping quality score returned by the alignment tool, if available, and number of mappings) as features for classification and uses simulated reads to learn a logistic regression model that relates these features to actual mapping quality. Results: We test the predictions of LoQuM on an independent dataset generated by the ART short read simulation software and observe that LoQuM can ‘resurrect’ many mappings that are assigned zero quality scores by the alignment tools and are therefore likely to be discarded by researchers. We also observe that the recalibration of mapping quality scores greatly enhances the precision of called single nucleotide polymorphisms. Availability: LoQuM is available as open source at http://compbio.case.edu/loqum/. Contact: matthew.ruffalo@case.edu. PMID:22962451
DNA nanomapping using CRISPR-Cas9 as a programmable nanoparticle.
Mikheikin, Andrey; Olsen, Anita; Leslie, Kevin; Russell-Pavier, Freddie; Yacoot, Andrew; Picco, Loren; Payton, Oliver; Toor, Amir; Chesney, Alden; Gimzewski, James K; Mishra, Bud; Reed, Jason
2017-11-21
Progress in whole-genome sequencing using short-read (e.g., <150 bp), next-generation sequencing technologies has reinvigorated interest in high-resolution physical mapping to fill technical gaps that are not well addressed by sequencing. Here, we report two technical advances in DNA nanotechnology and single-molecule genomics: (1) we describe a labeling technique (CRISPR-Cas9 nanoparticles) for high-speed AFM-based physical mapping of DNA and (2) the first successful demonstration of using DVD optics to image DNA molecules with high-speed AFM. As a proof of principle, we used this new "nanomapping" method to detect and map precisely BCL2-IGH translocations present in lymph node biopsies of follicular lymphoma patents. This HS-AFM "nanomapping" technique can be complementary to both sequencing and other physical mapping approaches.
Ma, Chun-Lei; Jin, Ji-Qiang; Li, Chun-Fang; Wang, Rong-Kai; Zheng, Hong-Kun; Yao, Ming-Zhe; Chen, Liang
2015-01-01
Genetic maps are important tools in plant genomics and breeding. The present study reports the large-scale discovery of single nucleotide polymorphisms (SNPs) for genetic map construction in tea plant. We developed a total of 6,042 valid SNP markers using specific-locus amplified fragment sequencing (SLAF-seq), and subsequently mapped them into the previous framework map. The final map contained 6,448 molecular markers, distributing on fifteen linkage groups corresponding to the number of tea plant chromosomes. The total map length was 3,965 cM, with an average inter-locus distance of 1.0 cM. This map is the first SNP-based reference map of tea plant, as well as the most saturated one developed to date. The SNP markers and map resources generated in this study provide a wealth of genetic information that can serve as a foundation for downstream genetic analyses, such as the fine mapping of quantitative trait loci (QTL), map-based cloning, marker-assisted selection, and anchoring of scaffolds to facilitate the process of whole genome sequencing projects for tea plant. PMID:26035838
Aita, Takuyo; Nishigaki, Koichi
2012-11-01
To visualize a bird's-eye view of an ensemble of mitochondrial genome sequences for various species, we recently developed a novel method of mapping a biological sequence ensemble into Three-Dimensional (3D) vector space. First, we represented a biological sequence of a species s by a word-composition vector x(s), where its length [absolute value]x(s)[absolute value] represents the sequence length, and its unit vector x(s)/[absolute value]x(s)[absolute value] represents the relative composition of the K-tuple words through the sequence and the size of the dimension, N=4(K), is the number of all possible words with the length of K. Second, we mapped the vector x(s) to the 3D position vector y(s), based on the two following simple principles: (1) [absolute value]y(s)[absolute value]=[absolute value]x(s)[absolute value] and (2) the angle between y(s) and y(t) maximally correlates with the angle between x(s) and x(t). The mitochondrial genome sequences for 311 species, including 177 Animalia, 85 Fungi and 49 Green plants, were mapped into 3D space by using K=7. The mapping was successful because the angles between vectors before and after the mapping highly correlated with each other (correlation coefficients were 0.92-0.97). Interestingly, the Animalia kingdom is distributed along a single arc belt (just like the Milky Way on a Celestial Globe), and the Fungi and Green plant kingdoms are distributed in a similar arc belt. These two arc belts intersect at their respective middle regions and form a cross structure just like a jet aircraft fuselage and its wings. This new mapping method will allow researchers to intuitively interpret the visual information presented in the maps in a highly effective manner. Copyright © 2012 Elsevier Inc. All rights reserved.
Efficient high-throughput sequencing of a laser microdissected chromosome arm
2013-01-01
Background Genomic sequence assemblies are key tools for a broad range of gene function and evolutionary studies. The diploid amphibian Xenopus tropicalis plays a pivotal role in these fields due to its combination of experimental flexibility, diploid genome, and early-branching tetrapod taxonomic position, having diverged from the amniote lineage ~360 million years ago. A genome assembly and a genetic linkage map have recently been made available. Unfortunately, large gaps in the linkage map attenuate long-range integrity of the genome assembly. Results We laser dissected the short arm of X. tropicalis chromosome 7 for next generation sequencing and computational mapping to the reference genome. This arm is of particular interest as it encodes the sex determination locus, but its genetic map contains large gaps which undermine available genome assemblies. Whole genome amplification of 15 laser-microdissected 7p arms followed by next generation sequencing yielded ~35 million reads, over four million of which uniquely mapped to the X. tropicalis genome. Our analysis placed more than 200 previously unmapped scaffolds on the analyzed chromosome arm, providing valuable low-resolution physical map information for de novo genome assembly. Conclusion We present a new approach for improving and validating genetic maps and sequence assemblies. Whole genome amplification of 15 microdissected chromosome arms provided sufficient high-quality material for localizing previously unmapped scaffolds and genes as well as recognizing mislocalized scaffolds. PMID:23714049
Method of removing oxides of sulfur and oxides of nitrogen from exhaust gases
Walker, Richard J.
1986-01-01
A continuous method is presented for removing both oxides of sulfur and oxides of nitrogen from combustion or exhaust gases with the regeneration of the absorbent. Exhaust gas is cleaned of particulates and HCl by a water scrub prior to contact with a liquid absorbent that includes an aqueous solution of bisulfite and sulfite ions along with a metal chelate, such as, an iron or zinc aminopolycarboxylic acid. Following contact with the combustion gases the spent absorbent is subjected to electrodialysis to transfer bisulfite ions into a sulfuric acid solution while splitting water with hydroxide and hydrogen ion migration to equalize electrical charge. The electrodialysis stack includes alternate layers of anion selective and bipolar membranes. Oxides of nitrogen are removed from the liquid absorbent by air stripping at an elevated temperature and the regenerated liquid absorbent is returned to contact with exhaust gases for removal of sulfur oxides and nitrogen oxides.
NASA Astrophysics Data System (ADS)
Dutta, Tanoy; Chandra, Falguni; Koner, Apurba L.
2018-02-01
A ;naked-eye; detection of health hazardous bisulfite (HSO3-) and hypochlorite (ClO-) using an indicator dye (Quinaldine Red, QR) in a wide range of pH is demonstrated. The molecule contains a quinoline moiety linked to an N,N-dimethylaniline moiety with a conjugated double bond. Treatment of QR with HSO3- and ClO-, in aqueous solution at near-neutral pH, resulted in a colorless product with high selectivity and sensitivity. The detection limit was 47.8 μM and 0.2 μM for HSO3- and ClO- respectively. However, ClO- was 50 times more sensitive and with 2 times faster response compared to HSO3-. The detail characterization and related analysis demonstrate the potential of QR for a rapid, robust and highly efficient colorimetric sensor for the practical applications to detect hypochlorite in water samples.
ZOOM Lite: next-generation sequencing data mapping and visualization software
Zhang, Zefeng; Lin, Hao; Ma, Bin
2010-01-01
High-throughput next-generation sequencing technologies pose increasing demands on the efficiency, accuracy and usability of data analysis software. In this article, we present ZOOM Lite, a software for efficient reads mapping and result visualization. With a kernel capable of mapping tens of millions of Illumina or AB SOLiD sequencing reads efficiently and accurately, and an intuitive graphical user interface, ZOOM Lite integrates reads mapping and result visualization into a easy to use pipeline on desktop PC. The software handles both single-end and paired-end reads, and can output both the unique mapping result or the top N mapping results for each read. Additionally, the software takes a variety of input file formats and outputs to several commonly used result formats. The software is freely available at http://bioinfor.com/zoom/lite/. PMID:20530531
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cao Daliang; Earl, Matthew A.; Luan, Shuang
2006-04-15
A new leaf-sequencing approach has been developed that is designed to reduce the number of required beam segments for step-and-shoot intensity modulated radiation therapy (IMRT). This approach to leaf sequencing is called continuous-intensity-map-optimization (CIMO). Using a simulated annealing algorithm, CIMO seeks to minimize differences between the optimized and sequenced intensity maps. Two distinguishing features of the CIMO algorithm are (1) CIMO does not require that each optimized intensity map be clustered into discrete levels and (2) CIMO is not rule-based but rather simultaneously optimizes both the aperture shapes and weights. To test the CIMO algorithm, ten IMRT patient cases weremore » selected (four head-and-neck, two pancreas, two prostate, one brain, and one pelvis). For each case, the optimized intensity maps were extracted from the Pinnacle{sup 3} treatment planning system. The CIMO algorithm was applied, and the optimized aperture shapes and weights were loaded back into Pinnacle. A final dose calculation was performed using Pinnacle's convolution/superposition based dose calculation. On average, the CIMO algorithm provided a 54% reduction in the number of beam segments as compared with Pinnacle's leaf sequencer. The plans sequenced using the CIMO algorithm also provided improved target dose uniformity and a reduced discrepancy between the optimized and sequenced intensity maps. For ten clinical intensity maps, comparisons were performed between the CIMO algorithm and the power-of-two reduction algorithm of Xia and Verhey [Med. Phys. 25(8), 1424-1434 (1998)]. When the constraints of a Varian Millennium multileaf collimator were applied, the CIMO algorithm resulted in a 26% reduction in the number of segments. For an Elekta multileaf collimator, the CIMO algorithm resulted in a 67% reduction in the number of segments. An average leaf sequencing time of less than one minute per beam was observed.« less
Guo, Yinshan; Shi, Guangli; Liu, Zhendong; Zhao, Yuhui; Yang, Xiaoxu; Zhu, Junchi; Li, Kun; Guo, Xiuwu
2015-01-01
In this study, 149 F1 plants from the interspecific cross between ‘Red Globe’ (Vitis vinifera L.) and ‘Shuangyou’ (Vitis amurensis Rupr.) and the parent were used to construct a molecular genetic linkage map by using the specific length amplified fragment sequencing technique. DNA sequencing generated 41.282 Gb data consisting of 206,411,693 paired-end reads. The average sequencing depths were 68.35 for ‘Red Globe,’ 63.65 for ‘Shuangyou,’ and 8.01 for each progeny. In all, 115,629 high-quality specific length amplified fragments were detected, of which 42,279 were polymorphic. The genetic map was constructed using 7,199 of these polymorphic markers. These polymorphic markers were assigned to 19 linkage groups; the total length of the map was 1929.13 cm, with an average distance of 0.28 cm between each maker. To our knowledge, the genetic maps constructed in this study contain the largest number of molecular markers. These high-density genetic maps might form the basis for the fine quantitative trait loci mapping and molecular-assisted breeding of grape. PMID:26089826
Kawase, Junya; Aoki, Jun-ya; Araki, Kazuo
2018-01-01
To investigate chromosome evolution in fish species, we newly mapped 181 markers that allowed us to construct a yellowtail (Seriola quinqueradiata) radiation hybrid (RH) physical map with 1,713 DNA markers, which was far denser than a previous map, and we anchored the de novo assembled sequences onto the RH physical map. Finally, we mapped a total of 13,977 expressed sequence tags (ESTs) on a genome sequence assembly aligned with the physical map. Using the high-density physical map and anchored genome sequences, we accurately compared the yellowtail genome structure with the genome structures of five model fishes to identify characteristics of the yellowtail genome. Between yellowtail and Japanese medaka (Oryzias latipes), almost all regions of the chromosomes were conserved and some blocks comprising several markers were translocated. Using the genome information of the spotted gar (Lepisosteus oculatus) as a reference, we further documented syntenic relationships and chromosomal rearrangements that occurred during evolution in four other acanthopterygian species (Japanese medaka, zebrafish, spotted green pufferfish and three-spined stickleback). The evolutionary chromosome translocation frequency was 1.5-2-times higher in yellowtail than in medaka, pufferfish, and stickleback. PMID:29290830
NASA Astrophysics Data System (ADS)
Yarnykh, V.; Korostyshevskaya, A.
2017-08-01
Macromolecular proton fraction (MPF) is a biophysical parameter describing the amount of macromolecular protons involved into magnetization exchange with water protons in tissues. MPF represents a significant interest as a magnetic resonance imaging (MRI) biomarker of myelin for clinical applications. A recent fast MPF mapping method enabled clinical translation of MPF measurements due to time-efficient acquisition based on the single-point constrained fit algorithm. However, previous MPF mapping applications utilized only 3 Tesla MRI scanners and modified pulse sequences, which are not commonly available. This study aimed to test the feasibility of MPF mapping implementation on a 1.5 Tesla clinical scanner using standard manufacturer’s sequences and compare the performance of this method between 1.5 and 3 Tesla scanners. MPF mapping was implemented on 1.5 and 3 Tesla MRI units of one manufacturer with either optimized custom-written or standard product pulse sequences. Whole-brain three-dimensional MPF maps obtained from a single volunteer were compared between field strengths and implementation options. MPF maps demonstrated similar quality at both field strengths. MPF values in segmented brain tissues and specific anatomic regions appeared in close agreement. This experiment demonstrates the feasibility of fast MPF mapping using standard sequences on 1.5 T and 3 T clinical scanners.
A second-generation anchored genetic linkage map of the tammar wallaby (Macropus eugenii)
2011-01-01
Background The tammar wallaby, Macropus eugenii, a small kangaroo used for decades for studies of reproduction and metabolism, is the model Australian marsupial for genome sequencing and genetic investigations. The production of a more comprehensive cytogenetically-anchored genetic linkage map will significantly contribute to the deciphering of the tammar wallaby genome. It has great value as a resource to identify novel genes and for comparative studies, and is vital for the ongoing genome sequence assembly and gene ordering in this species. Results A second-generation anchored tammar wallaby genetic linkage map has been constructed based on a total of 148 loci. The linkage map contains the original 64 loci included in the first-generation map, plus an additional 84 microsatellite loci that were chosen specifically to increase coverage and assist with the anchoring and orientation of linkage groups to chromosomes. These additional loci were derived from (a) sequenced BAC clones that had been previously mapped to tammar wallaby chromosomes by fluorescence in situ hybridization (FISH), (b) End sequence from BACs subsequently FISH-mapped to tammar wallaby chromosomes, and (c) tammar wallaby genes orthologous to opossum genes predicted to fill gaps in the tammar wallaby linkage map as well as three X-linked markers from a published study. Based on these 148 loci, eight linkage groups were formed. These linkage groups were assigned (via FISH-mapped markers) to all seven autosomes and the X chromosome. The sex-pooled map size is 1402.4 cM, which is estimated to provide 82.6% total coverage of the genome, with an average interval distance of 10.9 cM between adjacent markers. The overall ratio of female/male map length is 0.84, which is comparable to the ratio of 0.78 obtained for the first-generation map. Conclusions Construction of this second-generation genetic linkage map is a significant step towards complete coverage of the tammar wallaby genome and considerably extends that of the first-generation map. It will be a valuable resource for ongoing tammar wallaby genetic research and assembling the genome sequence. The sex-pooled map is available online at http://compldb.angis.org.au/. PMID:21854616
A second-generation anchored genetic linkage map of the tammar wallaby (Macropus eugenii).
Wang, Chenwei; Webley, Lee; Wei, Ke-jun; Wakefield, Matthew J; Patel, Hardip R; Deakin, Janine E; Alsop, Amber; Marshall Graves, Jennifer A; Cooper, Desmond W; Nicholas, Frank W; Zenger, Kyall R
2011-08-19
The tammar wallaby, Macropus eugenii, a small kangaroo used for decades for studies of reproduction and metabolism, is the model Australian marsupial for genome sequencing and genetic investigations. The production of a more comprehensive cytogenetically-anchored genetic linkage map will significantly contribute to the deciphering of the tammar wallaby genome. It has great value as a resource to identify novel genes and for comparative studies, and is vital for the ongoing genome sequence assembly and gene ordering in this species. A second-generation anchored tammar wallaby genetic linkage map has been constructed based on a total of 148 loci. The linkage map contains the original 64 loci included in the first-generation map, plus an additional 84 microsatellite loci that were chosen specifically to increase coverage and assist with the anchoring and orientation of linkage groups to chromosomes. These additional loci were derived from (a) sequenced BAC clones that had been previously mapped to tammar wallaby chromosomes by fluorescence in situ hybridization (FISH), (b) End sequence from BACs subsequently FISH-mapped to tammar wallaby chromosomes, and (c) tammar wallaby genes orthologous to opossum genes predicted to fill gaps in the tammar wallaby linkage map as well as three X-linked markers from a published study. Based on these 148 loci, eight linkage groups were formed. These linkage groups were assigned (via FISH-mapped markers) to all seven autosomes and the X chromosome. The sex-pooled map size is 1402.4 cM, which is estimated to provide 82.6% total coverage of the genome, with an average interval distance of 10.9 cM between adjacent markers. The overall ratio of female/male map length is 0.84, which is comparable to the ratio of 0.78 obtained for the first-generation map. Construction of this second-generation genetic linkage map is a significant step towards complete coverage of the tammar wallaby genome and considerably extends that of the first-generation map. It will be a valuable resource for ongoing tammar wallaby genetic research and assembling the genome sequence. The sex-pooled map is available online at http://compldb.angis.org.au/.
A Teaching-Learning Sequence about Weather Map Reading
ERIC Educational Resources Information Center
Mandrikas, Achilleas; Stavrou, Dimitrios; Skordoulis, Constantine
2017-01-01
In this paper a teaching-learning sequence (TLS) introducing pre-service elementary teachers (PET) to weather map reading, with emphasis on wind assignment, is presented. The TLS includes activities about recognition of wind symbols, assignment of wind direction and wind speed on a weather map and identification of wind characteristics in a…
Construction of a SNP and SSR linkage map in autotetraploid blueberry using genotyping by sequencing
USDA-ARS?s Scientific Manuscript database
A mapping population developed from a cross between two key highbush blueberry cultivars, Draper × Jewel (Vaccinium corymbosum), segregating for a number of important phenotypic traits, has been utilized to produce a genetic linkage map. Data on 233 single sequence repeat (SSR) markers and 1794 sing...
USDA-ARS?s Scientific Manuscript database
Genotyping by sequencing (GBS) provides opportunities to generate high-resolution genetic maps at a low per-sample genotyping cost, but missing data and under-calling of heterozygotes complicate the creation of GBS linkage maps for highly heterozygous species. To overcome these issues, we developed ...
Road Maps for Learning: A Bird's Eye View
ERIC Educational Resources Information Center
Dunne, Timothy T.
2011-01-01
The notion of the road map, advocated by Black, Wilson, and Yao (2011), and the associated minutiae of the construct map have several powerful features. At one level these notions assist the teacher to select and embody a suitable sequence of constructs within a specified curriculum. Whatever disparate sequenced pathways individual learners may…
Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm
Glunčić, Matko; Paar, Vladimir
2013-01-01
The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes). PMID:22977183
Belshaw, Nigel J; Elliott, Giles O; Williams, Elizabeth A; Bradburn, David M; Mills, Sarah J; Mathers, John C; Johnson, Ian T
2004-09-01
Hypermethylation of cytosine residues in the CpG islands of tumor suppressor genes is a key mechanism of colorectal carcinogenesis. Detection and quantification of CpG island methylation in human DNA isolated from stools might provide a novel strategy for the detection and investigation of colorectal neoplasia. To explore the feasibility of this approach, colorectal biopsies and fecal samples were obtained from 32 patients attending for colonoscopy or surgery, who were found to have adenomatous polyps, colorectal cancer, or no evidence of neoplasia. A further 18 fecal samples were obtained from healthy volunteers, with no bowel symptoms. Isolated DNA was modified with sodium bisulfite and analyzed by methylation-specific PCR and combined bisulfite restriction analysis for CpG island methylation of ESR1, MGMT, HPP1, p16(INK4a), APC, and MLH1. CpG island methylation was readily detectable in both mucosal and fecal DNA with methylation-specific PCR. Using combined bisulfite restriction analysis, it was established that, in volunteers from whom biopsies were available, the levels of methylation at two CpG sites within ESR1 assayed using fecal DNA were significantly correlated with methylation in DNA from colorectal mucosa. Thus, noninvasive techniques can be used to obtain quantitative information about the level of CpG island methylation in human colorectal mucosa. The methods described here could be applied to a much expanded range of genes and may be valuable both for screening purposes and to provide greater insight into the functional consequences of epigenetic changes in the colorectal mucosa of free-living individuals.
Comparison of mapping algorithms used in high-throughput sequencing: application to Ion Torrent data
2014-01-01
Background The rapid evolution in high-throughput sequencing (HTS) technologies has opened up new perspectives in several research fields and led to the production of large volumes of sequence data. A fundamental step in HTS data analysis is the mapping of reads onto reference sequences. Choosing a suitable mapper for a given technology and a given application is a subtle task because of the difficulty of evaluating mapping algorithms. Results In this paper, we present a benchmark procedure to compare mapping algorithms used in HTS using both real and simulated datasets and considering four evaluation criteria: computational resource and time requirements, robustness of mapping, ability to report positions for reads in repetitive regions, and ability to retrieve true genetic variation positions. To measure robustness, we introduced a new definition for a correctly mapped read taking into account not only the expected start position of the read but also the end position and the number of indels and substitutions. We developed CuReSim, a new read simulator, that is able to generate customized benchmark data for any kind of HTS technology by adjusting parameters to the error types. CuReSim and CuReSimEval, a tool to evaluate the mapping quality of the CuReSim simulated reads, are freely available. We applied our benchmark procedure to evaluate 14 mappers in the context of whole genome sequencing of small genomes with Ion Torrent data for which such a comparison has not yet been established. Conclusions A benchmark procedure to compare HTS data mappers is introduced with a new definition for the mapping correctness as well as tools to generate simulated reads and evaluate mapping quality. The application of this procedure to Ion Torrent data from the whole genome sequencing of small genomes has allowed us to validate our benchmark procedure and demonstrate that it is helpful for selecting a mapper based on the intended application, questions to be addressed, and the technology used. This benchmark procedure can be used to evaluate existing or in-development mappers as well as to optimize parameters of a chosen mapper for any application and any sequencing platform. PMID:24708189
Development of an Expressed Sequence Tag (EST) Resource for Wheat (Triticum aestivum L.)
Lazo, G. R.; Chao, S.; Hummel, D. D.; Edwards, H.; Crossman, C. C.; Lui, N.; Matthews, D. E.; Carollo, V. L.; Hane, D. L.; You, F. M.; Butler, G. E.; Miller, R. E.; Close, T. J.; Peng, J. H.; Lapitan, N. L. V.; Gustafson, J. P.; Qi, L. L.; Echalier, B.; Gill, B. S.; Dilbirligi, M.; Randhawa, H. S.; Gill, K. S.; Greene, R. A.; Sorrells, M. E.; Akhunov, E. D.; Dvořák, J.; Linkiewicz, A. M.; Dubcovsky, J.; Hossain, K. G.; Kalavacharla, V.; Kianian, S. F.; Mahmoud, A. A.; Miftahudin; Ma, X.-F.; Conley, E. J.; Anderson, J. A.; Pathan, M. S.; Nguyen, H. T.; McGuire, P. E.; Qualset, C. O.; Anderson, O. D.
2004-01-01
This report describes the rationale, approaches, organization, and resource development leading to a large-scale deletion bin map of the hexaploid (2n = 6x = 42) wheat genome (Triticum aestivum L.). Accompanying reports in this issue detail results from chromosome bin-mapping of expressed sequence tags (ESTs) representing genes onto the seven homoeologous chromosome groups and a global analysis of the entire mapped wheat EST data set. Among the resources developed were the first extensive public wheat EST collection (113,220 ESTs). Described are protocols for sequencing, sequence processing, EST nomenclature, and the assembly of ESTs into contigs. These contigs plus singletons (unassembled ESTs) were used for selection of distinct sequence motif unigenes. Selected ESTs were rearrayed, validated by 5′ and 3′ sequencing, and amplified for probing a series of wheat aneuploid and deletion stocks. Images and data for all Southern hybridizations were deposited in databases and were used by the coordinators for each of the seven homoeologous chromosome groups to validate the mapping results. Results from this project have established the foundation for future developments in wheat genomics. PMID:15514037
Dallol, Ashraf; Forgacs, Eva; Martinez, Alonso; Sekido, Yoshitaka; Walker, Rosemary; Kishida, Takeshi; Rabbitts, Pamela; Maher, Eamonn R; Minna, John D; Latif, Farida
2002-05-02
The human homologue of the Drosophila Roundabout gene DUTT1 (Deleted in U Twenty Twenty) or ROBO1 (Locus Link ID 6091), a member of the NCAM family of receptors, was recently cloned from the lung cancer tumour suppressor gene region 2 (LCTSGR2 or U2020 region) at 3p12. DUTT1 maps within a region of overlapping homozygous deletions characterized in both small cell lung cancer lines (SCLC) and in a breast cancer line. In this report we (a) defined the genomic organization of the DUTT1 gene, (b) performed mutation and expression analysis of DUTT1 in lung, breast and kidney cancers, (c) identified tumour specific promoter region methylation of DUTT1 in human cancers. The gene was found to contain 29 exons and spans at least 240 kb of genomic sequence. The 5' region contains a CpG island, and the poly(A)(+) tail has an atypical 5'-GATAAA-3' signal. We analysed DUTT1 for mutations in lung, breast and kidney cancers, no inactivating mutations were detected by PCR-SSCP. However, seven germline missense changes were found and characterized. DUTT1 expression was not detectable in one out of 18 breast tumour lines analysed by RT-PCR. Bisulfite sequencing of the promoter region of DUTT1 gene in the HTB-19 breast tumour cell line (not expressing DUTT1) showed complete hypermethylation of CpG sites within the promoter region of the DUTT1 gene (-244 to +27 relative to the translation start site). The expression of DUTT1 gene was reactivated in HTB-19 after treatment with the demethylating agent 5-aza-2'-deoxycytidine. The same region was also found to be hypermethylated in six out of 32 (19%) primary invasive breast carcinomas and eight out of 44 (18%) primary clear cell renal cell carcinomas (CC-RCC) and in one out of 26 (4%) primary NSCLC tumours. Furthermore 80% of breast and 75% of CC-RCC tumours showing DUTT1 methylation had allelic losses for 3p12 markers hence obeying Knudson's two hit hypothesis. Our findings suggest that DUTT1 warrants further analysis as a candidate for the tumour suppressor gene (TSG) at 3p12, a region defined by hemi and homozygous deletions and functional analysis.
Wright, Imogen A.; Travers, Simon A.
2014-01-01
The challenge presented by high-throughput sequencing necessitates the development of novel tools for accurate alignment of reads to reference sequences. Current approaches focus on using heuristics to map reads quickly to large genomes, rather than generating highly accurate alignments in coding regions. Such approaches are, thus, unsuited for applications such as amplicon-based analysis and the realignment phase of exome sequencing and RNA-seq, where accurate and biologically relevant alignment of coding regions is critical. To facilitate such analyses, we have developed a novel tool, RAMICS, that is tailored to mapping large numbers of sequence reads to short lengths (<10 000 bp) of coding DNA. RAMICS utilizes profile hidden Markov models to discover the open reading frame of each sequence and aligns to the reference sequence in a biologically relevant manner, distinguishing between genuine codon-sized indels and frameshift mutations. This approach facilitates the generation of highly accurate alignments, accounting for the error biases of the sequencing machine used to generate reads, particularly at homopolymer regions. Performance improvements are gained through the use of graphics processing units, which increase the speed of mapping through parallelization. RAMICS substantially outperforms all other mapping approaches tested in terms of alignment quality while maintaining highly competitive speed performance. PMID:24861618
Staton, Margaret; Zhebentyayeva, Tetyana; Olukolu, Bode; Fang, Guang Chen; Nelson, Dana; Carlson, John E; Abbott, Albert G
2015-10-05
Chinese chestnut (Castanea mollissima) has emerged as a model species for the Fagaceae family with extensive genomic resources including a physical map, a dense genetic map and quantitative trait loci (QTLs) for chestnut blight resistance. These resources enable comparative genomics analyses relative to model plants. We assessed the degree of conservation between the chestnut genome and other well annotated and assembled plant genomic sequences, focusing on the QTL regions of most interest to the chestnut breeding community. The integrated physical and genetic map of Chinese chestnut has been improved to now include 858 shared sequence-based markers. The utility of the integrated map has also been improved through the addition of 42,970 BAC (bacterial artificial chromosome) end sequences spanning over 26 million bases of the estimated 800 Mb chestnut genome. Synteny between chestnut and ten model plant species was conducted on a macro-syntenic scale using sequences from both individual probes and BAC end sequences across the chestnut physical map. Blocks of synteny with chestnut were found in all ten reference species, with the percent of the chestnut physical map that could be aligned ranging from 10 to 39 %. The integrated genetic and physical map was utilized to identify BACs that spanned the three previously identified QTL regions conferring blight resistance. The clones were pooled and sequenced, yielding 396 sequence scaffolds covering 13.9 Mbp. Comparative genomic analysis on a microsytenic scale, using the QTL-associated genomic sequence, identified synteny from chestnut to other plant genomes ranging from 5.4 to 12.9 % of the genome sequences aligning. On both the macro- and micro-synteny levels, the peach, grape and poplar genomes were found to be the most structurally conserved with chestnut. Interestingly, these results did not strictly follow the expectation that decreased phylogenetic distance would correspond to increased levels of genome preservation, but rather suggest the additional influence of life-history traits on preservation of synteny. The regions of synteny that were detected provide an important tool for defining and cataloging genes in the QTL regions for advancing chestnut blight resistance research.
G. S. Wang; X. J. Pan; Junyong Zhu; Roland Gleisner; D. Rockwood
2009-01-01
This study demonstrates sulfite pretreatment to overcome recalcitrance of lignocellulose (SPORL) for robust bioconversion of hardwoods. With only about 4% sodium bisulfite charge on aspen and 30-min pretreatment at temperature 180[...
Mapping the Space of Genomic Signatures
Kari, Lila; Hill, Kathleen A.; Sayem, Abu S.; Karamichalis, Rallis; Bryans, Nathaniel; Davis, Katelyn; Dattani, Nikesh S.
2015-01-01
We propose a computational method to measure and visualize interrelationships among any number of DNA sequences allowing, for example, the examination of hundreds or thousands of complete mitochondrial genomes. An "image distance" is computed for each pair of graphical representations of DNA sequences, and the distances are visualized as a Molecular Distance Map: Each point on the map represents a DNA sequence, and the spatial proximity between any two points reflects the degree of structural similarity between the corresponding sequences. The graphical representation of DNA sequences utilized, Chaos Game Representation (CGR), is genome- and species-specific and can thus act as a genomic signature. Consequently, Molecular Distance Maps could inform species identification, taxonomic classifications and, to a certain extent, evolutionary history. The image distance employed, Structural Dissimilarity Index (DSSIM), implicitly compares the occurrences of oligomers of length up to k (herein k = 9) in DNA sequences. We computed DSSIM distances for more than 5 million pairs of complete mitochondrial genomes, and used Multi-Dimensional Scaling (MDS) to obtain Molecular Distance Maps that visually display the sequence relatedness in various subsets, at different taxonomic levels. This general-purpose method does not require DNA sequence alignment and can thus be used to compare similar or vastly different DNA sequences, genomic or computer-generated, of the same or different lengths. We illustrate potential uses of this approach by applying it to several taxonomic subsets: phylum Vertebrata, (super)kingdom Protista, classes Amphibia-Insecta-Mammalia, class Amphibia, and order Primates. This analysis of an extensive dataset confirms that the oligomer composition of full mtDNA sequences can be a source of taxonomic information. This method also correctly finds the mtDNA sequences most closely related to that of the anatomically modern human (the Neanderthal, the Denisovan, and the chimp), and that the sequence most different from it in this dataset belongs to a cucumber. PMID:26000734
Using optical mapping data for the improvement of vertebrate genome assemblies.
Howe, Kerstin; Wood, Jonathan M D
2015-01-01
Optical mapping is a technology that gathers long-range information on genome sequences similar to ordered restriction digest maps. Because it is not subject to cloning, amplification, hybridisation or sequencing bias, it is ideally suited to the improvement of fragmented genome assemblies that can no longer be improved by classical methods. In addition, its low cost and rapid turnaround make it equally useful during the scaffolding process of de novo assembly from high throughput sequencing reads. We describe how optical mapping has been used in practice to produce high quality vertebrate genome assemblies. In particular, we detail the efforts undertaken by the Genome Reference Consortium (GRC), which maintains the reference genomes for human, mouse, zebrafish and chicken, and uses different optical mapping platforms for genome curation.
Patrício, Patrícia; Ramalho-Carvalho, João; Costa-Pinheiro, Pedro; Almeida, Mafalda; Barros-Silva, João Diogo; Vieira, Joana; Dias, Paula Cristina; Lobo, Francisco; Oliveira, Jorge; Teixeira, Manuel R; Henrique, Rui; Jeronimo, Carmen
2013-08-01
Expression of PAX2 (Paired-box 2) is suppressed through promoter methylation at the later stages of embryonic development, but eventually reactivated during carcinogenesis. Pax-2 is commonly expressed in the most prevalent renal cell tumour (RCT) subtypes-clear cell RCC (ccRCC), papillary RCC (pRCC) and oncocytoma--but not in chromophobe RCC (chrRCC), which frequently displays chromosome 10 loss (to which PAX2 is mapped). Herein, we assessed the epigenetic and/or genetic alterations affecting PAX2 expression in RCTs and evaluated its potential as biomarker. We tested 120 RCTs (30 of each main subtype) and four normal kidney tissues. Pax-2 expression was assessed by immunohistochemistry and PAX2 mRNA expression levels were determined by quantitative RT-PCR. PAX2 promoter methylation status was assessed by methylation-specific PCR and bisulfite sequencing. Chromosome 10 and PAX2 copy number alterations were determined by FISH. Pax-2 immunoexpression was significantly lower in chrRCC compared to other RCT subtypes. Using a 10% immunoexpression cut-off, Pax-2 immunoreactivity discriminated chrRCC from oncocytoma with 67% sensitivity and 90% specificity. PAX2 mRNA expression was significantly lower in chrRCC, compared to ccRCC, pRCC and oncocytoma, and transcript levels correlated with immunoexpression. Whereas no promoter methylation was found in RCTs or normal kidney, 69% of chrRCC displayed chromosome 10 monosomy, correlating with Pax-2 immunoexpression. We concluded that Pax-2 expression might be used as an ancillary tool to discriminate chrRCC from oncocytomas with overlapping morphological features. The biological rationale lies on the causal relation between Pax-2 expression and chromosome 10 monosomy, but not PAX2 promoter methylation, in chrRCC. © 2013 The Authors. Journal of Cellular and Molecular Medicine Published by Foundation for Cellular and Molecular Medicine/Blackwell Publishing Ltd.
2013-01-01
Background As for other major crops, achieving a complete wheat genome sequence is essential for the application of genomics to breeding new and improved varieties. To overcome the complexities of the large, highly repetitive and hexaploid wheat genome, the International Wheat Genome Sequencing Consortium established a chromosome-based strategy that was validated by the construction of the physical map of chromosome 3B. Here, we present improved strategies for the construction of highly integrated and ordered wheat physical maps, using chromosome 1BL as a template, and illustrate their potential for evolutionary studies and map-based cloning. Results Using a combination of novel high throughput marker assays and an assembly program, we developed a high quality physical map representing 93% of wheat chromosome 1BL, anchored and ordered with 5,489 markers including 1,161 genes. Analysis of the gene space organization and evolution revealed that gene distribution and conservation along the chromosome results from the superimposition of the ancestral grass and recent wheat evolutionary patterns, leading to a peak of synteny in the central part of the chromosome arm and an increased density of non-collinear genes towards the telomere. With a density of about 11 markers per Mb, the 1BL physical map provides 916 markers, including 193 genes, for fine mapping the 40 QTLs mapped on this chromosome. Conclusions Here, we demonstrate that high marker density physical maps can be developed in complex genomes such as wheat to accelerate map-based cloning, gain new insights into genome evolution, and provide a foundation for reference sequencing. PMID:23800011
Oliveira, R R; Viana, A J C; Reátegui, A C E; Vincentz, M G A
2015-12-29
Determination of gene expression is an important tool to study biological processes and relies on the quality of the extracted RNA. Changes in gene expression profiles may be directly related to mutations in regulatory DNA sequences or alterations in DNA cytosine methylation, which is an epigenetic mark. Correlation of gene expression with DNA sequence or epigenetic mark polymorphism is often desirable; for this, a robust protocol to isolate high-quality RNA and DNA simultaneously from the same sample is required. Although commercial kits and protocols are available, they are mainly optimized for animal tissues and, in general, restricted to RNA or DNA extraction, not both. In the present study, we describe an efficient and accessible method to extract both RNA and DNA simultaneously from the same sample of various plant tissues, using small amounts of starting material. The protocol was efficient in the extraction of high-quality nucleic acids from several Arabidopsis thaliana tissues (e.g., leaf, inflorescence stem, flower, fruit, cotyledon, seedlings, root, and embryo) and from other tissues of non-model plants, such as Avicennia schaueriana (Acanthaceae), Theobroma cacao (Malvaceae), Paspalum notatum (Poaceae), and Sorghum bicolor (Poaceae). The obtained nucleic acids were used as templates for downstream analyses, such as mRNA sequencing, quantitative real time-polymerase chain reaction, bisulfite treatment, and others; the results were comparable to those obtained with commercial kits. We believe that this protocol could be applied to a broad range of plant species, help avoid technical and sampling biases, and facilitate several RNA- and DNA-dependent analyses.
Hatano, Takashi; Sano, Daisuke; Takahashi, Hideaki; Hyakusoku, Hiroshi; Isono, Yasuhiro; Shimada, Shoko; Sawakuma, Kae; Takada, Kentaro; Oikawa, Ritsuko; Watanabe, Yoshiyuki; Yamamoto, Hiroyuki; Itoh, Fumio; Myers, Jeffrey N; Oridate, Nobuhiko
2017-04-01
Recent studies showed that human papillomavirus (HPV) integration contributes to the genomic instability seen in HPV-associated head and neck squamous cell carcinoma (HPV-HNSCC). However, the epigenetic alterations induced after HPV integration remains unclear. To identify the molecular details of HPV16 DNA integration and the ensuing patterns of methylation in HNSCC, we performed next-generation sequencing using a target-enrichment method for the effective identification of HPV16 integration breakpoints as well as the characterization of genomic sequences adjacent to HPV16 integration breakpoints with three HPV16-related HNSCC cell lines. The DNA methylation levels of the integrated HPV16 genome and that of the adjacent human genome were also analyzed by bisulfite pyrosequencing. We found various integration loci, including novel integration sites. Integration loci were located predominantly in the intergenic region, with a significant enrichment of the microhomologous sequences between the human and HPV16 genomes at the integration breakpoints. Furthermore, various levels of methylation within both the human genome and the integrated HPV genome at the integration breakpoints in each integrant were observed. Allele-specific methylation analysis suggested that the HPV16 integrants remained hypomethylated when the flanking host genome was hypomethylated. After integration into highly methylated human genome regions, however, the HPV16 DNA became methylated. In conclusion, we found novel integration sites and methylation patterns in HPV-HNSCC using our unique method. These findings may provide insights into understanding of viral integration mechanism and virus-associated carcinogenesis of HPV-HNSCC. © 2016 UICC.
Enhancing genome assemblies by integrating non-sequence based data
2011-01-01
Introduction Many genome projects were underway before the advent of high-throughput sequencing and have thus been supported by a wealth of genome information from other technologies. Such information frequently takes the form of linkage and physical maps, both of which can provide a substantial amount of data useful in de novo sequencing projects. Furthermore, the recent abundance of genome resources enables the use of conserved synteny maps identified in related species to further enhance genome assemblies. Methods The tammar wallaby (Macropus eugenii) is a model marsupial mammal with a low coverage genome. However, we have access to extensive comparative maps containing over 14,000 markers constructed through the physical mapping of conserved loci, chromosome painting and comprehensive linkage maps. Using a custom Bioperl pipeline, information from the maps was aligned to assembled tammar wallaby contigs using BLAT. This data was used to construct pseudo paired-end libraries with intervals ranging from 5-10 MB. We then used Bambus (a program designed to scaffold eukaryotic genomes by ordering and orienting contigs through the use of paired-end data) to scaffold our libraries. To determine how map data compares to sequence based approaches to enhance assemblies, we repeated the experiment using a 0.5× coverage of unique reads from 4 KB and 8 KB Illumina paired-end libraries. Finally, we combined both the sequence and non-sequence-based data to determine how a combined approach could further enhance the quality of the low coverage de novo reconstruction of the tammar wallaby genome. Results Using the map data alone, we were able order 2.2% of the initial contigs into scaffolds, and increase the N50 scaffold size to 39 KB (36 KB in the original assembly). Using only the 0.5× paired-end sequence based data, 53% of the initial contigs were assigned to scaffolds. Combining both data sets resulted in a further 2% increase in the number of initial contigs integrated into a scaffold (55% total) but a 35% increase in N50 scaffold size over the use of sequence-based data alone. Conclusions We provide a relatively simple pipeline utilizing existing bioinformatics tools to integrate map data into a genome assembly which is available at http://www.mcb.uconn.edu/fac.php?name=paska. While the map data only contributed minimally to assigning the initial contigs to scaffolds in the new assembly, it greatly increased the N50 size. This process added structure to our low coverage assembly, greatly increasing its utility in further analyses. PMID:21554765
Enhancing genome assemblies by integrating non-sequence based data.
Heider, Thomas N; Lindsay, James; Wang, Chenwei; O'Neill, Rachel J; Pask, Andrew J
2011-05-28
Many genome projects were underway before the advent of high-throughput sequencing and have thus been supported by a wealth of genome information from other technologies. Such information frequently takes the form of linkage and physical maps, both of which can provide a substantial amount of data useful in de novo sequencing projects. Furthermore, the recent abundance of genome resources enables the use of conserved synteny maps identified in related species to further enhance genome assemblies. The tammar wallaby (Macropus eugenii) is a model marsupial mammal with a low coverage genome. However, we have access to extensive comparative maps containing over 14,000 markers constructed through the physical mapping of conserved loci, chromosome painting and comprehensive linkage maps. Using a custom Bioperl pipeline, information from the maps was aligned to assembled tammar wallaby contigs using BLAT. This data was used to construct pseudo paired-end libraries with intervals ranging from 5-10 MB. We then used Bambus (a program designed to scaffold eukaryotic genomes by ordering and orienting contigs through the use of paired-end data) to scaffold our libraries. To determine how map data compares to sequence based approaches to enhance assemblies, we repeated the experiment using a 0.5× coverage of unique reads from 4 KB and 8 KB Illumina paired-end libraries. Finally, we combined both the sequence and non-sequence-based data to determine how a combined approach could further enhance the quality of the low coverage de novo reconstruction of the tammar wallaby genome. Using the map data alone, we were able order 2.2% of the initial contigs into scaffolds, and increase the N50 scaffold size to 39 KB (36 KB in the original assembly). Using only the 0.5× paired-end sequence based data, 53% of the initial contigs were assigned to scaffolds. Combining both data sets resulted in a further 2% increase in the number of initial contigs integrated into a scaffold (55% total) but a 35% increase in N50 scaffold size over the use of sequence-based data alone. We provide a relatively simple pipeline utilizing existing bioinformatics tools to integrate map data into a genome assembly which is available at http://www.mcb.uconn.edu/fac.php?name=paska. While the map data only contributed minimally to assigning the initial contigs to scaffolds in the new assembly, it greatly increased the N50 size. This process added structure to our low coverage assembly, greatly increasing its utility in further analyses.
MOSAIK: a hash-based algorithm for accurate next-generation sequencing short-read mapping.
Lee, Wan-Ping; Stromberg, Michael P; Ward, Alistair; Stewart, Chip; Garrison, Erik P; Marth, Gabor T
2014-01-01
MOSAIK is a stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome. Uniquely among current mapping tools, MOSAIK can align reads generated by all the major sequencing technologies, including Illumina, Applied Biosystems SOLiD, Roche 454, Ion Torrent and Pacific BioSciences SMRT. Indeed, MOSAIK was the only aligner to provide consistent mappings for all the generated data (sequencing technologies, low-coverage and exome) in the 1000 Genomes Project. To provide highly accurate alignments, MOSAIK employs a hash clustering strategy coupled with the Smith-Waterman algorithm. This method is well-suited to capture mismatches as well as short insertions and deletions. To support the growing interest in larger structural variant (SV) discovery, MOSAIK provides explicit support for handling known-sequence SVs, e.g. mobile element insertions (MEIs) as well as generating outputs tailored to aid in SV discovery. All variant discovery benefits from an accurate description of the read placement confidence. To this end, MOSAIK uses a neural-network based training scheme to provide well-calibrated mapping quality scores, demonstrated by a correlation coefficient between MOSAIK assigned and actual mapping qualities greater than 0.98. In order to ensure that studies of any genome are supported, a training pipeline is provided to ensure optimal mapping quality scores for the genome under investigation. MOSAIK is multi-threaded, open source, and incorporated into our command and pipeline launcher system GKNO (http://gkno.me).
MOSAIK: A Hash-Based Algorithm for Accurate Next-Generation Sequencing Short-Read Mapping
Lee, Wan-Ping; Stromberg, Michael P.; Ward, Alistair; Stewart, Chip; Garrison, Erik P.; Marth, Gabor T.
2014-01-01
MOSAIK is a stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome. Uniquely among current mapping tools, MOSAIK can align reads generated by all the major sequencing technologies, including Illumina, Applied Biosystems SOLiD, Roche 454, Ion Torrent and Pacific BioSciences SMRT. Indeed, MOSAIK was the only aligner to provide consistent mappings for all the generated data (sequencing technologies, low-coverage and exome) in the 1000 Genomes Project. To provide highly accurate alignments, MOSAIK employs a hash clustering strategy coupled with the Smith-Waterman algorithm. This method is well-suited to capture mismatches as well as short insertions and deletions. To support the growing interest in larger structural variant (SV) discovery, MOSAIK provides explicit support for handling known-sequence SVs, e.g. mobile element insertions (MEIs) as well as generating outputs tailored to aid in SV discovery. All variant discovery benefits from an accurate description of the read placement confidence. To this end, MOSAIK uses a neural-network based training scheme to provide well-calibrated mapping quality scores, demonstrated by a correlation coefficient between MOSAIK assigned and actual mapping qualities greater than 0.98. In order to ensure that studies of any genome are supported, a training pipeline is provided to ensure optimal mapping quality scores for the genome under investigation. MOSAIK is multi-threaded, open source, and incorporated into our command and pipeline launcher system GKNO (http://gkno.me). PMID:24599324
Wang, Yan; Sun, ZhongSheng; Szyf, Moshe
2017-01-01
S-adenosyl methionine (SAM) is a ubiquitous methyl donor that was reported to have chemo- protective activity against liver cancer, however the molecular footprint of SAM is unknown. We show here that SAM selectively inhibits growth, transformation and invasiveness of hepatocellular carcinoma cell lines but not normal primary liver cells. Analysis of the transcriptome of SAM treated and untreated liver cancer cell lines HepG2 and SKhep1 and primary liver cells reveals pathways involved in cancer and metastasis that are upregulated in cancer cells and are downregulated by SAM. Analysis of the methylome using bisulfite mapping of captured promoters and enhancers reveals that SAM hyper-methylates and downregulates genes in pathways of growth and metastasis that are upregulated in liver cancer cells. Depletion of two SAM downregulated genes STMN1 and TAF15 reduces cellular transformation and invasiveness, providing evidence that SAM targets are genes important for cancer growth and invasiveness. Taken together these data provide a molecular rationale for SAM as an anticancer agent. PMID:29340097
Wang, Yan; Sun, ZhongSheng; Szyf, Moshe
2017-12-19
S-adenosyl methionine (SAM) is a ubiquitous methyl donor that was reported to have chemo- protective activity against liver cancer, however the molecular footprint of SAM is unknown. We show here that SAM selectively inhibits growth, transformation and invasiveness of hepatocellular carcinoma cell lines but not normal primary liver cells. Analysis of the transcriptome of SAM treated and untreated liver cancer cell lines HepG2 and SKhep1 and primary liver cells reveals pathways involved in cancer and metastasis that are upregulated in cancer cells and are downregulated by SAM. Analysis of the methylome using bisulfite mapping of captured promoters and enhancers reveals that SAM hyper-methylates and downregulates genes in pathways of growth and metastasis that are upregulated in liver cancer cells. Depletion of two SAM downregulated genes STMN1 and TAF15 reduces cellular transformation and invasiveness, providing evidence that SAM targets are genes important for cancer growth and invasiveness. Taken together these data provide a molecular rationale for SAM as an anticancer agent.
Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data
Degner, Jacob F.; Marioni, John C.; Pai, Athma A.; Pickrell, Joseph K.; Nkadori, Everlyne; Gilad, Yoav; Pritchard, Jonathan K.
2009-01-01
Motivation: Next-generation sequencing has become an important tool for genome-wide quantification of DNA and RNA. However, a major technical hurdle lies in the need to map short sequence reads back to their correct locations in a reference genome. Here, we investigate the impact of SNP variation on the reliability of read-mapping in the context of detecting allele-specific expression (ASE). Results: We generated 16 million 35 bp reads from mRNA of each of two HapMap Yoruba individuals. When we mapped these reads to the human genome we found that, at heterozygous SNPs, there was a significant bias toward higher mapping rates of the allele in the reference sequence, compared with the alternative allele. Masking known SNP positions in the genome sequence eliminated the reference bias but, surprisingly, did not lead to more reliable results overall. We find that even after masking, ∼5–10% of SNPs still have an inherent bias toward more effective mapping of one allele. Filtering out inherently biased SNPs removes 40% of the top signals of ASE. The remaining SNPs showing ASE are enriched in genes previously known to harbor cis-regulatory variation or known to show uniparental imprinting. Our results have implications for a variety of applications involving detection of alternate alleles from short-read sequence data. Availability: Scripts, written in Perl and R, for simulating short reads, masking SNP variation in a reference genome and analyzing the simulation output are available upon request from JFD. Raw short read data were deposited in GEO (http://www.ncbi.nlm.nih.gov/geo/) under accession number GSE18156. Contact: jdegner@uchicago.edu; marioni@uchicago.edu; gilad@uchicago.edu; pritch@uchicago.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19808877
Development of a set of SNP markers present in expressed genes of the apple.
Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S
2008-11-01
Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
Harraghy, Niamh; Homerova, Dagmar; Herrmann, Mathias; Kormanec, Jan
2008-01-01
Mapping the transcription start points of the eap, emp, and vwb promoters revealed a conserved octanucleotide sequence (COS). Deleting this sequence abolished the expression of eap, emp, and vwb. However, electrophoretic mobility shift assays gave no evidence that this sequence was a binding site for SarA or SaeR, known regulators of eap and emp.
High-Throughput Mapping of Single-Neuron Projections by Sequencing of Barcoded RNA.
Kebschull, Justus M; Garcia da Silva, Pedro; Reid, Ashlan P; Peikon, Ian D; Albeanu, Dinu F; Zador, Anthony M
2016-09-07
Neurons transmit information to distant brain regions via long-range axonal projections. In the mouse, area-to-area connections have only been systematically mapped using bulk labeling techniques, which obscure the diverse projections of intermingled single neurons. Here we describe MAPseq (Multiplexed Analysis of Projections by Sequencing), a technique that can map the projections of thousands or even millions of single neurons by labeling large sets of neurons with random RNA sequences ("barcodes"). Axons are filled with barcode mRNA, each putative projection area is dissected, and the barcode mRNA is extracted and sequenced. Applying MAPseq to the locus coeruleus (LC), we find that individual LC neurons have preferred cortical targets. By recasting neuroanatomy, which is traditionally viewed as a problem of microscopy, as a problem of sequencing, MAPseq harnesses advances in sequencing technology to permit high-throughput interrogation of brain circuits. Copyright © 2016 Elsevier Inc. All rights reserved.
Yuan, Shuai; Johnston, H. Richard; Zhang, Guosheng; Li, Yun; Hu, Yi-Juan; Qin, Zhaohui S.
2015-01-01
With rapid decline of the sequencing cost, researchers today rush to embrace whole genome sequencing (WGS), or whole exome sequencing (WES) approach as the next powerful tool for relating genetic variants to human diseases and phenotypes. A fundamental step in analyzing WGS and WES data is mapping short sequencing reads back to the reference genome. This is an important issue because incorrectly mapped reads affect the downstream variant discovery, genotype calling and association analysis. Although many read mapping algorithms have been developed, the majority of them uses the universal reference genome and do not take sequence variants into consideration. Given that genetic variants are ubiquitous, it is highly desirable if they can be factored into the read mapping procedure. In this work, we developed a novel strategy that utilizes genotypes obtained a priori to customize the universal haploid reference genome into a personalized diploid reference genome. The new strategy is implemented in a program named RefEditor. When applying RefEditor to real data, we achieved encouraging improvements in read mapping, variant discovery and genotype calling. Compared to standard approaches, RefEditor can significantly increase genotype calling consistency (from 43% to 61% at 4X coverage; from 82% to 92% at 20X coverage) and reduce Mendelian inconsistency across various sequencing depths. Because many WGS and WES studies are conducted on cohorts that have been genotyped using array-based genotyping platforms previously or concurrently, we believe the proposed strategy will be of high value in practice, which can also be applied to the scenario where multiple NGS experiments are conducted on the same cohort. The RefEditor sources are available at https://github.com/superyuan/refeditor. PMID:26267278
Generation of a Maize B Centromere Minimal Map Containing the Central Core Domain.
Ellis, Nathanael A; Douglas, Ryan N; Jackson, Caroline E; Birchler, James A; Dawe, R Kelly
2015-10-28
The maize B centromere has been used as a model for centromere epigenetics and as the basis for building artificial chromosomes. However, there are no sequence resources for this important centromere. Here we used transposon display for the centromere-specific retroelement CRM2 to identify a collection of 40 sequence tags that flank CRM2 insertion points on the B chromosome. These were confirmed to lie within the centromere by assaying deletion breakpoints from centromere misdivision derivatives (intracentromere breakages caused by centromere fission). Markers were grouped together on the basis of their association with other markers in the misdivision series and assembled into a pseudocontig containing 10.1 kb of sequence. To identify sequences that interact directly with centromere proteins, we carried out chromatin immunoprecipitation using antibodies to centromeric histone H3 (CENH3), a defining feature of functional centromeric sequences. The CENH3 chromatin immunoprecipitation map was interpreted relative to the known transmission rates of centromere misdivision derivatives to identify a centromere core domain spanning 33 markers. A subset of seven markers was mapped in additional B centromere misdivision derivatives with the use of unique primer pairs. A derivative previously shown to have no canonical centromere sequences (Telo3-3) lacks these core markers. Our results provide a molecular map of the B chromosome centromere and identify key sequences within the map that interact directly with centromeric histone H3. Copyright © 2015 Ellis et al.
Whole-genome random sequencing and assembly of Haemophilus influenzae Rd
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fleischmann, R.D.; Adams, M.D.; White, O.
1995-07-28
An approach for genome analysis based on sequencing and assembly of unselected pieces of DNA from the whole chromosome has been applied to obtain the complete nucleotide sequence (1,830,137 base pairs) of the genome from the bacterium Haemophilus influenzae Rd. This approach eliminates the need for initial mapping efforts and is therefore applicable to the vast array of microbial species for which genome maps are unavailable. The H. influenzae Rd genome sequence (Genome Sequence DataBase accession number L42023) represents the only complete genome sequence from a free-living organism. 46 refs., 4 figs., 4 tabs.
Bartels, Daniela; Kespohl, Sebastian; Albaum, Stefan; Drüke, Tanja; Goesmann, Alexander; Herold, Julia; Kaiser, Olaf; Pühler, Alfred; Pfeiffer, Friedhelm; Raddatz, Günter; Stoye, Jens; Meyer, Folker; Schuster, Stephan C
2005-04-01
We provide the graphical tool BACCardI for the construction of virtual clone maps from standard assembler output files or BLAST based sequence comparisons. This new tool has been applied to numerous genome projects to solve various problems including (a) validation of whole genome shotgun assemblies, (b) support for contig ordering in the finishing phase of a genome project, and (c) intergenome comparison between related strains when only one of the strains has been sequenced and a large insert library is available for the other. The BACCardI software can seamlessly interact with various sequence assembly packages. Genomic assemblies generated from sequence information need to be validated by independent methods such as physical maps. The time-consuming task of building physical maps can be circumvented by virtual clone maps derived from read pair information of large insert libraries.
Kenyon, Jonathan; Nickel-Meester, Gabrielle; Qing, Yulan; Santos-Guasch, Gabriela; Drake, Ellen; PingfuFu; Sun, Shuying; Bai, Xiaodong; Wald, David; Arts, Eric; Gerson, Stanton L.
2016-01-01
Normal human hematopoietic stem and progenitor cells (HPC) lose expression of MLH1, an important mismatch repair (MMR) pathway gene, with age. Loss of MMR leads to replication dependent mutational events and microsatellite instability observed in secondary acute myelogenous leukemia and other hematologic malignancies. Epigenetic CpG methylation upstream of the MLH1 promoter is a contributing factor to acquired loss of MLH1 expression in tumors of the epithelia and proximal mucosa. Using single molecule high-throughput bisulfite sequencing we have characterized the CpG methylation landscape from −938 to −337 bp upstream of the MLH1 transcriptional start site (position +0), from 30 hematopoietic colony forming cell clones (CFC) either expressing or not expressing MLH1. We identify a correlation between MLH1 promoter methylation and loss of MLH1 expression. Additionally, using the CpG site methylation frequencies obtained in this study we were able to generate a classification algorithm capable of sorting the expressing and non-expressing CFC. Thus, as has been previously described for many tumor cell types, we report for the first time a correlation between the loss of MLH1 expression and increased MLH1 promoter methylation in CFC derived from CD34+ selected hematopoietic stem and progenitor cells. PMID:27570841
Kenyon, Jonathan; Nickel-Meester, Gabrielle; Qing, Yulan; Santos-Guasch, Gabriela; Drake, Ellen; PingfuFu; Sun, Shuying; Bai, Xiaodong; Wald, David; Arts, Eric; Gerson, Stanton L
Normal human hematopoietic stem and progenitor cells (HPC) lose expression of MLH1 , an important mismatch repair (MMR) pathway gene, with age. Loss of MMR leads to replication dependent mutational events and microsatellite instability observed in secondary acute myelogenous leukemia and other hematologic malignancies. Epigenetic CpG methylation upstream of the MLH1 promoter is a contributing factor to acquired loss of MLH1 expression in tumors of the epithelia and proximal mucosa. Using single molecule high-throughput bisulfite sequencing we have characterized the CpG methylation landscape from -938 to -337 bp upstream of the MLH1 transcriptional start site (position +0), from 30 hematopoietic colony forming cell clones (CFC) either expressing or not expressing MLH1 . We identify a correlation between MLH1 promoter methylation and loss of MLH1 expression. Additionally, using the CpG site methylation frequencies obtained in this study we were able to generate a classification algorithm capable of sorting the expressing and non-expressing CFC. Thus, as has been previously described for many tumor cell types, we report for the first time a correlation between the loss of MLH1 expression and increased MLH1 promoter methylation in CFC derived from CD34 + selected hematopoietic stem and progenitor cells.
Zhang, Xiaoyang; Wang, Dongxu; Han, Yang; Duan, Feifei; Lv, Qinyan; Li, Zhanjun
2014-11-01
To determine the expression patterns of imprinted genes and their methylation status in aborted cloned porcine fetuses and placentas. RNA and DNA were prepared from fetuses and placentas that were produced by SCNT and controls from artificial insemination. The expression of 18 imprinted genes was determined by quantitative real-time PCR (q-PCR). Bisulfite sequencing PCR (BSP) was conducted to determine the methylation status of PRE-1 short interspersed repetitive element (SINE), satellite DNA and H19 differentially methylated region 3 (DMR3). The weight, imprinted gene expression and genome-wide DNA methylation patterns were compared between the mid-gestation aborted and normal control samples. The results showed hypermethylation of PRE-1 and satellite sequences, the aberrant expression of imprinted genes, and the hypomethylation of H19 DMR3 occurred in mid-gestation aborted fetuses and placentas. Cloned pigs generated by somatic cell nuclear transfer (SCNT) showed a greater ratio of early abortion during mid-gestation than did normal controls because of the incomplete epigenetic reprogramming of the donor cells. Altered expression of imprinted genes and the hypermethylation profile of the repetitive regions (PRE-1 and satellite DNA) may be associated with defective development and early abortion of cloned pigs, emphasizing the importance of epigenetics during pregnancy and implications thereof for patient-specific embryonic stem cells for human therapeutic cloning and improvement of human assisted reproduction.
MeDReaders: a database for transcription factors that bind to methylated DNA.
Wang, Guohua; Luo, Ximei; Wang, Jianan; Wan, Jun; Xia, Shuli; Zhu, Heng; Qian, Jiang; Wang, Yadong
2018-01-04
Understanding the molecular principles governing interactions between transcription factors (TFs) and DNA targets is one of the main subjects for transcriptional regulation. Recently, emerging evidence demonstrated that some TFs could bind to DNA motifs containing highly methylated CpGs both in vitro and in vivo. Identification of such TFs and elucidation of their physiological roles now become an important stepping-stone toward understanding the mechanisms underlying the methylation-mediated biological processes, which have crucial implications for human disease and disease development. Hence, we constructed a database, named as MeDReaders, to collect information about methylated DNA binding activities. A total of 731 TFs, which could bind to methylated DNA sequences, were manually curated in human and mouse studies reported in the literature. In silico approaches were applied to predict methylated and unmethylated motifs of 292 TFs by integrating whole genome bisulfite sequencing (WGBS) and ChIP-Seq datasets in six human cell lines and one mouse cell line extracted from ENCODE and GEO database. MeDReaders database will provide a comprehensive resource for further studies and aid related experiment designs. The database implemented unified access for users to most TFs involved in such methylation-associated binding actives. The website is available at http://medreader.org/. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Single molecule and single cell epigenomics.
Hyun, Byung-Ryool; McElwee, John L; Soloway, Paul D
2015-01-15
Dynamically regulated changes in chromatin states are vital for normal development and can produce disease when they go awry. Accordingly, much effort has been devoted to characterizing these states under normal and pathological conditions. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most widely used method to characterize where in the genome transcription factors, modified histones, modified nucleotides and chromatin binding proteins are found; bisulfite sequencing (BS-seq) and its variants are commonly used to characterize the locations of DNA modifications. Though very powerful, these methods are not without limitations. Notably, they are best at characterizing one chromatin feature at a time, yet chromatin features arise and function in combination. Investigators commonly superimpose separate ChIP-seq or BS-seq datasets, and then infer where chromatin features are found together. While these inferences might be correct, they can be misleading when the chromatin source has distinct cell types, or when a given cell type exhibits any cell to cell variation in chromatin state. These ambiguities can be eliminated by robust methods that directly characterize the existence and genomic locations of combinations of chromatin features in very small inputs of cells or ideally, single cells. Here we review single molecule epigenomic methods under development to overcome these limitations, the technical challenges associated with single molecule methods and their potential application to single cells. Copyright © 2014 Elsevier Inc. All rights reserved.
Single Molecule and Single Cell Epigenomics
Hyun, Byung-Ryool; McElwee, John L.; Soloway, Paul D.
2014-01-01
Dynamically regulated changes in chromatin states are vital for normal development and can produce disease when they go awry. Accordingly, much effort has been devoted to characterizing these states under normal and pathological conditions. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most widely used method to characterize where in the genome transcription factors, modified histones, modified nucleotides and chromatin binding proteins are found; bisulfite sequencing (BS-seq) and its variants are commonly used to characterize the locations of DNA modifications. Though very powerful, these methods are not without limitations. Notably, they are best at characterizing one chromatin feature at a time, yet chromatin features arise and function in combination. Investigators commonly superimpose separate ChIP-seq or BS-seq datasets, and then infer where chromatin features are found together. While these inferences might be correct, they can be misleading when the chromatin source has distinct cell types, or when a given cell type exhibits any cell to cell variation in chromatin state. These ambiguities can be eliminated by robust methods that directly characterize the existence and genomic locations of combinations of chromatin features in very small inputs of cells or ideally, single cells. Here we review single molecule epigenomic methods under development to overcome these limitations, the technical challenges associated with single molecule methods and their potential application to single cells. PMID:25204781
ERIC Educational Resources Information Center
Chu, Hui-Chun; Yang, Kai-Hsiang; Chen, Jing-Hong
2015-01-01
Concept maps have been recognized as an effective tool for students to organize their knowledge; however, in history courses, it is important for students to learn and organize historical events according to the time of their occurrence. Therefore, in this study, a time sequence-oriented concept map approach is proposed for developing a game-based…
SfiI genomic cleavage map of Escherichia coli K-12 strain MG1655.
Perkins, J D; Heath, J D; Sharma, B R; Weinstock, G M
1992-01-01
An SfiI restriction map of Escherichia coli K-12 strain MG1655 is presented. The map contains thirty-one cleavage sites separating fragments ranging in size from 407 kb to 3.7 kb. Several techniques were used in the construction of this map, including CHEF pulsed field gel electrophoresis; physical analysis of a set of twenty-six auxotrophic transposon insertions; correlation with the restriction map of Kohara and coworkers using the commercially available E. coli Gene Mapping Membranes; analysis of publicly available sequence information; and correlation of the above data with the combined genetic and physical map developed by Rudd, et al. The combination of these techniques has yielded a map in which all but one site can be localized within a range of +/- 2 kb, and over half the sites can be localized precisely by sequence data. Two sites present in the EcoSeq5 sequence database are not cleaved in MG1655 and four sites are noted to be sensitive to methylation by the dcm methylase. This map, combined with the NotI physical map of MG1655, can aid in the rapid, precise mapping of several different types of genetic alterations, including transposon mediated mutations and other insertions, inversions, deletions and duplications. Images PMID:1312707
Validation of a standardized mapping system of the hip joint for radial MRA sequencing.
Klenke, Frank M; Hoffmann, Daniel B; Cross, Brian J; Siebenrock, Klaus A
2015-03-01
Intraarticular gadolinium-enhanced magnetic resonance arthrography (MRA) is commonly applied to characterize morphological disorders of the hip. However, the reproducibility of retrieving anatomic landmarks on MRA scans and their correlation with intraarticular pathologies is unknown. A precise mapping system for the exact localization of hip pathomorphologies with radial MRA sequences is lacking. Therefore, the purpose of the study was the establishment and validation of a reproducible mapping system for radial sequences of hip MRA. Sixty-nine consecutive intraarticular gadolinium-enhanced hip MRAs were evaluated. Radial sequencing consisted of 14 cuts orientated along the axis of the femoral neck. Three orthopedic surgeons read the radial sequences independently. Each MRI was read twice with a minimum interval of 7 days from the first reading. The intra- and inter-observer reliability of the mapping procedure was determined. A clockwise system for hip MRA was established. The teardrop figure served to determine the 6 o'clock position of the acetabulum; the center of the greater trochanter served to determine the 12 o'clock position of the femoral head-neck junction. The intra- and inter-observer ICCs to retrieve the correct 6/12 o'clock positions were 0.906-0.996 and 0.978-0.988, respectively. The established mapping system for radial sequences of hip joint MRA is reproducible and easy to perform.
ACTG: novel peptide mapping onto gene models.
Choi, Seunghyuk; Kim, Hyunwoo; Paek, Eunok
2017-04-15
In many proteogenomic applications, mapping peptide sequences onto genome sequences can be very useful, because it allows us to understand origins of the gene products. Existing software tools either take the genomic position of a peptide start site as an input or assume that the peptide sequence exactly matches the coding sequence of a given gene model. In case of novel peptides resulting from genomic variations, especially structural variations such as alternative splicing, these existing tools cannot be directly applied unless users supply information about the variant, either its genomic position or its transcription model. Mapping potentially novel peptides to genome sequences, while allowing certain genomic variations, requires introducing novel gene models when aligning peptide sequences to gene structures. We have developed a new tool called ACTG (Amino aCids To Genome), which maps peptides to genome, assuming all possible single exon skipping, junction variation allowing three edit distances from the original splice sites, exon extension and frame shift. In addition, it can also consider SNVs (single nucleotide variations) during mapping phase if a user provides the VCF (variant call format) file as an input. Available at http://prix.hanyang.ac.kr/ACTG/search.jsp . eunokpaek@hanyang.ac.kr. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
NASA Astrophysics Data System (ADS)
Wang, Guanxi; Tie, Yun; Qi, Lin
2017-07-01
In this paper, we propose a novel approach based on Depth Maps and compute Multi-Scale Histograms of Oriented Gradient (MSHOG) from sequences of depth maps to recognize actions. Each depth frame in a depth video sequence is projected onto three orthogonal Cartesian planes. Under each projection view, the absolute difference between two consecutive projected maps is accumulated through a depth video sequence to form a Depth Map, which is called Depth Motion Trail Images (DMTI). The MSHOG is then computed from the Depth Maps for the representation of an action. In addition, we apply L2-Regularized Collaborative Representation (L2-CRC) to classify actions. We evaluate the proposed approach on MSR Action3D dataset and MSRGesture3D dataset. Promising experimental result demonstrates the effectiveness of our proposed method.
Petroli, César D.; Sansaloni, Carolina P.; Carling, Jason; Steane, Dorothy A.; Vaillancourt, René E.; Myburg, Alexander A.; da Silva, Orzenil Bonfim; Pappas, Georgios Joannis; Kilian, Andrzej; Grattapaglia, Dario
2012-01-01
Diversity Arrays Technology (DArT) provides a robust, high throughput, cost-effective method to query thousands of sequence polymorphisms in a single assay. Despite the extensive use of this genotyping platform for numerous plant species, little is known regarding the sequence attributes and genome-wide distribution of DArT markers. We investigated the genomic properties of the 7,680 DArT marker probes of a Eucalyptus array, by sequencing them, constructing a high density linkage map and carrying out detailed physical mapping analyses to the Eucalyptus grandis reference genome. A consensus linkage map with 2,274 DArT markers anchored to 210 microsatellites and a framework map, with improved support for ordering, displayed extensive collinearity with the genome sequence. Only 1.4 Mbp of the 75 Mbp of still unplaced scaffold sequence was captured by 45 linkage mapped but physically unaligned markers to the 11 main Eucalyptus pseudochromosomes, providing compelling evidence for the quality and completeness of the current Eucalyptus genome assembly. A highly significant correspondence was found between the locations of DArT markers and predicted gene models, while most of the 89 DArT probes unaligned to the genome correspond to sequences likely absent in E. grandis, consistent with the pan-genomic feature of this multi-Eucalyptus species DArT array. These comprehensive linkage-to-physical mapping analyses provide novel data regarding the genomic attributes of DArT markers in plant genomes in general and for Eucalyptus in particular. DArT markers preferentially target the gene space and display a largely homogeneous distribution across the genome, thereby providing superb coverage for mapping and genome-wide applications in breeding and diversity studies. Data reported on these ubiquitous properties of DArT markers will be particularly valuable to researchers working on less-studied crop species who already count on DArT genotyping arrays but for which no reference genome is yet available to allow such detailed characterization. PMID:22984541
DistMap: a toolkit for distributed short read mapping on a Hadoop cluster.
Pandey, Ram Vinay; Schlötterer, Christian
2013-01-01
With the rapid and steady increase of next generation sequencing data output, the mapping of short reads has become a major data analysis bottleneck. On a single computer, it can take several days to map the vast quantity of reads produced from a single Illumina HiSeq lane. In an attempt to ameliorate this bottleneck we present a new tool, DistMap - a modular, scalable and integrated workflow to map reads in the Hadoop distributed computing framework. DistMap is easy to use, currently supports nine different short read mapping tools and can be run on all Unix-based operating systems. It accepts reads in FASTQ format as input and provides mapped reads in a SAM/BAM format. DistMap supports both paired-end and single-end reads thereby allowing the mapping of read data produced by different sequencing platforms. DistMap is available from http://code.google.com/p/distmap/
DistMap: A Toolkit for Distributed Short Read Mapping on a Hadoop Cluster
Pandey, Ram Vinay; Schlötterer, Christian
2013-01-01
With the rapid and steady increase of next generation sequencing data output, the mapping of short reads has become a major data analysis bottleneck. On a single computer, it can take several days to map the vast quantity of reads produced from a single Illumina HiSeq lane. In an attempt to ameliorate this bottleneck we present a new tool, DistMap - a modular, scalable and integrated workflow to map reads in the Hadoop distributed computing framework. DistMap is easy to use, currently supports nine different short read mapping tools and can be run on all Unix-based operating systems. It accepts reads in FASTQ format as input and provides mapped reads in a SAM/BAM format. DistMap supports both paired-end and single-end reads thereby allowing the mapping of read data produced by different sequencing platforms. DistMap is available from http://code.google.com/p/distmap/ PMID:24009693
A high-density intraspecific SNP linkage map of pigeonpea (Cajanas cajan L. Millsp.)
Mandal, Paritra; Bhutani, Shefali; Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram Pratap; Chaudhary, A. K.; Yadav, Rekha; Gaikwad, K.; Sevanthi, Amitha Mithra; Datta, Subhojit; Raje, Ranjeet S.; Sharma, Tilak R.; Singh, Nagendra Kumar
2017-01-01
Pigeonpea (Cajanus cajan (L.) Millsp.) is a major food legume cultivated in semi-arid tropical regions including the Indian subcontinent, Africa, and Southeast Asia. It is an important source of protein, minerals, and vitamins for nearly 20% of the world population. Due to high carbon sequestration and drought tolerance, pigeonpea is an important crop for the development of climate resilient agriculture and nutritional security. However, pigeonpea productivity has remained low for decades because of limited genetic and genomic resources, and sparse utilization of landraces and wild pigeonpea germplasm. Here, we present a dense intraspecific linkage map of pigeonpea comprising 932 markers that span a total adjusted map length of 1,411.83 cM. The consensus map is based on three different linkage maps that incorporate a large number of single nucleotide polymorphism (SNP) markers derived from next generation sequencing data, using Illumina GoldenGate bead arrays, and genotyping with restriction site associated DNA (RAD) sequencing. The genotyping-by-sequencing enhanced the marker density but was met with limited success due to lack of common markers across the genotypes of mapping population. The integrated map has 547 bead-array SNP, 319 RAD-SNP, and 65 simple sequence repeat (SSR) marker loci. We also show here correspondence between our linkage map and published genome pseudomolecules of pigeonpea. The availability of a high-density linkage map will help improve the anchoring of the pigeonpea genome to its chromosomes and the mapping of genes and quantitative trait loci associated with useful agronomic traits. PMID:28654689
A novel tandem repeat sequence located on human chromosome 4p: isolation and characterization.
Kogi, M; Fukushige, S; Lefevre, C; Hadano, S; Ikeda, J E
1997-06-01
In an effort to analyze the genomic region of the distal half of human chromosome 4p, to where Huntington disease and other diseases have been mapped, we have isolated the cosmid clone (CRS447) that was likely to contain a region with specific repeat sequences. Clone CRS447 was subjected to detailed analysis, including chromosome mapping, restriction mapping, and DNA sequencing. Chromosome mapping by both a human-CHO hybrid cell panel and FISH revealed that CRS447 was predominantly located in the 4p15.1-15.3 region. CRS447 was shown to consist of tandem repeats of 4.7-kb units present on chromosome 4p. A single EcoRI unit was subcloned (pRS447), and the complete sequence was determined as 4752 nucleotides. When pRS447 was used as a probe, the number of copies of this repeat per haploid genome was estimated to be 50-70. Sequence analysis revealed that it contained two internal CA repeats and one putative ORF. Database search established that this sequence was unreported. However, two homologous STS markers were found in the database. We concluded that CRS447/pRS447 is a novel tandem repeat sequence that is mainly specific to human chromosome 4p.
2012-01-01
Background Cotton is the world’s most important natural textile fiber and a significant oilseed crop. Decoding cotton genomes will provide the ultimate reference and resource for research and utilization of the species. Integration of high-density genetic maps with genomic sequence information will largely accelerate the process of whole-genome assembly in cotton. Results In this paper, we update a high-density interspecific genetic linkage map of allotetraploid cultivated cotton. An additional 1,167 marker loci have been added to our previously published map of 2,247 loci. Three new marker types, InDel (insertion-deletion) and SNP (single nucleotide polymorphism) developed from gene information, and REMAP (retrotransposon-microsatellite amplified polymorphism), were used to increase map density. The updated map consists of 3,414 loci in 26 linkage groups covering 3,667.62 cM with an average inter-locus distance of 1.08 cM. Furthermore, genome-wide sequence analysis was finished using 3,324 informative sequence-based markers and publicly-available Gossypium DNA sequence information. A total of 413,113 EST and 195 BAC sequences were physically anchored and clustered by 3,324 sequence-based markers. Of these, 14,243 ESTs and 188 BACs from different species of Gossypium were clustered and specifically anchored to the high-density genetic map. A total of 2,748 candidate unigenes from 2,111 ESTs clusters and 63 BACs were mined for functional annotation and classification. The 337 ESTs/genes related to fiber quality traits were integrated with 132 previously reported cotton fiber quality quantitative trait loci, which demonstrated the important roles in fiber quality of these genes. Higher-level sequence conservation between different cotton species and between the A- and D-subgenomes in tetraploid cotton was found, indicating a common evolutionary origin for orthologous and paralogous loci in Gossypium. Conclusion This study will serve as a valuable genomic resource for tetraploid cotton genome assembly, for cloning genes related to superior agronomic traits, and for further comparative genomic analyses in Gossypium. PMID:23046547
An integrated molecular cytogenetic map of Cucumis sativus L. chromosome 2.
Han, Yonghua; Zhang, Zhonghua; Huang, Sanwen; Jin, Weiwei
2011-01-27
Integration of molecular, genetic and cytological maps is still a challenge for most plant species. Recent progress in molecular and cytogenetic studies created a basis for developing integrated maps in cucumber (Cucumis sativus L.). In this study, eleven fosmid clones and three plasmids containing 45S rDNA, the centromeric satellite repeat Type III and the pericentriomeric repeat CsRP1 sequences respectively were hybridized to cucumber metaphase chromosomes to assign their cytological location on chromosome 2. Moreover, an integrated molecular cytogenetic map of cucumber chromosomes 2 was constructed by fluorescence in situ hybridization (FISH) mapping of 11 fosmid clones together with the cucumber centromere-specific Type III sequence on meiotic pachytene chromosomes. The cytogenetic map was fully integrated with genetic linkage map since each fosmid clone was anchored by a genetically mapped simple sequence repeat marker (SSR). The relationship between the genetic and physical distances along chromosome was analyzed. Recombination was not evenly distributed along the physical length of chromosome 2. Suppression of recombination was found in centromeric and pericentromeric regions. Our results also indicated that the molecular markers composing the linkage map for chromosome 2 provided excellent coverage of the chromosome.
Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J
2012-05-25
A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the 'Golden Delicious' reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.
Wright, Imogen A; Travers, Simon A
2014-07-01
The challenge presented by high-throughput sequencing necessitates the development of novel tools for accurate alignment of reads to reference sequences. Current approaches focus on using heuristics to map reads quickly to large genomes, rather than generating highly accurate alignments in coding regions. Such approaches are, thus, unsuited for applications such as amplicon-based analysis and the realignment phase of exome sequencing and RNA-seq, where accurate and biologically relevant alignment of coding regions is critical. To facilitate such analyses, we have developed a novel tool, RAMICS, that is tailored to mapping large numbers of sequence reads to short lengths (<10 000 bp) of coding DNA. RAMICS utilizes profile hidden Markov models to discover the open reading frame of each sequence and aligns to the reference sequence in a biologically relevant manner, distinguishing between genuine codon-sized indels and frameshift mutations. This approach facilitates the generation of highly accurate alignments, accounting for the error biases of the sequencing machine used to generate reads, particularly at homopolymer regions. Performance improvements are gained through the use of graphics processing units, which increase the speed of mapping through parallelization. RAMICS substantially outperforms all other mapping approaches tested in terms of alignment quality while maintaining highly competitive speed performance. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
USDA-ARS?s Scientific Manuscript database
Sorghum is the second cereal crop to have a full genome completely sequenced (Nature (2009), 457:551). This achievement is widely recognized as a scientific milestone for grass genetics and genomics in general. However, the true worth of genetic information lies in translating the sequence informa...
USDA-ARS?s Scientific Manuscript database
Ongoing developments and cost decreases in next-generation sequencing (NGS) technologies have led to an increase in their application, which has greatly enhanced the fields of genetics and genomics. Mapping sequence reads onto a reference genome is a fundamental step in the analysis of NGS data. Eff...
The Organization of Repetitive DNA in the Genomes of Amazonian Lizard Species in the Family Teiidae.
Carvalho, Natalia D M; Pinheiro, Vanessa S S; Carmo, Edson J; Goll, Leonardo G; Schneider, Carlos H; Gross, Maria C
2015-01-01
Repetitive DNA is the largest fraction of the eukaryote genome and comprises tandem and dispersed sequences. It presents variations in relation to its composition, number of copies, distribution, dynamics, and genome organization, and participates in the evolutionary diversification of different vertebrate species. Repetitive sequences are usually located in the heterochromatin of centromeric and telomeric regions of chromosomes, contributing to chromosomal structures. Therefore, the aim of this study was to physically map repetitive DNA sequences (5S rDNA, telomeric sequences, tropomyosin gene 1, and retroelements Rex1 and SINE) of mitotic chromosomes of Amazonian species of teiids (Ameiva ameiva, Cnemidophorus sp. 1, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin) to understand their genome organization and karyotype evolution. The mapping of repetitive sequences revealed a distinct pattern in Cnemidophorus sp. 1, whereas the other species showed all sequences interspersed in the heterochromatic region. Physical mapping of the tropomyosin 1 gene was performed for the first time in lizards and showed that in addition to being functional, this gene has a structural function similar to the mapped repetitive elements as it is located preferentially in centromeric regions and termini of chromosomes. © 2016 S. Karger AG, Basel.
Chaudhary, Sakshi; Mishra, Bharat Kumar; Vivek, Thiruvettai; Magadum, Santoshkumar; Yasin, Jeshima Khan
2016-01-01
Simple Sequence Repeats or microsatellites are resourceful molecular genetic markers. There are only few reports of SSR identification and development in pineapple. Complete genome sequence of pineapple available in the public domain can be used to develop numerous novel SSRs. Therefore, an attempt was made to identify SSRs from genomic, chloroplast, mitochondrial and EST sequences of pineapple which will help in deciphering genetic makeup of its germplasm resources. A total of 359511 SSRs were identified in pineapple (356385 from genome sequence, 45 from chloroplast sequence, 249 in mitochondrial sequence and 2832 from EST sequences). The list of EST-SSR markers and their details are available in the database. PineElm_SSRdb is an open source database available for non-commercial academic purpose at http://app.bioelm.com/ with a mapping tool which can develop circular maps of selected marker set. This database will be of immense use to breeders, researchers and graduates working on Ananas spp. and to others working on cross-species transferability of markers, investigating diversity, mapping and DNA fingerprinting.
Mapping wide row crops with video sequences acquired from a tractor moving at treatment speed.
Sainz-Costa, Nadir; Ribeiro, Angela; Burgos-Artizzu, Xavier P; Guijarro, María; Pajares, Gonzalo
2011-01-01
This paper presents a mapping method for wide row crop fields. The resulting map shows the crop rows and weeds present in the inter-row spacing. Because field videos are acquired with a camera mounted on top of an agricultural vehicle, a method for image sequence stabilization was needed and consequently designed and developed. The proposed stabilization method uses the centers of some crop rows in the image sequence as features to be tracked, which compensates for the lateral movement (sway) of the camera and leaves the pitch unchanged. A region of interest is selected using the tracked features, and an inverse perspective technique transforms the selected region into a bird's-eye view that is centered on the image and that enables map generation. The algorithm developed has been tested on several video sequences of different fields recorded at different times and under different lighting conditions, with good initial results. Indeed, lateral displacements of up to 66% of the inter-row spacing were suppressed through the stabilization process, and crop rows in the resulting maps appear straight.
Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire
2012-01-01
Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Baeßler, Bettina; Schaarschmidt, Frank; Stehning, Christian; Schnackenburg, Bernhard; Maintz, David; Bunck, Alexander C
2015-11-01
Previous studies showed that myocardial T2 relaxation times measured by cardiac T2-mapping vary significantly depending on sequence and field strength. Therefore, a systematic comparison of different T2-mapping sequences and the establishment of dedicated T2 reference values is mandatory for diagnostic decision-making. Phantom experiments using gel probes with a range of different T1 and T2 times were performed on a clinical 1.5T and 3T scanner. In addition, 30 healthy volunteers were examined at 1.5 and 3T in immediate succession. In each examination, three different T2-mapping sequences were performed at three short-axis slices: Multi Echo Spin Echo (MESE), T2-prepared balanced SSFP (T2prep), and Gradient Spin Echo with and without fat saturation (GraSEFS/GraSE). Segmented T2-Maps were generated according to the AHA 16-segment model and statistical analysis was performed. Significant intra-individual differences between mean T2 times were observed for all sequences. In general, T2prep resulted in lowest and GraSE in highest T2 times. A significant variation with field strength was observed for mean T2 in phantom as well as in vivo, with higher T2 values at 1.5T compared to 3T, regardless of the sequence used. Segmental T2 values for each sequence at 1.5 and 3T are presented. Despite a careful selection of sequence parameters and volunteers, significant variations of the measured T2 values were observed between field strengths, MR sequences and myocardial segments. Therefore, we present segmental T2 values for each sequence at 1.5 and 3T with the inherent potential to serve as reference values for future studies. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Toward an Integrated BAC Library Resource for Genome Sequencing and Analysis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Simon, M. I.; Kim, U.-J.
We developed a great deal of expertise in building large BAC libraries from a variety of DNA sources including humans, mice, corn, microorganisms, worms, and Arabidopsis. We greatly improved the technology for screening these libraries rapidly and for selecting appropriate BACs and mapping BACs to develop large overlapping contigs. We became involved in supplying BACs and BAC contigs to a variety of sequencing and mapping projects and we began to collaborate with Drs. Adams and Venter at TIGR and with Dr. Leroy Hood and his group at University of Washington to provide BACs for end sequencing and for mapping andmore » sequencing of large fragments of chromosome 16. Together with Dr. Ian Dunham and his co-workers at the Sanger Center we completed the mapping and they completed the sequencing of the first human chromosome, chromosome 22. This was published in Nature in 1999 and our BAC contigs made a major contribution to this sequencing effort. Drs. Shizuya and Ding invented an automated highly accurate BAC mapping technique. We also developed long-term collaborations with Dr. Uli Weier at UCSF in the design of BAC probes for characterization of human tumors and specific chromosome deletions and breakpoints. Finally the contribution of our work to the human genome project has been recognized in the publication both by the international consortium and the NIH of a draft sequence of the human genome in Nature last year. Dr. Shizuya was acknowledged in the authorship of that landmark paper. Dr. Simon was also an author on the Venter/Adams Celera project sequencing the human genome that was published in Science last year.« less
Stability of Adrenaline in Irrigating Solution for Intraocular Surgery.
Shibata, Yuuka; Kimura, Yasuhiro; Taogoshi, Takanori; Matsuo, Hiroaki; Kihira, Kenji
2016-01-01
Intraocular irrigating solution containing 1 µg/mL adrenaline is widely used during cataract surgery to maintain pupil dilation. Prepared intraocular irrigating solutions are recommended for use within 6 h. After the irrigating solution is admistered for dilution, the adrenaline may become oxidized, and this may result in a decrease in its biological activity. However, the stability of adrenaline in intraocular irrigating solution is not fully understood. The aim of this study was to evaluate the stability of adrenaline in clinically used irrigating solutions of varying pH. Six hours after mixing, the adrenaline percentages remaining were 90.6%±3.7 (pH 7.2), 91.1%±2.2 (pH 7.5), and 65.2%±2.8 (pH 8.0) of the initial concentration. One hour after mixing, the percentages remaining were 97.6%±2.0 (pH 7.2), 97.4%±2.7 (pH 7.5), and 95.6%±3.3 (pH 8.0). The degradation was especially remarkable and time dependent in the solution at pH 8.0. These results indicate that the concentration of adrenaline is decreased after preparation. Moreover, we investigated the influence of sodium bisulfite on adrenaline stability in irrigating solution. The percentage adrenaline remaining at 6 h after mixing in irrigating solution (pH 8.0) containing sodium bisulfite at 0.5 µg/mL (concentration in irrigating solution) or at 500 µg/mL (concentration in the undiluted adrenaline preparation) were 57.5 and 97.3%, respectively. Therefore, the low concentration of sodium bisulfite in the irrigating solution may be a cause of the adrenaline loss. In conclusion, intraocular irrigation solution with adrenaline should be prepared just prior to its use in surgery.
Speech processing using maximum likelihood continuity mapping
Hogden, John E.
2000-01-01
Speech processing is obtained that, given a probabilistic mapping between static speech sounds and pseudo-articulator positions, allows sequences of speech sounds to be mapped to smooth sequences of pseudo-articulator positions. In addition, a method for learning a probabilistic mapping between static speech sounds and pseudo-articulator position is described. The method for learning the mapping between static speech sounds and pseudo-articulator position uses a set of training data composed only of speech sounds. The said speech processing can be applied to various speech analysis tasks, including speech recognition, speaker recognition, speech coding, speech synthesis, and voice mimicry.
Speech processing using maximum likelihood continuity mapping
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hogden, J.E.
Speech processing is obtained that, given a probabilistic mapping between static speech sounds and pseudo-articulator positions, allows sequences of speech sounds to be mapped to smooth sequences of pseudo-articulator positions. In addition, a method for learning a probabilistic mapping between static speech sounds and pseudo-articulator position is described. The method for learning the mapping between static speech sounds and pseudo-articulator position uses a set of training data composed only of speech sounds. The said speech processing can be applied to various speech analysis tasks, including speech recognition, speaker recognition, speech coding, speech synthesis, and voice mimicry.
Kamel, Katarzyna A; Kroc, Magdalena; Święcicki, Wojciech
2015-01-01
Sequence tagged site (STS) markers are valuable tools for genetic and physical mapping that can be successfully used in comparative analyses among related species. Current challenges for molecular markers genotyping in plants include the lack of fast, sensitive and inexpensive methods suitable for sequence variant detection. In contrast, high resolution melting (HRM) is a simple and high-throughput assay, which has been widely applied in sequence polymorphism identification as well as in the studies of genetic variability and genotyping. The present study is the first attempt to use the HRM analysis to genotype STS markers in narrow-leafed lupin (Lupinus angustifolius L.). The sensitivity and utility of this method was confirmed by the sequence polymorphism detection based on melting curve profiles in the parental genotypes and progeny of the narrow-leafed lupin mapping population. Application of different approaches, including amplicon size and a simulated heterozygote analysis, has allowed for successful genetic mapping of 16 new STS markers in the narrow-leafed lupin genome.
Mapping Challenging Mutations by Whole-Genome Sequencing
Smith, Harold E.; Fabritius, Amy S.; Jaramillo-Lambert, Aimee; Golden, Andy
2016-01-01
Whole-genome sequencing provides a rapid and powerful method for identifying mutations on a global scale, and has spurred a renewed enthusiasm for classical genetic screens in model organisms. The most commonly characterized category of mutation consists of monogenic, recessive traits, due to their genetic tractability. Therefore, most of the mapping methods for mutation identification by whole-genome sequencing are directed toward alleles that fulfill those criteria (i.e., single-gene, homozygous variants). However, such approaches are not entirely suitable for the characterization of a variety of more challenging mutations, such as dominant and semidominant alleles or multigenic traits. Therefore, we have developed strategies for the identification of those classes of mutations, using polymorphism mapping in Caenorhabditis elegans as our model for validation. We also report an alternative approach for mutation identification from traditional recombinant crosses, and a solution to the technical challenge of sequencing sterile or terminally arrested strains where population size is limiting. The methods described herein extend the applicability of whole-genome sequencing to a broader spectrum of mutations, including classes that are difficult to map by traditional means. PMID:26945029
DNA Translator and Aligner: HyperCard utilities to aid phylogenetic analysis of molecules.
Eernisse, D J
1992-04-01
DNA Translator and Aligner are molecular phylogenetics HyperCard stacks for Macintosh computers. They manipulate sequence data to provide graphical gene mapping, conversions, translations and manual multiple-sequence alignment editing. DNA Translator is able to convert documented GenBank or EMBL documented sequences into linearized, rescalable gene maps whose gene sequences are extractable by clicking on the corresponding map button or by selection from a scrolling list. Provided gene maps, complete with extractable sequences, consist of nine metazoan, one yeast, and one ciliate mitochondrial DNAs and three green plant chloroplast DNAs. Single or multiple sequences can be manipulated to aid in phylogenetic analysis. Sequences can be translated between nucleic acids and proteins in either direction with flexible support of alternate genetic codes and ambiguous nucleotide symbols. Multiple aligned sequence output from diverse sources can be converted to Nexus, Hennig86 or PHYLIP format for subsequent phylogenetic analysis. Input or output alignments can be examined with Aligner, a convenient accessory stack included in the DNA Translator package. Aligner is an editor for the manual alignment of up to 100 sequences that toggles between display of matched characters and normal unmatched sequences. DNA Translator also generates graphic displays of amino acid coding and codon usage frequency relative to all other, or only synonymous, codons for approximately 70 select organism-organelle combinations. Codon usage data is compatible with spreadsheet or UWGCG formats for incorporation of additional molecules of interest. The complete package is available via anonymous ftp and is free for non-commercial uses.
Mapping of disease-associated variants in admixed populations
2011-01-01
Recent developments in high-throughput genotyping and whole-genome sequencing will enhance the identification of disease loci in admixed populations. We discuss how a more refined estimation of ancestry benefits both admixture mapping and association mapping, making disease loci identification in admixed populations more powerful. High-throughput genotyping and sequencing will enable refined estimation of ancestry, thus enhancing disease loci identification in admixed populations PMID:21635713
Schein, Jacqueline E.; Tangen, Kristin L.; Chiu, Readman; Shin, Heesun; Lengeler, Klaus B.; MacDonald, William Kim; Bosdet, Ian; Heitman, Joseph; Jones, Steven J.M.; Marra, Marco A.; Kronstad, James W.
2002-01-01
The basidiomycete fungus Cryptococcus neoformans is an important opportunistic pathogen of humans that poses a significant threat to immunocompromised individuals. Isolates of C. neoformans are classified into serotypes (A, B, C, D, and AD) based on antigenic differences in the polysaccharide capsule that surrounds the fungal cells. Genomic and EST sequencing projects are underway for the serotype D strain JEC21 and the serotype A strain H99. As part of a genomics program for C. neoformans, we have constructed fingerprinted bacterial artificial chromosome (BAC) clone physical maps for strains H99 and JEC21 to support the genomic sequencing efforts and to provide an initial comparison of the two genomes. The BAC clones represented an estimated 10-fold redundant coverage of the genomes of each serotype and allowed the assembly of 20 contigs each for H99 and JEC21. We found that the genomes of the two strains are sufficiently distinct to prevent coassembly of the two maps when combined fingerprint data are used to construct contigs. Hybridization experiments placed 82 markers on the JEC21 map and 102 markers on the H99 map, enabling contigs to be linked with specific chromosomes identified by electrophoretic karyotyping. These markers revealed both extensive similarity in gene order (conservation of synteny) between JEC21 and H99 as well as examples of chromosomal rearrangements including inversions and translocations. Sequencing reads were generated from the ends of the BAC clones to allow correlation of genomic shotgun sequence data with physical map contigs. The BAC maps therefore represent a valuable resource for the generation, assembly, and finishing of the genomic sequence of both JEC21 and H99. The physical maps also serve as a link between map-based and sequence-based data, providing a powerful resource for continued genomic studies. [This paper is dedicated to the memory of Michael Smith, Founding Director of the Biotechnology Laboratory and the BC Cancer Agency Genome Sciences Centre. Supplemental material is available online at http://www.genome.org.] PMID:12213782
Guo, Yinshan; Xing, Huiyang; Zhao, Yuhui; Liu, Zhendong; Li, Kun; Guo, Xiuwu
2017-01-01
Genetic maps are important tools in plant genomics and breeding. We report a large-scale discovery of single nucleotide polymorphisms (SNPs) using the specific length amplified fragment sequencing (SLAF-seq) technique for the construction of high-density genetic maps for two elite wine grape cultivars, ‘Chardonnay’ and ‘Beibinghong’, and their 130 F1 plants. A total of 372.53 M paired-end reads were obtained after preprocessing. The average sequencing depth was 33.81 for ‘Chardonnay’ (the female parent), 48.20 for ‘Beibinghong’ (the male parent), and 12.66 for the F1 offspring. We detected 202,349 high-quality SLAFs of which 144,972 were polymorphic; 10,042 SNPs were used to construct a genetic map that spanned 1,969.95 cM, with an average genetic distance of 0.23 cM between adjacent markers. This genetic map contains the largest molecular marker number of the grape maps so far reported. We thus demonstrate that SLAF-seq is a promising strategy for the construction of high-density genetic maps; the map that we report here is a good potential resource for QTL mapping of genes linked to major economic and agronomic traits, map-based cloning, and marker-assisted selection of grape. PMID:28746364
Evaluation of MRI sequences for quantitative T1 brain mapping
NASA Astrophysics Data System (ADS)
Tsialios, P.; Thrippleton, M.; Glatz, A.; Pernet, C.
2017-11-01
T1 mapping constitutes a quantitative MRI technique finding significant application in brain imaging. It allows evaluation of contrast uptake, blood perfusion, volume, providing a more specific biomarker of disease progression compared to conventional T1-weighted images. While there are many techniques for T1-mapping there is a wide range of reported T1-values in tissues, raising the issue of protocols reproducibility and standardization. The gold standard for obtaining T1-maps is based on acquiring IR-SE sequence. Widely used alternative sequences are IR-SE-EPI, VFA (DESPOT), DESPOT-HIFI and MP2RAGE that speed up scanning and fitting procedures. A custom MRI phantom was used to assess the reproducibility and accuracy of the different methods. All scans were performed using a 3T Siemens Prisma scanner. The acquired data processed using two different codes. The main difference was observed for VFA (DESPOT) which grossly overestimated T1 relaxation time by 214 ms [126 270] compared to the IR-SE sequence. MP2RAGE and DESPOT-HIFI sequences gave slightly shorter time than IR-SE (~20 to 30ms) and can be considered as alternative and time-efficient methods for acquiring accurate T1 maps of the human brain, while IR-SE-EPI gave identical result, at a cost of a lower image quality.
Mapping copy number variation by population-scale genome sequencing.
Mills, Ryan E; Walter, Klaudia; Stewart, Chip; Handsaker, Robert E; Chen, Ken; Alkan, Can; Abyzov, Alexej; Yoon, Seungtai Chris; Ye, Kai; Cheetham, R Keira; Chinwalla, Asif; Conrad, Donald F; Fu, Yutao; Grubert, Fabian; Hajirasouliha, Iman; Hormozdiari, Fereydoun; Iakoucheva, Lilia M; Iqbal, Zamin; Kang, Shuli; Kidd, Jeffrey M; Konkel, Miriam K; Korn, Joshua; Khurana, Ekta; Kural, Deniz; Lam, Hugo Y K; Leng, Jing; Li, Ruiqiang; Li, Yingrui; Lin, Chang-Yun; Luo, Ruibang; Mu, Xinmeng Jasmine; Nemesh, James; Peckham, Heather E; Rausch, Tobias; Scally, Aylwyn; Shi, Xinghua; Stromberg, Michael P; Stütz, Adrian M; Urban, Alexander Eckehart; Walker, Jerilyn A; Wu, Jiantao; Zhang, Yujun; Zhang, Zhengdong D; Batzer, Mark A; Ding, Li; Marth, Gabor T; McVean, Gil; Sebat, Jonathan; Snyder, Michael; Wang, Jun; Ye, Kenny; Eichler, Evan E; Gerstein, Mark B; Hurles, Matthew E; Lee, Charles; McCarroll, Steven A; Korbel, Jan O
2011-02-03
Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.
2016-04-01
Sequence tags were mapped on the human reference genome using the Novoalign software. Only those...ends of the linear islands to create a novel junctional sequence that does not exist in the genome . Thus the PE- sequence of a fragment that breaks at... genome (Fig. 3b). Those PE-tags where one tag maps uniquely to an island and the other remains unmapped, but passes the sequence quality filter,
An optimized protocol for generation and analysis of Ion Proton sequencing reads for RNA-Seq.
Yuan, Yongxian; Xu, Huaiqian; Leung, Ross Ka-Kit
2016-05-26
Previous studies compared running cost, time and other performance measures of popular sequencing platforms. However, comprehensive assessment of library construction and analysis protocols for Proton sequencing platform remains unexplored. Unlike Illumina sequencing platforms, Proton reads are heterogeneous in length and quality. When sequencing data from different platforms are combined, this can result in reads with various read length. Whether the performance of the commonly used software for handling such kind of data is satisfactory is unknown. By using universal human reference RNA as the initial material, RNaseIII and chemical fragmentation methods in library construction showed similar result in gene and junction discovery number and expression level estimated accuracy. In contrast, sequencing quality, read length and the choice of software affected mapping rate to a much larger extent. Unspliced aligner TMAP attained the highest mapping rate (97.27 % to genome, 86.46 % to transcriptome), though 47.83 % of mapped reads were clipped. Long reads could paradoxically reduce mapping in junctions. With reference annotation guide, the mapping rate of TopHat2 significantly increased from 75.79 to 92.09 %, especially for long (>150 bp) reads. Sailfish, a k-mer based gene expression quantifier attained highly consistent results with that of TaqMan array and highest sensitivity. We provided for the first time, the reference statistics of library preparation methods, gene detection and quantification and junction discovery for RNA-Seq by the Ion Proton platform. Chemical fragmentation performed equally well with the enzyme-based one. The optimal Ion Proton sequencing options and analysis software have been evaluated.
Webb, R; Troyan, T; Sherman, D; Sherman, L A
1994-08-01
Growth of Synechococcus sp. strain PCC 7942 in iron-deficient media leads to the accumulation of an approximately 34-kDa protein. The gene encoding this protein, mapA (membrane-associated protein A), has been cloned and sequenced (GenBank accession number, L01621). The mapA transcript is not detectable in normally grown cultures but is stably accumulated by cells grown in iron-deficient media. However, the promoter sequence for this gene does not resemble other bacterial iron-regulated promoters described to date. The carboxyl-terminal region of the derived amino acid sequence of MapA resembles bacterial proteins involved in iron acquisition, whereas the amino-terminal end of MapA has a high degree of amino acid identity with the abundant, chloroplast envelope protein E37. An approach employing improved cellular fractionation techniques as well as electron microscopy and immunocytochemistry was essential in localizing MapA protein to the cytoplasmic membrane of Synechococcus sp. strain PCC 7942. When these cells were grown under iron-deficient conditions, a significant fraction of MapA could also be localized to the thylakoid membranes.
Jairin, Jirapong; Kobayashi, Tetsuya; Yamagata, Yoshiyuki; Sanada-Morimura, Sachiyo; Mori, Kazuki; Tashiro, Kosuke; Kuhara, Satoru; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Yamamoto, Kimiko; Matsumura, Masaya; Yasui, Hideshi
2013-01-01
In this study, we developed the first genetic linkage map for the major rice insect pest, the brown planthopper (BPH, Nilaparvata lugens). The linkage map was constructed by integrating linkage data from two backcross populations derived from three inbred BPH strains. The consensus map consists of 474 simple sequence repeats, 43 single-nucleotide polymorphisms, and 1 sequence-tagged site, for a total of 518 markers at 472 unique positions in 17 linkage groups. The linkage groups cover 1093.9 cM, with an average distance of 2.3 cM between loci. The average number of marker loci per linkage group was 27.8. The sex-linkage group was identified by exploiting X-linked and Y-specific markers. Our linkage map and the newly developed markers used to create it constitute an essential resource and a useful framework for future genetic analyses in BPH. PMID:23204257
Comparative fine mapping of the Wax 1 (W1) locus in hexaploid wheat.
Lu, Ping; Qin, Jinxia; Wang, Guoxin; Wang, Lili; Wang, Zhenzhong; Wu, Qiuhong; Xie, Jingzhong; Liang, Yong; Wang, Yong; Zhang, Deyun; Sun, Qixin; Liu, Zhiyong
2015-08-01
By applying comparative genomics analyses, a high-density genetic linkage map of the Wax 1 ( W1 ) locus was constructed as a framework for map-based cloning. Glaucousness is described as the scattering effect of visible light from wax deposited on the cuticle of plant aerial organs. In wheat, the wax on leaves and stems is mainly controlled by two sets of genes: glaucousness loci (W1 and W2) and non-glaucousness loci (Iw1 and Iw2). Bulked segregant analysis (BSA) and simple sequence repeat (SSR) mapping showed that Wax1 (W1) is located on chromosome arm 2BS between markers Xgwm210 and Xbarc35. By applying comparative genomics analyses, colinearity genomic regions of the W1 locus on wheat 2BS were identified in Brachypodium distachyon chromosome 5, rice chromosome 4 and sorghum chromosome 6, respectively. Four STS markers were developed using the Triticum aestivum cv. Chinese Spring 454 contig sequences and the International Wheat Genome Sequencing Consortium (IWGSC) survey sequences. W1 was mapped into a 0.93 cM genetic interval flanked by markers XWGGC3197 and XWGGC2484, which has synteny with genomic regions of 56.5 kb in Brachypodium, 390 kb in rice and 31.8 kb in sorghum. The fine genetic map can serve as a framework for chromosome landing, physical mapping and map-based cloning of the W1 in wheat.
Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W.; Howieson, John G.; Li, Chengdao
2013-01-01
Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species. PMID:23734219
Yang, Huaan; Tao, Ye; Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W; Howieson, John G; Li, Chengdao
2013-01-01
Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species.
NASA Astrophysics Data System (ADS)
Eugster, H.; Huber, F.; Nebiker, S.; Gisi, A.
2012-07-01
Stereovision based mobile mapping systems enable the efficient capturing of directly georeferenced stereo pairs. With today's camera and onboard storage technologies imagery can be captured at high data rates resulting in dense stereo sequences. These georeferenced stereo sequences provide a highly detailed and accurate digital representation of the roadside environment which builds the foundation for a wide range of 3d mapping applications and image-based geo web-services. Georeferenced stereo images are ideally suited for the 3d mapping of street furniture and visible infrastructure objects, pavement inspection, asset management tasks or image based change detection. As in most mobile mapping systems, the georeferencing of the mapping sensors and observations - in our case of the imaging sensors - normally relies on direct georeferencing based on INS/GNSS navigation sensors. However, in urban canyons the achievable direct georeferencing accuracy of the dynamically captured stereo image sequences is often insufficient or at least degraded. Furthermore, many of the mentioned application scenarios require homogeneous georeferencing accuracy within a local reference frame over the entire mapping perimeter. To achieve these demands georeferencing approaches are presented and cost efficient workflows are discussed which allows validating and updating the INS/GNSS based trajectory with independently estimated positions in cases of prolonged GNSS signal outages in order to increase the georeferencing accuracy up to the project requirements.
Isobe, Sachiko N.; Hirakawa, Hideki; Sato, Shusei; Maeda, Fumi; Ishikawa, Masami; Mori, Toshiki; Yamamoto, Yuko; Shirasawa, Kenta; Kimura, Mitsuhiro; Fukami, Masanobu; Hashizume, Fujio; Tsuji, Tomoko; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Tsuruoka, Hisano; Minami, Chiharu; Takahashi, Chika; Wada, Tsuyuko; Ono, Akiko; Kawashima, Kumiko; Nakazaki, Naomi; Kishida, Yoshie; Kohara, Mitsuyo; Nakayama, Shinobu; Yamada, Manabu; Fujishiro, Tsunakazu; Watanabe, Akiko; Tabata, Satoshi
2013-01-01
The cultivated strawberry (Fragaria× ananassa) is an octoploid (2n = 8x = 56) of the Rosaceae family whose genomic architecture is still controversial. Several recent studies support the AAA′A′BBB′B′ model, but its complexity has hindered genetic and genomic analysis of this important crop. To overcome this difficulty and to assist genome-wide analysis of F. × ananassa, we constructed an integrated linkage map by organizing a total of 4474 of simple sequence repeat (SSR) markers collected from published Fragaria sequences, including 3746 SSR markers [Fragaria vesca expressed sequence tag (EST)-derived SSR markers] derived from F. vesca ESTs, 603 markers (F. × ananassa EST-derived SSR markers) from F. × ananassa ESTs, and 125 markers (F. × ananassa transcriptome-derived SSR markers) from F. × ananassa transcripts. Along with the previously published SSR markers, these markers were mapped onto five parent-specific linkage maps derived from three mapping populations, which were then assembled into an integrated linkage map. The constructed map consists of 1856 loci in 28 linkage groups (LGs) that total 2364.1 cM in length. Macrosynteny at the chromosome level was observed between the LGs of F. × ananassa and the genome of F. vesca. Variety distinction on 129 F. × ananassa lines was demonstrated using 45 selected SSR markers. PMID:23248204
NASA Astrophysics Data System (ADS)
Urbano, Gustavo; Lázaro, Isabel; Rodríguez, Israel; Reyes, Juan Luis; Larios, Roxana; Cruz, Roel
2016-02-01
Comparative voltammetry and differential double-layer capacitance studies were performed to evaluate interfacial interactions between chalcopyrite (CuFeS2) and n-isopropyl xanthate (X) in the presence of ammonium bisulfite/39wt% SO2 and caustic starch at different pH values. Raman spectroscopy, Fourier transform infrared (FTIR) spectroscopy, contact angle measurements, and microflotation tests were used to establish the type and extent of xanthate adsorption as well as the species involved under different mineral surface conditions in this study. The results demonstrate that the species that favor a greater hydrophobicity of chalcopyrite are primarily CuX and S0, whereas oxides and hydroxides of Cu and Fe as well as an excess of starch decrease the hydrophobicity. A conditioning of the mineral surface with ammonium bisulfite/39wt% SO2 at pH 6 promotes the activation of surface and enhances the xanthate adsorption. However, this effect is diminished at pH ≥ 8, when an excess of starch is added during the preconditioning step.
McCarthy, David; Pulverer, Walter; Weinhaeusel, Andreas; Diago, Oscar R; Hogan, Daniel J; Ostertag, Derek; Hanna, Michelle M
2016-06-01
Development of a sensitive method for DNA methylation profiling and associated mutation detection in clinical samples. Formalin-fixed and paraffin-embedded tumors received by clinical laboratories often contain insufficient DNA for analysis with bisulfite or methylation sensitive restriction enzymes-based methods. To increase sensitivity, methyl-CpG DNA capture and Coupled Abscription PCR Signaling detection were combined in a new assay, MethylMeter(®). Gliomas were analyzed for MGMT methylation, glioma CpG island methylator phenotype and IDH1 R132H. MethylMeter had 100% assay success rate measuring all five biomarkers in formalin-fixed and paraffin-embedded tissue. MGMT methylation results were supported by survival and mRNA expression data. MethylMeter is a sensitive and quantitative method for multitarget DNA methylation profiling and associated mutation detection. The MethylMeter-based GliomaSTRAT assay measures methylation of four targets and one mutation to simultaneously grade gliomas and predict their response to temozolomide. This information is clinically valuable in management of gliomas.
BM-Map: Bayesian Mapping of Multireads for Next-Generation Sequencing Data
Ji, Yuan; Xu, Yanxun; Zhang, Qiong; Tsui, Kam-Wah; Yuan, Yuan; Norris, Clift; Liang, Shoudan; Liang, Han
2011-01-01
Summary Next-generation sequencing (NGS) technology generates millions of short reads, which provide valuable information for various aspects of cellular activities and biological functions. A key step in NGS applications (e.g., RNA-Seq) is to map short reads to correct genomic locations within the source genome. While most reads are mapped to a unique location, a significant proportion of reads align to multiple genomic locations with equal or similar numbers of mismatches; these are called multireads. The ambiguity in mapping the multireads may lead to bias in downstream analyses. Currently, most practitioners discard the multireads in their analysis, resulting in a loss of valuable information, especially for the genes with similar sequences. To refine the read mapping, we develop a Bayesian model that computes the posterior probability of mapping a multiread to each competing location. The probabilities are used for downstream analyses, such as the quantification of gene expression. We show through simulation studies and RNA-Seq analysis of real life data that the Bayesian method yields better mapping than the current leading methods. We provide a C++ program for downloading that is being packaged into a user-friendly software. PMID:21517792
Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma Jj; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco Cam; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric
2016-01-01
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple ( Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species.
Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma JJ; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco CAM; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric
2016-01-01
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species. PMID:27917289
Indexing a sequence for mapping reads with a single mismatch.
Crochemore, Maxime; Langiu, Alessio; Rahman, M Sohel
2014-05-28
Mapping reads against a genome sequence is an interesting and useful problem in computational molecular biology and bioinformatics. In this paper, we focus on the problem of indexing a sequence for mapping reads with a single mismatch. We first focus on a simpler problem where the length of the pattern is given beforehand during the data structure construction. This version of the problem is interesting in its own right in the context of the next generation sequencing. In the sequel, we show how to solve the more general problem. In both cases, our algorithm can construct an efficient data structure in O(n log(1+ε) n) time and space and can answer subsequent queries in O(m log log n + K) time. Here, n is the length of the sequence, m is the length of the read, 0<ε<1 and is the optimal output size.
Mapping of aldose reductase gene sequences to human chromosomes 1, 3, 7, 9, 11, and 13
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bateman, J.B.; Kojis, T.; Heinzmann, C.
1993-09-01
Aldose reductase (alditol:NAD(P)+ 1-oxidoreductase; EC 1.1.1.21) (AR) catalyzes the reduction of several aldehydes, including that of glucose, to the corresponding sugar alcohol. Using a complementary DNA clone encoding human AR, the authors mapped the gene sequences to human chromosomes 1, 3, 7, 9, 11, 13, 14, and 18 by somatic cell hybridization. By in situ hybridization analysis, sequences were localized to human chromosomes 1q32-q43, 3p12, 7q31-q35, 9q22, 11p14-p15, and 13q14-q21. As a putative functional AR gene has been mapped to chromosome 7 and a putative pseudogene to chromosome 3, the sequences on the other seven chromosomes may represent other activemore » genes, non-aldose reductase homologous sequences, or pseudogenes. 24 refs., 3 figs., 2 tabs.« less
Holtz, Yan; Ardisson, Morgane; Ranwez, Vincent; Besnard, Alban; Leroy, Philippe; Poux, Gérard; Roumet, Pierre; Viader, Véronique; Santoni, Sylvain; David, Jacques
2016-01-01
Targeted sequence capture is a promising technology which helps reduce costs for sequencing and genotyping numerous genomic regions in large sets of individuals. Bait sequences are designed to capture specific alleles previously discovered in parents or reference populations. We studied a set of 135 RILs originating from a cross between an emmer cultivar (Dic2) and a recent durum elite cultivar (Silur). Six thousand sequence baits were designed to target Dic2 vs. Silur polymorphisms discovered in a previous RNAseq study. These baits were exposed to genomic DNA of the RIL population. Eighty percent of the targeted SNPs were recovered, 65% of which were of high quality and coverage. The final high density genetic map consisted of more than 3,000 markers, whose genetic and physical mapping were consistent with those obtained with large arrays. PMID:27171472
Loy, Alexander; Lehner, Angelika; Lee, Natuschka; Adamczyk, Justyna; Meier, Harald; Ernst, Jens; Schleifer, Karl-Heinz; Wagner, Michael
2002-01-01
For cultivation-independent detection of sulfate-reducing prokaryotes (SRPs) an oligonucleotide microarray consisting of 132 16S rRNA gene-targeted oligonucleotide probes (18-mers) having hierarchical and parallel (identical) specificity for the detection of all known lineages of sulfate-reducing prokaryotes (SRP-PhyloChip) was designed and subsequently evaluated with 41 suitable pure cultures of SRPs. The applicability of SRP-PhyloChip for diversity screening of SRPs in environmental and clinical samples was tested by using samples from periodontal tooth pockets and from the chemocline of a hypersaline cyanobacterial mat from Solar Lake (Sinai, Egypt). Consistent with previous studies, SRP-PhyloChip indicated the occurrence of Desulfomicrobium spp. in the tooth pockets and the presence of Desulfonema- and Desulfomonile-like SRPs (together with other SRPs) in the chemocline of the mat. The SRP-PhyloChip results were confirmed by several DNA microarray-independent techniques, including specific PCR amplification, cloning, and sequencing of SRP 16S rRNA genes and the genes encoding the dissimilatory (bi)sulfite reductase (dsrAB). PMID:12324358
Secco, David; Wang, Chuang; Shou, Huixia; Schultz, Matthew D; Chiarenza, Serge; Nussaume, Laurent; Ecker, Joseph R; Whelan, James; Lister, Ryan
2015-01-01
Cytosine DNA methylation (mC) is a genome modification that can regulate the expression of coding and non-coding genetic elements. However, little is known about the involvement of mC in response to environmental cues. Using whole genome bisulfite sequencing to assess the spatio-temporal dynamics of mC in rice grown under phosphate starvation and recovery conditions, we identified widespread phosphate starvation-induced changes in mC, preferentially localized in transposable elements (TEs) close to highly induced genes. These changes in mC occurred after changes in nearby gene transcription, were mostly DCL3a-independent, and could partially be propagated through mitosis, however no evidence of meiotic transmission was observed. Similar analyses performed in Arabidopsis revealed a very limited effect of phosphate starvation on mC, suggesting a species-specific mechanism. Overall, this suggests that TEs in proximity to environmentally induced genes are silenced via hypermethylation, and establishes the temporal hierarchy of transcriptional and epigenomic changes in response to stress. DOI: http://dx.doi.org/10.7554/eLife.09343.001 PMID:26196146
Tuorto, Francesca; Herbst, Friederike; Alerasool, Nader; Bender, Sebastian; Popp, Oliver; Federico, Giuseppina; Reitter, Sonja; Liebers, Reinhard; Stoecklin, Georg; Gröne, Hermann-Josef; Dittmar, Gunnar; Glimm, Hanno; Lyko, Frank
2015-09-14
The Dnmt2 enzyme utilizes the catalytic mechanism of eukaryotic DNA methyltransferases to methylate several tRNAs at cytosine 38. Dnmt2 mutant mice, flies, and plants were reported to be viable and fertile, and the biological function of Dnmt2 has remained elusive. Here, we show that endochondral ossification is delayed in newborn Dnmt2-deficient mice, which is accompanied by a reduction of the haematopoietic stem and progenitor cell population and a cell-autonomous defect in their differentiation. RNA bisulfite sequencing revealed that Dnmt2 methylates C38 of tRNA Asp(GTC), Gly(GCC), and Val(AAC), thus preventing tRNA fragmentation. Proteomic analyses from primary bone marrow cells uncovered systematic differences in protein expression that are due to specific codon mistranslation by tRNAs lacking Dnmt2-dependent methylation. Our observations demonstrate that Dnmt2 plays an important role in haematopoiesis and define a novel function of C38 tRNA methylation in the discrimination of near-cognate codons, thereby ensuring accurate polypeptide synthesis. © 2015 The Authors. Published under the terms of the CC BY NC ND 4.0 license.
DNA methylation modulates H19 and IGF2 expression in porcine female eye
Wang, Dongxu; Wang, Guodong; Yang, Hao; Liu, Haibo; Li, Cuie; Li, Xiaolan; Lin, Chao; Song, Yuning; Li, Zhanjun; Liu, Dianfeng
2017-01-01
Abstract The sexually dimorphic expression of H19/IGF2 is evolutionarily conserved. To investigate whether the expression of H19/IGF2 in the female porcine eye is sex-dependent, gene expression and methylation status were evaluated using quantitative real-time PCR (qPCR) and bisulfite sequencing PCR (BSP). We hypothesized that H19/IGF2 might exhibit a different DNA methylation status in the female eye. In order to evaluate our hypothesis, parthenogenetic (PA) cells were used for analysis by qPCR and BSP. Our results showed that H19 and IGF2 were over-expressed in the female eye compared with the male eye (3-fold and 2-fold, respectively). We observed a normal monoallelic methylation pattern for H19 differentially methylated regions (DMRs). Compared with H19 DMRs, IGF2 DMRs showed a different methylation pattern in the eye. Taken together, these results suggest that elevated expression of H19/IGF2 is caused by a specific chromatin structure that is regulated by the DNA methylation status of IGF2 DMRs in the female eye. PMID:28266684
Thorup, Casper; Schramm, Andreas
2017-01-01
ABSTRACT This study demonstrates that the deltaproteobacterium Desulfurivibrio alkaliphilus can grow chemolithotrophically by coupling sulfide oxidation to the dissimilatory reduction of nitrate and nitrite to ammonium. Key genes of known sulfide oxidation pathways are absent from the genome of D. alkaliphilus. Instead, the genome contains all of the genes necessary for sulfate reduction, including a gene for a reductive-type dissimilatory bisulfite reductase (DSR). Despite this, growth by sulfate reduction was not observed. Transcriptomic analysis revealed a very high expression level of sulfate-reduction genes during growth by sulfide oxidation, while inhibition experiments with molybdate pointed to elemental sulfur/polysulfides as intermediates. Consequently, we propose that D. alkaliphilus initially oxidizes sulfide to elemental sulfur, which is then either disproportionated, or oxidized by a reversal of the sulfate reduction pathway. This is the first study providing evidence that a reductive-type DSR is involved in a sulfide oxidation pathway. Transcriptome sequencing further suggests that nitrate reduction to ammonium is performed by a novel type of periplasmic nitrate reductase and an unusual membrane-anchored nitrite reductase. PMID:28720728
Genetic and Epigenetic Inactivation of Kruppel-like Factor 4 in Medulloblastoma1
Nakahara, Yukiko; Northcott, Paul A; Li, Meihua; Kongkham, Paul N; Smith, Christian; Yan, Hai; Croul, Sidney; Ra, Young-Shin; Eberhart, Charles; Huang, Annie; Bigner, Darell; Grajkowska, Wesia; Van Meter, Timothy; Rutka, James T; Taylor, Michael D
2010-01-01
Although medulloblastoma is the most common pediatric malignant brain tumor, its molecular underpinnings are largely unknown. We have identified rare, recurrent homozygous deletions of Kruppel-like Factor 4 (KLF4) in medulloblastoma using high-resolution single nucleotide polymorphism arrays, digital karyotyping, and genomic real-time polymerase chain reaction (PCR). Furthermore, we show that there is loss of physiological KLF4 expression in more than 40% of primary medulloblastomas both at the RNA and protein levels. Medulloblastoma cell lines drastically increase the expression of KLF4 in response to the demethylating agent 5-azacytidine and demonstrate dense methylation of the promoter CpG island by bisulfite sequencing. Methylation-specific PCR targeting the KLF4 promoter demonstrates CpG methylation in approximately 16% of primary medulloblastomas. Reexpression of KLF4 in the D283 medulloblastoma cell line results in significant growth suppression both in vitro and in vivo. We conclude that KLF4 is inactivated by either genetic or epigenetic mechanisms in a large subset of medulloblastomas and that it likely functions as a tumor suppressor gene in the pathogenesis of medulloblastoma. PMID:20072650
Smith, Gilbert; Smith, Carl; Kenny, John G; Chaudhuri, Roy R; Ritchie, Michael G
2015-04-01
Epigenetic marks such as DNA methylation play important biological roles in gene expression regulation and cellular differentiation during development. To examine whether DNA methylation patterns are potentially associated with naturally occurring phenotypic differences, we examined genome-wide DNA methylation within Gasterosteus aculeatus, using reduced representation bisulfite sequencing. First, we identified highly methylated regions of the stickleback genome, finding such regions to be located predominantly within genes, and associated with genes functioning in metabolism and biosynthetic processes, cell adhesion, signaling pathways, and blood vessel development. Next, we identified putative differentially methylated regions (DMRs) of the genome between complete and low lateral plate morphs of G. aculeatus. We detected 77 DMRs that were mainly located in intergenic regions. Annotations of genes associated with these DMRs revealed potential functions in a number of known divergent adaptive phenotypes between G. aculeatus ecotypes, including cardiovascular development, growth, and neuromuscular development. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Huang, Xin; Gollin, Susanne M.; Raja, Siva; Godfrey, Tony E.
2002-01-01
Amplification of chromosomal band 11q13 is a common event in human cancer. It has been reported in about 45% of head and neck carcinomas and in other cancers including esophageal, breast, liver, lung, and bladder cancer. To understand the mechanism of 11q13 amplification and to identify the potential oncogene(s) driving it, we have fine-mapped the structure of the amplicon in oral squamous cell carcinoma cell lines and localized the proximal and distal breakpoints. A 5-Mb physical map of the region has been prepared from which sequence is available. We quantified copy number of sequence-tagged site markers at 42–550 kb intervals along the length of the amplicon and defined the amplicon core and breakpoints by using TaqMan-based quantitative microsatellite analysis. The core of the amplicon maps to a 1.5-Mb region. The proximal breakpoint localizes to two intervals between sequence-tagged site markers, 550 kb and 160 kb in size, and the distal breakpoint maps to a 250 kb interval. The cyclin D1 gene maps to the amplicon core, as do two new expressed sequence tag clusters. We have analyzed one of these expressed sequence tag clusters and now report that it contains a previously uncharacterized gene, TAOS1 (tumor amplified and overexpressed sequence 1), which is both amplified and overexpressed in oral cancer cells. The data suggest that TAOS1 may be an amplification-dependent candidate oncogene with a role in the development and/or progression of human tumors, including oral squamous cell carcinomas. The approach described here should be useful for characterizing amplified genomic regions in a wide variety of tumors. PMID:12172009
David, Fabrice P A; Yip, Yum L
2008-09-23
Sequences and structures provide valuable complementary information on protein features and functions. However, it is not always straightforward for users to gather information concurrently from the sequence and structure levels. The UniProt knowledgebase (UniProtKB) strives to help users on this undertaking by providing complete cross-references to Protein Data Bank (PDB) as well as coherent feature annotation using available structural information. In this study, SSMap - a new UniProt-PDB residue-residue level mapping - was generated. The primary objective of this mapping is not only to facilitate the two tasks mentioned above, but also to palliate a number of shortcomings of existent mappings. SSMap is the first isoform sequence-specific mapping resource and is up-to-date for UniProtKB annotation tasks. The method employed by SSMap differs from the other mapping resources in that it stresses on the correct reconstruction of the PDB sequence from structures, and on the correct attribution of a UniProtKB entry to each PDB chain by using a series of post-processing steps. SSMap was compared to other existing mapping resources in terms of the correctness of the attribution of PDB chains to UniProtKB entries, and of the quality of the pairwise alignments supporting the residue-residue mapping. It was found that SSMap shared about 80% of the mappings with other mapping sources. New and alternative mappings proposed by SSMap were mostly good as assessed by manual verification of data subsets. As for local pairwise alignments, it was shown that major discrepancies (both in terms of alignment lengths and boundaries), when present, were often due to differences in methodologies used for the mappings. SSMap provides an independent, good quality UniProt-PDB mapping. The systematic comparison conducted in this study allows the further identification of general problems in UniProt-PDB mappings so that both the coverage and the quality of the mappings can be systematically improved for the benefit of the scientific community. SSMap mapping is currently used to provide PDB cross-references in UniProtKB.
2010-10-14
High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and Massively Parallel Sequencing...Venezuelan equine encephalitis virus (VEEV) genome. We initially used a capillary electrophoresis method to gain insight into the role of the VEEV...Smith JM, Schmaljohn CS (2010) High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and
Ferrand, Guillaume; Luong, Michel; Cloos, Martijn A; Amadon, Alexis; Wackernagel, Hans
2014-08-01
Transmit arrays have been developed to mitigate the RF field inhomogeneity commonly observed in high field magnetic resonance imaging (MRI), typically above 3T. To this end, the knowledge of the RF complex-valued B1 transmit-sensitivities of each independent radiating element has become essential. This paper details a method to speed up a currently available B1-calibration method. The principle relies on slice undersampling, slice and channel interleaving and kriging, an interpolation method developed in geostatistics and applicable in many domains. It has been demonstrated that, under certain conditions, kriging gives the best estimator of a field in a region of interest. The resulting accelerated sequence allows mapping a complete set of eight volumetric field maps of the human head in about 1 min. For validation, the accuracy of kriging is first evaluated against a well-known interpolation technique based on Fourier transform as well as to a B1-maps interpolation method presented in the literature. This analysis is carried out on simulated and decimated experimental B1 maps. Finally, the accelerated sequence is compared to the standard sequence on a phantom and a volunteer. The new sequence provides B1 maps three times faster with a loss of accuracy limited potentially to about 5%.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro
Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
Hayes, C; Rump, A; Cadman, M R; Harrison, M; Evans, E P; Lyon, M F; Morriss-Kay, G M; Rosenthal, A; Brown, S D
2001-12-01
The mouse doublefoot (Dbf) mutant exhibits preaxial polydactyly in association with craniofacial defects. This mutation has previously been mapped to mouse chromosome 1. We have used a positional cloning strategy, coupled with a comparative sequencing approach using available human draft sequence, to identify putative candidates for the Dbf gene in the mouse and in homologous human region. We have constructed a high-resolution genetic map of the region, localizing the mutation to a 0.4-cM (+/-0.0061) interval on mouse chromosome 1. Furthermore, we have constructed contiguous BAC/PAC clone maps across the mouse and human Dbf region. Using existing markers and additional sequence tagged sites, which we have generated, we have anchored the physical map to the genetic map. Through the comparative sequencing of these clones we have identified 35 genes within this interval, indicating that the region is gene-rich. From this we have identified several genes that are known to be differentially expressed in the developing mid-gestation mouse embryo, some in the developing embryonic limb buds. These genes include those encoding known developmental signaling molecules such as WNT proteins and IHH, and we provide evidence that these genes are candidates for the Dbf mutation.
Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro; ...
2017-03-10
Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
Mapping Simple Repeated DNA Sequences in Heterochromatin of Drosophila Melanogaster
Lohe, A. R.; Hilliker, A. J.; Roberts, P. A.
1993-01-01
Heterochromatin in Drosophila has unusual genetic, cytological and molecular properties. Highly repeated DNA sequences (satellites) are the principal component of heterochromatin. Using probes from cloned satellites, we have constructed a chromosome map of 10 highly repeated, simple DNA sequences in heterochromatin of mitotic chromosomes of Drosophila melanogaster. Despite extensive sequence homology among some satellites, chromosomal locations could be distinguished by stringent in situ hybridizations for each satellite. Only two of the localizations previously determined using gradient-purified bulk satellite probes are correct. Eight new satellite localizations are presented, providing a megabase-level chromosome map of one-quarter of the genome. Five major satellites each exhibit a multichromosome distribution, and five minor satellites hybridize to single sites on the Y chromosome. Satellites closely related in sequence are often located near one another on the same chromosome. About 80% of Y chromosome DNA is composed of nine simple repeated sequences, in particular (AAGAC)(n) (8 Mb), (AAGAG)(n) (7 Mb) and (AATAT)(n) (6 Mb). Similarly, more than 70% of the DNA in chromosome 2 heterochromatin is composed of five simple repeated sequences. We have also generated a high resolution map of satellites in chromosome 2 heterochromatin, using a series of translocation chromosomes whose breakpoints in heterochromatin were ordered by N-banding. Finally, staining and banding patterns of heterochromatic regions are correlated with the locations of specific repeated DNA sequences. The basis for the cytochemical heterogeneity in banding appears to depend exclusively on the different satellite DNAs present in heterochromatin. PMID:8375654
SUGAR: graphical user interface-based data refiner for high-throughput DNA sequencing.
Sato, Yukuto; Kojima, Kaname; Nariai, Naoki; Yamaguchi-Kabata, Yumi; Kawai, Yosuke; Takahashi, Mamoru; Mimori, Takahiro; Nagasaki, Masao
2014-08-08
Next-generation sequencers (NGSs) have become one of the main tools for current biology. To obtain useful insights from the NGS data, it is essential to control low-quality portions of the data affected by technical errors such as air bubbles in sequencing fluidics. We develop a software SUGAR (subtile-based GUI-assisted refiner) which can handle ultra-high-throughput data with user-friendly graphical user interface (GUI) and interactive analysis capability. The SUGAR generates high-resolution quality heatmaps of the flowcell, enabling users to find possible signals of technical errors during the sequencing. The sequencing data generated from the error-affected regions of a flowcell can be selectively removed by automated analysis or GUI-assisted operations implemented in the SUGAR. The automated data-cleaning function based on sequence read quality (Phred) scores was applied to a public whole human genome sequencing data and we proved the overall mapping quality was improved. The detailed data evaluation and cleaning enabled by SUGAR would reduce technical problems in sequence read mapping, improving subsequent variant analysis that require high-quality sequence data and mapping results. Therefore, the software will be especially useful to control the quality of variant calls to the low population cells, e.g., cancers, in a sample with technical errors of sequencing procedures.
A SSR-based composite genetic linkage map for the cultivated peanut (Arachis hypogaea L.) genome
2010-01-01
Background The construction of genetic linkage maps for cultivated peanut (Arachis hypogaea L.) has and continues to be an important research goal to facilitate quantitative trait locus (QTL) analysis and gene tagging for use in a marker-assisted selection in breeding. Even though a few maps have been developed, they were constructed using diploid or interspecific tetraploid populations. The most recently published intra-specific map was constructed from the cross of cultivated peanuts, in which only 135 simple sequence repeat (SSR) markers were sparsely populated in 22 linkage groups. The more detailed linkage map with sufficient markers is necessary to be feasible for QTL identification and marker-assisted selection. The objective of this study was to construct a genetic linkage map of cultivated peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Results Three recombinant inbred lines (RILs) populations were constructed from three crosses with one common female parental line Yueyou 13, a high yielding Spanish market type. The four parents were screened with 1044 primer pairs designed to amplify SSRs and 901 primer pairs produced clear PCR products. Of the 901 primer pairs, 146, 124 and 64 primer pairs (markers) were polymorphic in these populations, respectively, and used in genotyping these RIL populations. Individual linkage maps were constructed from each of the three populations and a composite map based on 93 common loci were created using JoinMap. The composite linkage maps consist of 22 composite linkage groups (LG) with 175 SSR markers (including 47 SSRs on the published AA genome maps), representing the 20 chromosomes of A. hypogaea. The total composite map length is 885.4 cM, with an average marker density of 5.8 cM. Segregation distortion in the 3 populations was 23.0%, 13.5% and 7.8% of the markers, respectively. These distorted loci tended to cluster on LG1, LG3, LG4 and LG5. There were only 15 EST-SSR markers mapped due to low polymorphism. By comparison, there were potential synteny, collinear order of some markers and conservation of collinear linkage groups among the maps and with the AA genome but not fully conservative. Conclusion A composite linkage map was constructed from three individual mapping populations with 175 SSR markers in 22 composite linkage groups. This composite genetic linkage map is among the first "true" tetraploid peanut maps produced. This map also consists of 47 SSRs that have been used in the published AA genome maps, and could be used in comparative mapping studies. The primers described in this study are PCR-based markers, which are easy to share for genetic mapping in peanuts. All 1044 primer pairs are provided as additional files and the three RIL populations will be made available to public upon request for quantitative trait loci (QTL) analysis and linkage map improvement. PMID:20105299
Predicting protein contact map using evolutionary and physical constraints by integer programming.
Wang, Zhiyong; Xu, Jinbo
2013-07-01
Protein contact map describes the pairwise spatial and functional relationship of residues in a protein and contains key information for protein 3D structure prediction. Although studied extensively, it remains challenging to predict contact map using only sequence information. Most existing methods predict the contact map matrix element-by-element, ignoring correlation among contacts and physical feasibility of the whole-contact map. A couple of recent methods predict contact map by using mutual information, taking into consideration contact correlation and enforcing a sparsity restraint, but these methods demand for a very large number of sequence homologs for the protein under consideration and the resultant contact map may be still physically infeasible. This article presents a novel method PhyCMAP for contact map prediction, integrating both evolutionary and physical restraints by machine learning and integer linear programming. The evolutionary restraints are much more informative than mutual information, and the physical restraints specify more concrete relationship among contacts than the sparsity restraint. As such, our method greatly reduces the solution space of the contact map matrix and, thus, significantly improves prediction accuracy. Experimental results confirm that PhyCMAP outperforms currently popular methods no matter how many sequence homologs are available for the protein under consideration. http://raptorx.uchicago.edu.
A high resolution radiation hybrid map of wheat chromosome 4A
USDA-ARS?s Scientific Manuscript database
Bread wheat has a large and complex allohexaploid genome with low recombination level at chromosome centromeric and peri-centromeric regions. This significantly hampers ordering of markers, contigs of physical maps and sequence scaffolds and impedes obtaining of high-quality reference genome sequenc...
Mapping and Sequencing the Human Genome
DOE R&D Accomplishments Database
1988-01-01
Numerous meetings have been held and a debate has developed in the biological community over the merits of mapping and sequencing the human genome. In response a committee to examine the desirability and feasibility of mapping and sequencing the human genome was formed to suggest options for implementing the project. The committee asked many questions. Should the analysis of the human genome be left entirely to the traditionally uncoordinated, but highly successful, support systems that fund the vast majority of biomedical research. Or should a more focused and coordinated additional support system be developed that is limited to encouraging and facilitating the mapping and eventual sequencing of the human genome. If so, how can this be done without distorting the broader goals of biological research that are crucial for any understanding of the data generated in such a human genome project. As the committee became better informed on the many relevant issues, the opinions of its members coalesced, producing a shared consensus of what should be done. This report reflects that consensus.
2014-01-01
Background Recent advancements in next-generation sequencing technology have enabled cost-effective sequencing of whole or partial genomes, permitting the discovery and characterization of molecular polymorphisms. Double-digest restriction-site associated DNA sequencing (ddRAD-seq) is a powerful and inexpensive approach to developing numerous single nucleotide polymorphism (SNP) markers and constructing a high-density genetic map. To enrich genomic resources for Japanese eel (Anguilla japonica), we constructed a ddRAD-based genetic map using an Ion Torrent Personal Genome Machine and anchored scaffolds of the current genome assembly to 19 linkage groups of the Japanese eel. Furthermore, we compared the Japanese eel genome with genomes of model fishes to infer the history of genome evolution after the teleost-specific genome duplication. Results We generated the ddRAD-based linkage map of the Japanese eel, where the maps for female and male spanned 1748.8 cM and 1294.5 cM, respectively, and were arranged into 19 linkage groups. A total of 2,672 SNP markers and 115 Simple Sequence Repeat markers provide anchor points to 1,252 scaffolds covering 151 Mb (13%) of the current genome assembly of the Japanese eel. Comparisons among the Japanese eel, medaka, zebrafish and spotted gar genomes showed highly conserved synteny among teleosts and revealed part of the eight major chromosomal rearrangement events that occurred soon after the teleost-specific genome duplication. Conclusions The ddRAD-seq approach combined with the Ion Torrent Personal Genome Machine sequencing allowed us to conduct efficient and flexible SNP genotyping. The integration of the genetic map and the assembled sequence provides a valuable resource for fine mapping and positional cloning of quantitative trait loci associated with economically important traits and for investigating comparative genomics of the Japanese eel. PMID:24669946
2012-01-01
Background Cultivated peanut (Arachis hypogaea L.) is an important crop worldwide, valued for its edible oil and digestible protein. It has a very narrow genetic base that may well derive from a relatively recent single polyploidization event. Accordingly molecular markers have low levels of polymorphism and the number of polymorphic molecular markers available for cultivated peanut is still limiting. Results Here, we report a large set of BAC-end sequences (BES), use them for developing SSR (BES-SSR) markers, and apply them in genetic linkage mapping. The majority of BESs had no detectable homology to known genes (49.5%) followed by sequences with similarity to known genes (44.3%), and miscellaneous sequences (6.2%) such as transposable element, retroelement, and organelle sequences. A total of 1,424 SSRs were identified from 36,435 BESs. Among these identified SSRs, dinucleotide (47.4%) and trinucleotide (37.1%) SSRs were predominant. The new set of 1,152 SSRs as well as about 4,000 published or unpublished SSRs were screened against two parents of a mapping population, generating 385 polymorphic loci. A genetic linkage map was constructed, consisting of 318 loci onto 21 linkage groups and covering a total of 1,674.4 cM, with an average distance of 5.3 cM between adjacent loci. Two markers related to resistance gene homologs (RGH) were mapped to two different groups, thus anchoring 1 RGH-BAC contig and 1 singleton. Conclusions The SSRs mined from BESs will be of use in further molecular analysis of the peanut genome, providing a novel set of markers, genetically anchoring BAC clones, and incorporating gene sequences into a linkage map. This will aid in the identification of markers linked to genes of interest and map-based cloning. PMID:22260238