dna sequencing progress: Topics by Science.gov

Sample records for dna sequencing progress

DNA Replication Profiling Using Deep Sequencing.

PubMed

Saayman, Xanita; Ramos-Pérez, Cristina; Brown, Grant W

2018-01-01

Profiling of DNA replication during progression through S phase allows a quantitative snap-shot of replication origin usage and DNA replication fork progression. We present a method for using deep sequencing data to profile DNA replication in S. cerevisiae.
Next-Generation Sequencing-Based Detection of Circulating Tumour DNA After Allogeneic Stem Cell Transplantation for Lymphoma

PubMed Central

Herrera, Alex F.; Kim, Haesook T.; Kong, Katherine A.; Faham, Malek; Sun, Heather; Sohani, Aliyah R.; Alyea, Edwin P.; Carlton, Victoria E.; Chen, Yi-Bin; Cutler, Corey S.; Ho, Vincent T.; Koreth, John; Kotwaliwale, Chitra; Nikiforow, Sarah; Ritz, Jerome; Rodig, Scott J.; Soiffer, Robert J.; Antin, Joseph H.; Armand, Philippe

2016-01-01

Summary Next-generation sequencing (NGS)-based circulating tumour DNA (ctDNA) detection is a promising monitoring tool for lymphoid malignancies. We evaluated whether the presence of ctDNA was associated with outcome after allogeneic haematopoietic stem cell transplantation (HSCT) in lymphoma patients. We studied 88 patients drawn from a phase 3 clinical trial of reduced-intensity conditioning HSCT in lymphoma. Conventional restaging and collection of peripheral blood samples occurred at pre-specified time points before and after HSCT and were assayed for ctDNA by sequencing of the immunoglobulin or T-cell receptor genes. Tumour clonotypes were identified in 87% of patients with adequate tumour samples. Sixteen of 19 (84%) patients with disease progression after HSCT had detectable ctDNA prior to progression at a median of 3.7 months prior to relapse/progression. Patients with detectable ctDNA 3 months after HSCT had inferior progression-free survival (PFS) (2-year PFS 58% versus 84% in ctDNA-negative patients, p=0.033). In multivariate models, detectable ctDNA was associated with increased risk of progression/death (Hazard ratio 3.9, p=0.003) and increased risk of relapse/progression (Hazard ratio 10.8, p=0.0006). Detectable ctDNA is associated with an increased risk of relapse/progression, but further validation studies are necessary to confirm these findings and determine the clinical utility of NGS-based minimal residual disease monitoring in lymphoma patients after HSCT. PMID:27711974
Next-generation sequencing-based detection of circulating tumour DNA After allogeneic stem cell transplantation for lymphoma.

PubMed

Herrera, Alex F; Kim, Haesook T; Kong, Katherine A; Faham, Malek; Sun, Heather; Sohani, Aliyah R; Alyea, Edwin P; Carlton, Victoria E; Chen, Yi-Bin; Cutler, Corey S; Ho, Vincent T; Koreth, John; Kotwaliwale, Chitra; Nikiforow, Sarah; Ritz, Jerome; Rodig, Scott J; Soiffer, Robert J; Antin, Joseph H; Armand, Philippe

2016-12-01

Next-generation sequencing (NGS)-based circulating tumour DNA (ctDNA) detection is a promising monitoring tool for lymphoid malignancies. We evaluated whether the presence of ctDNA was associated with outcome after allogeneic haematopoietic stem cell transplantation (HSCT) in lymphoma patients. We studied 88 patients drawn from a phase 3 clinical trial of reduced-intensity conditioning HSCT in lymphoma. Conventional restaging and collection of peripheral blood samples occurred at pre-specified time points before and after HSCT and were assayed for ctDNA by sequencing of the immunoglobulin or T-cell receptor genes. Tumour clonotypes were identified in 87% of patients with adequate tumour samples. Sixteen of 19 (84%) patients with disease progression after HSCT had detectable ctDNA prior to progression at a median of 3·7 months prior to relapse/progression. Patients with detectable ctDNA 3 months after HSCT had inferior progression-free survival (PFS) (2-year PFS 58% vs. 84% in ctDNA-negative patients, P = 0·033). In multivariate models, detectable ctDNA was associated with increased risk of progression/death (Hazard ratio 3·9, P = 0·003) and increased risk of relapse/progression (Hazard ratio 10·8, P = 0·0006). Detectable ctDNA is associated with an increased risk of relapse/progression, but further validation studies are necessary to confirm these findings and determine the clinical utility of NGS-based minimal residual disease monitoring in lymphoma patients after HSCT. © 2016 John Wiley & Sons Ltd.
DIVA V2.0

DOE Office of Scientific and Technical Information (OSTI.GOV)

CHEN, JOANNA; SIMIRENKO, LISA; TAPASWI, MANJIRI

The DIVA software interfaces a process in which researchers design their DNA with a web-based graphical user interface, submit their designs to a central queue, and a few weeks later receive their sequence-verified clonal constructs. Each researcher independently designs the DNA to be constructed with a web-based BioCAD tool, and presses a button to submit their designs to a central queue. Researchers have web-based access to their DNA design queues, and can track the progress of their submitted designs as they progress from "evaluation", to "waiting for reagents", to "in progress", to "complete". Researchers access their completed constructs through themore » central DNA repository. Along the way, all DNA construction success/failure rates are captured in a central database. Once a design has been submitted to the queue, a small number of dedicated staff evaluate the design for feasibility and provide feedback to the responsible researcher if the design is either unreasonable (e.g., encompasses a combinatorial library of a billion constructs) or small design changes could significantly facilitate the downstream implementation process. The dedicated staff then use DNA assembly design automation software to optimize the DNA construction process for the design, leveraging existing parts from the DNA repository where possible and ordering synthetic DNA where necessary. SynTrack software manages the physical locations and availability of the various requisite reagents and process inputs (e.g., DNA templates). Once all requisite process inputs are available, the design progresses from "waiting for reagents" to "in progress" in the design queue. Human-readable and machine-parseable DNA construction protocols output by the DNA assembly design automation software are then executed by the dedicated staff exploiting lab automation devices wherever possible. Since the all employed DNA construction methods are sequence-agnostic, standardized (utilize the same enzymatic master mixes and reaction conditions), completely independent DNA construction tasks can be aggregated into the same multi-well plates and pursued in parallel. The resulting sets of cloned constructs can then be screened by high-throughput next-gen sequencing platforms for sequence correctness. A combination of long read-length (e.g., PacBio) and paired-end read platforms (e.g., Illumina) would be exploited depending the particular task at hand (e.g., PacBio might be sufficient to screen a set of pooled constructs with significant gene divergence). Post sequence verification, designs for which at least one correct clone was identified will progress to a "complete" status, while designs for which no correct clones wereidentified will progress to a "failure" status. Depending on the failure mode (e.g., no transformants), and how many prior attempts/variations of assembly protocol have been already made for a given design, subsequent attempts may be made or the design can progress to a "permanent failure" state. All success and failure rate information will be captured during the process, including at which stage a given clonal construction procedure failed (e.g., no PCR product) and what the exact failure was (e.g. assembly piece 2 missing). This success/failure rate data can be leveraged to refine the DNA assembly design process.« less
Recent patents of nanopore DNA sequencing technology: progress and challenges.

PubMed

Zhou, Jianfeng; Xu, Bingqian

2010-11-01

DNA sequencing techniques witnessed fast development in the last decades, primarily driven by the Human Genome Project. Among the proposed new techniques, Nanopore was considered as a suitable candidate for the single DNA sequencing with ultrahigh speed and very low cost. Several fabrication and modification techniques have been developed to produce robust and well-defined nanopore devices. Many efforts have also been done to apply nanopore to analyze the properties of DNA molecules. By comparing with traditional sequencing techniques, nanopore has demonstrated its distinctive superiorities in main practical issues, such as sample preparation, sequencing speed, cost-effective and read-length. Although challenges still remain, recent researches in improving the capabilities of nanopore have shed a light to achieve its ultimate goal: Sequence individual DNA strand at single nucleotide level. This patent review briefly highlights recent developments and technological achievements for DNA analysis and sequencing at single molecule level, focusing on nanopore based methods.
Capillary electrophoresis of Big-Dye terminator sequencing reactions for human mtDNA Control Region haplotyping in the identification of human remains.

PubMed

Montesino, Marta; Prieto, Lourdes

2012-01-01

Cycle sequencing reaction with Big-Dye terminators provides the methodology to analyze mtDNA Control Region amplicons by means of capillary electrophoresis. DNA sequencing with ddNTPs or terminators was developed by (1). The progressive automation of the method by combining the use of fluorescent-dye terminators with cycle sequencing has made it possible to increase the sensibility and efficiency of the method and hence has allowed its introduction into the forensic field. PCR-generated mitochondrial DNA products are the templates for sequencing reactions. Different set of primers can be used to generate amplicons with different sizes according to the quality and quantity of the DNA extract providing sequence data for different ranges inside the Control Region.
New developments in ancient genomics.

PubMed

Millar, Craig D; Huynen, Leon; Subramanian, Sankar; Mohandesan, Elmira; Lambert, David M

2008-07-01

Ancient DNA research is on the crest of a 'third wave' of progress due to the introduction of a new generation of DNA sequencing technologies. Here we review the advantages and disadvantages of the four new DNA sequencers that are becoming available to researchers. These machines now allow the recovery of orders of magnitude more DNA sequence data, albeit as short sequence reads. Hence, the potential reassembly of complete ancient genomes seems imminent, and when used to screen libraries of ancient sequences, these methods are cost effective. This new wealth of data is also likely to herald investigations into the functional properties of extinct genes and gene complexes and will improve our understanding of the biological basis of extinct phenotypes.
5-bp Classical Satellite DNA Loci from Chromosome-1 Instability in Cervical Neoplasia Detected by DNA Breakage Detection/Fluorescence in Situ Hybridization (DBD-FISH).

PubMed

Cortés-Gutiérrez, Elva I; Ortíz-Hernández, Brenda L; Dávila-Rodríguez, Martha I; Cerda-Flores, Ricardo M; Fernández, José Luis; López-Fernández, Carmen; Gosálvez, Jaime

2013-02-19

We aimed to evaluate the association between the progressive stages of cervical neoplasia and DNA damage in 5-bp classical satellite DNA sequences from chromosome-1 in cervical epithelium and in peripheral blood lymphocytes using DNA breakage detection/fluorescence in situ hybridization (DBD-FISH). A hospital-based unmatched case-control study was conducted in 2011 with a sample of 30 women grouped according to disease stage and selected according to histological diagnosis; 10 with low-grade squamous intraepithelial lesions (LG-SIL), 10 with high-grade SIL (HG-SIL), and 10 with no cervical lesions, from the Unidad Medica de Alta Especialidad of The Mexican Social Security Institute, IMSS, Mexico. Specific chromosome damage levels in 5-bp classical satellite DNA sequences from chromosome-1 were evaluated in cervical epithelium and peripheral blood lymphocytes using the DBD-FISH technique. Whole-genome DNA hybridization was used as a reference for the level of damage. Results of Kruskal-Wallis test showed a significant increase according to neoplastic development in both tissues. The instability of 5-bp classical satellite DNA sequences from chromosome-1 was evidenced using chromosome-orientation FISH. In conclusion, we suggest that the progression to malignant transformation involves an increase in the instability of 5-bp classical satellite DNA sequences from chromosome-1.
From famine to feast? Selecting nuclear DNA sequence loci for plant species-level phylogeny reconstruction

PubMed Central

Hughes, Colin E; Eastwood, Ruth J; Donovan Bailey, C

2005-01-01

Phylogenetic analyses of DNA sequences have prompted spectacular progress in assembling the Tree of Life. However, progress in constructing phylogenies among closely related species, at least for plants, has been less encouraging. We show that for plants, the rapid accumulation of DNA characters at higher taxonomic levels has not been matched by conventional sequence loci at the species level, leaving a lack of well-resolved gene trees that is hindering investigations of many fundamental questions in plant evolutionary biology. The most popular approach to address this problem has been to use low-copy nuclear genes as a source of DNA sequence data. However, this has had limited success because levels of variation among nuclear intron sequences across groups of closely related species are extremely variable and generally lower than conventionally used loci, and because no universally useful low-copy nuclear DNA sequence loci have been developed. This suggests that solutions will, for the most part, be lineage-specific, prompting a move away from ‘universal’ gene thinking for species-level phylogenetics. The benefits and limitations of alternative approaches to locate more variable nuclear loci are discussed and the potential of anonymous non-genic nuclear loci is highlighted. Given the virtually unlimited number of loci that can be generated using these new approaches, it is clear that effective screening will be critical for efficient selection of the most informative loci. Strategies for screening are outlined. PMID:16553318
RNA metabolism in the regulation of protein synthesis in plants. Progress report, 1975-1979

DOE Office of Scientific and Technical Information (OSTI.GOV)

Key, J L

1979-01-01

The major objectives of the research for the contract period covered by this report were (1) to gain an insight into the sequence organization of the DNA of soybean, emphasizing the arrangement of single copy or unique sequences and repetitive sequences of DNA throughout the genome, (2) to characterize soybean RNAs relative to nucleotide sequence complexity and kinetics of synthesis and turnover of poly A/sup +/ mRNA, and (3) to study ribosomal proteins directed to an analysis of possible changes in proteins which relate to the activation of 80S ribosomes and thus mRNA utilization and protein synthesis in response tomore » environmental stimuli. Even with greatly reduced funding compared to that requested, objectives 1 and 2 were substantially accomplished. Because of reduced funding and the 20-month no cost extension, relatively little progress was made on objective 3. Accordingly objectives 1 and 2 will be summarized in some detail; a brief account of progress is presented on objective 3.« less
Method for phosphorothioate antisense DNA sequencing by capillary electrophoresis with UV detection.

PubMed

Froim, D; Hopkins, C E; Belenky, A; Cohen, A S

1997-11-01

The progress of antisense DNA therapy demands development of reliable and convenient methods for sequencing short single-stranded oligonucleotides. A method of phosphorothioate antisense DNA sequencing analysis using UV detection coupled to capillary electrophoresis (CE) has been developed based on a modified chain termination sequencing method. The proposed method reduces the sequencing cost since it uses affordable CE-UV instrumentation and requires no labeling with minimal sample processing before analysis. Cycle sequencing with ThermoSequenase generates quantities of sequencing products that are readily detectable by UV. Discrimination of undesired components from sequencing products in the reaction mixture, previously accomplished by fluorescent or radioactive labeling, is now achieved by bringing concentrations of undesired components below the UV detection range which yields a 'clean', well defined sequence. UV detection coupled with CE offers additional conveniences for sequencing since it can be accomplished with commercially available CE-UV equipment and is readily amenable to automation.
Method for phosphorothioate antisense DNA sequencing by capillary electrophoresis with UV detection.

PubMed Central

Froim, D; Hopkins, C E; Belenky, A; Cohen, A S

1997-01-01

The progress of antisense DNA therapy demands development of reliable and convenient methods for sequencing short single-stranded oligonucleotides. A method of phosphorothioate antisense DNA sequencing analysis using UV detection coupled to capillary electrophoresis (CE) has been developed based on a modified chain termination sequencing method. The proposed method reduces the sequencing cost since it uses affordable CE-UV instrumentation and requires no labeling with minimal sample processing before analysis. Cycle sequencing with ThermoSequenase generates quantities of sequencing products that are readily detectable by UV. Discrimination of undesired components from sequencing products in the reaction mixture, previously accomplished by fluorescent or radioactive labeling, is now achieved by bringing concentrations of undesired components below the UV detection range which yields a 'clean', well defined sequence. UV detection coupled with CE offers additional conveniences for sequencing since it can be accomplished with commercially available CE-UV equipment and is readily amenable to automation. PMID:9336449
[New hosts and vectors for genome cloning]. Progress report

DOE Office of Scientific and Technical Information (OSTI.GOV)

Not Available

The main goal of our project remains the development of new bacterial hosts and vectors for the stable propagation of human DNA clones in E. coli. During the past six months of our current budget period, we have (1) continued to develop new hosts that permit the stable maintenance of unstable features of human DNA, and (2) developed a series of vectors for (a) cloning large DNA inserts, (b) assessing the frequency of human sequences that are lethal to the growth of E. coli, and (c) assessing the stability of human sequences cloned in M13 for large-scale sequencing projects.
DNA nanomapping using CRISPR-Cas9 as a programmable nanoparticle.

PubMed

Mikheikin, Andrey; Olsen, Anita; Leslie, Kevin; Russell-Pavier, Freddie; Yacoot, Andrew; Picco, Loren; Payton, Oliver; Toor, Amir; Chesney, Alden; Gimzewski, James K; Mishra, Bud; Reed, Jason

2017-11-21

Progress in whole-genome sequencing using short-read (e.g., <150 bp), next-generation sequencing technologies has reinvigorated interest in high-resolution physical mapping to fill technical gaps that are not well addressed by sequencing. Here, we report two technical advances in DNA nanotechnology and single-molecule genomics: (1) we describe a labeling technique (CRISPR-Cas9 nanoparticles) for high-speed AFM-based physical mapping of DNA and (2) the first successful demonstration of using DVD optics to image DNA molecules with high-speed AFM. As a proof of principle, we used this new "nanomapping" method to detect and map precisely BCL2-IGH translocations present in lymph node biopsies of follicular lymphoma patents. This HS-AFM "nanomapping" technique can be complementary to both sequencing and other physical mapping approaches.
Support for HIV-1 Intervention Therapy

DTIC Science & Technology

1993-10-01

I. Kiselev, and E. S. Severin. 1990. Amplification of DNA 46 sequences of Epstein - Barr and human immunodeficiency viruses using DNA-polymerase from... develop and validate assays that predict or demonstrate disease progression for use in interventional trials with an emphasis on molecular biologic...to stay on the leading edge of technology development . A potential problem in obtaining quality sequence information is the occurrence of template
Comprehensive methylome analysis of ovarian tumors reveals hedgehog signaling pathway regulators as prognostic DNA methylation biomarkers.

PubMed

Huang, Rui-Lan; Gu, Fei; Kirma, Nameer B; Ruan, Jianhua; Chen, Chun-Liang; Wang, Hui-Chen; Liao, Yu-Ping; Chang, Cheng-Chang; Yu, Mu-Hsien; Pilrose, Jay M; Thompson, Ian M; Huang, Hsuan-Cheng; Huang, Tim Hui-Ming; Lai, Hung-Cheng; Nephew, Kenneth P

2013-06-01

Women with advanced stage ovarian cancer (OC) have a five-year survival rate of less than 25%. OC progression is associated with accumulation of epigenetic alterations and aberrant DNA methylation in gene promoters acts as an inactivating "hit" during OC initiation and progression. Abnormal DNA methylation in OC has been used to predict disease outcome and therapy response. To globally examine DNA methylation in OC, we used next-generation sequencing technology, MethylCap-sequencing, to screen 75 malignant and 26 normal or benign ovarian tissues. Differential DNA methylation regions (DMRs) were identified, and the Kaplan-Meier method and Cox proportional hazard model were used to correlate methylation with clinical endpoints. Functional role of specific genes identified by MethylCap-sequencing was examined in in vitro assays. We identified 577 DMRs that distinguished (p < 0.001) malignant from non-malignant ovarian tissues; of these, 63 DMRs correlated (p < 0.001) with poor progression free survival (PFS). Concordant hypermethylation and corresponding gene silencing of sonic hedgehog pathway members ZIC1 and ZIC4 in OC tumors was confirmed in a panel of OC cell lines, and ZIC1 and ZIC4 repression correlated with increased proliferation, migration and invasion. ZIC1 promoter hypermethylation correlated (p < 0.01) with poor PFS. In summary, we identified functional DNA methylation biomarkers significantly associated with clinical outcome in OC and suggest our comprehensive methylome analysis has significant translational potential for guiding the design of future clinical investigations targeting the OC epigenome. Methylation of ZIC1, a putative tumor suppressor, may be a novel determinant of OC outcome.
Complete mtDNA sequencing reveals mutations m.9185T>C and m.13513G>A in three patients with Leigh syndrome.

PubMed

Pelnena, Dita; Burnyte, Birute; Jankevics, Eriks; Lace, Baiba; Dagyte, Evelina; Grigalioniene, Kristina; Utkus, Algirdas; Krumina, Zita; Rozentale, Jolanta; Adomaitiene, Irina; Stavusis, Janis; Pliss, Liana; Inashkina, Inna

2017-12-12

The most common mitochondrial disorder in children is Leigh syndrome, which is a progressive and genetically heterogeneous neurodegenerative disorder caused by mutations in nuclear genes or mitochondrial DNA (mtDNA). In the present study, a novel and robust method of complete mtDNA sequencing, which allows amplification of the whole mitochondrial genome, was tested. Complete mtDNA sequencing was performed in a cohort of patients with suspected mitochondrial mutations. Patients from Latvia and Lithuania (n = 92 and n = 57, respectively) referred by clinical geneticists were included. The de novo point mutations m.9185T>C and m.13513G>A, respectively, were detected in two patients with lactic acidosis and neurodegenerative lesions. In one patient with neurodegenerative lesions, the mutation m.9185T>C was identified. These mutations are associated with Leigh syndrome. The present data suggest that full-length mtDNA sequencing is recommended as a supplement to nuclear gene testing and enzymatic assays to enhance mitochondrial disease diagnostics.
[New hosts and vectors for genome cloning]. Progress report, 1990--1991

DOE Office of Scientific and Technical Information (OSTI.GOV)

Not Available

The main goal of our project remains the development of new bacterial hosts and vectors for the stable propagation of human DNA clones in E. coli. During the past six months of our current budget period, we have (1) continued to develop new hosts that permit the stable maintenance of unstable features of human DNA, and (2) developed a series of vectors for (a) cloning large DNA inserts, (b) assessing the frequency of human sequences that are lethal to the growth of E. coli, and (c) assessing the stability of human sequences cloned in M13 for large-scale sequencing projects.
Basic quantitative polymerase chain reaction using real-time fluorescence measurements.

PubMed

Ares, Manuel

2014-10-01

This protocol uses quantitative polymerase chain reaction (qPCR) to measure the number of DNA molecules containing a specific contiguous sequence in a sample of interest (e.g., genomic DNA or cDNA generated by reverse transcription). The sample is subjected to fluorescence-based PCR amplification and, theoretically, during each cycle, two new duplex DNA molecules are produced for each duplex DNA molecule present in the sample. The progress of the reaction during PCR is evaluated by measuring the fluorescence of dsDNA-dye complexes in real time. In the early cycles, DNA duplication is not detected because inadequate amounts of DNA are made. At a certain threshold cycle, DNA-dye complexes double each cycle for 8-10 cycles, until the DNA concentration becomes so high and the primer concentration so low that the reassociation of the product strands blocks efficient synthesis of new DNA and the reaction plateaus. There are two types of measurements: (1) the relative change of the target sequence compared to a reference sequence and (2) the determination of molecule number in the starting sample. The first requires a reference sequence, and the second requires a sample of the target sequence with known numbers of the molecules of sequence to generate a standard curve. By identifying the threshold cycle at which a sample first begins to accumulate DNA-dye complexes exponentially, an estimation of the numbers of starting molecules in the sample can be extrapolated. © 2014 Cold Spring Harbor Laboratory Press.
The DNA Bank: High-Security Bank Accounts to Protect and Share Your Genetic Identity.

PubMed

den Dunnen, Johan T

2015-07-01

With the cost of genome sequencing decreasing every day, DNA information has the potential of affecting the lives of everyone. Surprisingly, an individual has little knowledge about his own DNA information, can rarely access it, and has hardly any control over its use. This may result in preventable, life-threatening situations, and also significantly inhibits scientific progress. What we urgently need is a "DNA bank," a resource providing a secure personal account where, similar to a financial institution, you can store your DNA sequence. Using this private and secure DNA bank account, you govern your sequence-related business. For any genetic study performed, the data generated must be transferred (paid) to your DNA account. Using your account, you regulate access, knowing for what purpose (informed consent) and only for the genetic data you are willing to share. The DNA account ensures you are in the driver's seat, know what is known, and control what is happening with it. © 2015 WILEY PERIODICALS, INC.

Making the Bend: DNA Tertiary Structure and Protein-DNA Interactions

PubMed Central

Harteis, Sabrina; Schneider, Sabine

2014-01-01

DNA structure functions as an overlapping code to the DNA sequence. Rapid progress in understanding the role of DNA structure in gene regulation, DNA damage recognition and genome stability has been made. The three dimensional structure of both proteins and DNA plays a crucial role for their specific interaction, and proteins can recognise the chemical signature of DNA sequence (“base readout”) as well as the intrinsic DNA structure (“shape recognition”). These recognition mechanisms do not exist in isolation but, depending on the individual interaction partners, are combined to various extents. Driving force for the interaction between protein and DNA remain the unique thermodynamics of each individual DNA-protein pair. In this review we focus on the structures and conformations adopted by DNA, both influenced by and influencing the specific interaction with the corresponding protein binding partner, as well as their underlying thermodynamics. PMID:25026169
Plans and progress for building a Great Lakes fauna DNA barcode reference library

EPA Science Inventory

DNA reference libraries provide researchers with an important tool for assessing regional biodiversity by allowing unknown genetic sequences to be assigned identities, while also providing a means for taxonomists to validate identifications. Expanding the representation of Great...
[Hot topics of circulating tumor DNA testing in breast cancer].

PubMed

Liu, Y H; Zhou, B; Xu, L; Xin, L

2017-02-01

The progress of gene detection technologies represented by next generation sequencing (NGS) and digital PCR laid a foundation for studies of circulating tumor DNA (ctDNA) in breast cancer. In 2014, the NGS workgroup organized by the College of American Pathologists (CAP) published the College of American Pathologists ' Laboratory Standards for Next - Generation Sequencing Clinical Tests, which provides a blueprint for the standardization of gene testing. In 2015, the Guidelines for Diagnostic Next - generation Sequencing published by the European Society of Human Genetics claimed that NGS is unacceptable in clinical practice before studies guided by guidelines are approved. Although existing studies show the benefits of ctDNA testing in disease monitoring and prognosis analyzing, we have a ways to go to normalize the procedure and build strict detection criteria.
Adult cases of mitochondrial DNA depletion due to TK2 defect: an expanding spectrum.

PubMed

Béhin, A; Jardel, C; Claeys, K G; Fagart, J; Louha, M; Romero, N B; Laforêt, P; Eymard, B; Lombès, A

2012-02-28

In this study we aim to demonstrate the occurrence of adult forms of TK2 mutations causing progressive mitochondrial myopathy with significant muscle mitochondrial DNA (mtDNA) depletion. Patients' investigations included serum creatine kinase, blood lactate, electromyographic, echocardiographic, and functional respiratory analyses as well as TK2 gene sequencing and TK2 activity measurement. Mitochondrial activities and mtDNA were analyzed in the patients' muscle biopsy. The 3 adult patients with TK2 mutations presented with slowly progressive myopathy compatible with a fairly normal life during decades. Apart from its much slower progression, these patients' phenotype closely resembled that of pediatric cases including early onset, absence of CNS symptoms, generalized muscle weakness predominating on axial and proximal muscles but affecting facial, ocular, and respiratory muscles, typical mitochondrial myopathy with a mosaic pattern of COX-negative and ragged-red fibers, combined mtDNA-dependent respiratory complexes deficiency and mtDNA depletion. In accordance with the disease's relatively slow progression, the residual mtDNA content was higher than that observed in pediatric cases. That difference was not explained by the type of the TK2 mutations or by the residual TK2 activity. TK2 mutations can cause mitochondrial myopathy with a slow progression. Comparison of patients with similar mutations but different disease progression might address potential mechanisms of mtDNA maintenance modulation.
Isolation and characterization of adrenoleukodystrophy protein (ALDP) related sequences in the human genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Geraghty, M.T.; Stetten, G.; Kearns, W.

1994-09-01

X-linked adrenoleukodystrophy (ALD) is a disorder of peroxisomal {beta}-oxidation of very long chain fatty acids. It presents either as progressive dementia in childhood or as progressive paraparesis in later years. Adrenal insufficiency occurs in both phenotypes. The gene of the ALD protein has been mapped to Xq28 and has recently been cloned and characterized. The ALD protein has significant homology to the peroxisomal membrane protein, PMP70 and belongs to the ATP binding cassette superfamily of transporters. We screened a human genomic library with an ALDP cDNA and isolated 5 different but highly similar clones containing sequences corresponding to the 3{prime}more » end of the ALDP gene. Comparison of the sequences over the region corresponding to exon 9 through the 3{prime} end of the ALDP gene reveals {approximately}96% nucleotide identity in both exonic and intronic regions. Splice sites and open reading frames are maintained. Using both FISH and human-rodent DNA mapping panels, we positively assign these ALDP-related sequences to chromosomes 2, 16 and 22, and provisionally to 1 and 20. Southern blot of primate DNA probed with a partial ALDP cDNA (exon 2-10) shows that expansion of ALDP-related sequences occurred in higher primates (chimp, gorilla and human). Although Northern blots show multiple ALDP-hybridizing transcripts in certain tissues, we have no evidence to date for expression of these ALDP-related sequences. In conclusion, our data show there has been an unusual and recent dispersal to multiple chromosomes of structural gene sequences related to the ALDP gene. The functional significance of these sequences remains to be determined but their existence complicates PCR and mutation analysis of the ALDP gene.« less
Oncogenic LINE-1 Retroelements Sustain Prostate Tumor Cells and Promote Metastatic Progression

DTIC Science & Technology

2015-10-01

elements in prostate cancer contribute to its progression by activating oncogenic DNA sequences, or silencing tumor suppressor like sequences. We have...prostate cancer cells. Experiments are ongoing to determine if PIWIL-1 expression in prostate cancer cells will reduce their growth, thereby providing...proof of principle for future gene-based therapeutics for this cancer . 15. SUBJECT TERMS Prostate cancer , LINE-1, PIWIL-1, retrotransposons 16
DNA Clutch Probes for Circulating Tumor DNA Analysis.

PubMed

Das, Jagotamoy; Ivanov, Ivaylo; Sargent, Edward H; Kelley, Shana O

2016-08-31

Progress toward the development of minimally invasive liquid biopsies of disease is being bolstered by breakthroughs in the analysis of circulating tumor DNA (ctDNA): DNA released from cancer cells into the bloodstream. However, robust, sensitive, and specific methods of detecting this emerging analyte are lacking. ctDNA analysis has unique challenges, since it is imperative to distinguish circulating DNA from normal cells vs mutation-bearing sequences originating from tumors. Here we report the electrochemical detection of mutated ctDNA in samples collected from cancer patients. By developing a strategy relying on the use of DNA clutch probes (DCPs) that render specific sequences of ctDNA accessible, we were able to readout the presence of mutated ctDNA. DCPs prevent reassociation of denatured DNA strands: they make one of the two strands of a dsDNA accessible for hybridization to a probe, and they also deactivate other closely related sequences in solution. DCPs ensure thereby that only mutated sequences associate with chip-based sensors detecting hybridization events. The assay exhibits excellent sensitivity and specificity in the detection of mutated ctDNA: it detects 1 fg/μL of a target mutation in the presence of 100 pg/μL of wild-type DNA, corresponding to detecting mutations at a level of 0.01% relative to wild type. This approach allows accurate analysis of samples collected from lung cancer and melanoma patients. This work represents the first detection of ctDNA without enzymatic amplification.
Next-generation sequencing identifies major DNA methylation changes during progression of Ph+ chronic myeloid leukemia

PubMed Central

Heller, G; Topakian, T; Altenberger, C; Cerny-Reiterer, S; Herndlhofer, S; Ziegler, B; Datlinger, P; Byrgazov, K; Bock, C; Mannhalter, C; Hörmann, G; Sperr, W R; Lion, T; Zielinski, C C; Valent, P; Zöchbauer-Müller, S

2016-01-01

Little is known about the impact of DNA methylation on the evolution/progression of Ph+ chronic myeloid leukemia (CML). We investigated the methylome of CML patients in chronic phase (CP-CML), accelerated phase (AP-CML) and blast crisis (BC-CML) as well as in controls by reduced representation bisulfite sequencing. Although only ~600 differentially methylated CpG sites were identified in samples obtained from CP-CML patients compared with controls, ~6500 differentially methylated CpG sites were found in samples from BC-CML patients. In the majority of affected CpG sites, methylation was increased. In CP-CML patients who progressed to AP-CML/BC-CML, we identified up to 897 genes that were methylated at the time of progression but not at the time of diagnosis. Using RNA-sequencing, we observed downregulated expression of many of these genes in BC-CML compared with CP-CML samples. Several of them are well-known tumor-suppressor genes or regulators of cell proliferation, and gene re-expression was observed by the use of epigenetic active drugs. Together, our results demonstrate that CpG site methylation clearly increases during CML progression and that it may provide a useful basis for revealing new targets of therapy in advanced CML. PMID:27211271
Navigating the tip of the genomic iceberg: Next-generation sequencing for plant systematics.

PubMed

Straub, Shannon C K; Parks, Matthew; Weitemier, Kevin; Fishbein, Mark; Cronn, Richard C; Liston, Aaron

2012-02-01

Just as Sanger sequencing did more than 20 years ago, next-generation sequencing (NGS) is poised to revolutionize plant systematics. By combining multiplexing approaches with NGS throughput, systematists may no longer need to choose between more taxa or more characters. Here we describe a genome skimming (shallow sequencing) approach for plant systematics. Through simulations, we evaluated optimal sequencing depth and performance of single-end and paired-end short read sequences for assembly of nuclear ribosomal DNA (rDNA) and plastomes and addressed the effect of divergence on reference-guided plastome assembly. We also used simulations to identify potential phylogenetic markers from low-copy nuclear loci at different sequencing depths. We demonstrated the utility of genome skimming through phylogenetic analysis of the Sonoran Desert clade (SDC) of Asclepias (Apocynaceae). Paired-end reads performed better than single-end reads. Minimum sequencing depths for high quality rDNA and plastome assemblies were 40× and 30×, respectively. Divergence from the reference significantly affected plastome assembly, but relatively similar references are available for most seed plants. Deeper rDNA sequencing is necessary to characterize intragenomic polymorphism. The low-copy fraction of the nuclear genome was readily surveyed, even at low sequencing depths. Nearly 160000 bp of sequence from three organelles provided evidence of phylogenetic incongruence in the SDC. Adoption of NGS will facilitate progress in plant systematics, as whole plastome and rDNA cistrons, partial mitochondrial genomes, and low-copy nuclear markers can now be efficiently obtained for molecular phylogenetics studies.
High Mitochondrial DNA Stability in B-Cell Chronic Lymphocytic Leukemia

PubMed Central

Cerezo, María; Bandelt, Hans-Jürgen; Martín-Guerrero, Idoia; Ardanaz, Maite; Vega, Ana; Carracedo, Ángel; García-Orad, África; Salas, Antonio

2009-01-01

Background Chronic Lymphocytic Leukemia (CLL) leads to progressive accumulation of lymphocytes in the blood, bone marrow, and lymphatic tissues. Previous findings have suggested that the mtDNA could play an important role in CLL. Methodology/Principal Findings The mitochondrial DNA (mtDNA) control-region was analyzed in lymphocyte cell DNA extracts and compared with their granulocyte counterpart extract of 146 patients suffering from B-Cell CLL; B-CLL (all recruited from the Basque country). Major efforts were undertaken to rule out methodological artefacts that would render a high false positive rate for mtDNA instabilities and thus lead to erroneous interpretation of sequence instabilities. Only twenty instabilities were finally confirmed, most of them affecting the homopolymeric stretch located in the second hypervariable segment (HVS-II) around position 310, which is well known to constitute an extreme mutational hotspot of length polymorphism, as these mutations are frequently observed in the general human population. A critical revision of the findings in previous studies indicates a lack of proper methodological standards, which eventually led to an overinterpretation of the role of the mtDNA in CLL tumorigenesis. Conclusions/Significance Our results suggest that mtDNA instability is not the primary causal factor in B-CLL. A secondary role of mtDNA mutations cannot be fully ruled out under the hypothesis that the progressive accumulation of mtDNA instabilities could finally contribute to the tumoral process. Recommendations are given that would help to minimize erroneous interpretation of sequencing results in mtDNA studies in tumorigenesis. PMID:19924307
Direct non transcriptional role of NF-Y in DNA replication.

PubMed

Benatti, Paolo; Belluti, Silvia; Miotto, Benoit; Neusiedler, Julia; Dolfini, Diletta; Drac, Marjorie; Basile, Valentina; Schwob, Etienne; Mantovani, Roberto; Blow, J Julian; Imbriano, Carol

2016-04-01

NF-Y is a heterotrimeric transcription factor, which plays a pioneer role in the transcriptional control of promoters containing the CCAAT-box, among which genes involved in cell cycle regulation, apoptosis and DNA damage response. The knock-down of the sequence-specific subunit NF-YA triggers defects in S-phase progression, which lead to apoptotic cell death. Here, we report that NF-Y has a critical function in DNA replication progression, independent from its transcriptional activity. NF-YA colocalizes with early DNA replication factories, its depletion affects the loading of replisome proteins to DNA, among which Cdc45, and delays the passage from early to middle-late S phase. Molecular combing experiments are consistent with a role for NF-Y in the control of fork progression. Finally, we unambiguously demonstrate a direct non-transcriptional role of NF-Y in the overall efficiency of DNA replication, specifically in the DNA elongation process, using a Xenopus cell-free system. Our findings broaden the activity of NF-Y on a DNA metabolism other than transcription, supporting the existence of specific TFs required for proper and efficient DNA replication. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
TALE proteins search DNA using a rotationally decoupled mechanism.

PubMed

Cuculis, Luke; Abil, Zhanar; Zhao, Huimin; Schroeder, Charles M

2016-10-01

Transcription activator-like effector (TALE) proteins are a class of programmable DNA-binding proteins used extensively for gene editing. Despite recent progress, however, little is known about their sequence search mechanism. Here, we use single-molecule experiments to study TALE search along DNA. Our results show that TALEs utilize a rotationally decoupled mechanism for nonspecific search, despite remaining associated with DNA templates during the search process. Our results suggest that the protein helical structure enables TALEs to adopt a loosely wrapped conformation around DNA templates during nonspecific search, facilitating rapid one-dimensional (1D) diffusion under a range of solution conditions. Furthermore, this model is consistent with a previously reported two-state mechanism for TALE search that allows these proteins to overcome the search speed-stability paradox. Taken together, our results suggest that TALE search is unique among the broad class of sequence-specific DNA-binding proteins and supports efficient 1D search along DNA.
Saccharomycotina and Taphrinomycotina – progress in circumscription of genera

USDA-ARS?s Scientific Manuscript database

Much progress has been made in understanding relationships among the yeasts. DNA barcoding (D1/D2, ITS) has provided a rapid means for species identification and phylogenetic analysis of gene sequences has shown that the Ascomycota is comprised of three major lineages, i.e, Saccharomycotina (buddin...
Scaling in nature: From DNA through heartbeats to weather

NASA Astrophysics Data System (ADS)

Havlin, S.; Buldyrev, S. V.; Bunde, A.; Goldberger, A. L.; Ivanov, P. Ch.; Peng, C.-K.; Stanley, H. E.

1999-12-01

The purpose of this talk is to describe some recent progress in applying scaling concepts to various systems in nature. We review several systems characterized by scaling laws such as DNA sequences, heartbeat rates and weather variations. We discuss the finding that the exponent α quantifying the scaling in DNA in smaller for coding than for noncoding sequences. We also discuss the application of fractal scaling analysis to the dynamics of heartbeat regulation, and report the recent finding that the scaling exponent α is smaller during sleep periods compared to wake periods. We also discuss the recent findings that suggest a universal scaling exponent characterizing the weather fluctuations.
Scaling in nature: from DNA through heartbeats to weather

NASA Technical Reports Server (NTRS)

Havlin, S.; Buldyrev, S. V.; Bunde, A.; Goldberger, A. L.; Peng, C. K.; Stanley, H. E.

1999-01-01

The purpose of this report is to describe some recent progress in applying scaling concepts to various systems in nature. We review several systems characterized by scaling laws such as DNA sequences, heartbeat rates and weather variations. We discuss the finding that the exponent alpha quantifying the scaling in DNA in smaller for coding than for noncoding sequences. We also discuss the application of fractal scaling analysis to the dynamics of heartbeat regulation, and report the recent finding that the scaling exponent alpha is smaller during sleep periods compared to wake periods. We also discuss the recent findings that suggest a universal scaling exponent characterizing the weather fluctuations.
Drafting human ancestry: what does the Neanderthal genome tell us about hominid evolution? Commentary on Green et al. (2010).

PubMed

Hofreiter, Michael

2011-02-01

Ten years after the first draft versions of the human genome were announced, technical progress in both DNA sequencing and ancient DNA analyses has allowed a research team around Ed Green and Svante Pääbo to complete this task from infinitely more difficult hominid samples: a few pieces of bone originating from our closest, albeit extinct, relatives, the Neanderthals. Pulling the Neanderthal sequences out of a sea of contaminating environmental DNA impregnating the bones and at the same time avoiding the problems of contamination with modern human DNA is in itself a remarkable accomplishment. However, the crucial question in the long run is, what can we learn from such genomic data about hominid evolution?
The Past, Present, and Future of Human Centromere Genomics

PubMed Central

Aldrup-MacDonald, Megan E.; Sullivan, Beth A.

2014-01-01

The centromere is the chromosomal locus essential for chromosome inheritance and genome stability. Human centromeres are located at repetitive alpha satellite DNA arrays that compose approximately 5% of the genome. Contiguous alpha satellite DNA sequence is absent from the assembled reference genome, limiting current understanding of centromere organization and function. Here, we review the progress in centromere genomics spanning the discovery of the sequence to its molecular characterization and the work done during the Human Genome Project era to elucidate alpha satellite structure and sequence variation. We discuss exciting recent advances in alpha satellite sequence assembly that have provided important insight into the abundance and complex organization of this sequence on human chromosomes. In light of these new findings, we offer perspectives for future studies of human centromere assembly and function. PMID:24683489
Neratinib Efficacy and Circulating Tumor DNA Detection of HER2 Mutations in HER2 Nonamplified Metastatic Breast Cancer.

PubMed

Ma, Cynthia X; Bose, Ron; Gao, Feng; Freedman, Rachel A; Telli, Melinda L; Kimmick, Gretchen; Winer, Eric; Naughton, Michael; Goetz, Matthew P; Russell, Christy; Tripathy, Debu; Cobleigh, Melody; Forero, Andres; Pluard, Timothy J; Anders, Carey; Niravath, Polly Ann; Thomas, Shana; Anderson, Jill; Bumb, Caroline; Banks, Kimberly C; Lanman, Richard B; Bryce, Richard; Lalani, Alshad S; Pfeifer, John; Hayes, Daniel F; Pegram, Mark; Blackwell, Kimberly; Bedard, Philippe L; Al-Kateb, Hussam; Ellis, Matthew J C

2017-10-01

Purpose: Based on promising preclinical data, we conducted a single-arm phase II trial to assess the clinical benefit rate (CBR) of neratinib, defined as complete/partial response (CR/PR) or stable disease (SD) ≥24 weeks, in HER2 mut nonamplified metastatic breast cancer (MBC). Secondary endpoints included progression-free survival (PFS), toxicity, and circulating tumor DNA (ctDNA) HER2 mut detection. Experimental Design: Tumor tissue positive for HER2 mut was required for eligibility. Neratinib was administered 240 mg daily with prophylactic loperamide. ctDNA sequencing was performed retrospectively for 54 patients (14 positive and 40 negative for tumor HER2 mut ). Results: Nine of 381 tumors (2.4%) sequenced centrally harbored HER2 mut (lobular 7.8% vs. ductal 1.6%; P = 0.026). Thirteen additional HER2 mut cases were identified locally. Twenty-one of these 22 HER2 mut cases were estrogen receptor positive. Sixteen patients [median age 58 (31-74) years and three (2-10) prior metastatic regimens] received neratinib. The CBR was 31% [90% confidence interval (CI), 13%-55%], including one CR, one PR, and three SD ≥24 weeks. Median PFS was 16 (90% CI, 8-31) weeks. Diarrhea (grade 2, 44%; grade 3, 25%) was the most common adverse event. Baseline ctDNA sequencing identified the same HER2 mut in 11 of 14 tumor-positive cases (sensitivity, 79%; 90% CI, 53%-94%) and correctly assigned 32 of 32 informative negative cases (specificity, 100%; 90% CI, 91%-100%). In addition, ctDNA HER2 mut variant allele frequency decreased in nine of 11 paired samples at week 4, followed by an increase upon progression. Conclusions: Neratinib is active in HER2 mut , nonamplified MBC. ctDNA sequencing offers a noninvasive strategy to identify patients with HER2 mut cancers for clinical trial participation. Clin Cancer Res; 23(19); 5687-95. ©2017 AACR . ©2017 American Association for Cancer Research.
Advances in DNA metabarcoding for food and wildlife forensic species identification.

PubMed

Staats, Martijn; Arulandhu, Alfred J; Gravendeel, Barbara; Holst-Jensen, Arne; Scholtens, Ingrid; Peelen, Tamara; Prins, Theo W; Kok, Esther

2016-07-01

Species identification using DNA barcodes has been widely adopted by forensic scientists as an effective molecular tool for tracking adulterations in food and for analysing samples from alleged wildlife crime incidents. DNA barcoding is an approach that involves sequencing of short DNA sequences from standardized regions and comparison to a reference database as a molecular diagnostic tool in species identification. In recent years, remarkable progress has been made towards developing DNA metabarcoding strategies, which involves next-generation sequencing of DNA barcodes for the simultaneous detection of multiple species in complex samples. Metabarcoding strategies can be used in processed materials containing highly degraded DNA e.g. for the identification of endangered and hazardous species in traditional medicine. This review aims to provide insight into advances of plant and animal DNA barcoding and highlights current practices and recent developments for DNA metabarcoding of food and wildlife forensic samples from a practical point of view. Special emphasis is placed on new developments for identifying species listed in the Convention on International Trade of Endangered Species (CITES) appendices for which reliable methods for species identification may signal and/or prevent illegal trade. Current technological developments and challenges of DNA metabarcoding for forensic scientists will be assessed in the light of stakeholders' needs.
Tumor Cell-Free DNA Copy Number Instability Predicts Therapeutic Response to Immunotherapy.

PubMed

Weiss, Glen J; Beck, Julia; Braun, Donald P; Bornemann-Kolatzki, Kristen; Barilla, Heather; Cubello, Rhiannon; Quan, Walter; Sangal, Ashish; Khemka, Vivek; Waypa, Jordan; Mitchell, William M; Urnovitz, Howard; Schütz, Ekkehard

2017-09-01

Purpose: Chromosomal instability is a fundamental property of cancer, which can be quantified by next-generation sequencing (NGS) from plasma/serum-derived cell-free DNA (cfDNA). We hypothesized that cfDNA could be used as a real-time surrogate for imaging analysis of disease status as a function of response to immunotherapy and as a more reliable tool than tumor biomarkers. Experimental Design: Plasma cfDNA sequences from 56 patients with diverse advanced cancers were prospectively collected and analyzed in a single-blind study for copy number variations, expressed as a quantitative chromosomal number instability (CNI) score versus 126 noncancer controls in a training set of 23 and a blinded validation set of 33. Tumor biomarker concentrations and a surrogate marker for T regulatory cells (Tregs) were comparatively analyzed. Results: Elevated CNI scores were observed in 51 of 56 patients prior to therapy. The blinded validation cohort provided an overall prediction accuracy of 83% (25/30) and a positive predictive value of CNI score for progression of 92% (11/12). The combination of CNI score before cycle (Cy) 2 and 3 yielded a correct prediction for progression in all 13 patients. The CNI score also correctly identified cases of pseudo-tumor progression from hyperprogression. Before Cy2 and Cy3, there was no significant correlation for protein tumor markers, total cfDNA, or surrogate Tregs. Conclusions: Chromosomal instability quantification in plasma cfDNA can serve as an early indicator of response to immunotherapy. The method has the potential to reduce health care costs and disease burden for cancer patients following further validation. Clin Cancer Res; 23(17); 5074-81. ©2017 AACR . ©2017 American Association for Cancer Research.

Overcoming a nucleosomal barrier to replication

PubMed Central

Chang, Han-Wen; Pandey, Manjula; Kulaeva, Olga I.; Patel, Smita S.; Studitsky, Vasily M.

2016-01-01

Efficient overcoming and accurate maintenance of chromatin structure and associated histone marks during DNA replication are essential for normal functioning of the daughter cells. However, the molecular mechanisms of replication through chromatin are unknown. We have studied traversal of uniquely positioned mononucleosomes by T7 replisome in vitro. Nucleosomes present a strong, sequence-dependent barrier for replication, with particularly strong pausing of DNA polymerase at the +(31–40) and +(41–65) regions of the nucleosomal DNA. The exonuclease activity of T7 DNA polymerase increases the overall rate of progression of the replisome through a nucleosome, likely by resolving nonproductive complexes. The presence of nucleosome-free DNA upstream of the replication fork facilitates the progression of DNA polymerase through the nucleosome. After replication, at least 50% of the nucleosomes assume an alternative conformation, maintaining their original positions on the DNA. Our data suggest a previously unpublished mechanism for nucleosome maintenance during replication, likely involving transient formation of an intranucleosomal DNA loop. PMID:27847876
Tumorigenic Potential of Transit Amplifying Prostate Cells

DTIC Science & Technology

2012-06-01

by ChIP-Seq showed that in both the human prostate cell line LNCaP and in mouse prostate, NKX3.1 bound DNA fragments are significantly enriched in...progression. Cancer Cell. 2010;17(5):443–454. 29. Steadman DJ, Giuffrida D, Gelmann EP. DNA - binding sequence of the human prostate-specific...bind nucleosomal DNA and destabilize nucleosomes thereby allowing other transcription factors to access their sites (7),(8). BODY Aim 1: To
Isolation of anonymous DNA sequences from within a submicroscopic X chromosomal deletion in a patient with choroideremia, deafness, and mental retardation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nussbaum, R.L.; Lesko, J.G.; Lewis, R.A.

1987-09-01

Choroideremia, an X-chromosome linked retinal dystrophy of unknown pathogenesis, causes progressive nightblindness and eventual central blindness in affected males by the third to fourth decade of life. Choroideremia has been mapped to Xq13-21 by tight linkage to restriction fragment length polymorphism loci. The authors have recently identified two families in which choroideremia is inherited with mental retardation and deafness. In family XL-62, an interstitial deletion Xq21 is visible by cytogenetic analysis and two linked anonymous DNA markers, DXYS1 and DXS72, are deleted. In the second family, XL-45, an interstitial deletion was suspected on phenotypic grounds but could not be confirmedmore » by high-resolution cytogenetic analysis. They used phenol-enhanced reassociation of 48,XXXX DNA in competition with excess XL-45 DNA to generate a library of cloned DNA enriched for sequences that might be deleted in XL-45. Two of the first 83 sequences characterized from the library were found to be deleted in probands from family XL-45 as well as from family XL-62. Isolation of these sequences proves that XL-45 does contain a submicroscopic deletion and provides a starting point for identifying overlapping genomic sequences that span the XL-45 deletion. Each overlapping sequence will be studied to identify exons from the choroideremia locus.« less
[Progress in genetic research of human height].

PubMed

Chen, Kaixu; Wang, Weilan; Zhang, Fuchun; Zheng, Xiufen

2015-08-01

It is well known that both environmental and genetic factors contribute to adult height variation in general population. However, heritability studies have shown that the variation in height is more affected by genetic factors. Height is a typical polygenic trait which has been studied by traditional linkage analysis and association analysis to identify common DNA sequence variation associated with height, but progress has been slow. More recently, with the development of genotyping and DNA sequencing technologies, tremendous achievements have been made in genetic research of human height. Hundreds of single nucleotide polymorphisms (SNPs) associated with human height have been identified and validated with the application of genome-wide association studies (GWAS) methodology, which deepens our understanding of the genetics of human growth and development and also provides theoretic basis and reference for studying other complex human traits. In this review, we summarize recent progress in genetic research of human height and discuss problems and prospects in this research area which may provide some insights into future genetic studies of human height.
A rapid, generally applicable method to engineer zinc fingers illustrated by targeting the HIV-1 promoter.

PubMed

Isalan, M; Klug, A; Choo, Y

2001-07-01

DNA-binding domains with predetermined sequence specificity are engineered by selection of zinc finger modules using phage display, allowing the construction of customized transcription factors. Despite remarkable progress in this field, the available protein-engineering methods are deficient in many respects, thus hampering the applicability of the technique. Here we present a rapid and convenient method that can be used to design zinc finger proteins against a variety of DNA-binding sites. This is based on a pair of pre-made zinc finger phage-display libraries, which are used in parallel to select two DNA-binding domains each of which recognizes given 5 base pair sequences, and whose products are recombined to produce a single protein that recognizes a composite (9 base pair) site of predefined sequence. Engineering using this system can be completed in less than two weeks and yields proteins that bind sequence-specifically to DNA with Kd values in the nanomolar range. To illustrate the technique, we have selected seven different proteins to bind various regions of the human immunodeficiency virus 1 (HIV-1) promoter.
DNA and histone methylation in gastric carcinogenesis

PubMed Central

Calcagno, Danielle Queiroz; Gigek, Carolina Oliveira; Chen, Elizabeth Suchi; Burbano, Rommel Rodriguez; Smith, Marília de Arruda Cardoso

2013-01-01

Epigenetic alterations contribute significantly to the development and progression of gastric cancer, one of the leading causes of cancer death worldwide. Epigenetics refers to the number of modifications of the chromatin structure that affect gene expression without altering the primary sequence of DNA, and these changes lead to transcriptional activation or silencing of the gene. Over the years, the study of epigenetic processes has increased, and novel therapeutic approaches that target DNA methylation and histone modifications have emerged. A greater understanding of epigenetics and the therapeutic potential of manipulating these processes is necessary for gastric cancer treatment. Here, we review recent research on the effects of aberrant DNA and histone methylation on the onset and progression of gastric tumors and the development of compounds that target enzymes that regulate the epigenome. PMID:23482412
DNA Sequences Proximal to Human Mitochondrial DNA Deletion Breakpoints Prevalent in Human Disease Form G-quadruplexes, a Class of DNA Structures Inefficiently Unwound by the Mitochondrial Replicative Twinkle Helicase*

PubMed Central

Bharti, Sanjay Kumar; Sommers, Joshua A.; Zhou, Jun; Kaplan, Daniel L.; Spelbrink, Johannes N.; Mergny, Jean-Louis; Brosh, Robert M.

2014-01-01

Mitochondrial DNA deletions are prominent in human genetic disorders, cancer, and aging. It is thought that stalling of the mitochondrial replication machinery during DNA synthesis is a prominent source of mitochondrial genome instability; however, the precise molecular determinants of defective mitochondrial replication are not well understood. In this work, we performed a computational analysis of the human mitochondrial genome using the “Pattern Finder” G-quadruplex (G4) predictor algorithm to assess whether G4-forming sequences reside in close proximity (within 20 base pairs) to known mitochondrial DNA deletion breakpoints. We then used this information to map G4P sequences with deletions characteristic of representative mitochondrial genetic disorders and also those identified in various cancers and aging. Circular dichroism and UV spectral analysis demonstrated that mitochondrial G-rich sequences near deletion breakpoints prevalent in human disease form G-quadruplex DNA structures. A biochemical analysis of purified recombinant human Twinkle protein (gene product of c10orf2) showed that the mitochondrial replicative helicase inefficiently unwinds well characterized intermolecular and intramolecular G-quadruplex DNA substrates, as well as a unimolecular G4 substrate derived from a mitochondrial sequence that nests a deletion breakpoint described in human renal cell carcinoma. Although G4 has been implicated in the initiation of mitochondrial DNA replication, our current findings suggest that mitochondrial G-quadruplexes are also likely to be a source of instability for the mitochondrial genome by perturbing the normal progression of the mitochondrial replication machinery, including DNA unwinding by Twinkle helicase. PMID:25193669
Separation/extraction, detection, and interpretation of DNA mixtures in forensic science (review).

PubMed

Tao, Ruiyang; Wang, Shouyu; Zhang, Jiashuo; Zhang, Jingyi; Yang, Zihao; Sheng, Xiang; Hou, Yiping; Zhang, Suhua; Li, Chengtao

2018-05-25

Interpreting mixed DNA samples containing material from multiple contributors has long been considered a major challenge in forensic casework, especially when encountering low-template DNA (LT-DNA) or high-order mixtures that may involve missing alleles (dropout) and unrelated alleles (drop-in), among others. In the last decades, extraordinary progress has been made in the analysis of mixed DNA samples, which has led to increasing attention to this research field. The advent of new methods for the separation and extraction of DNA from mixtures, novel or jointly applied genetic markers for detection and reliable interpretation approaches for estimating the weight of evidence, as well as the powerful massively parallel sequencing (MPS) technology, has greatly extended the range of mixed samples that can be correctly analyzed. Here, we summarized the investigative approaches and progress in the field of forensic DNA mixture analysis, hoping to provide some assistance to forensic practitioners and to promote further development involving this issue.
A DNA methylation map of human cancer at single base-pair resolution.

PubMed

Vidal, E; Sayols, S; Moran, S; Guillaumet-Adkins, A; Schroeder, M P; Royo, R; Orozco, M; Gut, M; Gut, I; Lopez-Bigas, N; Heyn, H; Esteller, M

2017-10-05

Although single base-pair resolution DNA methylation landscapes for embryonic and different somatic cell types provided important insights into epigenetic dynamics and cell-type specificity, such comprehensive profiling is incomplete across human cancer types. This prompted us to perform genome-wide DNA methylation profiling of 22 samples derived from normal tissues and associated neoplasms, including primary tumors and cancer cell lines. Unlike their invariant normal counterparts, cancer samples exhibited highly variable CpG methylation levels in a large proportion of the genome, involving progressive changes during tumor evolution. The whole-genome sequencing results from selected samples were replicated in a large cohort of 1112 primary tumors of various cancer types using genome-scale DNA methylation analysis. Specifically, we determined DNA hypermethylation of promoters and enhancers regulating tumor-suppressor genes, with potential cancer-driving effects. DNA hypermethylation events showed evidence of positive selection, mutual exclusivity and tissue specificity, suggesting their active participation in neoplastic transformation. Our data highlight the extensive changes in DNA methylation that occur in cancer onset, progression and dissemination.
DNA copy number changes define spatial patterns of heterogeneity in colorectal cancer

PubMed Central

Mamlouk, Soulafa; Childs, Liam Harold; Aust, Daniela; Heim, Daniel; Melching, Friederike; Oliveira, Cristiano; Wolf, Thomas; Durek, Pawel; Schumacher, Dirk; Bläker, Hendrik; von Winterfeld, Moritz; Gastl, Bastian; Möhr, Kerstin; Menne, Andrea; Zeugner, Silke; Redmer, Torben; Lenze, Dido; Tierling, Sascha; Möbs, Markus; Weichert, Wilko; Folprecht, Gunnar; Blanc, Eric; Beule, Dieter; Schäfer, Reinhold; Morkel, Markus; Klauschen, Frederick; Leser, Ulf; Sers, Christine

2017-01-01

Genetic heterogeneity between and within tumours is a major factor determining cancer progression and therapy response. Here we examined DNA sequence and DNA copy-number heterogeneity in colorectal cancer (CRC) by targeted high-depth sequencing of 100 most frequently altered genes. In 97 samples, with primary tumours and matched metastases from 27 patients, we observe inter-tumour concordance for coding mutations; in contrast, gene copy numbers are highly discordant between primary tumours and metastases as validated by fluorescent in situ hybridization. To further investigate intra-tumour heterogeneity, we dissected a single tumour into 68 spatially defined samples and sequenced them separately. We identify evenly distributed coding mutations in APC and TP53 in all tumour areas, yet highly variable gene copy numbers in numerous genes. 3D morpho-molecular reconstruction reveals two clusters with divergent copy number aberrations along the proximal–distal axis indicating that DNA copy number variations are a major source of tumour heterogeneity in CRC. PMID:28120820
Progress in ion torrent semiconductor chip based sequencing.

PubMed

Merriman, Barry; Rothberg, Jonathan M

2012-12-01

In order for next-generation sequencing to become widely used as a diagnostic in the healthcare industry, sequencing instrumentation will need to be mass produced with a high degree of quality and economy. One way to achieve this is to recast DNA sequencing in a format that fully leverages the manufacturing base created for computer chips, complementary metal-oxide semiconductor chip fabrication, which is the current pinnacle of large scale, high quality, low-cost manufacturing of high technology. To achieve this, ideally the entire sensory apparatus of the sequencer would be embodied in a standard semiconductor chip, manufactured in the same fab facilities used for logic and memory chips. Recently, such a sequencing chip, and the associated sequencing platform, has been developed and commercialized by Ion Torrent, a division of Life Technologies, Inc. Here we provide an overview of this semiconductor chip based sequencing technology, and summarize the progress made since its commercial introduction. We described in detail the progress in chip scaling, sequencing throughput, read length, and accuracy. We also summarize the enhancements in the associated platform, including sample preparation, data processing, and engagement of the broader development community through open source and crowdsourcing initiatives. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Subangstrom Measurements of Enzyme Function Using a Biological Nanopore, SPRNT.

PubMed

Laszlo, A H; Derrrington, I M; Gundlach, J H

2017-01-01

Nanopores are emerging as new single-molecule tools in the study of enzymes. Based on the progress in nanopore sequencing of DNA, a tool called Single-molecule Picometer Resolution Nanopore Tweezers (SPRNT) was developed to measure the movement of enzymes along DNA in real time. In this new method, an enzyme is loaded onto a DNA (or RNA) molecule. A single-stranded DNA end of this complex is drawn into a nanopore by an electrostatic potential that is applied across the pore. The single-stranded DNA passes through the pore's constriction until the enzyme comes into contact with the pore. Further progression of the DNA through the pore is then controlled by the enzyme. An ion current that flows through the pore's constriction is modulated by the DNA in the constriction. Analysis of ion current changes reveals the advance of the DNA with high spatiotemporal precision, thereby providing a real-time record of the enzyme's activity. Using an engineered version of the protein nanopore MspA, SPRNT has spatial resolution as small as 40pm at millisecond timescales, while simultaneously providing the DNA's sequence within the enzyme. In this chapter, SPRNT is introduced and its extraordinary potential is exemplified using the helicase Hel308. Two distinct substates are observed for each one-nucleotide advance; one of these about half-nucleotide long steps is ATP dependent and the other is ATP independent. The spatiotemporal resolution of this low-cost single-molecule technique lifts the study of enzymes to a new level of precision, enabling exploration of hitherto unobservable enzyme dynamics in real time. © 2017 Elsevier Inc. All rights reserved.
Using high-sensitivity sequencing for the detection of mutations in BTK and PLCγ2 genes in cellular and cell-free DNA and correlation with progression in patients treated with BTK inhibitors.

PubMed

Albitar, Adam; Ma, Wanlong; DeDios, Ivan; Estella, Jeffrey; Ahn, Inhye; Farooqui, Mohammed; Wiestner, Adrian; Albitar, Maher

2017-03-14

Patients with chronic lymphocytic leukemia (CLL) that develop resistance to Bruton tyrosine kinase (BTK) inhibitors are typically positive for mutations in BTK or phospholipase c gamma 2 (PLCγ2). We developed a high sensitivity (HS) assay utilizing wild-type blocking polymerase chain reaction achieved via bridged and locked nucleic acids. We used this high sensitivity assay in combination with Sanger sequencing and next generation sequencing (NGS) and tested cellular DNA and cell-free DNA (cfDNA) from patients with CLL treated with the BTK inhibitor, ibrutinib. We also tested ibrutinib-naïve patients with CLL. HS testing achieved 100x greater sensitivity than Sanger. HS Sanger sequencing was capable of detecting < 1 mutant allele in background of 1000 wild-type alleles (1:1000). Similar sensitivity was achieved with HS NGS. No BTK or PLCγ2 mutations were detected in any of the 44 ibrutinib-naïve CLL patients. We demonstrate that without the HS testing 56% of positive samples would have been missed for BTK and 85% of PLCγ2 would have been missed. With the use of HS, we were able to detect multiple mutant clones in the same sample in 37.5% of patients; most would have been missed without HS testing. We also demonstrate that with HS sequencing, plasma cfDNA is more reliable than cellular DNA in detecting mutations. Our studies indicate that wild-type blocking and HS sequencing is necessary for proper and early detection of BTK or PLCγ2 mutations in monitoring patients treated with BTK inhibitors. Furthermore, cfDNA from plasma is very reliable sample-type for testing.
A Review on the Applications of Next Generation Sequencing Technologies as Applied to Food-Related Microbiome Studies

PubMed Central

Cao, Yu; Fanning, Séamus; Proos, Sinéad; Jordan, Kieran; Srikumar, Shabarinath

2017-01-01

The development of next generation sequencing (NGS) techniques has enabled researchers to study and understand the world of microorganisms from broader and deeper perspectives. The contemporary advances in DNA sequencing technologies have not only enabled finer characterization of bacterial genomes but also provided deeper taxonomic identification of complex microbiomes which in its genomic essence is the combined genetic material of the microorganisms inhabiting an environment, whether the environment be a particular body econiche (e.g., human intestinal contents) or a food manufacturing facility econiche (e.g., floor drain). To date, 16S rDNA sequencing, metagenomics and metatranscriptomics are the three basic sequencing strategies used in the taxonomic identification and characterization of food-related microbiomes. These sequencing strategies have used different NGS platforms for DNA and RNA sequence identification. Traditionally, 16S rDNA sequencing has played a key role in understanding the taxonomic composition of a food-related microbiome. Recently, metagenomic approaches have resulted in improved understanding of a microbiome by providing a species-level/strain-level characterization. Further, metatranscriptomic approaches have contributed to the functional characterization of the complex interactions between different microbial communities within a single microbiome. Many studies have highlighted the use of NGS techniques in investigating the microbiome of fermented foods. However, the utilization of NGS techniques in studying the microbiome of non-fermented foods are limited. This review provides a brief overview of the advances in DNA sequencing chemistries as the technology progressed from first, next and third generations and highlights how NGS provided a deeper understanding of food-related microbiomes with special focus on non-fermented foods. PMID:29033905
Analytical and Clinical Validation of a Digital Sequencing Panel for Quantitative, Highly Accurate Evaluation of Cell-Free Circulating Tumor DNA

PubMed Central

Zill, Oliver A.; Sebisanovic, Dragan; Lopez, Rene; Blau, Sibel; Collisson, Eric A.; Divers, Stephen G.; Hoon, Dave S. B.; Kopetz, E. Scott; Lee, Jeeyun; Nikolinakos, Petros G.; Baca, Arthur M.; Kermani, Bahram G.; Eltoukhy, Helmy; Talasaz, AmirAli

2015-01-01

Next-generation sequencing of cell-free circulating solid tumor DNA addresses two challenges in contemporary cancer care. First this method of massively parallel and deep sequencing enables assessment of a comprehensive panel of genomic targets from a single sample, and second, it obviates the need for repeat invasive tissue biopsies. Digital SequencingTM is a novel method for high-quality sequencing of circulating tumor DNA simultaneously across a comprehensive panel of over 50 cancer-related genes with a simple blood test. Here we report the analytic and clinical validation of the gene panel. Analytic sensitivity down to 0.1% mutant allele fraction is demonstrated via serial dilution studies of known samples. Near-perfect analytic specificity (> 99.9999%) enables complete coverage of many genes without the false positives typically seen with traditional sequencing assays at mutant allele frequencies or fractions below 5%. We compared digital sequencing of plasma-derived cell-free DNA to tissue-based sequencing on 165 consecutive matched samples from five outside centers in patients with stage III-IV solid tumor cancers. Clinical sensitivity of plasma-derived NGS was 85.0%, comparable to 80.7% sensitivity for tissue. The assay success rate on 1,000 consecutive samples in clinical practice was 99.8%. Digital sequencing of plasma-derived DNA is indicated in advanced cancer patients to prevent repeated invasive biopsies when the initial biopsy is inadequate, unobtainable for genomic testing, or uninformative, or when the patient’s cancer has progressed despite treatment. Its clinical utility is derived from reduction in the costs, complications and delays associated with invasive tissue biopsies for genomic testing. PMID:26474073
New progress in snake mitochondrial gene rearrangement.

PubMed

Chen, Nian; Zhao, Shujin

2009-08-01

To further understand the evolution of snake mitochondrial genomes, the complete mitochondrial DNA (mtDNA) sequences were determined for representative species from two snake families: the Many-banded krait, the Banded krait, the Chinese cobra, the King cobra, the Hundred-pace viper, the Short-tailed mamushi, and the Chain viper. Thirteen protein-coding genes, 22-23 tRNA genes, 2 rRNA genes, and 2 control regions were identified in these mtDNAs. Duplication of the control region and translocation of the tRNAPro gene were two notable features of the snake mtDNAs. These results from the gene rearrangement comparisons confirm the correctness of traditional classification schemes and validate the utility of comparing complete mtDNA sequences for snake phylogeny reconstruction.
Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases

PubMed Central

Schadt, Eric E.; Banerjee, Onureena; Fang, Gang; Feng, Zhixing; Wong, Wing H.; Zhang, Xuegong; Kislyuk, Andrey; Clark, Tyson A.; Luong, Khai; Keren-Paz, Alona; Chess, Andrew; Kumar, Vipin; Chen-Plotkin, Alice; Sondheimer, Neal; Korlach, Jonas; Kasarskis, Andrew

2013-01-01

Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types. PMID:23093720
Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases.

PubMed

Schadt, Eric E; Banerjee, Onureena; Fang, Gang; Feng, Zhixing; Wong, Wing H; Zhang, Xuegong; Kislyuk, Andrey; Clark, Tyson A; Luong, Khai; Keren-Paz, Alona; Chess, Andrew; Kumar, Vipin; Chen-Plotkin, Alice; Sondheimer, Neal; Korlach, Jonas; Kasarskis, Andrew

2013-01-01

Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types.
A putative peroxidase cDNA from turnip and analysis of the encoded protein sequence.

PubMed

Romero-Gómez, S; Duarte-Vázquez, M A; García-Almendárez, B E; Mayorga-Martínez, L; Cervantes-Avilés, O; Regalado, C

2008-12-01

A putative peroxidase cDNA was isolated from turnip roots (Brassica napus L. var. purple top white globe) by reverse transcriptase-polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends (RACE). Total RNA extracted from mature turnip roots was used as a template for RT-PCR, using a degenerated primer designed to amplify the highly conserved distal motif of plant peroxidases. The resulting partial sequence was used to design the rest of the specific primers for 5' and 3' RACE. Two cDNA fragments were purified, sequenced, and aligned with the partial sequence from RT-PCR, and a complete overlapping sequence was obtained and labeled as BbPA (Genbank Accession No. AY423440, named as podC). The full length cDNA is 1167bp long and contains a 1077bp open reading frame (ORF) encoding a 358 deduced amino acid peroxidase polypeptide. The putative peroxidase (BnPA) showed a calculated Mr of 34kDa, and isoelectric point (pI) of 4.5, with no significant identity with other reported turnip peroxidases. Sequence alignment showed that only three peroxidases have a significant identity with BnPA namely AtP29a (84%), and AtPA2 (81%) from Arabidopsis thaliana, and HRPA2 (82%) from horseradish (Armoracia rusticana). Work is in progress to clone this gene into an adequate host to study the specific role and possible biotechnological applications of this alternative peroxidase source.
From Conventional to Next Generation Sequencing of Epstein-Barr Virus Genomes.

PubMed

Kwok, Hin; Chiang, Alan Kwok Shing

2016-02-24

Genomic sequences of Epstein-Barr virus (EBV) have been of interest because the virus is associated with cancers, such as nasopharyngeal carcinoma, and conditions such as infectious mononucleosis. The progress of whole-genome EBV sequencing has been limited by the inefficiency and cost of the first-generation sequencing technology. With the advancement of next-generation sequencing (NGS) and target enrichment strategies, increasing number of EBV genomes has been published. These genomes were sequenced using different approaches, either with or without EBV DNA enrichment. This review provides an overview of the EBV genomes published to date, and a description of the sequencing technology and bioinformatic analyses employed in generating these sequences. We further explored ways through which the quality of sequencing data can be improved, such as using DNA oligos for capture hybridization, and longer insert size and read length in the sequencing runs. These advances will enable large-scale genomic sequencing of EBV which will facilitate a better understanding of the genetic variations of EBV in different geographic regions and discovery of potentially pathogenic variants in specific diseases.

The Replication Focus Targeting Sequence (RFTS) Domain Is a DNA-competitive Inhibitor of Dnmt1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Syeda, Farisa; Fagan, Rebecca L.; Wean, Matthew

Dnmt1 (DNA methyltransferase 1) is the principal enzyme responsible for maintenance of cytosine methylation at CpG dinucleotides in the mammalian genome. The N-terminal replication focus targeting sequence (RFTS) domain of Dnmt1 has been implicated in subcellular localization, protein association, and catalytic function. However, progress in understanding its function has been limited by the lack of assays for and a structure of this domain. Here, we show that the naked DNA- and polynucleosome-binding activities of Dnmt1 are inhibited by the RFTS domain, which functions by virtue of binding the catalytic domain to the exclusion of DNA. Kinetic analysis with a fluorogenicmore » DNA substrate established the RFTS domain as a 600-fold inhibitor of Dnmt1 enzymatic activity. The crystal structure of the RFTS domain reveals a novel fold and supports a mechanism in which an RFTS-targeted Dnmt1-binding protein, such as Uhrf1, may activate Dnmt1 for DNA binding.« less
Cell-free DNA and next-generation sequencing in the service of personalized medicine for lung cancer

PubMed Central

Bennett, Catherine W.; Berchem, Guy; Kim, Yeoun Jin; El-Khoury, Victoria

2016-01-01

Personalized medicine has emerged as the future of cancer care to ensure that patients receive individualized treatment specific to their needs. In order to provide such care, molecular techniques that enable oncologists to diagnose, treat, and monitor tumors are necessary. In the field of lung cancer, cell free DNA (cfDNA) shows great potential as a less invasive liquid biopsy technique, and next-generation sequencing (NGS) is a promising tool for analysis of tumor mutations. In this review, we outline the evolution of cfDNA and NGS and discuss the progress of using them in a clinical setting for patients with lung cancer. We also present an analysis of the role of cfDNA as a liquid biopsy technique and NGS as an analytical tool in studying EGFR and MET, two frequently mutated genes in lung cancer. Ultimately, we hope that using cfDNA and NGS for cancer diagnosis and treatment will become standard for patients with lung cancer and across the field of oncology. PMID:27589834
The promise and challenge of high-throughput sequencing of the antibody repertoire

PubMed Central

Georgiou, George; Ippolito, Gregory C; Beausang, John; Busse, Christian E; Wardemann, Hedda; Quake, Stephen R

2014-01-01

Efforts to determine the antibody repertoire encoded by B cells in the blood or lymphoid organs using high-throughput DNA sequencing technologies have been advancing at an extremely rapid pace and are transforming our understanding of humoral immune responses. Information gained from high-throughput DNA sequencing of immunoglobulin genes (Ig-seq) can be applied to detect B-cell malignancies with high sensitivity, to discover antibodies specific for antigens of interest, to guide vaccine development and to understand autoimmunity. Rapid progress in the development of experimental protocols and informatics analysis tools is helping to reduce sequencing artifacts, to achieve more precise quantification of clonal diversity and to extract the most pertinent biological information. That said, broader application of Ig-seq, especially in clinical settings, will require the development of a standardized experimental design framework that will enable the sharing and meta-analysis of sequencing data generated by different laboratories. PMID:24441474
Technological advances in precision medicine and drug development.

PubMed

Maggi, Elaine; Patterson, Nicole E; Montagna, Cristina

New technologies are rapidly becoming available to expand the arsenal of tools accessible for precision medicine and to support the development of new therapeutics. Advances in liquid biopsies, which analyze cells, DNA, RNA, proteins, or vesicles isolated from the blood, have gained particular interest for their uses in acquiring information reflecting the biology of tumors and metastatic tissues. Through advancements in DNA sequencing that have merged unprecedented accuracy with affordable cost, personalized treatments based on genetic variations are becoming a real possibility. Extraordinary progress has been achieved in the development of biological therapies aimed to even further advance personalized treatments. We provide a summary of current and future applications of blood based liquid biopsies and how new technologies are utilized for the development of biological therapeutic treatments. We discuss current and future sequencing methods with an emphasis on how technological advances will support the progress in the field of precision medicine.
Structure and specificity of the RNA-guided endonuclease Cas9 during DNA interrogation, target binding and cleavage

PubMed Central

Josephs, Eric A.; Kocak, D. Dewran; Fitzgibbon, Christopher J.; McMenemy, Joshua; Gersbach, Charles A.; Marszalek, Piotr E.

2015-01-01

CRISPR-associated endonuclease Cas9 cuts DNA at variable target sites designated by a Cas9-bound RNA molecule. Cas9's ability to be directed by single ‘guide RNA’ molecules to target nearly any sequence has been recently exploited for a number of emerging biological and medical applications. Therefore, understanding the nature of Cas9's off-target activity is of paramount importance for its practical use. Using atomic force microscopy (AFM), we directly resolve individual Cas9 and nuclease-inactive dCas9 proteins as they bind along engineered DNA substrates. High-resolution imaging allows us to determine their relative propensities to bind with different guide RNA variants to targeted or off-target sequences. Mapping the structural properties of Cas9 and dCas9 to their respective binding sites reveals a progressive conformational transformation at DNA sites with increasing sequence similarity to its target. With kinetic Monte Carlo (KMC) simulations, these results provide evidence of a ‘conformational gating’ mechanism driven by the interactions between the guide RNA and the 14th–17th nucleotide region of the targeted DNA, the stabilities of which we find correlate significantly with reported off-target cleavage rates. KMC simulations also reveal potential methodologies to engineer guide RNA sequences with improved specificity by considering the invasion of guide RNAs into targeted DNA duplex. PMID:26384421
From Structure-Function Analyses to Protein Engineering for Practical Applications of DNA Ligase

PubMed Central

Tanabe, Maiko; Nishida, Hirokazu

2015-01-01

DNA ligases are indispensable in all living cells and ubiquitous in all organs. DNA ligases are broadly utilized in molecular biology research fields, such as genetic engineering and DNA sequencing technologies. Here we review the utilization of DNA ligases in a variety of in vitro gene manipulations, developed over the past several decades. During this period, fewer protein engineering attempts for DNA ligases have been made, as compared to those for DNA polymerases. We summarize the recent progress in the elucidation of the DNA ligation mechanisms obtained from the tertiary structures solved thus far, in each step of the ligation reaction scheme. We also present some examples of engineered DNA ligases, developed from the viewpoint of their three-dimensional structures. PMID:26508902
Demonstration of human T-cell lymphotropic virus type I (HTLV-I) from an HTLV-I seronegative south Indian patient with chronic, progressive spastic paraparesis.

PubMed

Nishimura, M; Mingioli, E; McFarlin, D E; Jacobson, S

1993-12-01

Here we describe a human T-cell lymphotropic virus type I (HTLV-I) seronegative patient from South India with a chronic, progressive spastic paraparesis from which HTLV-I has been isolated from peripheral blood lymphocytes. HTLV-I pol and tax viral sequences were detected in DNA from fresh peripheral blood lymphocytes (PBL) by polymerase chain reaction (PCR) and liquid hybridization techniques. Southern blot analysis of the PCR products demonstrated a low copy number of HTLV-I at the level of one viral copy per 10,000 fresh PBL. A long-term CD4+ T-cell line was established from PBL of this patient using recombinant interleukin-2, OKT3, and feeder cells. DNA from these cultured lines was amplified and portions of the HTLV-I long terminal repeat (U3), pol, env, and tax regions were sequenced (a total of 1,115 bp). The sequence data showed that the HTLV-I associated with this patient was 98.8% homologous to prototype HTLV-I. Southern blot analysis also confirmed the presence of full-length HTLV-I. These results indicate that HTLV-I can be demonstrated in an HTLV-I seronegative patient from South India with a chronic progressive neurological disorder.
The molecular biology of environmental aromatic hydrocarbons: Progress report for the period September 1, 1986 through July 31, 1987

DOE Office of Scientific and Technical Information (OSTI.GOV)

Weiss, S.B.

Our laboratory has explored the use of short DNA oligomers as targets for activated polycyclic aromatic hydrocarbons, such as benzo(a)pyrene diol epoxide (BPDE), in order to detect alterations in DNA sequence arrangement. In this model system, oligomers alkylated with (+)-BPDE are ligated into M13 viral DNA and used to transfect Escherichia coli. These cells are plated on agar, incubated at 37/sup 0/C, progeny viral clones are selected, amplified, and the viral DNAs isolated are sequenced at the site of oligomer insertion. We have devised a procedure for the preparation of unique duplex DNA oligomers such that the site of oligomermore » alkylation is specific for a single deoxynucleotide species in the two DNA strands. The procedure for oligomer assembly also allows us to vary the position of the alkylated residue in each of the two strands. Using our model system, the results obtained over the past year can be summarized as follows. When nonalkylated oligomer constructs are ligated into M13 viral DNA and used to transfect E. coli, no modifications in DNA sequence arrangement are detected in progeny viral DNAs. On the other hand, with oligomer constructs containing BP-adducts two major types of modifications in DNA sequence arrangement were observed: (1) large deletions, and (2) nonhomologous (illegitimate) recombinants. Both of these DNA modifications result in the complete removal of the oligomer insert. Transfection of E. coli that are recA/sup -/ does not alter these DNA modifications, therefore, it appears that the deletions and recombinants induced by the alkylated inserts are not under control of the RecA gene. As the distance between the alkylated residues in the duplex strands is increased, the number of recombinant events detected is reduced. In addition to the above types of DNA modifications, restoration of the original nucleotide sequence in the alkylated construct was also observed in progeny viral DNAs. 7 refs., 6 figs., 2 tabs.« less
Population Based Assessment of MHC Class 1 Antigens Down Regulation as Marker in Increased Risk for Development and Progression of Breast Cancer From Benign Breast Lesions

DTIC Science & Technology

2006-01-01

isolated using a routine salting-out method (DNA E-Z Prepkit, Orchid Diagnostics Europe, St Katelijne Waver, Belgium). Sequence based typing In...electrophoresis using ethidiumbromide to show the single 2 KB band before sequencing. Next, sequencing reactions were performed separately for exons 2, 3...Multiplex reverse transcription-polymerase chain reaction for simultaneous screening of 29 translocations and chromosomal aberrations in acute
Accurate RNA consensus sequencing for high-fidelity detection of transcriptional mutagenesis-induced epimutations.

PubMed

Reid-Bayliss, Kate S; Loeb, Lawrence A

2017-08-29

Transcriptional mutagenesis (TM) due to misincorporation during RNA transcription can result in mutant RNAs, or epimutations, that generate proteins with altered properties. TM has long been hypothesized to play a role in aging, cancer, and viral and bacterial evolution. However, inadequate methodologies have limited progress in elucidating a causal association. We present a high-throughput, highly accurate RNA sequencing method to measure epimutations with single-molecule sensitivity. Accurate RNA consensus sequencing (ARC-seq) uniquely combines RNA barcoding and generation of multiple cDNA copies per RNA molecule to eliminate errors introduced during cDNA synthesis, PCR, and sequencing. The stringency of ARC-seq can be scaled to accommodate the quality of input RNAs. We apply ARC-seq to directly assess transcriptome-wide epimutations resulting from RNA polymerase mutants and oxidative stress.
Characterization and modification of phage T7 DNA polymerase for use in DNA sequencing; Progress report, June 1, 1990--May 31, 1993

DOE Office of Scientific and Technical Information (OSTI.GOV)

Richardson, C.C.

1993-12-31

This project focuses on the DNA polymerase (gene 5 protein) of phage T7 for use in DNA sequence analysis. Gene 5 protein interacts with accessory proteins to acquire properties essential for DNA replication. One goal is to understand these interactions in order to modify the proteins for use in DNA sequencing. E. coli thioredoxin, binds to gene 5 protein and clamps it to a primer-template. They have analyzed the binding of gene 5 protein-thioredoxin to primer-templates and have defined the optimal conditions to form an extremely stable complex with a dNTP in the polymerase catalytic site. The spatial proximity ofmore » these components has been determined using fluorescence emission anisotropy. The T7 DNA binding protein, the gene 2.5 protein, interacts with gene 5 protein and gene 4 protein to increase processivity and primer synthesis, respectively. Mutant gene 2.5 proteins have been isolated that do not interact with T7 DNA polymerase and can not support T7 growth. The nucleotide binding site of the T7 helicase has been identified and mutations affecting the site provide information on how the hydrolysis of NTPs fuel its unidirectional translocation. The sequence, GTC, has been shown to be necessary and sufficient for recognition by the T7 primase. The T7 gene 5.5 protein interacts with the E. coli nucleoid protein, H-NS, and also overcomes the phage {lambda} rex restriction system.« less
Improved Analysis of Nanopore Sequence Data and Scanning Nanopore Techniques

NASA Astrophysics Data System (ADS)

Szalay, Tamas

The field of nanopore research has been driven by the need to inexpensively and rapidly sequence DNA. In order to help realize this goal, this thesis describes the PoreSeq algorithm that identifies and corrects errors in real-world nanopore sequencing data and improves the accuracy of de novo genome assembly with increasing coverage depth. The approach relies on modeling the possible sources of uncertainty that occur as DNA advances through the nanopore and then using this model to find the sequence that best explains multiple reads of the same region of DNA. PoreSeq increases nanopore sequencing read accuracy of M13 bacteriophage DNA from 85% to 99% at 100X coverage. We also use the algorithm to assemble E. coli with 30X coverage and the lambda genome at a range of coverages from 3X to 50X. Additionally, we classify sequence variants at an order of magnitude lower coverage than is possible with existing methods. This thesis also reports preliminary progress towards controlling the motion of DNA using two nanopores instead of one. The speed at which the DNA travels through the nanopore needs to be carefully controlled to facilitate the detection of individual bases. A second nanopore in close proximity to the first could be used to slow or stop the motion of the DNA in order to enable a more accurate readout. The fabrication process for a new pyramidal nanopore geometry was developed in order to facilitate the positioning of the nanopores. This thesis demonstrates that two of them can be placed close enough to interact with a single molecule of DNA, which is a prerequisite for being able to use the driving force of the pores to exert fine control over the motion of the DNA. Another strategy for reading the DNA is to trap it completely with one pore and to move the second nanopore instead. To that end, this thesis also shows that a single strand of immobilized DNA can be captured in a scanning nanopore and examined for a full hour, with data from many scans at many different voltages obtained in order to detect a bound protein placed partway along the molecule.
Diversification of transcription factor-DNA interactions and the evolution of gene regulatory networks.

PubMed

Rogers, Julia M; Bulyk, Martha L

2018-04-25

Sequence-specific transcription factors (TFs) bind short DNA sequences in the genome to regulate the expression of target genes. In the last decade, numerous technical advances have enabled the determination of the DNA-binding specificities of many of these factors. Large-scale screens of many TFs enabled the creation of databases of TF DNA-binding specificities, typically represented as position weight matrices (PWMs). Although great progress has been made in determining and predicting binding specificities systematically, there are still many surprises to be found when studying a particular TF's interactions with DNA in detail. Paralogous TFs' binding specificities can differ in subtle ways, in a manner that is not immediately apparent from looking at their PWMs. These differences affect gene regulatory outputs and enable TFs to rewire transcriptional networks over evolutionary time. This review discusses recent observations made in the study of TF-DNA interactions that highlight the importance of continued in-depth analysis of TF-DNA interactions and their inherent complexity. This article is categorized under: Biological Mechanisms > Regulatory Biology. © 2018 Wiley Periodicals, Inc.
Challenges and progress in making DNA-based AIS early ...

EPA Pesticide Factsheets

The ability of DNA barcoding to find additional species in hard-to-sample locations or hard-to-identify samples is well established. Nevertheless, adoption of DNA barcoding into regular monitoring programs has been slow, in part due to issues of standardization and interpretation that need resolving. In this presentation, we describe our progress towards incorporating DNA-based identification into broad-spectrum aquatic invasive species early-detection monitoring in the Laurentian Great Lakes. Our work uses community biodiversity information as the basis for evaluating survey performance for various taxonomic groups. Issues we are tackling in bringing DNA-based data to bear on AIS monitoring design include: 1) Standardizing methodology and work flow from field collection and sample handling through bioinformatics post-processing; 2) Determining detection sensitivity and accounting for inter-species differences in DNA amplification and primer affinity; 3) Differentiating sequencing and barcoding errors from legitimate new finds when range and natural history information is limited; and 4) Accounting for the different nature of morphology- vs. DNA-based biodiversity information in subsequent analysis (e.g., via species accumulation curves, multi-metric indices). not applicable
A DNA methylation map of human cancer at single base-pair resolution

PubMed Central

Vidal, E; Sayols, S; Moran, S; Guillaumet-Adkins, A; Schroeder, M P; Royo, R; Orozco, M; Gut, M; Gut, I; Lopez-Bigas, N; Heyn, H; Esteller, M

2017-01-01

Although single base-pair resolution DNA methylation landscapes for embryonic and different somatic cell types provided important insights into epigenetic dynamics and cell-type specificity, such comprehensive profiling is incomplete across human cancer types. This prompted us to perform genome-wide DNA methylation profiling of 22 samples derived from normal tissues and associated neoplasms, including primary tumors and cancer cell lines. Unlike their invariant normal counterparts, cancer samples exhibited highly variable CpG methylation levels in a large proportion of the genome, involving progressive changes during tumor evolution. The whole-genome sequencing results from selected samples were replicated in a large cohort of 1112 primary tumors of various cancer types using genome-scale DNA methylation analysis. Specifically, we determined DNA hypermethylation of promoters and enhancers regulating tumor-suppressor genes, with potential cancer-driving effects. DNA hypermethylation events showed evidence of positive selection, mutual exclusivity and tissue specificity, suggesting their active participation in neoplastic transformation. Our data highlight the extensive changes in DNA methylation that occur in cancer onset, progression and dissemination. PMID:28581523
Computational optimisation of targeted DNA sequencing for cancer detection

NASA Astrophysics Data System (ADS)

Martinez, Pierre; McGranahan, Nicholas; Birkbak, Nicolai Juul; Gerlinger, Marco; Swanton, Charles

2013-12-01

Despite recent progress thanks to next-generation sequencing technologies, personalised cancer medicine is still hampered by intra-tumour heterogeneity and drug resistance. As most patients with advanced metastatic disease face poor survival, there is need to improve early diagnosis. Analysing circulating tumour DNA (ctDNA) might represent a non-invasive method to detect mutations in patients, facilitating early detection. In this article, we define reduced gene panels from publicly available datasets as a first step to assess and optimise the potential of targeted ctDNA scans for early tumour detection. Dividing 4,467 samples into one discovery and two independent validation cohorts, we show that up to 76% of 10 cancer types harbour at least one mutation in a panel of only 25 genes, with high sensitivity across most tumour types. Our analyses demonstrate that targeting ``hotspot'' regions would introduce biases towards in-frame mutations and would compromise the reproducibility of tumour detection.
Base-Position Error Rate Analysis of Next-Generation Sequencing Applied to Circulating Tumor DNA in Non-Small Cell Lung Cancer: A Prospective Study

PubMed Central

Zonta, Eleonora; Didelot, Audrey; Combe, Pierre; Thibault, Constance; Gibault, Laure; Lours, Camille; Taly, Valérie; Laurent-Puig, Pierre

2016-01-01

Background Circulating tumor DNA (ctDNA) is an approved noninvasive biomarker to test for the presence of EGFR mutations at diagnosis or recurrence of lung cancer. However, studies evaluating ctDNA as a noninvasive “real-time” biomarker to provide prognostic and predictive information in treatment monitoring have given inconsistent results, mainly due to methodological differences. We have recently validated a next-generation sequencing (NGS) approach to detect ctDNA. Using this new approach, we evaluated the clinical usefulness of ctDNA monitoring in a prospective observational series of patients with non-small cell lung cancer (NSCLC). Methods and Findings We recruited 124 patients with newly diagnosed advanced NSCLC for ctDNA monitoring. The primary objective was to analyze the prognostic value of baseline ctDNA on overall survival. ctDNA was assessed by ultra-deep targeted NGS using our dedicated variant caller algorithm. Common mutations were validated by digital PCR. Out of the 109 patients with at least one follow-up marker mutation, plasma samples were contributive at baseline (n = 105), at first evaluation (n = 85), and at tumor progression (n = 66). We found that the presence of ctDNA at baseline was an independent marker of poor prognosis, with a median overall survival of 13.6 versus 21.5 mo (adjusted hazard ratio [HR] 1.82, 95% CI 1.01–3.55, p = 0.045) and a median progression-free survival of 4.9 versus 10.4 mo (adjusted HR 2.14, 95% CI 1.30–3.67, p = 0.002). It was also related to the presence of bone and liver metastasis. At first evaluation (E1) after treatment initiation, residual ctDNA was an early predictor of treatment benefit as judged by best radiological response and progression-free survival. Finally, negative ctDNA at E1 was associated with overall survival independently of Response Evaluation Criteria in Solid Tumors (RECIST) (HR 3.27, 95% CI 1.66–6.40, p < 0.001). Study population heterogeneity, over-representation of EGFR-mutated patients, and heterogeneous treatment types might limit the conclusions of this study, which require future validation in independent populations. Conclusions In this study of patients with newly diagnosed NSCLC, we found that ctDNA detection using targeted NGS was associated with poor prognosis. The heterogeneity of lung cancer molecular alterations, particularly at time of progression, impairs the ability of individual gene testing to accurately detect ctDNA in unselected patients. Further investigations are needed to evaluate the clinical impact of earlier evaluation times at 1 or 2 wk. Supporting clinical decisions, such as early treatment switching based on ctDNA positivity at first evaluation, will require dedicated interventional studies. PMID:28027313
Palaeoproteomics for human evolution studies

NASA Astrophysics Data System (ADS)

Welker, Frido

2018-06-01

The commonplace sequencing of Neanderthal, Denisovan and ancient modern human DNA continues to revolutionize our understanding of hominin phylogeny and interaction(s). The challenge with older fossils is that the progressive fragmentation of DNA even under optimal conditions, a function of time and temperature, results in ever shorter fragments of DNA. This process continues until no DNA can be sequenced or reliably aligned. Ancient proteins ultimately suffer a similar fate, but are a potential alternative source of biomolecular sequence data to investigate hominin phylogeny given their slower rate of fragmentation. In addition, ancient proteins have been proposed to potentially provide insights into in vivo biological processes and can be used to provide additional ecological information through large scale ZooMS (Zooarchaeology by Mass Spectrometry) screening of unidentifiable bone fragments. However, as initially with ancient DNA, most ancient protein research has focused on Late Pleistocene or Holocene samples from Europe. In addition, only a limited number of studies on hominin remains have been published. Here, an updated review on ancient protein analysis in human evolutionary contexts is given, including the identification of specific knowledge gaps and existing analytical limits, as well as potential avenues to overcome these.
The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome.

PubMed

Albayrak, Levent; Khanipov, Kamil; Pimenova, Maria; Golovko, George; Rojas, Mark; Pavlidis, Ioannis; Chumakov, Sergei; Aguilar, Gerardo; Chávez, Arturo; Widger, William R; Fofanov, Yuriy

2016-12-12

Low-abundance mutations in mitochondrial populations (mutations with minor allele frequency ≤ 1%), are associated with cancer, aging, and neurodegenerative disorders. While recent progress in high-throughput sequencing technology has significantly improved the heteroplasmy identification process, the ability of this technology to detect low-abundance mutations can be affected by the presence of similar sequences originating from nuclear DNA (nDNA). To determine to what extent nDNA can cause false positive low-abundance heteroplasmy calls, we have identified mitochondrial locations of all subsequences that are common or similar (one mismatch allowed) between nDNA and mitochondrial DNA (mtDNA). Performed analysis revealed up to a 25-fold variation in the lengths of longest common and longest similar (one mismatch allowed) subsequences across the mitochondrial genome. The size of the longest subsequences shared between nDNA and mtDNA in several regions of the mitochondrial genome were found to be as low as 11 bases, which not only allows using these regions to design new, very specific PCR primers, but also supports the hypothesis of the non-random introduction of mtDNA into the human nuclear DNA. Analysis of the mitochondrial locations of the subsequences shared between nDNA and mtDNA suggested that even very short (36 bases) single-end sequencing reads can be used to identify low-abundance variation in 20.4% of the mitochondrial genome. For longer (76 and 150 bases) reads, the proportion of the mitochondrial genome where nDNA presence will not interfere found to be 44.5 and 67.9%, when low-abundance mutations at 100% of locations can be identified using 417 bases long single reads. This observation suggests that the analysis of low-abundance variations in mitochondria population can be extended to a variety of large data collections such as NCBI Sequence Read Archive, European Nucleotide Archive, The Cancer Genome Atlas, and International Cancer Genome Consortium.
The Centromere: Chromatin Foundation for the Kinetochore Machinery

PubMed Central

Fukagawa, Tatsuo; Earnshaw, William C.

2014-01-01

Since discovery of the centromere-specific histone H3 variant CENP-A, centromeres have come to be defined as chromatin structures that establish the assembly site for the complex kinetochore machinery. In most organisms, centromere activity is defined epigenetically, rather than by specific DNA sequences. In this review, we describe selected classic work and recent progress in studies of centromeric chromatin with a focus on vertebrates. We consider possible roles for repetitive DNA sequences found at most centromeres, chromatin factors and modifications that assemble and activate CENP-A chromatin for kinetochore assembly, plus the use of artificial chromosomes and kinetochores to study centromere function. PMID:25203206

Digital PCR methods improve detection sensitivity and measurement precision of low abundance mtDNA deletions.

PubMed

Belmonte, Frances R; Martin, James L; Frescura, Kristin; Damas, Joana; Pereira, Filipe; Tarnopolsky, Mark A; Kaufman, Brett A

2016-04-28

Mitochondrial DNA (mtDNA) mutations are a common cause of primary mitochondrial disorders, and have also been implicated in a broad collection of conditions, including aging, neurodegeneration, and cancer. Prevalent among these pathogenic variants are mtDNA deletions, which show a strong bias for the loss of sequence in the major arc between, but not including, the heavy and light strand origins of replication. Because individual mtDNA deletions can accumulate focally, occur with multiple mixed breakpoints, and in the presence of normal mtDNA sequences, methods that detect broad-spectrum mutations with enhanced sensitivity and limited costs have both research and clinical applications. In this study, we evaluated semi-quantitative and digital PCR-based methods of mtDNA deletion detection using double-stranded reference templates or biological samples. Our aim was to describe key experimental assay parameters that will enable the analysis of low levels or small differences in mtDNA deletion load during disease progression, with limited false-positive detection. We determined that the digital PCR method significantly improved mtDNA deletion detection sensitivity through absolute quantitation, improved precision and reduced assay standard error.
Digital PCR methods improve detection sensitivity and measurement precision of low abundance mtDNA deletions

PubMed Central

Belmonte, Frances R.; Martin, James L.; Frescura, Kristin; Damas, Joana; Pereira, Filipe; Tarnopolsky, Mark A.; Kaufman, Brett A.

2016-01-01

Mitochondrial DNA (mtDNA) mutations are a common cause of primary mitochondrial disorders, and have also been implicated in a broad collection of conditions, including aging, neurodegeneration, and cancer. Prevalent among these pathogenic variants are mtDNA deletions, which show a strong bias for the loss of sequence in the major arc between, but not including, the heavy and light strand origins of replication. Because individual mtDNA deletions can accumulate focally, occur with multiple mixed breakpoints, and in the presence of normal mtDNA sequences, methods that detect broad-spectrum mutations with enhanced sensitivity and limited costs have both research and clinical applications. In this study, we evaluated semi-quantitative and digital PCR-based methods of mtDNA deletion detection using double-stranded reference templates or biological samples. Our aim was to describe key experimental assay parameters that will enable the analysis of low levels or small differences in mtDNA deletion load during disease progression, with limited false-positive detection. We determined that the digital PCR method significantly improved mtDNA deletion detection sensitivity through absolute quantitation, improved precision and reduced assay standard error. PMID:27122135
Carbon nanostructures as immobilization platform for DNA: A review on current progress in electrochemical DNA sensors.

PubMed

Rasheed, P Abdul; Sandhyarani, N

2017-11-15

Development of a sensitive, specific and cost-effective DNA detection method is motivated by increasing demand for the early stage diagnosis of genetic diseases. Recent developments in the design and fabrication of efficient sensor platforms based on nanostructures make the highly sensitive sensors which could indicate very low detection limit to the level of few molecules, a realistic possibility. Electrochemical detection methods are widely used in DNA diagnostics as it provide simple, accurate and inexpensive platform for DNA detection. In addition, the electrochemical DNA sensors provide direct electronic signal without the use of expensive signal transduction equipment and facilitates the immobilization of single stranded DNA (ssDNA) probe sequences on a wide variety of electrode substrates. It has been found that a range of nanomaterials such as metal nanoparticles (MNPs), carbon based nanomaterials, quantum dots (QDs), magnetic nanoparticles and polymeric NPs have been introduced in the sensor design to enhance the sensing performance of electrochemical DNA sensor. In this review, we discuss recent progress in the design and fabrication of efficient electrochemical genosensors based on carbon nanostructures such as carbon nanotubes, graphene, graphene oxide and nanodiamonds. Copyright © 2017 Elsevier B.V. All rights reserved.
Diagnostics based on nucleic acid sequence variant profiling: PCR, hybridization, and NGS approaches.

PubMed

Khodakov, Dmitriy; Wang, Chunyan; Zhang, David Yu

2016-10-01

Nucleic acid sequence variations have been implicated in many diseases, and reliable detection and quantitation of DNA/RNA biomarkers can inform effective therapeutic action, enabling precision medicine. Nucleic acid analysis technologies being translated into the clinic can broadly be classified into hybridization, PCR, and sequencing, as well as their combinations. Here we review the molecular mechanisms of popular commercial assays, and their progress in translation into in vitro diagnostics. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
[Tonoplast transport and salt tolerance in plants]. Progress report

DOE Office of Scientific and Technical Information (OSTI.GOV)

Taiz, L.

1993-04-01

We have showed that the tonoplast V-ATPase could be specifically inhibited by antisense DNA to the catalytic (A) subunit; that cell expansion was inhibited in carrot transformants deficient in the enzyme and have provided evidence for at least two different isoforms of the A subunit which are Golgi- and tonoplast-specific. These findings prompted a search for sequences of the isoforms of the A subunit in carrot. We have cloned and sequenced 1.0--1.5 kb fragments of three different genes for the catalytic subunit, the fragments differ greatly in their introns, but have nearly identical exons. We are using PCR to amplifymore » and subclone carrot seedling cDNA. Thus far two bands have been amplified and are currently being subcloned for sequencing.« less
A feasibility study of colorectal cancer diagnosis via circulating tumor DNA derived CNV detection.

PubMed

Molparia, Bhuvan; Oliveira, Glenn; Wagner, Jennifer L; Spencer, Emily G; Torkamani, Ali

2018-01-01

Circulating tumor DNA (ctDNA) has shown great promise as a biomarker for early detection of cancer. However, due to the low abundance of ctDNA, especially at early stages, it is hard to detect at high accuracies while keeping sequencing costs low. Here we present a pilot stage study to detect large scale somatic copy numbers variations (CNVs), which contribute more molecules to ctDNA signal compared to point mutations, via cell free DNA sequencing. We show that it is possible to detect somatic CNVs in early stage colorectal cancer (CRC) patients and subsequently discriminate them from normal patients. With 25 normal and 24 CRC samples, we achieve 100% specificity (lower bound confidence interval: 86%) and ~79% sensitivity (95% confidence interval: 63% - 95%,), though the performance should be considered with caution given the limited sample size. We report a lack of concordance between the CNVs detected via cfDNA sequencing and CNVs identified in parent tissue samples. However, recent findings suggest that a lack of concordance is expected for CNVs in CRC because of their sub-clonal nature. Finally, the CNVs we detect very likely contribute to cancer progression as they lie in functionally important regions, and have been shown to be associated with CRC specifically. This study paves the path for a larger scale exploration of the potential of CNV detection for both diagnoses and prognoses of cancer.
DNA methylation and the potential role of demethylating agents in prevention of progressive chronic kidney disease.

PubMed

Larkin, Benjamin P; Glastras, Sarah J; Chen, Hui; Pollock, Carol A; Saad, Sonia

2018-04-24

Chronic kidney disease (CKD) is a global epidemic, and its major risk factors include obesity and type 2 diabetes. Obesity not only promotes metabolic dysregulation and the development of diabetic kidney disease but also may independently lead to CKD by a variety of mechanisms, including endocrine and metabolic dysfunction, inflammation, oxidative stress, altered renal hemodynamics, and lipotoxicity. Deleterious renal effects of obesity can also be transmitted from one generation to the next, and it is increasingly recognized that offspring of obese mothers are predisposed to CKD. Epigenetic modifications are changes that regulate gene expression without altering the DNA sequence. Of these, DNA methylation is the most studied. Epigenetic imprints, particularly DNA methylation, are laid down during critical periods of fetal development, and they may provide a mechanism by which maternal-fetal transmission of chronic disease occurs. Our current review explores the evidence for the role of DNA methylation in the development of CKD, diabetic kidney disease, diabetes, and obesity. DNA methylation has been implicated in renal fibrosis-the final pathophysiologic pathway in the development of end-stage kidney disease-which supports the notion that demethylating agents may play a potential therapeutic role in preventing development and progression of CKD.-Larkin, B. P., Glastras, S. J., Chen, H., Pollock, C. A., Saad, S. DNA methylation and the potential role of demethylating agents in prevention of progressive chronic kidney disease.
Deep sequencing reveals distinct patterns of DNA methylation in prostate cancer.

PubMed

Kim, Jung H; Dhanasekaran, Saravana M; Prensner, John R; Cao, Xuhong; Robinson, Daniel; Kalyana-Sundaram, Shanker; Huang, Christina; Shankar, Sunita; Jing, Xiaojun; Iyer, Matthew; Hu, Ming; Sam, Lee; Grasso, Catherine; Maher, Christopher A; Palanisamy, Nallasivam; Mehra, Rohit; Kominsky, Hal D; Siddiqui, Javed; Yu, Jindan; Qin, Zhaohui S; Chinnaiyan, Arul M

2011-07-01

Beginning with precursor lesions, aberrant DNA methylation marks the entire spectrum of prostate cancer progression. We mapped the global DNA methylation patterns in select prostate tissues and cell lines using MethylPlex-next-generation sequencing (M-NGS). Hidden Markov model-based next-generation sequence analysis identified ∼68,000 methylated regions per sample. While global CpG island (CGI) methylation was not differential between benign adjacent and cancer samples, overall promoter CGI methylation significantly increased from ~12.6% in benign samples to 19.3% and 21.8% in localized and metastatic cancer tissues, respectively (P-value < 2 × 10(-16)). We found distinct patterns of promoter methylation around transcription start sites, where methylation occurred not only on the CGIs, but also on flanking regions and CGI sparse promoters. Among the 6691 methylated promoters in prostate tissues, 2481 differentially methylated regions (DMRs) are cancer-specific, including numerous novel DMRs. A novel cancer-specific DMR in the WFDC2 promoter showed frequent methylation in cancer (17/22 tissues, 6/6 cell lines), but not in the benign tissues (0/10) and normal PrEC cells. Integration of LNCaP DNA methylation and H3K4me3 data suggested an epigenetic mechanism for alternate transcription start site utilization, and these modifications segregated into distinct regions when present on the same promoter. Finally, we observed differences in repeat element methylation, particularly LINE-1, between ERG gene fusion-positive and -negative cancers, and we confirmed this observation using pyrosequencing on a tissue panel. This comprehensive methylome map will further our understanding of epigenetic regulation in prostate cancer progression.
CRISPR/Cas9 for genome editing: progress, implications and challenges.

PubMed

Zhang, Feng; Wen, Yan; Guo, Xiong

2014-09-15

Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) protein 9 system provides a robust and multiplexable genome editing tool, enabling researchers to precisely manipulate specific genomic elements, and facilitating the elucidation of target gene function in biology and diseases. CRISPR/Cas9 comprises of a nonspecific Cas9 nuclease and a set of programmable sequence-specific CRISPR RNA (crRNA), which can guide Cas9 to cleave DNA and generate double-strand breaks at target sites. Subsequent cellular DNA repair process leads to desired insertions, deletions or substitutions at target sites. The specificity of CRISPR/Cas9-mediated DNA cleavage requires target sequences matching crRNA and a protospacer adjacent motif locating at downstream of target sequences. Here, we review the molecular mechanism, applications and challenges of CRISPR/Cas9-mediated genome editing and clinical therapeutic potential of CRISPR/Cas9 in future. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
DNA SEQUENCE SIMILARITY REQUIREMENTS FOR INTERSPECIFIC RECOMBINATION IN BACILLUS. (R825348)

EPA Science Inventory

The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...
Identification of species with DNA-based technology: current progress and challenges.

PubMed

Pereira, Filipe; Carneiro, João; Amorim, António

2008-01-01

One of the grand challenges of modern biology is to develop accurate and reliable technologies for a rapid screening of DNA sequence variation. This topic of research is of prime importance for the detection and identification of species in numerous fields of investigation, such as taxonomy, epidemiology, forensics, archaeology or ecology. Molecular identification is also central for the diagnosis, treatment and control of infections caused by different pathogens. In recent years, a variety of DNA-based approaches have been developed for the identification of individuals in a myriad of taxonomic groups. Here, we provide an overview of most commonly used assays, with emphasis on those based on DNA hybridizations, restriction enzymes, random PCR amplifications, species-specific PCR primers and DNA sequencing. A critical evaluation of all methods is presented focusing on their discriminatory power, reproducibility and user-friendliness. Having in mind that the current trend is to develop small-scale devices with a high-throughput capacity, we briefly review recent technological achievements for DNA analysis that offer great potentials for the identification of species.
Individual sequences in large sets of gene sequences may be distinguished efficiently by combinations of shared sub-sequences

PubMed Central

Gibbs, Mark J; Armstrong, John S; Gibbs, Adrian J

2005-01-01

Background Most current DNA diagnostic tests for identifying organisms use specific oligonucleotide probes that are complementary in sequence to, and hence only hybridise with the DNA of one target species. By contrast, in traditional taxonomy, specimens are usually identified by 'dichotomous keys' that use combinations of characters shared by different members of the target set. Using one specific character for each target is the least efficient strategy for identification. Using combinations of shared bisectionally-distributed characters is much more efficient, and this strategy is most efficient when they separate the targets in a progressively binary way. Results We have developed a practical method for finding minimal sets of sub-sequences that identify individual sequences, and could be targeted by combinations of probes, so that the efficient strategy of traditional taxonomic identification could be used in DNA diagnosis. The sizes of minimal sub-sequence sets depended mostly on sequence diversity and sub-sequence length and interactions between these parameters. We found that 201 distinct cytochrome oxidase subunit-1 (CO1) genes from moths (Lepidoptera) were distinguished using only 15 sub-sequences 20 nucleotides long, whereas only 8–10 sub-sequences 6–10 nucleotides long were required to distinguish the CO1 genes of 92 species from the 9 largest orders of insects. Conclusion The presence/absence of sub-sequences in a set of gene sequences can be used like the questions in a traditional dichotomous taxonomic key; hybridisation probes complementary to such sub-sequences should provide a very efficient means for identifying individual species, subtypes or genotypes. Sequence diversity and sub-sequence length are the major factors that determine the numbers of distinguishing sub-sequences in any set of sequences. PMID:15817134
Scaling up discovery of hidden diversity in fungi: impacts of barcoding approaches.

PubMed

Yahr, Rebecca; Schoch, Conrad L; Dentinger, Bryn T M

2016-09-05

The fungal kingdom is a hyperdiverse group of multicellular eukaryotes with profound impacts on human society and ecosystem function. The challenge of documenting and describing fungal diversity is exacerbated by their typically cryptic nature, their ability to produce seemingly unrelated morphologies from a single individual and their similarity in appearance to distantly related taxa. This multiplicity of hurdles resulted in the early adoption of DNA-based comparisons to study fungal diversity, including linking curated DNA sequence data to expertly identified voucher specimens. DNA-barcoding approaches in fungi were first applied in specimen-based studies for identification and discovery of taxonomic diversity, but are now widely deployed for community characterization based on sequencing of environmental samples. Collectively, fungal barcoding approaches have yielded important advances across biological scales and research applications, from taxonomic, ecological, industrial and health perspectives. A major outstanding issue is the growing problem of 'sequences without names' that are somewhat uncoupled from the traditional framework of fungal classification based on morphology and preserved specimens. This review summarizes some of the most significant impacts of fungal barcoding, its limitations, and progress towards the challenge of effective utilization of the exponentially growing volume of data gathered from high-throughput sequencing technologies.This article is part of the themed issue 'From DNA barcodes to biomes'. © 2016 The Authors.
Simulations Meet Experiment to Reveal New Insights into DNA Intrinsic Mechanics

PubMed Central

Ben Imeddourene, Akli; Elbahnsi, Ahmad; Guéroult, Marc; Oguey, Christophe; Foloppe, Nicolas; Hartmann, Brigitte

2015-01-01

The accurate prediction of the structure and dynamics of DNA remains a major challenge in computational biology due to the dearth of precise experimental information on DNA free in solution and limitations in the DNA force-fields underpinning the simulations. A new generation of force-fields has been developed to better represent the sequence-dependent B-DNA intrinsic mechanics, in particular with respect to the BI ↔ BII backbone equilibrium, which is essential to understand the B-DNA properties. Here, the performance of MD simulations with the newly updated force-fields Parmbsc0εζOLI and CHARMM36 was tested against a large ensemble of recent NMR data collected on four DNA dodecamers involved in nucleosome positioning. We find impressive progress towards a coherent, realistic representation of B-DNA in solution, despite residual shortcomings. This improved representation allows new and deeper interpretation of the experimental observables, including regarding the behavior of facing phosphate groups in complementary dinucleotides, and their modulation by the sequence. It also provides the opportunity to extensively revisit and refine the coupling between backbone states and inter base pair parameters, which emerges as a common theme across all the complementary dinucleotides. In sum, the global agreement between simulations and experiment reveals new aspects of intrinsic DNA mechanics, a key component of DNA-protein recognition. PMID:26657165
Circulating mutational portrait of cancer: manifestation of aggressive clonal events in both early and late stages.

PubMed

Yang, Meng; Topaloglu, Umit; Petty, W Jeffrey; Pagni, Matthew; Foley, Kristie L; Grant, Stefan C; Robinson, Mac; Bitting, Rhonda L; Thomas, Alexandra; Alistar, Angela T; Desnoyers, Rodwige J; Goodman, Michael; Albright, Carol; Porosnicu, Mercedes; Vatca, Mihaela; Qasem, Shadi A; DeYoung, Barry; Kytola, Ville; Nykter, Matti; Chen, Kexin; Levine, Edward A; Staren, Edgar D; D'Agostino, Ralph B; Petro, Robin M; Blackstock, William; Powell, Bayard L; Abraham, Edward; Pasche, Boris; Zhang, Wei

2017-05-04

Solid tumors residing in tissues and organs leave footprints in circulation through circulating tumor cells (CTCs) and circulating tumor DNAs (ctDNA). Characterization of the ctDNA portraits and comparison with tumor DNA mutational portraits may reveal clinically actionable information on solid tumors that is traditionally achieved through more invasive approaches. We isolated ctDNAs from plasma of patients of 103 lung cancer and 74 other solid tumors of different tissue origins. Deep sequencing using the Guardant360 test was performed to identify mutations in 73 clinically actionable genes, and the results were associated with clinical characteristics of the patient. The mutation profiles of 37 lung cancer cases with paired ctDNA and tumor genomic DNA sequencing were used to evaluate clonal representation of tumor in circulation. Five lung cancer cases with longitudinal ctDNA sampling were monitored for cancer progression or response to treatments. Mutations in TP53, EGFR, and KRAS genes are most prevalent in our cohort. Mutation rates of ctDNA are similar in early (I and II) and late stage (III and IV) cancers. Mutation in DNA repair genes BRCA1, BRCA2, and ATM are found in 18.1% (32/177) of cases. Patients with higher mutation rates had significantly higher mortality rates. Lung cancer of never smokers exhibited significantly higher ctDNA mutation rates as well as higher EGFR and ERBB2 mutations than ever smokers. Comparative analysis of ctDNA and tumor DNA mutation data from the same patients showed that key driver mutations could be detected in plasma even when they were present at a minor clonal population in the tumor. Mutations of key genes found in the tumor tissue could remain in circulation even after frontline radiotherapy and chemotherapy suggesting these mutations represented resistance mechanisms. Longitudinal sampling of five lung cancer cases showed distinct changes in ctDNA mutation portraits that are consistent with cancer progression or response to EGFR drug treatment. This study demonstrates that ctDNA mutation rates in the key tumor-associated genes are clinical parameters relevant to smoking status and mortality. Mutations in ctDNA may serve as an early detection tool for cancer. This study quantitatively confirms the hypothesis that ctDNAs in circulation is the result of dissemination of aggressive tumor clones and survival of resistant clones. This study supports the use of ctDNA profiling as a less-invasive approach to monitor cancer progression and selection of appropriate drugs during cancer evolution.
Harnessing CRISPR-Cas systems for bacterial genome editing.

PubMed

Selle, Kurt; Barrangou, Rodolphe

2015-04-01

Manipulation of genomic sequences facilitates the identification and characterization of key genetic determinants in the investigation of biological processes. Genome editing via clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) constitutes a next-generation method for programmable and high-throughput functional genomics. CRISPR-Cas systems are readily reprogrammed to induce sequence-specific DNA breaks at target loci, resulting in fixed mutations via host-dependent DNA repair mechanisms. Although bacterial genome editing is a relatively unexplored and underrepresented application of CRISPR-Cas systems, recent studies provide valuable insights for the widespread future implementation of this technology. This review summarizes recent progress in bacterial genome editing and identifies fundamental genetic and phenotypic outcomes of CRISPR targeting in bacteria, in the context of tool development, genome homeostasis, and DNA repair. Copyright © 2015 Elsevier Ltd. All rights reserved.
DNA curvature and flexibility in vitro and in vivo

PubMed Central

Peters, Justin P.; Maher, L. James

2014-01-01

It has been more than 50 years since the elucidation of the structure of double-helical DNA. Despite active research and progress in DNA biology and biochemistry, much remains to be learned in the field of DNA biophysics. Predicting the sequence-dependent curvature and flexibility of DNA is difficult. Applicability of the conventional worm-like chain polymer model of DNA has been challenged. The fundamental forces responsible for the remarkable resistance of DNA to bending and twisting remain controversial. The apparent “softening” of DNA measured in vivo in the presence of kinking proteins and superhelical strain is incompletely understood. New methods and insights are being applied to these problems. This review places current work on DNA biophysics in historical context and illustrates the ongoing interplay between theory and experiment in this exciting field. PMID:20478077
High levels of heterozygosity found for 15 SSR loci in Solanum chacoense

USDA-ARS?s Scientific Manuscript database

Genetic variation is a necessary prerequisite for improving domesticated plants through breeding; without it, breeding progress would be impossible. Genetic variation can be readily ascertained with co-dominant DNA markers, such as simple sequence repeats (SSRs). Twenty-four SSR markers specifically...
Cell-free DNA as a molecular tool for monitoring disease progression and response to therapy in breast cancer patients.

PubMed

Liang, Diana H; Ensor, Joe E; Liu, Zhe-Bin; Patel, Asmita; Patel, Tejal A; Chang, Jenny C; Rodriguez, Angel A

2016-01-01

Due to the spatial and temporal genomic heterogeneity of breast cancer, genomic sequencing obtained from a single biopsy may not capture the complete genomic profile of tumors. Thus, we propose that cell-free DNA (cfDNA) in plasma may be an alternate source of genomic information to provide comprehensive data throughout a patient's clinical course. We performed a retrospective chart review of 100 patients with stage 4 or high-risk stage 3 breast cancer. The degree of agreement between genomic alterations found in tumor DNA (tDNA) and cfDNA was determined by Cohen's Kappa. Clinical disease progression was compared to mutant allele frequency using a two-sided Fisher's exact test. The presence of mutations and mutant allele frequency was correlated with progression-free survival (PFS) using a Cox proportional hazards model and a log-rank test. The most commonly found genomic alterations were mutations in TP53 and PIK3CA, and amplification of EGFR and ERBB2. PIK3CA mutation and ERBB2 amplification demonstrated robust agreement between tDNA and cfDNA (Cohen's kappa = 0.64 and 0.77, respectively). TP53 mutation and EGFR amplification demonstrated poor agreement between tDNA and cfDNA (Cohen's kappa = 0.18 and 0.33, respectively). The directional changes of TP53 and PIK3CA mutant allele frequency were closely associated with response to therapy (p = 0.002). The presence of TP53 mutation (p = 0.0004) and PIK3CA mutant allele frequency [p = 0.01, HR 1.074 (95 % CI 1.018-1.134)] was excellent predictors of PFS. Identification of selected cancer-specific genomic alterations from cfDNA may be a noninvasive way to monitor disease progression, predict PFS, and offer targeted therapy.
Epigenetics of prostate cancer.

PubMed

McKee, Tawnya C; Tricoli, James V

2015-01-01

The introduction of novel technologies that can be applied to the investigation of the molecular underpinnings of human cancer has allowed for new insights into the mechanisms associated with tumor development and progression. They have also advanced the diagnosis, prognosis and treatment of cancer. These technologies include microarray and other analysis methods for the generation of large-scale gene expression data on both mRNA and miRNA, next-generation DNA sequencing technologies utilizing a number of platforms to perform whole genome, whole exome, or targeted DNA sequencing to determine somatic mutational differences and gene rearrangements, and a variety of proteomic analysis platforms including liquid chromatography/mass spectrometry (LC/MS) analysis to survey alterations in protein profiles in tumors. One other important advancement has been our current ability to survey the methylome of human tumors in a comprehensive fashion through the use of sequence-based and array-based methylation analysis (Bock et al., Nat Biotechnol 28:1106-1114, 2010; Harris et al., Nat Biotechnol 28:1097-1105, 2010). The focus of this chapter is to present and discuss the evidence for key genes involved in prostate tumor development, progression, or resistance to therapy that are regulated by methylation-induced silencing.

A Molecular Framework for Understanding DCIS

DTIC Science & Technology

2015-10-01

progressing from DCIS to IDC, with the purpose of reducing the need for over treatment for this disease . 2. Body Biopsies are analyzed in two...to DCIS, stroma adjacent to IDC, atypia lesions, Stroma away from disease , normal ductal epithelium and immune infiltrates. We have sequenced 500...DNA libraries we plan be able to answer the question of whether DCIS is a clonal disease , and if IDC is an independent disease or is a progression from
The identification of FANCD2 DNA binding domains reveals nuclear localization sequences.

PubMed

Niraj, Joshi; Caron, Marie-Christine; Drapeau, Karine; Bérubé, Stéphanie; Guitton-Sert, Laure; Coulombe, Yan; Couturier, Anthony M; Masson, Jean-Yves

2017-08-21

Fanconi anemia (FA) is a recessive genetic disorder characterized by congenital abnormalities, progressive bone-marrow failure, and cancer susceptibility. The FA pathway consists of at least 21 FANC genes (FANCA-FANCV), and the encoded protein products interact in a common cellular pathway to gain resistance against DNA interstrand crosslinks. After DNA damage, FANCD2 is monoubiquitinated and accumulates on chromatin. FANCD2 plays a central role in the FA pathway, using yet unidentified DNA binding regions. By using synthetic peptide mapping and DNA binding screen by electromobility shift assays, we found that FANCD2 bears two major DNA binding domains predominantly consisting of evolutionary conserved lysine residues. Furthermore, one domain at the N-terminus of FANCD2 bears also nuclear localization sequences for the protein. Mutations in the bifunctional DNA binding/NLS domain lead to a reduction in FANCD2 monoubiquitination and increase in mitomycin C sensitivity. Such phenotypes are not fully rescued by fusion with an heterologous NLS, which enable separation of DNA binding and nuclear import functions within this domain that are necessary for FANCD2 functions. Collectively, our results enlighten the importance of DNA binding and NLS residues in FANCD2 to activate an efficient FA pathway. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Autonomous replication of nucleic acids by polymerization/nicking enzyme/DNAzyme cascades for the amplified detection of DNA and the aptamer-cocaine complex.

PubMed

Wang, Fuan; Freage, Lina; Orbach, Ron; Willner, Itamar

2013-09-03

The progressive development of amplified DNA sensors and aptasensors using replication/nicking enzymes/DNAzyme machineries is described. The sensing platforms are based on the tailoring of a DNA template on which the recognition of the target DNA or the formation of the aptamer-substrate complex trigger on the autonomous isothermal replication/nicking processes and the displacement of a Mg(2+)-dependent DNAzyme that catalyzes the generation of a fluorophore-labeled nucleic acid acting as readout signal for the analyses. Three different DNA sensing configurations are described, where in the ultimate configuration the target sequence is incorporated into a nucleic acid blocker structure associated with the sensing template. The target-triggered isothermal autonomous replication/nicking process on the modified template results in the formation of the Mg(2+)-dependent DNAzyme tethered to a free strand consisting of the target sequence. This activates additional template units for the nucleic acid self-replication process, resulting in the ultrasensitive detection of the target DNA (detection limit 1 aM). Similarly, amplified aptamer-based sensing platforms for cocaine are developed along these concepts. The modification of the cocaine-detection template by the addition of a nucleic acid sequence that enables the autonomous secondary coupled activation of a polymerization/nicking machinery and DNAzyme generation path leads to an improved analysis of cocaine (detection limit 10 nM).
Electrophoretic mobility shift assay reveals a novel recognition sequence for Setaria italica NAC protein.

PubMed

Puranik, Swati; Kumar, Karunesh; Srivastava, Prem S; Prasad, Manoj

2011-10-01

The NAC (NAM/ATAF1,2/CUC2) proteins are among the largest family of plant transcription factors. Its members have been associated with diverse plant processes and intricately regulate the expression of several genes. Inspite of this immense progress, knowledge of their DNA-binding properties are still limited. In our recent publication,1 we reported isolation of a membrane-associated NAC domain protein from Setaria italica (SiNAC). Transactivation analysis revealed that it was a functionally active transcription factor as it could stimulate expression of reporter genes in vivo. Truncations of the transmembrane region of the protein lead to its nuclear localization. Here we describe expression and purification of SiNAC DNA-binding domain. We further report identification of a novel DNA-binding site, [C/G][A/T][T/A][G/C]TC[C/G][A/T][C/G][G/C] for SiNAC by electrophoretic mobility shift assay. The SiNAC-GST protein could bind to the NAC recognition sequence in vitro as well as to sequences where some bases had been reshuffled. The results presented here contribute to our understanding of the DNA-binding specificity of SiNAC protein.
Electrophoretic mobility shift assay reveals a novel recognition sequence for Setaria italica NAC protein

PubMed Central

Puranik, Swati; Kumar, Karunesh; Srivastava, Prem S

2011-01-01

The NAC (NAM/ATAF1,2/CUC2) proteins are among the largest family of plant transcription factors. Its members have been associated with diverse plant processes and intricately regulate the expression of several genes. Inspite of this immense progress, knowledge of their DNA-binding properties are still limited. In our recent publication,1 we reported isolation of a membrane-associated NAC domain protein from Setaria italica (SiNAC). Transactivation analysis revealed that it was a functionally active transcription factor as it could stimulate expression of reporter genes in vivo. Truncation of the transmembrane region of the protein lead to its nuclear localization. Here we describe expression and purification of SiNAC DNA-binding domain. We further report identification of a novel DNA-binding site, [C/G][A/T] [T/A][G/C]TC[C/G][A/T][C/G][G/C] for SiNAC by electrophoretic mobility shift assay. The SiNAC-GST protein could bind to the NAC recognition sequence in vitro as well as to sequences where some bases had been reshuffled. The results presented here contribute to our understanding of the DNA-binding specificity of SiNAC protein. PMID:21918373
THE PHYLOGENETIC RELATIONSHIPS OF WHALE-FALL VESICOMYID CLAMS BASED ON MITOCHONDRIAL COI DNA SEQUENCES. (U915626)

EPA Science Inventory

The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...
IDENTICAL RIBOSOMAL DNA SEQUENCE DATA FROM PFIESTERIA PISCICIDA (DINOPHYCEAE) ISOLATES WITH DIFFERENT TOXICITY PHENOTYPES. (R827084)

EPA Science Inventory

The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health

PubMed Central

Martin, William F.

2017-01-01

Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. PMID:28444372
Chromosomal rearrangements involving telomeric DNA sequences in Balb/3T3 cells transfected with the Ha-ras oncogene.

PubMed

Peitl, Paulo; Mello, Stephano S; Camparoto, Marjori L; Passos, Geraldo A S; Hande, Manoor P; Cardoso, Renato S; Sakamoto-Hojo, Elza T

2002-01-01

Chromosomal instability involving telomeric DNA sequences was studied in mouse Balb/3T3 fibroblasts transfected with a mutated human c-Ha-ras-1 gene (B61 cells) and spontaneously immortalized normal parental cells (A31 cells), using fluorescence in situ hybridization (FISH). FISH analysis with a telomeric probe revealed high frequencies of chromosome alterations involving telomeric regions, mainly stable and unstable Robertsonian fusion-like configurations (RLC) (0.25 and 1.95/cell in A31 and B61 cells, respectively) and chromosome ends lacking telomeric signals in one (LTS') or both chromatids (LTS") (5.9 and 17.5/cell for A31 and B61 cells, respectively). Interstitial telomeric sequences (ITS) were also detected at both non-telomeric sites and in the centromeres of RLC. The frequencies of RLCs with ITS located in the centromeres were 3-fold higher in B61 compared with A31 cells. We demonstrated a high level of chromosome instability involving telomeric DNA sequences in ras-transfected cells overexpressing ras mRNA, which could be a consequence of rapid cell cycle progression associated with a deficient telomere capping mechanism.
Aberrant DNA methylation of WNT pathway genes in the development and progression of CIMP-negative colorectal cancer.

PubMed

Galamb, Orsolya; Kalmár, Alexandra; Péterfia, Bálint; Csabai, István; Bodor, András; Ribli, Dezső; Krenács, Tibor; Patai, Árpád V; Wichmann, Barnabás; Barták, Barbara Kinga; Tóth, Kinga; Valcz, Gábor; Spisák, Sándor; Tulassay, Zsolt; Molnár, Béla

2016-08-02

The WNT signaling pathway has an essential role in colorectal carcinogenesis and progression, which involves a cascade of genetic and epigenetic changes. We aimed to analyze DNA methylation affecting the WNT pathway genes in colorectal carcinogenesis in promoter and gene body regions using whole methylome analysis in 9 colorectal cancer, 15 adenoma, and 6 normal tumor adjacent tissue (NAT) samples by methyl capture sequencing. Functional methylation was confirmed on 5-aza-2'-deoxycytidine-treated colorectal cancer cell line datasets. In parallel with the DNA methylation analysis, mutations of WNT pathway genes (APC, β-catenin/CTNNB1) were analyzed by 454 sequencing on GS Junior platform. Most differentially methylated CpG sites were localized in gene body regions (95% of WNT pathway genes). In the promoter regions, 33 of the 160 analyzed WNT pathway genes were differentially methylated in colorectal cancer vs. normal, including hypermethylated AXIN2, CHP1, PRICKLE1, SFRP1, SFRP2, SOX17, and hypomethylated CACYBP, CTNNB1, MYC; 44 genes in adenoma vs. NAT; and 41 genes in colorectal cancer vs. adenoma comparisons. Hypermethylation of AXIN2, DKK1, VANGL1, and WNT5A gene promoters was higher, while those of SOX17, PRICKLE1, DAAM2, and MYC was lower in colon carcinoma compared to adenoma. Inverse correlation between expression and methylation was confirmed in 23 genes, including APC, CHP1, PRICKLE1, PSEN1, and SFRP1. Differential methylation affected both canonical and noncanonical WNT pathway genes in colorectal normal-adenoma-carcinoma sequence. Aberrant DNA methylation appears already in adenomas as an early event of colorectal carcinogenesis.
COMBINING DNA SEQUENCES AND MORPHOLOGY IN SYSTEMATICS: TESTING THE VALIDITY OF THE DRAGONFLY SPECIES CORDULEGASTER BILINEATA. (R826599)

EPA Science Inventory

The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...
DOE Office of Scientific and Technical Information (OSTI.GOV)

Ruggles, Kelly V.; Tang, Zuojian; Wang, Xuya

Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations and splice variants identified in cancer cells are translated. Herein we therefore describe a proteogenomic data integration tool (QUILTS) and illustrate its application to whole genome, transcriptome and global MS peptide sequence datasets generated from a pair of luminal and basal-like breast cancer patient derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS process replicates. Despite over thirty sample replicates, only about 10% of all SNV (somatic andmore » germline) were detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNV without a detectable mRNA transcript were also observed demonstrating the transcriptome coverage was also incomplete (~80%). In contrast to germ-line variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than the luminal tumor raising the possibility of differential translation or protein degradation effects. In conclusion, the QUILTS program integrates DNA, RNA and peptide sequencing to assess the degree to which somatic mutations are translated and therefore biologically active. By identifying gaps in sequence coverage QUILTS benchmarks current technology and assesses progress towards whole cancer proteome and transcriptome analysis.« less
Cell-free circulating tumour DNA as a liquid biopsy in breast cancer.

PubMed

De Mattos-Arruda, Leticia; Caldas, Carlos

2016-03-01

Recent developments in massively parallel sequencing and digital genomic techniques support the clinical validity of cell-free circulating tumour DNA (ctDNA) as a 'liquid biopsy' in human cancer. In breast cancer, ctDNA detected in plasma can be used to non-invasively scan tumour genomes and quantify tumour burden. The applications for ctDNA in plasma include identifying actionable genomic alterations, monitoring treatment responses, unravelling therapeutic resistance, and potentially detecting disease progression before clinical and radiological confirmation. ctDNA may be used to characterise tumour heterogeneity and metastasis-specific mutations providing information to adapt the therapeutic management of patients. In this article, we review the current status of ctDNA as a 'liquid biopsy' in breast cancer. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
DNA Origami: Folded DNA-Nanodevices That Can Direct and Interpret Cell Behavior

PubMed Central

Kearney, Cathal J.; Lucas, Christopher R.; O'Brien, Fergal J.; Castro, Carlos E.

2016-01-01

DNA origami is a DNA-based nanotechnology that utilizes programmed combinations of short complementary oligonucleotides to fold a large single strand of DNA into precise 2-D and 3-D shapes. The exquisite nanoscale shape control of this inherently biocompatible material is combined with the potential to spatially address the origami structures with diverse cargos including drugs, antibodies, nucleic acid sequences, small molecules and inorganic particles. This programmable flexibility enables the fabrication of precise nanoscale devices that have already shown great potential for biomedical applications such as: drug delivery, biosensing and synthetic nanopore formation. In this Progress Report, we will review the advances in the DNA origami field since its inception several years ago and then focus on how these DNA-nanodevices can be designed to interact with cells to direct or probe their behavior. PMID:26840503
An innovative platform for quick and flexible joining of assorted DNA fragments

DOE PAGES

De Paoli, Henrique Cestari; Tuskan, Gerald A.; Yang, Xiaohan

2016-01-13

Successful synthetic biology efforts rely on conceptual and experimental designs in combination with testing of multi-gene constructs. Despite recent progresses, several limitations still hinder the ability to flexibly assemble and collectively share different types of DNA segments. We describe an advanced system for joining DNA fragments from a universal library that automatically maintains open reading frames (ORFs) and does not require linkers, adaptors, sequence homology, amplification or mutation (domestication) of fragments in order to work properly. Moreover, we find that this system, which is enhanced by a unique buffer formulation, provides unforeseen capabilities for testing, and sharing, complex multi-gene circuitrymore » assembled from different DNA fragments.« less
Duplication in DNA Sequences

NASA Astrophysics Data System (ADS)

Ito, Masami; Kari, Lila; Kincaid, Zachary; Seki, Shinnosuke

The duplication and repeat-deletion operations are the basis of a formal language theoretic model of errors that can occur during DNA replication. During DNA replication, subsequences of a strand of DNA may be copied several times (resulting in duplications) or skipped (resulting in repeat-deletions). As formal language operations, iterated duplication and repeat-deletion of words and languages have been well studied in the literature. However, little is known about single-step duplications and repeat-deletions. In this paper, we investigate several properties of these operations, including closure properties of language families in the Chomsky hierarchy and equations involving these operations. We also make progress toward a characterization of regular languages that are generated by duplicating a regular language.
Analysis of Ethnic Admixture in Prostate Cancer

DTIC Science & Technology

2006-12-01

low penetrant genes have been identified as potential PCA suscept- ibility genes. These candidate genes include SRD5A2 (MIM 607306), CYP3A4 (MIM 124010...progression [13]. The CDH1gene is located at 16q22.1 and consists of 16 exons spanning approximately 100 kb of genomic DNA. Several polymorphisms, germline and...upstreamof theATGstart site and all 16 exons of CDH1 were screened for DNA sequence variation by denaturing high-performance liquid chro- matography
Nanochannel Device with Embedded Nanopore: a New Approach for Single-Molecule DNA Analysis and Manipulation

NASA Astrophysics Data System (ADS)

Zhang, Yuning; Reisner, Walter

2012-02-01

Nanopore and nanochannel based devices are robust methods for biomolecular sensing and single DNA manipulation. Nanopore-based DNA sensing has attractive features that make it a leading candidate as a single-molecule DNA sequencing technology. Nanochannel based extension of DNA, combined with enzymatic or denaturation-based barcoding schemes, is already a powerful approach for genome analysis. We believe that there is revolutionary potential in devices that combine nanochannels with nanpore detectors. In particular, due to the fast translocation of a DNA molecule through a standard nanopore configuration, there is an unfavorable trade-off between signal and sequence resolution. With a combined nanochannel-nanopore device, based on embedding a nanopore inside a nanochannel, we can in principle gain independent control over both DNA translocation speed and sensing signal, solving the key draw-back of the standard nanopore configuration. We will discuss our recent progress on device fabrication and characterization. In particular, we demonstrate that we can detect - using fluorescent microscopy - successful translocation of DNA from the nanochannel out through the nanopore, a possible method to 'select' a given barcode for further analysis. In particular, we show that in equilibrium DNA will not escape through an embedded sub-persistence length nanopore, suggesting that the embedded pore could be used as a nanoscale window through which to interrogate a nanochannel extended DNA molecule.
Novel ANKH Amino Terminus Mutation (Pro5Ser) Associated With Early-Onset Calcium Pyrophosphate Disease With Associated Phosphaturia

PubMed Central

Gruber, Barry L.; Couto, Ana Rita; Armas, Jácome Bruges; Brown, Matthew A.; Finzel, Kathleen; Terkeltaub, Robert A.

2015-01-01

This report describes a 32-year-old woman presenting since childhood with progressive calcium pyrophosphate disease (CPPD), characterized by severe arthropathy and chondrocalcinosis involving multiple peripheral joints and intervertebral disks. Because ANKH mutations have been previously described in familial CPPD, the proband’s DNA was assessed at this locus by direct sequencing of promoter and coding regions and revealed 3 sequence variants in ANKH. Sequences of exon 1 revealed a novel isolated nonsynonymous mutation (c.13 C>T), altering amino acid in codon 5 from proline to serine (CCG>TCG). Sequencing of parental DNA revealed an identical mutation in the proband’s father but not the mother. Subsequent clinical evaluation demonstrated extensive chondrocalcinosis and degenerative arthropathy in the proband’s father. In summary, we report a novel mutation, not previously described, in ANKH exon 1, wherein serine replaces proline, in a case of early-onset severe CPPD associated with metabolic abnormalities, with similar findings in the proband’s father. PMID:22647861
Novel ANKH amino terminus mutation (Pro5Ser) associated with early-onset calcium pyrophosphate disease with associated phosphaturia.

PubMed

Gruber, Barry L; Couto, Ana Rita; Armas, Jácome Bruges; Brown, Matthew A; Finzel, Kathleen; Terkeltaub, Robert A

2012-06-01

This report describes a 32-year-old woman presenting since childhood with progressive calcium pyrophosphate disease (CPPD), characterized by severe arthropathy and chondrocalcinosis involving multiple peripheral joints and intervertebral disks. Because ANKH mutations have been previously described in familial CPPD, the proband's DNA was assessed at this locus by direct sequencing of promoter and coding regions and revealed 3 sequence variants in ANKH. Sequences of exon 1 revealed a novel isolated nonsynonymous mutation (c.13 C>T), altering amino acid in codon 5 from proline to serine (CCG>TCG). Sequencing of parental DNA revealed an identical mutation in the proband's father but not the mother. Subsequent clinical evaluation demonstrated extensive chondrocalcinosis and degenerative arthropathy in the proband's father. In summary, we report a novel mutation, not previously described, in ANKH exon 1, wherein serine replaces proline, in a case of early-onset severe CPPD associated with metabolic abnormalities, with similar findings in the proband's father.

The DNA sequence of the human X chromosome

PubMed Central

Ross, Mark T.; Grafham, Darren V.; Coffey, Alison J.; Scherer, Steven; McLay, Kirsten; Muzny, Donna; Platzer, Matthias; Howell, Gareth R.; Burrows, Christine; Bird, Christine P.; Frankish, Adam; Lovell, Frances L.; Howe, Kevin L.; Ashurst, Jennifer L.; Fulton, Robert S.; Sudbrak, Ralf; Wen, Gaiping; Jones, Matthew C.; Hurles, Matthew E.; Andrews, T. Daniel; Scott, Carol E.; Searle, Stephen; Ramser, Juliane; Whittaker, Adam; Deadman, Rebecca; Carter, Nigel P.; Hunt, Sarah E.; Chen, Rui; Cree, Andrew; Gunaratne, Preethi; Havlak, Paul; Hodgson, Anne; Metzker, Michael L.; Richards, Stephen; Scott, Graham; Steffen, David; Sodergren, Erica; Wheeler, David A.; Worley, Kim C.; Ainscough, Rachael; Ambrose, Kerrie D.; Ansari-Lari, M. Ali; Aradhya, Swaroop; Ashwell, Robert I. S.; Babbage, Anne K.; Bagguley, Claire L.; Ballabio, Andrea; Banerjee, Ruby; Barker, Gary E.; Barlow, Karen F.; Barrett, Ian P.; Bates, Karen N.; Beare, David M.; Beasley, Helen; Beasley, Oliver; Beck, Alfred; Bethel, Graeme; Blechschmidt, Karin; Brady, Nicola; Bray-Allen, Sarah; Bridgeman, Anne M.; Brown, Andrew J.; Brown, Mary J.; Bonnin, David; Bruford, Elspeth A.; Buhay, Christian; Burch, Paula; Burford, Deborah; Burgess, Joanne; Burrill, Wayne; Burton, John; Bye, Jackie M.; Carder, Carol; Carrel, Laura; Chako, Joseph; Chapman, Joanne C.; Chavez, Dean; Chen, Ellson; Chen, Guan; Chen, Yuan; Chen, Zhijian; Chinault, Craig; Ciccodicola, Alfredo; Clark, Sue Y.; Clarke, Graham; Clee, Chris M.; Clegg, Sheila; Clerc-Blankenburg, Kerstin; Clifford, Karen; Cobley, Vicky; Cole, Charlotte G.; Conquer, Jen S.; Corby, Nicole; Connor, Richard E.; David, Robert; Davies, Joy; Davis, Clay; Davis, John; Delgado, Oliver; DeShazo, Denise; Dhami, Pawandeep; Ding, Yan; Dinh, Huyen; Dodsworth, Steve; Draper, Heather; Dugan-Rocha, Shannon; Dunham, Andrew; Dunn, Matthew; Durbin, K. James; Dutta, Ireena; Eades, Tamsin; Ellwood, Matthew; Emery-Cohen, Alexandra; Errington, Helen; Evans, Kathryn L.; Faulkner, Louisa; Francis, Fiona; Frankland, John; Fraser, Audrey E.; Galgoczy, Petra; Gilbert, James; Gill, Rachel; Glöckner, Gernot; Gregory, Simon G.; Gribble, Susan; Griffiths, Coline; Grocock, Russell; Gu, Yanghong; Gwilliam, Rhian; Hamilton, Cerissa; Hart, Elizabeth A.; Hawes, Alicia; Heath, Paul D.; Heitmann, Katja; Hennig, Steffen; Hernandez, Judith; Hinzmann, Bernd; Ho, Sarah; Hoffs, Michael; Howden, Phillip J.; Huckle, Elizabeth J.; Hume, Jennifer; Hunt, Paul J.; Hunt, Adrienne R.; Isherwood, Judith; Jacob, Leni; Johnson, David; Jones, Sally; de Jong, Pieter J.; Joseph, Shirin S.; Keenan, Stephen; Kelly, Susan; Kershaw, Joanne K.; Khan, Ziad; Kioschis, Petra; Klages, Sven; Knights, Andrew J.; Kosiura, Anna; Kovar-Smith, Christie; Laird, Gavin K.; Langford, Cordelia; Lawlor, Stephanie; Leversha, Margaret; Lewis, Lora; Liu, Wen; Lloyd, Christine; Lloyd, David M.; Loulseged, Hermela; Loveland, Jane E.; Lovell, Jamieson D.; Lozado, Ryan; Lu, Jing; Lyne, Rachael; Ma, Jie; Maheshwari, Manjula; Matthews, Lucy H.; McDowall, Jennifer; McLaren, Stuart; McMurray, Amanda; Meidl, Patrick; Meitinger, Thomas; Milne, Sarah; Miner, George; Mistry, Shailesh L.; Morgan, Margaret; Morris, Sidney; Müller, Ines; Mullikin, James C.; Nguyen, Ngoc; Nordsiek, Gabriele; Nyakatura, Gerald; O’Dell, Christopher N.; Okwuonu, Geoffery; Palmer, Sophie; Pandian, Richard; Parker, David; Parrish, Julia; Pasternak, Shiran; Patel, Dina; Pearce, Alex V.; Pearson, Danita M.; Pelan, Sarah E.; Perez, Lesette; Porter, Keith M.; Ramsey, Yvonne; Reichwald, Kathrin; Rhodes, Susan; Ridler, Kerry A.; Schlessinger, David; Schueler, Mary G.; Sehra, Harminder K.; Shaw-Smith, Charles; Shen, Hua; Sheridan, Elizabeth M.; Shownkeen, Ratna; Skuce, Carl D.; Smith, Michelle L.; Sotheran, Elizabeth C.; Steingruber, Helen E.; Steward, Charles A.; Storey, Roy; Swann, R. Mark; Swarbreck, David; Tabor, Paul E.; Taudien, Stefan; Taylor, Tineace; Teague, Brian; Thomas, Karen; Thorpe, Andrea; Timms, Kirsten; Tracey, Alan; Trevanion, Steve; Tromans, Anthony C.; d’Urso, Michele; Verduzco, Daniel; Villasana, Donna; Waldron, Lenee; Wall, Melanie; Wang, Qiaoyan; Warren, James; Warry, Georgina L.; Wei, Xuehong; West, Anthony; Whitehead, Siobhan L.; Whiteley, Mathew N.; Wilkinson, Jane E.; Willey, David L.; Williams, Gabrielle; Williams, Leanne; Williamson, Angela; Williamson, Helen; Wilming, Laurens; Woodmansey, Rebecca L.; Wray, Paul W.; Yen, Jennifer; Zhang, Jingkun; Zhou, Jianling; Zoghbi, Huda; Zorilla, Sara; Buck, David; Reinhardt, Richard; Poustka, Annemarie; Rosenthal, André; Lehrach, Hans; Meindl, Alfons; Minx, Patrick J.; Hillier, LaDeana W.; Willard, Huntington F.; Wilson, Richard K.; Waterston, Robert H.; Rice, Catherine M.; Vaudin, Mark; Coulson, Alan; Nelson, David L.; Weinstock, George; Sulston, John E.; Durbin, Richard; Hubbard, Tim; Gibbs, Richard A.; Beck, Stephan; Rogers, Jane; Bentley, David R.

2009-01-01

The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence. PMID:15772651
The Lipopolysaccharide and β-1,3-Glucan Binding Protein Gene Is Upregulated in White Spot Virus-Infected Shrimp (Penaeus stylirostris)

PubMed Central

Roux, Michelle M.; Pain, Arnab; Klimpel, Kurt R.; Dhar, Arun K.

2002-01-01

Pattern recognition proteins such as lipopolysaccharide and β-1,3-glucan binding protein (LGBP) play an important role in the innate immune response of crustaceans and insects. Random sequencing of cDNA clones from a hepatopancreas cDNA library of white spot virus (WSV)-infected shrimp provided a partial cDNA (PsEST-289) that showed similarity to the LGBP gene of crayfish and insects. Subsequently full-length cDNA was cloned by the 5′-RACE (rapid amplification of cDNA ends) technique and sequenced. The shrimp LGBP gene is 1,352 bases in length and is capable of encoding a polypeptide of 376 amino acids that showed significant similarity to homologous genes from crayfish, insects, earthworms, and sea urchins. Analysis of the shrimp LGBP deduced amino acid sequence identified conserved features of this gene family including a potential recognition motif for β-(1→3) linkage of polysaccharides and putative RGD cell adhesion sites. It is known that LGBP gene expression is upregulated in bacterial and fungal infection and that the binding of lipopolysaccharide and β-1,3-glucan to LGBP activates the prophenoloxidase (proPO) cascade. The temporal expression of LGBP and proPO genes in healthy and WSV-challenged Penaeus stylirostris shrimp was measured by real-time quantitative reverse transcription-PCR, and we showed that LGBP gene expression in shrimp was upregulated as the WSV infection progressed. Interestingly, the proPO expression was upregulated initially after infection followed by a downregulation as the viral infection progressed. The downward trend in the expression of proPO coincided with the detection of WSV in the infected shrimp. Our data suggest that shrimp LGBP is an inducible acute-phase protein that may play a critical role in shrimp-WSV interaction and that the WSV infection regulates the activation and/or activity of the proPO cascade in a novel way. PMID:12072514
Nuclear DNA amounts in angiosperms: progress, problems and prospects.

PubMed

Bennett, M D; Leitch, I J

2005-01-01

The nuclear DNA amount in an unreplicated haploid chromosome complement (1C-value) is a key diversity character with many uses. Angiosperm C-values have been listed for reference purposes since 1976, and pooled in an electronic database since 1997 (http://www.kew.org/cval/homepage). Such lists are cited frequently and provide data for many comparative studies. The last compilation was published in 2000, so a further supplementary list is timely to monitor progress against targets set at the first plant genome size workshop in 1997 and to facilitate new goal setting. The present work lists DNA C-values for 804 species including first values for 628 species from 88 original sources, not included in any previous compilation, plus additional values for 176 species included in a previous compilation. 1998-2002 saw striking progress in our knowledge of angiosperm C-values. At least 1700 first values for species were measured (the most in any five-year period) and familial representation rose from 30 % to 50 %. The loss of many densitometers used to measure DNA C-values proved less serious than feared, owing to the development of relatively inexpensive flow cytometers and computer-based image analysis systems. New uses of the term genome (e.g. in 'complete' genome sequencing) can cause confusion. The Arabidopsis Genome Initiative C-value for Arabidopsis thaliana (125 Mb) was a gross underestimate, and an exact C-value based on genome sequencing alone is unlikely to be obtained soon for any angiosperm. Lack of this expected benchmark poses a quandary as to what to use as the basal calibration standard for angiosperms. The next decade offers exciting prospects for angiosperm genome size research. The database (http://www.kew.org/cval/homepage) should become sufficiently representative of the global flora to answer most questions without needing new estimations. DNA amount variation will remain a key interest as an integrated strand of holistic genomics.
The preS deletion of hepatitis B virus (HBV) is associated with liver fibrosis progression in patients with chronic HBV infection.

PubMed

Li, Fan; Li, Xiaodong; Yan, Tao; Liu, Yan; Cheng, Yongqian; Xu, Zhihui; Shao, Qing; Liao, Hao; Huang, Pengyu; Li, Jin; Chen, Guo-Feng; Xu, Dongping

2018-03-01

Limited data are available regarding the association of hepatitis B virus (HBV) mutations with liver fibrosis in HBV infection. The study aimed to clarify whether HBV preS deletion mutation is associated with liver fibrosis progression. A total of 469 patients were enrolled, including 324 with chronic hepatitis B (CHB), 28 with HBV-related compensated liver cirrhosis (LC), and 117 with HBV-related decompensated LC. All CHB and compensated LC patients received liver biopsy. Fibrosis grade was assessed using METAVIR score. HBV preS deletion was determined by direct sequencing and verified by clonal sequencing. Overall preS deletion was detected in 12.6% (59/469) patients, specifically, in 7.51% (13/173), 10.60% (16/151), and 20.69% (30/145) of patients with no-to-mild liver fibrosis (F0-1), moderate-to-severe liver fibrosis (F2-3), and cirrhosis (F4), respectively (p < 0.01). Patients with preS-deleted HBV had lower serum HBV DNA and albumin levels compared to patients with wild-type HBV. The median length of preS deletion was 39-base pairs (bp) (3-204 bp) and the deletion most frequently emerged in preS2 initial region. Multivariate analysis identified the preS2 deletion rather than preS1 deletion to be an independent risk factor of significant fibrosis, i.e., METAVIR F ≥ 2 (p = 0.007). In addition, preS-deleted viral sequences were detected in the pool of intrahepatic HBV covalently closed circular DNA. HBV preS deletion is positively associated with liver fibrosis progression in chronic HBV-infected patients. HBV preS2 deletion may serve as a warning indicator for liver fibrosis progression.
BioVLAB-mCpG-SNP-EXPRESS: A system for multi-level and multi-perspective analysis and exploration of DNA methylation, sequence variation (SNPs), and gene expression from multi-omics data.

PubMed

Chae, Heejoon; Lee, Sangseon; Seo, Seokjun; Jung, Daekyoung; Chang, Hyeonsook; Nephew, Kenneth P; Kim, Sun

2016-12-01

Measuring gene expression, DNA sequence variation, and DNA methylation status is routinely done using high throughput sequencing technologies. To analyze such multi-omics data and explore relationships, reliable bioinformatics systems are much needed. Existing systems are either for exploring curated data or for processing omics data in the form of a library such as R. Thus scientists have much difficulty in investigating relationships among gene expression, DNA sequence variation, and DNA methylation using multi-omics data. In this study, we report a system called BioVLAB-mCpG-SNP-EXPRESS for the integrated analysis of DNA methylation, sequence variation (SNPs), and gene expression for distinguishing cellular phenotypes at the pairwise and multiple phenotype levels. The system can be deployed on either the Amazon cloud or a publicly available high-performance computing node, and the data analysis and exploration of the analysis result can be conveniently done using a web-based interface. In order to alleviate analysis complexity, all the process are fully automated, and graphical workflow system is integrated to represent real-time analysis progression. The BioVLAB-mCpG-SNP-EXPRESS system works in three stages. First, it processes and analyzes multi-omics data as input in the form of the raw data, i.e., FastQ files. Second, various integrated analyses such as methylation vs. gene expression and mutation vs. methylation are performed. Finally, the analysis result can be explored in a number of ways through a web interface for the multi-level, multi-perspective exploration. Multi-level interpretation can be done by either gene, gene set, pathway or network level and multi-perspective exploration can be explored from either gene expression, DNA methylation, sequence variation, or their relationship perspective. The utility of the system is demonstrated by performing analysis of phenotypically distinct 30 breast cancer cell line data set. BioVLAB-mCpG-SNP-EXPRESS is available at http://biohealth.snu.ac.kr/software/biovlab_mcpg_snp_express/. Copyright Â© 2016 Elsevier Inc. All rights reserved.
[Current situation and prospect of breast cancer liquid biopsy].

PubMed

Zhou, B; Xin, L; Xu, L; Ye, J M; Liu, Y H

2018-02-01

Liquid biopsy is a diagnostic approach by analyzing body fluid samples. Peripheral blood is the most common sample. Urine, saliva, pleural effusion and ascites are also used. Now liquid biopsy is mainly used in the area of neoplasm diagnosis and treatment. Compared with traditional tissue biopsy, liquid biopsy is minimally invasive, convenient to sample and easy to repeat. Liquid biopsy mainly includes circulating tumor cells and circulating tumor DNA (ctDNA) detection. Detection of ctDNA requires sensitive and accurate methods. The progression of next-generation sequencing (NGS) and digital PCR promote the process of studies in ctDNA. In 2016, Nature published the result of whole-genome sequencing study of breast cancer. The study found 1 628 mutations of 93 protein-coding genes which may be driver mutations of breast cancer. The result of this study provided a new platform for breast cancer ctDNA studies. In recent years, there were many studies using ctDNA detection to monitor therapeutic effect and guide treatment. NGS is a promising technique in accessing genetic information and guiding targeted therapy. It must be emphasized that ctDNA detection using NGS is still at research stage. It is important to standardize ctDNA detection technique and perform prospective clinical researches. The time is not ripe for using ctDNA detection to guide large-scale breast cancer clinical practice at present.
G-quadruplex-interacting compounds alter latent DNA replication and episomal persistence of KSHV

PubMed Central

Madireddy, Advaitha; Purushothaman, Pravinkumar; Loosbroock, Christopher P.; Robertson, Erle S.; Schildkraut, Carl L.; Verma, Subhash C.

2016-01-01

Kaposi's sarcoma associated herpesvirus (KSHV) establishes life-long latent infection by persisting as an extra-chromosomal episome in the infected cells and by maintaining its genome in dividing cells. KSHV achieves this by tethering its epigenome to the host chromosome by latency associated nuclear antigen (LANA), which binds in the terminal repeat (TR) region of the viral genome. Sequence analysis of the TR, a GC-rich DNA element, identified several potential Quadruplex G-Rich Sequences (QGRS). Since quadruplexes have the tendency to obstruct DNA replication, we used G-quadruplex stabilizing compounds to examine their effect on latent DNA replication and the persistence of viral episomes. Our results showed that these G-quadruplex stabilizing compounds led to the activation of dormant origins of DNA replication, with preferential bi-directional pausing of replications forks moving out of the TR region, implicating the role of the G-rich TR in the perturbation of episomal DNA replication. Over time, treatment with PhenDC3 showed a loss of viral episomes in the infected cells. Overall, these data show that G-quadruplex stabilizing compounds retard the progression of replication forks leading to a reduction in DNA replication and episomal maintenance. These results suggest a potential role for G-quadruplex stabilizers in the treatment of KSHV-associated diseases. PMID:26837574
Toward high-resolution population genomics using archaeological samples

PubMed Central

Morozova, Irina; Flegontov, Pavel; Mikheyev, Alexander S.; Bruskin, Sergey; Asgharian, Hosseinali; Ponomarenko, Petr; Klyuchnikov, Vladimir; ArunKumar, GaneshPrasad; Prokhortchouk, Egor; Gankin, Yuriy; Rogaev, Evgeny; Nikolsky, Yuri; Baranova, Ancha; Elhaik, Eran; Tatarinova, Tatiana V.

2016-01-01

The term ‘ancient DNA’ (aDNA) is coming of age, with over 1,200 hits in the PubMed database, beginning in the early 1980s with the studies of ‘molecular paleontology’. Rooted in cloning and limited sequencing of DNA from ancient remains during the pre-PCR era, the field has made incredible progress since the introduction of PCR and next-generation sequencing. Over the last decade, aDNA analysis ushered in a new era in genomics and became the method of choice for reconstructing the history of organisms, their biogeography, and migration routes, with applications in evolutionary biology, population genetics, archaeogenetics, paleo-epidemiology, and many other areas. This change was brought by development of new strategies for coping with the challenges in studying aDNA due to damage and fragmentation, scarce samples, significant historical gaps, and limited applicability of population genetics methods. In this review, we describe the state-of-the-art achievements in aDNA studies, with particular focus on human evolution and demographic history. We present the current experimental and theoretical procedures for handling and analysing highly degraded aDNA. We also review the challenges in the rapidly growing field of ancient epigenomics. Advancement of aDNA tools and methods signifies a new era in population genetics and evolutionary medicine research. PMID:27436340
Precision medicine in the age of big data: The present and future role of large-scale unbiased sequencing in drug discovery and development.

PubMed

Vicini, P; Fields, O; Lai, E; Litwack, E D; Martin, A-M; Morgan, T M; Pacanowski, M A; Papaluca, M; Perez, O D; Ringel, M S; Robson, M; Sakul, H; Vockley, J; Zaks, T; Dolsten, M; Søgaard, M

2016-02-01

High throughput molecular and functional profiling of patients is a key driver of precision medicine. DNA and RNA characterization has been enabled at unprecedented cost and scale through rapid, disruptive progress in sequencing technology, but challenges persist in data management and interpretation. We analyze the state-of-the-art of large-scale unbiased sequencing in drug discovery and development, including technology, application, ethical, regulatory, policy and commercial considerations, and discuss issues of LUS implementation in clinical and regulatory practice. © 2015 American Society for Clinical Pharmacology and Therapeutics.
Highly Efficient CRISPR/Cas9-Mediated Cloning and Functional Characterization of Gastric Cancer-Derived Epstein-Barr Virus Strains.

PubMed

Kanda, Teru; Furuse, Yuki; Oshitani, Hitoshi; Kiyono, Tohru

2016-05-01

The Epstein-Barr virus (EBV) is etiologically linked to approximately 10% of gastric cancers, in which viral genomes are maintained as multicopy episomes. EBV-positive gastric cancer cells are incompetent for progeny virus production, making viral DNA cloning extremely difficult. Here we describe a highly efficient strategy for obtaining bacterial artificial chromosome (BAC) clones of EBV episomes by utilizing a CRISPR/Cas9-mediated strand break of the viral genome and subsequent homology-directed repair. EBV strains maintained in two gastric cancer cell lines (SNU719 and YCCEL1) were cloned, and their complete viral genome sequences were determined. Infectious viruses of gastric cancer cell-derived EBVs were reconstituted, and the viruses established stable latent infections in immortalized keratinocytes. While Ras oncoprotein overexpression caused massive vacuolar degeneration and cell death in control keratinocytes, EBV-infected keratinocytes survived in the presence of Ras expression. These results implicate EBV infection in predisposing epithelial cells to malignant transformation by inducing resistance to oncogene-induced cell death. Recent progress in DNA-sequencing technology has accelerated EBV whole-genome sequencing, and the repertoire of sequenced EBV genomes is increasing progressively. Accordingly, the presence of EBV variant strains that may be relevant to EBV-associated diseases has begun to attract interest. Clearly, the determination of additional disease-associated viral genome sequences will facilitate the identification of any disease-specific EBV variants. We found that CRISPR/Cas9-mediated cleavage of EBV episomal DNA enabled the cloning of disease-associated viral strains with unprecedented efficiency. As a proof of concept, two gastric cancer cell-derived EBV strains were cloned, and the infection of epithelial cells with reconstituted viruses provided important clues about the mechanism of EBV-mediated epithelial carcinogenesis. This experimental system should contribute to establishing the relationship between viral genome variation and EBV-associated diseases. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Nuclear DNA Amounts in Angiosperms: Progress, Problems and Prospects

PubMed Central

BENNETT, M. D.; LEITCH, I. J.

2005-01-01

CONTENTSINTRODUCTION45PROGRESS46 Improved systematic representation (species and families)46 (i) First estimates for species46 (ii) First estimates for families47PROBLEMS48 Geographical representation and distribution48 Plant life form48 Obsolescence time bomb49 Errors and inexactitudes49 Genome size, ‘complete’ genome sequencing, and, the euchromatic genome50 The completely sequenced genome50 Weeding out erroneous data52 What is the smallest reliable C-value for an angiosperm?52 What is the minimum C-value for a free-living angiosperm and other free-living organisms?53PROSPECTS FOR THE NEXT TEN YEARS54 Holistic genomics55LITERATURE CITED56APPENDIX59 Notes to the Appendix59 Original references for DNA values89 • Background The nuclear DNA amount in an unreplicated haploid chromosome complement (1C-value) is a key diversity character with many uses. Angiosperm C-values have been listed for reference purposes since 1976, and pooled in an electronic database since 1997 (http://www.kew.org/cval/homepage). Such lists are cited frequently and provide data for many comparative studies. The last compilation was published in 2000, so a further supplementary list is timely to monitor progress against targets set at the first plant genome size workshop in 1997 and to facilitate new goal setting. • Scope The present work lists DNA C-values for 804 species including first values for 628 species from 88 original sources, not included in any previous compilation, plus additional values for 176 species included in a previous compilation. • Conclusions 1998–2002 saw striking progress in our knowledge of angiosperm C-values. At least 1700 first values for species were measured (the most in any five-year period) and familial representation rose from 30 % to 50 %. The loss of many densitometers used to measure DNA C-values proved less serious than feared, owing to the development of relatively inexpensive flow cytometers and computer-based image analysis systems. New uses of the term genome (e.g. in ‘complete’ genome sequencing) can cause confusion. The Arabidopsis Genome Initiative C-value for Arabidopsis thaliana (125 Mb) was a gross underestimate, and an exact C-value based on genome sequencing alone is unlikely to be obtained soon for any angiosperm. Lack of this expected benchmark poses a quandary as to what to use as the basal calibration standard for angiosperms. The next decade offers exciting prospects for angiosperm genome size research. The database (http://www.kew.org/cval/homepage) should become sufficiently representative of the global flora to answer most questions without needing new estimations. DNA amount variation will remain a key interest as an integrated strand of holistic genomics. PMID:15596457
Identification and Targeting of Tyrosine Kinase Activity in Prostate Cancer Initiation, Progression, and Metastasis

DTIC Science & Technology

2013-12-01

University) "Effectors of the DNA damage and radiotherapy response in cancer" 9:20 pm - 9:30 pm Discussion TUESDAY 7:30 am - 8:30 am Breakfast 9:00...M. Morris , Hideki Onagi, Timothy M. Altamore, Allan B. Gamble, Christopher J. Easton Prohormone-substrate peptide sequence recognition by
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.

PubMed

Hazkani-Covo, Einat; Martin, William F

2017-05-01

Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
TOPICAL REVIEW: The physics of chromatin

NASA Astrophysics Data System (ADS)

Schiessel, Helmut

2003-05-01

Recent progress has been made in the understanding of the physical properties of chromatin - the dense complex of DNA and histone proteins that occupies the nuclei of plant and animal cells. Here I will focus on the two lowest levels of the hierarchy of DNA folding into the chromatin complex. (i) The nucleosome, the chromatin repeating unit consisting of a globular aggregate of eight histone proteins with the DNA wrapped around it: its overcharging, the DNA unwrapping transition, the 'sliding' of the octamer along the DNA. (ii) The 30 nm chromatin fibre, the necklace-like structure of nucleosomes connected via linker DNA: its geometry, its mechanical properties under stretching and its response to changing ionic conditions. I will stress that chromatin combines two seemingly contradictory features: (1) high compaction of DNA within the nuclear envelope and, at the same time, (2) accessibility to genes, promoter regions and gene regulatory sequences.
Plasma genetic and genomic abnormalities predict treatment response and clinical outcome in advanced prostate cancer.

PubMed

Xia, Shu; Kohli, Manish; Du, Meijun; Dittmar, Rachel L; Lee, Adam; Nandy, Debashis; Yuan, Tiezheng; Guo, Yongchen; Wang, Yuan; Tschannen, Michael R; Worthey, Elizabeth; Jacob, Howard; See, William; Kilari, Deepak; Wang, Xuexia; Hovey, Raymond L; Huang, Chiang-Ching; Wang, Liang

2015-06-30

Liquid biopsies, examinations of tumor components in body fluids, have shown promise for predicting clinical outcomes. To evaluate tumor-associated genomic and genetic variations in plasma cell-free DNA (cfDNA) and their associations with treatment response and overall survival, we applied whole genome and targeted sequencing to examine the plasma cfDNAs derived from 20 patients with advanced prostate cancer. Sequencing-based genomic abnormality analysis revealed locus-specific gains or losses that were common in prostate cancer, such as 8q gains, AR amplifications, PTEN losses and TMPRSS2-ERG fusions. To estimate tumor burden in cfDNA, we developed a Plasma Genomic Abnormality (PGA) score by summing the most significant copy number variations. Cox regression analysis showed that PGA scores were significantly associated with overall survival (p < 0.04). After androgen deprivation therapy or chemotherapy, targeted sequencing showed significant mutational profile changes in genes involved in androgen biosynthesis, AR activation, DNA repair, and chemotherapy resistance. These changes may reflect the dynamic evolution of heterozygous tumor populations in response to these treatments. These results strongly support the feasibility of using non-invasive liquid biopsies as potential tools to study biological mechanisms underlying therapy-specific resistance and to predict disease progression in advanced prostate cancer.
Structure, replication efficiency and fragility of yeast ARS elements.

PubMed

Dhar, Manoj K; Sehgal, Shelly; Kaul, Sanjana

2012-05-01

DNA replication in eukaryotes initiates at specific sites known as origins of replication, or replicators. These replication origins occur throughout the genome, though the propensity of their occurrence depends on the type of organism. In eukaryotes, zones of initiation of replication spanning from about 100 to 50,000 base pairs have been reported. The characteristics of eukaryotic replication origins are best understood in the budding yeast Saccharomyces cerevisiae, where some autonomously replicating sequences, or ARS elements, confer origin activity. ARS elements are short DNA sequences of a few hundred base pairs, identified by their efficiency at initiating a replication event when cloned in a plasmid. ARS elements, although structurally diverse, maintain a basic structure composed of three domains, A, B and C. Domain A is comprised of a consensus sequence designated ACS (ARS consensus sequence), while the B domain has the DNA unwinding element and the C domain is important for DNA-protein interactions. Although there are ∼400 ARS elements in the yeast genome, not all of them are active origins of replication. Different groups within the genus Saccharomyces have ARS elements as components of replication origin. The present paper provides a comprehensive review of various aspects of ARSs, starting from their structural conservation to sequence thermodynamics. All significant and conserved functional sequence motifs within different types of ARS elements have been extensively described. Issues like silencing at ARSs, their inherent fragility and factors governing their replication efficiency have also been addressed. Progress in understanding crucial components associated with the replication machinery and timing at these ARS elements is discussed in the section entitled "The replicon revisited". Copyright © 2012 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
No exonic mutations at GJB2, GJB3, GJB4, GJB6, ARS (Component B), and LOR genes responsible for a Chinese patient affected by progressive symmetric erythrokeratodermia with pseudoainhum.

PubMed

Zhou, Fusheng; Fu, Hongyang; Liu, Linghua; Cui, Yong; Zhang, Zhengzhong; Chang, Ruixue; Yue, Zhen; Yang, Sen; Zhang, Xuejun

2014-09-01

Progressive symmetric erythrokeratodermia (PSEK) is characterized by symmetric and growing erythematous hyperkeratotic patches over the body shortly after birth, particularly trunk and limbs, the buttocks, and the face, sometimes together with palmoplantar keratoderma (PPK). The GJB2, GJB3, GJB4, GJB6, ARS (Component B), and LOR gene mutation might contribute to PSEK manifestation. This study aimed to identify sequence alteration of these genes in a Chinese PSEK patient with pseudoainhum. Genomic DNA was purified from the patient's peripheral blood. Mutation analysis of target genes was performed by direct sequencing using ABI 3730 sequencer No exonic mutations was identified in the aforementioned genes. The result underlines the genetic heterogeneity of PSEK and other related erythrokeratodermas. © 2014 The International Society of Dermatology.
Genetic mutations in human rectal cancers detected by targeted sequencing.

PubMed

Bai, Jun; Gao, Jinglong; Mao, Zhijun; Wang, Jianhua; Li, Jianhui; Li, Wensheng; Lei, Yu; Li, Shuaishuai; Wu, Zhuo; Tang, Chuanning; Jones, Lindsey; Ye, Hua; Lou, Feng; Liu, Zhiyuan; Dong, Zhishou; Guo, Baishuai; Huang, Xue F; Chen, Si-Yi; Zhang, Enke

2015-10-01

Colorectal cancer (CRC) is widespread with significant mortality. Both inherited and sporadic mutations in various signaling pathways influence the development and progression of the cancer. Identifying genetic mutations in CRC is important for optimal patient treatment and many approaches currently exist to uncover these mutations, including next-generation sequencing (NGS) and commercially available kits. In the present study, we used a semiconductor-based targeted DNA-sequencing approach to sequence and identify genetic mutations in 91 human rectal cancer samples. Analysis revealed frequent mutations in KRAS (58.2%), TP53 (28.6%), APC (16.5%), FBXW7 (9.9%) and PIK3CA (9.9%), and additional mutations in BRAF, CTNNB1, ERBB2 and SMAD4 were also detected at lesser frequencies. Thirty-eight samples (41.8%) also contained two or more mutations, with common combination mutations occurring between KRAS and TP53 (42.1%), and KRAS and APC (31.6%). DNA sequencing for individual cancers is of clinical importance for targeted drug therapy and the advantages of such targeted gene sequencing over other NGS platforms or commercially available kits in sensitivity, cost and time effectiveness may aid clinicians in treating CRC patients in the near future.
The primary structure of the thymidine kinase gene of fish lymphocystis disease virus.

PubMed

Schnitzler, P; Handermann, M; Szépe, O; Darai, G

1991-06-01

The DNA nucleotide sequence of the thymidine kinase (TK) gene of fish lymphocystis disease virus (FLDV) which has been localized between the coordinates 0.678 to 0.688 of the viral genome was determined. The analysis of the DNA nucleotide sequence located between the recognition sites of HindIII (0.669 map unit; nucleotide position 1) and AccI (nucleotide position 2032) revealed the presence of an open reading frame of 954 bp on the lower strand of this region between nucleotide positions 1868 (ATG) and 915 (TAA). It encodes for a protein of 318 amino acid residues. The evolutionary relationships of the TK gene of FLDV to the other known TK genes was investigated using the method of progressive sequence alignment. These analyses revealed a high degree of diversity between the protein sequence of FLDV TK gene and the amino acid composition of other TKs tested. However, significant conservations were detected at several regions of amino acid residues of the FLDV TK protein when compared to the amino acid sequence of TKs of African swine fever virus, fowlpox virus, shope fibroma virus, and vaccinia virus and to the amino acid sequences of the cellular cytoplasmic TK of chicken, mouse, and man.
Whole-exome/genome sequencing and genomics.

PubMed

Grody, Wayne W; Thompson, Barry H; Hudgins, Louanne

2013-12-01

As medical genetics has progressed from a descriptive entity to one focused on the functional relationship between genes and clinical disorders, emphasis has been placed on genomics. Genomics, a subelement of genetics, is the study of the genome, the sum total of all the genes of an organism. The human genome, which is contained in the 23 pairs of nuclear chromosomes and in the mitochondrial DNA of each cell, comprises >6 billion nucleotides of genetic code. There are some 23,000 protein-coding genes, a surprisingly small fraction of the total genetic material, with the remainder composed of noncoding DNA, regulatory sequences, and introns. The Human Genome Project, launched in 1990, produced a draft of the genome in 2001 and then a finished sequence in 2003, on the 50th anniversary of the initial publication of Watson and Crick's paper on the double-helical structure of DNA. Since then, this mass of genetic information has been translated at an ever-increasing pace into useable knowledge applicable to clinical medicine. The recent advent of massively parallel DNA sequencing (also known as shotgun, high-throughput, and next-generation sequencing) has brought whole-genome analysis into the clinic for the first time, and most of the current applications are directed at children with congenital conditions that are undiagnosable by using standard genetic tests for single-gene disorders. Thus, pediatricians must become familiar with this technology, what it can and cannot offer, and its technical and ethical challenges. Here, we address the concepts of human genomic analysis and its clinical applicability for primary care providers.

Accelerated construction of a regional DNA-barcode reference library: Caddisflies (Trichoptera) in the Great Smoky Mountains National Park

USGS Publications Warehouse

Zhou, X.; Robinson, J.L.; Geraci, C.J.; Parker, C.R.; Flint, O.S.; Etnier, D.A.; Ruiter, D.; DeWalt, R.E.; Jacobus, L.M.; Hebert, P.D.N.

2011-01-01

Deoxyribonucleic acid (DNA) barcoding is an effective tool for species identification and lifestage association in a wide range of animal taxa. We developed a strategy for rapid construction of a regional DNA-barcode reference library and used the caddisflies (Trichoptera) of the Great Smoky Mountains National Park (GSMNP) as a model. Nearly 1000 cytochrome c oxidase subunit I (COI) sequences, representing 209 caddisfly species previously recorded from GSMNP, were obtained from the global Trichoptera Barcode of Life campaign. Most of these sequences were collected from outside the GSMNP area. Another 645 COI sequences, representing 80 species, were obtained from specimens collected in a 3-d bioblitz (short-term, intense sampling program) in GSMNP. The joint collections provided barcode coverage for 212 species, 91% of the GSMNP fauna. Inclusion of samples from other localities greatly expedited construction of the regional DNA-barcode reference library. This strategy increased intraspecific divergence and decreased average distances to nearest neighboring species, but the DNA-barcode library was able to differentiate 93% of the GSMNP Trichoptera species examined. Global barcoding projects will aid construction of regional DNA-barcode libraries, but local surveys make crucial contributions to progress by contributing rare or endemic species and full-length barcodes generated from high-quality DNA. DNA taxonomy is not a goal of our present work, but the investigation of COI divergence patterns in caddisflies is providing new insights into broader biodiversity patterns in this group and has directed attention to various issues, ranging from the need to re-evaluate species taxonomy with integrated morphological and molecular evidence to the necessity of an appropriate interpretation of barcode analyses and its implications in understanding species diversity (in contrast to a simple claim for barcoding failure).
Chapter 27 -- Breast Cancer Genomics, Section VI, Pathology and Biological Markers of Invasive Breast Cancer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Spellman, Paul T.; Heiser, Laura; Gray, Joe W.

2009-06-18

Breast cancer is predominantly a disease of the genome with cancers arising and progressing through accumulation of aberrations that alter the genome - by changing DNA sequence, copy number, and structure in ways that that contribute to diverse aspects of cancer pathophysiology. Classic examples of genomic events that contribute to breast cancer pathophysiology include inherited mutations in BRCA1, BRCA2, TP53, and CHK2 that contribute to the initiation of breast cancer, amplification of ERBB2 (formerly HER2) and mutations of elements of the PI3-kinase pathway that activate aspects of epidermal growth factor receptor (EGFR) signaling and deletion of CDKN2A/B that contributes tomore » cell cycle deregulation and genome instability. It is now apparent that accumulation of these aberrations is a time-dependent process that accelerates with age. Although American women living to an age of 85 have a 1 in 8 chance of developing breast cancer, the incidence of cancer in women younger than 30 years is uncommon. This is consistent with a multistep cancer progression model whereby mutation and selection drive the tumor's development, analogous to traditional Darwinian evolution. In the case of cancer, the driving events are changes in sequence, copy number, and structure of DNA and alterations in chromatin structure or other epigenetic marks. Our understanding of the genetic, genomic, and epigenomic events that influence the development and progression of breast cancer is increasing at a remarkable rate through application of powerful analysis tools that enable genome-wide analysis of DNA sequence and structure, copy number, allelic loss, and epigenomic modification. Application of these techniques to elucidation of the nature and timing of these events is enriching our understanding of mechanisms that increase breast cancer susceptibility, enable tumor initiation and progression to metastatic disease, and determine therapeutic response or resistance. These studies also reveal the molecular differences between cancer and normal that may be exploited to therapeutic benefit or that provide targets for molecular assays that may enable early cancer detection, and predict individual disease progression or response to treatment. This chapter reviews current and future directions in genome analysis and summarizes studies that provide insights into breast cancer pathophysiology or that suggest strategies to improve breast cancer management.« less
A distinct first replication cycle of DNA introduced in mammalian cells

PubMed Central

Chandok, Gurangad S.; Kapoor, Kalvin K.; Brick, Rachel M.; Sidorova, Julia M.; Krasilnikova, Maria M.

2011-01-01

Many mutation events in microsatellite DNA sequences were traced to the first embryonic divisions. It was not known what makes the first replication cycles of embryonic DNA different from subsequent replication cycles. Here we demonstrate that an unusual replication mode is involved in the first cycle of replication of DNA introduced in mammalian cells. This alternative replication starts at random positions, and occurs before the chromatin is fully assembled. It is detected in various cell lines and primary cells. The presence of single-stranded regions increases the efficiency of this alternative replication mode. The alternative replication cannot progress through the A/T-rich FRA16B fragile site, while the regular replication mode is not affected by it. A/T-rich microsatellites are associated with the majority of chromosomal breakpoints in cancer. We suggest that the alternative replication mode may be initiated at the regions with immature chromatin structure in embryonic and cancer cells resulting in increased genomic instability. This work demonstrates, for the first time, differences in the replication progression during the first and subsequent replication cycles in mammalian cells. PMID:21062817
Towards a comprehensive barcode library for arctic life - Ephemeroptera, Plecoptera, and Trichoptera of Churchill, Manitoba, Canada

PubMed Central

2009-01-01

Background This study reports progress in assembling a DNA barcode reference library for Ephemeroptera, Plecoptera, and Trichoptera ("EPTs") from a Canadian subarctic site, which is the focus of a comprehensive biodiversity inventory using DNA barcoding. These three groups of aquatic insects exhibit a moderate level of species diversity, making them ideal for testing the feasibility of DNA barcoding for routine biotic surveys. We explore the correlation between the morphological species delineations, DNA barcode-based haplotype clusters delimited by a sequence threshold (2%), and a threshold-free approach to biodiversity quantification--phylogenetic diversity. Results A DNA barcode reference library is built for 112 EPT species for the focal region, consisting of 2277 COI sequences. Close correspondence was found between EPT morphospecies and haplotype clusters as designated using a standard threshold value. Similarly, the shapes of taxon accumulation curves based upon haplotype clusters were very similar to those generated using phylogenetic diversity accumulation curves, but were much more computationally efficient. Conclusion The results of this study will facilitate other lines of research on northern EPTs and also bode well for rapidly conducting initial biodiversity assessments in unknown EPT faunas. PMID:20003245
Sequence Variants and Haplotype Analysis of Cat ERBB2 Gene: A Survey on Spontaneous Cat Mammary Neoplastic and Non-Neoplastic Lesions

PubMed Central

Santos, Sara; Bastos, Estela; Baptista, Cláudia S.; Sá, Daniela; Caloustian, Christophe; Guedes-Pinto, Henrique; Gärtner, Fátima; Gut, Ivo G.; Chaves, Raquel

2012-01-01

The human ERBB2 proto-oncogene is widely considered a key gene involved in human breast cancer onset and progression. Among spontaneous tumors, mammary tumors are the most frequent cause of cancer death in cats and second most frequent in humans. In fact, naturally occurring tumors in domestic animals, more particularly cat mammary tumors, have been proposed as a good model for human breast cancer, but critical genetic and molecular information is still scarce. The aims of this study include the analysis of the cat ERBB2 gene partial sequences (between exon 17 and 20) in order to characterize a normal and a mammary lesion heterogeneous populations. Cat genomic DNA was extracted from normal frozen samples (n = 16) and from frozen and formalin-fixed paraffin-embedded mammary lesion samples (n = 41). We amplified and sequenced two cat ERBB2 DNA fragments comprising exons 17 to 20. It was possible to identify five sequence variants and six haplotypes in the total population. Two sequence variants and two haplotypes show to be specific for cat mammary tumor samples. Bioinformatics analysis predicts that four of the sequence variants can produce alternative transcripts or activate cryptic splicing sites. Also, a possible association was identified between clinicopathological traits and the variant haplotypes. As far as we know, this is the first attempt to examine ERBB2 genetic variations in cat mammary genome and its possible association with the onset and progression of cat mammary tumors. The demonstration of a possible association between primary tumor size (one of the two most important prognostic factors) and the number of masses with the cat ERBB2 variant haplotypes reveal the importance of the analysis of this gene in veterinary medicine. PMID:22489125
Capturing the 'ome': the expanding molecular toolbox for RNA and DNA library construction.

PubMed

Boone, Morgane; De Koker, Andries; Callewaert, Nico

2018-04-06

All sequencing experiments and most functional genomics screens rely on the generation of libraries to comprehensively capture pools of targeted sequences. In the past decade especially, driven by the progress in the field of massively parallel sequencing, numerous studies have comprehensively assessed the impact of particular manipulations on library complexity and quality, and characterized the activities and specificities of several key enzymes used in library construction. Fortunately, careful protocol design and reagent choice can substantially mitigate many of these biases, and enable reliable representation of sequences in libraries. This review aims to guide the reader through the vast expanse of literature on the subject to promote informed library generation, independent of the application.
Recent advances in plant centromere biology.

PubMed

Feng, Chao; Liu, YaLin; Su, HanDong; Wang, HeFei; Birchler, James; Han, FangPu

2015-03-01

The centromere, which is one of the essential parts of a chromosome, controls kinetochore formation and chromosome segregation during mitosis and meiosis. While centromere function is conserved in eukaryotes, the centromeric DNA sequences evolve rapidly and have few similarities among species. The histone H3 variant CENH3 (CENP-A in human), which mostly exists in centromeric nucleosomes, is a universal active centromere mark in eukaryotes and plays an essential role in centromere identity determination. The relationship between centromeric DNA sequences and centromere identity determination is one of the intriguing questions in studying centromere formation. Due to the discoveries in the past decades, including "neocentromeres" and "centromere inactivation", it is now believed that the centromere identity is determined by epigenetic mechanisms. This review will present recent progress in plant centromere biology.
Incidence and progression of Parvovirus B19 infection and molecular changes in circulating B19V strains in children with haematological malignancy: A follow up study.

PubMed

Jain, Amita; Jain, Parul; Kumar, Archana; Prakash, Shantanu; Khan, Danish Nasar; Kant, Ravi

2018-01-01

The present study was planned to estimate the incidence of human Parvovirus B19 infection and understand its progression in children suffering with hematological malignancy. The circulating B19V genotypes and viral mutations occurring in strains of B19V over one-year period were also studied. Children with malignancies were enrolled consecutively and were followed up for one-year period. Serum sample was collected at the time of enrolment and each follow up visit and was tested for anti B19V IgG and IgM as well as for B19V DNA. At least one B19V DNA positive sample from each patient was processed for sequencing. For patients positive for B19V DNA >1 time and at least 6 months apart, last positive sample from the same patient was also sequenced to study the nucleotide change over time. We have found very high incidence of B19V infection (100%) in the study population. All the patients tested positive for at least one B19V infection parameter (either antibodies or DNA) at least once, over one year of follow up. Cumulative percent positivity of anti B19V IgG, anti B19V IgM and B19V DNA was 85.3%, 45.2% and 72.1% respectively. Genotype 3b was reported, with occasional nucleotide change over one year period. DNA clearance was delayed in spite of appearance of IgG antibodies. Appearance of IgM class of antibodies was either delayed or absent. To conclude, children with haematological malignancies have high incidence of B19V infection with late and short lived serological response and persistence of DNA for long duration. Copyright © 2017 Elsevier B.V. All rights reserved.
Cloning and characterization of an autonomous replication sequence from Coxiella burnetii.

PubMed Central

Suhan, M; Chen, S Y; Thompson, H A; Hoover, T A; Hill, A; Williams, J C

1994-01-01

A Coxiella burnetii chromosomal fragment capable of functioning as an origin for the replication of a kanamycin resistance (Kanr) plasmid was isolated by use of origin search methods utilizing an Escherichia coli host. The 5.8-kb fragment was subcloned into phagemid vectors and was deleted progressively by an exonuclease III-S1 technique. Plasmids containing progressively shorter DNA fragments were then tested for their capability to support replication by transformation of an E. coli polA strain. A minimal autonomous replication sequence (ARS) was delimited to 403 bp. Sequencing of the entire 5.8-kb region revealed that the minimal ARS contained two consensus DnaA boxes, three A + T-rich 21-mers, a transcriptional promoter leading rightwards, and potential integration host factor and factor of inversion stimulation binding sites. Database comparisons of deduced amino acid sequences revealed that open reading frames located around the ARS were homologous to genes often, but not always, found near bacterial chromosomal origins; these included identities with rpmH and rnpA in E. coli and identities with the 9K protein and 60K membrane protein in E. coli and Pseudomonas species. These and direct hybridization data suggested that the ARS was chromosomal and not associated with the resident plasmid QpH1. Two-dimensional agarose gel electrophoresis did not reveal the presence of initiating intermediates, indicating that the ARS did not initiate chromosome replication during laboratory growth of C. burnetii. Images PMID:8071197
[Research Progress on the Detection Method of DNA Methylation and Its Application in Forensic Science].

PubMed

Nie, Y C; Yu, L J; Guan, H; Zhao, Y; Rong, H B; Jiang, B W; Zhang, T

2017-06-01

As an important part of epigenetic marker, DNA methylation involves in the gene regulation and attracts a wide spread attention in biological auxology, geratology and oncology fields. In forensic science, because of the relative stable, heritable, abundant, and age-related characteristics, DNA methylation is considered to be a useful complement to the classic genetic markers for age-prediction, tissue-identification, and monozygotic twins' discrimination. Various methods for DNA methylation detection have been validated based on methylation sensitive restriction endonuclease, bisulfite modification and methylation-CpG binding protein. In recent years, it is reported that the third generation sequencing method can be used to detect DNA methylation. This paper aims to make a review on the detection method of DNA methylation and its applications in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine.
Association between mitochondrial DNA variations and Alzheimer's Disease in the ADNI cohort

PubMed Central

Lakatos, Anita; Derbeneva, Olga; Younes, Danny; Keator, David; Bakken, Trygve; Lvova, Maria; Brandon, Marty; Guffanti, Guia; Reglodi, Dora; Saykin, Andrew; Weiner, Michael; Macciardi, Fabio; Schork, Nicholas; Wallace, Douglas C.; Potkin, Steven G.

2010-01-01

Despite the central role of amyloid deposition in the development of Alzheimer's disease (AD), the pathogenesis of AD still remains elusive at the molecular level. Increasing evidence suggests that compromised mitochondrial function contributes to the aging process and thus may increase the risk of AD. Dysfunctional mitochondria contribute to reactive oxygen species (ROS) which can lead to extensive macromolecule oxidative damage and the progression of amyloid pathology. Oxidative stress and amyloid toxicity leave neurons chemically vulnerable. Because the brain relies on aerobic metabolism, it is apparent that mitochondria are critical for the cerebral function. Mitochondrial DNA sequence-changes could shift cell dynamics and facilitate neuronal vulnerability. Therefore we postulated that mitochondrial DNA sequence polymorphisms may increase the risk of AD. We evaluated the role of mitochondrial haplogroups derived from 138 mitochondrial polymorphisms in 358 Caucasian ADNI subjects. Our results indicate that the mitochondrial haplogroup UK may confer genetic susceptibility to AD independently of the APOE4 allele. PMID:20538375
A grass molecular identification system for forensic botany: a critical evaluation of the strengths and limitations.

PubMed

Ward, Jodie; Gilmore, Simon R; Robertson, James; Peakall, Rod

2009-11-01

Plant material is frequently encountered in criminal investigations but often overlooked as potential evidence. We designed a DNA-based molecular identification system for 100 Australian grasses that consisted of a series of polymerase chain reaction assays that enabled the progressive identification of grasses to different taxonomic levels. The identification system was based on DNA sequence variation at four chloroplast and two mitochondrial loci. Seventeen informative indels and 68 single-nucleotide polymorphisms were utilized as molecular markers for subfamily to species-level identification. To identify an unknown sample to subfamily level required a minimum of four markers or nine markers for species identification. The accuracy of the system was confirmed by blind tests. We have demonstrated "proof of concept" of a molecular identification system for trace botanical samples. Our evaluation suggests that the adoption of a system that combines this approach with DNA sequencing could assist the morphological identification of grasses found as forensic evidence.
Status of duckweed genomics and transcriptomics.

PubMed

Wang, W; Messing, J

2015-01-01

Duckweeds belong to the smallest flowering plants that undergo fast vegetative growth in an aquatic environment. They are commonly used in wastewater treatment and animal feed. Whereas duckweeds have been studied at the biochemical level, their reduced morphology and wide environmental adaption had not been subjected to molecular analysis until recently. Here, we review the progress that has been made in using a DNA barcode system and the sequences of chloroplast and mitochondrial genomes to identify duckweed species at the species or population level. We also review analysis of the nuclear genome sequence of Spirodela that provides new insights into fundamental biological questions. Indeed, reduced gene families and missing genes are consistent with its compact morphogenesis, aquatic floating and suppression of juvenile-to-adult transition. Furthermore, deep RNA sequencing of Spirodela at the onset of dormancy and Landoltia in exposure of nutrient deficiency illustrate the molecular network for environmental adaption and stress response, constituting major progress towards a post-genome sequencing phase, where further functional genomic details can be explored. Rapid advances in sequencing technologies could continue to promote a proliferation of genome sequences for additional ecotypes as well as for other duckweed species. © 2014 German Botanical Society and The Royal Botanical Society of the Netherlands.
ESR1 Mutations in Circulating Plasma Tumor DNA from Metastatic Breast Cancer Patients.

PubMed

Chu, David; Paoletti, Costanza; Gersch, Christina; VanDenBerg, Dustin A; Zabransky, Daniel J; Cochran, Rory L; Wong, Hong Yuen; Toro, Patricia Valda; Cidado, Justin; Croessmann, Sarah; Erlanger, Bracha; Cravero, Karen; Kyker-Snowman, Kelly; Button, Berry; Parsons, Heather A; Dalton, W Brian; Gillani, Riaz; Medford, Arielle; Aung, Kimberly; Tokudome, Nahomi; Chinnaiyan, Arul M; Schott, Anne; Robinson, Dan; Jacks, Karen S; Lauring, Josh; Hurley, Paula J; Hayes, Daniel F; Rae, James M; Park, Ben Ho

2016-02-15

Mutations in the estrogen receptor (ER)α gene, ESR1, have been identified in breast cancer metastases after progression on endocrine therapies. Because of limitations of metastatic biopsies, the reported frequency of ESR1 mutations may be underestimated. Here, we show a high frequency of ESR1 mutations using circulating plasma tumor DNA (ptDNA) from patients with metastatic breast cancer. We retrospectively obtained plasma samples from eight patients with known ESR1 mutations and three patients with wild-type ESR1 identified by next-generation sequencing (NGS) of biopsied metastatic tissues. Three common ESR1 mutations were queried for using droplet digital PCR (ddPCR). In a prospective cohort, metastatic tissue and plasma were collected contemporaneously from eight ER-positive and four ER-negative patients. Tissue biopsies were sequenced by NGS, and ptDNA ESR1 mutations were analyzed by ddPCR. In the retrospective cohort, all corresponding mutations were detected in ptDNA, with two patients harboring additional ESR1 mutations not present in their metastatic tissues. In the prospective cohort, three ER-positive patients did not have adequate tissue for NGS, and no ESR1 mutations were identified in tissue biopsies from the other nine patients. In contrast, ddPCR detected seven ptDNA ESR1 mutations in 6 of 12 patients (50%). We show that ESR1 mutations can occur at a high frequency and suggest that blood can be used to identify additional mutations not found by sequencing of a single metastatic lesion. ©2015 American Association for Cancer Research.
Circulating tumour DNA and CT monitoring in patients with untreated diffuse large B-cell lymphoma: a correlative biomarker study.

PubMed

Roschewski, Mark; Dunleavy, Kieron; Pittaluga, Stefania; Moorhead, Martin; Pepin, Francois; Kong, Katherine; Shovlin, Margaret; Jaffe, Elaine S; Staudt, Louis M; Lai, Catherine; Steinberg, Seth M; Chen, Clara C; Zheng, Jianbiao; Willis, Thomas D; Faham, Malek; Wilson, Wyndham H

2015-05-01

Diffuse large-B-cell lymphoma is curable, but when treatment fails, outcome is poor. Although imaging can help to identify patients at risk of treatment failure, they are often imprecise, and radiation exposure is a potential health risk. We aimed to assess whether circulating tumour DNA encoding the clonal immunoglobulin gene sequence could be detected in the serum of patients with diffuse large-B-cell lymphoma and used to predict clinical disease recurrence after frontline treatment. We used next-generation DNA sequencing to retrospectively analyse cell-free circulating tumour DNA in patients assigned to one of three treatment protocols between May 8, 1993, and June 6, 2013. Eligible patients had diffuse large-B-cell lymphoma, no evidence of indolent lymphoma, and were previously untreated. We obtained serial serum samples and concurrent CT scans at specified times during most treatment cycles and up to 5 years of follow-up. VDJ gene segments of the rearranged immunoglobulin receptor genes were amplified and sequenced from pretreatment specimens and serum circulating tumour DNA encoding the VDJ rearrangements was quantitated. Tumour clonotypes were identified in pretreatment specimens from 126 patients who were followed up for a median of 11 years (IQR 6·8-14·2). Interim monitoring of circulating tumour DNA at the end of two treatment cycles in 108 patients showed a 5-year time to progression of 41·7% (95% CI 22·2-60·1) in patients with detectable circulating tumour DNA and 80·2% (69·6-87·3) in those without detectable circulating tumour DNA (p<0·0001). Detectable interim circulating tumour DNA had a positive predictive value of 62·5% (95% CI 40·6-81·2) and a negative predictive value of 79·8% (69·6-87·8). Surveillance monitoring of circulating tumour DNA was done in 107 patients who achieved complete remission. A Cox proportional hazards model showed that the hazard ratio for clinical disease progression was 228 (95% CI 51-1022) for patients who developed detectable circulating tumour DNA during surveillance compared with patients with undetectable circulating tumour DNA (p<0·0001). Surveillance circulating tumour DNA had a positive predictive value of 88·2% (95% CI 63·6-98·5) and a negative predictive value of 97·8% (92·2-99·7) and identified risk of recurrence at a median of 3·5 months (range 0-200) before evidence of clinical disease. Surveillance circulating tumour DNA identifies patients at risk of recurrence before clinical evidence of disease in most patients and results in a reduced disease burden at relapse. Interim circulating tumour DNA is a promising biomarker to identify patients at high risk of treatment failure. National Cancer Institute and Adaptive Biotechnologies. Copyright © 2015 Elsevier Ltd. All rights reserved.
Different strategies for the detection of bioagents using electrochemical and photoelectrochemical genosensors

NASA Astrophysics Data System (ADS)

Voccia, Diego; Bettazi, Francesca; Palchetti, Ilaria

2015-10-01

In recent years various kinds of biosensors for the detection of pathogens have been developed. A genosensor consists in the immobilization, onto the surface of a chosen transducer, of an oligonucleotide with a specific base sequence called capture probe. The complementary sequence (the analytical target, i.e. a specific sequence of the DNA/RNA of the pathogen) present in the sample is recognized and captured by the probe through the hybridization reaction. The evaluation of the extent of the hybridization allows one to confirm whether the sample contains the complementary sequence of the probe or not. Electrochemical transducers have received considerable attention in connection with the detection of DNA hybridization. Moreover, recently, with the emergence of novel photoelectrochemically active species and new detection schemes, photoelectrochemistry has resulted in substantial progress in its analytical performance for biosensing applications. In this paper, some examples of electrochemical genosensors for multiplexed pathogen detection are shown. Moreover, the preliminary experiments towards the development of a photoelectrochemical genosensor using a TiO2 - nanocrystal-modified ITO electrode are discussed.
Cellular HIV-1 DNA Levels in Drug Sensitive Strains Are Equivalent to Those in Drug Resistant Strains in Newly-Diagnosed Patients in Europe

PubMed Central

Demetriou, Victoria L.; van de Vijver, David A. M. C.; Kousiappa, Ioanna; Balotta, Claudia; Clotet, Bonaventura; Grossman, Zehava; Jørgensen, Louise B.; Lepej, Snjezana Z.; Levy, Itzchak; Nielsen, Claus; Paraskevis, Dimitrios; Poljak, Mario; Roman, Francois; Ruiz, Lidia; Schmidt, Jean-Claude; Vandamme, Anne-Mieke; Van Laethem, Kristel; Vercauteren, Jurgen; Kostrikis, Leondios G.

2010-01-01

Background HIV-1 genotypic drug resistance is an important threat to the success of antiretroviral therapy and transmitted resistance has reached 9% prevalence in Europe. Studies have demonstrated that HIV-1 DNA load in peripheral blood mononuclear cells (PBMC) have a predictive value for disease progression, independently of CD4 counts and plasma viral load. Methodology/Principal Findings Molecular-beacon-based real-time PCR was used to measure HIV-1 second template switch (STS) DNA in PBMC in newly-diagnosed HIV-1 patients across Europe. These patients were representative for the HIV-1 epidemic in the participating countries and were carrying either drug-resistant or sensitive viral strains. The assay design was improved from a previous version to specifically detect M-group HIV-1 and human CCR5 alleles. The findings resulted in a median of 3.32 log10 HIV-1 copies/106 PBMC and demonstrated for the first time no correlation between cellular HIV-1 DNA load and transmitted drug-resistance. A weak association between cellular HIV-1 DNA levels with plasma viral RNA load and CD4+ T-cell counts was also reconfirmed. Co-receptor tropism for 91% of samples, whether or not they conferred resistance, was CCR5. A comparison of pol sequences derived from RNA and DNA, resulted in a high similarity between the two. Conclusions/Significance An improved molecular-beacon-based real-time PCR assay is reported for the measurement of HIV-1 DNA in PBMC and has investigated the association between cellular HIV-1 DNA levels and transmitted resistance to antiretroviral therapy in newly-diagnosed patients from across Europe. The findings show no correlation between these two parameters, suggesting that transmitted resistance does not impact disease progression in HIV-1 infected individuals. The CCR5 co-receptor tropism predominance implies that both resistant and non-resistant strains behave similarly in early infection. Furthermore, a correlation found between RNA- and DNA-derived sequences in the pol region suggests that genotypic drug-resistance testing could be carried out on either template. PMID:20544014
Why double-stranded RNA resists condensation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tolokh, Igor S.; Pabit, Suzette; Katz, Andrea M.

2014-09-15

The addition of small amounts of multivalent cations to solutions containing double-stranded DNA leads to attraction between the negatively charged helices and eventually to condensation. Surprisingly, this effect is suppressed in double-stranded RNA, which carries the same charge as the DNA, but assumes a different double helical form. However, additional characterization of short (25 base-pairs) nucleic acid (NA) duplex structures by circular dichroism shows that measured differences in condensation are not solely determined by duplex helical geometry. Here we combine experiment, theory, and atomistic simulations to propose a mechanism that connects the observed variations in condensation of short NA duplexesmore » with the spatial variation of cobalt hexammine (CoHex) binding at the NA duplex surface. The atomistic picture that emerged showed that CoHex distributions around the NA reveals two major NA-CoHex binding modes -- internal and external -- distinguished by the proximity of bound CoHex to the helical axis. Decreasing trends in experimentally observed condensation propensity of the four studied NA duplexes (from B-like form of homopolymeric DNA, to mixed sequence DNA, to DNA:RNA hybrid, to A-like RNA) are explained by the progressive decrease of a single quantity: the fraction of CoHex ions in the external binding mode. Thus, while NA condensation depends on a complex interplay between various structural and sequence features, our coupled experimental and theoretical results suggest a new model in which a single parameter connects the NA condensation propensity with geometry and sequence dependence of CoHex binding.« less
G-quadruplex-interacting compounds alter latent DNA replication and episomal persistence of KSHV.

PubMed

Madireddy, Advaitha; Purushothaman, Pravinkumar; Loosbroock, Christopher P; Robertson, Erle S; Schildkraut, Carl L; Verma, Subhash C

2016-05-05

Kaposi's sarcoma associated herpesvirus (KSHV) establishes life-long latent infection by persisting as an extra-chromosomal episome in the infected cells and by maintaining its genome in dividing cells. KSHV achieves this by tethering its epigenome to the host chromosome by latency associated nuclear antigen (LANA), which binds in the terminal repeat (TR) region of the viral genome. Sequence analysis of the TR, a GC-rich DNA element, identified several potential Quadruplex G-Rich Sequences (QGRS). Since quadruplexes have the tendency to obstruct DNA replication, we used G-quadruplex stabilizing compounds to examine their effect on latent DNA replication and the persistence of viral episomes. Our results showed that these G-quadruplex stabilizing compounds led to the activation of dormant origins of DNA replication, with preferential bi-directional pausing of replications forks moving out of the TR region, implicating the role of the G-rich TR in the perturbation of episomal DNA replication. Over time, treatment with PhenDC3 showed a loss of viral episomes in the infected cells. Overall, these data show that G-quadruplex stabilizing compounds retard the progression of replication forks leading to a reduction in DNA replication and episomal maintenance. These results suggest a potential role for G-quadruplex stabilizers in the treatment of KSHV-associated diseases. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
The clinical, histochemical, and molecular spectrum of PEO1 (Twinkle)-linked adPEO

PubMed Central

Fratter, C.; Gorman, G.S.; Stewart, J.D.; Buddles, M.; Smith, C.; Evans, J.; Seller, A.; Poulton, J.; Roberts, M.; Hanna, M.G.; Rahman, S.; Omer, S.E.; Klopstock, T.; Schoser, B.; Kornblum, C.; Czermin, B.; Lecky, B.; Blakely, E.L.; Craig, K.; Chinnery, P.F.; Turnbull, D.M.; Horvath, R.; Taylor, R.W.

2010-01-01

Background: Mutations in the Twinkle (PEO1) gene are a recognized cause of autosomal dominant progressive external ophthalmoplegia (adPEO), resulting in the accumulation of multiple mitochondrial DNA (mtDNA) deletions and cytochrome c oxidase (COX)-deficient fibers in skeletal muscle secondary to a disorder of mtDNA maintenance. Patients typically present with isolated extraocular muscle involvement, with little apparent evidence of the clinical heterogeneity documented in other mtDNA maintenance disorders, in particular POLG-related disease. Methods: We reviewed the clinical, histochemical, and molecular genetics analysis of 33 unreported patients from 26 families together with all previous cases described in the literature to define the clinical phenotype associated with PEO1 mutations. Results: Ptosis and ophthalmoparesis were almost universal clinical features among this cohort, with 52% (17/33) reporting fatigue and 33% (11/33) having mild proximal myopathy. Features consistent with CNS involvement were rarely described; however, in 24% (8/33) of the patients, cardiac abnormalities were reported. Mitochondrial histochemical changes observed in muscle showed remarkable variability, as did the secondary mtDNA deletions, which in some patients were only detected by PCR-based assays and not Southern blotting. Moreover, we report 7 novel PEO1 variants. Conclusions: Our data suggest a shared clinical phenotype with variable mild multiorgan involvement, and that the contribution of PEO1 mutations as a cause of adPEO may well be underestimated. Direct sequencing of the PEO1 gene should be considered in adPEO patients prior to muscle biopsy. GLOSSARY adPEO = autosomal dominant progressive external ophthalmoplegia; COX = cytochrome c oxidase; IOSCA = infantile-onset spinocerebellar ataxia; mtDNA = mitochondrial DNA; PEO = progressive external ophthalmoplegia; SANDO = sensory ataxic neuropathy, dysarthria, and ophthalmoparesis; SDH = succinate dehydrogenase. PMID:20479361

3-D DNA methylation phenotypes correlate with cytotoxicity levels in prostate and liver cancer cell models

PubMed Central

2013-01-01

Background The spatial organization of the genome is being evaluated as a novel indicator of toxicity in conjunction with drug-induced global DNA hypomethylation and concurrent chromatin reorganization. 3D quantitative DNA methylation imaging (3D-qDMI) was applied as a cell-by-cell high-throughput approach to investigate this matter by assessing genome topology through represented immunofluorescent nuclear distribution patterns of 5-methylcytosine (MeC) and global DNA (4,6-diamidino-2-phenylindole = DAPI) in labeled nuclei. Methods Differential progression of global DNA hypomethylation was studied by comparatively dosing zebularine (ZEB) and 5-azacytidine (AZA). Treated and untreated (control) human prostate and liver cancer cells were subjected to confocal scanning microscopy and dedicated 3D image analysis for the following features: differential nuclear MeC/DAPI load and codistribution patterns, cell similarity based on these patterns, and corresponding differences in the topology of low-intensity MeC (LIM) and low in intensity DAPI (LID) sites. Results Both agents generated a high fraction of similar MeC phenotypes across applied concentrations. ZEB exerted similar effects at 10–100-fold higher drug concentrations than its AZA analogue: concentration-dependent progression of global cytosine demethylation, validated by measuring differential MeC levels in repeat sequences using MethyLight, and the concurrent increase in nuclear LIM densities correlated with cellular growth reduction and cytotoxicity. Conclusions 3D-qDMI demonstrated the capability of quantitating dose-dependent drug-induced spatial progression of DNA demethylation in cell nuclei, independent from interphase cell-cycle stages and in conjunction with cytotoxicity. The results support the notion of DNA methylation topology being considered as a potential indicator of causal impacts on chromatin distribution with a conceivable application in epigenetic drug toxicology. PMID:23394161
Bias-Corrected Targeted Next-Generation Sequencing for Rapid, Multiplexed Detection of Actionable Alterations in Cell-Free DNA from Advanced Lung Cancer Patients.

PubMed

Paweletz, Cloud P; Sacher, Adrian G; Raymond, Chris K; Alden, Ryan S; O'Connell, Allison; Mach, Stacy L; Kuang, Yanan; Gandhi, Leena; Kirschmeier, Paul; English, Jessie M; Lim, Lee P; Jänne, Pasi A; Oxnard, Geoffrey R

2016-02-15

Tumor genotyping is a powerful tool for guiding non-small cell lung cancer (NSCLC) care; however, comprehensive tumor genotyping can be logistically cumbersome. To facilitate genotyping, we developed a next-generation sequencing (NGS) assay using a desktop sequencer to detect actionable mutations and rearrangements in cell-free plasma DNA (cfDNA). An NGS panel was developed targeting 11 driver oncogenes found in NSCLC. Targeted NGS was performed using a novel methodology that maximizes on-target reads, and minimizes artifact, and was validated on DNA dilutions derived from cell lines. Plasma NGS was then blindly performed on 48 patients with advanced, progressive NSCLC and a known tumor genotype, and explored in two patients with incomplete tumor genotyping. NGS could identify mutations present in DNA dilutions at ≥ 0.4% allelic frequency with 100% sensitivity/specificity. Plasma NGS detected a broad range of driver and resistance mutations, including ALK, ROS1, and RET rearrangements, HER2 insertions, and MET amplification, with 100% specificity. Sensitivity was 77% across 62 known driver and resistance mutations from the 48 cases; in 29 cases with common EGFR and KRAS mutations, sensitivity was similar to droplet digital PCR. In two cases with incomplete tumor genotyping, plasma NGS rapidly identified a novel EGFR exon 19 deletion and a missed case of MET amplification. Blinded to tumor genotype, this plasma NGS approach detected a broad range of targetable genomic alterations in NSCLC with no false positives including complex mutations like rearrangements and unexpected resistance mutations such as EGFR C797S. Through use of widely available vacutainers and a desktop sequencing platform, this assay has the potential to be implemented broadly for patient care and translational research. ©2015 American Association for Cancer Research.
Bias-corrected targeted next-generation sequencing for rapid, multiplexed detection of actionable alterations in cell-free DNA from advanced lung cancer patients

PubMed Central

Paweletz, Cloud P.; Sacher, Adrian G.; Raymond, Chris K.; Alden, Ryan S.; O'Connell, Allison; Mach, Stacy L.; Kuang, Yanan; Gandhi, Leena; Kirschmeier, Paul; English, Jessie M.; Lim, Lee P.; Jänne, Pasi A.; Oxnard, Geoffrey R.

2015-01-01

Purpose Tumor genotyping is a powerful tool for guiding non-small cell lung cancer (NSCLC) care, however comprehensive tumor genotyping can be logistically cumbersome. To facilitate genotyping, we developed a next-generation sequencing (NGS) assay using a desktop sequencer to detect actionable mutations and rearrangements in cell-free plasma DNA (cfDNA). Experimental Design An NGS panel was developed targeting 11 driver oncogenes found in NSCLC. Targeted NGS was performed using a novel methodology that maximizes on-target reads, and minimizes artifact, and was validated on DNA dilutions derived from cell lines. Plasma NGS was then blindly performed on 48 patients with advanced, progressive NSCLC and a known tumor genotype, and explored in two patients with incomplete tumor genotyping. Results NGS could identify mutations present in DNA dilutions at ≥0.4% allelic frequency with 100% sensitivity/specificity. Plasma NGS detected a broad range of driver and resistance mutations, including ALK, ROS1, and RET rearrangements, HER2 insertions, and MET amplification, with 100% specificity. Sensitivity was 77% across 62 known driver and resistance mutations from the 48 cases; in 29 cases with common EGFR and KRAS mutations, sensitivity was similar to droplet digital PCR. In two cases with incomplete tumor genotyping, plasma NGS rapidly identified a novel EGFR exon 19 deletion and a missed case of MET amplification. Conclusion Blinded to tumor genotype, this plasma NGS approach detected a broad range of targetable genomic alterations in NSCLC with no false positives including complex mutations like rearrangements and unexpected resistance mutations such as EGFR C797S. Through use of widely available vacutainers and a desktop sequencing platform, this assay has the potential to be implemented broadly for patient care and translational research. PMID:26459174
Characterization of RAD9 of Saccharomyces cerevisiae and evidence that its function acts posttranslationally in cell cycle arrest after DNA damage.

PubMed

Weinert, T A; Hartwell, L H

1990-12-01

In eucaryotic cells, incompletely replicated or damaged chromosomes induce cell cycle arrest in G2 before mitosis, and in the yeast Saccharomyces cerevisiae the RAD9 gene is essential for the cell cycle arrest (T.A. Weinert and L. H. Hartwell, Science 241:317-322, 1988). In this report, we extend the analysis of RAD9-dependent cell cycle control. We found that both induction of RAD9-dependent arrest in G2 and recovery from arrest could occur in the presence of the protein synthesis inhibitor cycloheximide, showing that the mechanism of RAD9-dependent control involves a posttranslational mechanism(s). We have isolated and determined the DNA sequence of the RAD9 gene, confirming the DNA sequence reported previously (R. H. Schiestl, P. Reynolds, S. Prakash, and L. Prakash, Mol. Cell. Biol. 9:1882-1886, 1989). The predicted protein sequence for the Rad9 protein bears no similarity to sequences of known proteins. We also found that synthesis of the RAD9 transcript in the cell cycle was constitutive and not induced by X-irradiation. We constructed yeast cells containing a complete deletion of the RAD9 gene; the rad9 null mutants were viable, sensitive to X- and UV irradiation, and defective for cell cycle arrest after DNA damage. Although Rad+ and rad9 delta cells had similar growth rates and cell cycle kinetics in unirradiated cells, the spontaneous rate of chromosome loss (in unirradiated cells) was elevated 7- to 21-fold in rad9 delta cells. These studies show that in the presence of induced or endogenous DNA damage, RAD9 is a negative regulator that inhibits progression from G2 in order to preserve cell viability and to maintain the fidelity of chromosome transmission.
Fine organization of genomic regions tagged to the 5S rDNA locus of the bread wheat 5B chromosome.

PubMed

Sergeeva, Ekaterina M; Shcherban, Andrey B; Adonina, Irina G; Nesterov, Michail A; Beletsky, Alexey V; Rakitin, Andrey L; Mardanov, Andrey V; Ravin, Nikolai V; Salina, Elena A

2017-11-14

The multigene family encoding the 5S rRNA, one of the most important structurally-functional part of the large ribosomal subunit, is an obligate component of all eukaryotic genomes. 5S rDNA has long been a favored target for cytological and phylogenetic studies due to the inherent peculiarities of its structural organization, such as the tandem arrays of repetitive units and their high interspecific divergence. The complex polyploid nature of the genome of bread wheat, Triticum aestivum, and the technically difficult task of sequencing clusters of tandem repeats mean that the detailed organization of extended genomic regions containing 5S rRNA genes remains unclear. This is despite the recent progress made in wheat genomic sequencing. Using pyrosequencing of BAC clones, in this work we studied the organization of two distinct 5S rDNA-tagged regions of the 5BS chromosome of bread wheat. Three BAC-clones containing 5S rDNA were identified in the 5BS chromosome-specific BAC-library of Triticum aestivum. Using the results of pyrosequencing and assembling, we obtained six 5S rDNA- containing contigs with a total length of 140,417 bp, and two sets (pools) of individual 5S rDNA sequences belonging to separate, but closely located genomic regions on the 5BS chromosome. Both regions are characterized by the presence of approximately 70-80 copies of 5S rDNA, however, they are completely different in their structural organization. The first region contained highly diverged short-type 5S rDNA units that were disrupted by multiple insertions of transposable elements. The second region contained the more conserved long-type 5S rDNA, organized as a single tandem array. FISH using probes specific to both 5S rDNA unit types showed differences in the distribution and intensity of signals on the chromosomes of polyploid wheat species and their diploid progenitors. A detailed structural organization of two closely located 5S rDNA-tagged genomic regions on the 5BS chromosome of bread wheat has been established. These two regions differ in the organization of both 5S rDNA and the neighboring sequences comprised of transposable elements, implying different modes of evolution for these regions.
DNA Damage, DNA Repair, Aging, and Neurodegeneration

PubMed Central

Maynard, Scott; Fang, Evandro Fei; Scheibye-Knudsen, Morten; Croteau, Deborah L.; Bohr, Vilhelm A.

2015-01-01

Aging in mammals is accompanied by a progressive atrophy of tissues and organs, and stochastic damage accumulation to the macromolecules DNA, RNA, proteins, and lipids. The sequence of the human genome represents our genetic blueprint, and accumulating evidence suggests that loss of genomic maintenance may causally contribute to aging. Distinct evidence for a role of imperfect DNA repair in aging is that several premature aging syndromes have underlying genetic DNA repair defects. Accumulation of DNA damage may be particularly prevalent in the central nervous system owing to the low DNA repair capacity in postmitotic brain tissue. It is generally believed that the cumulative effects of the deleterious changes that occur in aging, mostly after the reproductive phase, contribute to species-specific rates of aging. In addition to nuclear DNA damage contributions to aging, there is also abundant evidence for a causative link between mitochondrial DNA damage and the major phenotypes associated with aging. Understanding the mechanistic basis for the association of DNA damage and DNA repair with aging and age-related diseases, such as neurodegeneration, would give insight into contravening age-related diseases and promoting a healthy life span. PMID:26385091
Emerging pathogens in the fish farming industry and sequencing-based pathogen discovery.

PubMed

Tengs, Torstein; Rimstad, Espen

2017-10-01

The use of large scale DNA/RNA sequencing has become an integral part of biomedical research. Reduced sequencing costs and the availability of efficient computational resources has led to a revolution in how problems concerning genomics and transcriptomics are addressed. Sequencing-based pathogen discovery represents one example of how genetic data can now be used in ways that were previously considered infeasible. Emerging pathogens affect both human and animal health due to a multitude of factors, including globalization, a shifting environment and an increasing human population. Fish farming represents a relevant, interesting and challenging system to study emerging pathogens. This review summarizes recent progress in pathogen discovery using sequence data, with particular emphasis on viruses in Atlantic salmon (Salmo salar). Copyright © 2017 Elsevier Ltd. All rights reserved.
Using Genome Sequence to Enable the Design of Medicines and Chemical Probes.

PubMed

Angelbello, Alicia J; Chen, Jonathan L; Childs-Disney, Jessica L; Zhang, Peiyuan; Wang, Zi-Fu; Disney, Matthew D

2018-02-28

Rapid progress in genome sequencing technology has put us firmly into a postgenomic era. A key challenge in biomedical research is harnessing genome sequence to fulfill the promise of personalized medicine. This Review describes how genome sequencing has enabled the identification of disease-causing biomolecules and how these data have been converted into chemical probes of function, preclinical lead modalities, and ultimately U.S. Food and Drug Administration (FDA)-approved drugs. In particular, we focus on the use of oligonucleotide-based modalities to target disease-causing RNAs; small molecules that target DNA, RNA, or protein; the rational repurposing of known therapeutic modalities; and the advantages of pharmacogenetics. Lastly, we discuss the remaining challenges and opportunities in the direct utilization of genome sequence to enable design of medicines.
Identification of Single-Copy Orthologous Genes between Physalis and Solanum lycopersicum and Analysis of Genetic Diversity in Physalis Using Molecular Markers

PubMed Central

Wei, Jingli; Hu, Xiaorong; Yang, Jingjing; Yang, Wencai

2012-01-01

The genus Physalis includes a number of commercially important edible and ornamental species. Its high nutritional value and potential medicinal properties leads to the increased commercial interest in the products of this genus worldwide. However, lack of molecular markers prevents the detailed study of genetics and phylogeny in Physalis, which limits the progress of breeding. In the present study, we compared the DNA sequences between Physalis and tomato, and attempted to analyze genetic diversity in Physalis using tomato markers. Blasting 23180 DNA sequences derived from Physalis against the International Tomato Annotation Group (ITAG) Release2.3 Predicted CDS (SL2.40) discovered 3356 single-copy orthologous genes between them. A total of 38 accessions from at least six species of Physalis were subjected to genetic diversity analysis using 97 tomato markers and 25 SSR markers derived from P. peruviana. Majority (73.2%) of tomato markers could amplify DNA fragments from at least one accession of Physalis. Diversity in Physalis at molecular level was also detected. The average Nei’s genetic distance between accessions was 0.3806 with a range of 0.2865 to 0.7091. These results indicated Physalis and tomato had similarity at both molecular marker and DNA sequence levels. Therefore, the molecular markers developed in tomato can be used in genetic study in Physalis. PMID:23166835
The genome of Eimeria spp., with special reference to Eimeria tenella--a coccidium from the chicken.

PubMed

Shirley, M W

2000-04-10

Eimeria spp. contain at least four genomes. The nuclear genome is best studied in the avian species Eimeria tenella and comprises about 60 Mbp DNA contained within ca. 14 chromosomes; other avian and lupine species appear to possess a nuclear genome of similar size. In addition, sequence data and hybridisation studies have provided direct evidence for extrachromosomal mitochondrial and plastid DNA genomes, and double-stranded RNA segments have also been described. The unique phenotype of "precocious" development that characterises some selected lines of Eimeria spp. not only provides the basis for the first generation of live attenuated vaccines, but offers a significant entrée into studies on the regulation of an apicomplexan life-cycle. With a view to identifying loci implicated in the trait of precocious development, a genetic linkage map of the genome of E. tenella is being constructed in this laboratory from analyses of the inheritance of over 400 polymorphic DNA markers in the progeny of a cross between complementary drug-resistant and precocious parents. Other projects that impinge directly or indirectly on the genome and/or genetics of Eimeria spp. are currently in progress in several laboratories, and include the derivation of expressed sequence tag data and the development of ancillary technologies such as transfection techniques. No large-scale genomic DNA sequencing projects have been reported.
Identification of single-copy orthologous genes between Physalis and Solanum lycopersicum and analysis of genetic diversity in Physalis using molecular markers.

PubMed

Wei, Jingli; Hu, Xiaorong; Yang, Jingjing; Yang, Wencai

2012-01-01

The genus Physalis includes a number of commercially important edible and ornamental species. Its high nutritional value and potential medicinal properties leads to the increased commercial interest in the products of this genus worldwide. However, lack of molecular markers prevents the detailed study of genetics and phylogeny in Physalis, which limits the progress of breeding. In the present study, we compared the DNA sequences between Physalis and tomato, and attempted to analyze genetic diversity in Physalis using tomato markers. Blasting 23180 DNA sequences derived from Physalis against the International Tomato Annotation Group (ITAG) Release2.3 Predicted CDS (SL2.40) discovered 3356 single-copy orthologous genes between them. A total of 38 accessions from at least six species of Physalis were subjected to genetic diversity analysis using 97 tomato markers and 25 SSR markers derived from P. peruviana. Majority (73.2%) of tomato markers could amplify DNA fragments from at least one accession of Physalis. Diversity in Physalis at molecular level was also detected. The average Nei's genetic distance between accessions was 0.3806 with a range of 0.2865 to 0.7091. These results indicated Physalis and tomato had similarity at both molecular marker and DNA sequence levels. Therefore, the molecular markers developed in tomato can be used in genetic study in Physalis.
Spatial and temporal plasticity of chromatin during programmed DNA-reorganization in Stylonychia macronuclear development

PubMed Central

Postberg, Jan; Heyse, Katharina; Cremer, Marion; Cremer, Thomas; Lipps, Hans J

2008-01-01

Background: In this study we exploit the unique genome organization of ciliates to characterize the biological function of histone modification patterns and chromatin plasticity for the processing of specific DNA sequences during a nuclear differentiation process. Ciliates are single-cell eukaryotes containing two morphologically and functionally specialized types of nuclei, the somatic macronucleus and the germline micronucleus. In the course of sexual reproduction a new macronucleus develops from a micronuclear derivative. During this process specific DNA sequences are eliminated from the genome, while sequences that will be transcribed in the mature macronucleus are retained. Results: We show by immunofluorescence microscopy, Western analyses and chromatin immunoprecipitation (ChIP) experiments that each nuclear type establishes its specific histone modification signature. Our analyses reveal that the early macronuclear anlage adopts a permissive chromatin state immediately after the fusion of two heterochromatic germline micronuclei. As macronuclear development progresses, repressive histone modifications that specify sequences to be eliminated are introduced de novo. ChIP analyses demonstrate that permissive histone modifications are associated with sequences that will be retained in the new macronucleus. Furthermore, our data support the hypothesis that a PIWI-family protein is involved in a transnuclear cross-talk and in the RNAi-dependent control of developmental chromatin reorganization. Conclusion: Based on these data we present a comprehensive analysis of the spatial and temporal pattern of histone modifications during this nuclear differentiation process. Results obtained in this study may also be relevant for our understanding of chromatin plasticity during metazoan embryogenesis. PMID:19014664
Mutation-based detection and monitoring of cell-free tumor DNA in peripheral blood of cancer patients.

PubMed

Benesova, L; Belsanova, B; Suchanek, S; Kopeckova, M; Minarikova, P; Lipska, L; Levy, M; Visokai, V; Zavoral, M; Minarik, M

2013-02-15

Prognosis of solid cancers is generally more favorable if the disease is treated early and efficiently. A key to long cancer survival is in radical surgical therapy directed at the primary tumor followed by early detection of possible progression, with swift application of subsequent therapeutic intervention reducing the risk of disease generalization. The conventional follow-up care is based on regular observation of tumor markers in combination with computed tomography/endoscopic ultrasound/magnetic resonance/positron emission tomography imaging to monitor potential tumor progression. A recent development in methodologies allowing screening for a presence of cell-free DNA (cfDNA) brings a new viable tool in early detection and management of major cancers. It is believed that cfDNA is released from tumors primarily due to necrotization, whereas the origin of nontumorous cfDNA is mostly apoptotic. The process of cfDNA detection starts with proper collection and treatment of blood and isolation and storage of blood plasma. The next important steps include cfDNA extraction from plasma and its detection and/or quantification. To distinguish tumor cfDNA from nontumorous cfDNA, specific somatic DNA mutations, previously localized in the primary tumor tissue, are identified in the extracted cfDNA. Apart from conventional mutation detection approaches, several dedicated techniques have been presented to detect low levels of cfDNA in an excess of nontumorous (nonmutated) DNA, including real-time polymerase chain reaction (PCR), "BEAMing" (beads, emulsion, amplification, and magnetics), and denaturing capillary electrophoresis. Techniques to facilitate the mutant detection, such as mutant-enriched PCR and COLD-PCR (coamplification at lower denaturation temperature PCR), are also applicable. Finally, a number of newly developed miniaturized approaches, such as single-molecule sequencing, are promising for the future. Copyright © 2012 Elsevier Inc. All rights reserved.
Stereochemical analysis of the functional significance of the conserved inverted CCAAT and TATA elements in the rat bone sialoprotein gene promoter.

PubMed

Su, Ming; Lee, Daniel; Ganss, Bernhard; Sodek, Jaro

2006-04-14

Basal transcription of the bone sialoprotein gene is mediated by highly conserved inverted CCAAT (ICE; ATTGG) and TATA elements (TTTATA) separated by precisely 21 nucleotides. Here we studied the importance of the relative position and orientation of the CCAAT and TATA elements in the proximal promoter by measuring the transcriptional activity of a series of mutated reporter constructs in transient transfection assays. Whereas inverting the TTTATA (wild type) to a TATAAA (consensus TATA) sequence increased transcription slightly, transcription was reduced when the flanking dinucleotides were also inverted. In contrast, reversing the ATTGG (wild type; ICE) to a CCAAT (RICE) sequence caused a marked reduction in transcription, whereas both transcription and NF-Y binding were progressively increased with the simultaneous inversion of flanking nucleotides (f-RICE-f). Reducing the distance between the ICE and TATA elements produced cyclical changes in transcriptional activity that correlated with progressive alterations in the relative positions of the CCAAT and TATA elements on the face of the DNA helix. Minimal transcription was observed after 5 nucleotides were deleted (equivalent to approximately one half turn of the helix), whereas transcription was fully restored after deleting 10 nucleotides (approximately one full turn of the DNA helix), transcriptional activity being progressively lost with deletions beyond 10 nucleotides. In comparison, when deletions were made with the ICE in the reversed (f-RICE-f) orientation transcriptional activity was progressively lost with no recovery. These results show that, although transcription can still occur when the CCAAT box is reversed and/or displaced relative to the TATA box, the activity is dependent upon the flexibility of the intervening DNA helix needed to align the NF-Y complex on the CCAAT box with preinitiation complex proteins that bind to the TATA box. Thus, the precise location and orientation of the CCAAT element is necessary for optimizing basal transcription of the bone sialoprotein gene.
Next-generation Sequencing-based genomic profiling: Fostering innovation in cancer care?

PubMed

Fernandes, Gustavo S; Marques, Daniel F; Girardi, Daniel M; Braghiroli, Maria Ignez F; Coudry, Renata A; Meireles, Sibele I; Katz, Artur; Hoff, Paulo M

2017-10-01

With the development of next-generation sequencing (NGS) technologies, DNA sequencing has been increasingly utilized in clinical practice. Our goal was to investigate the impact of genomic evaluation on treatment decisions for heavily pretreated patients with metastatic cancer. We analyzed metastatic cancer patients from a single institution whose cancers had progressed after all available standard-of-care therapies and whose tumors underwent next-generation sequencing analysis. We determined the percentage of patients who received any therapy directed by the test, and its efficacy. From July 2013 to December 2015, 185 consecutive patients were tested using a commercially available next-generation sequencing-based test, and 157 patients were eligible. Sixty-six patients (42.0%) were female, and 91 (58.0%) were male. The mean age at diagnosis was 52.2 years, and the mean number of pre-test lines of systemic treatment was 2.7. One hundred and seventy-seven patients (95.6%) had at least one identified gene alteration. Twenty-four patients (15.2%) underwent systemic treatment directed by the test result. Of these, one patient had a complete response, four (16.7%) had partial responses, two (8.3%) had stable disease, and 17 (70.8%) had disease progression as the best result. The median progression-free survival time with matched therapy was 1.6 months, and the median overall survival was 10 months. We identified a high prevalence of gene alterations using an next-generation sequencing test. Although some benefit was associated with the matched therapy, most of the patients had disease progression as the best response, indicating the limited biological potential and unclear clinical relevance of this practice.
Approaching the taxonomic affiliation of unidentified sequences in public databases--an example from the mycorrhizal fungi.

PubMed

Nilsson, R Henrik; Kristiansson, Erik; Ryberg, Martin; Larsson, Karl-Henrik

2005-07-18

During the last few years, DNA sequence analysis has become one of the primary means of taxonomic identification of species, particularly so for species that are minute or otherwise lack distinct, readily obtainable morphological characters. Although the number of sequences available for comparison in public databases such as GenBank increases exponentially, only a minuscule fraction of all organisms have been sequenced, leaving taxon sampling a momentous problem for sequence-based taxonomic identification. When querying GenBank with a set of unidentified sequences, a considerable proportion typically lack fully identified matches, forming an ever-mounting pile of sequences that the researcher will have to monitor manually in the hope that new, clarifying sequences have been submitted by other researchers. To alleviate these concerns, a project to automatically monitor select unidentified sequences in GenBank for taxonomic progress through repeated local BLAST searches was initiated. Mycorrhizal fungi--a field where species identification often is prohibitively complex--and the much used ITS locus were chosen as test bed. A Perl script package called emerencia is presented. On a regular basis, it downloads select sequences from GenBank, separates the identified sequences from those insufficiently identified, and performs BLAST searches between these two datasets, storing all results in an SQL database. On the accompanying web-service http://emerencia.math.chalmers.se, users can monitor the taxonomic progress of insufficiently identified sequences over time, either through active searches or by signing up for e-mail notification upon disclosure of better matches. Other search categories, such as listing all insufficiently identified sequences (and their present best fully identified matches) publication-wise, are also available. The ever-increasing use of DNA sequences for identification purposes largely falls back on the assumption that public sequence databases contain a thorough sampling of taxonomically well-annotated sequences. Taxonomy, held by some to be an old-fashioned trade, has accordingly never been more important. emerencia does not automate the taxonomic process, but it does allow researchers to focus their efforts elsewhere than countless manual BLAST runs and arduous sieving of BLAST hit lists. The emerencia system is available on an open source basis for local installation with any organism and gene group as targets.
Deciphering the Epigenetic Code: An Overview of DNA Methylation Analysis Methods

PubMed Central

Umer, Muhammad

2013-01-01

Abstract Significance: Methylation of cytosine in DNA is linked with gene regulation, and this has profound implications in development, normal biology, and disease conditions in many eukaryotic organisms. A wide range of methods and approaches exist for its identification, quantification, and mapping within the genome. While the earliest approaches were nonspecific and were at best useful for quantification of total methylated cytosines in the chunk of DNA, this field has seen considerable progress and development over the past decades. Recent Advances: Methods for DNA methylation analysis differ in their coverage and sensitivity, and the method of choice depends on the intended application and desired level of information. Potential results include global methyl cytosine content, degree of methylation at specific loci, or genome-wide methylation maps. Introduction of more advanced approaches to DNA methylation analysis, such as microarray platforms and massively parallel sequencing, has brought us closer to unveiling the whole methylome. Critical Issues: Sensitive quantification of DNA methylation from degraded and minute quantities of DNA and high-throughput DNA methylation mapping of single cells still remain a challenge. Future Directions: Developments in DNA sequencing technologies as well as the methods for identification and mapping of 5-hydroxymethylcytosine are expected to augment our current understanding of epigenomics. Here we present an overview of methodologies available for DNA methylation analysis with special focus on recent developments in genome-wide and high-throughput methods. While the application focus relates to cancer research, the methods are equally relevant to broader issues of epigenetics and redox science in this special forum. Antioxid. Redox Signal. 18, 1972–1986. PMID:23121567
Toxicogenomics and Cancer Susceptibility: Advances with Next-Generation Sequencing

PubMed Central

Ning, Baitang; Su, Zhenqiang; Mei, Nan; Hong, Huixiao; Deng, Helen; Shi, Leming; Fuscoe, James C.; Tolleson, William H.

2017-01-01

The aim of this review is to comprehensively summarize the recent achievements in the field of toxicogenomics and cancer research regarding genetic-environmental interactions in carcinogenesis and detection of genetic aberrations in cancer genomes by next-generation sequencing technology. Cancer is primarily a genetic disease in which genetic factors and environmental stimuli interact to cause genetic and epigenetic aberrations in human cells. Mutations in the germline act as either high-penetrance alleles that strongly increase the risk of cancer development, or as low-penetrance alleles that mildly change an individual’s susceptibility to cancer. Somatic mutations, resulting from either DNA damage induced by exposure to environmental mutagens or from spontaneous errors in DNA replication or repair are involved in the development or progression of the cancer. Induced or spontaneous changes in the epigenome may also drive carcinogenesis. Advances in next-generation sequencing technology provide us opportunities to accurately, economically, and rapidly identify genetic variants, somatic mutations, gene expression profiles, and epigenetic alterations with single-base resolution. Whole genome sequencing, whole exome sequencing, and RNA sequencing of paired cancer and adjacent normal tissue present a comprehensive picture of the cancer genome. These new findings should benefit public health by providing insights in understanding cancer biology, and in improving cancer diagnosis and therapy. PMID:24875441
Scar-less multi-part DNA assembly design automation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hillson, Nathan J.

The present invention provides a method of a method of designing an implementation of a DNA assembly. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which to assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding flanking homology sequences to each of the DNA oligos. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which tomore » assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding optimized overhang sequences to each of the DNA oligos.« less
Label-free detection of real-time DNA amplification using a nanofluidic diffraction grating

NASA Astrophysics Data System (ADS)

Yasui, Takao; Ogawa, Kensuke; Kaji, Noritada; Nilsson, Mats; Ajiri, Taiga; Tokeshi, Manabu; Horiike, Yasuhiro; Baba, Yoshinobu

2016-08-01

Quantitative DNA amplification using fluorescence labeling has played an important role in the recent, rapid progress of basic medical and molecular biological research. Here we report a label-free detection of real-time DNA amplification using a nanofluidic diffraction grating. Our detection system observed intensity changes during DNA amplification of diffracted light derived from the passage of a laser beam through nanochannels embedded in a microchannel. Numerical simulations revealed that the diffracted light intensity change in the nanofluidic diffraction grating was attributed to the change of refractive index. We showed the first case reported to date for label-free detection of real-time DNA amplification, such as specific DNA sequences from tubercle bacilli (TB) and human papillomavirus (HPV). Since our developed system allows quantification of the initial concentration of amplified DNA molecules ranging from 1 fM to 1 pM, we expect that it will offer a new strategy for developing fundamental techniques of medical applications.

Genetic resources offer efficient tools for rice functional genomics research.

PubMed

Lo, Shuen-Fang; Fan, Ming-Jen; Hsing, Yue-Ie; Chen, Liang-Jwu; Chen, Shu; Wen, Ien-Chie; Liu, Yi-Lun; Chen, Ku-Ting; Jiang, Mirng-Jier; Lin, Ming-Kuang; Rao, Meng-Yen; Yu, Lin-Chih; Ho, Tuan-Hua David; Yu, Su-May

2016-05-01

Rice is an important crop and major model plant for monocot functional genomics studies. With the establishment of various genetic resources for rice genomics, the next challenge is to systematically assign functions to predicted genes in the rice genome. Compared with the robustness of genome sequencing and bioinformatics techniques, progress in understanding the function of rice genes has lagged, hampering the utilization of rice genes for cereal crop improvement. The use of transfer DNA (T-DNA) insertional mutagenesis offers the advantage of uniform distribution throughout the rice genome, but preferentially in gene-rich regions, resulting in direct gene knockout or activation of genes within 20-30 kb up- and downstream of the T-DNA insertion site and high gene tagging efficiency. Here, we summarize the recent progress in functional genomics using the T-DNA-tagged rice mutant population. We also discuss important features of T-DNA activation- and knockout-tagging and promoter-trapping of the rice genome in relation to mutant and candidate gene characterizations and how to more efficiently utilize rice mutant populations and datasets for high-throughput functional genomics and phenomics studies by forward and reverse genetics approaches. These studies may facilitate the translation of rice functional genomics research to improvements of rice and other cereal crops. © 2015 John Wiley & Sons Ltd.
Functional genomics of bio-energy plants and related patent activities.

PubMed

Jiang, Shu-Ye; Ramachandran, Srinivasan

2013-04-01

With dwindling fossil oil resources and increased economic growth of many developing countries due to globalization, energy driven from an alternative source such as bio-energy in a sustainable fashion is the need of the hour. However, production of energy from biological source is relatively expensive due to low starch and sugar contents of bioenergy plants leading to lower oil yield and reduced quality along with lower conversion efficiency of feedstock. In this context genetic improvement of bio-energy plants offers a viable solution. In this manuscript, we reviewed the current status of functional genomics studies and related patent activities in bio-energy plants. Currently, genomes of considerable bio-energy plants have been sequenced or are in progress and also large amount of expression sequence tags (EST) or cDNA sequences are available from them. These studies provide fundamental data for more reliable genome annotation and as a result, several genomes have been annotated in a genome-wide level. In addition to this effort, various mutagenesis tools have also been employed to develop mutant populations for characterization of genes that are involved in bioenergy quantitative traits. With the progress made on functional genomics of important bio-energy plants, more patents were filed with a significant number of them focusing on genes and DNA sequences which may involve in improvement of bio-energy traits including higher yield and quality of starch, sugar and oil. We also believe that these studies will lead to the generation of genetically altered plants with improved tolerance to various abiotic and biotic stresses.
Expression of disease resistance in genetically modified grapevines correlates with the contents of viral sequences in the T-DNA and global genome methylation.

PubMed

Dal Bosco, Daniela; Sinski, Iraci; Ritschel, Patrícia S; Camargo, Umberto A; Fajardo, Thor V M; Harakava, Ricardo; Quecini, Vera

2018-06-06

Increased tolerance to pathogens is an important goal in conventional and biotechnology-assisted grapevine breeding programs worldwide. Fungal and viral pathogens cause direct losses in berry production, but also affect the quality of the final products. Precision breeding strategies allow the introduction of resistance characters in elite cultivars, although the factors determining the plant's overall performance are not fully characterized. Grapevine plants expressing defense proteins, from fungal or plant origins, or of the coat protein gene of grapevine leafroll-associated virus 3 (GLRaV-3) were generated by Agrobacterium-mediated transformation of somatic embryos and shoot apical meristems. The responses of the transformed lines to pathogen challenges were investigated by biochemical, phytopathological and molecular methods. The expression of a Metarhizium anisopliae chitinase gene delayed pathogenesis and disease progression against the necrotrophic pathogen Botrytis cinerea. Modified lines expressing a Solanum nigrum osmotin-like protein also exhibited slower disease progression, but to a smaller extent. Grapevine lines carrying two hairpin-inducing constructs had lower GLRaV-3 titers when challenged by grafting, although disease symptoms and viral multiplication were detected. The levels of global genome methylation were determined for the genetically engineered lines, and correlation analyses demonstrated the association between higher levels of methylated DNA and larger portions of virus-derived sequences. Resistance expression was also negatively correlated with the contents of introduced viral sequences and genome methylation, indicating that the effectiveness of resistance strategies employing sequences of viral origin is subject to epigenetic regulation in grapevine.
Origin and composition of cell-free DNA in spent medium from human embryo culture during preimplantation development.

PubMed

Vera-Rodriguez, M; Diez-Juan, A; Jimenez-Almazan, J; Martinez, S; Navarro, R; Peinado, V; Mercader, A; Meseguer, M; Blesa, D; Moreno, I; Valbuena, D; Rubio, C; Simon, C

2018-04-01

What is the origin and composition of cell-free DNA in human embryo spent culture media? Cell-free DNA from human embryo spent culture media represents a mix of maternal and embryonic DNA, and the mixture can be more complex for mosaic embryos. In 2016, ~300 000 human embryos were chromosomally and/or genetically analyzed using preimplantation genetic testing for aneuploidies (PGT-A) or monogenic disorders (PGT-M) before transfer into the uterus. While progress in genetic techniques has enabled analysis of the full karyotype in a single cell with high sensitivity and specificity, these approaches still require an embryo biopsy. Thus, non-invasive techniques are sought as an alternative. This study was based on a total of 113 human embryos undergoing trophectoderm biopsy as part of PGT-A analysis. For each embryo, the spent culture media used between Day 3 and Day 5 of development were collected for cell-free DNA analysis. In addition to the 113 spent culture media samples, 28 media drops without embryo contact were cultured in parallel under the same conditions to use as controls. In total, 141 media samples were collected and divided into two groups: one for direct DNA quantification (53 spent culture media and 17 controls), the other for whole-genome amplification (60 spent culture media and 11 controls) and subsequent quantification. Some samples with amplified DNA (N = 56) were used for aneuploidy testing by next-generation sequencing; of those, 35 samples underwent single-nucleotide polymorphism (SNP) sequencing to detect maternal contamination. Finally, from the 35 spent culture media analyzed by SNP sequencing, 12 whole blastocysts were analyzed by fluorescence in situ hybridization (FISH) to determine the level of mosaicism in each embryo, as a possible origin for discordance between sample types. Trophectoderm biopsies and culture media samples (20 μl) underwent whole-genome amplification, then libraries were generated and sequenced for an aneuploidy study. For SNP sequencing, triads including trophectoderm DNA, cell-free DNA, and follicular fluid DNA were analyzed. In total, 124 SNPs were included with 90 SNPs distributed among all autosomes and 34 SNPs located on chromosome Y. Finally, 12 whole blastocysts were fixed and individual cells were analyzed by FISH using telomeric/centromeric probes for the affected chromosomes. We found a higher quantity of cell-free DNA in spent culture media co-cultured with embryos versus control media samples (P ≤ 0.001). The presence of cell-free DNA in the spent culture media enabled a chromosomal diagnosis, although results differed from those of trophectoderm biopsy analysis in most cases (67%). Discordant results were mainly attributable to a high percentage of maternal DNA in the spent culture media, with a median percentage of embryonic DNA estimated at 8%. Finally, from the discordant cases, 91.7% of whole blastocysts analyzed by FISH were mosaic and 75% of the analyzed chromosomes were concordant with the trophectoderm DNA diagnosis instead of the cell-free DNA result. This study was limited by the sample size and the number of cells analyzed by FISH. This is the first study to combine chromosomal analysis of cell-free DNA, SNP sequencing to identify maternal contamination, and whole-blastocyst analysis for detecting mosaicism. Our results provide a better understanding of the origin of cell-free DNA in spent culture media, offering an important step toward developing future non-invasive karyotyping that must rely on the specific identification of DNA released from human embryos. This work was funded by Igenomix S.L. There are no competing interests.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

2013-06-25

A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

2011-01-18

A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
Recombinational Repair of DNA Damage in Escherichia coli and Bacteriophage λ

PubMed Central

Kuzminov, Andrei

1999-01-01

Although homologous recombination and DNA repair phenomena in bacteria were initially extensively studied without regard to any relationship between the two, it is now appreciated that DNA repair and homologous recombination are related through DNA replication. In Escherichia coli, two-strand DNA damage, generated mostly during replication on a template DNA containing one-strand damage, is repaired by recombination with a homologous intact duplex, usually the sister chromosome. The two major types of two-strand DNA lesions are channeled into two distinct pathways of recombinational repair: daughter-strand gaps are closed by the RecF pathway, while disintegrated replication forks are reestablished by the RecBCD pathway. The phage λ recombination system is simpler in that its major reaction is to link two double-stranded DNA ends by using overlapping homologous sequences. The remarkable progress in understanding the mechanisms of recombinational repair in E. coli over the last decade is due to the in vitro characterization of the activities of individual recombination proteins. Putting our knowledge about recombinational repair in the broader context of DNA replication will guide future experimentation. PMID:10585965
Indel detection from DNA and RNA sequencing data with transIndel.

PubMed

Yang, Rendong; Van Etten, Jamie L; Dehm, Scott M

2018-04-19

Insertions and deletions (indels) are a major class of genomic variation associated with human disease. Indels are primarily detected from DNA sequencing (DNA-seq) data but their transcriptional consequences remain unexplored due to challenges in discriminating medium-sized and large indels from splicing events in RNA-seq data. Here, we developed transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and reconstructs the mid-sized insertions and large deletions based on the linear alignments of split reads from DNA-seq or RNA-seq data. TransIndel exhibits competitive or superior performance over eight state-of-the-art indel detection tools on benchmarks using both synthetic and real DNA-seq data. Additionally, we applied transIndel to DNA-seq and RNA-seq datasets from 333 primary prostate cancer patients from The Cancer Genome Atlas (TCGA) and 59 metastatic prostate cancer patients from AACR-PCF Stand-Up- To-Cancer (SU2C) studies. TransIndel enhanced the taxonomy of DNA- and RNA-level alterations in prostate cancer by identifying recurrent FOXA1 indels as well as exitron splicing in genes implicated in disease progression. Our study demonstrates that transIndel is a robust tool for elucidation of medium- and large-sized indels from DNA-seq and RNA-seq data. Including RNA-seq in indel discovery efforts leads to significant improvements in sensitivity for identification of med-sized and large indels missed by DNA-seq, and reveals non-canonical RNA-splicing events in genes associated with disease pathology.
Next-generation sequencing: hype and hope for development of personalized radiation therapy?

PubMed

Tinhofer, Ingeborg; Niehr, Franziska; Konschak, Robert; Liebs, Sandra; Munz, Matthias; Stenzinger, Albrecht; Weichert, Wilko; Keilholz, Ulrich; Budach, Volker

2015-08-28

The introduction of next-generation sequencing (NGS) in the field of cancer research has boosted worldwide efforts of genome-wide personalized oncology aiming at identifying predictive biomarkers and novel actionable targets. Despite considerable progress in understanding the molecular biology of distinct cancer entities by the use of this revolutionary technology and despite contemporaneous innovations in drug development, translation of NGS findings into improved concepts for cancer treatment remains a challenge. The aim of this article is to describe shortly the NGS platforms for DNA sequencing and in more detail key achievements and unresolved hurdles. A special focus will be given on potential clinical applications of this innovative technique in the field of radiation oncology.
Childhood-onset HAM/TSP with progressive cognitive impairment.

PubMed

Zorzi, Giovanna; Mancuso, Roberta; Nardocci, Nardo; Farina, Laura; Guerini, Franca Rosa; Ferrante, Pasquale

2010-04-01

HTLV-I associated myelopathy/tropical spastic paraparesis (HAM/TSP) is a chronic myelopathy, usually with adult-onset. Very few cases of childhood-onset have been described, most presenting with progressive paraparesis and sphincteric disturbances as in the adult form. Here we report a young male with childhood-onset of HAM/TSP and progressive cognitive and behavioral disturbances. A serological screening revealed HTLV-I infection, confirmed by Western Immunoblotting analysis. Molecular characterization of amplified HTLV-I proviral DNA has been performed both in the patient and his mother by LTR sequence analysis, and HLA genotype inheritance was evaluated. Our case indicates the possibility that cognitive dysfunctions may be one manifestation of HTLV-I infection in childhood.
Detection of Hepatocyte Clones Containing Integrated Hepatitis B Virus DNA Using Inverse Nested PCR.

PubMed

Tu, Thomas; Jilbert, Allison R

2017-01-01

Chronic hepatitis B virus (HBV) infection is a major cause of liver cirrhosis and hepatocellular carcinoma (HCC), leading to ~600,000 deaths per year worldwide. Many of the steps that occur during progression from the normal liver to cirrhosis and/or HCC are unknown. Integration of HBV DNA into random sites in the host cell genome occurs as a by-product of the HBV replication cycle and forms a unique junction between virus and cellular DNA. Analyses of integrated HBV DNA have revealed that HCCs are clonal and imply that they develop from the transformation of hepatocytes, the only liver cell known to be infected by HBV. Integrated HBV DNA has also been shown, at least in some tumors, to cause insertional mutagenesis in cancer driver genes, which may facilitate the development of HCC. Studies of HBV DNA integration in the histologically normal liver have provided additional insight into HBV-associated liver disease, suggesting that hepatocytes with a survival or growth advantage undergo high levels of clonal expansion even in the absence of oncogenic transformation. Here we describe inverse nested PCR (invPCR), a highly sensitive method that allows detection, sequencing, and enumeration of virus-cell DNA junctions formed by the integration of HBV DNA. The invPCR protocol is composed of two major steps: inversion of the virus-cell DNA junction and single-molecule nested PCR. The invPCR method is highly specific and inexpensive and can be tailored to DNA extracted from large or small amounts of liver. This procedure also allows detection of genome-wide random integration of any known DNA sequence and is therefore a useful technique for molecular biology, virology, and genetic research.
Qualitative and quantitative assessment of Illumina's forensic STR and SNP kits on MiSeq FGx™.

PubMed

Sharma, Vishakha; Chow, Hoi Yan; Siegel, Donald; Wurmbach, Elisa

2017-01-01

Massively parallel sequencing (MPS) is a powerful tool transforming DNA analysis in multiple fields ranging from medicine, to environmental science, to evolutionary biology. In forensic applications, MPS offers the ability to significantly increase the discriminatory power of human identification as well as aid in mixture deconvolution. However, before the benefits of any new technology can be employed, a thorough evaluation of its quality, consistency, sensitivity, and specificity must be rigorously evaluated in order to gain a detailed understanding of the technique including sources of error, error rates, and other restrictions/limitations. This extensive study assessed the performance of Illumina's MiSeq FGx MPS system and ForenSeq™ kit in nine experimental runs including 314 reaction samples. In-depth data analysis evaluated the consequences of different assay conditions on test results. Variables included: sample numbers per run, targets per run, DNA input per sample, and replications. Results are presented as heat maps revealing patterns for each locus. Data analysis focused on read numbers (allele coverage), drop-outs, drop-ins, and sequence analysis. The study revealed that loci with high read numbers performed better and resulted in fewer drop-outs and well balanced heterozygous alleles. Several loci were prone to drop-outs which led to falsely typed homozygotes and therefore to genotype errors. Sequence analysis of allele drop-in typically revealed a single nucleotide change (deletion, insertion, or substitution). Analyses of sequences, no template controls, and spurious alleles suggest no contamination during library preparation, pooling, and sequencing, but indicate that sequencing or PCR errors may have occurred due to DNA polymerase infidelities. Finally, we found utilizing Illumina's FGx System at recommended conditions does not guarantee 100% outcomes for all samples tested, including the positive control, and required manual editing due to low read numbers and/or allele drop-in. These findings are important for progressing towards implementation of MPS in forensic DNA testing.
Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.

PubMed

Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi

2014-01-01

A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.
A polygalacturonase-inhibiting protein with a role in pea defence against the cyst nematode Heterodera goettingiana.

PubMed

Veronico, Pasqua; Melillo, M Teresa; Saponaro, Concetta; Leonetti, Paola; Picardi, Ernesto; Jones, John T

2011-04-01

A cDNA of 312 bp, similar to polygalacturonase-inhibiting proteins (PGIPs), was isolated by cDNA-amplified fragment length polymorphism (cDNA-AFLP) from pea roots infected with the cyst nematode Heterodera goettingiana. The deduced amino acid sequence obtained from the complete Pspgip1 coding sequence was very similar to PGIPs described from several other plant species, and was identical in both MG103738 and Progress 9 genotypes, resistant and susceptible to H. goettingiana, respectively. Reverse transcription-polymerase chain reaction (RT-PCR) expression analysis revealed the differential regulation of the Pspgip1 gene in the two genotypes in response to wounding and nematode challenge. Mechanical wounding induced Pspgip1 expression in MG103738 within 8 h, but this response was delayed in Progress 9. In contrast, the response to nematode infection was more complex. The transcription of Pspgip1 was triggered rapidly in both genotypes, but the expression level returned to levels observed in uninfected plants more quickly in susceptible than in resistant roots. In addition, in situ hybridization showed that Pspgip1 was expressed in the cortical cells damaged as a result of nematode invasion in both genotypes. However, it was specifically localized in the cells bordering the nematode-induced syncytia in resistant roots. This suggests a role for this gene in counteracting nematode establishment inside the root. © 2010 THE AUTHORS. MOLECULAR PLANT PATHOLOGY © 2010 BSPP AND BLACKWELL PUBLISHING LTD.
Progressive engineering of a homing endonuclease genome editing reagent for the murine X-linked immunodeficiency locus

PubMed Central

Wang, Yupeng; Khan, Iram F.; Boissel, Sandrine; Jarjour, Jordan; Pangallo, Joseph; Thyme, Summer; Baker, David; Scharenberg, Andrew M.; Rawlings, David J.

2014-01-01

LAGLIDADG homing endonucleases (LHEs) are compact endonucleases with 20–22 bp recognition sites, and thus are ideal scaffolds for engineering site-specific DNA cleavage enzymes for genome editing applications. Here, we describe a general approach to LHE engineering that combines rational design with directed evolution, using a yeast surface display high-throughput cleavage selection. This approach was employed to alter the binding and cleavage specificity of the I-Anil LHE to recognize a mutation in the mouse Bruton tyrosine kinase (Btk) gene causative for mouse X-linked immunodeficiency (XID)—a model of human X-linked agammaglobulinemia (XLA). The required re-targeting of I-AniI involved progressive resculpting of the DNA contact interface to accommodate nine base differences from the native cleavage sequence. The enzyme emerging from the progressive engineering process was specific for the XID mutant allele versus the wild-type (WT) allele, and exhibited activity equivalent to WT I-AniI in vitro and in cellulo reporter assays. Fusion of the enzyme to a site-specific DNA binding domain of transcription activator-like effector (TALE) resulted in a further enhancement of gene editing efficiency. These results illustrate the potential of LHE enzymes as specific and efficient tools for therapeutic genome engineering. PMID:24682825
Effect of different concentration of HPV DNA probe immobilization for cervical cancer detection based IDE biosensor

NASA Astrophysics Data System (ADS)

Roshila, M. L.; Hashim, U.; Azizah, N.; Nadzirah, Sh.; Arshad, M. K. Md; Ruslinda, A. R.; Gopinath, Subash C. B.

2017-03-01

This paper principally delineates to the detection process of Human Papillomavirus (HPV) DNA test. HPV is an extremely common virus infection that infected to human by the progressions cell in the cervix cell. The types of HPV that give a most exceedingly awful infected with cervical cancer is 16 and 18 other than 31 and 45. The HPV DNA probe is immobilized with a different concentration to stabilize the sensitivity. A technique of rapid and sensitive for the HPV identification was proposed by coordinating basic DNA extraction with a quality of DNA. The extraction of the quality of DNA will make a proficiency of the discovery procedure. It will rely on the sequence of the capture probes and the way to support their attached. The fabrication, surface modification, immobilization and hybridization procedures are described by current-voltage (I-V) estimation by utilizing KEITHLEY 6487. This procedure will play out a decent affectability and selectivity of HPV discovery.
Analysis of JC virus DNA replication using a quantitative and high-throughput assay

PubMed Central

Shin, Jong; Phelan, Paul J.; Chhum, Panharith; Bashkenova, Nazym; Yim, Sung; Parker, Robert; Gagnon, David; Gjoerup, Ole; Archambault, Jacques; Bullock, Peter A.

2015-01-01

Progressive Multifocal Leukoencephalopathy (PML) is caused by lytic replication of JC virus (JCV) in specific cells of the central nervous system. Like other polyomaviruses, JCV encodes a large T-antigen helicase needed for replication of the viral DNA. Here, we report the development of a luciferase-based, quantitative and high-throughput assay of JCV DNA replication in C33A cells, which, unlike the glial cell lines Hs 683 and U87, accumulate high levels of nuclear T-ag needed for robust replication. Using this assay, we investigated the requirement for different domains of T-ag, and for specific sequences within and flanking the viral origin, in JCV DNA replication. Beyond providing validation of the assay, these studies revealed an important stimulatory role of the transcription factor NF1 in JCV DNA replication. Finally, we show that the assay can be used for inhibitor testing, highlighting its value for the identification of antiviral drugs targeting JCV DNA replication. PMID:25155200
The Ties That Bind: Mapping the Dynamic Enhancer-Promoter Interactome

DOE PAGES

Spurrell, Cailyn H.; Dickel, Diane E.; Visel, Axel

2016-11-17

Coupling chromosome conformation capture to molecular enrichment for promoter-containing DNA fragments enables the systematic mapping of interactions between individual distal regulatory sequences and their target genes. Here in this Minireview, we describe recent progress in the application of this technique and related complementary approaches to gain insight into the lineage- and cell-type-specific dynamics of interactions between regulators and gene promoters.
Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

NASA Astrophysics Data System (ADS)

Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

2017-07-01

DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
HNRNPLL stabilizes mRNAs for DNA replication proteins and promotes cell cycle progression in colorectal cancer cells.

PubMed

Sakuma, Keiichiro; Sasaki, Eiichi; Kimura, Kenya; Komori, Koji; Shimizu, Yasuhiro; Yatabe, Yasushi; Aoki, Masahiro

2018-06-05

HNRNPLL (heterogeneous nuclear ribonucleoprotein L-like), an RNA-binding protein that regulates alternative splicing of pre-mRNAs, has been shown to regulate differentiation of lymphocytes, as well as metastasis of colorectal cancer cells. Here we show that HNRNPLL promotes cell cycle progression and hence proliferation of colorectal cancer cells. Functional annotation analysis of those genes whose expression levels were changed by three-fold or more in RNA sequencing analysis between SW480 cells overexpressing HNRNPLL and those knocked down for HNRNPLL revealed enrichment of DNA replication-related genes by HNRNPLL overexpression. Among 13 genes detected in the DNA replication pathway, PCNA, RFC3, and FEN1 showed reproducible upregulation by HNRNPLL overexpression both at mRNA and protein levels in SW480 and HT29 cells. Importantly, knockdown of any of these genes alone suppressed the proliferation promoting effect induced by HNRNPLL overexpression. RNA-immunoprecipitation assay presented a binding of FLAG-tagged HNRNPLL to mRNA of these genes, and HNRNPLL overexpression significantly suppressed the downregulation of these genes during 12 hours of actinomycin D treatment, suggesting a role of HNRNPLL in mRNA stability. Finally, analysis of a public RNA sequencing dataset of clinical samples suggested a link between overexpression of HNRNPLL and that of PCNA, RFC3, and FEN1. This link was further supported by immunohistochemistry of colorectal cancer clinical samples, whereas expression of CDKN1A, which is known to inhibit the cooperative function of PCNA, RFC3, and FEN1, was negatively associated with HNRNPLL expression. These results indicate that HNRNPLL stabilizes mRNAs encoding regulators of DNA replication and promotes colorectal cancer cell proliferation. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

BioPartsDB: a synthetic biology workflow web-application for education and research.

PubMed

Stracquadanio, Giovanni; Yang, Kun; Boeke, Jef D; Bader, Joel S

2016-11-15

Synthetic biology has become a widely used technology, and expanding applications in research, education and industry require progress tracking for team-based DNA synthesis projects. Although some vendors are beginning to supply multi-kilobase sequence-verified constructs, synthesis workflows starting with short oligos remain important for cost savings and pedagogical benefit. We developed BioPartsDB as an open source, extendable workflow management system for synthetic biology projects with entry points for oligos and larger DNA constructs and ending with sequence-verified clones. BioPartsDB is released under the MIT license and available for download at https://github.com/baderzone/biopartsdb Additional documentation and video tutorials are available at https://github.com/baderzone/biopartsdb/wiki An Amazon Web Services image is available from the AWS Market Place (ami-a01d07c8). joel.bader@jhu.edu. © The Author 2016. Published by Oxford University Press.
Histones: Controlling Tumor Signaling Circuitry

PubMed Central

Martins, Manoela D.; Castilho, Rogerio M.

2014-01-01

Epigenetic modifications constitute the next frontier in tumor biology research. Post-translation modification of histones dynamically influences gene expression independent of alterations to the DNA sequence. These mechanisms are often mediated by histone linkers or by proteins associated with the recruitment of DNA-binding proteins, HDAC I and II interacting proteins and transcriptional activators, coactivators or corepressors. Early evidence suggested that histones and their modifiers are involved in sophisticated processes that modulate tumor behavior and cellular phenotype. In this review, we discuss how recent discoveries about chromatin modifications, particularly histone acetylation, are shaping our knowledge of cell biology and our understanding of the molecular circuitry governing tumor progression and consider whether recent insights may extend to novel therapeutic approaches. Furthermore, we discuss the latest oncogenomic findings in Head and Neck Squamous Cell Carcinoma (HNSCC) from studies using Next Generation Sequencing (NGS) technology and highlight the impact of mutations identified in histones and their modifiers. PMID:25177526
[The nineteenth century roots of the contemporary biological revolution].

PubMed

Swynghedauw, Bernard

2006-01-01

The recent publication of the human genomic sequence is the most important progress in biology. It originates from four major watersheds between 1860-1865, namely the biological evolution by Darwin in 1858, the Mendel laws of heredity in 1865, the basis of physiology established by Claude Bernard also in 1865, and the discoveries of microbacteria by Louis Pasteur around 1857. Before 1860, biology did not exist as a science. After 1860, the Darwin's theory progressively became a law after the discovery of the DNA polymorphism and that of the mechanisms of genetic mixing. So far the Mendel's laws were confirmed in parallel with the development of molecular genetics after the discovery of DNA structure and genetic code. The discovery of hormones is one example, amongst several on how integrative physiology applies to Claude Bernard's basis. Finally, based on Pasteur's discovery and Pasteur Institutes, microbiology became a tool for molecular biologists.
5-AED Enhances Survival of Irradiated Mice in a G-CSF-Dependent Manner, Stimulates Innate Immune Cell Function, Reduces Radiation-induced DNA Damage and Induces Genes that Modulate Cell Cycle Progression and Apoptosis

DTIC Science & Technology

2012-01-01

modulate cell cycle progression and apoptosis. INTRODUCTION Because of the increasing threat posed by nuclear weapons [1], there is a pressing need for both...were per- formed using the iCycler iQ Sequence Detection System ( Bio -Rad Laboratories, Hercules CA) on 96-well microtiter plates with optical caps...Thoss K, Petrow PK et al. Amelioration of murine antigen -induced arthritis by dehydroepiandrosterone (DHEA). Inflamm Res 2004;53:189–98. 56. Auci D
Inference from Samples of DNA Sequences Using a Two-Locus Model

PubMed Central

Griffiths, Robert C.

2011-01-01

Abstract Performing inference on contemporary samples of DNA sequence data is an important and challenging task. Computationally intensive methods such as importance sampling (IS) are attractive because they make full use of the available data, but in the presence of recombination the large state space of genealogies can be prohibitive. In this article, we make progress by developing an efficient IS proposal distribution for a two-locus model of sequence data. We show that the proposal developed here leads to much greater efficiency, outperforming existing IS methods that could be adapted to this model. Among several possible applications, the algorithm can be used to find maximum likelihood estimates for mutation and crossover rates, and to perform ancestral inference. We illustrate the method on previously reported sequence data covering two loci either side of the well-studied TAP2 recombination hotspot. The two loci are themselves largely non-recombining, so we obtain a gene tree at each locus and are able to infer in detail the effect of the hotspot on their joint ancestry. We summarize this joint ancestry by introducing the gene graph, a summary of the well-known ancestral recombination graph. PMID:21210733
KRAS mutation detection in colorectal cancer by a commercially available gene chip array compares well with Sanger sequencing.

PubMed

French, Deborah; Smith, Andrew; Powers, Martin P; Wu, Alan H B

2011-08-17

Binding of a ligand to the epidermal growth factor receptor (EGFR) stimulates various intracellular signaling pathways resulting in cell cycle progression, proliferation, angiogenesis and apoptosis inhibition. KRAS is involved in signaling pathways including RAF/MAPK and PI3K and mutations in this gene result in constitutive activation of these pathways, independent of EGFR activation. Seven mutations in codons 12 and 13 of KRAS comprise around 95% of the observed human mutations, rendering monoclonal antibodies against EGFR (e.g. cetuximab and panitumumab) useless in treatment of colorectal cancer. KRAS mutation testing by two different methodologies was compared; Sanger sequencing and AutoGenomics INFINITI® assay, on DNA extracted from colorectal cancers. Out of 29 colorectal tumor samples tested, 28 were concordant between the two methodologies for the KRAS mutations that were detected in both assays with the INFINITI® assay detecting a mutation in one sample that was indeterminate by Sanger sequencing and a third methodology; single nucleotide primer extension. This study indicates the utility of the AutoGenomics INFINITI® methodology in a clinical laboratory setting where technical expertise or access to equipment for DNA sequencing does not exist. Copyright © 2011 Elsevier B.V. All rights reserved.
Large-Scale Concatenation cDNA Sequencing

PubMed Central

Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.

1997-01-01

A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
5′CAG and 5′CTG Repeats Create Differential Impediment to the Progression of a Minimal Reconstituted T4 Replisome Depending on the Concentration of dNTPs

PubMed Central

Delagoutte, Emmanuelle; Baldacci, Giuseppe

2011-01-01

Instability of repetitive sequences originates from strand misalignment during repair or replicative DNA synthesis. To investigate the activity of reconstituted T4 replisomes across trinucleotide repeats (TNRs) during leading strand DNA synthesis, we developed a method to build replication miniforks containing a TNR unit of defined sequence and length. Each minifork consists of three strands, primer, leading strand template, and lagging strand template with a 5′ single-stranded (ss) tail. Each strand is prepared independently, and the minifork is assembled by hybridization of the three strands. Using these miniforks and a minimal reconstituted T4 replisome, we show that during leading strand DNA synthesis, the dNTP concentration dictates which strand of the structure-forming 5′CAG/5′CTG repeat creates the strongest impediment to the minimal replication complex. We discuss this result in the light of the known fluctuation of dNTP concentration during the cell cycle and cell growth and the known concentration balance among individual dNTPs. PMID:22096622
Non-coding stem-bulge RNAs are required for cell proliferation and embryonic development in C. elegans

PubMed Central

Kowalski, Madzia P.; Baylis, Howard A.; Krude, Torsten

2015-01-01

ABSTRACT Stem bulge RNAs (sbRNAs) are a family of small non-coding stem-loop RNAs present in Caenorhabditis elegans and other nematodes, the function of which is unknown. Here, we report the first functional characterisation of nematode sbRNAs. We demonstrate that sbRNAs from a range of nematode species are able to reconstitute the initiation of chromosomal DNA replication in the presence of replication proteins in vitro, and that conserved nucleotide sequence motifs are essential for this function. By functionally inactivating sbRNAs with antisense morpholino oligonucleotides, we show that sbRNAs are required for S phase progression, early embryonic development and the viability of C. elegans in vivo. Thus, we demonstrate a new and essential role for sbRNAs during the early development of C. elegans. sbRNAs show limited nucleotide sequence similarity to vertebrate Y RNAs, which are also essential for the initiation of DNA replication. Our results therefore establish that the essential function of small non-coding stem-loop RNAs during DNA replication extends beyond vertebrates. PMID:25908866
Cell-Free Expression and In Situ Immobilization of Parasite Proteins from Clonorchis sinensis for Rapid Identification of Antigenic Candidates

PubMed Central

Ju, Jung Won; Kim, Ho-Cheol; Shin, Hyun-Il; Kim, Yu Jung; Kim, Dong-Myung

2015-01-01

Progress towards genetic sequencing of human parasites has provided the groundwork for a post-genomic approach to develop novel antigens for the diagnosis and treatment of parasite infections. To fully utilize the genomic data, however, high-throughput methodologies are required for functional analysis of the proteins encoded in the genomic sequences. In this study, we investigated cell-free expression and in situ immobilization of parasite proteins as a novel platform for the discovery of antigenic proteins. PCR-amplified parasite DNA was immobilized on microbeads that were also functionalized to capture synthesized proteins. When the microbeads were incubated in a reaction mixture for cell-free synthesis, proteins expressed from the microbead-immobilized DNA were instantly immobilized on the same microbeads, providing a physical linkage between the genetic information and encoded proteins. This approach of in situ expression and isolation enables streamlined recovery and analysis of cell-free synthesized proteins and also allows facile identification of the genes coding antigenic proteins through direct PCR of the microbead-bound DNA. PMID:26599101
The Release 6 reference sequence of the Drosophila melanogaster genome

DOE PAGES

Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.; ...

2015-01-14

Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
The Release 6 reference sequence of the Drosophila melanogaster genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.

Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
Synthesis of DNA

DOEpatents

Mariella, Jr., Raymond P.

2008-11-18

A method of synthesizing a desired double-stranded DNA of a predetermined length and of a predetermined sequence. Preselected sequence segments that will complete the desired double-stranded DNA are determined. Preselected segment sequences of DNA that will be used to complete the desired double-stranded DNA are provided. The preselected segment sequences of DNA are assembled to produce the desired double-stranded DNA.
Nanopore Technology: A Simple, Inexpensive, Futuristic Technology for DNA Sequencing.

PubMed

Gupta, P D

2016-10-01

In health care, importance of DNA sequencing has been fully established. Sanger's Capillary Electrophoresis DNA sequencing methodology is time consuming, cumbersome, hence become more expensive. Lately, because of its versatility DNA sequencing became house hold name, and therefore, there is an urgent need of simple, fast, inexpensive, DNA sequencing technology. In the beginning of this century efforts were made, and Nanopore DNA sequencing technology was developed; still it is infancy, nevertheless, it is the futuristic technology.
Circulating tumor DNA evaluated by Next-Generation Sequencing is predictive of tumor response and prolonged clinical benefit with nivolumab in advanced non-small cell lung cancer.

PubMed

Giroux Leprieur, Etienne; Herbretau, Guillaume; Dumenil, Coraline; Julie, Catherine; Giraud, Violaine; Labrune, Sylvie; Dumoulin, Jennifer; Tisserand, Julie; Emile, Jean-François; Blons, Hélène; Chinet, Thierry

2018-01-01

Nivolumab is an anti-PD1 antibody, given in second-line or later treatment in advanced non-small cell lung cancer (NSCLC). The objective of this study was to describe the predictive value of circulating tumor DNA (ctDNA) on the efficacy of nivolumab in advanced NSCLC. We prospectively included all consecutive patients with advanced NSCLC treated with nivolumab in our Department between June 2015 and October 2016. Plasma samples were obtained before the first injection of nivolumab and at the first tumor evaluation with nivolumab. ctDNA was analyzed by Next-Generation Sequencing (NGS), and the predominant somatic mutation was followed for each patient and correlated with tumor response, clinical benefit (administration of nivolumab for more than 6 months), and progression-free survival (PFS). Of 23 patients, 15 had evaluable NGS results at both times of analysis. ctDNA concentration at the first tumor evaluation and ctDNA change correlated with tumor response, clinical benefit and PFS. ROC curve analyses showed good diagnostic performances for tumor response and clinical benefit, both for ctDNA concentration at the first tumor evaluation (tumor response: positive predictive value (PPV) at 100.0% and negative predictive value (NPV) at 71.0%; clinical benefit: PPV at 83.3% and NPV 77.8%) and the ctDNA change (tumor response: PPV 100.0% and NPV 62.5%; clinical benefit: PPV 100.0% and NPV 80.0%). Patients without ctDNA concentration increase >9% at 2 months had a long-term benefit of nivolumab. In conclusion, NGS analysis of ctDNA allows the early detection of tumor response and long-term clinical benefit with nivolumab in NSCLC.
The genome-wide DNA sequence specificity of the anti-tumour drug bleomycin in human cells.

PubMed

Murray, Vincent; Chen, Jon K; Tanaka, Mark M

2016-07-01

The cancer chemotherapeutic agent, bleomycin, cleaves DNA at specific sites. For the first time, the genome-wide DNA sequence specificity of bleomycin breakage was determined in human cells. Utilising Illumina next-generation DNA sequencing techniques, over 200 million bleomycin cleavage sites were examined to elucidate the bleomycin genome-wide DNA selectivity. The genome-wide bleomycin cleavage data were analysed by four different methods to determine the cellular DNA sequence specificity of bleomycin strand breakage. For the most highly cleaved DNA sequences, the preferred site of bleomycin breakage was at 5'-GT* dinucleotide sequences (where the asterisk indicates the bleomycin cleavage site), with lesser cleavage at 5'-GC* dinucleotides. This investigation also determined longer bleomycin cleavage sequences, with preferred cleavage at 5'-GT*A and 5'- TGT* trinucleotide sequences, and 5'-TGT*A tetranucleotides. For cellular DNA, the hexanucleotide DNA sequence 5'-RTGT*AY (where R is a purine and Y is a pyrimidine) was the most highly cleaved DNA sequence. It was striking that alternating purine-pyrimidine sequences were highly cleaved by bleomycin. The highest intensity cleavage sites in cellular and purified DNA were very similar although there were some minor differences. Statistical nucleotide frequency analysis indicated a G nucleotide was present at the -3 position (relative to the cleavage site) in cellular DNA but was absent in purified DNA.
Naturally occurring deletions/insertions in HBV core promoter tend to decrease in hepatitis B e antigen-positive chronic hepatitis B patients during antiviral therapy.

PubMed

Peng, Yaqin; Liu, Baoming; Hou, Jinlin; Sun, Jian; Hao, Ran; Xiang, Kuanhui; Yan, Ling; Zhang, Jiangbo; Zhuang, Hui; Li, Tong

2015-01-01

Mutations in HBV core promoter (CP) are suggested to affect viral replication and disease progression. We investigated CP deletion/insertion mutations (Del/Ins) in hepatitis B e antigen (HBeAg)-positive chronic hepatitis B (CHB) patients before and during antiviral treatment. Direct and clone sequencings were used for detection of CP Del/Ins in 12 patients. The dynamic changes of CP Del/Ins were tracked in these cases until week 48 of treatment. The effects of Del/Ins on CP activities and hepatitis B X protein (HBx) were analysed using luciferase assay and sequence comparison, respectively. Furthermore, 292 untreated HBeAg-positive CHB cases were also analysed. Twelve cases with multi-peak PCR direct sequencing electropherograms at baseline were confirmed to have CP Del/Ins by clone sequencing, with detection rates varying from 14.8% to 93.3% of clones analysed. Follow-up studies showed the detection rates of CP Del/Ins in patients decreased from 100% (12/12) at baseline to 16.7% (2/12) at week 48 of treatment (P<0.001), in parallel with a decline in HBV DNA, hepatitis B surface antigen (HBsAg), alanine aminotransferase (ALT) and aspartate transaminase (AST) levels along with an increase in HBeAg loss. Luciferase assay results showed distinct promoter activities among Del/Ins-harbouring CP sequences. Importantly, 71.8% (148/206) of Del/Ins sequences potentially resulted in HBx carboxy-terminal truncations. CP Del/Ins mutations were also found in 27.4% (80/292) of untreated cases. Naturally occurring complex of CP Del/Ins mutants existed in untreated HBeAg-positive CHB patients. These mutations would affect HBV transcription activities and integrity of HBx, which might correlate with disease progression. Their prevalence decreases on antiviral therapy in parallel with the decline in HBV DNA, HBsAg and ALT and AST levels.
Common fragile sites (CFS) and extremely large CFS genes are targets for human papillomavirus integrations and chromosome rearrangements in oropharyngeal squamous cell carcinoma.

PubMed

Gao, Ge; Johnson, Sarah H; Vasmatzis, George; Pauley, Christina E; Tombers, Nicole M; Kasperbauer, Jan L; Smith, David I

2017-01-01

Common fragile sites (CFS) are chromosome regions that are prone to form gaps or breaks in response to DNA replication stress. They are often found as hotspots for sister chromatid exchanges, deletions, and amplifications in different cancers. Many of the CFS regions are found to span genes whose genomic sequence is greater than 1 Mb, some of which have been demonstrated to function as important tumor suppressors. CFS regions are also hotspots for human papillomavirus (HPV) integrations in cervical cancer. We used mate-pair sequencing to examine HPV integration events and chromosomal structural variations in 34 oropharyngeal squamous cell carcinoma (OPSCC). We used endpoint PCR and Sanger sequencing to validate each HPV integration event and found HPV integrations preferentially occurred within CFS regions similar to what is observed in cervical cancer. We also found that many of the chromosomal alterations detected also occurred at or near the cytogenetic location of CFSs. Several large genes were also found to be recurrent targets of rearrangements, independent of HPV integrations, including CSMD1 (2.1Mb), LRP1B (1.9Mb), and LARGE1 (0.7Mb). Sanger sequencing revealed that the nucleotide sequences near to identified junction sites contained repetitive and AT-rich sequences that were shown to have the potential to form stem-loop DNA secondary structures that might stall DNA replication fork progression during replication stress. This could then cause increased instability in these regions which could lead to cancer development in human cells. Our findings suggest that CFSs and some specific large genes appear to play important roles in OPSCC. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Radiation induced pulmonary fibrosis as a model of progressive fibrosis: Contributions of DNA damage, inflammatory response and cellular senescence genes.

PubMed

Beach, Tyler A; Johnston, Carl J; Groves, Angela M; Williams, Jacqueline P; Finkelstein, Jacob N

2017-04-01

Purpose/Aim of Study: Studies of pulmonary fibrosis (PF) have resulted in DNA damage, inflammatory response, and cellular senescence being widely hypothesized to play a role in the progression of the disease. Utilizing these aforementioned terms, genomics databases were interrogated along with the term, "pulmonary fibrosis," to identify genes common among all 4 search terms. Findings were compared to data derived from a model of radiation-induced progressive pulmonary fibrosis (RIPF) to verify that these genes are similarly expressed, supporting the use of radiation as a model for diseases involving PF, such as human idiopathic pulmonary fibrosis (IPF). In an established model of RIPF, C57BL/6J mice were exposed to 12.5 Gy thorax irradiation and sacrificed at 24 hours, 1, 4, 12, and 32 weeks following exposure, and lung tissue was compared to age-matched controls by RNA sequencing. Of 176 PF associated gene transcripts identified by database interrogation, 146 (>82%) were present in our experimental model, throughout the progression of RIPF. Analysis revealed that nearly 85% of PF gene transcripts were associated with at least 1 other search term. Furthermore, of 22 genes common to all four terms, 16 were present experimentally in RIPF. This illustrates the validity of RIPF as a model of progressive PF/IPF based on the numbers of transcripts reported in both literature and observed experimentally. Well characterized genes and proteins are implicated in this model, supporting the hypotheses that DNA damage, inflammatory response and cellular senescence are associated with the pathogenesis of PF.
Diversity arrays technology: a generic genome profiling technology on open platforms.

PubMed

Kilian, Andrzej; Wenzl, Peter; Huttner, Eric; Carling, Jason; Xia, Ling; Blois, Hélène; Caig, Vanessa; Heller-Uszynska, Katarzyna; Jaccoud, Damian; Hopper, Colleen; Aschenbrenner-Kilian, Malgorzata; Evers, Margaret; Peng, Kaiman; Cayla, Cyril; Hok, Puthick; Uszynski, Grzegorz

2012-01-01

In the last 20 years, we have observed an exponential growth of the DNA sequence data and simular increase in the volume of DNA polymorphism data generated by numerous molecular marker technologies. Most of the investment, and therefore progress, concentrated on human genome and genomes of selected model species. Diversity Arrays Technology (DArT), developed over a decade ago, was among the first "democratizing" genotyping technologies, as its performance was primarily driven by the level of DNA sequence variation in the species rather than by the level of financial investment. DArT also proved more robust to genome size and ploidy-level differences among approximately 60 organisms for which DArT was developed to date compared to other high-throughput genotyping technologies. The success of DArT in a number of organisms, including a wide range of "orphan crops," can be attributed to the simplicity of underlying concepts: DArT combines genome complexity reduction methods enriching for genic regions with a highly parallel assay readout on a number of "open-access" microarray platforms. The quantitative nature of the assay enabled a number of applications in which allelic frequencies can be estimated from DArT arrays. A typical DArT assay tests for polymorphism tens of thousands of genomic loci with the final number of markers reported (hundreds to thousands) reflecting the level of DNA sequence variation in the tested loci. Detailed DArT methods, protocols, and a range of their application examples as well as DArT's evolution path are presented.

Identification and characterization of INMAP, a novel interphase nucleus and mitotic apparatus protein that is involved in spindle formation and cell cycle progression

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shen, Enzhi; Lei, Yan; Liu, Qian

2009-04-15

A novel protein that associates with interphase nucleus and mitotic apparatus (INMAP) was identified by screening HeLa cDNA expression library with an autoimmune serum followed by tandem mass spectrometry. Its complete cDNA sequence of 1.818 kb encodes 343 amino acids with predicted molecular mass of 38.2 kDa and numerous phosphorylation sites. The sequence is identical with nucleotides 1-1800 bp of an unnamed gene (GenBank accession no. (7022388)) and highly homologous with the 3'-terminal sequence of POLR3B. A monoclonal antibody against INMAP reacted with similar proteins in S. cerevisiae, Mel and HeLa cells, suggesting that it is a conserved protein. Confocalmore » microscopy using either GFP-INMAP fusion protein or labeling with the monoclonal antibody revealed that the protein localizes as distinct dots in the interphase nucleus, but during mitosis associates closely with the spindle. Double immunolabeling using specific antibodies showed that the INMAP co-localizes with {alpha}-tubulin, {gamma}-tubulin, and NuMA. INMAP also co-immunoprecipitated with these proteins in their native state. Stable overexpression of INMAP in HeLa cell lines leads to defects in the spindle, mitotic arrest, formation of polycentrosomal and multinuclear cells, inhibition of growth, and apoptosis. We propose that INMAP is a novel protein that plays essential role in spindle formation and cell-cycle progression.« less
Sequence and Structure Dependent DNA-DNA Interactions

NASA Astrophysics Data System (ADS)

Kopchick, Benjamin; Qiu, Xiangyun

Molecular forces between dsDNA strands are largely dominated by electrostatics and have been extensively studied. Quantitative knowledge has been accumulated on how DNA-DNA interactions are modulated by varied biological constituents such as ions, cationic ligands, and proteins. Despite its central role in biology, the sequence of DNA has not received substantial attention and ``random'' DNA sequences are typically used in biophysical studies. However, ~50% of human genome is composed of non-random-sequence DNAs, particularly repetitive sequences. Furthermore, covalent modifications of DNA such as methylation play key roles in gene functions. Such DNAs with specific sequences or modifications often take on structures other than the canonical B-form. Here we present series of quantitative measurements of the DNA-DNA forces with the osmotic stress method on different DNA sequences, from short repeats to the most frequent sequences in genome, and to modifications such as bromination and methylation. We observe peculiar behaviors that appear to be strongly correlated with the incurred structural changes. We speculate the causalities in terms of the differences in hydration shell and DNA surface structures.
Methylation of insulin DNA in response to proinflammatory cytokines during the progression of autoimmune diabetes in NOD mice.

PubMed

Rui, Jinxiu; Deng, Songyan; Lebastchi, Jasmin; Clark, Pamela L; Usmani-Brown, Sahar; Herold, Kevan C

2016-05-01

Type 1 diabetes is caused by the immunological destruction of pancreatic beta cells. Preclinical and clinical data indicate that there are changes in beta cell function at different stages of the disease, but the fate of beta cells has not been closely studied. We studied how immune factors affect the function and epigenetics of beta cells during disease progression and identified possible triggers of these changes. We studied FACS sorted beta cells and infiltrating lymphocytes from NOD mouse and human islets. Gene expression was measured by quantitative real-time RT-PCR (qRT-PCR) and methylation of the insulin genes was investigated by high-throughput and Sanger sequencing. To understand the role of DNA methyltransferases, Dnmt3a was knocked down with small interfering RNA (siRNA). The effects of cytokines on methylation and expression of the insulin gene were studied in humans and mice. During disease progression in NOD mice, there was an inverse relationship between the proportion of infiltrating lymphocytes and the beta cell mass. In beta cells, methylation marks in the Ins1 and Ins2 genes changed over time. Insulin gene expression appears to be most closely regulated by the methylation of Ins1 exon 2 and Ins2 exon 1. Cytokine transcription increased with age in NOD mice, and these cytokines could induce methylation marks in the insulin DNA by inducing methyltransferases. Similar changes were induced by cytokines in human beta cells in vitro. Epigenetic modification of DNA by methylation in response to immunological stressors may be a mechanism that affects insulin gene expression during the progression of type 1 diabetes.
DNA sequence-level analyses reveal potential phenotypic modifiers in a large family with psychiatric disorders.

PubMed

Ryan, Niamh M; Lihm, Jayon; Kramer, Melissa; McCarthy, Shane; Morris, Stewart W; Arnau-Soler, Aleix; Davies, Gail; Duff, Barbara; Ghiban, Elena; Hayward, Caroline; Deary, Ian J; Blackwood, Douglas H R; Lawrie, Stephen M; McIntosh, Andrew M; Evans, Kathryn L; Porteous, David J; McCombie, W Richard; Thomson, Pippa A

2018-06-07

Psychiatric disorders are a group of genetically related diseases with highly polygenic architectures. Genome-wide association analyses have made substantial progress towards understanding the genetic architecture of these disorders. More recently, exome- and whole-genome sequencing of cases and families have identified rare, high penetrant variants that provide direct functional insight. There remains, however, a gap in the heritability explained by these complementary approaches. To understand how multiple genetic variants combine to modify both severity and penetrance of a highly penetrant variant, we sequenced 48 whole genomes from a family with a high loading of psychiatric disorder linked to a balanced chromosomal translocation. The (1;11)(q42;q14.3) translocation directly disrupts three genes: DISC1, DISC2, DISC1FP and has been linked to multiple brain imaging and neurocognitive outcomes in the family. Using DNA sequence-level linkage analysis, functional annotation and population-based association, we identified common and rare variants in GRM5 (minor allele frequency (MAF) > 0.05), PDE4D (MAF > 0.2) and CNTN5 (MAF < 0.01) that may help explain the individual differences in phenotypic expression in the family. We suggest that whole-genome sequencing in large families will improve the understanding of the combined effects of the rare and common sequence variation underlying psychiatric phenotypes.
Exome-wide Sequencing Shows Low Mutation Rates and Identifies Novel Mutated Genes in Seminomas.

PubMed

Cutcutache, Ioana; Suzuki, Yuka; Tan, Iain Beehuat; Ramgopal, Subhashini; Zhang, Shenli; Ramnarayanan, Kalpana; Gan, Anna; Lee, Heng Hong; Tay, Su Ting; Ooi, Aikseng; Ong, Choon Kiat; Bolthouse, Jonathan T; Lane, Brian R; Anema, John G; Kahnoski, Richard J; Tan, Patrick; Teh, Bin Tean; Rozen, Steven G

2015-07-01

Testicular germ cell tumors are the most common cancer diagnosed in young men, and seminomas are the most common type of these cancers. There have been no exome-wide examinations of genes mutated in seminomas or of overall rates of nonsilent somatic mutations in these tumors. The objective was to analyze somatic mutations in seminomas to determine which genes are affected and to determine rates of nonsilent mutations. Eight seminomas and matched normal samples were surgically obtained from eight patients. DNA was extracted from tissue samples and exome sequenced on massively parallel Illumina DNA sequencers. Single-nucleotide polymorphism chip-based copy number analysis was also performed to assess copy number alterations. The DNA sequencing read data were analyzed to detect somatic mutations including single-nucleotide substitutions and short insertions and deletions. The detected mutations were validated by independent sequencing and further checked for subclonality. The rate of nonsynonymous somatic mutations averaged 0.31 mutations/Mb. We detected nonsilent somatic mutations in 96 genes that were not previously known to be mutated in seminomas, of which some may be driver mutations. Many of the mutations appear to have been present in subclonal populations. In addition, two genes, KIT and KRAS, were affected in two tumors each with mutations that were previously observed in other cancers and are presumably oncogenic. Our study, the first report on exome sequencing of seminomas, detected somatic mutations in 96 new genes, several of which may be targetable drivers. Furthermore, our results show that seminoma mutation rates are five times higher than previously thought, but are nevertheless low compared to other common cancers. Similar low rates are seen in other cancers that also have excellent rates of remission achieved with chemotherapy. We examined the DNA sequences of seminomas, the most common type of testicular germ cell cancer. Our study identified 96 new genes in which mutations occurred during seminoma development, some of which might contribute to cancer development or progression. The study also showed that the rates of DNA mutations during seminoma development are higher than previously thought, but still lower than for other common solid-organ cancers. Such low rates are also observed among other cancers that, like seminomas, show excellent rates of disease remission after chemotherapy. Copyright © 2015 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Mi-2/NuRD complex function is required for normal S phase progression and assembly of pericentric heterochromatin.

PubMed

Sims, Jennifer K; Wade, Paul A

2011-09-01

During chromosome duplication, it is essential to replicate not only the DNA sequence, but also the complex nucleoprotein structures of chromatin. Pericentric heterochromatin is critical for silencing repetitive elements and plays an essential structural role during mitosis. However, relatively little is understood about its assembly and maintenance during replication. The Mi2/NuRD chromatin remodeling complex tightly associates with actively replicating pericentric heterochromatin, suggesting a role in its assembly. Here we demonstrate that depletion of the catalytic ATPase subunit CHD4/Mi-2β in cells with a dampened DNA damage response results in a slow-growth phenotype characterized by delayed progression through S phase. Furthermore, we observe defects in pericentric heterochromatin maintenance and assembly. Our data suggest that chromatin assembly defects are sensed by an ATM-dependent intra-S phase chromatin quality checkpoint, resulting in a temporal block to the transition from early to late S phase. These findings implicate Mi-2β in the maintenance of chromatin structure and proper cell cycle progression.
Understanding genetics in neuroimaging.

PubMed

Vasquez, Marina Lipkin; Renault, Ilana Zalcberg

2015-02-01

Gene expression is a process of DNA sequence reading into protein synthesis. In cases of problems in DNA repair/apoptosis mechanisms, cells accumulate genomic abnormalities and pass them through generations of cells. The accumulation of mutations causes diseases and even tumors. In addition to cancer, many other neurologic conditions have been associated with genetic mutations. Some trials are testing patients with epigenetic treatments. Epigenetic therapy must be used with caution because epigenetic processes and changes happen constantly in normal cells, giving rise to drug off-target effects. Scientists are making progress in specifically targeting abnormal cells with minimal damage to normal ones. Copyright © 2015. Published by Elsevier Inc.
DNA as a powerful tool for morphology control, spatial positioning, and dynamic assembly of nanoparticles.

PubMed

Tan, Li Huey; Xing, Hang; Lu, Yi

2014-06-17

CONSPECTUS: Several properties of nanomaterials, such as morphologies (e.g., shapes and surface structures) and distance dependent properties (e.g., plasmonic and quantum confinement effects), make nanomaterials uniquely qualified as potential choices for future applications from catalysis to biomedicine. To realize the full potential of these nanomaterials, it is important to demonstrate fine control of the morphology of individual nanoparticles, as well as precise spatial control of the position, orientation, and distances between multiple nanoparticles. In addition, dynamic control of nanomaterial assembly in response to multiple stimuli, with minimal or no error, and the reversibility of the assemblies are also required. In this Account, we summarize recent progress of using DNA as a powerful programmable tool to realize the above goals. First, inspired by the discovery of genetic codes in biology, we have discovered DNA sequence combinations to control different morphologies of nanoparticles during their growth process and have shown that these effects are synergistic or competitive, depending on the sequence combination. The DNA, which guides the growth of the nanomaterial, is stable and retains its biorecognition ability. Second, by taking advantage of different reactivities of phosphorothioate and phosphodiester backbone, we have placed phosphorothioate at selective positions on different DNA nanostructures including DNA tetrahedrons. Bifunctional linkers have been used to conjugate phosphorothioate on one end and bind nanoparticles or proteins on the other end. In doing so, precise control of distances between two or more nanoparticles or proteins with nanometer resolution can be achieved. Furthermore, by developing facile methods to functionalize two hemispheres of Janus nanoparticles with two different DNA sequences regioselectively, we have demonstrated directional control of nanomaterial assembly, where DNA strands with specific hybridization serve as orthogonal linkers. Third, by using functional DNA that includes DNAzyme, aptamer, and aptazyme, dynamic control of assemblies of gold nanoparticles, quantum dots, carbon nanotubes, and iron oxide nanoparticles in response to one or more stimuli cooperatively have been achieved, resulting in colorimetric, fluorescent, electrochemical, and magnetic resonance signals for a wide range of targets, such as metal ions, small molecules, proteins, and intact cells. Fourth, by mimicking biology, we have employed DNAzymes as proofreading units to remove errors in nanoparticle assembly and further used DNAzyme cascade reactions to modify or repair DNA sequences involved in the assembly. Finally, by taking advantage of different affinities of biotin and desthiobiotin toward streptavidin, we have demonstrated reversible assembly of proteins on DNA origami.
Genomic clones for human cholinesterase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kott, M.; Venta, P.J.; Larsen, J.

1987-05-01

A human genomic library was prepared from peripheral white blood cells from a single donor by inserting an MboI partial digest into BamHI poly-linker sites of EMBL3. This library was screened using an oligolabeled human cholinesterase cDNA probe over 700 bp long. The latter probe was obtained from a human basal ganglia cDNA library. Of approximately 2 million clones screened with high stringency conditions several positive clones were identified; two have been plaque purified. One of these clones has been partially mapped using restriction enzymes known to cut within the coded region of the cDNA for human serum cholinesterase. Hybridizationmore » of the fragments and their sizes are as expected if the genomic clone is cholinesterase. Sequencing of the DNA fragments in M13 is in progress to verify the identify of the clone and the location of introns.« less
A High-Throughput Process for the Solid-Phase Purification of Synthetic DNA Sequences

PubMed Central

Grajkowski, Andrzej; Cieślak, Jacek; Beaucage, Serge L.

2017-01-01

An efficient process for the purification of synthetic phosphorothioate and native DNA sequences is presented. The process is based on the use of an aminopropylated silica gel support functionalized with aminooxyalkyl functions to enable capture of DNA sequences through an oximation reaction with the keto function of a linker conjugated to the 5′-terminus of DNA sequences. Deoxyribonucleoside phosphoramidites carrying this linker, as a 5′-hydroxyl protecting group, have been synthesized for incorporation into DNA sequences during the last coupling step of a standard solid-phase synthesis protocol executed on a controlled pore glass (CPG) support. Solid-phase capture of the nucleobase- and phosphate-deprotected DNA sequences released from the CPG support is demonstrated to proceed near quantitatively. Shorter than full-length DNA sequences are first washed away from the capture support; the solid-phase purified DNA sequences are then released from this support upon reaction with tetra-n-butylammonium fluoride in dry dimethylsulfoxide (DMSO) and precipitated in tetrahydrofuran (THF). The purity of solid-phase-purified DNA sequences exceeds 98%. The simulated high-throughput and scalability features of the solid-phase purification process are demonstrated without sacrificing purity of the DNA sequences. PMID:28628204
Exome and deep sequencing of clinically aggressive neuroblastoma reveal somatic mutations that affect key pathways involved in cancer progression

PubMed Central

Lasorsa, Vito Alessandro; Formicola, Daniela; Pignataro, Piero; Cimmino, Flora; Calabrese, Francesco Maria; Mora, Jaume; Esposito, Maria Rosaria; Pantile, Marcella; Zanon, Carlo; De Mariano, Marilena; Longo, Luca; Hogarty, Michael D.; de Torres, Carmen; Tonini, Gian Paolo; Iolascon, Achille; Capasso, Mario

2016-01-01

The spectrum of somatic mutation of the most aggressive forms of neuroblastoma is not completely determined. We sought to identify potential cancer drivers in clinically aggressive neuroblastoma. Whole exome sequencing was conducted on 17 germline and tumor DNA samples from high-risk patients with adverse events within 36 months from diagnosis (HR-Event3) to identify somatic mutations and deep targeted sequencing of 134 genes selected from the initial screening in additional 48 germline and tumor pairs (62.5% HR-Event3 and high-risk patients), 17 HR-Event3 tumors and 17 human-derived neuroblastoma cell lines. We revealed 22 significantly mutated genes, many of which implicated in cancer progression. Fifteen genes (68.2%) were highly expressed in neuroblastoma supporting their involvement in the disease. CHD9, a cancer driver gene, was the most significantly altered (4.0% of cases) after ALK. Other genes (PTK2, NAV3, NAV1, FZD1 and ATRX), expressed in neuroblastoma and involved in cell invasion and migration were mutated at frequency ranged from 4% to 2%. Focal adhesion and regulation of actin cytoskeleton pathways, were frequently disrupted (14.1% of cases) thus suggesting potential novel therapeutic strategies to prevent disease progression. Notably BARD1, CHEK2 and AXIN2 were enriched in rare, potentially pathogenic, germline variants. In summary, whole exome and deep targeted sequencing identified novel cancer genes of clinically aggressive neuroblastoma. Our analyses show pathway-level implications of infrequently mutated genes in leading neuroblastoma progression. PMID:27009842
Exome and deep sequencing of clinically aggressive neuroblastoma reveal somatic mutations that affect key pathways involved in cancer progression.

PubMed

Lasorsa, Vito Alessandro; Formicola, Daniela; Pignataro, Piero; Cimmino, Flora; Calabrese, Francesco Maria; Mora, Jaume; Esposito, Maria Rosaria; Pantile, Marcella; Zanon, Carlo; De Mariano, Marilena; Longo, Luca; Hogarty, Michael D; de Torres, Carmen; Tonini, Gian Paolo; Iolascon, Achille; Capasso, Mario

2016-04-19

The spectrum of somatic mutation of the most aggressive forms of neuroblastoma is not completely determined. We sought to identify potential cancer drivers in clinically aggressive neuroblastoma.Whole exome sequencing was conducted on 17 germline and tumor DNA samples from high-risk patients with adverse events within 36 months from diagnosis (HR-Event3) to identify somatic mutations and deep targeted sequencing of 134 genes selected from the initial screening in additional 48 germline and tumor pairs (62.5% HR-Event3 and high-risk patients), 17 HR-Event3 tumors and 17 human-derived neuroblastoma cell lines.We revealed 22 significantly mutated genes, many of which implicated in cancer progression. Fifteen genes (68.2%) were highly expressed in neuroblastoma supporting their involvement in the disease. CHD9, a cancer driver gene, was the most significantly altered (4.0% of cases) after ALK.Other genes (PTK2, NAV3, NAV1, FZD1 and ATRX), expressed in neuroblastoma and involved in cell invasion and migration were mutated at frequency ranged from 4% to 2%.Focal adhesion and regulation of actin cytoskeleton pathways, were frequently disrupted (14.1% of cases) thus suggesting potential novel therapeutic strategies to prevent disease progression.Notably BARD1, CHEK2 and AXIN2 were enriched in rare, potentially pathogenic, germline variants.In summary, whole exome and deep targeted sequencing identified novel cancer genes of clinically aggressive neuroblastoma. Our analyses show pathway-level implications of infrequently mutated genes in leading neuroblastoma progression.
Microbial Genomes Multiply

NASA Technical Reports Server (NTRS)

Doolittle, Russell F.

2002-01-01

The publication of the first complete sequence of a bacterial genome in 1995 was a signal event, underscored by the fact that the article has been cited more than 2,100 times during the intervening seven years. It was a marvelous technical achievement, made possible by automatic DNA-sequencing machines. The feat is the more impressive in that complete genome sequencing has now been adopted in many different laboratories around the world. Four years ago in these columns I examined the situation after a dozen microbial genomes had been completed. Now, with upwards of 60 microbial genome sequences determined and twice that many in progress, it seems reasonable to assess just what is being learned. Are new concepts emerging about how cells work? Have there been practical benefits in the fields of medicine and agriculture? Is it feasible to determine the genomic sequence of every bacterial species on Earth? The answers to these questions maybe Yes, Perhaps, and No, respectively.
ChIP-seq: advantages and challenges of a maturing technology.

PubMed

Park, Peter J

2009-10-01

Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is a technique for genome-wide profiling of DNA-binding proteins, histone modifications or nucleosomes. Owing to the tremendous progress in next-generation sequencing technology, ChIP-seq offers higher resolution, less noise and greater coverage than its array-based predecessor ChIP-chip. With the decreasing cost of sequencing, ChIP-seq has become an indispensable tool for studying gene regulation and epigenetic mechanisms. In this Review, I describe the benefits and challenges in harnessing this technique with an emphasis on issues related to experimental design and data analysis. ChIP-seq experiments generate large quantities of data, and effective computational analysis will be crucial for uncovering biological mechanisms.
An improved model for whole genome phylogenetic analysis by Fourier transform.

PubMed

Yin, Changchuan; Yau, Stephen S-T

2015-10-07

DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Next-generation sequencing: the future of molecular genetics in poultry production and food safety.

PubMed

Diaz-Sanchez, S; Hanning, I; Pendleton, Sean; D'Souza, Doris

2013-02-01

The era of molecular biology and automation of the Sanger chain-terminator sequencing method has led to discovery and advances in diagnostics and biotechnology. The Sanger methodology dominated research for over 2 decades, leading to significant accomplishments and technological improvements in DNA sequencing. Next-generation high-throughput sequencing (HT-NGS) technologies were developed subsequently to overcome the limitations of this first generation technology that include higher speed, less labor, and lowered cost. Various platforms developed include sequencing-by-synthesis 454 Life Sciences, Illumina (Solexa) sequencing, SOLiD sequencing (among others), and the Ion Torrent semiconductor sequencing technologies that use different detection principles. As technology advances, progress made toward third generation sequencing technologies are being reported, which include Nanopore Sequencing and real-time monitoring of PCR activity through fluorescent resonant energy transfer. The advantages of these technologies include scalability, simplicity, with increasing DNA polymerase performance and yields, being less error prone, and even more economically feasible with the eventual goal of obtaining real-time results. These technologies can be directly applied to improve poultry production and enhance food safety. For example, sequence-based (determination of the gut microbial community, genes for metabolic pathways, or presence of plasmids) and function-based (screening for function such as antibiotic resistance, or vitamin production) metagenomic analysis can be carried out. Gut microbialflora/communities of poultry can be sequenced to determine the changes that affect health and disease along with efficacy of methods to control pathogenic growth. Thus, the purpose of this review is to provide an overview of the principles of these current technologies and their potential application to improve poultry production and food safety as well as public health.
Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer

DTIC Science & Technology

2015-09-01

EPICOPY to obtain reliable copy number variation ( CNV ) data from the methylome array data, thereby decreasing the DNA requirements in half...in the R statistical environment. Samples were assessed for good performance on the array using detection p-values, a metric implemented by...Illumina to identify probes detected with confidence. Samples less than 90% of probes detected were removed from the analysis and probes undetected in any
Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.

PubMed

Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene

2017-02-01

Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Crystal digital droplet PCR for detection and quantification of circulating EGFR sensitizing and resistance mutations in advanced non-small cell lung cancer

PubMed Central

Madic, Jordan; Remon, Jordi; Honoré, Aurélie; Girard, Romain; Rouleau, Etienne; André, Barbara; Besse, Benjamin; Droniou, Magali; Lacroix, Ludovic

2017-01-01

Over the past years, targeted therapies using tyrosine kinase inhibitors (TKI) have led to an increase in progression-free survival and response rate for a subgroup of non-small cell lung cancer (NSCLC) patients harbouring specific gene abnormalities compared with chemotherapy. However long-lasting tumor regression is rarely achieved, due to the development of resistant tumoral subclones, which requires alternative therapeutic approaches. Molecular profile at progressive disease is a challenge for making adaptive treatment decisions. The aim of this study was to monitor EGFR-mutant tumors over time based on the quantity of mutant DNA circulating in plasma (ctDNA), comparing two different methods, Crystal™ Digital™ PCR and Massive Parallel Sequencing (MPS). In plasma circulating cell free DNA (cfDNA) of 61 advanced NSCLC patients we found an overall correlation of 78% between mutated allelic fraction measured by Crystal Digital PCR and MPS. 7 additional samples with sensitizing mutations and 4 additional samples with the resistance mutation were detected with Crystal Digital PCR, but not with MPS. Monitoring levels of both mutation types over time showed a correlation between levels and trends of mutated ctDNA detected and clinical assessment of disease for the 6 patients tested. In conclusion, Crystal Digital PCR exhibited good performance for monitoring mutational status in plasma cfDNA, and also appeared as better suited to the detection of known mutations than MPS in terms of features such as time to results. PMID:28829811
Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.

PubMed

Yin, Changchuan

2015-04-01

To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.

Insights about minority HIV-1 strains in transmitted drug resistance mutation dynamics and disease progression.

PubMed

Leda, Ana Rachel; Hunter, James; Oliveira, Ursula Castro; Azevedo, Inacio Junqueira; Sucupira, Maria Cecilia Araripe; Diaz, Ricardo Sobhie

2018-04-19

The presence of minority transmitted drug resistance mutations was assessed using ultra-deep sequencing and correlated with disease progression among recently HIV-1-infected individuals from Brazil. Samples at baseline during recent infection and 1 year after the establishment of the infection were analysed. Viral RNA and proviral DNA from 25 individuals were subjected to ultra-deep sequencing of the reverse transcriptase and protease regions of HIV-1. Viral strains carrying transmitted drug resistance mutations were detected in 9 out of the 25 patients, for all major antiretroviral classes, ranging from one to five mutations per patient. Ultra-deep sequencing detected strains with frequencies as low as 1.6% and only strains with frequencies >20% were detected by population plasma sequencing (three patients). Transmitted drug resistance strains with frequencies <14.8% did not persist upon established infection. The presence of transmitted drug resistance mutations was negatively correlated with the viral load and with CD4+ T cell count decay. Transmitted drug resistance mutations representing small percentages of the viral population do not persist during infection because they are negatively selected in the first year after HIV-1 seroconversion.
Single-cell genomic sequencing using Multiple Displacement Amplification.

PubMed

Lasken, Roger S

2007-10-01

Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
Comparative scaffolding and gap filling of ancient bacterial genomes applied to two ancient Yersinia pestis genomes

PubMed Central

Doerr, Daniel; Chauve, Cedric

2017-01-01

Yersinia pestis is the causative agent of the bubonic plague, a disease responsible for several dramatic historical pandemics. Progress in ancient DNA (aDNA) sequencing rendered possible the sequencing of whole genomes of important human pathogens, including the ancient Y. pestis strains responsible for outbreaks of the bubonic plague in London in the 14th century and in Marseille in the 18th century, among others. However, aDNA sequencing data are still characterized by short reads and non-uniform coverage, so assembling ancient pathogen genomes remains challenging and often prevents a detailed study of genome rearrangements. It has recently been shown that comparative scaffolding approaches can improve the assembly of ancient Y. pestis genomes at a chromosome level. In the present work, we address the last step of genome assembly, the gap-filling stage. We describe an optimization-based method AGapEs (ancestral gap estimation) to fill in inter-contig gaps using a combination of a template obtained from related extant genomes and aDNA reads. We show how this approach can be used to refine comparative scaffolding by selecting contig adjacencies supported by a mix of unassembled aDNA reads and comparative signal. We applied our method to two Y. pestis data sets from the London and Marseilles outbreaks, for which we obtained highly improved genome assemblies for both genomes, comprised of, respectively, five and six scaffolds with 95 % of the assemblies supported by ancient reads. We analysed the genome evolution between both ancient genomes in terms of genome rearrangements, and observed a high level of synteny conservation between these strains. PMID:29114402
Mapping of transcription factor binding regions in mammalian cells by ChIP: Comparison of array- and sequencing-based technologies

PubMed Central

Euskirchen, Ghia M.; Rozowsky, Joel S.; Wei, Chia-Lin; Lee, Wah Heng; Zhang, Zhengdong D.; Hartman, Stephen; Emanuelsson, Olof; Stolc, Viktor; Weissman, Sherman; Gerstein, Mark B.; Ruan, Yijun; Snyder, Michael

2007-01-01

Recent progress in mapping transcription factor (TF) binding regions can largely be credited to chromatin immunoprecipitation (ChIP) technologies. We compared strategies for mapping TF binding regions in mammalian cells using two different ChIP schemes: ChIP with DNA microarray analysis (ChIP-chip) and ChIP with DNA sequencing (ChIP-PET). We first investigated parameters central to obtaining robust ChIP-chip data sets by analyzing STAT1 targets in the ENCODE regions of the human genome, and then compared ChIP-chip to ChIP-PET. We devised methods for scoring and comparing results among various tiling arrays and examined parameters such as DNA microarray format, oligonucleotide length, hybridization conditions, and the use of competitor Cot-1 DNA. The best performance was achieved with high-density oligonucleotide arrays, oligonucleotides ≥50 bases (b), the presence of competitor Cot-1 DNA and hybridizations conducted in microfluidics stations. When target identification was evaluated as a function of array number, 80%–86% of targets were identified with three or more arrays. Comparison of ChIP-chip with ChIP-PET revealed strong agreement for the highest ranked targets with less overlap for the low ranked targets. With advantages and disadvantages unique to each approach, we found that ChIP-chip and ChIP-PET are frequently complementary in their relative abilities to detect STAT1 targets for the lower ranked targets; each method detected validated targets that were missed by the other method. The most comprehensive list of STAT1 binding regions is obtained by merging results from ChIP-chip and ChIP-sequencing. Overall, this study provides information for robust identification, scoring, and validation of TF targets using ChIP-based technologies. PMID:17568005
Base Preferences in Non-Templated Nucleotide Incorporation by MMLV-Derived Reverse Transcriptases

PubMed Central

Zajac, Pawel; Islam, Saiful; Hochgerner, Hannah; Lönnerberg, Peter; Linnarsson, Sten

2013-01-01

Reverse transcriptases derived from Moloney Murine Leukemia Virus (MMLV) have an intrinsic terminal transferase activity, which causes the addition of a few non-templated nucleotides at the 3´ end of cDNA, with a preference for cytosine. This mechanism can be exploited to make the reverse transcriptase switch template from the RNA molecule to a secondary oligonucleotide during first-strand cDNA synthesis, and thereby to introduce arbitrary barcode or adaptor sequences in the cDNA. Because the mechanism is relatively efficient and occurs in a single reaction, it has recently found use in several protocols for single-cell RNA sequencing. However, the base preference of the terminal transferase activity is not known in detail, which may lead to inefficiencies in template switching when starting from tiny amounts of mRNA. Here, we used fully degenerate oligos to determine the exact base preference at the template switching site up to a distance of ten nucleotides. We found a strong preference for guanosine at the first non-templated nucleotide, with a greatly reduced bias at progressively more distant positions. Based on this result, and a number of careful optimizations, we report conditions for efficient template switching for cDNA amplification from single cells. PMID:24392002
Cells Comprising the Prostate Cancer Microenvironment Lack Recurrent Clonal Somatic Genomic Aberrations

PubMed Central

Bianchi-Frias, Daniella; Basom, Ryan; Delrow, Jeffrey J; Coleman, Ilsa M; Dakhova, Olga; Qu, Xiaoyu; Fang, Min; Franco, Omar E.; Ericson, Nolan G.; Bielas, Jason H.; Hayward, Simon W.; True, Lawrence; Morrissey, Colm; Brown, Lisha; Bhowmick, Neil A.; Rowley, David; Ittmann, Michael; Nelson, Peter S.

2017-01-01

Prostate cancer-associated stroma (CAS) plays an active role in malignant transformation, tumor progression, and metastasis. Molecular analyses of CAS have demonstrated significant changes in gene expression; however, conflicting evidence exists on whether genomic alterations in benign cells comprising the tumor microenvironment (TME) underlie gene expression changes and oncogenic phenotypes. This study evaluates the nuclear and mitochondrial DNA integrity of prostate carcinoma cells, CAS, matched benign epithelium and benign epithelium-associated stroma by whole genome copy number analyses, targeted sequencing of TP53, and fluorescence in situ hybridization. Comparative genomic hybridization (aCGH) of CAS revealed a copy-neutral diploid genome with only rare and small somatic copy number aberrations (SCNAs). In contrast, several expected recurrent SCNAs were evident in the adjacent prostate carcinoma cells, including gains at 3q, 7p, and 8q, and losses at 8p and 10q. No somatic TP53 mutations were observed in CAS. Mitochondrial DNA (mtDNA) extracted from carcinoma cells and stroma identified 23 somatic mtDNA mutations in neoplastic epithelial cells but only one mutation in stroma. Finally, genomic analyses identified no SCNAs, no loss of heterozygosity (LOH) or copy-neutral LOH in cultured cancer-associated fibroblasts (CAFs), which are known to promote prostate cancer progression in vivo. PMID:26753621
Performance comparison of two commercial human whole-exome capture systems on formalin-fixed paraffin-embedded lung adenocarcinoma samples.

PubMed

Bonfiglio, Silvia; Vanni, Irene; Rossella, Valeria; Truini, Anna; Lazarevic, Dejan; Dal Bello, Maria Giovanna; Alama, Angela; Mora, Marco; Rijavec, Erika; Genova, Carlo; Cittaro, Davide; Grossi, Francesco; Coco, Simona

2016-08-30

Next Generation Sequencing (NGS) has become a valuable tool for molecular landscape characterization of cancer genomes, leading to a better understanding of tumor onset and progression, and opening new avenues in translational oncology. Formalin-fixed paraffin-embedded (FFPE) tissue is the method of choice for storage of clinical samples, however low quality of FFPE genomic DNA (gDNA) can limit its use for downstream applications. To investigate the FFPE specimen suitability for NGS analysis and to establish the performance of two solution-based exome capture technologies, we compared the whole-exome sequencing (WES) data of gDNA extracted from 5 fresh frozen (FF) and 5 matched FFPE lung adenocarcinoma tissues using: SeqCap EZ Human Exome v.3.0 (Roche NimbleGen) and SureSelect XT Human All Exon v.5 (Agilent Technologies). Sequencing metrics on Illumina HiSeq were optimal for both exome systems and comparable among FFPE and FF samples, with a slight increase of PCR duplicates in FFPE, mainly in Roche NimbleGen libraries. Comparison of single nucleotide variants (SNVs) between FFPE-FF pairs reached overlapping values >90 % in both systems. Both WES showed high concordance with target re-sequencing data by Ion PGM™ in 22 lung-cancer genes, regardless the source of samples. Exon coverage of 623 cancer-related genes revealed high coverage efficiency of both kits, proposing WES as a valid alternative to target re-sequencing. High-quality and reliable data can be successfully obtained from WES of FFPE samples starting from a relatively low amount of input gDNA, suggesting the inclusion of NGS-based tests into clinical contest. In conclusion, our analysis suggests that the WES approach could be extended to a translational research context as well as to the clinic (e.g. to study rare malignancies), where the simultaneous analysis of the whole coding region of the genome may help in the detection of cancer-linked variants.
[The world of double helix--"it did not escape our notice"].

PubMed

Gabryelska, Marta M; Barciszewski, Jan

2013-01-01

One of the key questions of biology is the nature and mechanisms of gene function. It has been 60 years since proposing the right-handed model of DNA double helix in 1953. This discovery was honored with Nobel Prize in 1962 and become a breakthrough in knowing and understanding mechanisms of heredity and genetic code. Since that time a great deal of data have been gathered considering functions, structure and DNA application. It became the basis of modern molecular biology, chemical biology and biotechnology. Today we know, that double helix is characterized by its dynamics and plasticity, which depend on its nucleotide sequence. Chromatin structure and DNA mediated charge transport have a crucial role in understanding mechanisms of its damage and repair. Progress in epigenetics allowed to identify new DNA bases, such as 5-methylcytosine, 5-hydroxymethylcytosine, 5-formylcytosine and 5-carboxycytosine. Design of new catalytic nucleic acids and the nanotechnology field of DNA origami reveal its application potential.
Acquisition of New DNA Sequences After Infection of Chicken Cells with Avian Myeloblastosis Virus

PubMed Central

Shoyab, M.; Baluda, M. A.; Evans, R.

1974-01-01

DNA-RNA hybridization studies between 70S RNA from avian myeloblastosis virus (AMV) and an excess of DNA from (i) AMV-induced leukemic chicken myeloblasts or (ii) a mixture of normal and of congenitally infected K-137 chicken embryos producing avian leukosis viruses revealed the presence of fast- and slow-hybridizing virus-specific DNA sequences. However, the leukemic cells contained twice the level of AMV-specific DNA sequences observed in normal chicken embryonic cells. The fast-reacting sequences were two to three times more numerous in leukemic DNA than in DNA from the mixed embryos. The slow-reacting sequences had a reiteration frequency of approximately 9 and 6, in the two respective systems. Both the fast- and the slow-reacting DNA sequences in leukemic cells exhibited a higher Tm (2 C) than the respective DNA sequences in normal cells. In normal and leukemic cells the slow hybrid sequences appeared to have a Tm which was 2 C higher than that of the fast hybrid sequences. Individual non-virus-producing chicken embryos, either group-specific antigen positive or negative, contained 40 to 100 copies of the fast sequences and 2 to 6 copies of the slowly hybridizing sequences per cell genome. Normal rat cells did not contain DNA that hybridized with AMV RNA, whereas non-virus-producing rat cells transformed by B-77 avian sarcoma virus contained only the slowly reacting sequences. The results demonstrate that leukemic cells transformed by AMV contain new AMV-specific DNA sequences which were not present before infection. PMID:16789139
Homogeneity of the 16S rDNA sequence among geographically disparate isolates of Taylorella equigenitalis

PubMed Central

Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC

2006-01-01

Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935
Reading biological processes from nucleotide sequences

NASA Astrophysics Data System (ADS)

Murugan, Anand

Cellular processes have traditionally been investigated by techniques of imaging and biochemical analysis of the molecules involved. The recent rapid progress in our ability to manipulate and read nucleic acid sequences gives us direct access to the genetic information that directs and constrains biological processes. While sequence data is being used widely to investigate genotype-phenotype relationships and population structure, here we use sequencing to understand biophysical mechanisms. We present work on two different systems. First, in chapter 2, we characterize the stochastic genetic editing mechanism that produces diverse T-cell receptors in the human immune system. We do this by inferring statistical distributions of the underlying biochemical events that generate T-cell receptor coding sequences from the statistics of the observed sequences. This inferred model quantitatively describes the potential repertoire of T-cell receptors that can be produced by an individual, providing insight into its potential diversity and the probability of generation of any specific T-cell receptor. Then in chapter 3, we present work on understanding the functioning of regulatory DNA sequences in both prokaryotes and eukaryotes. Here we use experiments that measure the transcriptional activity of large libraries of mutagenized promoters and enhancers and infer models of the sequence-function relationship from this data. For the bacterial promoter, we infer a physically motivated 'thermodynamic' model of the interaction of DNA-binding proteins and RNA polymerase determining the transcription rate of the downstream gene. For the eukaryotic enhancers, we infer heuristic models of the sequence-function relationship and use these models to find synthetic enhancer sequences that optimize inducibility of expression. Both projects demonstrate the utility of sequence information in conjunction with sophisticated statistical inference techniques for dissecting underlying biophysical mechanisms.
Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

DOEpatents

McCutchen-Maloney, Sandra L.

2002-01-01

DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
RNA sequencing analysis to capture the transcriptome landscape during skin ulceration syndrome progression in sea cucumber Apostichopus japonicus.

PubMed

Yang, Aifu; Zhou, Zunchun; Pan, Yongjia; Jiang, Jingwei; Dong, Ying; Guan, Xiaoyan; Sun, Hongjuan; Gao, Shan; Chen, Zhong

2016-06-14

Sea cucumber Apostichopus japonicus is an important economic species in China, which is affected by various diseases; skin ulceration syndrome (SUS) is the most serious. In this study, we characterized the transcriptomes in A. japonicus challenged with Vibrio splendidus to elucidate the changes in gene expression throughout the three stages of SUS progression. RNA sequencing of 21 cDNA libraries from various tissues and developmental stages of SUS-affected A. japonicus yielded 553 million raw reads, of which 542 million high-quality reads were generated by deep-sequencing using the Illumina HiSeq™ 2000 platform. The reference transcriptome comprised a combination of the Illumina reads, 454 sequencing data and Sanger sequences obtained from the public database to generate 93,163 unigenes (average length, 1,052 bp; N50 = 1,575 bp); 33,860 were annotated. Transcriptome comparisons between healthy and SUS-affected A. japonicus revealed greater differences in gene expression profiles in the body walls (BW) than in the intestines (Int), respiratory trees (RT) and coelomocytes (C). Clustering of expression models revealed stable up-regulation as the main pattern occurring in the BW throughout the three stages of SUS progression. Significantly affected pathways were associated with signal transduction, immune system, cellular processes, development and metabolism. Ninety-two differentially expressed genes (DEGs) were divided into four functional categories: attachment/pathogen recognition (17), inflammatory reactions (38), oxidative stress response (7) and apoptosis (30). Using quantitative real-time PCR, twenty representative DEGs were selected to validate the sequencing results. The Pearson's correlation coefficient (R) of the 20 DEGs ranged from 0.811 to 0.999, which confirmed the consistency and accuracy between these two approaches. Dynamic changes in global gene expression occur during SUS progression in A. japonicus. Elucidation of these changes is important in clarifying the molecular mechanisms associated with the development of SUS in sea cucumber.
Methylation patterns of repetitive DNA sequences in germ cells of Mus musculus.

PubMed

Sanford, J; Forrester, L; Chapman, V; Chandley, A; Hastie, N

1984-03-26

The major and the minor satellite sequences of Mus musculus were undermethylated in both sperm and oocyte DNAs relative to the amount of undermethylation observed in adult somatic tissue DNA. This hypomethylation was specific for satellite sequences in sperm DNA. Dispersed repetitive and low copy sequences show a high degree of methylation in sperm DNA; however, a dispersed repetitive sequence was undermethylated in oocyte DNA. This finding suggests a difference in the amount of total genomic DNA methylation between sperm and oocyte DNA. The methylation levels of the minor satellite sequences did not change during spermiogenesis, and were not associated with the onset of meiosis or a specific stage in sperm development.
Process of labeling specific chromosomes using recombinant repetitive DNA

DOEpatents

Moyzis, R.K.; Meyne, J.

1988-02-12

Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
A Molecular View of Kinetochore Assembly and Function

PubMed Central

Musacchio, Andrea; Desai, Arshad

2017-01-01

Kinetochores are large protein assemblies that connect chromosomes to microtubules of the mitotic and meiotic spindles in order to distribute the replicated genome from a mother cell to its daughters. Kinetochores also control feedback mechanisms responsible for the correction of incorrect microtubule attachments, and for the coordination of chromosome attachment with cell cycle progression. Finally, kinetochores contribute to their own preservation, across generations, at the specific chromosomal loci devoted to host them, the centromeres. They achieve this in most species by exploiting an epigenetic, DNA-sequence-independent mechanism; notable exceptions are budding yeasts where a specific sequence is associated with centromere function. In the last 15 years, extensive progress in the elucidation of the composition of the kinetochore and the identification of various physical and functional modules within its substructure has led to a much deeper molecular understanding of kinetochore organization and the origins of its functional output. Here, we provide a broad summary of this progress, focusing primarily on kinetochores of humans and budding yeast, while highlighting work from other models, and present important unresolved questions for future studies. PMID:28125021
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

PubMed

Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

2014-11-01

As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Fractals in biology and medicine

NASA Technical Reports Server (NTRS)

Havlin, S.; Buldyrev, S. V.; Goldberger, A. L.; Mantegna, R. N.; Ossadnik, S. M.; Peng, C. K.; Simons, M.; Stanley, H. E.

1995-01-01

Our purpose is to describe some recent progress in applying fractal concepts to systems of relevance to biology and medicine. We review several biological systems characterized by fractal geometry, with a particular focus on the long-range power-law correlations found recently in DNA sequences containing noncoding material. Furthermore, we discuss the finding that the exponent alpha quantifying these long-range correlations ("fractal complexity") is smaller for coding than for noncoding sequences. We also discuss the application of fractal scaling analysis to the dynamics of heartbeat regulation, and report the recent finding that the normal heart is characterized by long-range "anticorrelations" which are absent in the diseased heart.
Recent Progress in Aptamer-Based Functional Probes for Bioanalysis and Biomedicine.

PubMed

Zhang, Huimin; Zhou, Leiji; Zhu, Zhi; Yang, Chaoyong

2016-07-11

Nucleic acid aptamers are short synthetic DNA or RNA sequences that can bind to a wide range of targets with high affinity and specificity. In recent years, aptamers have attracted increasing research interest due to their unique features of high binding affinity and specificity, small size, excellent chemical stability, easy chemical synthesis, facile modification, and minimal immunogenicity. These properties make aptamers ideal recognition ligands for bioanalysis, disease diagnosis, and cancer therapy. This review highlights the recent progress in aptamer selection and the latest applications of aptamer-based functional probes in the fields of bioanalysis and biomedicine. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Initiation and termination of DNA replication during S phase in relation to cyclins D1, E and A, p21WAF1, Cdt1 and the p12 subunit of DNA polymerase δ revealed in individual cells by cytometry

PubMed Central

Darzynkiewicz, Zbigniew; Zhao, Hong; Zhang, Sufang; Marietta, Y.W.T. Lee; Ernest, Y.C. Lee; Zhang, Zhongtao

2015-01-01

During our recent studies on mechanism of the regulation of human DNA polymerase δ in preparation for DNA replication or repair, multiparameter imaging cytometry as exemplified by laser scanning cytometry (LSC) has been used to assess changes in expression of the following nuclear proteins associated with initiation of DNA replication: cyclin A, PCNA, Ki-67, p21WAF1, DNA replication factor Cdt1 and the smallest subunit of DNA polymerase δ, p12. In the present review, rather than focusing on Pol δ, we emphasize the application of LSC in these studies and outline possibilities offered by the concurrent differential analysis of DNA replication in conjunction with expression of the nuclear proteins. A more extensive analysis of the data on a correlation between rates of EdU incorporation, likely reporting DNA replication, and expression of these proteins, is presently provided. New data, specifically on the expression of cyclin D1 and cyclin E with respect to EdU incorporation as well as on a relationship between expression of cyclin A vs. p21WAF1 and Ki-67 vs. Cdt1, are also reported. Of particular interest is the observation that this approach makes it possible to assess the temporal sequence of degradation of cyclin D1, p21WAF1, Cdt1 and p12, each with respect to initiation of DNA replication and with respect to each other. Also the sequence or reappearance of these proteins in G2 after termination of DNA replication is assessed. The reviewed data provide a more comprehensive presentation of potential markers, whose presence or absence marks the DNA replicating cells. Discussed is also usefulness of these markers as indicators of proliferative activity in cancer tissues that may bear information on tumor progression and have a prognostic value. PMID:26059433

Initiation and termination of DNA replication during S phase in relation to cyclins D1, E and A, p21WAF1, Cdt1 and the p12 subunit of DNA polymerase δ revealed in individual cells by cytometry.

PubMed

Darzynkiewicz, Zbigniew; Zhao, Hong; Zhang, Sufang; Lee, Marietta Y W T; Lee, Ernest Y C; Zhang, Zhongtao

2015-05-20

During our recent studies on mechanism of the regulation of human DNA polymerase δ in preparation for DNA replication or repair, multiparameter imaging cytometry as exemplified by laser scanning cytometry (LSC) has been used to assess changes in expression of the following nuclear proteins associated with initiation of DNA replication: cyclin A, PCNA, Ki-67, p21(WAF1), DNA replication factor Cdt1 and the smallest subunit of DNA polymerase δ, p12. In the present review, rather than focusing on Pol δ, we emphasize the application of LSC in these studies and outline possibilities offered by the concurrent differential analysis of DNA replication in conjunction with expression of the nuclear proteins. A more extensive analysis of the data on a correlation between rates of EdU incorporation, likely reporting DNA replication, and expression of these proteins, is presently provided. New data, specifically on the expression of cyclin D1 and cyclin E with respect to EdU incorporation as well as on a relationship between expression of cyclin A vs. p21(WAF1) and Ki-67 vs. Cdt1, are also reported. Of particular interest is the observation that this approach makes it possible to assess the temporal sequence of degradation of cyclin D1, p21(WAF1), Cdt1 and p12, each with respect to initiation of DNA replication and with respect to each other. Also the sequence or reappearance of these proteins in G2 after termination of DNA replication is assessed. The reviewed data provide a more comprehensive presentation of potential markers, whose presence or absence marks the DNA replicating cells. Discussed is also usefulness of these markers as indicators of proliferative activity in cancer tissues that may bear information on tumor progression and have a prognostic value.
Enlightenment of Yeast Mitochondrial Homoplasmy: Diversified Roles of Gene Conversion

PubMed Central

Ling, Feng; Mikawa, Tsutomu; Shibata, Takehiko

2011-01-01

Mitochondria have their own genomic DNA. Unlike the nuclear genome, each cell contains hundreds to thousands of copies of mitochondrial DNA (mtDNA). The copies of mtDNA tend to have heterogeneous sequences, due to the high frequency of mutagenesis, but are quickly homogenized within a cell (“homoplasmy”) during vegetative cell growth or through a few sexual generations. Heteroplasmy is strongly associated with mitochondrial diseases, diabetes and aging. Recent studies revealed that the yeast cell has the machinery to homogenize mtDNA, using a common DNA processing pathway with gene conversion; i.e., both genetic events are initiated by a double-stranded break, which is processed into 3′ single-stranded tails. One of the tails is base-paired with the complementary sequence of the recipient double-stranded DNA to form a D-loop (homologous pairing), in which repair DNA synthesis is initiated to restore the sequence lost by the breakage. Gene conversion generates sequence diversity, depending on the divergence between the donor and recipient sequences, especially when it occurs among a number of copies of a DNA sequence family with some sequence variations, such as in immunoglobulin diversification in chicken. MtDNA can be regarded as a sequence family, in which the members tend to be diversified by a high frequency of spontaneous mutagenesis. Thus, it would be interesting to determine why and how double-stranded breakage and D-loop formation induce sequence homogenization in mitochondria and sequence diversification in nuclear DNA. We will review the mechanisms and roles of mtDNA homoplasmy, in contrast to nuclear gene conversion, which diversifies gene and genome sequences, to provide clues toward understanding how the common DNA processing pathway results in such divergent outcomes. PMID:24710143
Clinical Implications of Promoter Hypermethylation in RASSF1A and MGMT in Retinoblastoma1

PubMed Central

Choy, Kwong Wai; Lee, Tom C; Cheung, Kin Fai; Fan, Dorothy S P; Lo, Kwok Wai; Beaverson, Katherine L; Abramson, David H; Lam, Dennis S C; Yu, Christopher B O; Pang, Chi Pui

2005-01-01

Abstract We investigated the epigenetic silencing and genetic changes of the RAS-associated domain family 1A (RASSF1A) gene and the O6-methylguanine-DNA methyltransferase (MGMT) gene in retinoblastoma. We extracted DNA from microdissected tumor and normal retina tissues of the same patient in 68 retinoblastoma cases. Promoter methylation in RASSF1A and MGMT was analyzed by methylation-specific PCR, RASSF1A sequence alterations in all coding exons by direct DNA sequencing, and RASSF1A expression by RT-PCR. Cell cycle staging was analyzed by flow cytometry. We detected RASSF1A promoter hypermethylation in 82% of retinoblastoma, in tumor tissues only but not in adjacent normal retinal tissue cells. There was no expression of RASSF1A transcripts in all hypermethylated samples, but RASSF1A transcripts were restored after 5-aza-2′-deoxycytidine treatment with no changes in cell cycle or apoptosis. No mutation in the RASSF1A sequence was found. MGMT hypermethylation was present in 15% of theretinoblastoma samples, and the absence of MGMT hypermethylation was associated (P = .002) with retinoblastoma at advanced Reese-Ellsworth tumor stage. Our results revealed a high RASSF1A hypermethylation frequency in retinoblastoma. The correlation of MGMT inactivation by promoter hypermethylation with lower-stage diseases indicated that MGMT hypermethylation provides useful prognostic information. Epigenetic mechanism plays an important role in the progression of retinoblastoma. PMID:15799820
"First generation" automated DNA sequencing technology.

PubMed

Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M

2011-10-01

Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.
High-throughput sequencing of the T cell receptor β gene identifies aggressive early-stage mycosis fungoides.

PubMed

de Masson, Adele; O'Malley, John T; Elco, Christopher P; Garcia, Sarah S; Divito, Sherrie J; Lowry, Elizabeth L; Tawa, Marianne; Fisher, David C; Devlin, Phillip M; Teague, Jessica E; Leboeuf, Nicole R; Kirsch, Ilan R; Robins, Harlan; Clark, Rachael A; Kupper, Thomas S

2018-05-09

Mycosis fungoides (MF), the most common cutaneous T cell lymphoma (CTCL) is a malignancy of skin-tropic memory T cells. Most MF cases present as early stage (stage I A/B, limited to the skin), and these patients typically have a chronic, indolent clinical course. However, a small subset of early-stage cases develop progressive and fatal disease. Because outcomes can be so different, early identification of this high-risk population is an urgent unmet clinical need. We evaluated the use of next-generation high-throughput DNA sequencing of the T cell receptor β gene ( TCRB ) in lesional skin biopsies to predict progression and survival in a discovery cohort of 208 patients with CTCL (177 with MF) from a 15-year longitudinal observational clinical study. We compared these data to the results in an independent validation cohort of 101 CTCL patients (87 with MF). The tumor clone frequency (TCF) in lesional skin, measured by high-throughput sequencing of the TCRB gene, was an independent prognostic factor of both progression-free and overall survival in patients with CTCL and MF in particular. In early-stage patients, a TCF of >25% in the skin was a stronger predictor of progression than any other established prognostic factor (stage IB versus IA, presence of plaques, high blood lactate dehydrogenase concentration, large-cell transformation, or age). The TCF therefore may accurately predict disease progression in early-stage MF. Early identification of patients at high risk for progression could help identify candidates who may benefit from allogeneic hematopoietic stem cell transplantation before their disease becomes treatment-refractory. Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Influence of DNA sequence on the structure of minicircles under torsional stress

PubMed Central

Wang, Qian; Irobalieva, Rossitza N.; Chiu, Wah; Schmid, Michael F.; Fogg, Jonathan M.; Zechiedrich, Lynn

2017-01-01

Abstract The sequence dependence of the conformational distribution of DNA under various levels of torsional stress is an important unsolved problem. Combining theory and coarse-grained simulations shows that the DNA sequence and a structural correlation due to topology constraints of a circle are the main factors that dictate the 3D structure of a 336 bp DNA minicircle under torsional stress. We found that DNA minicircle topoisomers can have multiple bend locations under high torsional stress and that the positions of these sharp bends are determined by the sequence, and by a positive mechanical correlation along the sequence. We showed that simulations and theory are able to provide sequence-specific information about individual DNA minicircles observed by cryo-electron tomography (cryo-ET). We provided a sequence-specific cryo-ET tomogram fitting of DNA minicircles, registering the sequence within the geometric features. Our results indicate that the conformational distribution of minicircles under torsional stress can be designed, which has important implications for using minicircle DNA for gene therapy. PMID:28609782
Analysis of DNA Sequences by an Optical Time-Integrating Correlator: Proof-of-Concept Experiments.

DTIC Science & Technology

1992-05-01

DNA ANALYSIS STRATEGY 4 2.1 Representation of DNA Bases 4 2.2 DNA Analysis Strategy 6 3.0 CUSTOM GENERATORS FOR DNA SEQUENCES 10 3.1 Hardware Design 10...of the DNA bases where each base is represented by a 7-bits long pseudorandom sequence. 5 Figure 4: Coarse analysis of a DNA sequence. 7 Figure 5: Fine...a 20-bases long database. 32 xiii LIST OF TABLES PAGE Table 1: Short representations of the DNA bases where each base is represented by 7-bits long
Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting

NASA Astrophysics Data System (ADS)

Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.

1997-05-01

Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.
Micronuclear DNA of Oxytricha nova contains sequences with autonomously replicating activity in Saccharomyces cerevisiae.

PubMed Central

Colombo, M M; Swanton, M T; Donini, P; Prescott, D M

1984-01-01

Oxytricha nova is a hypotrichous ciliate with micronuclei and macronuclei. Micronuclei, which contain large, chromosomal-sized DNA, are genetically inert but undergo meiosis and exchange during cell mating. Macronuclei, which contain only small, gene-sized DNA molecules, provide all of the nuclear RNA needed to run the cell. After cell mating the macronucleus is derived from a micronucleus, a derivation that includes excision of the genes from chromosomes and elimination of the remaining DNA. The eliminated DNA includes all of the repetitious sequences and approximately 95% of the unique sequences. We cloned large restriction fragments from the micronucleus that confer replication ability on a replication-deficient plasmid in Saccharomyces cerevisiae. Sequences that confer replication ability are called autonomously replicating sequences. The frequency and effectiveness of autonomously replicating sequences in micronuclear DNA are similar to those reported for DNAs of other organisms introduced into yeast cells. Of the 12 micronuclear fragments with autonomously replicating sequence activity, 9 also showed homology to macronuclear DNA, indicating that they contain a macronuclear gene sequence. We conclude from this that autonomously replicating sequence activity is nonrandomly distributed throughout micronuclear DNA and is preferentially associated with those regions of micronuclear DNA that contain genes. Images PMID:6092934
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation

PubMed Central

Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob

2014-01-01

As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

PubMed

El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

2013-07-01

Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Studying Different Binding and Intracellular Delivery Efficiency of ssDNA Single-Walled Carbon Nanotubes and Their Effects on LC3-Related Autophagy in Renal Mesangial Cells via miRNA-382.

PubMed

Wang, Guobao; Zhao, Tingting; Wang, Leyu; Hu, Bianxiang; Darabi, Ali; Lin, Jiansheng; Xing, Malcolm M Q; Qiu, Xiaozhong

2015-11-25

Single-walled carbon nanotubes (SWCNTs) have been used to deliver single-stranded (ssDNA). ssDNA in oligonucleotide can act as an inhibitor of microRNA to regulate cellular functions. However, these ssDNA are difficult to bind carbon nanotubes with low transferring efficiency to cells. To this end, we designed ssDNA with regulatory and functional units to form ssDNA-SWCNT hybrids to study their binding effects and transferring efficiency. The functional unit on ssDNA mimics the inhibitor (MI) of miRNA-382, which plays a crucial role in the progress of many diseases such as renal interstitial fibrosis. After verification of overexpression of miRNA-382 in a coculture system, we designed oligonucleotide sequences (GCG)5-MI, (TAT)5-MI, and N23-MI as regulatory units added to the 5'-terminal end of the functional DNA fragment, respectively. These regulatory units lead to different secondary structures and thus exhibit different affinity ability to SWCNTs, and finally decide their deliver efficacy to cells. Autophagy, apoptosis and necrosis were observed in renal mesangial cells.
The Ties That Bind: Mapping the Dynamic Enhancer-Promoter Interactome.

PubMed

Spurrell, Cailyn H; Dickel, Diane E; Visel, Axel

2016-11-17

Coupling chromosome conformation capture to molecular enrichment for promoter-containing DNA fragments enables the systematic mapping of interactions between individual distal regulatory sequences and their target genes. In this Minireview, we describe recent progress in the application of this technique and related complementary approaches to gain insight into the lineage- and cell-type-specific dynamics of interactions between regulators and gene promoters. Copyright © 2016 Elsevier Inc. All rights reserved.
Lesion bypass by S. cerevisiae Pol ζ alone

PubMed Central

Stone, Jana E.; Kumar, Dinesh; Binz, Sara K.; Inase, Aki; Iwai, Shigenori; Chabes, Andrei; Burgers, Peter M.; Kunkel, Thomas A.

2011-01-01

DNA polymerase zeta (Pol ζ) participates in translesion synthesis (TLS) of DNA adducts that stall replication fork progression. Previous studies have led to the suggestion that the primary role of Pol ζ in TLS is to extend primers created when another DNA polymerase inserts nucleotides opposite lesions. Here we test the non-exclusive possibility that Pol ζ can sometimes perform TLS in the absence of any other polymerase. To do so, we quantified the efficiency with which S. cerevisiae Pol ζ bypasses abasic sites, cis-syn cyclobutane pyrimidine dimers and (6-4) photoproducts. In reactions containing dNTP concentrations that mimic those induced by DNA damage, a Pol ζ derivative with phenylalanine substituted for leucine 979 at the polymerase active site bypasses all three lesions at efficiencies between 27–73%. Wild-type Pol ζ also bypasses these lesions, with efficiencies that are lower and depend on the sequence context in which the lesion resides. The results are consistent with the hypothesis that, in addition to extending aberrant termini created by other DNA polymerases, Pol ζ has the potential to be the sole DNA polymerase involved in TLS. PMID:21622032
Affordable hands-on DNA sequencing and genotyping: an exercise for teaching DNA analysis to undergraduates.

PubMed

Shah, Kushani; Thomas, Shelby; Stein, Arnold

2013-01-01

In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C Sanger sequencing reactions. They prepare and run the gels, perform Southern blots (which require only 10 min), and detect sequencing ladders using a colorimetric detection system. Students enlarge their sequencing ladders from digital images of their small nylon membranes, and read the sequence manually. They compare their reads with the actual DNA sequence using BLAST2. After mastering the DNA sequencing system, students prepare their own DNA from a cheek swab, polymerase chain reaction-amplify a region of their DNA that encompasses a SNP of interest, and perform sequencing to determine their genotype at the SNP position. A family pedigree can also be constructed. The SNP chosen by the instructor was rs17822931, which is in the ABCC11 gene and is the determinant of human earwax type. Genotypes at the rs178229931 site vary in different ethnic populations. © 2013 by The International Union of Biochemistry and Molecular Biology.
Breaking the 1000-gene barrier for Mimivirus using ultra-deep genome and transcriptome sequencing.

PubMed

Legendre, Matthieu; Santini, Sébastien; Rico, Alain; Abergel, Chantal; Claverie, Jean-Michel

2011-03-04

Mimivirus, a giant dsDNA virus infecting Acanthamoeba, is the prototype of the mimiviridae family, the latest addition to the family of the nucleocytoplasmic large DNA viruses (NCLDVs). Its 1.2 Mb-genome was initially predicted to encode 917 genes. A subsequent RNA-Seq analysis precisely mapped many transcript boundaries and identified 75 new genes. We now report a much deeper analysis using the SOLiD™ technology combining RNA-Seq of the Mimivirus transcriptome during the infectious cycle (202.4 Million reads), and a complete genome re-sequencing (45.3 Million reads). This study corrected the genome sequence and identified several single nucleotide polymorphisms. Our results also provided clear evidence of previously overlooked transcription units, including an important RNA polymerase subunit distantly related to Euryarchea homologues. The total Mimivirus gene count is now 1018, 11% greater than the original annotation. This study highlights the huge progress brought about by ultra-deep sequencing for the comprehensive annotation of virus genomes, opening the door to a complete one-nucleotide resolution level description of their transcriptional activity, and to the realistic modeling of the viral genome expression at the ultimate molecular level. This work also illustrates the need to go beyond bioinformatics-only approaches for the annotation of short protein and non-coding genes in viral genomes.
Severe infantile leigh syndrome associated with a rare mitochondrial ND6 mutation, m.14487T>C.

PubMed

Tarnopolsky, Mark; Meaney, Brandon; Robinson, Brian; Sheldon, Katherine; Boles, Richard G

2013-08-01

We describe a case of severe infantile-onset complex I deficiency in association with an apparent de novo near-homoplasmic mutation (m.14487T>C) in the mitochondrial ND6 gene, which was previously associated with Leigh syndrome and other neurological disorders. The mutation was near-homoplasmic in muscle by NextGen sequencing (99.4% mutant), homoplasmic in muscle by Sanger sequencing, and it was associated with a severe complex I deficiency in both muscle and fibroblasts. This supports previous data regarding Leigh syndrome being on the severe end of a phenotypic spectrum including progressive myoclonic epilepsy, childhood-onset dystonia, bilateral striatal necrosis, and optic atrophy, depending on the proportion of mutant heteroplasmy. While the mother in all previously reported cases was heteroplasmic, the mother and brother of this case were homoplasmic for the wild-type, m.14487T. Importantly, the current data demonstrate the potential for cases of mutations that were previously reported to be homoplasmic by Sanger sequencing to be less homoplasmic by NextGen sequencing. This case underscores the importance of considering mitochondrial DNA mutations in families with a negative family history, even in offspring of those who have tested negative for a specific mtDNA mutation. Copyright © 2013 Wiley Periodicals, Inc.
Phylogenetic characterization of a biogas plant microbial community integrating clone library 16S-rDNA sequences and metagenome sequence data obtained by 454-pyrosequencing.

PubMed

Kröber, Magdalena; Bekel, Thomas; Diaz, Naryttza N; Goesmann, Alexander; Jaenicke, Sebastian; Krause, Lutz; Miller, Dimitri; Runte, Kai J; Viehöver, Prisca; Pühler, Alfred; Schlüter, Andreas

2009-06-01

The phylogenetic structure of the microbial community residing in a fermentation sample from a production-scale biogas plant fed with maize silage, green rye and liquid manure was analysed by an integrated approach using clone library sequences and metagenome sequence data obtained by 454-pyrosequencing. Sequencing of 109 clones from a bacterial and an archaeal 16S-rDNA amplicon library revealed that the obtained nucleotide sequences are similar but not identical to 16S-rDNA database sequences derived from different anaerobic environments including digestors and bioreactors. Most of the bacterial 16S-rDNA sequences could be assigned to the phylum Firmicutes with the most abundant class Clostridia and to the class Bacteroidetes, whereas most archaeal 16S-rDNA sequences cluster close to the methanogen Methanoculleus bourgensis. Further sequences of the archaeal library most probably represent so far non-characterised species within the genus Methanoculleus. A similar result derived from phylogenetic analysis of mcrA clone sequences. The mcrA gene product encodes the alpha-subunit of methyl-coenzyme-M reductase involved in the final step of methanogenesis. BLASTn analysis applying stringent settings resulted in assignment of 16S-rDNA metagenome sequence reads to 62 16S-rDNA amplicon sequences thus enabling frequency of abundance estimations for 16S-rDNA clone library sequences. Ribosomal Database Project (RDP) Classifier processing of metagenome 16S-rDNA reads revealed abundance of the phyla Firmicutes, Bacteroidetes and Euryarchaeota and the orders Clostridiales, Bacteroidales and Methanomicrobiales. Moreover, a large fraction of 16S-rDNA metagenome reads could not be assigned to lower taxonomic ranks, demonstrating that numerous microorganisms in the analysed fermentation sample of the biogas plant are still unclassified or unknown.
DNA capture and next-generation sequencing can recover whole mitochondrial genomes from highly degraded samples for human identification

PubMed Central

2013-01-01

Background Mitochondrial DNA (mtDNA) typing can be a useful aid for identifying people from compromised samples when nuclear DNA is too damaged, degraded or below detection thresholds for routine short tandem repeat (STR)-based analysis. Standard mtDNA typing, focused on PCR amplicon sequencing of the control region (HVS I and HVS II), is limited by the resolving power of this short sequence, which misses up to 70% of the variation present in the mtDNA genome. Methods We used in-solution hybridisation-based DNA capture (using DNA capture probes prepared from modern human mtDNA) to recover mtDNA from post-mortem human remains in which the majority of DNA is both highly fragmented (<100 base pairs in length) and chemically damaged. The method ‘immortalises’ the finite quantities of DNA in valuable extracts as DNA libraries, which is followed by the targeted enrichment of endogenous mtDNA sequences and characterisation by next-generation sequencing (NGS). Results We sequenced whole mitochondrial genomes for human identification from samples where standard nuclear STR typing produced only partial profiles or demonstrably failed and/or where standard mtDNA hypervariable region sequences lacked resolving power. Multiple rounds of enrichment can substantially improve coverage and sequencing depth of mtDNA genomes from highly degraded samples. The application of this method has led to the reliable mitochondrial sequencing of human skeletal remains from unidentified World War Two (WWII) casualties approximately 70 years old and from archaeological remains (up to 2,500 years old). Conclusions This approach has potential applications in forensic science, historical human identification cases, archived medical samples, kinship analysis and population studies. In particular the methodology can be applied to any case, involving human or non-human species, where whole mitochondrial genome sequences are required to provide the highest level of maternal lineage discrimination. Multiple rounds of in-solution hybridisation-based DNA capture can retrieve whole mitochondrial genome sequences from even the most challenging samples. PMID:24289217
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.

PubMed

Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

2012-01-01

RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.

Direct Detection and Sequencing of Damaged DNA Bases

PubMed Central

2011-01-01

Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications. PMID:22185597
Direct detection and sequencing of damaged DNA bases.

PubMed

Clark, Tyson A; Spittle, Kristi E; Turner, Stephen W; Korlach, Jonas

2011-12-20

Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications.
Next-generation transcriptome sequencing, SNP discovery and validation in four market classes of peanut, Arachis hypogaea L.

PubMed

Chopra, Ratan; Burow, Gloria; Farmer, Andrew; Mudge, Joann; Simpson, Charles E; Wilkins, Thea A; Baring, Michael R; Puppala, Naveen; Chamberlin, Kelly D; Burow, Mark D

2015-06-01

Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker-trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.
Short report: Monitoring ESR1 mutations by circulating tumor DNA in aromatase inhibitor resistant metastatic breast cancer.

PubMed

Sefrioui, David; Perdrix, Anne; Sarafan-Vasseur, Nasrin; Dolfus, Claire; Dujon, Antoine; Picquenot, Jean-Michel; Delacour, Julien; Cornic, Marie; Bohers, Elodie; Leheurteur, Marianne; Rigal, Olivier; Tennevet, Isabelle; Thery, Jean-Christophe; Alexandru, Cristina; Guillemet, Cécile; Moldovan, Cristian; Veyret, Corinne; Frebourg, Thierry; Di Fiore, Frédéric; Clatot, Florian

2015-11-15

Acquired estrogen receptor gene (ESR1) mutations have been recently reported as a marker of resistance to aromatase inhibitors in hormone receptor positive metastatic breast cancer. We retrospectively considered seven patients treated for metastatic breast cancer with available samples from the primary tumor before any treatment, cryopreserved metastasis removed during progression and concomitant plasmas. All these seven patients were in disease progression after previous exposure to aromatase inhibitors for at least 6 months, and were assessed for ESR1 mutations detection in tumor and circulating DNA. For these patients, Sanger sequencing identified four metastases with clear ESR1 mutation and one possible, whereas digital PCR identified six mutated metastases. Then, under blind conditions and using digital PCR, corresponding circulating ESR1 mutations were successfully detected in four of these six metastatic breast cancer patients. Moreover, in two patients with serial blood samples following treatments exposure, the monitoring of circulating ESR1 mutations clearly predicted disease evolution. In the context of high interest for ESR1 mutations, our results highlight that these acquired recurrent mutations may be tracked in circulating tumor DNA and may be of clinical relevance for metastatic breast cancer patient monitoring. © 2015 UICC.
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1987-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3575113
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1990-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2333227
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1988-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3368330
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1989-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2654889
NOVEL EPIGENETIC CHANGES IN CDKN2A ARE ASSOCIATED WITH PROGRESSION OF CERVICAL INTRAEPITHELIAL NEOPLASIA

PubMed Central

Wijetunga, N. Ari; Belbin, Thomas J.; Burk, Robert D.; Whitney, Kathleen; Abadi, Maria; Greally, John M.; Einstein, Mark H.; Schlecht, Nicolas F.

2016-01-01

Objective To conduct a comprehensive mapping of the genomic DNA methylation in CDKN2A, which codes for the p16INK4A and p14ARF proteins, and 14 of the most promising DNA methylation marker candidates previously reported to be associated with progression of low-grade cervical intraepithelial neoplasia (CIN1) to cervical cancer. Methods We analyzed DNA methylation in 68 HIV-seropositive and negative women with incident CIN1, CIN2, CIN3 and invasive cervical cancer, assaying 120 CpG dinucleotide sites spanning APC, CDH1, CDH13, CDKN2A, CDKN2B, DAPK1, FHIT, GSTP1, HIC1, MGMT, MLH1, RARB, RASSF1, TERT and TIMP3 using the Illumina Infinium array. Validation was performed using high resolution mapping of the target genes with HELP-tagging for 286 CpGs, followed by fine mapping of candidate genes with targeted bisulfite sequencing. We assessed for statistical differences in DNA methylation levels for each CpG loci assayed using univariate and multivariate methods correcting for multiple comparisons. Results In our discovery sample set, we identified dose dependent differences in DNA methylation with grade of disease in CDKN2A, APC, MGMT, MLH1 and HIC1, whereas single CpG locus differences between CIN2/3 and cancer groups were seen for CDH13, DAPK1 and TERT. Only those CpGs in the gene body of CDKN2A showed a monotonic increase in methylation between persistent CIN1, CIN2, CIN3 and cancers. Conclusion Our data suggests a novel link between early cervical disease progression and DNA methylation in a region downstream of the CDKN2A transcription start site that may lead to increased p16INK4A/p14ARF expression prior to development of malignant disease. PMID:27401842
Novel epigenetic changes in CDKN2A are associated with progression of cervical intraepithelial neoplasia.

PubMed

Wijetunga, N Ari; Belbin, Thomas J; Burk, Robert D; Whitney, Kathleen; Abadi, Maria; Greally, John M; Einstein, Mark H; Schlecht, Nicolas F

2016-09-01

To conduct a comprehensive mapping of the genomic DNA methylation in CDKN2A, which codes for the p16(INK4A) and p14(ARF) proteins, and 14 of the most promising DNA methylation marker candidates previously reported to be associated with progression of low-grade cervical intraepithelial neoplasia (CIN1) to cervical cancer. We analyzed DNA methylation in 68 HIV-seropositive and negative women with incident CIN1, CIN2, CIN3 and invasive cervical cancer, assaying 120 CpG dinucleotide sites spanning APC, CDH1, CDH13, CDKN2A, CDKN2B, DAPK1, FHIT, GSTP1, HIC1, MGMT, MLH1, RARB, RASSF1, TERT and TIMP3 using the Illumina Infinium array. Validation was performed using high resolution mapping of the target genes with HELP-tagging for 286 CpGs, followed by fine mapping of candidate genes with targeted bisulfite sequencing. We assessed for statistical differences in DNA methylation levels for each CpG loci assayed using univariate and multivariate methods correcting for multiple comparisons. In our discovery sample set, we identified dose dependent differences in DNA methylation with grade of disease in CDKN2A, APC, MGMT, MLH1 and HIC1, whereas single CpG locus differences between CIN2/3 and cancer groups were seen for CDH13, DAPK1 and TERT. Only those CpGs in the gene body of CDKN2A showed a monotonic increase in methylation between persistent CIN1, CIN2, CIN3 and cancers. Our data suggests a novel link between early cervical disease progression and DNA methylation in a region downstream of the CDKN2A transcription start site that may lead to increased p16(INK4A)/p14(ARF) expression prior to development of malignant disease. Copyright © 2016 Elsevier Inc. All rights reserved.
Kilo-sequencing: an ordered strategy for rapid DNA sequence data acquisition.

PubMed Central

Barnes, W M; Bevan, M

1983-01-01

A strategy for rapid DNA sequence acquisition in an ordered, nonrandom manner, while retaining all of the conveniences of the dideoxy method with M13 transducing phage DNA template, is described. Target DNA 3 to 14 kb in size can be stably carried by our M13 vectors. Suitable targets are stretches of DNA which lack an enzyme recognition site which is unique on our cloning vectors and adjacent to the sequencing primer; current sites that are so useful when lacking are Pst, Xba, HindIII, BglII, EcoRI. By an in vitro procedure, we cut RF DNA once randomly and once specifically, to create thousands of deletions which start at the unique restriction site adjacent to the dideoxy sequencing primer and extend various distances across the target DNA. Phage carrying a desired size of deletions, whose DNA as template will give rise to DNA sequence data in a desired location along the target DNA, may be purified by electrophoresis alive on agarose gels. Phage running in the same location on the agarose gel thus conveniently give rise to nucleotide sequence data from the same kilobase of target DNA. Images PMID:6298723
LncRNA Structural Characteristics in Epigenetic Regulation

PubMed Central

Wang, Chenguang; Wang, Lianzong; Ding, Yu; Lu, Xiaoyan; Zhang, Guosi; Yang, Jiaxin; Zheng, Hewei; Wang, Hong; Jiang, Yongshuai; Xu, Liangde

2017-01-01

The rapid development of new generation sequencing technology has deepened the understanding of genomes and functional products. RNA-sequencing studies in mammals show that approximately 85% of the DNA sequences have RNA products, for which the length greater than 200 nucleotides (nt) is called long non-coding RNAs (lncRNA). LncRNAs now have been shown to play important epigenetic regulatory roles in key molecular processes, such as gene expression, genetic imprinting, histone modification, chromatin dynamics, and other activities by forming specific structures and interacting with all kinds of molecules. This paper mainly discusses the correlation between the structure and function of lncRNAs with the recent progress in epigenetic regulation, which is important to the understanding of the mechanism of lncRNAs in physiological and pathological processes. PMID:29292750
Trinucleotide repeat length and progression of illness in Huntington's disease.

PubMed

Kieburtz, K; MacDonald, M; Shih, C; Feigin, A; Steinberg, K; Bordwell, K; Zimmerman, C; Srinidhi, J; Sotack, J; Gusella, J

1994-11-01

The genetic defect causing Huntington's disease (HD) has been identified as an unstable expansion of a trinucleotide (CAG) repeat sequence within the coding region of the IT15 gene on chromosome 4. In 50 patients with manifest HD who were evaluated prospectively and uniformly, we examined the relationship between the extent of the DNA expansion and the rate of illness progression. Although the length of CAG repeats showed a strong inverse correlation with the age at onset of HD, there was no such relationship between the number of CAG repeats and the rate of clinical decline. These findings suggest that the CAG repeat length may influence or trigger the onset of HD, but other genetic, neurobiological, or environmental factors contribute to the progression of illness and the underlying pace of neuronal degeneration.
Silicene nanoribbon as a new DNA sequencing device

NASA Astrophysics Data System (ADS)

Alesheikh, Sara; Shahtahmassebi, Nasser; Roknabadi, Mahmood Rezaee; Pilevar Shahri, Raheleh

2018-02-01

The importance of applying DNA sequencing in different fields, results in looking for fast and cheap methods. Nanotechnology helps this development by introducing nanostructures used for DNA sequencing. In this work we study the interaction between zigzag silicene nanoribbon and DNA nucleobases using DFT and non equilibrium Green's function approach, to investigate the possibility of using zigzag silicene nanoribbons as a biosensor for DNA sequencing.
Recent advances in peptide nucleic acid for cancer bionanotechnology.

PubMed

Wu, Jun-Chen; Meng, Qing-Chun; Ren, Hong-Mei; Wang, Hong-Tao; Wu, Jie; Wang, Qi

2017-06-01

Peptide nucleic acid (PNA) is an oligomer, in which the phosphate backbone has been replaced by a pseudopeptide backbone that is meant to mimic DNA. Peptide nucleic acids are of the utmost importance in the biomedical field because of their ability to hybridize with neutral nucleic acids and their special chemical and biological properties. In recent years, PNAs have emerged in nanobiotechnology for cancer diagnosis and therapy due to their high affinity and sequence selectivity toward corresponding DNA and RNA. In this review, we summarize the recent progresses that have been made in cancer detection and therapy with PNA biotechnology. In addition, we emphasize nanoparticle PNA-based strategies for the efficient delivery of drugs in anticancer therapies.
Isolation and characterization of target sequences of the chicken CdxA homeobox gene.

PubMed Central

Margalit, Y; Yarus, S; Shapira, E; Gruenbaum, Y; Fainsod, A

1993-01-01

The DNA binding specificity of the chicken homeodomain protein CDXA was studied. Using a CDXA-glutathione-S-transferase fusion protein, DNA fragments containing the binding site for this protein were isolated. The sources of DNA were oligonucleotides with random sequence and chicken genomic DNA. The DNA fragments isolated were sequenced and tested in DNA binding assays. Sequencing revealed that most DNA fragments are AT rich which is a common feature of homeodomain binding sites. By electrophoretic mobility shift assays it was shown that the different target sequences isolated bind to the CDXA protein with different affinities. The specific sequences bound by the CDXA protein in the genomic fragments isolated, were determined by DNase I footprinting. From the footprinted sequences, the CDXA consensus binding site was determined. The CDXA protein binds the consensus sequence A, A/T, T, A/T, A, T, A/G. The CAUDAL binding site in the ftz promoter is also included in this consensus sequence. When tested, some of the genomic target sequences were capable of enhancing the transcriptional activity of reporter plasmids when introduced into CDXA expressing cells. This study determined the DNA sequence specificity of the CDXA protein and it also shows that this protein can further activate transcription in cells in culture. Images PMID:7909943
Sequence periodicity in nucleosomal DNA and intrinsic curvature.

PubMed

Nair, T Murlidharan

2010-05-17

Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA.
Assessing the Fidelity of Ancient DNA Sequences Amplified From Nuclear Genes

PubMed Central

Binladen, Jonas; Wiuf, Carsten; Gilbert, M. Thomas P.; Bunce, Michael; Barnett, Ross; Larson, Greger; Greenwood, Alex D.; Haile, James; Ho, Simon Y. W.; Hansen, Anders J.; Willerslev, Eske

2006-01-01

To date, the field of ancient DNA has relied almost exclusively on mitochondrial DNA (mtDNA) sequences. However, a number of recent studies have reported the successful recovery of ancient nuclear DNA (nuDNA) sequences, thereby allowing the characterization of genetic loci directly involved in phenotypic traits of extinct taxa. It is well documented that postmortem damage in ancient mtDNA can lead to the generation of artifactual sequences. However, as yet no one has thoroughly investigated the damage spectrum in ancient nuDNA. By comparing clone sequences from 23 fossil specimens, recovered from environments ranging from permafrost to desert, we demonstrate the presence of miscoding lesion damage in both the mtDNA and nuDNA, resulting in insertion of erroneous bases during amplification. Interestingly, no significant differences in the frequency of miscoding lesion damage are recorded between mtDNA and nuDNA despite great differences in cellular copy numbers. For both mtDNA and nuDNA, we find significant positive correlations between total sequence heterogeneity and the rates of type 1 transitions (adenine → guanine and thymine → cytosine) and type 2 transitions (cytosine → thymine and guanine → adenine), respectively. Type 2 transitions are by far the most dominant and increase relative to those of type 1 with damage load. The results suggest that the deamination of cytosine (and 5-methyl cytosine) to uracil (and thymine) is the main cause of miscoding lesions in both ancient mtDNA and nuDNA sequences. We argue that the problems presented by postmortem damage, as well as problems with contamination from exogenous sources of conserved nuclear genes, allelic variation, and the reliance on single nucleotide polymorphisms, call for great caution in studies relying on ancient nuDNA sequences. PMID:16299392
[Current applications of high-throughput DNA sequencing technology in antibody drug research].

PubMed

Yu, Xin; Liu, Qi-Gang; Wang, Ming-Rong

2012-03-01

Since the publication of a high-throughput DNA sequencing technology based on PCR reaction was carried out in oil emulsions in 2005, high-throughput DNA sequencing platforms have been evolved to a robust technology in sequencing genomes and diverse DNA libraries. Antibody libraries with vast numbers of members currently serve as a foundation of discovering novel antibody drugs, and high-throughput DNA sequencing technology makes it possible to rapidly identify functional antibody variants with desired properties. Herein we present a review of current applications of high-throughput DNA sequencing technology in the analysis of antibody library diversity, sequencing of CDR3 regions, identification of potent antibodies based on sequence frequency, discovery of functional genes, and combination with various display technologies, so as to provide an alternative approach of discovery and development of antibody drugs.
DNA fingerprinting, DNA barcoding, and next generation sequencing technology in plants.

PubMed

Sucher, Nikolaus J; Hennell, James R; Carles, Maria C

2012-01-01

DNA fingerprinting of plants has become an invaluable tool in forensic, scientific, and industrial laboratories all over the world. PCR has become part of virtually every variation of the plethora of approaches used for DNA fingerprinting today. DNA sequencing is increasingly used either in combination with or as a replacement for traditional DNA fingerprinting techniques. A prime example is the use of short, standardized regions of the genome as taxon barcodes for biological identification of plants. Rapid advances in "next generation sequencing" (NGS) technology are driving down the cost of sequencing and bringing large-scale sequencing projects into the reach of individual investigators. We present an overview of recent publications that demonstrate the use of "NGS" technology for DNA fingerprinting and DNA barcoding applications.

Mammalian DNA enriched for replication origins is enriched for snap-back sequences.

PubMed

Zannis-Hadjopoulos, M; Kaufmann, G; Martin, R G

1984-11-15

Using the instability of replication loops as a method for the isolation of double-stranded nascent DNA, extruded DNA enriched for replication origins was obtained and denatured. Snap-back DNA, single-stranded DNA with inverted repeats (palindromic sequences), reassociates rapidly into stem-loop structures with zero-order kinetics when conditions are changed from denaturing to renaturing, and can be assayed by chromatography on hydroxyapatite. Origin-enriched nascent DNA strands from mouse, rat and monkey cells growing either synchronously or asynchronously were purified and assayed for the presence of snap-back sequences. The results show that origin-enriched DNA is also enriched for snap-back sequences, implying that some origins for mammalian DNA replication contain or lie near palindromic sequences.
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE PAGES

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio; ...

2016-03-09

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Specific minor groove solvation is a crucial determinant of DNA binding site recognition

PubMed Central

Harris, Lydia-Ann; Williams, Loren Dean; Koudelka, Gerald B.

2014-01-01

The DNA sequence preferences of nearly all sequence specific DNA binding proteins are influenced by the identities of bases that are not directly contacted by protein. Discrimination between non-contacted base sequences is commonly based on the differential abilities of DNA sequences to allow narrowing of the DNA minor groove. However, the factors that govern the propensity of minor groove narrowing are not completely understood. Here we show that the differential abilities of various DNA sequences to support formation of a highly ordered and stable minor groove solvation network are a key determinant of non-contacted base recognition by a sequence-specific binding protein. In addition, disrupting the solvent network in the non-contacted region of the binding site alters the protein's ability to recognize contacted base sequences at positions 5–6 bases away. This observation suggests that DNA solvent interactions link contacted and non-contacted base recognition by the protein. PMID:25429976
Pine Gene Discovery Project - Final Report - 08/31/1997 - 02/28/2001

DOE Office of Scientific and Technical Information (OSTI.GOV)

Whetten, R. W.; Sederoff, R. R.; Kinlaw, C.

2001-04-30

Integration of pines into the large scope of plant biology research depends on study of pines in parallel with study of annual plants, and on availability of research materials from pine to plant biologists interested in comparing pine with annual plant systems. The objectives of the Pine Gene Discovery Project were to obtain 10,000 partial DNA sequences of genes expressed in loblolly pine, to determine which of those pine genes were similar to known genes from other organisms, and to make the DNA sequences and isolated pine genes available to plant researchers to stimulate integration of pines into the widermore » scope of plant biology research. Those objectives have been completed, and the results are available to the public. Requests for pine genes have been received from a number of laboratories that would otherwise not have included pine in their research, indicating that progress is being made toward the goal of integrating pine research into the larger molecular biology research community.« less
Molecular Profiling Reveals Biologically Discrete Subsets and Pathways of Progression in Diffuse Glioma.

PubMed

Ceccarelli, Michele; Barthel, Floris P; Malta, Tathiane M; Sabedot, Thais S; Salama, Sofie R; Murray, Bradley A; Morozova, Olena; Newton, Yulia; Radenbaugh, Amie; Pagnotta, Stefano M; Anjum, Samreen; Wang, Jiguang; Manyam, Ganiraju; Zoppoli, Pietro; Ling, Shiyun; Rao, Arjun A; Grifford, Mia; Cherniack, Andrew D; Zhang, Hailei; Poisson, Laila; Carlotti, Carlos Gilberto; Tirapelli, Daniela Pretti da Cunha; Rao, Arvind; Mikkelsen, Tom; Lau, Ching C; Yung, W K Alfred; Rabadan, Raul; Huse, Jason; Brat, Daniel J; Lehman, Norman L; Barnholtz-Sloan, Jill S; Zheng, Siyuan; Hess, Kenneth; Rao, Ganesh; Meyerson, Matthew; Beroukhim, Rameen; Cooper, Lee; Akbani, Rehan; Wrensch, Margaret; Haussler, David; Aldape, Kenneth D; Laird, Peter W; Gutmann, David H; Noushmehr, Houtan; Iavarone, Antonio; Verhaak, Roel G W

2016-01-28

Therapy development for adult diffuse glioma is hindered by incomplete knowledge of somatic glioma driving alterations and suboptimal disease classification. We defined the complete set of genes associated with 1,122 diffuse grade II-III-IV gliomas from The Cancer Genome Atlas and used molecular profiles to improve disease classification, identify molecular correlations, and provide insights into the progression from low- to high-grade disease. Whole-genome sequencing data analysis determined that ATRX but not TERT promoter mutations are associated with increased telomere length. Recent advances in glioma classification based on IDH mutation and 1p/19q co-deletion status were recapitulated through analysis of DNA methylation profiles, which identified clinically relevant molecular subsets. A subtype of IDH mutant glioma was associated with DNA demethylation and poor outcome; a group of IDH-wild-type diffuse glioma showed molecular similarity to pilocytic astrocytoma and relatively favorable survival. Understanding of cohesive disease groups may aid improved clinical outcomes. Copyright © 2016 Elsevier Inc. All rights reserved.
On Statistical Modeling of Sequencing Noise in High Depth Data to Assess Tumor Evolution

NASA Astrophysics Data System (ADS)

Rabadan, Raul; Bhanot, Gyan; Marsilio, Sonia; Chiorazzi, Nicholas; Pasqualucci, Laura; Khiabanian, Hossein

2018-07-01

One cause of cancer mortality is tumor evolution to therapy-resistant disease. First line therapy often targets the dominant clone, and drug resistance can emerge from preexisting clones that gain fitness through therapy-induced natural selection. Such mutations may be identified using targeted sequencing assays by analysis of noise in high-depth data. Here, we develop a comprehensive, unbiased model for sequencing error background. We find that noise in sufficiently deep DNA sequencing data can be approximated by aggregating negative binomial distributions. Mutations with frequencies above noise may have prognostic value. We evaluate our model with simulated exponentially expanded populations as well as data from cell line and patient sample dilution experiments, demonstrating its utility in prognosticating tumor progression. Our results may have the potential to identify significant mutations that can cause recurrence. These results are relevant in the pretreatment clinical setting to determine appropriate therapy and prepare for potential recurrence pretreatment.
Rare Compound Heterozygous Frameshift Mutations in ALMS1 Gene Identified Through Exome Sequencing in a Taiwanese Patient With Alström Syndrome.

PubMed

Tsai, Meng-Che; Yu, Hui-Wen; Liu, Tsunglin; Chou, Yen-Yin; Chiou, Yuan-Yow; Chen, Peng-Chieh

2018-01-01

Alström syndrome (AS) is a rare autosomal recessive disorder that shares clinical features with other ciliopathy-related diseases. Genetic mutation analysis is often required in making differential diagnosis but usually costly in time and effort using conventional Sanger sequencing. Herein we describe a Taiwanese patient presenting cone-rod dystrophy and early-onset obesity that progressed to diabetes mellitus with marked insulin resistance during adolescence. Whole exome sequencing of the patient's genomic DNA identified a novel frameshift mutation in exons 15 (c.10290_10291delTA, p.Lys3431Serfs * 10) and a rare mutation in 16 (c.10823_10824delAG, p.Arg3609Alafs * 6) of ALMS1 gene. The compound heterozygous mutations were predicted to render truncated proteins. This report highlighted the clinical utility of exome sequencing and extended the knowledge of mutation spectrum in AS patients.
On Statistical Modeling of Sequencing Noise in High Depth Data to Assess Tumor Evolution

NASA Astrophysics Data System (ADS)

Rabadan, Raul; Bhanot, Gyan; Marsilio, Sonia; Chiorazzi, Nicholas; Pasqualucci, Laura; Khiabanian, Hossein

2017-12-01

One cause of cancer mortality is tumor evolution to therapy-resistant disease. First line therapy often targets the dominant clone, and drug resistance can emerge from preexisting clones that gain fitness through therapy-induced natural selection. Such mutations may be identified using targeted sequencing assays by analysis of noise in high-depth data. Here, we develop a comprehensive, unbiased model for sequencing error background. We find that noise in sufficiently deep DNA sequencing data can be approximated by aggregating negative binomial distributions. Mutations with frequencies above noise may have prognostic value. We evaluate our model with simulated exponentially expanded populations as well as data from cell line and patient sample dilution experiments, demonstrating its utility in prognosticating tumor progression. Our results may have the potential to identify significant mutations that can cause recurrence. These results are relevant in the pretreatment clinical setting to determine appropriate therapy and prepare for potential recurrence pretreatment.
A Method for Preparing DNA Sequencing Templates Using a DNA-Binding Microplate

PubMed Central

Yang, Yu; Hebron, Haroun R.; Hang, Jun

2009-01-01

A DNA-binding matrix was immobilized on the surface of a 96-well microplate and used for plasmid DNA preparation for DNA sequencing. The same DNA-binding plate was used for bacterial growth, cell lysis, DNA purification, and storage. In a single step using one buffer, bacterial cells were lysed by enzymes, and released DNA was captured on the plate simultaneously. After two wash steps, DNA was eluted and stored in the same plate. Inclusion of phosphates in the culture medium was found to enhance the yield of plasmid significantly. Purified DNA samples were used successfully in DNA sequencing with high consistency and reproducibility. Eleven vectors and nine libraries were tested using this method. In 10 μl sequencing reactions using 3 μl sample and 0.25 μl BigDye Terminator v3.1, the results from a 3730xl sequencer gave a success rate of 90–95% and read-lengths of 700 bases or more. The method is fully automatable and convenient for manual operation as well. It enables reproducible, high-throughput, rapid production of DNA with purity and yields sufficient for high-quality DNA sequencing at a substantially reduced cost. PMID:19568455
Dendritic Cell-Based Immunotherapy of Breast Cancer: Modulation by CpG DNA

DTIC Science & Technology

2005-09-01

tumor-associated antigens and bacterial DNA oligodeoxynucleotides containing unmethylated CpG sequences (CpG DNA) further augment the immune priming...associated antigens by cytotoxic T lymphocytes, and bacterial DNA oligodeoxy- nucleotides containing unmethylated CpG sequences (CpG DNA) can further...further amplify their immunostimulatory capacity and bacterial DNA oligodeoxynucleotides (ODN) containing unmethylated CpG sequences (CpG DNA) provide such
A rapid and cost-effective method for sequencing pooled cDNA clones by using a combination of transposon insertion and Gateway technology.

PubMed

Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide

2011-09-01

Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.
Biological sequence compression algorithms.

PubMed

Matsumoto, T; Sadakane, K; Imai, H

2000-01-01

Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.
Solid-State and Biological Nanopore for Real-Time Sensing of Single Chemical and Sequencing of DNA.

PubMed

Haque, Farzin; Li, Jinghong; Wu, Hai-Chen; Liang, Xing-Jie; Guo, Peixuan

2013-02-01

Sensitivity and specificity are two most important factors to take into account for molecule sensing, chemical detection and disease diagnosis. A perfect sensitivity is to reach the level where a single molecule can be detected. An ideal specificity is to reach the level where the substance can be detected in the presence of many contaminants. The rapidly progressing nanopore technology is approaching this threshold. A wide assortment of biomotors and cellular pores in living organisms perform diverse biological functions. The elegant design of these transportation machineries has inspired the development of single molecule detection based on modulations of the individual current blockage events. The dynamic growth of nanotechnology and nanobiotechnology has stimulated rapid advances in the study of nanopore based instrumentation over the last decade, and inspired great interest in sensing of single molecules including ions, nucleotides, enantiomers, drugs, and polymers such as PEG, RNA, DNA, and polypeptides. This sensing technology has been extended to medical diagnostics and third generation high throughput DNA sequencing. This review covers current nanopore detection platforms including both biological pores and solid state counterparts. Several biological nanopores have been studied over the years, but this review will focus on the three best characterized systems including α-hemolysin and MspA, both containing a smaller channel for the detection of single-strand DNA, as well as bacteriophage phi29 DNA packaging motor connector that contains a larger channel for the passing of double stranded DNA. The advantage and disadvantage of each system are compared; their current and potential applications in nanomedicine, biotechnology, and nanotechnology are discussed.
Solid-State and Biological Nanopore for Real-Time Sensing of Single Chemical and Sequencing of DNA

PubMed Central

Haque, Farzin; Li, Jinghong; Wu, Hai-Chen; Liang, Xing-Jie; Guo, Peixuan

2013-01-01

Sensitivity and specificity are two most important factors to take into account for molecule sensing, chemical detection and disease diagnosis. A perfect sensitivity is to reach the level where a single molecule can be detected. An ideal specificity is to reach the level where the substance can be detected in the presence of many contaminants. The rapidly progressing nanopore technology is approaching this threshold. A wide assortment of biomotors and cellular pores in living organisms perform diverse biological functions. The elegant design of these transportation machineries has inspired the development of single molecule detection based on modulations of the individual current blockage events. The dynamic growth of nanotechnology and nanobiotechnology has stimulated rapid advances in the study of nanopore based instrumentation over the last decade, and inspired great interest in sensing of single molecules including ions, nucleotides, enantiomers, drugs, and polymers such as PEG, RNA, DNA, and polypeptides. This sensing technology has been extended to medical diagnostics and third generation high throughput DNA sequencing. This review covers current nanopore detection platforms including both biological pores and solid state counterparts. Several biological nanopores have been studied over the years, but this review will focus on the three best characterized systems including α-hemolysin and MspA, both containing a smaller channel for the detection of single-strand DNA, as well as bacteriophage phi29 DNA packaging motor connector that contains a larger channel for the passing of double stranded DNA. The advantage and disadvantage of each system are compared; their current and potential applications in nanomedicine, biotechnology, and nanotechnology are discussed. PMID:23504223
Comparative genomic analysis of Genlisea (corkscrew plants—Lentibulariaceae) chloroplast genomes reveals an increasing loss of the ndh genes

PubMed Central

Silva, Saura R.; Michael, Todd P.; Meer, Elliott J.; Pinheiro, Daniel G.; Miranda, Vitor F. O.

2018-01-01

In the carnivorous plant family Lentibulariaceae, all three genome compartments (nuclear, chloroplast, and mitochondria) have some of the highest rates of nucleotide substitutions across angiosperms. While the genera Genlisea and Utricularia have the smallest known flowering plant nuclear genomes, the chloroplast genomes (cpDNA) are mostly structurally conserved except for deletion and/or pseudogenization of the NAD(P)H-dehydrogenase complex (ndh) genes known to be involved in stress conditions of low light or CO2 concentrations. In order to determine how the cpDNA are changing, and to better understand the evolutionary history within the Genlisea genus, we sequenced, assembled and analyzed complete cpDNA from six species (G. aurea, G. filiformis, G. pygmaea, G. repens, G. tuberosa and G. violacea) together with the publicly available G. margaretae cpDNA. In general, the cpDNA structure among the analyzed Genlisea species is highly similar. However, we found that the plastidial ndh genes underwent a progressive process of degradation similar to the other terrestrial Lentibulariaceae cpDNA analyzed to date, but in contrast to the aquatic species. Contrary to current thinking that the terrestrial environment is a more stressful environment and thus requiring the ndh genes, we provide evidence that in the Lentibulariaceae the terrestrial forms have progressive loss while the aquatic forms have the eleven plastidial ndh genes intact. Therefore, the Lentibulariaceae system provides an important opportunity to understand the evolutionary forces that govern the transition to an aquatic environment and may provide insight into how plants manage water stress at a genome scale. PMID:29293597
Comparative genomic analysis of Genlisea (corkscrew plants-Lentibulariaceae) chloroplast genomes reveals an increasing loss of the ndh genes.

PubMed

Silva, Saura R; Michael, Todd P; Meer, Elliott J; Pinheiro, Daniel G; Varani, Alessandro M; Miranda, Vitor F O

2018-01-01

In the carnivorous plant family Lentibulariaceae, all three genome compartments (nuclear, chloroplast, and mitochondria) have some of the highest rates of nucleotide substitutions across angiosperms. While the genera Genlisea and Utricularia have the smallest known flowering plant nuclear genomes, the chloroplast genomes (cpDNA) are mostly structurally conserved except for deletion and/or pseudogenization of the NAD(P)H-dehydrogenase complex (ndh) genes known to be involved in stress conditions of low light or CO2 concentrations. In order to determine how the cpDNA are changing, and to better understand the evolutionary history within the Genlisea genus, we sequenced, assembled and analyzed complete cpDNA from six species (G. aurea, G. filiformis, G. pygmaea, G. repens, G. tuberosa and G. violacea) together with the publicly available G. margaretae cpDNA. In general, the cpDNA structure among the analyzed Genlisea species is highly similar. However, we found that the plastidial ndh genes underwent a progressive process of degradation similar to the other terrestrial Lentibulariaceae cpDNA analyzed to date, but in contrast to the aquatic species. Contrary to current thinking that the terrestrial environment is a more stressful environment and thus requiring the ndh genes, we provide evidence that in the Lentibulariaceae the terrestrial forms have progressive loss while the aquatic forms have the eleven plastidial ndh genes intact. Therefore, the Lentibulariaceae system provides an important opportunity to understand the evolutionary forces that govern the transition to an aquatic environment and may provide insight into how plants manage water stress at a genome scale.
Detection of DNA Methylation by Whole-Genome Bisulfite Sequencing.

PubMed

Li, Qing; Hermanson, Peter J; Springer, Nathan M

2018-01-01

DNA methylation plays an important role in the regulation of the expression of transposons and genes. Various methods have been developed to assay DNA methylation levels. Bisulfite sequencing is considered to be the "gold standard" for single-base resolution measurement of DNA methylation levels. Coupled with next-generation sequencing, whole-genome bisulfite sequencing (WGBS) allows DNA methylation to be evaluated at a genome-wide scale. Here, we described a protocol for WGBS in plant species with large genomes. This protocol has been successfully applied to assay genome-wide DNA methylation levels in maize and barley. This protocol has also been successfully coupled with sequence capture technology to assay DNA methylation levels in a targeted set of genomic regions.
Single-Molecule Electrical Random Resequencing of DNA and RNA

NASA Astrophysics Data System (ADS)

Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji

2012-07-01

Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.
New strategy to address DNA-methyl transferase activity in ovarian cancer cell cultures by monitoring the formation of 5-methylcytosine using HPLC-UV.

PubMed

Iglesias González, T; Blanco-González, E; Montes-Bayón, M

2016-08-15

Methylation of mammalian genomic DNA is catalyzed by DNA methyltransferases (DNMTs). Aberrant expression and activity of these enzymes has been reported to play an important role in the initiation and progression of tumors and its response to chemotherapy. Therefore, there is a great interest in developing strategies to detect human DNMTs activity. We propose a simple, antibody-free, label-free and non-radioactive analytical strategy in which methyltransferase activity is measured trough the determination of the 5-methylcytosine (5mC) content in DNA by a chromatographic method (HPLC-UV) previously developed. For this aim, a correlation between the enzyme activity and the concentration of 5mC obtained by HPLC-UV is previously obtained under optimized conditions using both, un-methylated and hemi-methylated DNA substrates and the prokaryotic methyltransferase M.SssI as model enzyme. The evaluation of the methylation yield in un-methylated known sequences (a 623bp PCR-amplicon) turned to be quantitative (110%) in experiments conducted in-vitro. Methylation of hemi-methylated and low-methylated sequences could be also detected with the proposed approach. The application of the methodology to the determination of the DNMTs activity in nuclear extracts from human ovarian cancer cells has revealed the presence of matrix effects (also confirmed by standard additions) that hampered quantitative enzyme recovery. The obtained results showed the high importance of adequate sample clean-up steps. Copyright © 2016. Published by Elsevier B.V.

Metastatic EML4-ALK fusion detected by circulating DNA genotyping in an EGFR-mutated NSCLC patient and successful management by adding ALK inhibitors: a case report.

PubMed

Liang, Wenhua; He, Qihua; Chen, Ying; Chuai, Shaokun; Yin, Weiqiang; Wang, Wei; Peng, Guilin; Zhou, Caicun; He, Jianxing

2016-02-05

Rebiopsy is highly recommended to identify the mechanism of acquired resistance to EGFR-TKIs in advanced lung cancer. Recent advances in multiplex genotyping based on circulating tumor DNA (ctDNA) provide a strong and non-invasive alternative for detection of the resistance mechanism. Here we report a multiple metastatic NSCLC patient who was detected to have pure EGFR 19 exon deletion (negative for EML4-ALK and ROS1 in both IHC-based and sequencing assay) in the primary lesion and responded to first-line and second-line EGFR-TKI treatments (erlotinib then HY-15772). At 8 months, most lesions remained well controlled except for the liver metastases which presented dramatic progression. Considering the high risk of bleeding in rebiopsy of hepatic lesions, we conducted a multiplex genomic profiling with ctDNA. Results reported coexistence of EGFR mutation and EML4-ALK gene translocation in plasma which heavily indicated that ALK was the primary reason for progression of the liver lesions. This deduction was supported by the repeated response to ALK inhibitors (crizotinib then AP26113) of the hepatic metastases. This is the first report of the existence of ALK rearrangement in metastatic lesions in an EGFR mutated patient. It highlighted the feasibility and advantages of using ctDNA multiplex genotyping in identifying the heterogeneity across lesions and the resistance mechanism of targeted treatments.
DNA/RNA hybrid substrates modulate the catalytic activity of purified AID.

PubMed

Abdouni, Hala S; King, Justin J; Ghorbani, Atefeh; Fifield, Heather; Berghuis, Lesley; Larijani, Mani

2018-01-01

Activation-induced cytidine deaminase (AID) converts cytidine to uridine at Immunoglobulin (Ig) loci, initiating somatic hypermutation and class switching of antibodies. In vitro, AID acts on single stranded DNA (ssDNA), but neither double-stranded DNA (dsDNA) oligonucleotides nor RNA, and it is believed that transcription is the in vivo generator of ssDNA targeted by AID. It is also known that the Ig loci, particularly the switch (S) regions targeted by AID are rich in transcription-generated DNA/RNA hybrids. Here, we examined the binding and catalytic behavior of purified AID on DNA/RNA hybrid substrates bearing either random sequences or GC-rich sequences simulating Ig S regions. If substrates were made up of a random sequence, AID preferred substrates composed entirely of DNA over DNA/RNA hybrids. In contrast, if substrates were composed of S region sequences, AID preferred to mutate DNA/RNA hybrids over substrates composed entirely of DNA. Accordingly, AID exhibited a significantly higher affinity for binding DNA/RNA hybrid substrates composed specifically of S region sequences, than any other substrates composed of DNA. Thus, in the absence of any other cellular processes or factors, AID itself favors binding and mutating DNA/RNA hybrids composed of S region sequences. AID:DNA/RNA complex formation and supporting mutational analyses suggest that recognition of DNA/RNA hybrids is an inherent structural property of AID. Copyright © 2017 Elsevier Ltd. All rights reserved.
Characterization of the repetitive DNA elements in the genome of fish lymphocystis disease viruses.

PubMed

Schnitzler, P; Darai, G

1989-09-01

The complete DNA nucleotide sequence of the repetitive DNA elements in the genome of fish lymphocystis disease virus (FLDV) isolated from two different species (flounder and dab) was determined. The size of these repetitive DNA elements was found to be 1413 bp which corresponds to the DNA sequences of the 5' terminus of the EcoRI DNA fragment B (0.034 to 0.052 m.u.) and to the EcoRI DNA fragment M (0.718 to 0.736 m.u.) of the FLDV genome causing lymphocystis disease in flounder and plaice. The degree of DNA nucleotide homology between both regions was found to be 99%. The repetitive DNA element in the genome of FLDV isolated from other fish species (dab) was identified and is located within the EcoRI DNA fragment B and J of the viral genome. The DNA nucleotide sequence of one duplicate of this repetition (EcoRI DNA fragment J) was determined (1410 bp) and compared to the DNA nucleotide sequences of the repetitive DNA elements of the genome of FLDV isolated from flounder. It was found that the repetitive DNA elements of the genome of FLDV derived from two different fish species are highly conserved and possess a degree of DNA sequence homology of 94%. The DNA sequences of each strand of the individual repetitive element possess one open reading frame.
Long-range correlations and charge transport properties of DNA sequences

NASA Astrophysics Data System (ADS)

Liu, Xiao-liang; Ren, Yi; Xie, Qiong-tao; Deng, Chao-sheng; Xu, Hui

2010-04-01

By using Hurst's analysis and transfer approach, the rescaled range functions and Hurst exponents of human chromosome 22 and enterobacteria phage lambda DNA sequences are investigated and the transmission coefficients, Landauer resistances and Lyapunov coefficients of finite segments based on above genomic DNA sequences are calculated. In a comparison with quasiperiodic and random artificial DNA sequences, we find that λ-DNA exhibits anticorrelation behavior characterized by a Hurst exponent 0.5
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].

PubMed

Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y

2017-08-01

To analyze and detect the whole genome sequence of human mitochondrial DNA （mtDNA） by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine
Sequence periodicity in nucleosomal DNA and intrinsic curvature

PubMed Central

2010-01-01

Background Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Results Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. Conclusions The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA. PMID:20487515
A survey of the sequence-specific interaction of damaging agents with DNA: emphasis on antitumor agents.

PubMed

Murray, V

1999-01-01

This article reviews the literature concerning the sequence specificity of DNA-damaging agents. DNA-damaging agents are widely used in cancer chemotherapy. It is important to understand fully the determinants of DNA sequence specificity so that more effective DNA-damaging agents can be developed as antitumor drugs. There are five main methods of DNA sequence specificity analysis: cleavage of end-labeled fragments, linear amplification with Taq DNA polymerase, ligation-mediated polymerase chain reaction (PCR), single-strand ligation PCR, and footprinting. The DNA sequence specificity in purified DNA and in intact mammalian cells is reviewed for several classes of DNA-damaging agent. These include agents that form covalent adducts with DNA, free radical generators, topoisomerase inhibitors, intercalators and minor groove binders, enzymes, and electromagnetic radiation. The main sites of adduct formation are at the N-7 of guanine in the major groove of DNA and the N-3 of adenine in the minor groove, whereas free radical generators abstract hydrogen from the deoxyribose sugar and topoisomerase inhibitors cause enzyme-DNA cross-links to form. Several issues involved in the determination of the DNA sequence specificity are discussed. The future directions of the field, with respect to cancer chemotherapy, are also examined.
Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing

PubMed Central

Chandran, Anandhakumar; Syed, Junetha; Taylor, Rhys D.; Kashiwazaki, Gengo; Sato, Shinsuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

2016-01-01

Chemically engineered small molecules targeting specific genomic sequences play an important role in drug development research. Pyrrole-imidazole polyamides (PIPs) are a group of molecules that can bind to the DNA minor-groove and can be engineered to target specific sequences. Their biological effects rely primarily on their selective DNA binding. However, the binding mechanism of PIPs at the chromatinized genome level is poorly understood. Herein, we report a method using high-throughput sequencing to identify the DNA-alkylating sites of PIP-indole-seco-CBI conjugates. High-throughput sequencing analysis of conjugate 2 showed highly similar DNA-alkylating sites on synthetic oligos (histone-free DNA) and on human genomes (chromatinized DNA context). To our knowledge, this is the first report identifying alkylation sites across genomic DNA by alkylating PIP conjugates using high-throughput sequencing. PMID:27098039
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.

PubMed

Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Incidence of human papilloma virus in esophageal squamous cell carcinoma in patients from the Lublin region.

PubMed

Dąbrowski, Andrzej; Kwaśniewski, Wojciech; Skoczylas, Tomasz; Bednarek, Wiesława; Kuźma, Dorota; Goździcka-Józefiak, Anna

2012-10-28

To assess the prevalence of human papilloma virus (HPV) in esophageal squamous cell carcinoma (ESCC) in the south-eastern region of Poland. The study population consisted of 56 ESCC patients and 35 controls. The controls were patients referred to our department due to other non-esophageal and non-oncological disorders with no gross or microscopic esophageal pathology as confirmed by endoscopy and histopathology. In the ESCC patients, samples were taken from normal mucosa (56 mucosa samples) and from the tumor (56 tumor samples). Tissue samples from the controls were taken from normal mucosa of the middle esophagus (35 control samples). Quantitative determination of DNA was carried out using a spectrophotometric method. Genomic DNA was isolated using the QIAamp DNA Midi Kit. HPV infection was identified following PCR amplification of the HPV gene sequence, using primers MY09 and MY11 complementary to the genome sequence of at least 33 types of HPV. The sequencing results were computationally analyzed using the basic local alignment search tool database. In tumor samples, HPV DNA was identified in 28 of 56 patients (50%). High risk HPV phenotypes (16 or/and 18) were found in 5 of 56 patients (8.9%), low risk in 19 of 56 patients (33.9%) and other types of HPV (37, 81, 97, CP6108) in 4 of 56 patients (7.1%). In mucosa samples, HPV DNA was isolated in 21 of 56 patients (37.5%). High risk HPV DNA was confirmed in 3 of 56 patients (5.3%), low risk HPV DNA in 12 of 56 patients (21.4%), and other types of HPV in 6 of 56 patients (10.7%). In control samples, HPV DNA was identified in 4 of 35 patients (11.4%) with no high risk HPV. The occurrence of HPV in ESCC patients was significantly higher than in the controls [28 of 56 (50%) vs 4 of 35 (11.4%), P < 0.001]. In esophageal cancer patients, both in tumor and mucosa samples, the predominant HPV phenotypes were low risk HPV, isolated 4 times more frequently than high risk phenotypes [19 of 56 (33.9%) vs 5 of 56 (8.9%), P < 0.001]. A higher prevalence of HPV was identified in female patients (71.4% vs 46.9%). Accordingly, the high risk phenotypes were isolated more frequently in female patients and this difference reached statistical significance [3 of 7 (42.9%) vs 2 of 49 (4.1%), P < 0.05]. Of the pathological characteristics, only an infiltrative pattern of macroscopic tumor type significantly correlated with the presence of HPV DNA in ESCC samples [20 of 27 (74.1%) vs 8 of 29 (27.6%) for ulcerative or protruding macroscopic type, P < 0.05]. The occurrence of total HPV DNA and both HPV high or low risk phenotypes did not significantly differ with regard to particular grades of cellular differentiation, phases in depth of tumor infiltration, grades of nodal involvement and stages of tumor progression. Low risk HPV phenotypes could be one of the co-activators or/and co-carcinogens in complex, progressive, multifactorial and multistep esophageal carcinogenesis.
[Analysis of Musculoskeletal Systems and Their Diseases. The research for musculoskeletal disease using genome editing technology].

PubMed

Suzuki, Hidetsugu; Asahara, Hiroshi

2015-08-01

Genome editing is a genetic technology by which any DNA sequence is inserted, replaced or deleted. Genome editing has been making rapid progress recently, with the development of new techniques such as ZFN, TALEN and CRISPR/Cas9. Genome editing can be applied to various fields ranging from the production of knock out animals to gene therapy. This section summarizes these new genome editing technologies and its applications.
Cumulative mtDNA damage and mutations contribute to the progressive loss of RGCs in a rat model of glaucoma

PubMed Central

Nickerson, John M.; Gao, Feng-juan; Sun, Zhongmou; Chen, Xin-ya; Zhang, Shu-jie; Gao, Feng; Chen, Jun-yi; Luo, Yi; Wang, Yan; Sun, Xing-huai

2015-01-01

Glaucoma is a chronic neurodegenerative disease characterized by the progressive loss of retinal ganglion cells (RGCs). Mitochondrial DNA (mtDNA) alterations have been documented as a key component of many neurodegenerative disorders. However, whether mtDNA alterations contribute to the progressive loss of RGCs and the mechanism whereby this phenomenon could occur are poorly understood. We investigated mtDNA alterations in RGCs using a rat model of chronic intraocular hypertension and explored the mechanisms underlying progressive RGC loss. We demonstrate that the mtDNA damage and mutations triggered by intraocular pressure (IOP) elevation are initiating, crucial events in a cascade leading to progressive RGC loss. Damage to and mutation of mtDNA, mitochondrial dysfunction, reduced levels of mtDNA repair/replication enzymes, and elevated reactive oxygen species form a positive feedback loop that produces irreversible mtDNA damage and mutation and contributes to progressive RGC loss, which occurs even after a return to normal IOP. Furthermore, we demonstrate that mtDNA damage and mutations increase the vulnerability of RGCs to elevated IOP and glutamate levels, which are among the most common glaucoma insults. This study suggests that therapeutic approaches that target mtDNA maintenance and repair and that promote energy production may prevent the progressive death of RGCs. PMID:25478814
In vitro manipulation of gene expression in larval Schistosoma: a model for postgenomic approaches in Trematoda

PubMed Central

YOSHINO, TIMOTHY P.; DINGUIRARD, NATHALIE; DE MORAES MOURÃO, MARINA

2013-01-01

SUMMARY With rapid developments in DNA and protein sequencing technologies, combined with powerful bioinformatics tools, a continued acceleration of gene identification in parasitic helminths is predicted, potentially leading to discovery of new drug and vaccine targets, enhanced diagnostics and insights into the complex biology underlying host-parasite interactions. For the schistosome blood flukes, with the recent completion of genome sequencing and comprehensive transcriptomic datasets, there has accumulated massive amounts of gene sequence data, for which, in the vast majority of cases, little is known about actual functions within the intact organism. In this review we attempt to bring together traditional in vitro cultivation approaches and recent emergent technologies of molecular genomics, transcriptomics and genetic manipulation to illustrate the considerable progress made in our understanding of trematode gene expression and function during development of the intramolluscan larval stages. Using several prominent trematode families (Schistosomatidae, Fasciolidae, Echinostomatidae), we have focused on the current status of in vitro larval isolation/cultivation as a source of valuable raw material supporting gene discovery efforts in model digeneans that include whole genome sequencing, transcript and protein expression profiling during larval development, and progress made in the in vitro manipulation of genes and their expression in larval trematodes using transgenic and RNA interference (RNAi) approaches. PMID:19961646
Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors.

PubMed

Adalsteinsson, Viktor A; Ha, Gavin; Freeman, Samuel S; Choudhury, Atish D; Stover, Daniel G; Parsons, Heather A; Gydush, Gregory; Reed, Sarah C; Rotem, Denisse; Rhoades, Justin; Loginov, Denis; Livitz, Dimitri; Rosebrock, Daniel; Leshchiner, Ignaty; Kim, Jaegil; Stewart, Chip; Rosenberg, Mara; Francis, Joshua M; Zhang, Cheng-Zhong; Cohen, Ofir; Oh, Coyin; Ding, Huiming; Polak, Paz; Lloyd, Max; Mahmud, Sairah; Helvie, Karla; Merrill, Margaret S; Santiago, Rebecca A; O'Connor, Edward P; Jeong, Seong H; Leeson, Rachel; Barry, Rachel M; Kramkowski, Joseph F; Zhang, Zhenwei; Polacek, Laura; Lohr, Jens G; Schleicher, Molly; Lipscomb, Emily; Saltzman, Andrea; Oliver, Nelly M; Marini, Lori; Waks, Adrienne G; Harshman, Lauren C; Tolaney, Sara M; Van Allen, Eliezer M; Winer, Eric P; Lin, Nancy U; Nakabayashi, Mari; Taplin, Mary-Ellen; Johannessen, Cory M; Garraway, Levi A; Golub, Todd R; Boehm, Jesse S; Wagle, Nikhil; Getz, Gad; Love, J Christopher; Meyerson, Matthew

2017-11-06

Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.
An evolution based biosensor receptor DNA sequence generation algorithm.

PubMed

Kim, Eungyeong; Lee, Malrey; Gatton, Thomas M; Lee, Jaewan; Zang, Yupeng

2010-01-01

A biosensor is composed of a bioreceptor, an associated recognition molecule, and a signal transducer that can selectively detect target substances for analysis. DNA based biosensors utilize receptor molecules that allow hybridization with the target analyte. However, most DNA biosensor research uses oligonucleotides as the target analytes and does not address the potential problems of real samples. The identification of recognition molecules suitable for real target analyte samples is an important step towards further development of DNA biosensors. This study examines the characteristics of DNA used as bioreceptors and proposes a hybrid evolution-based DNA sequence generating algorithm, based on DNA computing, to identify suitable DNA bioreceptor recognition molecules for stable hybridization with real target substances. The Traveling Salesman Problem (TSP) approach is applied in the proposed algorithm to evaluate the safety and fitness of the generated DNA sequences. This approach improves efficiency and stability for enhanced and variable-length DNA sequence generation and allows extension to generation of variable-length DNA sequences with diverse receptor recognition requirements.
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis

PubMed Central

Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

2012-01-01

RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. Availability http://www.cemb.edu.pk/sw.html Abbreviations RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language. PMID:23055611
Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes.

PubMed

Al-Attar, Sinan; Westra, Edze R; van der Oost, John; Brouns, Stan J J

2011-04-01

Many prokaryotes contain the recently discovered defense system against mobile genetic elements. This defense system contains a unique type of repetitive DNA stretches, termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs). CRISPRs consist of identical repeated DNA sequences (repeats), interspaced by highly variable sequences referred to as spacers. The spacers originate from either phages or plasmids and comprise the prokaryotes' 'immunological memory'. CRISPR-associated (cas) genes encode conserved proteins that together with CRISPRs make-up the CRISPR/Cas system, responsible for defending the prokaryotic cell against invaders. CRISPR-mediated resistance has been proposed to involve three stages: (i) CRISPR-Adaptation, the invader DNA is encountered by the CRISPR/Cas machinery and an invader-derived short DNA fragment is incorporated in the CRISPR array. (ii) CRISPR-Expression, the CRISPR array is transcribed and the transcript is processed by Cas proteins. (iii) CRISPR-Interference, the invaders' nucleic acid is recognized by complementarity to the crRNA and neutralized. An application of the CRISPR/Cas system is the immunization of industry-relevant prokaryotes (or eukaryotes) against mobile-genetic invasion. In addition, the high variability of the CRISPR spacer content can be exploited for phylogenetic and evolutionary studies. Despite impressive progress during the last couple of years, the elucidation of several fundamental details will be a major challenge in future research.
Structural and Thermodynamic Signatures of DNA Recognition by Mycobacterium tuberculosis DnaA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tsodikov, Oleg V.; Biswas, Tapan

An essential protein, DnaA, binds to 9-bp DNA sites within the origin of replication oriC. These binding events are prerequisite to forming an enigmatic nucleoprotein scaffold that initiates replication. The number, sequences, positions, and orientations of these short DNA sites, or DnaA boxes, within the oriCs of different bacteria vary considerably. To investigate features of DnaA boxes that are important for binding Mycobacterium tuberculosis DnaA (MtDnaA), we have determined the crystal structures of the DNA binding domain (DBD) of MtDnaA bound to a cognate MtDnaA-box (at 2.0 {angstrom} resolution) and to a consensus Escherichia coli DnaA-box (at 2.3 {angstrom}). Thesemore » structures, complemented by calorimetric equilibrium binding studies of MtDnaA DBD in a series of DnaA-box variants, reveal the main determinants of DNA recognition and establish the [T/C][T/A][G/A]TCCACA sequence as a high-affinity MtDnaA-box. Bioinformatic and calorimetric analyses indicate that DnaA-box sequences in mycobacterial oriCs generally differ from the optimal binding sequence. This sequence variation occurs commonly at the first 2 bp, making an in vivo mycobacterial DnaA-box effectively a 7-mer and not a 9-mer. We demonstrate that the decrease in the affinity of these MtDnaA-box variants for MtDnaA DBD relative to that of the highest-affinity box TTGTCCACA is less than 10-fold. The understanding of DnaA-box recognition by MtDnaA and E. coli DnaA enables one to map DnaA-box sequences in the genomes of M. tuberculosis and other eubacteria.« less
DNA barcode goes two-dimensions: DNA QR code web server.

PubMed

Liu, Chang; Shi, Linchun; Xu, Xiaolan; Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin

2012-01-01

The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, "DNA barcode" actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications.
TaxI: a software tool for DNA barcoding using distance methods

PubMed Central

Steinke, Dirk; Vences, Miguel; Salzburger, Walter; Meyer, Axel

2005-01-01

DNA barcoding is a promising approach to the diagnosis of biological diversity in which DNA sequences serve as the primary key for information retrieval. Most existing software for evolutionary analysis of DNA sequences was designed for phylogenetic analyses and, hence, those algorithms do not offer appropriate solutions for the rapid, but precise analyses needed for DNA barcoding, and are also unable to process the often large comparative datasets. We developed a flexible software tool for DNA taxonomy, named TaxI. This program calculates sequence divergences between a query sequence (taxon to be barcoded) and each sequence of a dataset of reference sequences defined by the user. Because the analysis is based on separate pairwise alignments this software is also able to work with sequences characterized by multiple insertions and deletions that are difficult to align in large sequence sets (i.e. thousands of sequences) by multiple alignment algorithms because of computational restrictions. Here, we demonstrate the utility of this approach with two datasets of fish larvae and juveniles from Lake Constance and juvenile land snails under different models of sequence evolution. Sets of ribosomal 16S rRNA sequences, characterized by multiple indels, performed as good as or better than cox1 sequence sets in assigning sequences to species, demonstrating the suitability of rRNA genes for DNA barcoding. PMID:16214755

Blueprints for green biotech: development and application of standards for plant synthetic biology.

PubMed

Patron, Nicola J

2016-06-15

Synthetic biology aims to apply engineering principles to the design and modification of biological systems and to the construction of biological parts and devices. The ability to programme cells by providing new instructions written in DNA is a foundational technology of the field. Large-scale de novo DNA synthesis has accelerated synthetic biology by offering custom-made molecules at ever decreasing costs. However, for large fragments and for experiments in which libraries of DNA sequences are assembled in different combinations, assembly in the laboratory is still desirable. Biological assembly standards allow DNA parts, even those from multiple laboratories and experiments, to be assembled together using the same reagents and protocols. The adoption of such standards for plant synthetic biology has been cohesive for the plant science community, facilitating the application of genome editing technologies to plant systems and streamlining progress in large-scale, multi-laboratory bioengineering projects. © 2016 The Author(s). published by Portland Press Limited on behalf of the Biochemical Society.
Single nucleotide polymorphisms of DNA repair genes as predictors of radioresponse.

PubMed

Parliament, Matthew B; Murray, David

2010-10-01

Radiation therapy is a key modality in the treatment of cancer. Substantial progress has been made in unraveling the molecular events which underpin the responses of malignant and surrounding normal tissues to ionizing radiation. An understanding of the genes involved in processes such as DNA double-strand break repair, DNA damage response, cell-cycle control, apoptosis, cellular antioxidant defenses, and cytokine production, has evolved toward examination of how genetic variants, most often, single nucleotide polymorphisms (SNPs), may influence interindividual radioresponse. Experimental approaches, such as candidate SNP-association studies, genome-wide association studies, and massively parallel sequencing are being proposed to address these questions. We present a focused review of the evidence supporting an association between SNPs in DNA repair genes and radioresponse in normal tissues and tumors. Although preliminary results indicate possible associations, there are methodological weaknesses in many of the studies, and independent validation of SNPs as biomarkers of radioresponse in much larger cohorts will likely require research cooperation through international consortia. Copyright © 2010 Elsevier Inc. All rights reserved.
Dna Sequencing

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1995-04-25

A method for sequencing a strand of DNA, including the steps off: providing the strand of DNA; annealing the strand with a primer able to hybridize to the strand to give an annealed mixture; incubating the mixture with four deoxyribonucleoside triphosphates, a DNA polymerase, and at least three deoxyribonucleoside triphosphates in different amounts, under conditions in favoring primer extension to form nucleic acid fragments complementory to the DNA to be sequenced; labelling the nucleic and fragments; separating them and determining the position of the deoxyribonucleoside triphosphates by differences in the intensity of the labels, thereby to determine the DNA sequence.
High-fidelity target sequencing of individual molecules identified using barcode sequences: de novo detection and absolute quantitation of mutations in plasma cell-free DNA from cancer patients.

PubMed

Kukita, Yoji; Matoba, Ryo; Uchida, Junji; Hamakawa, Takuya; Doki, Yuichiro; Imamura, Fumio; Kato, Kikuya

2015-08-01

Circulating tumour DNA (ctDNA) is an emerging field of cancer research. However, current ctDNA analysis is usually restricted to one or a few mutation sites due to technical limitations. In the case of massively parallel DNA sequencers, the number of false positives caused by a high read error rate is a major problem. In addition, the final sequence reads do not represent the original DNA population due to the global amplification step during the template preparation. We established a high-fidelity target sequencing system of individual molecules identified in plasma cell-free DNA using barcode sequences; this system consists of the following two steps. (i) A novel target sequencing method that adds barcode sequences by adaptor ligation. This method uses linear amplification to eliminate the errors introduced during the early cycles of polymerase chain reaction. (ii) The monitoring and removal of erroneous barcode tags. This process involves the identification of individual molecules that have been sequenced and for which the number of mutations have been absolute quantitated. Using plasma cell-free DNA from patients with gastric or lung cancer, we demonstrated that the system achieved near complete elimination of false positives and enabled de novo detection and absolute quantitation of mutations in plasma cell-free DNA. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Sequence-Dependent Diastereospecific and Diastereodivergent Crosslinking of DNA by Decarbamoylmitomycin C.

PubMed

Aguilar, William; Paz, Manuel M; Vargas, Anayatzinc; Clement, Cristina C; Cheng, Shu-Yuan; Champeil, Elise

2018-04-20

Mitomycin C (MC), a potent antitumor drug, and decarbamoylmitomycin C (DMC), a derivative lacking the carbamoyl group, form highly cytotoxic DNA interstrand crosslinks. The major interstrand crosslink formed by DMC is the C1'' epimer of the major crosslink formed by MC. The molecular basis for the stereochemical configuration exhibited by DMC was investigated using biomimetic synthesis. The formation of DNA-DNA crosslinks by DMC is diastereospecific and diastereodivergent: Only the 1''S-diastereomer of the initially formed monoadduct can form crosslinks at GpC sequences, and only the 1''R-diastereomer of the monoadduct can form crosslinks at CpG sequences. We also show that CpG and GpC sequences react with divergent diastereoselectivity in the first alkylation step: 1"S stereochemistry is favored at GpC sequences and 1''R stereochemistry is favored at CpG sequences. Therefore, the first alkylation step results, at each sequence, in the selective formation of the diastereomer able to generate an interstrand DNA-DNA crosslink after the "second arm" alkylation. Examination of the known DNA adduct pattern obtained after treatment of cancer cell cultures with DMC indicates that the GpC sequence is the major target for the formation of DNA-DNA crosslinks in vivo by this drug. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sequencing historical specimens: successful preparation of small specimens with low amounts of degraded DNA.

PubMed

Sproul, John S; Maddison, David R

2017-11-01

Despite advances that allow DNA sequencing of old museum specimens, sequencing small-bodied, historical specimens can be challenging and unreliable as many contain only small amounts of fragmented DNA. Dependable methods to sequence such specimens are especially critical if the specimens are unique. We attempt to sequence small-bodied (3-6 mm) historical specimens (including nomenclatural types) of beetles that have been housed, dried, in museums for 58-159 years, and for which few or no suitable replacement specimens exist. To better understand ideal approaches of sample preparation and produce preparation guidelines, we compared different library preparation protocols using low amounts of input DNA (1-10 ng). We also explored low-cost optimizations designed to improve library preparation efficiency and sequencing success of historical specimens with minimal DNA, such as enzymatic repair of DNA. We report successful sample preparation and sequencing for all historical specimens despite our low-input DNA approach. We provide a list of guidelines related to DNA repair, bead handling, reducing adapter dimers and library amplification. We present these guidelines to facilitate more economical use of valuable DNA and enable more consistent results in projects that aim to sequence challenging, irreplaceable historical specimens. © 2017 John Wiley & Sons Ltd.
i-rDNA: alignment-free algorithm for rapid in silico detection of ribosomal gene fragments from metagenomic sequence data sets.

PubMed

Mohammed, Monzoorul Haque; Ghosh, Tarini Shankar; Chadaram, Sudha; Mande, Sharmila S

2011-11-30

Obtaining accurate estimates of microbial diversity using rDNA profiling is the first step in most metagenomics projects. Consequently, most metagenomic projects spend considerable amounts of time, money and manpower for experimentally cloning, amplifying and sequencing the rDNA content in a metagenomic sample. In the second step, the entire genomic content of the metagenome is extracted, sequenced and analyzed. Since DNA sequences obtained in this second step also contain rDNA fragments, rapid in silico identification of these rDNA fragments would drastically reduce the cost, time and effort of current metagenomic projects by entirely bypassing the experimental steps of primer based rDNA amplification, cloning and sequencing. In this study, we present an algorithm called i-rDNA that can facilitate the rapid detection of 16S rDNA fragments from amongst millions of sequences in metagenomic data sets with high detection sensitivity. Performance evaluation with data sets/database variants simulating typical metagenomic scenarios indicates the significantly high detection sensitivity of i-rDNA. Moreover, i-rDNA can process a million sequences in less than an hour on a simple desktop with modest hardware specifications. In addition to the speed of execution, high sensitivity and low false positive rate, the utility of the algorithmic approach discussed in this paper is immense given that it would help in bypassing the entire experimental step of primer-based rDNA amplification, cloning and sequencing. Application of this algorithmic approach would thus drastically reduce the cost, time and human efforts invested in all metagenomic projects. A web-server for the i-rDNA algorithm is available at http://metagenomics.atc.tcs.com/i-rDNA/
Biosensors for DNA sequence detection

NASA Technical Reports Server (NTRS)

Vercoutere, Wenonah; Akeson, Mark

2002-01-01

DNA biosensors are being developed as alternatives to conventional DNA microarrays. These devices couple signal transduction directly to sequence recognition. Some of the most sensitive and functional technologies use fibre optics or electrochemical sensors in combination with DNA hybridization. In a shift from sequence recognition by hybridization, two emerging single-molecule techniques read sequence composition using zero-mode waveguides or electrical impedance in nanoscale pores.
DNA Sequences from Formalin-Fixed Nematodes: Integrating Molecular and Morphological Approaches to Taxonomy

PubMed Central

Thomas, W. Kelley; Vida, J. T.; Frisse, Linda M.; Mundo, Manuel; Baldwin, James G.

1997-01-01

To effectively integrate DNA sequence analysis and classical nematode taxonomy, we must be able to obtain DNA sequences from formalin-fixed specimens. Microdissected sections of nematodes were removed from specimens fixed in formalin, using standard protocols and without destroying morphological features. The fixed sections provided sufficient template for multiple polymerase chain reaction-based DNA sequence analyses. PMID:19274156
Palindromic Sequence Artifacts Generated during Next Generation Sequencing Library Preparation from Historic and Ancient DNA

PubMed Central

Star, Bastiaan; Nederbragt, Alexander J.; Hansen, Marianne H. S.; Skage, Morten; Gilfillan, Gregor D.; Bradbury, Ian R.; Pampoulie, Christophe; Stenseth, Nils Chr; Jakobsen, Kjetill S.; Jentoft, Sissel

2014-01-01

Degradation-specific processes and variation in laboratory protocols can bias the DNA sequence composition from samples of ancient or historic origin. Here, we identify a novel artifact in sequences from historic samples of Atlantic cod (Gadus morhua), which forms interrupted palindromes consisting of reverse complementary sequence at the 5′ and 3′-ends of sequencing reads. The palindromic sequences themselves have specific properties – the bases at the 5′-end align well to the reference genome, whereas extensive misalignments exists among the bases at the terminal 3′-end. The terminal 3′ bases are artificial extensions likely caused by the occurrence of hairpin loops in single stranded DNA (ssDNA), which can be ligated and amplified in particular library creation protocols. We propose that such hairpin loops allow the inclusion of erroneous nucleotides, specifically at the 3′-end of DNA strands, with the 5′-end of the same strand providing the template. We also find these palindromes in previously published ancient DNA (aDNA) datasets, albeit at varying and substantially lower frequencies. This artifact can negatively affect the yield of endogenous DNA in these types of samples and introduces sequence bias. PMID:24608104
A new family of satellite DNA sequences as a major component of centromeric heterochromatin in owls (Strigiformes).

PubMed

Yamada, Kazuhiko; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2004-03-01

We isolated a new family of satellite DNA sequences from HaeIII- and EcoRI-digested genomic DNA of the Blakiston's fish owl ( Ketupa blakistoni). The repetitive sequences were organized in tandem arrays of the 174 bp element, and localized to the centromeric regions of all macrochromosomes, including the Z and W chromosomes, and microchromosomes. This hybridization pattern was consistent with the distribution of C-band-positive centromeric heterochromatin, and the satellite DNA sequences occupied 10% of the total genome as a major component of centromeric heterochromatin. The sequences were homogenized between macro- and microchromosomes in this species, and therefore intraspecific divergence of the nucleotide sequences was low. The 174 bp element cross-hybridized to the genomic DNA of six other Strigidae species, but not to that of the Tytonidae, suggesting that the satellite DNA sequences are conserved in the same family but fairly divergent between the different families in the Strigiformes. Secondly, the centromeric satellite DNAs were cloned from eight Strigidae species, and the nucleotide sequences of 41 monomer fragments were compared within and between species. Molecular phylogenetic relationships of the nucleotide sequences were highly correlated with both the taxonomy based on morphological traits and the phylogenetic tree constructed by DNA-DNA hybridization. These results suggest that the satellite DNA sequence has evolved by concerted evolution in the Strigidae and that it is a good taxonomic and phylogenetic marker to examine genetic diversity between Strigiformes species.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Sobottka, Marcelo, E-mail: sobottka@mtm.ufsc.br; Hart, Andrew G., E-mail: ahart@dim.uchile.cl

Highlights: {yields} We propose a simple stochastic model to construct primitive DNA sequences. {yields} The model provide an explanation for Chargaff's second parity rule in primitive DNA sequences. {yields} The model is also used to predict a novel type of strand symmetry in primitive DNA sequences. {yields} We extend the results for bacterial DNA sequences and compare distributional properties intrinsic to the model to statistical estimates from 1049 bacterial genomes. {yields} We find out statistical evidences that the novel type of strand symmetry holds for bacterial DNA sequences. -- Abstract: Chargaff's second parity rule for short oligonucleotides states that themore » frequency of any short nucleotide sequence on a strand is approximately equal to the frequency of its reverse complement on the same strand. Recent studies have shown that, with the exception of organellar DNA, this parity rule generally holds for double-stranded DNA genomes and fails to hold for single-stranded genomes. While Chargaff's first parity rule is fully explained by the Watson-Crick pairing in the DNA double helix, a definitive explanation for the second parity rule has not yet been determined. In this work, we propose a model based on a hidden Markov process for approximating the distributional structure of primitive DNA sequences. Then, we use the model to provide another possible theoretical explanation for Chargaff's second parity rule, and to predict novel distributional aspects of bacterial DNA sequences.« less
A Simulation of DNA Sequencing Utilizing 3M Post-It[R] Notes

ERIC Educational Resources Information Center

Christensen, Doug

2009-01-01

An inexpensive and equipment free approach to teaching the technical aspects of DNA sequencing. The activity described requires an instructor with a familiarity of DNA sequencing technology but provides a straight forward method of teaching the technical aspects of sequencing in the absence of expensive sequencing equipment. The final sequence…
DNA and RNA sequencing by nanoscale reading through programmable electrophoresis and nanoelectrode-gated tunneling and dielectric detection

DOEpatents

Lee, James W.; Thundat, Thomas G.

2005-06-14

An apparatus and method for performing nucleic acid (DNA and/or RNA) sequencing on a single molecule. The genetic sequence information is obtained by probing through a DNA or RNA molecule base by base at nanometer scale as though looking through a strip of movie film. This DNA sequencing nanotechnology has the theoretical capability of performing DNA sequencing at a maximal rate of about 1,000,000 bases per second. This enhanced performance is made possible by a series of innovations including: novel applications of a fine-tuned nanometer gap for passage of a single DNA or RNA molecule; thin layer microfluidics for sample loading and delivery; and programmable electric fields for precise control of DNA or RNA movement. Detection methods include nanoelectrode-gated tunneling current measurements, dielectric molecular characterization, and atomic force microscopy/electrostatic force microscopy (AFM/EFM) probing for nanoscale reading of the nucleic acid sequences.
The sequence specificity of UV-induced DNA damage in a systematically altered DNA sequence.

PubMed

Khoe, Clairine V; Chung, Long H; Murray, Vincent

2018-06-01

The sequence specificity of UV-induced DNA damage was investigated in a specifically designed DNA plasmid using two procedures: end-labelling and linear amplification. Absorption of UV photons by DNA leads to dimerisation of pyrimidine bases and produces two major photoproducts, cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). A previous study had determined that two hexanucleotide sequences, 5'-GCTC*AC and 5'-TATT*AA, were high intensity UV-induced DNA damage sites. The UV clone plasmid was constructed by systematically altering each nucleotide of these two hexanucleotide sequences. One of the main goals of this study was to determine the influence of single nucleotide alterations on the intensity of UV-induced DNA damage. The sequence 5'-GCTC*AC was designed to examine the sequence specificity of 6-4PPs and the highest intensity 6-4PP damage sites were found at 5'-GTTC*CC nucleotides. The sequence 5'-TATT*AA was devised to investigate the sequence specificity of CPDs and the highest intensity CPD damage sites were found at 5'-TTTT*CG nucleotides. It was proposed that the tetranucleotide DNA sequence, 5'-YTC*Y (where Y is T or C), was the consensus sequence for the highest intensity UV-induced 6-4PP adduct sites; while it was 5'-YTT*C for the highest intensity UV-induced CPD damage sites. These consensus tetranucleotides are composed entirely of consecutive pyrimidines and must have a DNA conformation that is highly productive for the absorption of UV photons. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.
Optical Materials with a Genome: Nanophotonics with DNA-Stabilized Silver Clusters

NASA Astrophysics Data System (ADS)

Copp, Stacy M.

Fluorescent silver clusters with unique rod-like geometries are stabilized by DNA. The sizes and colors of these clusters, or AgN-DNA, are selected by DNA base sequence, which can tune peak emission from blue-green into the near-infrared. Combined with DNA nanostructures, AgN-DNA promise exciting applications in nanophotonics and sensing. Until recently, however, a lack of understanding of the mechanisms controlling AgN-DNA fluorescence has challenged such applications. This dissertation discusses progress toward understanding the role of DNA as a "genome" for silver clusters and toward using DNA to achieve atomic-scale precision of silver cluster size and nanometer-scale precision of silver cluster position on a DNA breadboard. We also investigate sensitivity of AgN-DNA to local solvent environment, with an eye toward applications in chemical and biochemical sensing. Using robotic techniques to generate large data sets, we show that fluorescent silver clusters are templated by certain DNA base motifs that select "magic-sized" cluster cores of enhanced stabilities. The linear arrangement of bases on the phosphate backbone imposes a unique rod-like geometry on the clusters. Harnessing machine learning and bioinformatics techniques, we also demonstrate that sequences of DNA templates can be selected to stabilize silver clusters with desired optical properties, including high fluorescence intensity and specific fluorescence wavelengths, with much higher rates of success as compared to current strategies. The discovered base motifs can be also used to design modular DNA host strands that enable individual silver clusters with atomically precise sizes to bind at specific programmed locations on a DNA nanostructure. We show that DNA-mediated nanoscale arrangement enables near-field coupling of distinct clusters, demonstrated by dual-color cluster assemblies exhibiting resonant energy transfer. These results demonstrate a new degree of control over the optical properties and relative positions of nanoparticles, selected almost solely by the sequence of DNA. AgN-DNA are promising chemical and biochemical sensors due to the sensitivity of their fluorescence to local environment. However, the mechanisms behind many sensing schemes are not understood, and the nature of the excited state of the silver cluster itself remains unknown. To probe the fluorescence mechanisms of AgN-DNA, we investigate the behavior of purified solutions of these clusters in various solvents. We find that standard models for fluorophore solvatochromism, including the Lippert-Mataga model, do not describe AgN-DNA fluorescence because such models neglect specific interactions between the cluster and surrounding solvent molecules. Fluorescence colors are well-modeled by Mie-Gans theory, suggesting that the local dielectric environment of the cluster does play a role in fluorescence, although additional specific solvent interactions and cluster shape changes may also determine fluorescence color and intensity. These results suggest that AgN-DNA may be sensitive to changes in local dielectric environment on nanometer length scales and may also act as sensors for small molecules with affinity for DNA.
DNA Replication Origins and Fork Progression at Mammalian Telomeres

PubMed Central

Higa, Mitsunori; Fujita, Masatoshi; Yoshida, Kazumasa

2017-01-01

Telomeres are essential chromosomal regions that prevent critical shortening of linear chromosomes and genomic instability in eukaryotic cells. The bulk of telomeric DNA is replicated by semi-conservative DNA replication in the same way as the rest of the genome. However, recent findings revealed that replication of telomeric repeats is a potential cause of chromosomal instability, because DNA replication through telomeres is challenged by the repetitive telomeric sequences and specific structures that hamper the replication fork. In this review, we summarize current understanding of the mechanisms by which telomeres are faithfully and safely replicated in mammalian cells. Various telomere-associated proteins ensure efficient telomere replication at different steps, such as licensing of replication origins, passage of replication forks, proper fork restart after replication stress, and dissolution of post-replicative structures. In particular, shelterin proteins have central roles in the control of telomere replication. Through physical interactions, accessory proteins are recruited to maintain telomere integrity during DNA replication. Dormant replication origins and/or homology-directed repair may rescue inappropriate fork stalling or collapse that can cause defects in telomere structure and functions. PMID:28350373
Torque measurements reveal sequence-specific cooperative transitions in supercoiled DNA

PubMed Central

Oberstrass, Florian C.; Fernandes, Louis E.; Bryant, Zev

2012-01-01

B-DNA becomes unstable under superhelical stress and is able to adopt a wide range of alternative conformations including strand-separated DNA and Z-DNA. Localized sequence-dependent structural transitions are important for the regulation of biological processes such as DNA replication and transcription. To directly probe the effect of sequence on structural transitions driven by torque, we have measured the torsional response of a panel of DNA sequences using single molecule assays that employ nanosphere rotational probes to achieve high torque resolution. The responses of Z-forming d(pGpC)n sequences match our predictions based on a theoretical treatment of cooperative transitions in helical polymers. “Bubble” templates containing 50–100 bp mismatch regions show cooperative structural transitions similar to B-DNA, although less torque is required to disrupt strand–strand interactions. Our mechanical measurements, including direct characterization of the torsional rigidity of strand-separated DNA, establish a framework for quantitative predictions of the complex torsional response of arbitrary sequences in their biological context. PMID:22474350
Ancient dna from pleistocene fossils: Preservation, recovery, and utility of ancient genetic information for quaternary research

NASA Astrophysics Data System (ADS)

Yang, Hong

Until recently, recovery and analysis of genetic information encoded in ancient DNA sequences from Pleistocene fossils were impossible. Recent advances in molecular biology offered technical tools to obtain ancient DNA sequences from well-preserved Quaternary fossils and opened the possibilities to directly study genetic changes in fossil species to address various biological and paleontological questions. Ancient DNA studies involving Pleistocene fossil material and ancient DNA degradation and preservation in Quaternary deposits are reviewed. The molecular technology applied to isolate, amplify, and sequence ancient DNA is also presented. Authentication of ancient DNA sequences and technical problems associated with modern and ancient DNA contamination are discussed. As illustrated in recent studies on ancient DNA from proboscideans, it is apparent that fossil DNA sequence data can shed light on many aspects of Quaternary research such as systematics and phylogeny. conservation biology, evolutionary theory, molecular taphonomy, and forensic sciences. Improvement of molecular techniques and a better understanding of DNA degradation during fossilization are likely to build on current strengths and to overcome existing problems, making fossil DNA data a unique source of information for Quaternary scientists.
Enantiospecific recognition of DNA sequences by a proflavine Tröger base.

PubMed

Bailly, C; Laine, W; Demeunynck, M; Lhomme, J

2000-07-05

The DNA interaction of a chiral Tröger base derived from proflavine was investigated by DNA melting temperature measurements and complementary biochemical assays. DNase I footprinting experiments demonstrate that the binding of the proflavine-based Tröger base is both enantio- and sequence-specific. The (+)-isomer poorly interacts with DNA in a non-sequence-selective fashion. In sharp contrast, the corresponding (-)-isomer recognizes preferentially certain DNA sequences containing both A. T and G. C base pairs, such as the motifs 5'-GTT. AAC and 5'-ATGA. TCAT. This is the first experimental demonstration that acridine-type Tröger bases can be used for enantiospecific recognition of DNA sequences. Copyright 2000 Academic Press.

Sensitive detection of mercury and copper ions by fluorescent DNA/Ag nanoclusters in guanine-rich DNA hybridization

NASA Astrophysics Data System (ADS)

Peng, Jun; Ling, Jian; Zhang, Xiu-Qing; Bai, Hui-Ping; Zheng, Liyan; Cao, Qiu-E.; Ding, Zhong-Tao

2015-02-01

In this work, we designed a new fluorescent oligonucleotides-stabilized silver nanoclusters (DNA/AgNCs) probe for sensitive detection of mercury and copper ions. This probe contains two tailored DNA sequence. One is a signal probe contains a cytosine-rich sequence template for AgNCs synthesis and link sequence at both ends. The other is a guanine-rich sequence for signal enhancement and link sequence complementary to the link sequence of the signal probe. After hybridization, the fluorescence of hybridized double-strand DNA/AgNCs is 200-fold enhanced based on the fluorescence enhancement effect of DNA/AgNCs in proximity of guanine-rich DNA sequence. The double-strand DNA/AgNCs probe is brighter and stable than that of single-strand DNA/AgNCs, and more importantly, can be used as novel fluorescent probes for detecting mercury and copper ions. Mercury and copper ions in the range of 6.0-160.0 and 6-240 nM, can be linearly detected with the detection limits of 2.1 and 3.4 nM, respectively. Our results indicated that the analytical parameters of the method for mercury and copper ions detection are much better than which using a single-strand DNA/AgNCs.
Ab initio DNA synthesis by Bst polymerase in the presence of nicking endonucleases Nt.AlwI, Nb.BbvCI, and Nb.BsmI.

PubMed

Antipova, Valeriya N; Zheleznaya, Lyudmila A; Zyrina, Nadezhda V

2014-08-01

In the absence of added DNA, thermophilic DNA polymerases synthesize double-stranded DNA from free dNTPs, which consist of numerous repetitive units (ab initio DNA synthesis). The addition of thermophilic restriction endonuclease (REase), or nicking endonuclease (NEase), effectively stimulates ab initio DNA synthesis and determines the nucleotide sequence of reaction products. We have found that NEases Nt.AlwI, Nb.BbvCI, and Nb.BsmI with non-palindromic recognition sites stimulate the synthesis of sequences organized mainly as palindromes. Moreover, the nucleotide sequence of the palindromes appeared to be dependent on NEase recognition/cleavage modes. Thus, the heterodimeric Nb.BbvCI stimulated the synthesis of palindromes composed of two recognition sites of this NEase, which were separated by AT-reach sequences or (A)n (T)m spacers. Palindromic DNA sequences obtained in the ab initio DNA synthesis with the monomeric NEases Nb.BsmI and Nt.AlwI contained, along with the sites of these NEases, randomly synthesized sequences consisted of blocks of short repeats. These findings could help investigation of the potential abilities of highly productive ab initio DNA synthesis for the creation of DNA molecules with desirable sequence. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): A linear DNA molecule encoding a putative DNA-dependent DNA polymerase.

PubMed

Shao, Zhiyong; Graf, Shannon; Chaga, Oleg Y; Lavrov, Dennis V

2006-10-15

The 16,937-nuceotide sequence of the linear mitochondrial DNA (mt-DNA) molecule of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa) - the first mtDNA sequence from the class Scypozoa and the first sequence of a linear mtDNA from Metazoa - has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs. In addition, two open reading frames of 324 and 969 base pairs in length have been found. The deduced amino-acid sequence of one of them, ORF969, displays extensive sequence similarity with the polymerase [but not the exonuclease] domain of family B DNA polymerases, and this ORF has been tentatively identified as dnab. This is the first report of dnab in animal mtDNA. The genes in A. aurita mtDNA are arranged in two clusters with opposite transcriptional polarities; transcription proceeding toward the ends of the molecule. The determined sequences at the ends of the molecule are nearly identical but inverted and lack any obvious potential secondary structures or telomere-like repeat elements. The acquisition of mitochondrial genomic data for the second class of Cnidaria allows us to reconstruct characteristic features of mitochondrial evolution in this animal phylum.
Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.

PubMed Central

Benslimane, A A; Dron, M; Hartmann, C; Rode, A

1986-01-01

Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553
Deoxyribozymes: Selection Design and Serendipity in the Development of DNA Catalysts†

PubMed Central

Silverman, Scott K.

2009-01-01

CONSPECTUS One of the chemist’s key motivations is to explore the forefront of catalysis. In this Account, we describe our laboratory’s efforts at one such forefront: the use of DNA as a catalyst. Natural biological catalysts include both protein enzymes and RNA enzymes (ribozymes), whereas nature apparently uses DNA solely for genetic information storage. Nevertheless, the chemical similarities between RNA and DNA naturally lead to laboratory examination of DNA as a catalyst, especially because DNA is more stable than RNA and is less costly and easier to synthesize. Many catalytically active DNA sequences (deoxyribozymes, also called DNAzymes) have been identified in the laboratory by in vitro selection, in which many random DNA sequences are evaluated in parallel to find those rare sequences that have a desired functional ability. Since 2001, our research group has pursued new deoxyribozymes for various chemical reactions. We consider DNA simply as a large biopolymer that can adopt intricate three-dimensional structure and, in the presence of appropriate metal ions, generate the chemical complexity required to achieve catalysis. Our initial efforts focused on deoxyribozymes that ligate two RNA substrates. In these studies, we used only substrates that are readily obtained biochemically. Highly active deoxyribozymes have been identified, with emergent questions regarding chemical selectivity during RNA phosphodiester bond formation. Deoxyribozymes allow synthesis of interesting RNA products, such as branches and lariats, that are otherwise challenging to prepare. Our experiments have demonstrated that deoxyribozymes can have very high rate enhancements and chemical selectivities. We have also shown how the in vitro selection process itself can be directed towards desired goals, such as selective formation of native 3′–5′ RNA linkages. A final lesson is that unanticipated selection outcomes can be very interesting, highlighting the importance of allowing such opportunities in future experiments. More recently, we have begun using non-oligonucleotide substrates in our efforts with deoxyribozymes. We have especially focused on developing DNA catalysts for reactions of small molecules or amino acid side chains. For example, new deoxyribozymes have the catalytic power to create a nucleopeptide linkage between a tyrosine or serine side chain and the 5′-terminus of an RNA strand. Although considerable further work remains to establish DNA as a practical catalyst for small molecules and full-length proteins, the progress to date is very promising. The many lessons learned during the experiments described in this Account will help us and others to realize the full catalytic power of DNA. PMID:19572701
Improved Yield of High Molecular Weight DNA Coincides with Increased Microbial Diversity Access from Iron Oxide Cemented Sub-Surface Clay Environments

PubMed Central

Hurt, Richard A.; Robeson, Michael S.; Shakya, Migun; Moberly, James G.; Vishnivetskaya, Tatiana A.; Gu, Baohua; Elias, Dwayne A.

2014-01-01

Despite over three decades of progress, extraction of high molecular weight (HMW) DNA from high clay soils or iron oxide cemented clay has remained challenging. HMW DNA is desirable for next generation sequencing as it yields the most comprehensive coverage. Several DNA extraction procedures were compared from samples that exhibit strong nucleic acid adsorption. pH manipulation or use of alternative ion solutions offered no improvement in nucleic acid recovery. Lysis by liquid N2 grinding in concentrated guanidine followed by concentrated sodium phosphate extraction supported HMW DNA recovery from clays high in iron oxides. DNA recovered using 1 M sodium phosphate buffer (PB) as a competitive desorptive wash was 15.22±2.33 µg DNA/g clay, with most DNA consisting of >20 Kb fragments, compared to 2.46±0.25 µg DNA/g clay with the Powerlyzer system (MoBio). Increasing PB concentration in the lysis reagent coincided with increasing DNA fragment length during initial extraction. Rarefaction plots of 16S rRNA (V1–V3 region) pyrosequencing from A-horizon and clay soils showed an ∼80% and ∼400% larger accessed diversity compared to the Powerlyzer soil DNA system, respectively. The observed diversity from the Firmicutes showed the strongest increase with >3-fold more operational taxonomic units (OTU) recovered. PMID:25033199
Large-scale chromosome folding versus genomic DNA sequences: A discrete double Fourier transform technique.

PubMed

Chechetkin, V R; Lobzin, V V

2017-08-07

Using state-of-the-art techniques combining imaging methods and high-throughput genomic mapping tools leaded to the significant progress in detailing chromosome architecture of various organisms. However, a gap still remains between the rapidly growing structural data on the chromosome folding and the large-scale genome organization. Could a part of information on the chromosome folding be obtained directly from underlying genomic DNA sequences abundantly stored in the databanks? To answer this question, we developed an original discrete double Fourier transform (DDFT). DDFT serves for the detection of large-scale genome regularities associated with domains/units at the different levels of hierarchical chromosome folding. The method is versatile and can be applied to both genomic DNA sequences and corresponding physico-chemical parameters such as base-pairing free energy. The latter characteristic is closely related to the replication and transcription and can also be used for the assessment of temperature or supercoiling effects on the chromosome folding. We tested the method on the genome of E. coli K-12 and found good correspondence with the annotated domains/units established experimentally. As a brief illustration of further abilities of DDFT, the study of large-scale genome organization for bacteriophage PHIX174 and bacterium Caulobacter crescentus was also added. The combined experimental, modeling, and bioinformatic DDFT analysis should yield more complete knowledge on the chromosome architecture and genome organization. Copyright © 2017 Elsevier Ltd. All rights reserved.
Statistical genetics concepts and approaches in schizophrenia and related neuropsychiatric research.

PubMed

Schork, Nicholas J; Greenwood, Tiffany A; Braff, David L

2007-01-01

Statistical genetics is a research field that focuses on mathematical models and statistical inference methodologies that relate genetic variations (ie, naturally occurring human DNA sequence variations or "polymorphisms") to particular traits or diseases (phenotypes) usually from data collected on large samples of families or individuals. The ultimate goal of such analysis is the identification of genes and genetic variations that influence disease susceptibility. Although of extreme interest and importance, the fact that many genes and environmental factors contribute to neuropsychiatric diseases of public health importance (eg, schizophrenia, bipolar disorder, and depression) complicates relevant studies and suggests that very sophisticated mathematical and statistical modeling may be required. In addition, large-scale contemporary human DNA sequencing and related projects, such as the Human Genome Project and the International HapMap Project, as well as the development of high-throughput DNA sequencing and genotyping technologies have provided statistical geneticists with a great deal of very relevant and appropriate information and resources. Unfortunately, the use of these resources and their interpretation are not straightforward when applied to complex, multifactorial diseases such as schizophrenia. In this brief and largely nonmathematical review of the field of statistical genetics, we describe many of the main concepts, definitions, and issues that motivate contemporary research. We also provide a discussion of the most pressing contemporary problems that demand further research if progress is to be made in the identification of genes and genetic variations that predispose to complex neuropsychiatric diseases.
One fungus, one name promotes progressive plant pathology.

PubMed

Wingfield, Michael J; De Beer, Z Wilhelm; Slippers, Bernard; Wingfield, Brenda D; Groenewald, Johannes Z; Lombard, Lorenzo; Crous, Pedro W

2012-08-01

The robust and reliable identification of fungi underpins virtually every element of plant pathology, from disease diagnosis to studies of biology, management/control, quarantine and, even more recently, comparative genomics. Most plant diseases are caused by fungi, typically pleomorphic organisms, for which the taxonomy and, in particular, a dual nomenclature system have frustrated and confused practitioners of plant pathology. The emergence of DNA sequencing has revealed cryptic taxa and revolutionized our understanding of relationships in the fungi. The impacts on plant pathology at every level are already immense and will continue to grow rapidly as new DNA sequencing technologies continue to emerge. DNA sequence comparisons, used to resolve a dual nomenclature problem for the first time only 19 years ago, have made it possible to approach a natural classification for the fungi and to abandon the confusing dual nomenclature system. The journey to a one fungus, one name taxonomic reality has been long and arduous, but its time has come. This will inevitably have a positive impact on plant pathology, plant pathologists and future students of this hugely important discipline on which the world depends for food security and plant health in general. This contemporary review highlights the problems of a dual nomenclature, especially its impact on plant pathogenic fungi, and charts the road to a one fungus, one name system that is rapidly drawing near. © 2011 The Authors. Molecular Plant Pathology © 2011 BSPP and Blackwell Publishing Ltd.
Inflammation, cancer, and targets of ginseng.

PubMed

Hofseth, Lorne J; Wargovich, Michael J

2007-01-01

Chronic inflammation is associated with a high cancer risk. At the molecular level, free radicals and aldehydes, produced during chronic inflammation, can induce deleterious gene mutation and posttranslational modifications of key cancer-related proteins. Other products of inflammation, including cytokines, growth factors, and transcription factors such as nuclear factor kappaB, control the expression of cancer genes (e.g., suppressor genes and oncogenes) and key inflammatory enzymes such as inducible nitric oxide synthase and cyclooxygenase-2. These enzymes in turn directly influence reactive oxygen species and eicosanoid levels. The procancerous outcome of chronic inflammation is increased DNA damage, increased DNA synthesis, cellular proliferation, disruption of DNA repair pathways and cellular milieu, inhibition of apoptosis, and promotion of angiogenesis and invasion. Chronic inflammation is also associated with immunosuppression, which is a risk factor for cancer. Current treatment strategies for reactive species overload diseases are frequently aimed at treating or preventing the cause of inflammation. Although these strategies have led to some progress in combating reactive species overload diseases and associated cancers, exposure often occurs again after eradication, treatment to eradicate the cause fails, or the treatment has long-term side effects. Therefore, the identification of molecules and pathways involved in chronic inflammation and cancer is critical to the design of agents that may help in preventing the progression of reactive species overload disease and cancer associated with disease progression. Here, we use ginseng as an example of an antiinflammatory molecule that targets many of the key players in the inflammation-to-cancer sequence.
Developmental modulation of DNA methylation in the fungus Phycomyces blakesleeanus.

PubMed Central

Antequera, F; Tamame, M; Vilanueva, J R; Santos, T

1985-01-01

DNA methylation is a rather sparse event among fungi. Phycomyces blakesleeanus seems to be one of the few exceptions in this context. 5-Methylcytosine represents 2.9% of the total cytosine in spore DNA and is located in approximately the same amount at any of the four CA, CT, CC or CG dinucleotides. A progressive and gradual drop in total 5-methylcytosine parallels the development of the fungus. This demethylation is non random but sequence specific and is not accounted for equally by the four different methylated dinucleotides, CG being much less affected (20% demethylated) than CA, CT and CC (more than 90% demethylated at the same time). "De novo" methylation to restore the initial pattern probably takes place during spore maturation. By using specific hybridization probes we have been able to show that the rRNA genes are not significantly methylated at any stage of development, regardless of their transcription status. Images PMID:2997714
Epigenetic regulation of vascular smooth muscle cell function in atherosclerosis.

PubMed

Findeisen, Hannes M; Kahles, Florian K; Bruemmer, Dennis

2013-04-01

Epigenetics involve heritable and acquired changes in gene transcription that occur independently of the DNA sequence. Epigenetic mechanisms constitute a hierarchic upper-level of transcriptional control through complex modifications of chromosomal components and nuclear structures. These modifications include, for example, DNA methylation or post-translational modifications of core histones; they are mediated by various chromatin-modifying enzymes; and ultimately they define the accessibility of a transcriptional complex to its target DNA. Integrating epigenetic mechanisms into the pathophysiologic concept of complex and multifactorial diseases such as atherosclerosis may significantly enhance our understanding of related mechanisms and provide promising therapeutic approaches. Although still in its infancy, intriguing scientific progress has begun to elucidate the role of epigenetic mechanisms in vascular biology, particularly in the control of smooth muscle cell phenotypes. In this review, we will summarize epigenetic pathways in smooth muscle cells, focusing on mechanisms involved in the regulation of vascular remodeling.
Epigenetic regulation of vascular smooth muscle cell function in atherosclerosis.

PubMed

Findeisen, Hannes M; Kahles, Florian K; Bruemmer, Dennis

2013-05-01

Epigenetics involve heritable and acquired changes in gene transcription that occur independently of the DNA sequence. Epigenetic mechanisms constitute a hierarchic upper-level of transcriptional control through complex modifications of chromosomal components and nuclear structures. These modifications include, for example, DNA methylation or post-translational modifications of core histones; they are mediated by various chromatin-modifying enzymes; and ultimately they define the accessibility of a transcriptional complex to its target DNA. Integrating epigenetic mechanisms into the pathophysiologic concept of complex and multifactorial diseases such as atherosclerosis may significantly enhance our understanding of related mechanisms and provide promising therapeutic approaches. Although still in its infancy, intriguing scientific progress has begun to elucidate the role of epigenetic mechanisms in vascular biology, particularly in the control of smooth muscle cell phenotypes. In this review, we will summarize epigenetic pathways in smooth muscle cells, focusing on mechanisms involved in the regulation of vascular remodeling.
Cloning, structure, and chromosome localization of the mouse glutaryl-CoA dehydrogenase gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Koeller, D.M.; DiGiulio, A.; Frerman, F.E.

Glutaryl-CoA dehydrogenase (GCDH) is a nuclear-encoded, mitochondrial matrix enzyme. In humans, deficiency of GCDH leads to glutaric acidemia type I, and inherited disorder of amino acid metabolism characterized by a progressive neurodegenerative disease. In this report we describe the cloning and structure of the mouse GCDH (Gcdh) gene and cDNA and its chromosomal localization. The mouse Gcdh cDNA is 1.75 kb long and contains and open reading frame of 438 amino acids. The amino acid sequences of mouse, human, and pig GCDH are highly conserved. The mouse Gcdh gene contains 11 exons and spans 7 kb of genomic DNA. Gcdhmore » was mapped by backcross analysis to mouse chromosome 8 within a region that is homologous to a region of human chromosome 19, where the human gene was previously mapped. 14 refs., 3 figs.« less
Epigenetics in myeloid derived suppressor cells: a sheathed sword towards cancer

PubMed Central

Zhang, Chao; Wang, Shuo; Liu, Yufeng; Yang, Cheng

2016-01-01

Myeloid-derived suppressor cells (MDSCs), a heterogeneous population of cells composed of progenitors and precursors to myeloid cells, are deemed to participate in the development of tumor-favoring immunosuppressive microenvironment. Thus, the regulatory strategies targeting MDSCs' expansion, differentiation, accumulation and function could possibly be effective “weapons” in anti-tumor immunotherapies. Epigenetic mechanisms, which involve DNA modification, covalent histone modification and RNA interference, result in the heritable down-regulation or silencing of gene expression without a change in DNA sequences. Epigenetic modification of MDSC's functional plasticity leads to the remodeling of its characteristics, therefore reframing the microenvironment towards countering tumor growth and metastasis. This review summarized the pertinent findings on the DNA methylation, covalent histone modification, microRNAs and small interfering RNAs targeting MDSC in cancer genesis, progression and metastasis. The potentials as well as possible obstacles in translating into anti-cancer therapeutics were also discussed. PMID:27458169
Genotoxin induced mutagenesis in the model plant Physcomitrella patens.

PubMed

Holá, Marcela; Kozák, Jaroslav; Vágnerová, Radka; Angelis, Karel J

2013-01-01

The moss Physcomitrella patens is unique for the high frequency of homologous recombination, haploid state, and filamentous growth during early stages of the vegetative growth, which makes it an excellent model plant to study DNA damage responses. We used single cell gel electrophoresis (comet) assay to determine kinetics of response to Bleomycin induced DNA oxidative damage and single and double strand breaks in wild type and mutant lig4 Physcomitrella lines. Moreover, APT gene when inactivated by induced mutations was used as selectable marker to ascertain mutational background at nucleotide level by sequencing of the APT locus. We show that extensive repair of DSBs occurs also in the absence of the functional LIG4, whereas repair of SSBs is seriously compromised. From analysis of induced mutations we conclude that their accumulation rather than remaining lesions in DNA and blocking progression through cell cycle is incompatible with normal plant growth and development and leads to sensitive phenotype.
Genotoxin Induced Mutagenesis in the Model Plant Physcomitrella patens

PubMed Central

Holá, Marcela; Kozák, Jaroslav; Vágnerová, Radka; Angelis, Karel J.

2013-01-01

The moss Physcomitrella patens is unique for the high frequency of homologous recombination, haploid state, and filamentous growth during early stages of the vegetative growth, which makes it an excellent model plant to study DNA damage responses. We used single cell gel electrophoresis (comet) assay to determine kinetics of response to Bleomycin induced DNA oxidative damage and single and double strand breaks in wild type and mutant lig4 Physcomitrella lines. Moreover, APT gene when inactivated by induced mutations was used as selectable marker to ascertain mutational background at nucleotide level by sequencing of the APT locus. We show that extensive repair of DSBs occurs also in the absence of the functional LIG4, whereas repair of SSBs is seriously compromised. From analysis of induced mutations we conclude that their accumulation rather than remaining lesions in DNA and blocking progression through cell cycle is incompatible with normal plant growth and development and leads to sensitive phenotype. PMID:24383055
Torque Teno Virus in HIV-infected transgender in Surakarta, Indonesia

NASA Astrophysics Data System (ADS)

Hartono; Agung Prasetyo, Afiono; Fanani, Mohammad

2018-05-01

Torque Teno Virus (TTV) is a circular single-stranded DNA virus that may co-infected with human immunodeficiency virus (HIV), especially in the high-risk community e.g. the transgender performing high-riskbehavior. TTV shows an increased viremia in HIV patients and maybe influence the HIV clinical progression. Blood samples collected from transgender performing high-riskbehavior in Surakarta were tested by serological and molecular assays to detect the presence of HIV infection. The blood samples with HIV positive status were then tested by a nested polymerase chain reaction (PCR) to detect the presentation of TTV DNA. The amplified PCR products were molecularly cloned and subjected to sequence analysis. TTV DNA was detected in 40.0% HIV-positive samples. The molecular characterization revealed that the most prevalent was genogroup 3, followed by genogroup 2 and 1, respectively. TTV was detected in HIV-infected transgender performing high-riskbehavior in Surakarta with high infection rate.
Bridging epigenomics and complex disease: the basics.

PubMed

Teperino, Raffaele; Lempradl, Adelheid; Pospisilik, J Andrew

2013-05-01

The DNA sequence largely defines gene expression and phenotype. However, it is becoming increasingly clear that an additional chromatin-based regulatory network imparts both stability and plasticity to genome output, modifying phenotype independently of the genetic blueprint. Indeed, alterations in this "epigenetic" control layer underlie, at least in part, the reason for monozygotic twins being discordant for disease. Functionally, this regulatory layer comprises post-translational modifications of DNA and histones, as well as small and large noncoding RNAs. Together these regulate gene expression by changing chromatin organization and DNA accessibility. Successive technological advances over the past decade have enabled researchers to map the chromatin state with increasing accuracy and comprehensiveness, catapulting genetic research into a genome-wide era. Here, aiming particularly at the genomics/epigenomics newcomer, we review the epigenetic basis that has helped drive the technological shift and how this progress is shaping our understanding of complex disease.
Next-Generation Sequencing Platforms

NASA Astrophysics Data System (ADS)

Mardis, Elaine R.

2013-06-01

Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.

Regulatory link between DNA methylation and active demethylation in Arabidopsis

PubMed Central

Lei, Mingguang; Zhang, Huiming; Julian, Russell; Tang, Kai; Xie, Shaojun; Zhu, Jian-Kang

2015-01-01

De novo DNA methylation through the RNA-directed DNA methylation (RdDM) pathway and active DNA demethylation play important roles in controlling genome-wide DNA methylation patterns in plants. Little is known about how cells manage the balance between DNA methylation and active demethylation activities. Here, we report the identification of a unique RdDM target sequence, where DNA methylation is required for maintaining proper active DNA demethylation of the Arabidopsis genome. In a genetic screen for cellular antisilencing factors, we isolated several REPRESSOR OF SILENCING 1 (ros1) mutant alleles, as well as many RdDM mutants, which showed drastically reduced ROS1 gene expression and, consequently, transcriptional silencing of two reporter genes. A helitron transposon element (TE) in the ROS1 gene promoter negatively controls ROS1 expression, whereas DNA methylation of an RdDM target sequence between ROS1 5′ UTR and the promoter TE region antagonizes this helitron TE in regulating ROS1 expression. This RdDM target sequence is also targeted by ROS1, and defective DNA demethylation in loss-of-function ros1 mutant alleles causes DNA hypermethylation of this sequence and concomitantly causes increased ROS1 expression. Our results suggest that this sequence in the ROS1 promoter region serves as a DNA methylation monitoring sequence (MEMS) that senses DNA methylation and active DNA demethylation activities. Therefore, the ROS1 promoter functions like a thermostat (i.e., methylstat) to sense DNA methylation levels and regulates DNA methylation by controlling ROS1 expression. PMID:25733903
Attomole-level Genomics with Single-molecule Direct DNA, cDNA and RNA Sequencing Technologies.

PubMed

Ozsolak, Fatih

2016-01-01

With the introduction of next-generation sequencing (NGS) technologies in 2005, the domination of microarrays in genomics quickly came to an end due to NGS's superior technical performance and cost advantages. By enabling genetic analysis capabilities that were not possible previously, NGS technologies have started to play an integral role in all areas of biomedical research. This chapter outlines the low-quantity DNA and cDNA sequencing capabilities and applications developed with the Helicos single molecule DNA sequencing technology.
A cDNA from a mouse pancreatic beta cell encoding a putative transcription factor of the insulin gene.

PubMed Central

Walker, M D; Park, C W; Rosen, A; Aronheim, A

1990-01-01

Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Highly multiplexed targeted DNA sequencing from single nuclei.

PubMed

Leung, Marco L; Wang, Yong; Kim, Charissa; Gao, Ruli; Jiang, Jerry; Sei, Emi; Navin, Nicholas E

2016-02-01

Single-cell DNA sequencing methods are challenged by poor physical coverage, high technical error rates and low throughput. To address these issues, we developed a single-cell DNA sequencing protocol that combines flow-sorting of single nuclei, time-limited multiple-displacement amplification (MDA), low-input library preparation, DNA barcoding, targeted capture and next-generation sequencing (NGS). This approach represents a major improvement over our previous single nucleus sequencing (SNS) Nature Protocols paper in terms of generating higher-coverage data (>90%), thereby enabling the detection of genome-wide variants in single mammalian cells at base-pair resolution. Furthermore, by pooling 48-96 single-cell libraries together for targeted capture, this approach can be used to sequence many single-cell libraries in parallel in a single reaction. This protocol greatly reduces the cost of single-cell DNA sequencing, and it can be completed in 5-6 d by advanced users. This single-cell DNA sequencing protocol has broad applications for studying rare cells and complex populations in diverse fields of biological research and medicine.
Cumulative mtDNA damage and mutations contribute to the progressive loss of RGCs in a rat model of glaucoma.

PubMed

Wu, Ji-Hong; Zhang, Sheng-Hai; Nickerson, John M; Gao, Feng-Juan; Sun, Zhongmou; Chen, Xin-Ya; Zhang, Shu-Jie; Gao, Feng; Chen, Jun-Yi; Luo, Yi; Wang, Yan; Sun, Xing-Huai

2015-02-01

Glaucoma is a chronic neurodegenerative disease characterized by the progressive loss of retinal ganglion cells (RGCs). Mitochondrial DNA (mtDNA) alterations have been documented as a key component of many neurodegenerative disorders. However, whether mtDNA alterations contribute to the progressive loss of RGCs and the mechanism whereby this phenomenon could occur are poorly understood. We investigated mtDNA alterations in RGCs using a rat model of chronic intraocular hypertension and explored the mechanisms underlying progressive RGC loss. We demonstrate that the mtDNA damage and mutations triggered by intraocular pressure (IOP) elevation are initiating, crucial events in a cascade leading to progressive RGC loss. Damage to and mutation of mtDNA, mitochondrial dysfunction, reduced levels of mtDNA repair/replication enzymes, and elevated reactive oxygen species form a positive feedback loop that produces irreversible mtDNA damage and mutation and contributes to progressive RGC loss, which occurs even after a return to normal IOP. Furthermore, we demonstrate that mtDNA damage and mutations increase the vulnerability of RGCs to elevated IOP and glutamate levels, which are among the most common glaucoma insults. This study suggests that therapeutic approaches that target mtDNA maintenance and repair and that promote energy production may prevent the progressive death of RGCs. Copyright © 2014 Elsevier Inc. All rights reserved.
The Giardia genome project database.

PubMed

McArthur, A G; Morrison, H G; Nixon, J E; Passamaneck, N Q; Kim, U; Hinkle, G; Crocker, M K; Holder, M E; Farr, R; Reich, C I; Olsen, G E; Aley, S B; Adam, R D; Gillin, F D; Sogin, M L

2000-08-15

The Giardia genome project database provides an online resource for Giardia lamblia (WB strain, clone C6) genome sequence information. The database includes edited single-pass reads, the results of BLASTX searches, and details of progress towards sequencing the entire 12 million-bp Giardia genome. Pre-sorted BLASTX results can be retrieved based on keyword searches and BLAST searches of the high throughput Giardia data can be initiated from the web site or through NCBI. Descriptions of the genomic DNA libraries, project protocols and summary statistics are also available. Although the Giardia genome project is ongoing, new sequences are made available on a bi-monthly basis to ensure that researchers have access to information that may assist them in the search for genes and their biological function. The current URL of the Giardia genome project database is www.mbl.edu/Giardia.
Trinucleotide repeat length and progression of illness in Huntington's disease.

PubMed Central

Kieburtz, K; MacDonald, M; Shih, C; Feigin, A; Steinberg, K; Bordwell, K; Zimmerman, C; Srinidhi, J; Sotack, J; Gusella, J

1994-01-01

The genetic defect causing Huntington's disease (HD) has been identified as an unstable expansion of a trinucleotide (CAG) repeat sequence within the coding region of the IT15 gene on chromosome 4. In 50 patients with manifest HD who were evaluated prospectively and uniformly, we examined the relationship between the extent of the DNA expansion and the rate of illness progression. Although the length of CAG repeats showed a strong inverse correlation with the age at onset of HD, there was no such relationship between the number of CAG repeats and the rate of clinical decline. These findings suggest that the CAG repeat length may influence or trigger the onset of HD, but other genetic, neurobiological, or environmental factors contribute to the progression of illness and the underlying pace of neuronal degeneration. PMID:7853373
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE PAGES

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.; ...

2017-07-18

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

PubMed Central

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Richard A.; Brown, Steven D.

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. PMID:28769883
Mapping the binding site of aflatoxin B/sub 1/ in DNA: systematic analysis of the reactivity of aflatoxin B/sub 1/ with guanines in different DNA sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Benasutti, M.; Ejadi, S.; Whitlow, M.D.

The mutagenic and carcinogenic chemical aflatoxin B/sub 1/ (AFB/sub 1/) reacts almost exclusively at the N(7)-position of guanine following activation to its reactive form, the 8,9-epoxide (AFB/sub 1/ oxide). In general N(7)-guanine adducts yield DNA strand breaks when heated in base, a property that serves as the basis for the Maxam-Gilbert DNA sequencing reaction specific for guanine. Using DNA sequencing methods, other workers have shown that AFB/sub 1/ oxide gives strand breaks at positions of guanines; however, the guanine bands varied in intensity. This phenomenon has been used to infer that AFB/sub 1/ oxide prefers to react with guanines inmore » some sequence contexts more than in others and has been referred to as sequence specificity of binding. Herein, data on the reaction of AFB/sub 1/ oxide with several synthetic DNA polymers with different sequences are presented, and (following hydrolysis) adduct levels are determine by high-pressure liquid chromatography. These results reveal that for AFB/sub 1/ oxide (1) the N(7)-guanine adduct is the major adduct found in all of the DNA polymers, (2) adduct levels vary in different sequences, and, thus, sequence specificity is also observed by this more direct method, and (3) the intensity of bands in DNA sequencing gels is likely to reflect adduct levels formed at the N(7)-position of guanine. Knowing this, a reinvestigation of the reactivity of guanines in different DNA sequences using DNA sequencing methods was undertaken. Methods are developed to determine the X (5'-side) base and the Y (3'-side) base are most influential in determining guanine reactivity. These rules in conjunction with molecular modeling studies were used to assess the binding sites that might be utilized by AFB/sub 1/ oxide in its reaction with DNA.« less
Chromosome specific repetitive DNA sequences

DOEpatents

Moyzis, Robert K.; Meyne, Julianne

1991-01-01

A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
The past, present and future of mitochondrial genomics: have we sequenced enough mtDNAs?

PubMed

Smith, David Roy

2016-01-01

The year 2014 saw more than a thousand new mitochondrial genome sequences deposited in GenBank-an almost 15% increase from the previous year. Hundreds of peer-reviewed articles accompanied these genomes, making mitochondrial DNAs (mtDNAs) the most sequenced and reported type of eukaryotic chromosome. These mtDNA data have advanced a wide range of scientific fields, from forensics to anthropology to medicine to molecular evolution. But for many biological lineages, mtDNAs are so well sampled that newly published genomes are arguably no longer contributing significantly to the progression of science, and in some cases they are tying up valuable resources, particularly journal editors and referees. Is it time to acknowledge that as a research community we have published enough mitochondrial genome papers? Here, I address this question, exploring the history, milestones and impacts of mitochondrial genomics, the benefits and drawbacks of continuing to publish mtDNAs at a high rate and what the future may hold for such an important and popular genetic marker. I highlight groups for which mtDNAs are still poorly sampled, thus meriting further investigation, and recommend that more energy be spent characterizing aspects of mitochondrial genomes apart from the DNA sequence, such as their chromosomal and transcriptional architectures. Ultimately, one should be mindful before writing a mitochondrial genome paper. Consider perhaps sending the sequence directly to GenBank instead, and be sure to annotate it correctly before submission. © The Author 2015. Published by Oxford University Press.
Affordable Hands-On DNA Sequencing and Genotyping: An Exercise for Teaching DNA Analysis to Undergraduates

ERIC Educational Resources Information Center

Shah, Kushani; Thomas, Shelby; Stein, Arnold

2013-01-01

In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C…
Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections.

PubMed

van der Weele, Pascal; Meijer, Chris J L M; King, Audrey J

2017-10-01

Human papillomavirus (HPV) is a strongly conserved DNA virus, high-risk types of which can cause cervical cancer in persistent infections. The most common type found in HPV-attributable cancer is HPV16, which can be subdivided into four lineages (A to D) with different carcinogenic properties. Studies have shown HPV16 sequence diversity in different geographical areas, but only limited information is available regarding HPV16 diversity within a population, especially at the whole-genome level. We analyzed HPV16 major variant diversity and conservation in persistent infections and performed a single nucleotide polymorphism (SNP) comparison between persistent and clearing infections. Materials were obtained in the Netherlands from a cohort study with longitudinal follow-up for up to 3 years. Our analysis shows a remarkably large variant diversity in the population. Whole-genome sequences were obtained for 57 persistent and 59 clearing HPV16 infections, resulting in 109 unique variants. Interestingly, persistent infections were completely conserved through time. One reinfection event was identified where the initial and follow-up samples clustered differently. Non-A1/A2 variants seemed to clear preferentially ( P = 0.02). Our analysis shows that population-wide HPV16 sequence diversity is very large. In persistent infections, the HPV16 sequence was fully conserved. Sequencing can identify HPV16 reinfections, although occurrence is rare. SNP comparison identified no strongly acting effect of the viral genome affecting HPV16 infection clearance or persistence in up to 3 years of follow-up. These findings suggest the progression of an early HPV16 infection could be host related. IMPORTANCE Human papillomavirus 16 (HPV16) is the predominant type found in cervical cancer. Progression of initial infection to cervical cancer has been linked to sequence properties; however, knowledge of variants circulating in European populations, especially with longitudinal follow-up, is limited. By sequencing a number of infections with known follow-up for up to 3 years, we gained initial insights into the genetic diversity of HPV16 and the effects of the viral genome on the persistence of infections. A SNP comparison between sequences obtained from clearing and persistent infections did not identify strongly acting DNA variations responsible for these infection outcomes. In addition, we identified an HPV16 reinfection event where sequencing of initial and follow-up samples showed different HPV16 variants. Based on conventional genotyping, this infection would incorrectly be considered a persistent HPV16 infection. In the context of vaccine efficacy and monitoring studies, such infections could potentially cause reduced reported efficacy or efficiency. Copyright © 2017 van der Weele et al.
DNA Barcode Goes Two-Dimensions: DNA QR Code Web Server

PubMed Central

Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin

2012-01-01

The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, “DNA barcode” actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications. PMID:22574113
High-resolution characterization of sequence signatures due to non-random cleavage of cell-free DNA.

PubMed

Chandrananda, Dineika; Thorne, Natalie P; Bahlo, Melanie

2015-06-17

High-throughput sequencing of cell-free DNA fragments found in human plasma has been used to non-invasively detect fetal aneuploidy, monitor organ transplants and investigate tumor DNA. However, many biological properties of this extracellular genetic material remain unknown. Research that further characterizes circulating DNA could substantially increase its diagnostic value by allowing the application of more sophisticated bioinformatics tools that lead to an improved signal to noise ratio in the sequencing data. In this study, we investigate various features of cell-free DNA in plasma using deep-sequencing data from two pregnant women (>70X, >50X) and compare them with matched cellular DNA. We utilize a descriptive approach to examine how the biological cleavage of cell-free DNA affects different sequence signatures such as fragment lengths, sequence motifs at fragment ends and the distribution of cleavage sites along the genome. We show that the size distributions of these cell-free DNA molecules are dependent on their autosomal and mitochondrial origin as well as the genomic location within chromosomes. DNA mapping to particular microsatellites and alpha repeat elements display unique size signatures. We show how cell-free fragments occur in clusters along the genome, localizing to nucleosomal arrays and are preferentially cleaved at linker regions by correlating the mapping locations of these fragments with ENCODE annotation of chromatin organization. Our work further demonstrates that cell-free autosomal DNA cleavage is sequence dependent. The region spanning up to 10 positions on either side of the DNA cleavage site show a consistent pattern of preference for specific nucleotides. This sequence motif is present in cleavage sites localized to nucleosomal cores and linker regions but is absent in nucleosome-free mitochondrial DNA. These background signals in cell-free DNA sequencing data stem from the non-random biological cleavage of these fragments. This sequence structure can be harnessed to improve bioinformatics algorithms, in particular for CNV and structural variant detection. Descriptive measures for cell-free DNA features developed here could also be used in biomarker analysis to monitor the changes that occur during different pathological conditions.
Nuclear targeting of viral and non-viral DNA.

PubMed

Chowdhury, E H

2009-07-01

The nuclear envelope presents a major barrier to transgene delivery and expression using a non-viral vector. Virus is capable of overcoming the barrier to deliver their genetic materials efficiently into the nucleus by virtue of the specialized protein components with the unique amino acid sequences recognizing cellular nuclear transport machinery. However, considering the safety issues in the clinical gene therapy for treating critical human diseases, non-viral systems are highly promising compared with their viral counterparts. This review summarizes the progress on exploring the nuclear traffic mechanisms for the prominent viral vectors and the technological innovations for the nuclear delivery of non-viral DNA by mimicking those natural processes evolved for the viruses as well as for many cellular proteins.
Analysis of DNA Sequences by An Optical Time-Integrating Correlator: Proof-Of-Concept Experiments.

DTIC Science & Technology

1992-05-01

TABLES xv LIST OF ABBREVIATIONS xvii 1.0 INTRODUCTION 1 2.0 DNA ANALYSIS STRATEGY 4 2.1 Representation of DNA Bases 4 2.2 DNA Analysis Strategy 6 3.0...Zehnder architecture. 3 Figure 3: Short representations of the DNA bases where each base is represented by a 7-bits long pseudorandom sequence. 5... DNA bases where each base is represented by 7-bits long pseudorandom sequences. 4 Table 2: Long representations of the DNA bases with 255-bits maximum
SNP discovery through de novo deep sequencing using the next generation of DNA sequencers

USDA-ARS?s Scientific Manuscript database

The production of high volumes of DNA sequence data using new technologies has permitted more efficient identification of single nucleotide polymorphisms in vertebrate genomes. This chapter presented practical methodology for production and analysis of DNA sequence data for SNP discovery....

A simple procedure for parallel sequence analysis of both strands of 5'-labeled DNA.

PubMed

Razvi, F; Gargiulo, G; Worcel, A

1983-08-01

Ligation of a 5'-labeled DNA restriction fragment results in a circular DNA molecule carrying the two 32Ps at the reformed restriction site. Double digestions of the circular DNA with the original enzyme and a second restriction enzyme cleavage near the labeled site allows direct chemical sequencing of one 5'-labeled DNA strand. Similar double digestions, using an isoschizomer that cleaves differently at the 32P-labeled site, allows direct sequencing of the now 3'-labeled complementary DNA strand. It is possible to directly sequence both strands of cloned DNA inserts by using the above protocol and a multiple cloning site vector that provides the necessary restriction sites. The simultaneous and parallel visualization of both DNA strands eliminates sequence ambiguities. In addition, the labeled circular molecules are particularly useful for single-hit DNA cleavage studies and DNA footprint analysis. As an example, we show here an analysis of the micrococcal nuclease-induced breaks on the two strands of the somatic 5S RNA gene of Xenopus borealis, which suggests that the enzyme may recognize and cleave small AT-containing palindromes along the DNA helix.
A Glimpse into the Satellite DNA Library in Characidae Fish (Teleostei, Characiformes)

PubMed Central

Utsunomia, Ricardo; Ruiz-Ruano, Francisco J.; Silva, Duílio M. Z. A.; Serrano, Érica A.; Rosa, Ivana F.; Scudeler, Patrícia E. S.; Hashimoto, Diogo T.; Oliveira, Claudio; Camacho, Juan Pedro M.; Foresti, Fausto

2017-01-01

Satellite DNA (satDNA) is an abundant fraction of repetitive DNA in eukaryotic genomes and plays an important role in genome organization and evolution. In general, satDNA sequences follow a concerted evolutionary pattern through the intragenomic homogenization of different repeat units. In addition, the satDNA library hypothesis predicts that related species share a series of satDNA variants descended from a common ancestor species, with differential amplification of different satDNA variants. The finding of a same satDNA family in species belonging to different genera within Characidae fish provided the opportunity to test both concerted evolution and library hypotheses. For this purpose, we analyzed here sequence variation and abundance of this satDNA family in ten species, by a combination of next generation sequencing (NGS), PCR and Sanger sequencing, and fluorescence in situ hybridization (FISH). We found extensive between-species variation for the number and size of pericentromeric FISH signals. At genomic level, the analysis of 1000s of DNA sequences obtained by Illumina sequencing and PCR amplification allowed defining 150 haplotypes which were linked in a common minimum spanning tree, where different patterns of concerted evolution were apparent. This also provided a glimpse into the satDNA library of this group of species. In consistency with the library hypothesis, different variants for this satDNA showed high differences in abundance between species, from highly abundant to simply relictual variants. PMID:28855916
Refined annotation and assembly of the Tetrahymena thermophila genome sequence through EST analysis, comparative genomic hybridization, and targeted gap closure

PubMed Central

Coyne, Robert S; Thiagarajan, Mathangi; Jones, Kristie M; Wortman, Jennifer R; Tallon, Luke J; Haas, Brian J; Cassidy-Hanley, Donna M; Wiley, Emily A; Smith, Joshua J; Collins, Kathleen; Lee, Suzanne R; Couvillion, Mary T; Liu, Yifan; Garg, Jyoti; Pearlman, Ronald E; Hamilton, Eileen P; Orias, Eduardo; Eisen, Jonathan A; Methé, Barbara A

2008-01-01

Background Tetrahymena thermophila, a widely studied model for cellular and molecular biology, is a binucleated single-celled organism with a germline micronucleus (MIC) and somatic macronucleus (MAC). The recent draft MAC genome assembly revealed low sequence repetitiveness, a result of the epigenetic removal of invasive DNA elements found only in the MIC genome. Such low repetitiveness makes complete closure of the MAC genome a feasible goal, which to achieve would require standard closure methods as well as removal of minor MIC contamination of the MAC genome assembly. Highly accurate preliminary annotation of Tetrahymena's coding potential was hindered by the lack of both comparative genomic sequence information from close relatives and significant amounts of cDNA evidence, thus limiting the value of the genomic information and also leaving unanswered certain questions, such as the frequency of alternative splicing. Results We addressed the problem of MIC contamination using comparative genomic hybridization with purified MIC and MAC DNA probes against a whole genome oligonucleotide microarray, allowing the identification of 763 genome scaffolds likely to contain MIC-limited DNA sequences. We also employed standard genome closure methods to essentially finish over 60% of the MAC genome. For the improvement of annotation, we have sequenced and analyzed over 60,000 verified EST reads from a variety of cellular growth and development conditions. Using this EST evidence, a combination of automated and manual reannotation efforts led to updates that affect 16% of the current protein-coding gene models. By comparing EST abundance, many genes showing apparent differential expression between these conditions were identified. Rare instances of alternative splicing and uses of the non-standard amino acid selenocysteine were also identified. Conclusion We report here significant progress in genome closure and reannotation of Tetrahymena thermophila. Our experience to date suggests that complete closure of the MAC genome is attainable. Using the new EST evidence, automated and manual curation has resulted in substantial improvements to the over 24,000 gene models, which will be valuable to researchers studying this model organism as well as for comparative genomics purposes. PMID:19036158
Single-molecule detection of epidermal growth factor receptor mutations in plasma by microfluidics digital PCR in non-small cell lung cancer patients.

PubMed

Yung, Tony K F; Chan, K C Allen; Mok, Tony S K; Tong, Joanna; To, Ka-Fai; Lo, Y M Dennis

2009-03-15

We aim to develop a digital PCR-based method for the quantitative detection of the two common epidermal growth factor receptor (EGFR) mutations (in-frame deletion at exon 19 and L858R at exon 21) in the plasma and tumor tissues of patients suffering from non-small cell lung cancers. These two mutations account for >85% of clinically important EGFR mutations associated with responsiveness to tyrosine kinase inhibitors. DNA samples were analyzed using a microfluidics system that simultaneously performed 9,180 PCRs at nanoliter scale. A single-mutant DNA molecule in a clinical specimen could be detected and the quantities of mutant and wild-type sequences were precisely determined. Exon 19 deletion and L858R mutation were detectable in 6 (17%) and 9 (26%) of 35 pretreatment plasma samples, respectively. When compared with the sequencing results of the tumor samples, the sensitivity and specificity of plasma EGFR mutation analysis were 92% and 100%, respectively. The plasma concentration of the mutant sequences correlated well with the clinical response. Decreased concentration was observed in all patients with partial or complete clinical remission, whereas persistence of mutation was observed in a patient with cancer progression. In one patient, tyrosine kinase inhibitor was stopped after an initial response and the tumor-associated EGFR mutation reemerged 4 weeks after stopping treatment. The sensitive detection and accurate quantification of low abundance EGFR mutations in tumor tissues and plasma by microfluidics digital PCR would be useful for predicting treatment response, monitoring disease progression and early detection of treatment failure associated with acquired drug resistance.
Detection of IDH1 mutation in human gliomas: comparison of immunohistochemistry and sequencing.

PubMed

Takano, Shingo; Tian, Wei; Matsuda, Masahide; Yamamoto, Tetsuya; Ishikawa, Eiichi; Kaneko, Mika Kato; Yamazaki, Kentaro; Kato, Yukinari; Matsumura, Akira

2011-04-01

Isocitrate dehydrogenase 1 (IDH1) mutations have recently been identified as early and frequent genetic alterations in astrocytomas, oligodendrogliomas, and oligoastrocytomas, as well as secondary glioblastomas, whereas primary glioblastomas very rarely contain IDH1 mutations. Furthermore, a specific monoclonal antibody, IMab-1, which recognizes IDH1-R132H-the most frequent IDH1 mutation-has been generated. IMab-1 has been reported to react with the IDH1-R132H protein, but not the wild-type IDH1 or the other IDH1 mutant proteins in Western-blot analysis. However, the importance of immunohistochemistry using IMab-1 has not yet been elucidated. In this study, we compared the findings from IMab-1 immunohistochemistry and direct DNA sequencing using 49 glioma samples. IMab-1 detected 12 out of 49 cases; however, only nine cases were found to be IDH1-R132H by direct DNA sequencing because of a small population of IDH1-R132H mutation-possessing tumor cells, indicating that IMab-1 immunohistochemistry is useful for detecting IDH1-R132H. We conducted immunohistochemical detection in 52 cases of grade III astrocytomas. The median time to progression (TTP) was significantly longer in the cases with the IDH1 mutation (86.7 months) compared to the cases without the IDH1 mutation (wild type, 10.4 months) (p < 0.01). In conclusion, the anti-IDH1-R132H-specific monoclonal antibody IMab-1 is very useful for detecting IDH1-R132H in immunohistochemistry, and predicting the time to progression in grade III anaplastic astrocytomas. Therefore, IMab-1 is likely to be useful for the diagnosis of mutation-bearing gliomas and for determining the treatment strategy of grade III gliomas.
Short, interspersed, and repetitive DNA sequences in Spiroplasma species.

PubMed

Nur, I; LeBlanc, D J; Tully, J G

1987-03-01

Small fragments of DNA from an 8-kbp plasmid, pRA1, from a plant pathogenic strain of Spiroplasma citri were shown previously to be present in the chromosomal DNA of at least two species of Spiroplasma. We describe here the shot-gun cloning of chromosomal DNA from S. citri Maroc and the identification of two distinct sequences exhibiting homology to pRA1. Further subcloning experiments provided specific molecular probes for the identification of these two sequences in chromosomal DNA from three distinct plant pathogenic species of Spiroplasma. The results of Southern blot hybridization indicated that each of the pRA1-associated sequences is present as multiple copies in short, dispersed, and repetitive sequences in the chromosomes of these three strains. None of the sequences was detectable in chromosomal DNA from an additional nine Spiroplasma strains examined.
Laser Desorption Mass Spectrometry for DNA Sequencing and Analysis

NASA Astrophysics Data System (ADS)

Chen, C. H. Winston; Taranenko, N. I.; Golovlev, V. V.; Isola, N. R.; Allman, S. L.

1998-03-01

Rapid DNA sequencing and/or analysis is critically important for biomedical research. In the past, gel electrophoresis has been the primary tool to achieve DNA analysis and sequencing. However, gel electrophoresis is a time-consuming and labor-extensive process. Recently, we have developed and used laser desorption mass spectrometry (LDMS) to achieve sequencing of ss-DNA longer than 100 nucleotides. With LDMS, we succeeded in sequencing DNA in seconds instead of hours or days required by gel electrophoresis. In addition to sequencing, we also applied LDMS for the detection of DNA probes for hybridization LDMS was also used to detect short tandem repeats for forensic applications. Clinical applications for disease diagnosis such as cystic fibrosis caused by base deletion and point mutation have also been demonstrated. Experimental details will be presented in the meeting. abstract.
DNA Barcoding of Metazoan Zooplankton Copepods from South Korea

PubMed Central

Ryu, Shi Hyun; Kim, Sang Ki; Lee, Jin Hee; Lim, Young Jin; Lee, Jimin; Jun, Jumin; Kwak, Myounghai; Lee, Young-Sup; Hwang, Jae-Sam; Venmathi Maran, Balu Alagar; Chang, Cheon Young; Kim, Il-Hoi; Hwang, Ui Wook

2016-01-01

Copepods, small aquatic crustaceans, are the most abundant metazoan zooplankton and outnumber every other group of multicellular animals on earth. In spite of ecological and biological importance in aquatic environment, their morphological plasticity, originated from their various lifestyles and their incomparable capacity to adapt to a variety of environments, has made the identification of species challenging, even for expert taxonomists. Molecular approaches to species identification have allowed rapid detection, discrimination, and identification of cryptic or sibling species based on DNA sequence data. We examined sequence variation of a partial mitochondrial cytochrome C oxidase I gene (COI) from 133 copepod individuals collected from the Korean Peninsula, in order to identify and discriminate 94 copepod species covering six copepod orders of Calanoida, Cyclopoida, Harpacticoida, Monstrilloida, Poecilostomatoida and Siphonostomatoida. The results showed that there exists a clear gap with ca. 20 fold difference between the averages of within-specific sequence divergence (2.42%) and that of between-specific sequence divergence (42.79%) in COI, suggesting the plausible utility of this gene in delimitating copepod species. The results showed, with the COI barcoding data among 94 copepod species, that a copepod species could be distinguished from the others very clearly, only with four exceptions as followings: Mesocyclops dissimilis–Mesocyclops pehpeiensis (0.26% K2P distance in percent) and Oithona davisae–Oithona similis (1.1%) in Cyclopoida, Ostrincola japonica–Pseudomyicola spinosus (1.5%) in Poecilostomatoida, and Hatschekia japonica–Caligus quadratus (5.2%) in Siphonostomatoida. Thus, it strongly indicated that COI may be a useful tool in identifying various copepod species and make an initial progress toward the construction of a comprehensive DNA barcode database for copepods inhabiting the Korean Peninsula. PMID:27383475
PriC-mediated DNA replication restart requires PriC complex formation with the single-stranded DNA-binding protein.

PubMed

Wessel, Sarah R; Marceau, Aimee H; Massoni, Shawn C; Zhou, Ruobo; Ha, Taekjip; Sandler, Steven J; Keck, James L

2013-06-14

Frequent collisions between cellular DNA replication complexes (replisomes) and obstacles such as damaged DNA or frozen protein complexes make DNA replication fork progression surprisingly sporadic. These collisions can lead to the ejection of replisomes prior to completion of replication, which, if left unrepaired, results in bacterial cell death. As such, bacteria have evolved DNA replication restart mechanisms that function to reload replisomes onto abandoned DNA replication forks. Here, we define a direct interaction between PriC, a key Escherichia coli DNA replication restart protein, and the single-stranded DNA-binding protein (SSB), a protein that is ubiquitously associated with DNA replication forks. PriC/SSB complex formation requires evolutionarily conserved residues from both proteins, including a pair of Arg residues from PriC and the C terminus of SSB. In vitro, disruption of the PriC/SSB interface by sequence changes in either protein blocks the first step of DNA replication restart, reloading of the replicative DnaB helicase onto an abandoned replication fork. Consistent with the critical role of PriC/SSB complex formation in DNA replication restart, PriC variants that cannot bind SSB are non-functional in vivo. Single-molecule experiments demonstrate that PriC binding to SSB alters SSB/DNA complexes, exposing single-stranded DNA and creating a platform for other proteins to bind. These data lead to a model in which PriC interaction with SSB remodels SSB/DNA structures at abandoned DNA replication forks to create a DNA structure that is competent for DnaB loading.
Constructing DNA Barcode Sets Based on Particle Swarm Optimization.

PubMed

Wang, Bin; Zheng, Xuedong; Zhou, Shihua; Zhou, Changjun; Wei, Xiaopeng; Zhang, Qiang; Wei, Ziqi

2018-01-01

Following the completion of the human genome project, a large amount of high-throughput bio-data was generated. To analyze these data, massively parallel sequencing, namely next-generation sequencing, was rapidly developed. DNA barcodes are used to identify the ownership between sequences and samples when they are attached at the beginning or end of sequencing reads. Constructing DNA barcode sets provides the candidate DNA barcodes for this application. To increase the accuracy of DNA barcode sets, a particle swarm optimization (PSO) algorithm has been modified and used to construct the DNA barcode sets in this paper. Compared with the extant results, some lower bounds of DNA barcode sets are improved. The results show that the proposed algorithm is effective in constructing DNA barcode sets.
Gene Discovery in Bladder Cancer Progression using cDNA Microarrays

PubMed Central

Sanchez-Carbayo, Marta; Socci, Nicholas D.; Lozano, Juan Jose; Li, Wentian; Charytonowicz, Elizabeth; Belbin, Thomas J.; Prystowsky, Michael B.; Ortiz, Angel R.; Childs, Geoffrey; Cordon-Cardo, Carlos

2003-01-01

To identify gene expression changes along progression of bladder cancer, we compared the expression profiles of early-stage and advanced bladder tumors using cDNA microarrays containing 17,842 known genes and expressed sequence tags. The application of bootstrapping techniques to hierarchical clustering segregated early-stage and invasive transitional carcinomas into two main clusters. Multidimensional analysis confirmed these clusters and more importantly, it separated carcinoma in situ from papillary superficial lesions and subgroups within early-stage and invasive tumors displaying different overall survival. Additionally, it recognized early-stage tumors showing gene profiles similar to invasive disease. Different techniques including standard t-test, single-gene logistic regression, and support vector machine algorithms were applied to identify relevant genes involved in bladder cancer progression. Cytokeratin 20, neuropilin-2, p21, and p33ING1 were selected among the top ranked molecular targets differentially expressed and validated by immunohistochemistry using tissue microarrays (n = 173). Their expression patterns were significantly associated with pathological stage, tumor grade, and altered retinoblastoma (RB) expression. Moreover, p33ING1 expression levels were significantly associated with overall survival. Analysis of the annotation of the most significant genes revealed the relevance of critical genes and pathways during bladder cancer progression, including the overexpression of oncogenic genes such as DEK in superficial tumors or immune response genes such as Cd86 antigen in invasive disease. Gene profiling successfully classified bladder tumors based on their progression and clinical outcome. The present study has identified molecular biomarkers of potential clinical significance and critical molecular targets associated with bladder cancer progression. PMID:12875971
The effect of silver nanoparticles on seasonal change in arctic tundra bacterial and fungal assemblages.

PubMed

Kumar, Niraj; Palmer, Gerald R; Shah, Vishal; Walker, Virginia K

2014-01-01

The impact of silver nanoparticles (NPs) and microparticles (MPs) on bacterial and fungal assemblages was studied in soils collected from a low arctic site. Two different concentrations (0.066% and 6.6%) of Ag NPs and Ag MPs were tested in microcosms that were exposed to temperatures mimicking a winter to summer transition. Toxicity was monitored by differential respiration, phospholipid fatty acid analysis, polymerase chain reaction-denaturing gradient gel electrophoresis and DNA sequencing. Notwithstanding the effect of Ag MPs, nanosilver had an obvious, additional impact on the microbial community, underscoring the importance of particle size in toxicity. This impact was evidenced by levels of differential respiration in 0.066% Ag NP-treated soil that were only half that of control soils, a decrease in signature bacterial fatty acids, and changes in both richness and evenness in bacterial and fungal DNA sequence assemblages. Prominent after Ag NP-treatment were Hypocreales fungi, which increased to 70%, from only 1% of fungal sequences under control conditions. Genera within this Order known for their antioxidant properties (Cordyceps/Isaria) dominated the fungal assemblage after NP addition. In contrast, sequences attributed to the nitrogen-fixing Rhizobiales bacteria appeared vulnerable to Ag NP-mediated toxicity. This combination of physiological, biochemical and molecular studies clearly demonstrate that Ag NPs can severely disrupt the natural seasonal progression of tundra assemblages.
A DNA Barcode Library for North American Ephemeroptera: Progress and Prospects

PubMed Central

Webb, Jeffrey M.; Jacobus, Luke M.; Funk, David H.; Zhou, Xin; Kondratieff, Boris; Geraci, Christy J.; DeWalt, R. Edward; Baird, Donald J.; Richard, Barton; Phillips, Iain; Hebert, Paul D. N.

2012-01-01

DNA barcoding of aquatic macroinvertebrates holds much promise as a tool for taxonomic research and for providing the reliable identifications needed for water quality assessment programs. A prerequisite for identification using barcodes is a reliable reference library. We gathered 4165 sequences from the barcode region of the mitochondrial cytochrome c oxidase subunit I gene representing 264 nominal and 90 provisional species of mayflies (Insecta: Ephemeroptera) from Canada, Mexico, and the United States. No species shared barcode sequences and all can be identified with barcodes with the possible exception of some Caenis. Minimum interspecific distances ranged from 0.3–24.7% (mean: 12.5%), while the average intraspecific divergence was 1.97%. The latter value was inflated by the presence of very high divergences in some taxa. In fact, nearly 20% of the species included two or three haplotype clusters showing greater than 5.0% sequence divergence and some values are as high as 26.7%. Many of the species with high divergences are polyphyletic and likely represent species complexes. Indeed, many of these polyphyletic species have numerous synonyms and individuals in some barcode clusters show morphological attributes characteristic of the synonymized species. In light of our findings, it is imperative that type or topotype specimens be sequenced to correctly associate barcode clusters with morphological species concepts and to determine the status of currently synonymized species. PMID:22666447
The Role of microRNA miR-101 in Prostate Cancer Progression

DTIC Science & Technology

2012-09-01

genome -wide mapping of PcG binding in human fibroblasts, human ES cells, mouse ES cells, and Drosophila 37-41 . All of the studies demonstrated that...development. Mamm Genome 2002; 13(9): 493-503. 15. Simon J, Chiang A, Bender W, Shimell MJ, O’Connor M. Elements of the Drosophila bithorax complex... sequencing analysis of the miR-203 genomic region revealed cancer-specific DNA methylation in a region proximal to miR-203 in prostate cancer tissues
Fanconi anemia proteins in telomere maintenance.

PubMed

Sarkar, Jaya; Liu, Yie

2016-07-01

Mammalian chromosome ends are protected by nucleoprotein structures called telomeres. Telomeres ensure genome stability by preventing chromosome termini from being recognized as DNA damage. Telomere length homeostasis is inevitable for telomere maintenance because critical shortening or over-lengthening of telomeres may lead to DNA damage response or delay in DNA replication, and hence genome instability. Due to their repetitive DNA sequence, unique architecture, bound shelterin proteins, and high propensity to form alternate/secondary DNA structures, telomeres are like common fragile sites and pose an inherent challenge to the progression of DNA replication, repair, and recombination apparatus. It is conceivable that longer the telomeres are, greater is the severity of such challenges. Recent studies have linked excessively long telomeres with increased tumorigenesis. Here we discuss telomere abnormalities in a rare recessive chromosomal instability disorder called Fanconi Anemia and the role of the Fanconi Anemia pathway in telomere biology. Reports suggest that Fanconi Anemia proteins play a role in maintaining long telomeres, including processing telomeric joint molecule intermediates. We speculate that ablation of the Fanconi Anemia pathway would lead to inadequate aberrant structural barrier resolution at excessively long telomeres, thereby causing replicative burden on the cell. Published by Elsevier B.V.
Genetic instability associated with loop or stem–loop structures within transcription units can be independent of nucleotide excision repair

PubMed Central

Burns, John A; Chowdhury, Moinuddin A; Cartularo, Laura; Berens, Christian; Scicchitano, David A

2018-01-01

Abstract Simple sequence repeats (SSRs) are found throughout the genome, and under some conditions can change in length over time. Germline and somatic expansions of trinucleotide repeats are associated with a series of severely disabling illnesses, including Huntington's disease. The underlying mechanisms that effect SSR expansions and contractions have been experimentally elusive, but models suggesting a role for DNA repair have been proposed, in particular the involvement of transcription-coupled nucleotide excision repair (TCNER) that removes transcription-blocking DNA damage from the transcribed strand of actively expressed genes. If the formation of secondary DNA structures that are associated with SSRs were to block RNA polymerase progression, TCNER could be activated, resulting in the removal of the aberrant structure and a concomitant change in the region's length. To test this, TCNER activity in primary human fibroblasts was assessed on defined DNA substrates containing extrahelical DNA loops that lack discernible internal base pairs or DNA stem–loops that contain base pairs within the stem. The results show that both structures impede transcription elongation, but there is no corresponding evidence that nucleotide excision repair (NER) or TCNER operates to remove them. PMID:29474673
Winnowing DNA for rare sequences: highly specific sequence and methylation based enrichment.

PubMed

Thompson, Jason D; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

2012-01-01

Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue.
Pulling out the 1%: Whole-Genome Capture for the Targeted Enrichment of Ancient DNA Sequencing Libraries

PubMed Central

Carpenter, Meredith L.; Buenrostro, Jason D.; Valdiosera, Cristina; Schroeder, Hannes; Allentoft, Morten E.; Sikora, Martin; Rasmussen, Morten; Gravel, Simon; Guillén, Sonia; Nekhrizov, Georgi; Leshtakov, Krasimir; Dimitrova, Diana; Theodossiev, Nikola; Pettener, Davide; Luiselli, Donata; Sandoval, Karla; Moreno-Estrada, Andrés; Li, Yingrui; Wang, Jun; Gilbert, M. Thomas P.; Willerslev, Eske; Greenleaf, William J.; Bustamante, Carlos D.

2013-01-01

Most ancient specimens contain very low levels of endogenous DNA, precluding the shotgun sequencing of many interesting samples because of cost. Ancient DNA (aDNA) libraries often contain <1% endogenous DNA, with the majority of sequencing capacity taken up by environmental DNA. Here we present a capture-based method for enriching the endogenous component of aDNA sequencing libraries. By using biotinylated RNA baits transcribed from genomic DNA libraries, we are able to capture DNA fragments from across the human genome. We demonstrate this method on libraries created from four Iron Age and Bronze Age human teeth from Bulgaria, as well as bone samples from seven Peruvian mummies and a Bronze Age hair sample from Denmark. Prior to capture, shotgun sequencing of these libraries yielded an average of 1.2% of reads mapping to the human genome (including duplicates). After capture, this fraction increased substantially, with up to 59% of reads mapped to human and enrichment ranging from 6- to 159-fold. Furthermore, we maintained coverage of the majority of regions sequenced in the precapture library. Intersection with the 1000 Genomes Project reference panel yielded an average of 50,723 SNPs (range 3,062–147,243) for the postcapture libraries sequenced with 1 million reads, compared with 13,280 SNPs (range 217–73,266) for the precapture libraries, increasing resolution in population genetic analyses. Our whole-genome capture approach makes it less costly to sequence aDNA from specimens containing very low levels of endogenous DNA, enabling the analysis of larger numbers of samples. PMID:24568772
Biological nanopore MspA for DNA sequencing

NASA Astrophysics Data System (ADS)

Manrao, Elizabeth A.

Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.
Dynamic molecular analysis and clinical correlates of tumor evolution within a phase II trial of panitumumab-based therapy in metastatic colorectal cancer.

PubMed

Siena, S; Sartore-Bianchi, A; Garcia-Carbonero, R; Karthaus, M; Smith, D; Tabernero, J; Van Cutsem, E; Guan, X; Boedigheimer, M; Ang, A; Twomey, B; Bach, B A; Jung, A S; Bardelli, A

2018-01-01

Mutations in rat sarcoma (RAS) genes may be a mechanism of secondary resistance in epidermal growth factor receptor inhibitor-treated patients. Tumor-tissue biopsy testing has been the standard for evaluating mutational status; however, plasma testing of cell-free DNA has been shown to be a more sensitive method for detecting clonal evolution. Archival pre- and post-treatment tumor biopsy samples from a phase II study of panitumumab in combination with irinotecan in patients with metastatic colorectal cancer (mCRC) that also collected plasma samples before, during, and after treatment were analyzed for emergence of mutations during/post-treatment by next-generation sequencing and BEAMing. The rate of emergence of tumor tissue RAS mutations was 9.5% by next-generation sequencing (n = 21) and 6.3% by BEAMing (n = 16). Plasma testing of cell-free DNA by BEAMing revealed a mutant RAS emergence rate of 36.7% (n = 39). Exploratory outcomes analysis of plasma samples indicated that patients who had emergent RAS mutations at progression had similar median progression-free survival to those patients who remained wild-type at progression. Serial analysis of plasma samples showed that the first detected emergence of RAS mutations preceded progression by a median of 3.6 months (range, -0.3 to 7.5 months) and that there did not appear to be a mutant RAS allele frequency threshold that could predict near-term outcomes. This first prospective analysis in mCRC showed that serial plasma biopsies are more inclusive than tissue biopsies for evaluating global tumor heterogeneity; however, the clinical utility of plasma testing in mCRC remains to be further explored. NCT00891930. © The Author 2017. Published by Oxford University Press on behalf of the European Society for Medical Oncology.

Effects of sequence on DNA wrapping around histones

NASA Astrophysics Data System (ADS)

Ortiz, Vanessa

2011-03-01

A central question in biophysics is whether the sequence of a DNA strand affects its mechanical properties. In epigenetics, these are thought to influence nucleosome positioning and gene expression. Theoretical and experimental attempts to answer this question have been hindered by an inability to directly resolve DNA structure and dynamics at the base-pair level. In our previous studies we used a detailed model of DNA to measure the effects of sequence on the stability of naked DNA under bending. Sequence was shown to influence DNA's ability to form kinks, which arise when certain motifs slide past others to form non-native contacts. Here, we have now included histone-DNA interactions to see if the results obtained for naked DNA are transferable to the problem of nucleosome positioning. Different DNA sequences interacting with the histone protein complex are studied, and their equilibrium and mechanical properties are compared among themselves and with the naked case. NLM training grant to the Computation and Informatics in Biology and Medicine Training Program (NLM T15LM007359).
A high-throughput and quantitative method to assess the mutagenic potential of translesion DNA synthesis

PubMed Central

Taggart, David J.; Camerlengo, Terry L.; Harrison, Jason K.; Sherrer, Shanen M.; Kshetry, Ajay K.; Taylor, John-Stephen; Huang, Kun; Suo, Zucai

2013-01-01

Cellular genomes are constantly damaged by endogenous and exogenous agents that covalently and structurally modify DNA to produce DNA lesions. Although most lesions are mended by various DNA repair pathways in vivo, a significant number of damage sites persist during genomic replication. Our understanding of the mutagenic outcomes derived from these unrepaired DNA lesions has been hindered by the low throughput of existing sequencing methods. Therefore, we have developed a cost-effective high-throughput short oligonucleotide sequencing assay that uses next-generation DNA sequencing technology for the assessment of the mutagenic profiles of translesion DNA synthesis catalyzed by any error-prone DNA polymerase. The vast amount of sequencing data produced were aligned and quantified by using our novel software. As an example, the high-throughput short oligonucleotide sequencing assay was used to analyze the types and frequencies of mutations upstream, downstream and at a site-specifically placed cis–syn thymidine–thymidine dimer generated individually by three lesion-bypass human Y-family DNA polymerases. PMID:23470999
Expanding the functionality and applications of nanopore sensors

NASA Astrophysics Data System (ADS)

Venta, Kimberly E.

Nanopore sensors have developed into powerful tools for single-molecule studies since their inception two decades ago. Nanopore sensors function as nanoscale Coulter counters, by monitoring ionic current modulations as particles pass through a nanopore. While nanopore sensors can be used to study any nanoscale particle, their most notable application is as a low cost, fast alternative to current DNA sequencing technologies. In recent years, signifcant progress has been made toward the goal of nanopore-based DNA sequencing, which requires an ambitious combination of a low-noise and high-bandwidth nanopore measurement system and spatial resolution. In this dissertation, nanopore sensors in thin membranes are developed to improve dimensional resolution, and these membranes are used in parallel with a high-bandwidth amplfier. Using this nanopore sensor system, the signals of three DNA homopolymers are differentiated for the first time in solid-state nanopores. The nanopore noise is also reduced through the addition of a layer of SU8, a spin-on polymer, to the supporting chip structure. By increasing the temporal and spatial resolution of nanopore sensors, studies of shorter molecules are now possible. Nanopore sensors are beginning to be used for the study and characterization of nanoparticles. Nanoparticles have found many uses from biomedical imaging to next-generation solar cells. However, further insights into the formation and characterization of nanoparticles would aid in developing improved synthesis methods leading to more effective and customizable nanoparticles. This dissertation presents two methods of employing nanopore sensors to benet nanoparticle characterization and fabrication. Nanopores were used to study the formation of individual nanoparticles and serve as nanoparticle growth templates that could be exploited to create custom nanoparticle arrays. Additionally, nanopore sensors were used to characterize the surface charge density of anisotropic nanopores, which previously could not be reliably measured. Current nanopore sensor resolution levels have facilitated innovative research on nanoscale systems, including studies of DNA and nanoparticle characterization. Further nanopore system improvements will enable vastly improved DNA sequencing capabilities and open the door to additional nanopore sensing applications.
Combinatorial Pooling Enables Selective Sequencing of the Barley Gene Space

PubMed Central

Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R.; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J.

2013-01-01

For the vast majority of species – including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding. PMID:23592960
Combinatorial pooling enables selective sequencing of the barley gene space.

PubMed

Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J

2013-04-01

For the vast majority of species - including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding.
An extended sequence specificity for UV-induced DNA damage.

PubMed

Chung, Long H; Murray, Vincent

2018-01-01

The sequence specificity of UV-induced DNA damage was determined with a higher precision and accuracy than previously reported. UV light induces two major damage adducts: cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). Employing capillary electrophoresis with laser-induced fluorescence and taking advantages of the distinct properties of the CPDs and 6-4PPs, we studied the sequence specificity of UV-induced DNA damage in a purified DNA sequence using two approaches: end-labelling and a polymerase stop/linear amplification assay. A mitochondrial DNA sequence that contained a random nucleotide composition was employed as the target DNA sequence. With previous methodology, the UV sequence specificity was determined at a dinucleotide or trinucleotide level; however, in this paper, we have extended the UV sequence specificity to a hexanucleotide level. With the end-labelling technique (for 6-4PPs), the consensus sequence was found to be 5'-GCTC*AC (where C* is the breakage site); while with the linear amplification procedure, it was 5'-TCTT*AC. With end-labelling, the dinucleotide frequency of occurrence was highest for 5'-TC*, 5'-TT* and 5'-CC*; whereas it was 5'-TT* for linear amplification. The influence of neighbouring nucleotides on the degree of UV-induced DNA damage was also examined. The core sequences consisted of pyrimidine nucleotides 5'-CTC* and 5'-CTT* while an A at position "1" and C at position "2" enhanced UV-induced DNA damage. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.
Multiplex analysis of DNA

DOEpatents

Church, George M.; Kieffer-Higgins, Stephen

1992-01-01

This invention features vectors and a method for sequencing DNA. The method includes the steps of: a) ligating the DNA into a vector comprising a tag sequence, the tag sequence includes at least 15 bases, wherein the tag sequence will not hybridize to the DNA under stringent hybridization conditions and is unique in the vector, to form a hybrid vector, b) treating the hybrid vector in a plurality of vessels to produce fragments comprising the tag sequence, wherein the fragments differ in length and terminate at a fixed known base or bases, wherein the fixed known base or bases differs in each vessel, c) separating the fragments from each vessel according to their size, d) hybridizing the fragments with an oligonucleotide able to hybridize specifically with the tag sequence, and e) detecting the pattern of hybridization of the tag sequence, wherein the pattern reflects the nucleotide sequence of the DNA.
BiQ Analyzer HT: locus-specific analysis of DNA methylation by high-throughput bisulfite sequencing

PubMed Central

Lutsik, Pavlo; Feuerbach, Lars; Arand, Julia; Lengauer, Thomas; Walter, Jörn; Bock, Christoph

2011-01-01

Bisulfite sequencing is a widely used method for measuring DNA methylation in eukaryotic genomes. The assay provides single-base pair resolution and, given sufficient sequencing depth, its quantitative accuracy is excellent. High-throughput sequencing of bisulfite-converted DNA can be applied either genome wide or targeted to a defined set of genomic loci (e.g. using locus-specific PCR primers or DNA capture probes). Here, we describe BiQ Analyzer HT (http://biq-analyzer-ht.bioinf.mpi-inf.mpg.de/), a user-friendly software tool that supports locus-specific analysis and visualization of high-throughput bisulfite sequencing data. The software facilitates the shift from time-consuming clonal bisulfite sequencing to the more quantitative and cost-efficient use of high-throughput sequencing for studying locus-specific DNA methylation patterns. In addition, it is useful for locus-specific visualization of genome-wide bisulfite sequencing data. PMID:21565797
A DNA sequence analysis package for the IBM personal computer.

PubMed Central

Lagrimini, L M; Brentano, S T; Donelson, J E

1984-01-01

We present here a collection of DNA sequence analysis programs, called "PC Sequence" (PCS), which are designed to run on the IBM Personal Computer (PC). These programs are written in IBM PC compiled BASIC and take full advantage of the IBM PC's speed, error handling, and graphics capabilities. For a modest initial expense in hardware any laboratory can use these programs to quickly perform computer analysis on DNA sequences. They are written with the novice user in mind and require very little training or previous experience with computers. Also provided are a text editing program for creating and modifying DNA sequence files and a communications program which enables the PC to communicate with and collect information from mainframe computers and DNA sequence databases. PMID:6546433
Genomic sequencing of Pleistocene cave bears

DOE Office of Scientific and Technical Information (OSTI.GOV)

Noonan, James P.; Hofreiter, Michael; Smith, Doug

2005-04-01

Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome,more » the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.« less
Characterization of North American Armillaria species: Genetic relationships determined by ribosomal DNA sequences and AFLP markers

Treesearch

M. -S. Kim; N. B. Klopfenstein; J. W. Hanna; G. I. McDonald

2006-01-01

Phylogenetic and genetic relationships among 10 North American Armillaria species were analysed using sequence data from ribosomal DNA (rDNA), including intergenic spacer (IGS-1), internal transcribed spacers with associated 5.8S (ITS + 5.8S), and nuclear large subunit rDNA (nLSU), and amplified fragment length polymorphism (AFLP) markers. Based on rDNA sequence data,...
Fractal landscape analysis of DNA walks

NASA Technical Reports Server (NTRS)

Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Sciortino, F.; Simons, M.; Stanley, H. E.

1992-01-01

By mapping nucleotide sequences onto a "DNA walk", we uncovered remarkably long-range power law correlations [Nature 356 (1992) 168] that imply a new scale invariant property of DNA. We found such long-range correlations in intron-containing genes and in non-transcribed regulatory DNA sequences, but not in cDNA sequences or intron-less genes. In this paper, we present more explicit evidences to support our findings.
[Genome-scale sequence data processing and epigenetic analysis of DNA methylation].

PubMed

Wang, Ting-Zhang; Shan, Gao; Xu, Jian-Hong; Xue, Qing-Zhong

2013-06-01

A new approach recently developed for detecting cytosine DNA methylation (mC) and analyzing the genome-scale DNA methylation profiling, is called BS-Seq which is based on bisulfite conversion of genomic DNA combined with next-generation sequencing. The method can not only provide an insight into the difference of genome-scale DNA methylation among different organisms, but also reveal the conservation of DNA methylation in all contexts and nucleotide preference for different genomic regions, including genes, exons, and repetitive DNA sequences. It will be helpful to under-stand the epigenetic impacts of cytosine DNA methylation on the regulation of gene expression and maintaining silence of repetitive sequences, such as transposable elements. In this paper, we introduce the preprocessing steps of DNA methylation data, by which cytosine (C) and guanine (G) in the reference sequence are transferred to thymine (T) and adenine (A), and cytosine in reads is transferred to thymine, respectively. We also comprehensively review the main content of the DNA methylation analysis on the genomic scale: (1) the cytosine methylation under the context of different sequences; (2) the distribution of genomic methylcytosine; (3) DNA methylation context and the preference for the nucleotides; (4) DNA- protein interaction sites of DNA methylation; (5) degree of methylation of cytosine in the different structural elements of genes. DNA methylation analysis technique provides a powerful tool for the epigenome study in human and other species, and genes and environment interaction, and founds the theoretical basis for further development of disease diagnostics and therapeutics in human.
Characterization of biochemical properties of Bacillus subtilis RecQ helicase.

PubMed

Qin, Wei; Liu, Na-Nv; Wang, Lijun; Zhou, Min; Ren, Hua; Bugnard, Elisabeth; Liu, Jie-Lin; Zhang, Lin-Hu; Vendôme, Jeremie; Hu, Jin-Shan; Xi, Xu Guang

2014-12-01

RecQ family helicases function as safeguards of the genome. Unlike Escherichia coli, the Gram-positive Bacillus subtilis bacterium possesses two RecQ-like homologues, RecQ[Bs] and RecS, which are required for the repair of DNA double-strand breaks. RecQ[Bs] also binds to the forked DNA to ensure a smooth progression of the cell cycle. Here we present the first biochemical analysis of recombinant RecQ[Bs]. RecQ[Bs] binds weakly to single-stranded DNA (ssDNA) and blunt-ended double-stranded DNA (dsDNA) but strongly to forked dsDNA. The protein exhibits a DNA-stimulated ATPase activity and ATP- and Mg(2+)-dependent DNA helicase activity with a 3' → 5' polarity. Molecular modeling shows that RecQ[Bs] shares high sequence and structure similarity with E. coli RecQ. Surprisingly, RecQ[Bs] resembles the truncated Saccharomyces cerevisiae Sgs1 and human RecQ helicases more than RecQ[Ec] with regard to its enzymatic activities. Specifically, RecQ[Bs] unwinds forked dsDNA and DNA duplexes with a 3'-overhang but is inactive on blunt-ended dsDNA and 5'-overhung duplexes. Interestingly, RecQ[Bs] unwinds blunt-ended DNA with structural features, including nicks, gaps, 5'-flaps, Kappa joints, synthetic replication forks, and Holliday junctions. We discuss these findings in the context of RecQ[Bs]'s possible functions in preserving genomic stability. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Extracting DNA words based on the sequence features: non-uniform distribution and integrity.

PubMed

Li, Zhi; Cao, Hongyan; Cui, Yuehua; Zhang, Yanbo

2016-01-25

DNA sequence can be viewed as an unknown language with words as its functional units. Given that most sequence alignment algorithms such as the motif discovery algorithms depend on the quality of background information about sequences, it is necessary to develop an ab initio algorithm for extracting the "words" based only on the DNA sequences. We considered that non-uniform distribution and integrity were two important features of a word, based on which we developed an ab initio algorithm to extract "DNA words" that have potential functional meaning. A Kolmogorov-Smirnov test was used for consistency test of uniform distribution of DNA sequences, and the integrity was judged by the sequence and position alignment. Two random base sequences were adopted as negative control, and an English book was used as positive control to verify our algorithm. We applied our algorithm to the genomes of Saccharomyces cerevisiae and 10 strains of Escherichia coli to show the utility of the methods. The results provide strong evidences that the algorithm is a promising tool for ab initio building a DNA dictionary. Our method provides a fast way for large scale screening of important DNA elements and offers potential insights into the understanding of a genome.
CpG PatternFinder: a Windows-based utility program for easy and rapid identification of the CpG methylation status of DNA.

PubMed

Xu, Yi-Hua; Manoharan, Herbert T; Pitot, Henry C

2007-09-01

The bisulfite genomic sequencing technique is one of the most widely used techniques to study sequence-specific DNA methylation because of its unambiguous ability to reveal DNA methylation status to the order of a single nucleotide. One characteristic feature of the bisulfite genomic sequencing technique is that a number of sample sequence files will be produced from a single DNA sample. The PCR products of bisulfite-treated DNA samples cannot be sequenced directly because they are heterogeneous in nature; therefore they should be cloned into suitable plasmids and then sequenced. This procedure generates an enormous number of sample DNA sequence files as well as adding extra bases belonging to the plasmids to the sequence, which will cause problems in the final sequence comparison. Finding the methylation status for each CpG in each sample sequence is not an easy job. As a result CpG PatternFinder was developed for this purpose. The main functions of the CpG PatternFinder are: (i) to analyze the reference sequence to obtain CpG and non-CpG-C residue position information. (ii) To tailor sample sequence files (delete insertions and mark deletions from the sample sequence files) based on a configuration of ClustalW multiple alignment. (iii) To align sample sequence files with a reference file to obtain bisulfite conversion efficiency and CpG methylation status. And, (iv) to produce graphics, highlighted aligned sequence text and a summary report which can be easily exported to Microsoft Office suite. CpG PatternFinder is designed to operate cooperatively with BioEdit, a freeware on the internet. It can handle up to 100 files of sample DNA sequences simultaneously, and the total CpG pattern analysis process can be finished in minutes. CpG PatternFinder is an ideal software tool for DNA methylation studies to determine the differential methylation pattern in a large number of individuals in a population. Previously we developed the CpG Analyzer program; CpG PatternFinder is our further effort to create software tools for DNA methylation studies.
DNA-based watermarks using the DNA-Crypt algorithm.

PubMed

Heider, Dominik; Barnekow, Angelika

2007-05-29

The aim of this paper is to demonstrate the application of watermarks based on DNA sequences to identify the unauthorized use of genetically modified organisms (GMOs) protected by patents. Predicted mutations in the genome can be corrected by the DNA-Crypt program leaving the encrypted information intact. Existing DNA cryptographic and steganographic algorithms use synthetic DNA sequences to store binary information however, although these sequences can be used for authentication, they may change the target DNA sequence when introduced into living organisms. The DNA-Crypt algorithm and image steganography are based on the same watermark-hiding principle, namely using the least significant base in case of DNA-Crypt and the least significant bit in case of the image steganography. It can be combined with binary encryption algorithms like AES, RSA or Blowfish. DNA-Crypt is able to correct mutations in the target DNA with several mutation correction codes such as the Hamming-code or the WDH-code. Mutations which can occur infrequently may destroy the encrypted information, however an integrated fuzzy controller decides on a set of heuristics based on three input dimensions, and recommends whether or not to use a correction code. These three input dimensions are the length of the sequence, the individual mutation rate and the stability over time, which is represented by the number of generations. In silico experiments using the Ypt7 in Saccharomyces cerevisiae shows that the DNA watermarks produced by DNA-Crypt do not alter the translation of mRNA into protein. The program is able to store watermarks in living organisms and can maintain the original information by correcting mutations itself. Pairwise or multiple sequence alignments show that DNA-Crypt produces few mismatches between the sequences similar to all steganographic algorithms.
DNA-based watermarks using the DNA-Crypt algorithm

PubMed Central

Heider, Dominik; Barnekow, Angelika

2007-01-01

Background The aim of this paper is to demonstrate the application of watermarks based on DNA sequences to identify the unauthorized use of genetically modified organisms (GMOs) protected by patents. Predicted mutations in the genome can be corrected by the DNA-Crypt program leaving the encrypted information intact. Existing DNA cryptographic and steganographic algorithms use synthetic DNA sequences to store binary information however, although these sequences can be used for authentication, they may change the target DNA sequence when introduced into living organisms. Results The DNA-Crypt algorithm and image steganography are based on the same watermark-hiding principle, namely using the least significant base in case of DNA-Crypt and the least significant bit in case of the image steganography. It can be combined with binary encryption algorithms like AES, RSA or Blowfish. DNA-Crypt is able to correct mutations in the target DNA with several mutation correction codes such as the Hamming-code or the WDH-code. Mutations which can occur infrequently may destroy the encrypted information, however an integrated fuzzy controller decides on a set of heuristics based on three input dimensions, and recommends whether or not to use a correction code. These three input dimensions are the length of the sequence, the individual mutation rate and the stability over time, which is represented by the number of generations. In silico experiments using the Ypt7 in Saccharomyces cerevisiae shows that the DNA watermarks produced by DNA-Crypt do not alter the translation of mRNA into protein. Conclusion The program is able to store watermarks in living organisms and can maintain the original information by correcting mutations itself. Pairwise or multiple sequence alignments show that DNA-Crypt produces few mismatches between the sequences similar to all steganographic algorithms. PMID:17535434
Conserved Sequences at the Origin of Adenovirus DNA Replication

PubMed Central

Stillman, Bruce W.; Topp, William C.; Engler, Jeffrey A.

1982-01-01

The origin of adenovirus DNA replication lies within an inverted sequence repetition at either end of the linear, double-stranded viral DNA. Initiation of DNA replication is primed by a deoxynucleoside that is covalently linked to a protein, which remains bound to the newly synthesized DNA. We demonstrate that virion-derived DNA-protein complexes from five human adenovirus serological subgroups (A to E) can act as a template for both the initiation and the elongation of DNA replication in vitro, using nuclear extracts from adenovirus type 2 (Ad2)-infected HeLa cells. The heterologous template DNA-protein complexes were not as active as the homologous Ad2 DNA, most probably due to inefficient initiation by Ad2 replication factors. In an attempt to identify common features which may permit this replication, we have also sequenced the inverted terminal repeated DNA from human adenovirus serotypes Ad4 (group E), Ad9 and Ad10 (group D), and Ad31 (group A), and we have compared these to previously determined sequences from Ad2 and Ad5 (group C), Ad7 (group B), and Ad12 and Ad18 (group A) DNA. In all cases, the sequence around the origin of DNA replication can be divided into two structural domains: a proximal A · T-rich region which is partially conserved among these serotypes, and a distal G · C-rich region which is less well conserved. The G · C-rich region contains sequences similar to sequences present in papovavirus replication origins. The two domains may reflect a dual mechanism for initiation of DNA replication: adenovirus-specific protein priming of replication, and subsequent utilization of this primer by host replication factors for completion of DNA synthesis. Images PMID:7143575
Hardware Acceleration Of Multi-Deme Genetic Algorithm for DNA Codeword Searching

DTIC Science & Technology

2008-01-01

C and G are complementary to each other. A Watson - Crick complement of a DNA sequence is another DNA sequence which replaces all the A with T or vise...versa and replaces all the T with A or vise versa, and also switches the 5’ and 3’ ends. A DNA sequence binds most stably with its Watson - Crick ...bind with 5 Watson - Crick pairs. The length of the longest complementary sequence between two flexible DNA strands, A and B, is the same as the

Combined subtraction hybridization and polymerase chain reaction amplification procedure for isolation of strain-specific Rhizobium DNA sequences.

PubMed Central

Bjourson, A J; Stone, C E; Cooper, J E

1992-01-01

A novel subtraction hybridization procedure, incorporating a combination of four separation strategies, was developed to isolate unique DNA sequences from a strain of Rhizobium leguminosarum bv. trifolii. Sau3A-digested DNA from this strain, i.e., the probe strain, was ligated to a linker and hybridized in solution with an excess of pooled subtracter DNA from seven other strains of the same biovar which had been restricted, ligated to a different, biotinylated, subtracter-specific linker, and amplified by polymerase chain reaction to incorporate dUTP. Subtracter DNA and subtracter-probe hybrids were removed by phenol-chloroform extraction of a streptavidin-biotin-DNA complex. NENSORB chromatography of the sequences remaining in the aqueous layer captured biotinylated subtracter DNA which may have escaped removal by phenol-chloroform treatment. Any traces of contaminating subtracter DNA were removed by digestion with uracil DNA glycosylase. Finally, remaining sequences were amplified by polymerase chain reaction with a probe strain-specific primer, labelled with 32P, and tested for specificity in dot blot hybridizations against total genomic target DNA from each strain in the subtracter pool. Two rounds of subtraction-amplification were sufficient to remove cross-hybridizing sequences and to give a probe which hybridized only with homologous target DNA. The method is applicable to the isolation of DNA and RNA sequences from both procaryotic and eucaryotic cells. Images PMID:1637166
Protein alignment algorithms with an efficient backtracking routine on multiple GPUs.

PubMed

Blazewicz, Jacek; Frohmberg, Wojciech; Kierzynka, Michal; Pesch, Erwin; Wojciechowski, Pawel

2011-05-20

Pairwise sequence alignment methods are widely used in biological research. The increasing number of sequences is perceived as one of the upcoming challenges for sequence alignment methods in the nearest future. To overcome this challenge several GPU (Graphics Processing Unit) computing approaches have been proposed lately. These solutions show a great potential of a GPU platform but in most cases address the problem of sequence database scanning and computing only the alignment score whereas the alignment itself is omitted. Thus, the need arose to implement the global and semiglobal Needleman-Wunsch, and Smith-Waterman algorithms with a backtracking procedure which is needed to construct the alignment. In this paper we present the solution that performs the alignment of every given sequence pair, which is a required step for progressive multiple sequence alignment methods, as well as for DNA recognition at the DNA assembly stage. Performed tests show that the implementation, with performance up to 6.3 GCUPS on a single GPU for affine gap penalties, is very efficient in comparison to other CPU and GPU-based solutions. Moreover, multiple GPUs support with load balancing makes the application very scalable. The article shows that the backtracking procedure of the sequence alignment algorithms may be designed to fit in with the GPU architecture. Therefore, our algorithm, apart from scores, is able to compute pairwise alignments. This opens a wide range of new possibilities, allowing other methods from the area of molecular biology to take advantage of the new computational architecture. Performed tests show that the efficiency of the implementation is excellent. Moreover, the speed of our GPU-based algorithms can be almost linearly increased when using more than one graphics card.
Plans and progress for building a Great Lakes fauna DNA ...

EPA Pesticide Factsheets

DNA reference libraries provide researchers with an important tool for assessing regional biodiversity by allowing unknown genetic sequences to be assigned identities, while also providing a means for taxonomists to validate identifications. Expanding the representation of Great Lakes species in such reference libraries is an explicit component of research at EPA’s Mid-Continent Ecology Division. Our DNA reference library building efforts began in 2012 with the goal of providing barcodes for at least 5 specimens of each native and nonindigenous fish and aquatic invertebrate species currently present in the Great Lakes. The approach is to pull taxonomically validated specimen for sequencing from EPA led sampling efforts of adult/juvenile fish, larval fish, benthic macroinvertebrates, and zooplankton; while also soliciting aid from state and federal agencies for tissue from “shopping list” organisms. The barcodes we generate are made available through the publicly accessible BOLD (Barcode of Life) database, and help inform a planned Great Lakes biodiversity inventory. To date, our submissions to BOLD are limited to fishes; of the 88 fish species listed as being present within Lake Superior, roughly half were successfully barcoded, while only 22 species met the desired quota of 5 barcoded specimens per species. As we continue to generate genomic information from our collections and the taxonomic representations become more complete, we will continue to
G Quadruplex in Plants: A Ubiquitous Regulatory Element and Its Biological Relevance.

PubMed

Yadav, Vikas; Hemansi; Kim, Nayun; Tuteja, Narendra; Yadav, Puja

2017-01-01

G quadruplexes (G4) are higher-order DNA and RNA secondary structures formed by G-rich sequences that are built around tetrads of hydrogen-bonded guanine bases. Potential G4 quadruplex sequences have been identified in G-rich eukaryotic non-telomeric and telomeric genomic regions. Upon function, G4 formation is known to involve in chromatin remodeling, gene regulation and has been associated with genomic instability, genetic diseases and cancer progression. The natural role and biological validation of G4 structures is starting to be explored, and is of particular interest for the therapeutic interventions for human diseases. However, the existence and physiological role of G4 DNA and G4 RNA in plants species have not been much investigated yet and therefore, is of great interest for the development of improved crop varieties for sustainable agriculture. In this context, several recent studies suggests that these highly diverse G4 structures in plants can be employed to regulate expression of genes involved in several pathophysiological conditions including stress response to biotic and abiotic stresses as well as DNA damage. In the current review, we summarize the recent findings regarding the emerging functional significance of G4 structures in plants and discuss their potential value in the development of improved crop varieties.
Impact of Maternal Diet on the Epigenome during In Utero Life and the Developmental Programming of Diseases in Childhood and Adulthood

PubMed Central

Lee, Ho-Sun

2015-01-01

Exposure to environmental factors in early life can influence developmental processes and long-term health in humans. Early life nutrition and maternal diet are well-known examples of conditions shown to influence the risk of developing metabolic diseases, including type 2 diabetes mellitus and cardiovascular diseases, in adulthood. It is increasingly accepted that environmental compounds, including nutrients, can produce changes in the genome activity that, in spite of not altering the DNA sequence, can produce important, stable and, in some instances, transgenerational alterations in the phenotype. Epigenetics refers to changes in gene function that cannot be explained by changes in the DNA sequence, with DNA methylation patterns/histone modifications that can make important contributions to epigenetic memory. The epigenome can be considered as an interface between the genome and the environment that is central to the generation of phenotypes and their stability throughout the life course. To better understand the role of maternal health and nutrition in the initiation and progression of diseases in childhood and adulthood, it is necessary to identify the physiological and/or pathological roles of specific nutrients on the epigenome and how dietary interventions in utero and early life could modulate disease risk through epigenomic alteration. PMID:26593940
Influence of Neuroblastoma Stage on Serum-Based Detection of MYCN Amplification

PubMed Central

Combaret, Valerie; Hogarty, Michael D; London, Wendy B; McGrady, Patrick; Iacono, Isabelle; Brejon, Stephanie; Swerts, Katrien; Noguera, Rosa; Gross, Nicole; Rousseau, Raphael; Puisieux, Alain

2010-01-01

Background MYCN oncogene amplification has been defined as the most important prognostic factor for neuroblastoma, the most common solid extracranial neoplasm in children. High copy numbers are strongly associated with rapid tumor progression and poor outcome, independently of tumor stage or patient age, and this has become an important factor in treatment stratification. Procedure By Real Time Quantitative PCR analysis, we evaluated the clinical relevance of circulating MYCN DNA of 267 patients with locoregional or metastatic neuroblastoma in children less than 18 months of age. Results For patients in this age group with INSS stage 4 or 4S NB and stage 3 patients, serum-based determination of MYCN DNA sequences had good sensitivity (85%, 83% and 75% respectively) and high specificity (100%) when compared to direct tumor gene determination. In contrast, the approach showed low sensitivity patients with stage 1 and 2 disease. Conclusion Our results show that the sensitivity of the serum-based MYCN DNA sequence determination depends on the stage of the disease. However, this simple, reproducible assay may represent a reasonably sensitive and very specific tool to assess tumor MYCN status in cases with stage 3 and metastatic disease for whom a wait and see strategy is often recommended. PMID:19301388
Progressive genome-wide introgression in agricultural Campylobacter coli

PubMed Central

Sheppard, Samuel K; Didelot, Xavier; Jolley, Keith A; Darling, Aaron E; Pascoe, Ben; Meric, Guillaume; Kelly, David J; Cody, Alison; Colles, Frances M; Strachan, Norval J C; Ogden, Iain D; Forbes, Ken; French, Nigel P; Carter, Philip; Miller, William G; McCarthy, Noel D; Owen, Robert; Litrup, Eva; Egholm, Michael; Affourtit, Jason P; Bentley, Stephen D; Parkhill, Julian; Maiden, Martin C J; Falush, Daniel

2013-01-01

Hybridization between distantly related organisms can facilitate rapid adaptation to novel environments, but is potentially constrained by epistatic fitness interactions among cell components. The zoonotic pathogens Campylobacter coli and C. jejuni differ from each other by around 15% at the nucleotide level, corresponding to an average of nearly 40 amino acids per protein-coding gene. Using whole genome sequencing, we show that a single C. coli lineage, which has successfully colonized an agricultural niche, has been progressively accumulating C. jejuni DNA. Members of this lineage belong to two groups, the ST-828 and ST-1150 clonal complexes. The ST-1150 complex is less frequently isolated and has undergone a substantially greater amount of introgression leading to replacement of up to 23% of the C. coli core genome as well as import of novel DNA. By contrast, the more commonly isolated ST-828 complex bacteria have 10–11% introgressed DNA, and C. jejuni and nonagricultural C. coli lineages each have <2%. Thus, the C. coli that colonize agriculture, and consequently cause most human disease, have hybrid origin, but this cross-species exchange has so far not had a substantial impact on the gene pools of either C. jejuni or nonagricultural C. coli. These findings also indicate remarkable interchangeability of basic cellular machinery after a prolonged period of independent evolution. PMID:23279096
Epigenetic-Mediated Downregulation of μ-Protocadherin in Colorectal Tumours

PubMed Central

Mateusz, Bujko; Paulina, Kober; Małgorzata, Statkiewicz; Michal, Mikula; Marcin, Ligaj; Lech, Zwierzchowski; Jerzy, Ostrowski; Aleksander, Siedlecki Janusz

2015-01-01

Carcinogenesis involves altered cellular interaction and tissue morphology that partly arise from aberrant expression of cadherins. Mucin-like protocadherin is implicated in intercellular adhesion and its expression was found decreased in colorectal cancer (CRC). This study has compared MUPCDH (CDHR5) expression in three key types of colorectal tissue samples, for normal mucosa, adenoma, and carcinoma. A gradual decrease of mRNA levels and protein expression was observed in progressive stages of colorectal carcinogenesis which are consistent with reports of increasing MUPCDH 5′ promoter region DNA methylation. High MUPCDH methylation was also observed in HCT116 and SW480 CRC cell lines that revealed low gene expression levels compared to COLO205 and HT29 cell lines which lack DNA methylation at the MUPCDH locus. Furthermore, HCT116 and SW480 showed lower levels of RNA polymerase II and histone H3 lysine 4 trimethylation (H3K4me3) as well as higher levels of H3K27 trimethylation at the MUPCDH promoter. MUPCDH expression was however restored in HCT116 and SW480 cells in the presence of 5-Aza-2′-deoxycytidine (DNA methyltransferase inhibitor). Results indicate that μ-protocadherin downregulation occurs during early stages of tumourigenesis and progression into the adenoma-carcinoma sequence. Epigenetic mechanisms are involved in this silencing. PMID:25972897
Sequence Dependent Interactions Between DNA and Single-Walled Carbon Nanotubes

NASA Astrophysics Data System (ADS)

Roxbury, Daniel

It is known that single-stranded DNA adopts a helical wrap around a single-walled carbon nanotube (SWCNT), forming a water-dispersible hybrid molecule. The ability to sort mixtures of SWCNTs based on chirality (electronic species) has recently been demonstrated using special short DNA sequences that recognize certain matching SWCNTs of specific chirality. This thesis investigates the intricacies of DNA-SWCNT sequence-specific interactions through both experimental and molecular simulation studies. The DNA-SWCNT binding strengths were experimentally quantified by studying the kinetics of DNA replacement by a surfactant on the surface of particular SWCNTs. Recognition ability was found to correlate strongly with measured binding strength, e.g. DNA sequence (TAT)4 was found to bind 20 times stronger to the (6,5)-SWCNT than sequence (TAT)4T. Next, using replica exchange molecular dynamics (REMD) simulations, equilibrium structures formed by (a) single-strands and (b) multiple-strands of 12-mer oligonucleotides adsorbed on various SWCNTs were explored. A number of structural motifs were discovered in which the DNA strand wraps around the SWCNT and 'stitches' to itself via hydrogen bonding. Great variability among equilibrium structures was observed and shown to be directly influenced by DNA sequence and SWCNT type. For example, the (6,5)-SWCNT DNA recognition sequence, (TAT)4, was found to wrap in a tight single-stranded right-handed helical conformation. In contrast, DNA sequence T12 forms a beta-barrel left-handed structure on the same SWCNT. These are the first theoretical indications that DNA-based SWCNT selectivity can arise on a molecular level. In a biomedical collaboration with the Mayo Clinic, pathways for DNA-SWCNT internalization into healthy human endothelial cells were explored. Through absorbance spectroscopy, TEM imaging, and confocal fluorescence microscopy, we showed that intracellular concentrations of SWCNTs far exceeded those of the incubation solution, which suggested an energy-dependent pathway. Additionally, by means of pharmacological inhibition and vector-induced gene knockout studies, the DNA-SWCNTs were shown to enter the cells via Rac1-mediated macropinocytosis.
Development of a Novel Technology for Label Free DNA Sequencing

DTIC Science & Technology

2012-05-21

of the C-H bond stretch vibrations in the planes of the corresponding DNA bases , and in the higher-frequency side, sequence-identifier region is...composed of the N-H bond stretch vibrations in the planes of the corresponding DNA bases . In addition, the sequence-identifier dividing region almost...regions are localized at the corresponding DNA bases and exhibit a definable dependence on the sequence form of the codons under study. Final
Flow cytometry for enrichment and titration in massively parallel DNA sequencing

PubMed Central

Sandberg, Julia; Ståhl, Patrik L.; Ahmadian, Afshin; Bjursell, Magnus K.; Lundeberg, Joakim

2009-01-01

Massively parallel DNA sequencing is revolutionizing genomics research throughout the life sciences. However, the reagent costs and labor requirements in current sequencing protocols are still substantial, although improvements are continuously being made. Here, we demonstrate an effective alternative to existing sample titration protocols for the Roche/454 system using Fluorescence Activated Cell Sorting (FACS) technology to determine the optimal DNA-to-bead ratio prior to large-scale sequencing. Our method, which eliminates the need for the costly pilot sequencing of samples during titration is capable of rapidly providing accurate DNA-to-bead ratios that are not biased by the quantification and sedimentation steps included in current protocols. Moreover, we demonstrate that FACS sorting can be readily used to highly enrich fractions of beads carrying template DNA, with near total elimination of empty beads and no downstream sacrifice of DNA sequencing quality. Automated enrichment by FACS is a simple approach to obtain pure samples for bead-based sequencing systems, and offers an efficient, low-cost alternative to current enrichment protocols. PMID:19304748
A DNA sequence obtained by replacement of the dopamine RNA aptamer bases is not an aptamer.

PubMed

Álvarez-Martos, Isabel; Ferapontova, Elena E

2017-08-05

A unique specificity of the aptamer-ligand biorecognition and binding facilitates bioanalysis and biosensor development, contributing to discrimination of structurally related molecules, such as dopamine and other catecholamine neurotransmitters. The aptamer sequence capable of specific binding of dopamine is a 57 nucleotides long RNA sequence reported in 1997 (Biochemistry, 1997, 36, 9726). Later, it was suggested that the DNA homologue of the RNA aptamer retains the specificity of dopamine binding (Biochem. Biophys. Res. Commun., 2009, 388, 732). Here, we show that the DNA sequence obtained by the replacement of the RNA aptamer bases for their DNA analogues is not able of specific biorecognition of dopamine, in contrast to the original RNA aptamer sequence. This DNA sequence binds dopamine and structurally related catecholamine neurotransmitters non-specifically, as any DNA sequence, and, thus, is not an aptamer and cannot be used neither for in vivo nor in situ analysis of dopamine in the presence of structurally related neurotransmitters. Copyright © 2017 Elsevier Inc. All rights reserved.
Method for sequencing DNA base pairs

DOEpatents

Sessler, Andrew M.; Dawson, John

1993-01-01

The base pairs of a DNA structure are sequenced with the use of a scanning tunneling microscope (STM). The DNA structure is scanned by the STM probe tip, and, as it is being scanned, the DNA structure is separately subjected to a sequence of infrared radiation from four different sources, each source being selected to preferentially excite one of the four different bases in the DNA structure. Each particular base being scanned is subjected to such sequence of infrared radiation from the four different sources as that particular base is being scanned. The DNA structure as a whole is separately imaged for each subjection thereof to radiation from one only of each source.
Nuclear counterparts of the cytoplasmic mitochondrial 12S rRNA gene: a problem of ancient DNA and molecular phylogenies.

PubMed

van der Kuyl, A C; Kuiken, C L; Dekker, J T; Perizonius, W R; Goudsmit, J

1995-06-01

Monkey mummy bones and teeth originating from the North Saqqara Baboon Galleries (Egypt), soft tissue from a mummified baboon in a museum collection, and nineteenth/twentieth-century skin fragments from mangabeys were used for DNA extraction and PCR amplification of part of the mitochondrial 12S rRNA gene. Sequences aligning with the 12S rRNA gene were recovered but were only distantly related to contemporary monkey mitochondrial 12S rRNA sequences. However, many of these sequences were identical or closely related to human nuclear DNA sequences resembling mitochondrial 12S rRNA (isolated from a cell line depleted in mitochondria) and therefore have to be considered contamination. Subsequently in a separate study we were able to recover genuine mitochondrial 12S rRNA sequences from many extant species of nonhuman Old World primates and sequences closely resembling the human nuclear integrations. Analysis of all sequences by the neighbor-joining (NJ) method indicated that mitochondrial DNA sequences and their nuclear counterparts can be divided into two distinct clusters. One cluster contained all temporary cytoplasmic mitochondrial DNA sequences and approximately half of the monkey nuclear mitochondriallike sequences. A second cluster contained most human nuclear sequences and the other half of monkey nuclear sequences with a separate branch leading to human and gorilla mitochondrial and nuclear sequences. Sequences recovered from ancient materials were equally divided between the two clusters. These results constitute a warning for when working with ancient DNA or performing phylogenetic analysis using mitochondrial DNA as a target sequence: Nuclear counterparts of mitochondrial genes may lead to faulty interpretation of results.
Sequence independent amplification of DNA

DOEpatents

Bohlander, S.K.

1998-03-24

The present invention is a rapid sequence-independent amplification procedure (SIA). Even minute amounts of DNA from various sources can be amplified independent of any sequence requirements of the DNA or any a priori knowledge of any sequence characteristics of the DNA to be amplified. This method allows, for example, the sequence independent amplification of microdissected chromosomal material and the reliable construction of high quality fluorescent in situ hybridization (FISH) probes from YACs or from other sources. These probes can be used to localize YACs on metaphase chromosomes but also--with high efficiency--in interphase nuclei. 25 figs.
Sequence independent amplification of DNA

DOEpatents

Bohlander, Stefan K.

1998-01-01

The present invention is a rapid sequence-independent amplification procedure (SIA). Even minute amounts of DNA from various sources can be amplified independent of any sequence requirements of the DNA or any a priori knowledge of any sequence characteristics of the DNA to be amplified. This method allows, for example the sequence independent amplification of microdissected chromosomal material and the reliable construction of high quality fluorescent in situ hybridization (FISH) probes from YACs or from other sources. These probes can be used to localize YACs on metaphase chromosomes but also--with high efficiency--in interphase nuclei.
UV-Visible Spectroscopy-Based Quantification of Unlabeled DNA Bound to Gold Nanoparticles.

PubMed

Baldock, Brandi L; Hutchison, James E

2016-12-20

DNA-functionalized gold nanoparticles have been increasingly applied as sensitive and selective analytical probes and biosensors. The DNA ligands bound to a nanoparticle dictate its reactivity, making it essential to know the type and number of DNA strands bound to the nanoparticle surface. Existing methods used to determine the number of DNA strands per gold nanoparticle (AuNP) require that the sequences be fluorophore-labeled, which may affect the DNA surface coverage and reactivity of the nanoparticle and/or require specialized equipment and other fluorophore-containing reagents. We report a UV-visible-based method to conveniently and inexpensively determine the number of DNA strands attached to AuNPs of different core sizes. When this method is used in tandem with a fluorescence dye assay, it is possible to determine the ratio of two unlabeled sequences of different lengths bound to AuNPs. Two sizes of citrate-stabilized AuNPs (5 and 12 nm) were functionalized with mixtures of short (5 base) and long (32 base) disulfide-terminated DNA sequences, and the ratios of sequences bound to the AuNPs were determined using the new method. The long DNA sequence was present as a lower proportion of the ligand shell than in the ligand exchange mixture, suggesting it had a lower propensity to bind the AuNPs than the short DNA sequence. The ratio of DNA sequences bound to the AuNPs was not the same for the large and small AuNPs, which suggests that the radius of curvature had a significant influence on the assembly of DNA strands onto the AuNPs.
Ancient DNA sequence revealed by error-correcting codes.

PubMed

Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo

2015-07-10

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.
Ancient DNA sequence revealed by error-correcting codes

PubMed Central

Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo

2015-01-01

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228
Multiplex picoliter-droplet digital PCR for quantitative assessment of DNA integrity in clinical samples.

PubMed

Didelot, Audrey; Kotsopoulos, Steve K; Lupo, Audrey; Pekin, Deniz; Li, Xinyu; Atochin, Ivan; Srinivasan, Preethi; Zhong, Qun; Olson, Jeff; Link, Darren R; Laurent-Puig, Pierre; Blons, Hélène; Hutchison, J Brian; Taly, Valerie

2013-05-01

Assessment of DNA integrity and quantity remains a bottleneck for high-throughput molecular genotyping technologies, including next-generation sequencing. In particular, DNA extracted from paraffin-embedded tissues, a major potential source of tumor DNA, varies widely in quality, leading to unpredictable sequencing data. We describe a picoliter droplet-based digital PCR method that enables simultaneous detection of DNA integrity and the quantity of amplifiable DNA. Using a multiplex assay, we detected 4 different target lengths (78, 159, 197, and 550 bp). Assays were validated with human genomic DNA fragmented to sizes of 170 bp to 3000 bp. The technique was validated with DNA quantities as low as 1 ng. We evaluated 12 DNA samples extracted from paraffin-embedded lung adenocarcinoma tissues. One sample contained no amplifiable DNA. The fractions of amplifiable DNA for the 11 other samples were between 0.05% and 10.1% for 78-bp fragments and ≤1% for longer fragments. Four samples were chosen for enrichment and next-generation sequencing. The quality of the sequencing data was in agreement with the results of the DNA-integrity test. Specifically, DNA with low integrity yielded sequencing results with lower levels of coverage and uniformity and had higher levels of false-positive variants. The development of DNA-quality assays will enable researchers to downselect samples or process more DNA to achieve reliable genome sequencing with the highest possible efficiency of cost and effort, as well as minimize the waste of precious samples. © 2013 American Association for Clinical Chemistry.

Detecting and Estimating Contamination of Human DNA Samples in Sequencing and Array-Based Genotype Data

PubMed Central

Jun, Goo; Flickinger, Matthew; Hetrick, Kurt N.; Romm, Jane M.; Doheny, Kimberly F.; Abecasis, Gonçalo R.; Boehnke, Michael; Kang, Hyun Min

2012-01-01

DNA sample contamination is a serious problem in DNA sequencing studies and may result in systematic genotype misclassification and false positive associations. Although methods exist to detect and filter out cross-species contamination, few methods to detect within-species sample contamination are available. In this paper, we describe methods to identify within-species DNA sample contamination based on (1) a combination of sequencing reads and array-based genotype data, (2) sequence reads alone, and (3) array-based genotype data alone. Analysis of sequencing reads allows contamination detection after sequence data is generated but prior to variant calling; analysis of array-based genotype data allows contamination detection prior to generation of costly sequence data. Through a combination of analysis of in silico and experimentally contaminated samples, we show that our methods can reliably detect and estimate levels of contamination as low as 1%. We evaluate the impact of DNA contamination on genotype accuracy and propose effective strategies to screen for and prevent DNA contamination in sequencing studies. PMID:23103226
Integrated sequencing of exome and mRNA of large-sized single cells.

PubMed

Wang, Lily Yan; Guo, Jiajie; Cao, Wei; Zhang, Meng; He, Jiankui; Li, Zhoufang

2018-01-10

Current approaches of single cell DNA-RNA integrated sequencing are difficult to call SNPs, because a large amount of DNA and RNA is lost during DNA-RNA separation. Here, we performed simultaneous single-cell exome and transcriptome sequencing on individual mouse oocytes. Using microinjection, we kept the nuclei intact to avoid DNA loss, while retaining the cytoplasm inside the cell membrane, to maximize the amount of DNA and RNA captured from the single cell. We then conducted exome-sequencing on the isolated nuclei and mRNA-sequencing on the enucleated cytoplasm. For single oocytes, exome-seq can cover up to 92% of exome region with an average sequencing depth of 10+, while mRNA-sequencing reveals more than 10,000 expressed genes in enucleated cytoplasm, with similar performance for intact oocytes. This approach provides unprecedented opportunities to study DNA-RNA regulation, such as RNA editing at single nucleotide level in oocytes. In future, this method can also be applied to other large cells, including neurons, large dendritic cells and large tumour cells for integrated exome and transcriptome sequencing.
A simple method for semi-random DNA amplicon fragmentation using the methylation-dependent restriction enzyme MspJI.

PubMed

Shinozuka, Hiroshi; Cogan, Noel O I; Shinozuka, Maiko; Marshall, Alexis; Kay, Pippa; Lin, Yi-Han; Spangenberg, German C; Forster, John W

2015-04-11

Fragmentation at random nucleotide locations is an essential process for preparation of DNA libraries to be used on massively parallel short-read DNA sequencing platforms. Although instruments for physical shearing, such as the Covaris S2 focused-ultrasonicator system, and products for enzymatic shearing, such as the Nextera technology and NEBNext dsDNA Fragmentase kit, are commercially available, a simple and inexpensive method is desirable for high-throughput sequencing library preparation. MspJI is a recently characterised restriction enzyme which recognises the sequence motif CNNR (where R = G or A) when the first base is modified to 5-methylcytosine or 5-hydroxymethylcytosine. A semi-random enzymatic DNA amplicon fragmentation method was developed based on the unique cleavage properties of MspJI. In this method, random incorporation of 5-methyl-2'-deoxycytidine-5'-triphosphate is achieved through DNA amplification with DNA polymerase, followed by DNA digestion with MspJI. Due to the recognition sequence of the enzyme, DNA amplicons are fragmented in a relatively sequence-independent manner. The size range of the resulting fragments was capable of control through optimisation of 5-methyl-2'-deoxycytidine-5'-triphosphate concentration in the reaction mixture. A library suitable for sequencing using the Illumina MiSeq platform was prepared and processed using the proposed method. Alignment of generated short reads to a reference sequence demonstrated a relatively high level of random fragmentation. The proposed method may be performed with standard laboratory equipment. Although the uniformity of coverage was slightly inferior to the Covaris physical shearing procedure, due to efficiencies of cost and labour, the method may be more suitable than existing approaches for implementation in large-scale sequencing activities, such as bacterial artificial chromosome (BAC)-based genome sequence assembly, pan-genomic studies and locus-targeted genotyping-by-sequencing.
Genomics dataset of unidentified disclosed isolates.

PubMed

Rekadwad, Bhagwan N

2016-09-01

Analysis of DNA sequences is necessary for higher hierarchical classification of the organisms. It gives clues about the characteristics of organisms and their taxonomic position. This dataset is chosen to find complexities in the unidentified DNA in the disclosed patents. A total of 17 unidentified DNA sequences were thoroughly analyzed. The quick response codes were generated. AT/GC content of the DNA sequences analysis was carried out. The QR is helpful for quick identification of isolates. AT/GC content is helpful for studying their stability at different temperatures. Additionally, a dataset on cleavage code and enzyme code studied under the restriction digestion study, which helpful for performing studies using short DNA sequences was reported. The dataset disclosed here is the new revelatory data for exploration of unique DNA sequences for evaluation, identification, comparison and analysis.
Winnowing DNA for Rare Sequences: Highly Specific Sequence and Methylation Based Enrichment

PubMed Central

Thompson, Jason D.; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

2012-01-01

Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue. PMID:22355378
Low-Energy Electron-Induced Strand Breaks in Telomere-Derived DNA Sequences-Influence of DNA Sequence and Topology.

PubMed

Rackwitz, Jenny; Bald, Ilko

2018-03-26

During cancer radiation therapy high-energy radiation is used to reduce tumour tissue. The irradiation produces a shower of secondary low-energy (<20 eV) electrons, which are able to damage DNA very efficiently by dissociative electron attachment. Recently, it was suggested that low-energy electron-induced DNA strand breaks strongly depend on the specific DNA sequence with a high sensitivity of G-rich sequences. Here, we use DNA origami platforms to expose G-rich telomere sequences to low-energy (8.8 eV) electrons to determine absolute cross sections for strand breakage and to study the influence of sequence modifications and topology of telomeric DNA on the strand breakage. We find that the telomeric DNA 5'-(TTA GGG) 2 is more sensitive to low-energy electrons than an intermixed sequence 5'-(TGT GTG A) 2 confirming the unique electronic properties resulting from G-stacking. With increasing length of the oligonucleotide (i.e., going from 5'-(GGG ATT) 2 to 5'-(GGG ATT) 4 ), both the variety of topology and the electron-induced strand break cross sections increase. Addition of K + ions decreases the strand break cross section for all sequences that are able to fold G-quadruplexes or G-intermediates, whereas the strand break cross section for the intermixed sequence remains unchanged. These results indicate that telomeric DNA is rather sensitive towards low-energy electron-induced strand breakage suggesting significant telomere shortening that can also occur during cancer radiation therapy. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Dynamic distribution patterns of ribosomal DNA and chromosomal evolution in Paphiopedilum, a lady's slipper orchid

PubMed Central

2011-01-01

Background Paphiopedilum is a horticulturally and ecologically important genus of ca. 80 species of lady's slipper orchids native to Southeast Asia. These plants have long been of interest regarding their chromosomal evolution, which involves a progressive aneuploid series based on either fission or fusion of centromeres. Chromosome number is positively correlated with genome size, so rearrangement processes must include either insertion or deletion of DNA segments. We have conducted Fluorescence In Situ Hybridization (FISH) studies using 5S and 25S ribosomal DNA (rDNA) probes to survey for rearrangements, duplications, and phylogenetically-correlated variation within Paphiopedilum. We further studied sequence variation of the non-transcribed spacers of 5S rDNA (5S-NTS) to examine their complex duplication history, including the possibility that concerted evolutionary forces may homogenize diversity. Results 5S and 25S rDNA loci among Paphiopedilum species, representing all key phylogenetic lineages, exhibit a considerable diversity that correlates well with recognized evolutionary groups. 25S rDNA signals range from 2 (representing 1 locus) to 9, the latter representing hemizygosity. 5S loci display extensive structural variation, and show from 2 specific signals to many, both major and minor and highly dispersed. The dispersed signals mainly occur at centromeric and subtelomeric positions, which are hotspots for chromosomal breakpoints. Phylogenetic analysis of cloned 5S rDNA non-transcribed spacer (5S-NTS) sequences showed evidence for both ancient and recent post-speciation duplication events, as well as interlocus and intralocus diversity. Conclusions Paphiopedilum species display many chromosomal rearrangements - for example, duplications, translocations, and inversions - but only weak concerted evolutionary forces among highly duplicated 5S arrays, which suggests that double-strand break repair processes are dynamic and ongoing. These results make the genus a model system for the study of complex chromosomal evolution in plants. PMID:21910890
Dynamic distribution patterns of ribosomal DNA and chromosomal evolution in Paphiopedilum, a lady's slipper orchid.

PubMed

Lan, Tianying; Albert, Victor A

2011-09-12

Paphiopedilum is a horticulturally and ecologically important genus of ca. 80 species of lady's slipper orchids native to Southeast Asia. These plants have long been of interest regarding their chromosomal evolution, which involves a progressive aneuploid series based on either fission or fusion of centromeres. Chromosome number is positively correlated with genome size, so rearrangement processes must include either insertion or deletion of DNA segments. We have conducted Fluorescence In Situ Hybridization (FISH) studies using 5S and 25S ribosomal DNA (rDNA) probes to survey for rearrangements, duplications, and phylogenetically-correlated variation within Paphiopedilum. We further studied sequence variation of the non-transcribed spacers of 5S rDNA (5S-NTS) to examine their complex duplication history, including the possibility that concerted evolutionary forces may homogenize diversity. 5S and 25S rDNA loci among Paphiopedilum species, representing all key phylogenetic lineages, exhibit a considerable diversity that correlates well with recognized evolutionary groups. 25S rDNA signals range from 2 (representing 1 locus) to 9, the latter representing hemizygosity. 5S loci display extensive structural variation, and show from 2 specific signals to many, both major and minor and highly dispersed. The dispersed signals mainly occur at centromeric and subtelomeric positions, which are hotspots for chromosomal breakpoints. Phylogenetic analysis of cloned 5S rDNA non-transcribed spacer (5S-NTS) sequences showed evidence for both ancient and recent post-speciation duplication events, as well as interlocus and intralocus diversity. Paphiopedilum species display many chromosomal rearrangements--for example, duplications, translocations, and inversions--but only weak concerted evolutionary forces among highly duplicated 5S arrays, which suggests that double-strand break repair processes are dynamic and ongoing. These results make the genus a model system for the study of complex chromosomal evolution in plants.
Improved multiple displacement amplification (iMDA) and ultraclean reagents.

PubMed

Motley, S Timothy; Picuri, John M; Crowder, Chris D; Minich, Jeremiah J; Hofstadler, Steven A; Eshoo, Mark W

2014-06-06

Next-generation sequencing sample preparation requires nanogram to microgram quantities of DNA; however, many relevant samples are comprised of only a few cells. Genomic analysis of these samples requires a whole genome amplification method that is unbiased and free of exogenous DNA contamination. To address these challenges we have developed protocols for the production of DNA-free consumables including reagents and have improved upon multiple displacement amplification (iMDA). A specialized ethylene oxide treatment was developed that renders free DNA and DNA present within Gram positive bacterial cells undetectable by qPCR. To reduce DNA contamination in amplification reagents, a combination of ion exchange chromatography, filtration, and lot testing protocols were developed. Our multiple displacement amplification protocol employs a second strand-displacing DNA polymerase, improved buffers, improved reaction conditions and DNA free reagents. The iMDA protocol, when used in combination with DNA-free laboratory consumables and reagents, significantly improved efficiency and accuracy of amplification and sequencing of specimens with moderate to low levels of DNA. The sensitivity and specificity of sequencing of amplified DNA prepared using iMDA was compared to that of DNA obtained with two commercial whole genome amplification kits using 10 fg (~1-2 bacterial cells worth) of bacterial genomic DNA as a template. Analysis showed >99% of the iMDA reads mapped to the template organism whereas only 0.02% of the reads from the commercial kits mapped to the template. To assess the ability of iMDA to achieve balanced genomic coverage, a non-stochastic amount of bacterial genomic DNA (1 pg) was amplified and sequenced, and data obtained were compared to sequencing data obtained directly from genomic DNA. The iMDA DNA and genomic DNA sequencing had comparable coverage 99.98% of the reference genome at ≥1X coverage and 99.9% at ≥5X coverage while maintaining both balance and representation of the genome. The iMDA protocol in combination with DNA-free laboratory consumables, significantly improved the ability to sequence specimens with low levels of DNA. iMDA has broad utility in metagenomics, diagnostics, ancient DNA analysis, pre-implantation embryo screening, single-cell genomics, whole genome sequencing of unculturable organisms, and forensic applications for both human and microbial targets.
Strong minor groove base conservation in sequence logos implies DNA distortion or base flipping during replication and transcription initiation.

PubMed

Schneider, T D

2001-12-01

The sequence logo for DNA binding sites of the bacteriophage P1 replication protein RepA shows unusually high sequence conservation ( approximately 2 bits) at a minor groove that faces RepA. However, B-form DNA can support only 1 bit of sequence conservation via contacts into the minor groove. The high conservation in RepA sites therefore implies a distorted DNA helix with direct or indirect contacts to the protein. Here I show that a high minor groove conservation signature also appears in sequence logos of sites for other replication origin binding proteins (Rts1, DnaA, P4 alpha, EBNA1, ORC) and promoter binding proteins (sigma(70), sigma(D) factors). This finding implies that DNA binding proteins generally use non-B-form DNA distortion such as base flipping to initiate replication and transcription.
Molecular design of sequence specific DNA alkylating agents.

PubMed

Minoshima, Masafumi; Bando, Toshikazu; Shinohara, Ken-ichi; Sugiyama, Hiroshi

2009-01-01

Sequence-specific DNA alkylating agents have great interest for novel approach to cancer chemotherapy. We designed the conjugates between pyrrole (Py)-imidazole (Im) polyamides and DNA alkylating chlorambucil moiety possessing at different positions. The sequence-specific DNA alkylation by conjugates was investigated by using high-resolution denaturing polyacrylamide gel electrophoresis (PAGE). The results showed that polyamide chlorambucil conjugates alkylate DNA at flanking adenines in recognition sequences of Py-Im polyamides, however, the reactivities and alkylation sites were influenced by the positions of conjugation. In addition, we synthesized conjugate between Py-Im polyamide and another alkylating agent, 1-(chloromethyl)-5-hydroxy-1,2-dihydro-3H-benz[e]indole (seco-CBI). DNA alkylation reactivies by both alkylating polyamides were almost comparable. In contrast, cytotoxicities against cell lines differed greatly. These comparative studies would promote development of appropriate sequence-specific DNA alkylating polyamides against specific cancer cells.
The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

PubMed

Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P; Panitz, Frank; Bendixen, Christian; Nielsen, Rasmus; Willerslev, Eske

2007-02-14

The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources. We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences). Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis. We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%). Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial analyses, population genetics, and phylogenetics.
Independent effects of leaf growth and light on the development of the plastid and its DNA content in Zea species.

PubMed

Zheng, Qi; Oldenburg, Delene J; Bendich, Arnold J

2011-05-01

In maize (Zea mays L.), chloroplast development progresses from the basal meristem to the mature leaf tip, and light is required for maturation to photosynthetic competence. During chloroplast greening, it was found that chloroplast DNA (cpDNA) is extensively degraded, falling to undetectable levels in many individual chloroplasts for three maize cultivars, as well as Zea mexicana (the ancestor of cultivated maize) and the perennial species Zea diploperennis. In dark-grown maize seedlings, the proplastid-to-etioplast transition is characterized by plastid enlargement, cpDNA replication, and the retention of high levels of cpDNA. When dark-grown seedlings are transferred to white light, the DNA content per plastid increases slightly during the first 4 h of illumination and then declines rapidly to a minimum at 24 h during the etioplast-to-chloroplast transition. Plastid autofluorescence (from chlorophyll) continues to increase as cpDNA declines, whereas plastid size remains constant. It is concluded that the increase in cpDNA that accompanies plastid enlargement is a consequence of cell and leaf growth, rather than illumination, whereas light stimulates photosynthetic capacity and cpDNA instability. When cpDNA from total tissue was monitored by blot hybridization and real-time quantitative PCR, no decline following transfer from dark to light was observed. The lack of agreement between DNA per plastid and cpDNA per cell may be attributed to nupts (nuclear sequences of plastid origin).
Utility of 16S rDNA Sequencing for Identification of Rare Pathogenic Bacteria.

PubMed

Loong, Shih Keng; Khor, Chee Sieng; Jafar, Faizatul Lela; AbuBakar, Sazaly

2016-11-01

Phenotypic identification systems are established methods for laboratory identification of bacteria causing human infections. Here, the utility of phenotypic identification systems was compared against 16S rDNA identification method on clinical isolates obtained during a 5-year study period, with special emphasis on isolates that gave unsatisfactory identification. One hundred and eighty-seven clinical bacteria isolates were tested with commercial phenotypic identification systems and 16S rDNA sequencing. Isolate identities determined using phenotypic identification systems and 16S rDNA sequencing were compared for similarity at genus and species level, with 16S rDNA sequencing as the reference method. Phenotypic identification systems identified ~46% (86/187) of the isolates with identity similar to that identified using 16S rDNA sequencing. Approximately 39% (73/187) and ~15% (28/187) of the isolates showed different genus identity and could not be identified using the phenotypic identification systems, respectively. Both methods succeeded in determining the species identities of 55 isolates; however, only ~69% (38/55) of the isolates matched at species level. 16S rDNA sequencing could not determine the species of ~20% (37/187) of the isolates. The 16S rDNA sequencing is a useful method over the phenotypic identification systems for the identification of rare and difficult to identify bacteria species. The 16S rDNA sequencing method, however, does have limitation for species-level identification of some bacteria highlighting the need for better bacterial pathogen identification tools. © 2016 Wiley Periodicals, Inc.
Performance of the ForenSeqTM DNA Signature Prep kit on highly degraded samples.

PubMed

Fattorini, Paolo; Previderé, Carlo; Carboni, Ilaria; Marrubini, Giorgio; Sorçaburu-Cigliero, Solange; Grignani, Pierangela; Bertoglio, Barbara; Vatta, Paolo; Ricci, Ugo

2017-04-01

Next generation sequencing (NGS) is the emerging technology in forensic genomics laboratories. It offers higher resolution to address most problems of human identification, greater efficiency and potential ability to interrogate very challenging forensic casework samples. In this study, a trial set of DNA samples was artificially degraded by progressive aqueous hydrolysis, and analyzed together with the corresponding unmodified DNA sample and control sample 2800 M, to test the performance and reliability of the ForenSeq TM DNA Signature Prep kit using the MiSeq Sequencer (Illumina). The results of replicate tests performed on the unmodified sample (1.0 ng) and on scalar dilutions (1.0, 0.5 and 0.1 ng) of the reference sample 2800 M showed the robustness and the reliability of the NGS approach even from sub-optimal amounts of high quality DNA. The degraded samples showed a very limited number of reads/sample, from 2.9-10.2 folds lower than the ones reported for the less concentrated 2800 M DNA dilution (0.1 ng). In addition, it was impossible to assign up to 78.2% of the genotypes in the degraded samples as the software identified the corresponding loci as "low coverage" (< 50x). Amplification artifacts such as allelic imbalances, allele drop outs and a single allele drop in were also scored in the degraded samples. However, the ForenSeq TM DNA Sequencing kit, on the Illumina MiSeq, was able to generate data which led to the correct typing of 5.1-44.8% and 10.9-58.7% of 58 of the STRs and 92 SNPs, respectively. In all trial samples, the SNP markers showed higher chances to be typed correctly compared to the STRs. This NGS approach showed very promising results in terms of ability to recover genetic information from heavily degraded DNA samples for which the conventional PCR/CE approach gave no results. The frequency of genetic mistyping was very low, reaching the value of 1.4% for only one of the degraded samples. However, these results suggest that further validation studies and a definition of interpretation criteria for NGS data are needed before implementation of this technique in forensic genetics. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Using circulating cell-free DNA to monitor personalized cancer therapy.

PubMed

Oellerich, Michael; Schütz, Ekkehard; Beck, Julia; Kanzow, Philipp; Plowman, Piers N; Weiss, Glen J; Walson, Philip D

2017-05-01

High-quality genomic analysis is critical for personalized pharmacotherapy in patients with cancer. Tumor-specific genomic alterations can be identified in cell-free DNA (cfDNA) from patient blood samples and can complement biopsies for real-time molecular monitoring of treatment, detection of recurrence, and tracking resistance. cfDNA can be especially useful when tumor tissue is unavailable or insufficient for testing. For blood-based genomic profiling, next-generation sequencing (NGS) and droplet digital PCR (ddPCR) have been successfully applied. The US Food and Drug Administration (FDA) recently approved the first such "liquid biopsy" test for EGFR mutations in patients with non-small cell lung cancer (NSCLC). Such non-invasive methods allow for the identification of specific resistance mutations selected by treatment, such as EGFR T790M, in patients with NSCLC treated with gefitinib. Chromosomal aberration pattern analysis by low coverage whole genome sequencing is a more universal approach based on genomic instability. Gains and losses of chromosomal regions have been detected in plasma tumor-specific cfDNA as copy number aberrations and can be used to compute a genomic copy number instability (CNI) score of cfDNA. A specific CNI index obtained by massive parallel sequencing discriminated those patients with prostate cancer from both healthy controls and men with benign prostatic disease. Furthermore, androgen receptor gene aberrations in cfDNA were associated with therapeutic resistance in metastatic castration resistant prostate cancer. Change in CNI score has been shown to serve as an early predictor of response to standard chemotherapy for various other cancer types (e.g. NSCLC, colorectal cancer, pancreatic ductal adenocarcinomas). CNI scores have also been shown to predict therapeutic responses to immunotherapy. Serial genomic profiling can detect resistance mutations up to 16 weeks before radiographic progression. There is a potential for cost savings when ineffective use of expensive new anticancer drugs is avoided or halted. Challenges for routine implementation of liquid biopsy tests include the necessity of specialized personnel, instrumentation, and software, as well as further development of quality management (e.g. external quality control). Validation of blood-based tumor genomic profiling in additional multicenter outcome studies is necessary; however, cfDNA monitoring can provide clinically important actionable information for precision oncology approaches.
Mapping the Space of Genomic Signatures

PubMed Central

Kari, Lila; Hill, Kathleen A.; Sayem, Abu S.; Karamichalis, Rallis; Bryans, Nathaniel; Davis, Katelyn; Dattani, Nikesh S.

2015-01-01

We propose a computational method to measure and visualize interrelationships among any number of DNA sequences allowing, for example, the examination of hundreds or thousands of complete mitochondrial genomes. An "image distance" is computed for each pair of graphical representations of DNA sequences, and the distances are visualized as a Molecular Distance Map: Each point on the map represents a DNA sequence, and the spatial proximity between any two points reflects the degree of structural similarity between the corresponding sequences. The graphical representation of DNA sequences utilized, Chaos Game Representation (CGR), is genome- and species-specific and can thus act as a genomic signature. Consequently, Molecular Distance Maps could inform species identification, taxonomic classifications and, to a certain extent, evolutionary history. The image distance employed, Structural Dissimilarity Index (DSSIM), implicitly compares the occurrences of oligomers of length up to k (herein k = 9) in DNA sequences. We computed DSSIM distances for more than 5 million pairs of complete mitochondrial genomes, and used Multi-Dimensional Scaling (MDS) to obtain Molecular Distance Maps that visually display the sequence relatedness in various subsets, at different taxonomic levels. This general-purpose method does not require DNA sequence alignment and can thus be used to compare similar or vastly different DNA sequences, genomic or computer-generated, of the same or different lengths. We illustrate potential uses of this approach by applying it to several taxonomic subsets: phylum Vertebrata, (super)kingdom Protista, classes Amphibia-Insecta-Mammalia, class Amphibia, and order Primates. This analysis of an extensive dataset confirms that the oligomer composition of full mtDNA sequences can be a source of taxonomic information. This method also correctly finds the mtDNA sequences most closely related to that of the anatomically modern human (the Neanderthal, the Denisovan, and the chimp), and that the sequence most different from it in this dataset belongs to a cucumber. PMID:26000734
The number of reduced alignments between two DNA sequences

PubMed Central

2014-01-01

Background In this study we consider DNA sequences as mathematical strings. Total and reduced alignments between two DNA sequences have been considered in the literature to measure their similarity. Results for explicit representations of some alignments have been already obtained. Results We present exact, explicit and computable formulas for the number of different possible alignments between two DNA sequences and a new formula for a class of reduced alignments. Conclusions A unified approach for a wide class of alignments between two DNA sequences has been provided. The formula is computable and, if complemented by software development, will provide a deeper insight into the theory of sequence alignment and give rise to new comparison methods. AMS Subject Classification Primary 92B05, 33C20, secondary 39A14, 65Q30 PMID:24684679
Novel numerical and graphical representation of DNA sequences and proteins.

PubMed

Randić, M; Novic, M; Vikić-Topić, D; Plavsić, D

2006-12-01

We have introduced novel numerical and graphical representations of DNA, which offer a simple and unique characterization of DNA sequences. The numerical representation of a DNA sequence is given as a sequence of real numbers derived from a unique graphical representation of the standard genetic code. There is no loss of information on the primary structure of a DNA sequence associated with this numerical representation. The novel representations are illustrated with the coding sequences of the first exon of beta-globin gene of half a dozen species in addition to human. The method can be extended to proteins as is exemplified by humanin, a 24-aa peptide that has recently been identified as a specific inhibitor of neuronal cell death induced by familial Alzheimer's disease mutant genes.
Gene Identification Algorithms Using Exploratory Statistical Analysis of Periodicity

NASA Astrophysics Data System (ADS)

Mukherjee, Shashi Bajaj; Sen, Pradip Kumar

2010-10-01

Studying periodic pattern is expected as a standard line of attack for recognizing DNA sequence in identification of gene and similar problems. But peculiarly very little significant work is done in this direction. This paper studies statistical properties of DNA sequences of complete genome using a new technique. A DNA sequence is converted to a numeric sequence using various types of mappings and standard Fourier technique is applied to study the periodicity. Distinct statistical behaviour of periodicity parameters is found in coding and non-coding sequences, which can be used to distinguish between these parts. Here DNA sequences of Drosophila melanogaster were analyzed with significant accuracy.

Sequencing of adenine in DNA by scanning tunneling microscopy

NASA Astrophysics Data System (ADS)

Tanaka, Hiroyuki; Taniguchi, Masateru

2017-08-01

The development of DNA sequencing technology utilizing the detection of a tunnel current is important for next-generation sequencer technologies based on single-molecule analysis technology. Using a scanning tunneling microscope, we previously reported that dI/dV measurements and dI/dV mapping revealed that the guanine base (purine base) of DNA adsorbed onto the Cu(111) surface has a characteristic peak at V s = -1.6 V. If, in addition to guanine, the other purine base of DNA, namely, adenine, can be distinguished, then by reading all the purine bases of each single strand of a DNA double helix, the entire base sequence of the original double helix can be determined due to the complementarity of the DNA base pair. Therefore, the ability to read adenine is important from the viewpoint of sequencing. Here, we report on the identification of adenine by STM topographic and spectroscopic measurements using a synthetic DNA oligomer and viral DNA.
Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments.

PubMed

Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias

2013-09-24

Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp.
Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments

PubMed Central

Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias

2013-01-01

Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp. PMID:24019490
Crystal structure of MboIIA methyltransferase.

PubMed

Osipiuk, Jerzy; Walsh, Martin A; Joachimiak, Andrzej

2003-09-15

DNA methyltransferases (MTases) are sequence-specific enzymes which transfer a methyl group from S-adenosyl-L-methionine (AdoMet) to the amino group of either cytosine or adenine within a recognized DNA sequence. Methylation of a base in a specific DNA sequence protects DNA from nucleolytic cleavage by restriction enzymes recognizing the same DNA sequence. We have determined at 1.74 A resolution the crystal structure of a beta-class DNA MTase MboIIA (M.MboIIA) from the bacterium Moraxella bovis, the smallest DNA MTase determined to date. M.MboIIA methylates the 3' adenine of the pentanucleotide sequence 5'-GAAGA-3'. The protein crystallizes with two molecules in the asymmetric unit which we propose to resemble the dimer when M.MboIIA is not bound to DNA. The overall structure of the enzyme closely resembles that of M.RsrI. However, the cofactor-binding pocket in M.MboIIA forms a closed structure which is in contrast to the open-form structures of other known MTases.
Analysis of JC virus DNA replication using a quantitative and high-throughput assay

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shin, Jong; Phelan, Paul J.; Chhum, Panharith

2014-11-15

Progressive Multifocal Leukoencephalopathy (PML) is caused by lytic replication of JC virus (JCV) in specific cells of the central nervous system. Like other polyomaviruses, JCV encodes a large T-antigen helicase needed for replication of the viral DNA. Here, we report the development of a luciferase-based, quantitative and high-throughput assay of JCV DNA replication in C33A cells, which, unlike the glial cell lines Hs 683 and U87, accumulate high levels of nuclear T-ag needed for robust replication. Using this assay, we investigated the requirement for different domains of T-ag, and for specific sequences within and flanking the viral origin, in JCVmore » DNA replication. Beyond providing validation of the assay, these studies revealed an important stimulatory role of the transcription factor NF1 in JCV DNA replication. Finally, we show that the assay can be used for inhibitor testing, highlighting its value for the identification of antiviral drugs targeting JCV DNA replication. - Highlights: • Development of a high-throughput screening assay for JCV DNA replication using C33A cells. • Evidence that T-ag fails to accumulate in the nuclei of established glioma cell lines. • Evidence that NF-1 directly promotes JCV DNA replication in C33A cells. • Proof-of-concept that the HTS assay can be used to identify pharmacological inhibitor of JCV DNA replication.« less
Coordination of tRNA transcription with export at nuclear pore complexes in budding yeast.

PubMed

Chen, Miao; Gartenberg, Marc R

2014-05-01

tRNAs are encoded by RNA polymerase III-transcribed genes that reside at seemingly random intervals along the chromosomes of budding yeast. Existing evidence suggests that the genes congregate together at the nucleolus and/or centromeres. In this study, we re-examined spatial and temporal aspects of tRNA gene (tDNA) expression. We show that tDNA transcription fluctuates during cell cycle progression. In M phase, when tRNA synthesis peaks, tDNAs localize at nuclear pore complexes (NPCs). Docking of a tDNA requires the DNA sequence of the contacted gene, nucleoporins Nup60 and Nup2, and cohesin. Characterization of mutants that block NPC localization revealed that docking is a consequence of elevated tDNA transcription. NPC-tDNA contact falters in the absence of the principal exportin of nascent tRNA, Los1, and genetic assays indicate that gating of tDNAs at NPCs favors cytoplasmic accumulation of functional tRNA. Collectively, the data suggest that tDNAs associate with NPCs to coordinate RNA polymerase III transcription with the nuclear export of pre-tRNA. The M-phase specificity of NPC contact reflects a regulatory mechanism that may have evolved, in part, to avoid collisions between DNA replication forks and transcribing RNA polymerase III machinery at NPCs.
Coordination of tRNA transcription with export at nuclear pore complexes in budding yeast

PubMed Central

Chen, Miao; Gartenberg, Marc R.

2014-01-01

tRNAs are encoded by RNA polymerase III-transcribed genes that reside at seemingly random intervals along the chromosomes of budding yeast. Existing evidence suggests that the genes congregate together at the nucleolus and/or centromeres. In this study, we re-examined spatial and temporal aspects of tRNA gene (tDNA) expression. We show that tDNA transcription fluctuates during cell cycle progression. In M phase, when tRNA synthesis peaks, tDNAs localize at nuclear pore complexes (NPCs). Docking of a tDNA requires the DNA sequence of the contacted gene, nucleoporins Nup60 and Nup2, and cohesin. Characterization of mutants that block NPC localization revealed that docking is a consequence of elevated tDNA transcription. NPC–tDNA contact falters in the absence of the principal exportin of nascent tRNA, Los1, and genetic assays indicate that gating of tDNAs at NPCs favors cytoplasmic accumulation of functional tRNA. Collectively, the data suggest that tDNAs associate with NPCs to coordinate RNA polymerase III transcription with the nuclear export of pre-tRNA. The M-phase specificity of NPC contact reflects a regulatory mechanism that may have evolved, in part, to avoid collisions between DNA replication forks and transcribing RNA polymerase III machinery at NPCs. PMID:24788517
Genomics dataset on unclassified published organism (patent US 7547531).

PubMed

Khan Shawan, Mohammad Mahfuz Ali; Hasan, Md Ashraful; Hossain, Md Mozammel; Hasan, Md Mahmudul; Parvin, Afroza; Akter, Salina; Uddin, Kazi Rasel; Banik, Subrata; Morshed, Mahbubul; Rahman, Md Nazibur; Rahman, S M Badier

2016-12-01

Nucleotide (DNA) sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical classification of that particular organism. This dataset (patent US 7547531) is chosen to simplify all the complex raw data buried in undisclosed DNA sequences which help to open doors for new collaborations. In this data, a total of 48 unidentified DNA sequences from patent US 7547531 were selected and their complete sequences were retrieved from NCBI BioSample database. Quick response (QR) code of those DNA sequences was constructed by DNA BarID tool. QR code is useful for the identification and comparison of isolates with other organisms. AT/GC content of the DNA sequences was determined using ENDMEMO GC Content Calculator, which indicates their stability at different temperature. The highest GC content was observed in GP445188 (62.5%) which was followed by GP445198 (61.8%) and GP445189 (59.44%), while lowest was in GP445178 (24.39%). In addition, New England BioLabs (NEB) database was used to identify cleavage code indicating the 5, 3 and blunt end and enzyme code indicating the methylation site of the DNA sequences was also shown. These data will be helpful for the construction of the organisms' hierarchical classification, determination of their phylogenetic and taxonomic position and revelation of their molecular characteristics.
Fluorescent DNA-templated silver nanoclusters

NASA Astrophysics Data System (ADS)

Lin, Ruoqian

Because of the ultra-small size and biocompatibility of silver nanoclusters, they have attracted much research interest for their applications in biolabeling. Among the many ways of synthesizing silver nanoclusters, DNA templated method is particularly attractive---the high tunability of DNA sequences provides another degree of freedom for controlling the chemical and photophysical properties. However, systematic studies about how DNA sequences and concentrations are controlling the photophysical properties are still lacking. The aim of this thesis is to investigate the binding mechanisms of silver clusters binding and single stranded DNAs. Here in this thesis, we report synthesis and characterization of DNA-templated silver nanoclusters and provide a systematic interrogation of the effects of DNA concentrations and sequences, including lengths and secondary structures. We performed a series of syntheses utilizing five different sequences to explore the optimal synthesis condition. By characterizing samples with UV-vis and fluorescence spectroscopy, we achieved the most proper reactants ratio and synthesis conditions. Two of them were chosen for further concentration dependence studies and sequence dependence studies. We found that cytosine-rich sequences are more likely to produce silver nanoclusters with stronger fluorescence signals; however, sequences with hairpin secondary structures are more capable in stabilizing silver nanoclusters. In addition, the fluorescence peak emission intensities and wavelengths of the DNA templated silver clusters have sequence dependent fingerprints. This potentially can be applied to sequence sensing in the future. However all the current conclusions are not warranted; there is still difficulty in formulating general rules in DNA strand design and silver nanocluster production. Further investigation of more sequences could solve these questions in the future.
Cloning and sequence analysis of complementary DNA encoding an aberrantly rearranged human T-cell gamma chain.

PubMed Central

Dialynas, D P; Murre, C; Quertermous, T; Boss, J M; Leiden, J M; Seidman, J G; Strominger, J L

1986-01-01

Complementary DNA (cDNA) encoding a human T-cell gamma chain has been cloned and sequenced. At the junction of the variable and joining regions, there is an apparent deletion of two nucleotides in the human cDNA sequence relative to the murine gamma-chain cDNA sequence, resulting simultaneously in the generation of an in-frame stop codon and in a translational frameshift. For this reason, the sequence presented here encodes an aberrantly rearranged human T-cell gamma chain. There are several surprising differences between the deduced human and murine gamma-chain amino acid sequences. These include poor homology in the variable region, poor homology in a discrete segment of the constant region precisely bounded by the expected junctions of exon CII, and the presence in the human sequence of five potential sites for N-linked glycosylation. Images PMID:3458221
'DNA Strider': a 'C' program for the fast analysis of DNA and protein sequences on the Apple Macintosh family of computers.

PubMed Central

Marck, C

1988-01-01

DNA Strider is a new integrated DNA and Protein sequence analysis program written with the C language for the Macintosh Plus, SE and II computers. It has been designed as an easy to learn and use program as well as a fast and efficient tool for the day-to-day sequence analysis work. The program consists of a multi-window sequence editor and of various DNA and Protein analysis functions. The editor may use 4 different types of sequences (DNA, degenerate DNA, RNA and one-letter coded protein) and can handle simultaneously 6 sequences of any type up to 32.5 kB each. Negative numbering of the bases is allowed for DNA sequences. All classical restriction and translation analysis functions are present and can be performed in any order on any open sequence or part of a sequence. The main feature of the program is that the same analysis function can be repeated several times on different sequences, thus generating multiple windows on the screen. Many graphic capabilities have been incorporated such as graphic restriction map, hydrophobicity profile and the CAI plot- codon adaptation index according to Sharp and Li. The restriction sites search uses a newly designed fast hexamer look-ahead algorithm. Typical runtime for the search of all sites with a library of 130 restriction endonucleases is 1 second per 10,000 bases. The circular graphic restriction map of the pBR322 plasmid can be therefore computed from its sequence and displayed on the Macintosh Plus screen within 2 seconds and its multiline restriction map obtained in a scrolling window within 5 seconds. PMID:2832831
Sequence-dependent DNA deformability studied using molecular dynamics simulations.

PubMed

Fujii, Satoshi; Kono, Hidetoshi; Takenaka, Shigeori; Go, Nobuhiro; Sarai, Akinori

2007-01-01

Proteins recognize specific DNA sequences not only through direct contact between amino acids and bases, but also indirectly based on the sequence-dependent conformation and deformability of the DNA (indirect readout). We used molecular dynamics simulations to analyze the sequence-dependent DNA conformations of all 136 possible tetrameric sequences sandwiched between CGCG sequences. The deformability of dimeric steps obtained by the simulations is consistent with that by the crystal structures. The simulation results further showed that the conformation and deformability of the tetramers can highly depend on the flanking base pairs. The conformations of xATx tetramers show the most rigidity and are not affected by the flanking base pairs and the xYRx show by contrast the greatest flexibility and change their conformations depending on the base pairs at both ends, suggesting tetramers with the same central dimer can show different deformabilities. These results suggest that analysis of dimeric steps alone may overlook some conformational features of DNA and provide insight into the mechanism of indirect readout during protein-DNA recognition. Moreover, the sequence dependence of DNA conformation and deformability may be used to estimate the contribution of indirect readout to the specificity of protein-DNA recognition as well as nucleosome positioning and large-scale behavior of nucleic acids.
CROSS-DISCIPLINARY PHYSICS AND RELATED AREAS OF SCIENCE AND TECHNOLOGY: Characteristics of alternating current hopping conductivity in DNA sequences

NASA Astrophysics Data System (ADS)

Ma, Song-Shan; Xu, Hui; Wang, Huan-You; Guo, Rui

2009-08-01

This paper presents a model to describe alternating current (AC) conductivity of DNA sequences, in which DNA is considered as a one-dimensional (1D) disordered system, and electrons transport via hopping between localized states. It finds that AC conductivity in DNA sequences increases as the frequency of the external electric field rises, and it takes the form of øac(ω) ~ ω2 ln2(1/ω). Also AC conductivity of DNA sequences increases with the increase of temperature, this phenomenon presents characteristics of weak temperature-dependence. Meanwhile, the AC conductivity in an off-diagonally correlated case is much larger than that in the uncorrelated case of the Anderson limit in low temperatures, which indicates that the off-diagonal correlations in DNA sequences have a great effect on the AC conductivity, while at high temperature the off-diagonal correlations no longer play a vital role in electric transport. In addition, the proportion of nucleotide pairs p also plays an important role in AC electron transport of DNA sequences. For p < 0.5, the conductivity of DNA sequence decreases with the increase of p, while for p >= 0.5, the conductivity increases with the increase of p.
Sperm DNA damage or progressive motility: which one is the better predictor of fertilization in vitro?

PubMed

Simon, Luke; Lewis, Sheena E M

2011-06-01

Sperm progressive motility has been reported to be one of the key factors influencing in vitro fertilization rates. However, recent studies have shown that sperm DNA fragmentation is a more robust predictor of assisted reproductive outcomes including reduced fertilization rates, embryo quality, and pregnancy rates. This study aimed to compare the usefulness of sperm progressive motility and DNA damage as predictive tools of in vitro fertilization rates. Here, 136 couples provided 1,767 eggs with an overall fertilization rate of 64.2%. The fertilization rate in vitro correlated with both sperm progressive motility (r² = 0.236; P = 0.002) and DNA fragmentation (r² = -0.318; P < 0.001). The relative risk of a poor fertilization rate was 9.5 times higher in sperm of men with high DNA fragmentation (>40%) compared with 2.6 times in sperm with poor motility (<40%). Further, sperm DNA fragmentation gave a higher specificity (93.3%) in predicting the fertilization rate than progressive motility (77.8%). Finally, the odds ratio to determine fertilization rate (>70%) was 4.81 (1.89-12.65) using progressive motility compared with 24.18 (5.21-154.51) using DNA fragmentation. This study shows that fertilization rates are directly dependent upon both sperm progressive motility and DNA fragmentation, but sperm DNA fragmentation is a much stronger test.
Phylogenetic relationships of the Gomphales based on nuc-25S-rDNA, mit-12S-rDNA, and mit-atp6-DNA combined sequences

Treesearch

Admir J. Giachini; Kentaro Hosaka; Eduardo Nouhra; Joseph Spatafora; James M. Trappe

2010-01-01

Phylogenetic relationships among Geastrales, Gomphales, Hysterangiales, and Phallales were estimated via combined sequences: nuclear large subunit ribosomal DNA (nuc-25S-rDNA), mitochondrial small subunit ribosomal DNA (mit-12S-rDNA), and mitochondrial atp6 DNA (mit-atp6-DNA). Eighty-one taxa comprising 19 genera and 58 species...
Impacts of Genome-Wide Analyses on Our Understanding of Human Herpesvirus Diversity and Evolution.

PubMed

Renner, Daniel W; Szpara, Moriah L

2018-01-01

Until fairly recently, genome-wide evolutionary dynamics and within-host diversity were more commonly examined in the context of small viruses than in the context of large double-stranded DNA viruses such as herpesviruses. The high mutation rates and more compact genomes of RNA viruses have inspired the investigation of population dynamics for these species, and recent data now suggest that herpesviruses might also be considered candidates for population modeling. High-throughput sequencing (HTS) and bioinformatics have expanded our understanding of herpesviruses through genome-wide comparisons of sequence diversity, recombination, allele frequency, and selective pressures. Here we discuss recent data on the mechanisms that generate herpesvirus genomic diversity and underlie the evolution of these virus families. We focus on human herpesviruses, with key insights drawn from veterinary herpesviruses and other large DNA virus families. We consider the impacts of cell culture on herpesvirus genomes and how to accurately describe the viral populations under study. The need for a strong foundation of high-quality genomes is also discussed, since it underlies all secondary genomic analyses such as RNA sequencing (RNA-Seq), chromatin immunoprecipitation, and ribosome profiling. Areas where we foresee future progress, such as the linking of viral genetic differences to phenotypic or clinical outcomes, are highlighted as well. Copyright © 2017 Renner and Szpara.
Genome-wide base-resolution mapping of DNA methylation in single cells using single-cell bisulfite sequencing (scBS-seq).

PubMed

Clark, Stephen J; Smallwood, Sébastien A; Lee, Heather J; Krueger, Felix; Reik, Wolf; Kelsey, Gavin

2017-03-01

DNA methylation (DNAme) is an important epigenetic mark in diverse species. Our current understanding of DNAme is based on measurements from bulk cell samples, which obscures intercellular differences and prevents analyses of rare cell types. Thus, the ability to measure DNAme in single cells has the potential to make important contributions to the understanding of several key biological processes, such as embryonic development, disease progression and aging. We have recently reported a method for generating genome-wide DNAme maps from single cells, using single-cell bisulfite sequencing (scBS-seq), allowing the quantitative measurement of DNAme at up to 50% of CpG dinucleotides throughout the mouse genome. Here we present a detailed protocol for scBS-seq that includes our most recent developments to optimize recovery of CpGs, mapping efficiency and success rate; reduce hands-on time; and increase sample throughput with the option of using an automated liquid handler. We provide step-by-step instructions for each stage of the method, comprising cell lysis and bisulfite (BS) conversion, preamplification and adaptor tagging, library amplification, sequencing and, lastly, alignment and methylation calling. An individual with relevant molecular biology expertise can complete library preparation within 3 d. Subsequent computational steps require 1-3 d for someone with bioinformatics expertise.
Method for performing site-specific affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

1999-01-01

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.
Miniaturized reaction vessel system, method for performing site-specific biochemical reactions and affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

2000-01-01

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.
DNABIT Compress - Genome compression algorithm.

PubMed

Rajarajeswari, Pothuraju; Apparao, Allam

2011-01-22

Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.