Pyle, Angela; Hudson, Gavin; Wilson, Ian J; Coxhead, Jonathan; Smertenko, Tania; Herbert, Mary; Santibanez-Koref, Mauro; Chinnery, Patrick F
2015-05-01
Recent reports have questioned the accepted dogma that mammalian mitochondrial DNA (mtDNA) is strictly maternally inherited. In humans, the argument hinges on detecting a signature of inter-molecular recombination in mtDNA sequences sampled at the population level, inferring a paternal source for the mixed haplotypes. However, interpreting these data is fraught with difficulty, and direct experimental evidence is lacking. Using extreme-high depth mtDNA re-sequencing up to ~1.2 million-fold coverage, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans, thus excluding a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals. Our findings indicate that an active mechanism eliminates paternal mtDNA which likely acts at the molecular level.
Pyle, Angela; Hudson, Gavin; Wilson, Ian J.; Coxhead, Jonathan; Smertenko, Tania; Herbert, Mary; Santibanez-Koref, Mauro; Chinnery, Patrick F.
2015-01-01
Recent reports have questioned the accepted dogma that mammalian mitochondrial DNA (mtDNA) is strictly maternally inherited. In humans, the argument hinges on detecting a signature of inter-molecular recombination in mtDNA sequences sampled at the population level, inferring a paternal source for the mixed haplotypes. However, interpreting these data is fraught with difficulty, and direct experimental evidence is lacking. Using extreme-high depth mtDNA re-sequencing up to ~1.2 million-fold coverage, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans, thus excluding a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals. Our findings indicate that an active mechanism eliminates paternal mtDNA which likely acts at the molecular level. PMID:25973765
Berger, C; Berger, B; Parson, W
2012-01-01
In recent years, evidence from domestic dogs has increasingly been analyzed by forensic DNA testing. Especially, canine hairs have proved most suitable and practical due to the high rate of hair transfer occurring between dogs and humans. Starting with the description of a contamination-free sample handling procedure, we give a detailed workflow for sequencing hypervariable segments (HVS) of the mtDNA control region from canine evidence. After the hair material is lysed and the DNA extracted by Phenol/Chloroform, the amplification and sequencing strategy comprises the HVS I and II of the canine control region and is optimized for DNA of medium-to-low quality and quantity. The sequencing procedure is based on the Sanger Big-dye deoxy-terminator method and the separation of the sequencing reaction products is performed on a conventional multicolor fluorescence detection capillary electrophoresis platform. Finally, software-aided base calling and sequence interpretation are addressed exemplarily.
Fractal landscape analysis of DNA walks
NASA Technical Reports Server (NTRS)
Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Sciortino, F.; Simons, M.; Stanley, H. E.
1992-01-01
By mapping nucleotide sequences onto a "DNA walk", we uncovered remarkably long-range power law correlations [Nature 356 (1992) 168] that imply a new scale invariant property of DNA. We found such long-range correlations in intron-containing genes and in non-transcribed regulatory DNA sequences, but not in cDNA sequences or intron-less genes. In this paper, we present more explicit evidences to support our findings.
Evidence for recombination in scorpion mitochondrial DNA (Scorpiones: Buthidae).
Gantenbein, Benjamin; Fet, Victor; Gantenbein-Ritter, Iris A; Balloux, François
2005-04-07
There has been very little undisputed evidence for recombination in animal mitochondrial DNA (mtDNA) provided so far. Previous unpublished results suggestive of mtDNA recombination in the scorpion family Buthidae, together with cytological evidence for a unique mechanism of mitochondrial fusion in that family, prompted us to investigate this group in more details. First, we sequenced the complete mtDNA genome of Mesobuthus gibbosus, and chose two genes opposing each other (16S and coxI). We then sequenced 150 individuals from the natural populations of four species of Buthidae (Old World genera Buthus and Mesobuthus). We observed strong evidence for widespread recombination through highly significant negative correlations between linkage disequilibrium and physical distance in three out of four species. The evidence is further confirmed when using five other tests for recombination and by the presence of a high amount of homoplasy in phylogenetic trees.
Ahmed, Ikhlak; Sarazin, Alexis; Bowler, Chris; Colot, Vincent; Quesneville, Hadi
2011-09-01
Transposable elements (TEs) and their relics play major roles in genome evolution. However, mobilization of TEs is usually deleterious and strongly repressed. In plants and mammals, this repression is typically associated with DNA methylation, but the relationship between this epigenetic mark and TE sequences has not been investigated systematically. Here, we present an improved annotation of TE sequences and use it to analyze genome-wide DNA methylation maps obtained at single-nucleotide resolution in Arabidopsis. We show that although the majority of TE sequences are methylated, ∼26% are not. Moreover, a significant fraction of TE sequences densely methylated at CG, CHG and CHH sites (where H = A, T or C) have no or few matching small interfering RNA (siRNAs) and are therefore unlikely to be targeted by the RNA-directed DNA methylation (RdDM) machinery. We provide evidence that these TE sequences acquire DNA methylation through spreading from adjacent siRNA-targeted regions. Further, we show that although both methylated and unmethylated TE sequences located in euchromatin tend to be more abundant closer to genes, this trend is least pronounced for methylated, siRNA-targeted TE sequences located 5' to genes. Based on these and other findings, we propose that spreading of DNA methylation through promoter regions explains at least in part the negative impact of siRNA-targeted TE sequences on neighboring gene expression.
Wang, Jing; McCord, Bruce
2011-06-01
A common problem in the analysis of forensic DNA evidence is the presence of environmentally degraded and inhibited DNA. Such samples produce a variety of interpretational problems such as allele imbalance, allele dropout and sequence specific inhibition. In an attempt to develop methods to enhance the recovery of this type of evidence, magnetic bead hybridization has been applied to extract and preconcentrate DNA sequences containing short tandem repeat (STR) alleles of interest. In this work, genomic DNA was fragmented by heating, and sequences associated with STR alleles were selectively hybridized to allele-specific biotinylated probes. Each particular biotinylated probe-DNA complex was bound to streptavidin-coated magnetic beads using enabling enrichment of target DNA sequences. Experiments conducted using degraded DNA samples, as well as samples containing a large concentration of inhibitory substances, showed good specificity and recovery of missing alleles. Based on the favorable results obtained with these specific probes, this method should prove useful as a tool to improve the recovery of alleles from degraded and inhibited DNA samples. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Varela, Eduardo S; Lima, João P M S; Galdino, Alexsandro S; Pinto, Luciano da S; Bezerra, Walderly M; Nunes, Edson P; Alves, Maria A O; Grangeiro, Thalles B
2004-01-01
The complete sequences of nuclear ribosomal DNA (nrDNA) internal transcribed spacer regions (ITS/5.8S) were determined for species belonging to six genera from the subtribe Diocleinae as well as for the anomalous genera Calopogonium and Pachyrhizus. Phylogenetic trees constructed by distance matrix, maximum parsimony and maximum likelihood methods showed that Calopogonium and Pachyrhizus were outside the clade Diocleinae (Canavalia, Camptosema, Cratylia, Dioclea, Cymbosema, and Galactia). This finding supports previous morphological, phytochemical, and molecular evidence that Calopogonium and Pachyrhizus do not belong to the subtribe Diocleinae. Within the true Diocleinae clade, the clustering of genera and species were congruent with morphology-based classifications, suggesting that ITS/5.8S sequences can provide enough informative sites to allow resolution below the genus level. This is the first evidence of the phylogeny of subtribe Diocleinae based on nuclear DNA sequences.
D.J. Glass; N. Takebayashi; L. Olson; D.L. Taylor
2013-01-01
The number of sequences from both formally described taxa and uncultured environmental DNA deposited in the International Nucleotide Sequence Databases has increased substantially over the last two decades. Although the majority of these sequences represent authentic gene copies, there is evidence of DNA artifacts in these databases as well. These include lab artifacts...
Isolation and characterization of DNA from archaeological bone.
Hagelberg, E; Clegg, J B
1991-04-22
DNA was extracted from human and animal bones recovered from archaeological sites and mitochondrial DNA sequences were amplified from the extracts using the polymerase chain reaction. Evidence is presented that the amplified sequences are authentic and do not represent contamination by extraneous DNA. The results show that significant amounts of genetic information can survive for long periods in bone, and have important implications for evolutionary genetics, anthropology and forensic science.
Repair of DNA damage caused by cytosine deamination in mitochondrial DNA of forensic case samples.
Gorden, Erin M; Sturk-Andreaggi, Kimberly; Marshall, Charla
2018-05-01
DNA sequence damage from cytosine deamination is well documented in degraded samples, such as those from ancient and forensic contexts. This study examined the effect of a DNA repair treatment on mitochondrial DNA (mtDNA) from aged and degraded skeletal samples. DNA extracts from 21 non-probative, degraded skeletal samples (aged 50-70 years) were utilized for the analysis. A portion of each sample extract was subjected to DNA repair using a commercial repair kit, the New England BioLabs' NEBNext FFPE DNA Repair Kit (Ipswich, MA). MtDNA was enriched using PCR and targeted capture in a side-by-side experiment of untreated and repaired DNA. Sequencing was performed using both traditional (Sanger-type; STS) and next-generation sequencing (NGS) methods Although cytosine deamination was evident in the mtDNA sequence data, the observed level of damaged bases varied by sequencing method as well as by enrichment type. The STS PCR amplicon data did not show evidence of cytosine deamination that could be distinguished from background signal in either the untreated or repaired sample set. However, the same PCR amplicons showed 850 C → T/G → A substitutions consistent with cytosine deamination with variant frequencies (VFs) of up to 25% when sequenced using NGS methods The occurrence of base misincorporation due to cytosine deamination was reduced by 98% (to 10) in the NGS amplicon data after repair. The NGS capture data indicated low levels (1-2%) of cytosine deamination in mtDNA fragments that was effectively mitigated by DNA repair. The observed difference in the level of cytosine deamination between the PCR and capture enrichment methods can be attributed to the greater propensity for stochastic effects from the PCR enrichment technique employed (e.g., low template input, increased PCR cycles). Altogether these results indicate that DNA repair may be required when sequencing PCR-amplified DNA from degraded forensic case samples with NGS methods. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Nakamura, Ryohei; Uno, Ayako; Kumagai, Masahiko; Fukushima, Hiroto S.; Morishita, Shinichi; Takeda, Hiroyuki
2017-01-01
The heavily methylated vertebrate genomes are punctuated by stretches of poorly methylated DNA sequences that usually mark gene regulatory regions. It is known that the methylation state of these regions confers transcriptional control over their associated genes. Given its governance on the transcriptome, cellular functions and identity, genome-wide DNA methylation pattern is tightly regulated and evidently predefined. However, how is the methylation pattern determined in vivo remains enigmatic. Based on in silico and in vitro evidence, recent studies proposed that the regional hypomethylated state is primarily determined by local DNA sequence, e.g., high CpG density and presence of specific transcription factor binding sites. Nonetheless, the dependency of DNA methylation on nucleotide sequence has not been carefully validated in vertebrates in vivo. Herein, with the use of medaka (Oryzias latipes) as a model, the sequence dependency of DNA methylation was intensively tested in vivo. Our statistical modeling confirmed the strong statistical association between nucleotide sequence pattern and methylation state in the medaka genome. However, by manipulating the methylation state of a number of genomic sequences and reintegrating them into medaka embryos, we demonstrated that artificially conferred DNA methylation states were predominantly and robustly maintained in vivo, regardless of their sequences and endogenous states. This feature was also observed in the medaka transgene that had passed across generations. Thus, despite the observed statistical association, nucleotide sequence was unable to autonomously determine its own methylation state in medaka in vivo. Our results apparently argue against the notion of the governance on the DNA methylation by nucleotide sequence, but instead suggest the involvement of other epigenetic factors in defining and maintaining the DNA methylation landscape. Further investigation in other vertebrate models in vivo will be needed for the generalization of our observations made in medaka. PMID:29267279
Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.
Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene
2017-02-01
Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Widespread recombination in published animal mtDNA sequences.
Tsaousis, A D; Martin, D P; Ladoukakis, E D; Posada, D; Zouros, E
2005-04-01
Mitochondrial DNA (mtDNA) recombination has been observed in several animal species, but there are doubts as to whether it is common or only occurs under special circumstances. Animal mtDNA sequences retrieved from public databases were unambiguously aligned and rigorously tested for evidence of recombination. At least 30 recombination events were detected among 186 alignments examined. Recombinant sequences were found in invertebrates and vertebrates, including primates. It appears that mtDNA recombination may occur regularly in the animal cell but rarely produces new haplotypes because of homoplasmy. Common animal mtDNA recombination would necessitate a reexamination of phylogenetic and biohistorical inference based on the assumption of clonal mtDNA transmission. Recombination may also have an important role in producing and purging mtDNA mutations and thus in mtDNA-based diseases and senescence.
A complete Neandertal mitochondrial genome sequence determined by high-throughput sequencing
Green, Richard E.; Malaspinas, Anna-Sapfo; Krause, Johannes; Briggs, Adrian W.; Johnson, Philip L. F.; Uhler, Caroline; Meyer, Matthias; Good, Jeffrey M.; Maricic, Tomislav; Stenzel, Udo; Prüfer, Kay; Siebauer, Michael; Burbano, Hernán A.; Ronan, Michael; Rothberg, Jonathan M.; Egholm, Michael; Rudan, Pavao; Brajković, Dejana; Kućan, Željko; Gušić, Ivan; Wikström, Mårten; Laakkonen, Liisa; Kelso, Janet; Slatkin, Montgomery; Pääbo, Svante
2008-01-01
Summary A complete mitochondrial (mt) genome sequence was reconstructed from a 38,000-year-old Neandertal individual using 8,341 mtDNA sequences identified among 4.8 Gb of DNA generated from ~0.3 grams of bone. Analysis of the assembled sequence unequivocally establishes that the Neandertal mtDNA falls outside the variation of extant human mtDNAs and allows an estimate of the divergence date between the two mtDNA lineages of 660,000±140,000 years. Of the 13 proteins encoded in the mtDNA, subunit 2 of cytochrome c oxidase of the mitochondrial electron transport chain has experienced the largest number of amino acid substitutions in human ancestors since the separation from Neandertals. There is evidence that purifying selection in the Neandertal mtDNA was reduced compared to other primate lineages suggesting that the effective population size of Neandertals was small. PMID:18692465
Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood
Fan, H. Christina; Blumenfeld, Yair J.; Chitkara, Usha; Hudgins, Louanne; Quake, Stephen R.
2008-01-01
We directly sequenced cell-free DNA with high-throughput shotgun sequencing technology from plasma of pregnant women, obtaining, on average, 5 million sequence tags per patient sample. This enabled us to measure the over- and underrepresentation of chromosomes from an aneuploid fetus. The sequencing approach is polymorphism-independent and therefore universally applicable for the noninvasive detection of fetal aneuploidy. Using this method, we successfully identified all nine cases of trisomy 21 (Down syndrome), two cases of trisomy 18 (Edward syndrome), and one case of trisomy 13 (Patau syndrome) in a cohort of 18 normal and aneuploid pregnancies; trisomy was detected at gestational ages as early as the 14th week. Direct sequencing also allowed us to study the characteristics of cell-free plasma DNA, and we found evidence that this DNA is enriched for sequences from nucleosomes. PMID:18838674
Extracting DNA words based on the sequence features: non-uniform distribution and integrity.
Li, Zhi; Cao, Hongyan; Cui, Yuehua; Zhang, Yanbo
2016-01-25
DNA sequence can be viewed as an unknown language with words as its functional units. Given that most sequence alignment algorithms such as the motif discovery algorithms depend on the quality of background information about sequences, it is necessary to develop an ab initio algorithm for extracting the "words" based only on the DNA sequences. We considered that non-uniform distribution and integrity were two important features of a word, based on which we developed an ab initio algorithm to extract "DNA words" that have potential functional meaning. A Kolmogorov-Smirnov test was used for consistency test of uniform distribution of DNA sequences, and the integrity was judged by the sequence and position alignment. Two random base sequences were adopted as negative control, and an English book was used as positive control to verify our algorithm. We applied our algorithm to the genomes of Saccharomyces cerevisiae and 10 strains of Escherichia coli to show the utility of the methods. The results provide strong evidences that the algorithm is a promising tool for ab initio building a DNA dictionary. Our method provides a fast way for large scale screening of important DNA elements and offers potential insights into the understanding of a genome.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sobottka, Marcelo, E-mail: sobottka@mtm.ufsc.br; Hart, Andrew G., E-mail: ahart@dim.uchile.cl
Highlights: {yields} We propose a simple stochastic model to construct primitive DNA sequences. {yields} The model provide an explanation for Chargaff's second parity rule in primitive DNA sequences. {yields} The model is also used to predict a novel type of strand symmetry in primitive DNA sequences. {yields} We extend the results for bacterial DNA sequences and compare distributional properties intrinsic to the model to statistical estimates from 1049 bacterial genomes. {yields} We find out statistical evidences that the novel type of strand symmetry holds for bacterial DNA sequences. -- Abstract: Chargaff's second parity rule for short oligonucleotides states that themore » frequency of any short nucleotide sequence on a strand is approximately equal to the frequency of its reverse complement on the same strand. Recent studies have shown that, with the exception of organellar DNA, this parity rule generally holds for double-stranded DNA genomes and fails to hold for single-stranded genomes. While Chargaff's first parity rule is fully explained by the Watson-Crick pairing in the DNA double helix, a definitive explanation for the second parity rule has not yet been determined. In this work, we propose a model based on a hidden Markov process for approximating the distributional structure of primitive DNA sequences. Then, we use the model to provide another possible theoretical explanation for Chargaff's second parity rule, and to predict novel distributional aspects of bacterial DNA sequences.« less
In silico evidence for sequence-dependent nucleosome sliding
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lequieu, Joshua; Schwartz, David C.; de Pablo, Juan J.
Nucleosomes represent the basic building block of chromatin and provide an important mechanism by which cellular processes are controlled. The locations of nucleosomes across the genome are not random but instead depend on both the underlying DNA sequence and the dynamic action of other proteins within the nucleus. These processes are central to cellular function, and the molecular details of the interplay between DNA sequence and nudeosome dynamics remain poorly understood. In this work, we investigate this interplay in detail by relying on a molecular model, which permits development of a comprehensive picture of the underlying free energy surfaces andmore » the corresponding dynamics of nudeosome repositioning. The mechanism of nudeosome repositioning is shown to be strongly linked to DNA sequence and directly related to the binding energy of a given DNA sequence to the histone core. It is also demonstrated that chromatin remodelers can override DNA-sequence preferences by exerting torque, and the histone H4 tail is then identified as a key component by which DNA-sequence, histone modifications, and chromatin remodelers could in fact be coupled.« less
Spooner, David M; Ruess, Holly; Iorizzo, Massimo; Senalik, Douglas; Simon, Philipp
2017-02-01
We explored the phylogenetic utility of entire plastid DNA sequences in Daucus and compared the results with prior phylogenetic results using plastid and nuclear DNA sequences. We used Illumina sequencing to obtain full plastid sequences of 37 accessions of 20 Daucus taxa and outgroups, analyzed the data with phylogenetic methods, and examined evidence for mitochondrial DNA transfer to the plastid ( Dc MP). Our phylogenetic trees of the entire data set were highly resolved, with 100% bootstrap support for most of the external and many of the internal clades, except for the clade of D. carota and its most closely related species D. syrticus . Subsets of the data, including regions traditionally used as phylogenetically informative regions, provide various degrees of soft congruence with the entire data set. There are areas of hard incongruence, however, with phylogenies using nuclear data. We extended knowledge of a mitochondrial to plastid DNA insertion sequence previously named Dc MP and identified the first instance in flowering plants of a sequence of potential nuclear genome origin inserted into the plastid genome. There is a relationship of inverted repeat junction classes and repeat DNA to phylogeny, but no such relationship with nonsynonymous mutations. Our data have allowed us to (1) produce a well-resolved plastid phylogeny of Daucus , (2) evaluate subsets of the entire plastid data for phylogeny, (3) examine evidence for plastid and nuclear DNA phylogenetic incongruence, and (4) examine mitochondrial and nuclear DNA insertion into the plastid. © 2017 Spooner et al. Published by the Botanical Society of America. This work is licensed under a Creative Commons public domain license (CC0 1.0).
Berry, Neil; Jenkins, Adrian; Martin, Javier; Davis, Clare; Wood, David; Schild, Geoffrey; Bottiger, Margareta; Holmes, Harvey; Minor, Philip; Almond, Neil
2005-02-25
Inoculation of live experimental oral poliovirus vaccines (OPV CHAT) during the 1950s in central Africa has been proposed to account for the introduction of HIV into human populations. For this to have occurred, it would have been necessary for chimpanzee rather than macaque kidney epithelial cells to have been included in the preparation of early OPV materials. Theoretically, this could have led to contamination with a progenitor of HIV-1 derived from a related simian immunodeficiency virus of chimpanzees (SIVCPZ). In this article we present further detailed analyses of two samples of OPV, CHAT 10A-11 and CHAT 6039/Yugo, which were used in early human trials of poliovirus vaccination. Recovery of poliovirus by culture techniques confirmed the biological viability of the vaccines and sequence analysis of poliovirus RNA specifically identified the presence of the CHAT strain. Independent nested sets of oligonucleotide primers specific for HIV-1/SIVCPZ and HIV-2/SIVMAC/SIVSM phylogenetic lineages, respectively, indicated no evidence of HIV/SIV RNA in either vaccine preparation, at a sensitivity of 100 RNA equivalents/ml. Analysis of cellular substrate by the amplification of two distinct regions of mitochondrial DNA (D-loop control region and 12S ribosomal sequences) revealed no evidence of chimpanzee cellular sequences. However, this approach positively identified rhesus and cynomolgus macaque DNA for the CHAT 10A-11 and CHAT 6039/Yugo vaccine preparations, respectively. Analysis of multiple clones of mtDNA 12S rDNA indicated a relatively high number of nuclear mitochondrial DNA sequences (numts) in the CHAT 10A-11 material, but confirmed the macaque origin of cellular substrate used in vaccine preparation. These data reinforce earlier findings on this topic providing no evidence to support the contention that poliovirus vaccination was responsible for the introduction of HIV into humans and sparking the AIDS pandemic.
Lam, Kathy N; Charles, Trevor C
2015-01-01
Clone libraries provide researchers with a powerful resource to study nucleic acid from diverse sources. Metagenomic clone libraries in particular have aided in studies of microbial biodiversity and function, and allowed the mining of novel enzymes. Libraries are often constructed by cloning large inserts into cosmid or fosmid vectors. Recently, there have been reports of GC bias in fosmid metagenomic libraries, and it was speculated to be a result of fragmentation and loss of AT-rich sequences during cloning. However, evidence in the literature suggests that transcriptional activity or gene product toxicity may play a role. To explore possible mechanisms responsible for sequence bias in clone libraries, we constructed a cosmid library from a human microbiome sample and sequenced DNA from different steps during library construction: crude extract DNA, size-selected DNA, and cosmid library DNA. We confirmed a GC bias in the final cosmid library, and we provide evidence that the bias is not due to fragmentation and loss of AT-rich sequences but is likely occurring after DNA is introduced into Escherichia coli. To investigate the influence of strong constitutive transcription, we searched the sequence data for promoters and found that rpoD/σ(70) promoter sequences were underrepresented in the cosmid library. Furthermore, when we examined the genomes of taxa that were differentially abundant in the cosmid library relative to the original sample, we found the bias to be more correlated with the number of rpoD/σ(70) consensus sequences in the genome than with simple GC content. The GC bias of metagenomic libraries does not appear to be due to DNA fragmentation. Rather, analysis of promoter sequences provides support for the hypothesis that strong constitutive transcription from sequences recognized as rpoD/σ(70) consensus-like in E. coli may lead to instability, causing loss of the plasmid or loss of the insert DNA that gives rise to the transcription. Despite widespread use of E. coli to propagate foreign DNA in metagenomic libraries, the effects of in vivo transcriptional activity on clone stability are not well understood. Further work is required to tease apart the effects of transcription from those of gene product toxicity.
An immunoassay for the study of DNA-binding activities of herpes simplex virus protein ICP8.
Lee, C K; Knipe, D M
1985-06-01
An immunoassay was used to examine the interaction between a herpes simplex virus protein, ICP8, and various types of DNA. The advantage of this assay is that the protein is not subjected to harsh purification procedures. We characterized the binding of ICP8 to both single-stranded (ss) and double-stranded (ds) DNA. ICP8 bound ss DNA fivefold more efficiently than ds DNA, and both binding activities were most efficient in 150 mM NaCl. Two lines of evidence indicate that the binding activities were not identical: (i) ds DNA failed to complete with ss DNA binding even with a large excess of ds DNA; (ii) Scatchard plots of DNA binding with various amounts of DNA were fundamentally different for ss DNA and ds DNA. However, the two activities were related in that ss DNA efficiently competed with the binding of ds DNA. We conclude that the ds DNA-binding activity of ICP8 is probably distinct from the ss DNA-binding activity. No evidence for sequence-specific ds DNA binding was obtained for either the entire herpes simplex virus genome or cloned viral sequences.
Armstrong, Miles R; Husmeier, Dirk; Phillips, Mark S; Blok, Vivian C
2007-06-01
The discovery that the potato cyst nematode Globodera pallida has a multipartite mitochondrial DNA (mtDNA) composed, at least in part, of six small circular mtDNAs (scmtDNAs) raised a number of questions concerning the population-level processes that might act on such a complex genome. Here we report our observations on the distribution of some scmtDNAs among a sample of European and South American G. pallida populations. The occurrence of sequence variants of scmtDNA IV in population P4A from South America, and that particular sequence variants are common to the individuals within a single cyst, is described. Evidence for recombination of sequence variants of scmtDNA IV in P4A is also reported. The mosaic structure of P4A scmtDNA IV sequences was revealed using several detection methods and recombination breakpoints were independently detected by maximum likelihood and Bayesian MCMC methods.
Statistical properties of DNA sequences
NASA Technical Reports Server (NTRS)
Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Simons, M.; Stanley, H. E.
1995-01-01
We review evidence supporting the idea that the DNA sequence in genes containing non-coding regions is correlated, and that the correlation is remarkably long range--indeed, nucleotides thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationarity" feature of the sequence of base pairs by applying a new algorithm called detrended fluctuation analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and non-coding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to every DNA sequence (33301 coding and 29453 non-coding) in the entire GenBank database. Finally, we describe briefly some recent work showing that the non-coding sequences have certain statistical features in common with natural and artificial languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts. These statistical properties of non-coding sequences support the possibility that non-coding regions of DNA may carry biological information.
Simon, J W; Slabas, A R
1998-09-18
The GenBank database was searched using the E. coli malonyl CoA:ACP transacylase (MCAT) sequence, for plant protein/cDNA sequences corresponding to MCAT, a component of plant fatty acid synthetase (FAS), for which the plant cDNA has not been isolated. A 272-bp Zea mays EST sequence (GenBank accession number: AA030706) was identified which has strong homology to the E. coli MCAT. A PCR derived cDNA probe from Zea mays was used to screen a Brassica napus (rape) cDNA library. This resulted in the isolation of a 1200-bp cDNA clone which encodes an open reading frame corresponding to a protein of 351 amino acids. The protein shows 47% homology to the E. coli MCAT amino acid sequence in the coding region for the mature protein. Expression of a plasmid (pMCATrap2) containing the plant cDNA sequence in Fab D89, an E. coli mutant, in MCAT activity restores growth demonstrating functional complementation and direct function of the cloned cDNA. This is the first functional evidence supporting the identification of a plant cDNA for MCAT.
Evidence for recombination of mitochondrial DNA in triploid crucian carp.
Guo, Xinhong; Liu, Shaojun; Liu, Yun
2006-03-01
In this study, we report the complete mitochondrial DNA (mtDNA) sequences of the allotetraploid and triploid crucian carp and compare the complete mtDNA sequences between the triploid crucian carp and its female parent Japanese crucian carp and between the triploid crucian carp and its male parent allotetraploid. Our results indicate that the complete mtDNA nucleotide identity (98%) between the triploid crucian carp and its male parent allotetraploid was higher than that (93%) between the triploid crucian carp and its female parent Japanese crucian carp. Moreover, the presence of a pattern of identity and difference at synonymous sites of mitochondrial genomes between the triploid crucian carp and its parents provides direct evidence that triploid crucian carp possessed the recombination mtDNA fragment (12,759 bp) derived from the paternal fish. These results suggest that mtDNA recombination was derived from the fusion of the maternal and paternal mtDNAs. Compared with the haploid egg with one set of genome from the Japanese crucian carp, the diploid sperm with two sets of genomes from the allotetraploid could more easily make its mtDNA fuse with the mtDNA of the haploid egg. In addition, the triple hybrid nature of the triploid crucian carp probably allowed its better mtDNA recombination. In summary, our results provide the first evidence of mtDNA combination in polyploid fish.
Sequencing to Station in 12 Months (Targeting Orbital 5 Launch, March 30th)
NASA Technical Reports Server (NTRS)
Smith, David J.; Burton, Aaron Steven
2015-01-01
The Biomolecule Sequencer is a Commercial Off-The-Shelf device developed by Oxford Nanopore Technologies and implements a method of DNA sequencing unlike any other current sequencers. The device measures changes in electrical current through a nanopore depending on the sequence of the DNA strand that is passing through it. Since the technology is built on nanometer-scale ion pores, the hardware itself is exceptionally small (3 x 1 x 58 inches), lightweight (less than 120 grams with USB cable), and powered only by a USB connection. The sequencing device is permanent, while the flow cells, to which the samples are added, are periodically replaced. The goal of our upcoming technology demonstration on ISS is to provide evidence that DNA sequencing in space is possible, which holds the exciting potential to enable the identification of microorganisms, monitor changes in microbes and humans in response to spaceflight, and possibly aid in the detection of DNA-based life elsewhere in the universe.
Pfeiffer, H; Hühne, J; Ortmann, C; Waterkamp, K; Brinkmann, B
1999-01-01
The analysis of mitochondrial DNA (mtDNA) from shed hairs has gained high importance in forensic casework since telogen hairs are one of the most common types of evidence left at the crime scene. In this systematic study of hair shafts from 20 individuals, the correlation of mtDNA recovery with hair morphology (length, diameter, volume, colour), with sex, and with body localisation (head, armpit, pubis) was investigated. The highest average success rate of hypervariable region 1 (HV 1) sequencing was found in head hair shafts (75%) followed by pubic (66%) and axillary hair shafts (52%). No statistically significant correlation between morphological parameters or sex and the success rate of sequencing was found. MtDNA sequences of buccal cells, head, pubic and axillary hair shafts did not show intraindividual differences. Heteroplasmic base positions were observed neither in the hair shafts nor in control samples of buccal cells.
A reanalysis of the indirect evidence for recombination in human mitochondrial DNA.
Piganeau, G; Eyre-Walker, A
2004-04-01
In an attempt to resolve the controversy about whether recombination occurs in human mtDNA, we have analysed three recently published data sets of complete mtDNA sequences along with 10 RFLP data sets. We have analysed the relationship between linkage disequilibrium (LD) and distance between sites under a variety of conditions using two measures of LD, r2 and /D'/. We find that there is a negative correlation between r2 and distance in the majority of data sets, but no overall trend for /D'/. Five out of six mtDNA sequence data sets show an excess of homoplasy, but this could be due to either recombination or hypervariable sites. Two additional recombination detection methods used, Geneconv and Maximum Chi-Square, showed nonsignificant results. The overall significance of these findings is hard to quantify because of nonindependence, but our results suggest a lack of evidence for recombination in human mtDNA.
Modeling the integration of bacterial rRNA fragments into the human cancer genome.
Sieber, Karsten B; Gajer, Pawel; Dunning Hotopp, Julie C
2016-03-21
Cancer is a disease driven by the accumulation of genomic alterations, including the integration of exogenous DNA into the human somatic genome. We previously identified in silico evidence of DNA fragments from a Pseudomonas-like bacteria integrating into the 5'-UTR of four proto-oncogenes in stomach cancer sequencing data. The functional and biological consequences of these bacterial DNA integrations remain unknown. Modeling of these integrations suggests that the previously identified sequences cover most of the sequence flanking the junction between the bacterial and human DNA. Further examination of these reads reveals that these integrations are rich in guanine nucleotides and the integrated bacterial DNA may have complex transcript secondary structures. The models presented here lay the foundation for future experiments to test if bacterial DNA integrations alter the transcription of the human genes.
Previously unknown and highly divergent ssDNA viruses populate the oceans.
Labonté, Jessica M; Suttle, Curtis A
2013-11-01
Single-stranded DNA (ssDNA) viruses are economically important pathogens of plants and animals, and are widespread in oceans; yet, the diversity and evolutionary relationships among marine ssDNA viruses remain largely unknown. Here we present the results from a metagenomic study of composite samples from temperate (Saanich Inlet, 11 samples; Strait of Georgia, 85 samples) and subtropical (46 samples, Gulf of Mexico) seawater. Most sequences (84%) had no evident similarity to sequenced viruses. In total, 608 putative complete genomes of ssDNA viruses were assembled, almost doubling the number of ssDNA viral genomes in databases. These comprised 129 genetically distinct groups, each represented by at least one complete genome that had no recognizable similarity to each other or to other virus sequences. Given that the seven recognized families of ssDNA viruses have considerable sequence homology within them, this suggests that many of these genetic groups may represent new viral families. Moreover, nearly 70% of the sequences were similar to one of these genomes, indicating that most of the sequences could be assigned to a genetically distinct group. Most sequences fell within 11 well-defined gene groups, each sharing a common gene. Some of these encoded putative replication and coat proteins that had similarity to sequences from viruses infecting eukaryotes, suggesting that these were likely from viruses infecting eukaryotic phytoplankton and zooplankton.
CRITICA: coding region identification tool invoking comparative analysis
NASA Technical Reports Server (NTRS)
Badger, J. H.; Olsen, G. J.; Woese, C. R. (Principal Investigator)
1999-01-01
Gene recognition is essential to understanding existing and future DNA sequence data. CRITICA (Coding Region Identification Tool Invoking Comparative Analysis) is a suite of programs for identifying likely protein-coding sequences in DNA by combining comparative analysis of DNA sequences with more common noncomparative methods. In the comparative component of the analysis, regions of DNA are aligned with related sequences from the DNA databases; if the translation of the aligned sequences has greater amino acid identity than expected for the observed percentage nucleotide identity, this is interpreted as evidence for coding. CRITICA also incorporates noncomparative information derived from the relative frequencies of hexanucleotides in coding frames versus other contexts (i.e., dicodon bias). The dicodon usage information is derived by iterative analysis of the data, such that CRITICA is not dependent on the existence or accuracy of coding sequence annotations in the databases. This independence makes the method particularly well suited for the analysis of novel genomes. CRITICA was tested by analyzing the available Salmonella typhimurium DNA sequences. Its predictions were compared with the DNA sequence annotations and with the predictions of GenMark. CRITICA proved to be more accurate than GenMark, and moreover, many of its predictions that would seem to be errors instead reflect problems in the sequence databases. The source code of CRITICA is freely available by anonymous FTP (rdp.life.uiuc.edu in/pub/critica) and on the World Wide Web (http:/(/)rdpwww.life.uiuc.edu).
Filloux, Denis; Murrell, Sasha; Koohapitagtam, Maneerat; Golden, Michael; Julian, Charlotte; Galzi, Serge; Uzest, Marilyne; Rodier-Goud, Marguerite; D’Hont, Angélique; Vernerey, Marie Stephanie; Wilkin, Paul; Peterschmitt, Michel; Winter, Stephan; Murrell, Ben; Martin, Darren P.; Roumagnac, Philippe
2015-01-01
Endogenous viral sequences are essentially ‘fossil records’ that can sometimes reveal the genomic features of long extinct virus species. Although numerous known instances exist of single-stranded DNA (ssDNA) genomes becoming stably integrated within the genomes of bacteria and animals, there remain very few examples of such integration events in plants. The best studied of these events are those which yielded the geminivirus-related DNA elements found within the nuclear genomes of various Nicotiana species. Although other ssDNA virus-like sequences are included within the draft genomes of various plant species, it is not entirely certain that these are not contaminants. The Nicotiana geminivirus-related DNA elements therefore remain the only definitively proven instances of endogenous plant ssDNA virus sequences. Here, we characterize two new classes of endogenous plant virus sequence that are also apparently derived from ancient geminiviruses in the genus Begomovirus. These two endogenous geminivirus-like elements (EGV1 and EGV2) are present in the Dioscorea spp. of the Enantiophyllum clade. We used fluorescence in situ hybridization to confirm that the EGV1 sequences are integrated in the D. alata genome and showed that one or two ancestral EGV sequences likely became integrated more than 1.4 million years ago during or before the diversification of the Asian and African Enantiophyllum Dioscorea spp. Unexpectedly, we found evidence of natural selection actively favouring the maintenance of EGV-expressed replication-associated protein (Rep) amino acid sequences, which clearly indicates that functional EGV Rep proteins were probably expressed for prolonged periods following endogenization. Further, the detection in D. alata of EGV gene transcripts, small 21–24 nt RNAs that are apparently derived from these transcripts, and expressed Rep proteins, provides evidence that some EGV genes are possibly still functionally expressed in at least some of the Enantiophyllum clade species. PMID:27774276
Evidence for recombination of mtDNA in the marine mussel Mytilus trossulus from the Baltic.
Burzyński, Artur; Zbawicka, Małgorzata; Skibinski, David O F; Wenne, Roman
2003-03-01
A number of studies have claimed that recombination occurs in animal mtDNA, although this evidence is controversial. Ladoukakis and Zouros (2001) provided strong evidence for mtDNA recombination in the COIII gene in gonadal tissue in the marine mussel Mytilus galloprovincialis from the Black Sea. The recombinant molecules they reported had not however become established in the population from which experimental animals were sampled. In the present study, we provide further evidence of the generality of mtDNA recombination in Mytilus by reporting recombinant mtDNA molecules in a related mussel species, Mytilus trossulus, from the Baltic. The mtDNA region studied begins in the 16S rRNA gene and terminates in the cytochrome b gene and includes a major noncoding region that may be analogous to the D-loop region observed in other animals. Many bivalve species, including some Mytilus species, are unusual in that they have two mtDNA genomes, one of which is inherited maternally (F genome) the other inherited paternally (M genome). Two recombinant variants reported in the present study have population frequencies of 5% and 36% and appear to be mosaic for F-like and M-like sequences. However, both variants have the noncoding region from the M genome, and both are transmitted to sperm like the M genome. We speculate that acquisition of the noncoding region by the recombinant molecules has conferred a paternal role on mtDNA genomes that otherwise resemble the F genome in sequence.
To Clone or Not To Clone: Method Analysis for Retrieving Consensus Sequences In Ancient DNA Samples
Winters, Misa; Barta, Jodi Lynn; Monroe, Cara; Kemp, Brian M.
2011-01-01
The challenges associated with the retrieval and authentication of ancient DNA (aDNA) evidence are principally due to post-mortem damage which makes ancient samples particularly prone to contamination from “modern” DNA sources. The necessity for authentication of results has led many aDNA researchers to adopt methods considered to be “gold standards” in the field, including cloning aDNA amplicons as opposed to directly sequencing them. However, no standardized protocol has emerged regarding the necessary number of clones to sequence, how a consensus sequence is most appropriately derived, or how results should be reported in the literature. In addition, there has been no systematic demonstration of the degree to which direct sequences are affected by damage or whether direct sequencing would provide disparate results from a consensus of clones. To address this issue, a comparative study was designed to examine both cloned and direct sequences amplified from ∼3,500 year-old ancient northern fur seal DNA extracts. Majority rules and the Consensus Confidence Program were used to generate consensus sequences for each individual from the cloned sequences, which exhibited damage at 31 of 139 base pairs across all clones. In no instance did the consensus of clones differ from the direct sequence. This study demonstrates that, when appropriate, cloning need not be the default method, but instead, should be used as a measure of authentication on a case-by-case basis, especially when this practice adds time and cost to studies where it may be superfluous. PMID:21738625
Sites of instability in the human TCF3 (E2A) gene adopt G-quadruplex DNA structures in vitro
Williams, Jonathan D.; Fleetwood, Sara; Berroyer, Alexandra; Kim, Nayun; Larson, Erik D.
2015-01-01
The formation of highly stable four-stranded DNA, called G-quadruplex (G4), promotes site-specific genome instability. G4 DNA structures fold from repetitive guanine sequences, and increasing experimental evidence connects G4 sequence motifs with specific gene rearrangements. The human transcription factor 3 (TCF3) gene (also termed E2A) is subject to genetic instability associated with severe disease, most notably a common translocation event t(1;19) associated with acute lymphoblastic leukemia. The sites of instability in TCF3 are not randomly distributed, but focused to certain sequences. We asked if G4 DNA formation could explain why TCF3 is prone to recombination and mutagenesis. Here we demonstrate that sequences surrounding the major t(1;19) break site and a region associated with copy number variations both contain G4 sequence motifs. The motifs identified readily adopt G4 DNA structures that are stable enough to interfere with DNA synthesis in physiological salt conditions in vitro. When introduced into the yeast genome, TCF3 G4 motifs promoted gross chromosomal rearrangements in a transcription-dependent manner. Our results provide a molecular rationale for the site-specific instability of human TCF3, suggesting that G4 DNA structures contribute to oncogenic DNA breaks and recombination. PMID:26029241
Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H; Proukakis, Christos
2017-01-01
Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array "waves", and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance.
Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M.; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H.
2017-01-01
Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array “waves”, and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance. PMID:28683077
Recognition of the DNA sequence by an inorganic crystal surface
Sampaolese, Beatrice; Bergia, Anna; Scipioni, Anita; Zuccheri, Giampaolo; Savino, Maria; Samorì, Bruno; De Santis, Pasquale
2002-01-01
The sequence-dependent curvature is generally recognized as an important and biologically relevant property of DNA because it is involved in the formation and stability of association complexes with proteins. When a DNA tract, intrinsically curved for the periodical recurrence on the same strand of A-tracts phased with the B-DNA periodicity, is deposited on a flat surface, it exposes to that surface either a T- or an A-rich face. The surface of a freshly cleaved mica crystal recognizes those two faces and preferentially interacts with the former one. Statistical analysis of scanning force microscopy (SFM) images provides evidence of this recognition between an inorganic crystal surface and nanoscale structures of double-stranded DNA. This finding could open the way toward the use of the sequence-dependent adhesion to specific crystal faces for nanotechnological purposes. PMID:12361979
FA-SAT Is an Old Satellite DNA Frozen in Several Bilateria Genomes
Chaves, Raquel; Ferreira, Daniela; Mendes-da-Silva, Ana; Meles, Susana; Adega, Filomena
2017-01-01
Abstract In recent years, a growing body of evidence has recognized the tandem repeat sequences, and specifically satellite DNA, as a functional class of sequences in the genomic “dark matter.” Using an original, complementary, and thus an eclectic experimental design, we show that the cat archetypal satellite DNA sequence, FA-SAT, is “frozen” conservatively in several Bilateria genomes. We found different genomic FA-SAT architectures, and the interspersion pattern was conserved. In Carnivora genomes, the FA-SAT-related sequences are also amplified, with the predominance of a specific FA-SAT variant, at the heterochromatic regions. We inspected the cat genome project to locate FA-SAT array flanking regions and revealed an intensive intermingling with transposable elements. Our results also show that FA-SAT-related sequences are transcribed and that the most abundant FA-SAT variant is not always the most transcribed. We thus conclude that the DNA sequences of FA-SAT and their transcripts are “frozen” in these genomes. Future work is needed to disclose any putative function that these sequences may play in these genomes. PMID:29608678
Impact of Lateral Transfers on the Genomes of Lepidoptera
Drezen, Jean-Michel; Josse, Thibaut; Bézier, Annie; Gauthier, Jérémy; Huguet, Elisabeth
2017-01-01
Transfer of DNA sequences between species regardless of their evolutionary distance is very common in bacteria, but evidence that horizontal gene transfer (HGT) also occurs in multicellular organisms has been accumulating in the past few years. The actual extent of this phenomenon is underestimated due to frequent sequence filtering of “alien” DNA before genome assembly. However, recent studies based on genome sequencing have revealed, and experimentally verified, the presence of foreign DNA sequences in the genetic material of several species of Lepidoptera. Large DNA viruses, such as baculoviruses and the symbiotic viruses of parasitic wasps (bracoviruses), have the potential to mediate these transfers in Lepidoptera. In particular, using ultra-deep sequencing, newly integrated transposons have been identified within baculovirus genomes. Bacterial genes have also been acquired by genomes of Lepidoptera, as in other insects and nematodes. In addition, insertions of bracovirus sequences were present in the genomes of certain moth and butterfly lineages, that were likely corresponding to rearrangements of ancient integrations. The viral genes present in these sequences, sometimes of hymenopteran origin, have been co-opted by lepidopteran species to confer some protection against pathogens. PMID:29120392
Young, J M; Austin, J J; Weyrich, L S
2017-02-01
Analysis of physical evidence is typically a deciding factor in forensic casework by establishing what transpired at a scene or who was involved. Forensic geoscience is an emerging multi-disciplinary science that can offer significant benefits to forensic investigations. Soil is a powerful, nearly 'ideal' contact trace evidence, as it is highly individualistic, easy to characterise, has a high transfer and retention probability, and is often overlooked in attempts to conceal evidence. However, many real-life cases encounter close proximity soil samples or soils with low inorganic content, which cannot be easily discriminated based on current physical and chemical analysis techniques. The capability to improve forensic soil discrimination, and identify key indicator taxa from soil using the organic fraction is currently lacking. The development of new DNA sequencing technologies offers the ability to generate detailed genetic profiles from soils and enhance current forensic soil analyses. Here, we discuss the use of DNA metabarcoding combined with high-throughput sequencing (HTS) technology to distinguish between soils from different locations in a forensic context. Specifically, we provide recommendations for best practice, outline the potential limitations encountered in a forensic context and describe the future directions required to integrate soil DNA analysis into casework. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Beaudet, Denis; Nadimi, Maryam; Iffis, Bachir; Hijri, Mohamed
2013-01-01
Arbuscular mycorrhizal fungi (AMF) are common and important plant symbionts. They have coenocytic hyphae and form multinucleated spores. The nuclear genome of AMF is polymorphic and its organization is not well understood, which makes the development of reliable molecular markers challenging. In stark contrast, their mitochondrial genome (mtDNA) is homogeneous. To assess the intra- and inter-specific mitochondrial variability in closely related Glomus species, we performed 454 sequencing on total genomic DNA of Glomus sp. isolate DAOM-229456 and we compared its mtDNA with two G. irregulare isolates. We found that the mtDNA of Glomus sp. is homogeneous, identical in gene order and, with respect to the sequences of coding regions, almost identical to G. irregulare. However, certain genomic regions vary substantially, due to insertions/deletions of elements such as introns, mitochondrial plasmid-like DNA polymerase genes and mobile open reading frames. We found no evidence of mitochondrial or cytoplasmic plasmids in Glomus species, and mobile ORFs in Glomus are responsible for the formation of four gene hybrids in atp6, atp9, cox2, and nad3, which are most probably the result of horizontal gene transfer and are expressed at the mRNA level. We found evidence for substantial sequence variation in defined regions of mtDNA, even among closely related isolates with otherwise identical coding gene sequences. This variation makes it possible to design reliable intra- and inter-specific markers. PMID:23637766
Beaudet, Denis; Nadimi, Maryam; Iffis, Bachir; Hijri, Mohamed
2013-01-01
Arbuscular mycorrhizal fungi (AMF) are common and important plant symbionts. They have coenocytic hyphae and form multinucleated spores. The nuclear genome of AMF is polymorphic and its organization is not well understood, which makes the development of reliable molecular markers challenging. In stark contrast, their mitochondrial genome (mtDNA) is homogeneous. To assess the intra- and inter-specific mitochondrial variability in closely related Glomus species, we performed 454 sequencing on total genomic DNA of Glomus sp. isolate DAOM-229456 and we compared its mtDNA with two G. irregulare isolates. We found that the mtDNA of Glomus sp. is homogeneous, identical in gene order and, with respect to the sequences of coding regions, almost identical to G. irregulare. However, certain genomic regions vary substantially, due to insertions/deletions of elements such as introns, mitochondrial plasmid-like DNA polymerase genes and mobile open reading frames. We found no evidence of mitochondrial or cytoplasmic plasmids in Glomus species, and mobile ORFs in Glomus are responsible for the formation of four gene hybrids in atp6, atp9, cox2, and nad3, which are most probably the result of horizontal gene transfer and are expressed at the mRNA level. We found evidence for substantial sequence variation in defined regions of mtDNA, even among closely related isolates with otherwise identical coding gene sequences. This variation makes it possible to design reliable intra- and inter-specific markers.
Roux-Rouquie, M; Marilley, M
2000-09-15
We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X. laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed.
Chromosome Evolution in Connection with Repetitive Sequences and Epigenetics in Plants.
Li, Shu-Fen; Su, Ting; Cheng, Guang-Qian; Wang, Bing-Xiao; Li, Xu; Deng, Chuan-Liang; Gao, Wu-Jun
2017-10-24
Chromosome evolution is a fundamental aspect of evolutionary biology. The evolution of chromosome size, structure and shape, number, and the change in DNA composition suggest the high plasticity of nuclear genomes at the chromosomal level. Repetitive DNA sequences, which represent a conspicuous fraction of every eukaryotic genome, particularly in plants, are found to be tightly linked with plant chromosome evolution. Different classes of repetitive sequences have distinct distribution patterns on the chromosomes. Mounting evidence shows that repetitive sequences may play multiple generative roles in shaping the chromosome karyotypes in plants. Furthermore, recent development in our understanding of the repetitive sequences and plant chromosome evolution has elucidated the involvement of a spectrum of epigenetic modification. In this review, we focused on the recent evidence relating to the distribution pattern of repetitive sequences in plant chromosomes and highlighted their potential relevance to chromosome evolution in plants. We also discussed the possible connections between evolution and epigenetic alterations in chromosome structure and repatterning, such as heterochromatin formation, centromere function, and epigenetic-associated transposable element inactivation.
Marine Fungi: Their Ecology and Molecular Diversity
NASA Astrophysics Data System (ADS)
Richards, Thomas A.; Jones, Meredith D. M.; Leonard, Guy; Bass, David
2012-01-01
Fungi appear to be rare in marine environments. There are relatively few marine isolates in culture, and fungal small subunit ribosomal DNA (SSU rDNA) sequences are rarely recovered in marine clone library experiments (i.e., culture-independent sequence surveys of eukaryotic microbial diversity from environmental DNA samples). To explore the diversity of marine fungi, we took a broad selection of SSU rDNA data sets and calculated a summary phylogeny. Bringing these data together identified a diverse collection of marine fungi, including sequences branching close to chytrids (flagellated fungi), filamentous hypha-forming fungi, and multicellular fungi. However, the majority of the sequences branched with ascomycete and basidiomycete yeasts. We discuss evidence for 36 novel marine lineages, the majority and most divergent of which branch with the chytrids. We then investigate what these data mean for the evolutionary history of the Fungi and specifically marine-terrestrial transitions. Finally, we discuss the roles of fungi in marine ecosystems.
Papasotiropoulos, Vasilis; Klossa-Kilia, Elena; Alahiotis, Stamatis N; Kilias, George
2007-08-01
Mitochondrial DNA sequence analysis has been used to explore genetic differentiation and phylogenetic relationships among five species of the Mugilidae family, Mugil cephalus, Chelon labrosus, Liza aurata, Liza ramada, and Liza saliens. DNA was isolated from samples originating from the Messolongi Lagoon in Greece. Three mtDNA segments (12s rRNA, 16s rRNA, and CO I) were PCR amplified and sequenced. Sequencing analysis revealed that the greatest genetic differentiation was observed between M. cephalus and all the other species studied, while C. labrosus and L. aurata were the closest taxa. Dendrograms obtained by the neighbor-joining method and Bayesian inference analysis exhibited the same topology. According to this topology, M. cephalus is the most distinct species and the remaining taxa are clustered together, with C. labrosus and L. aurata forming a single group. The latter result brings into question the monophyletic origin of the genus Liza.
Mitochondrial sequence analysis for forensic identification using pyrosequencing technology.
Andréasson, H; Asp, A; Alderborn, A; Gyllensten, U; Allen, M
2002-01-01
Over recent years, requests for mtDNA analysis in the field of forensic medicine have notably increased, and the results of such analyses have proved to be very useful in forensic cases where nuclear DNA analysis cannot be performed. Traditionally, mtDNA has been analyzed by DNA sequencing of the two hypervariable regions, HVI and HVII, in the D-loop. DNA sequence analysis using the conventional Sanger sequencing is very robust but time consuming and labor intensive. By contrast, mtDNA analysis based on the pyrosequencing technology provides fast and accurate results from the human mtDNA present in many types of evidence materials in forensic casework. The assay has been developed to determine polymorphic sites in the mitochondrial D-loop as well as the coding region to further increase the discrimination power of mtDNA analysis. The pyrosequencing technology for analysis of mtDNA polymorphisms has been tested with regard to sensitivity, reproducibility, and success rate when applied to control samples and actual casework materials. The results show that the method is very accurate and sensitive; the results are easily interpreted and provide a high success rate on casework samples. The panel of pyrosequencing reactions for the mtDNA polymorphisms were chosen to result in an optimal discrimination power in relation to the number of bases determined.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie
2009-11-20
RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR)more » shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.« less
Zhao, Junhua; Wang, Guliang; Del Mundo, Imee M; McKinney, Jennifer A; Lu, Xiuli; Bacolla, Albino; Boulware, Stephen B; Zhang, Changsheng; Zhang, Haihua; Ren, Pengyu; Freudenreich, Catherine H; Vasquez, Karen M
2018-01-30
Sequences with the capacity to adopt alternative DNA structures have been implicated in cancer etiology; however, the mechanisms are unclear. For example, H-DNA-forming sequences within oncogenes have been shown to stimulate genetic instability in mammals. Here, we report that H-DNA-forming sequences are enriched at translocation breakpoints in human cancer genomes, further implicating them in cancer etiology. H-DNA-induced mutations were suppressed in human cells deficient in the nucleotide excision repair nucleases, ERCC1-XPF and XPG, but were stimulated in cells deficient in FEN1, a replication-related endonuclease. Further, we found that these nucleases cleaved H-DNA conformations, and the interactions of modeled H-DNA with ERCC1-XPF, XPG, and FEN1 proteins were explored at the sub-molecular level. The results suggest mechanisms of genetic instability triggered by H-DNA through distinct structure-specific, cleavage-based replication-independent and replication-dependent pathways, providing critical evidence for a role of the DNA structure itself in the etiology of cancer and other human diseases. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Williams, Warren M; Ellison, Nicholas W; Ansari, Helal A; Verry, Isabelle M; Hussain, S Wajid
2012-04-24
White clover (Trifolium repens) is a ubiquitous weed of the temperate world that through use of improved cultivars has also become the most important legume of grazed pastures world-wide. It has long been suspected to be allotetraploid, but the diploid ancestral species have remained elusive. Putative diploid ancestors were indicated by DNA sequence phylogeny to be T. pallescens and T. occidentale. Here, we use further DNA evidence as well as a combination of molecular cytogenetics (FISH and GISH) and experimental hybridization to test the hypothesis that white clover originated as a hybrid between T. pallescens and T. occidentale. T. pallescens plants were identified with chloroplast trnL intron DNA sequences identical to those of white clover. Similarly, T. occidentale plants with nuclear ITS sequences identical to white clover were also identified. Reciprocal GISH experiments, alternately using labeled genomic DNA probes from each of the putative ancestral species on the same white clover cells, showed that half of the chromosomes hybridized with each probe. F1 hybrids were generated by embryo rescue and these showed strong interspecific chromosome pairing and produced a significant frequency of unreduced gametes, indicating the likely mode of polyploidization. The F1 hybrids are inter-fertile with white clover and function as synthetic white clovers, a valuable new resource for the re-incorporation of ancestral genomes into modern white clover for future plant breeding. Evidence from DNA sequence analyses, molecular cytogenetics, interspecific hybridization and breeding experiments supports the hypothesis that a diploid alpine species (T. pallescens) hybridized with a diploid coastal species (T. occidentale) to generate tetraploid T. repens. The coming together of these two narrowly adapted species (one alpine and the other maritime), along with allotetraploidy, has led to a transgressive hybrid with a broad adaptive range.
Navigating the tip of the genomic iceberg: Next-generation sequencing for plant systematics.
Straub, Shannon C K; Parks, Matthew; Weitemier, Kevin; Fishbein, Mark; Cronn, Richard C; Liston, Aaron
2012-02-01
Just as Sanger sequencing did more than 20 years ago, next-generation sequencing (NGS) is poised to revolutionize plant systematics. By combining multiplexing approaches with NGS throughput, systematists may no longer need to choose between more taxa or more characters. Here we describe a genome skimming (shallow sequencing) approach for plant systematics. Through simulations, we evaluated optimal sequencing depth and performance of single-end and paired-end short read sequences for assembly of nuclear ribosomal DNA (rDNA) and plastomes and addressed the effect of divergence on reference-guided plastome assembly. We also used simulations to identify potential phylogenetic markers from low-copy nuclear loci at different sequencing depths. We demonstrated the utility of genome skimming through phylogenetic analysis of the Sonoran Desert clade (SDC) of Asclepias (Apocynaceae). Paired-end reads performed better than single-end reads. Minimum sequencing depths for high quality rDNA and plastome assemblies were 40× and 30×, respectively. Divergence from the reference significantly affected plastome assembly, but relatively similar references are available for most seed plants. Deeper rDNA sequencing is necessary to characterize intragenomic polymorphism. The low-copy fraction of the nuclear genome was readily surveyed, even at low sequencing depths. Nearly 160000 bp of sequence from three organelles provided evidence of phylogenetic incongruence in the SDC. Adoption of NGS will facilitate progress in plant systematics, as whole plastome and rDNA cistrons, partial mitochondrial genomes, and low-copy nuclear markers can now be efficiently obtained for molecular phylogenetics studies.
Schouten, Henk J; Vande Geest, Henri; Papadimitriou, Sofia; Bemer, Marian; Schaart, Jan G; Smulders, Marinus J M; Perez, Gabino Sanchez; Schijlen, Elio
2017-03-01
Transformation resulted in deletions and translocations at T-DNA inserts, but not in genome-wide small mutations. A tiny T-DNA splinter was detected that probably would remain undetected by conventional techniques. We investigated to which extent Agrobacterium tumefaciens-mediated transformation is mutagenic, on top of inserting T-DNA. To prevent mutations due to in vitro propagation, we applied floral dip transformation of Arabidopsis thaliana. We re-sequenced the genomes of five primary transformants, and compared these to genomic sequences derived from a pool of four wild-type plants. By genome-wide comparisons, we identified ten small mutations in the genomes of the five transgenic plants, not correlated to the positions or number of T-DNA inserts. This mutation frequency is within the range of spontaneous mutations occurring during seed propagation in A. thaliana, as determined earlier. In addition, we detected small as well as large deletions specifically at the T-DNA insert sites. Furthermore, we detected partial T-DNA inserts, one of these a tiny 50-bp fragment originating from a central part of the T-DNA construct used, inserted into the plant genome without flanking other T-DNA. Because of its small size, we named this fragment a T-DNA splinter. As far as we know this is the first report of such a small T-DNA fragment insert in absence of any T-DNA border sequence. Finally, we found evidence for translocations from other chromosomes, flanking T-DNA inserts. In this study, we showed that next-generation sequencing (NGS) is a highly sensitive approach to detect T-DNA inserts in transgenic plants.
Mechanism of chimera formation during the Multiple Displacement Amplification reaction.
Lasken, Roger S; Stockwell, Timothy B
2007-04-12
Multiple Displacement Amplification (MDA) is a method used for amplifying limiting DNA sources. The high molecular weight amplified DNA is ideal for DNA library construction. While this has enabled genomic sequencing from one or a few cells of unculturable microorganisms, the process is complicated by the tendency of MDA to generate chimeric DNA rearrangements in the amplified DNA. Determining the source of the DNA rearrangements would be an important step towards reducing or eliminating them. Here, we characterize the major types of chimeras formed by carrying out an MDA whole genome amplification from a single E. coli cell and sequencing by the 454 Life Sciences method. Analysis of 475 chimeras revealed the predominant reaction mechanisms that create the DNA rearrangements. The highly branched DNA synthesized in MDA can assume many alternative secondary structures. DNA strands extended on an initial template can be displaced becoming available to prime on a second template creating the chimeras. Evidence supports a model in which branch migration can displace 3'-ends freeing them to prime on the new templates. More than 85% of the resulting DNA rearrangements were inverted sequences with intervening deletions that the model predicts. Intramolecular rearrangements were favored, with displaced 3'-ends reannealing to single stranded 5'-strands contained within the same branched DNA molecule. In over 70% of the chimeric junctions, the 3' termini had initiated priming at complimentary sequences of 2-21 nucleotides (nts) in the new templates. Formation of chimeras is an important limitation to the MDA method, particularly for whole genome sequencing. Identification of the mechanism for chimera formation provides new insight into the MDA reaction and suggests methods to reduce chimeras. The 454 sequencing approach used here will provide a rapid method to assess the utility of reaction modifications.
Mechanism of chimera formation during the Multiple Displacement Amplification reaction
Lasken, Roger S; Stockwell, Timothy B
2007-01-01
Background Multiple Displacement Amplification (MDA) is a method used for amplifying limiting DNA sources. The high molecular weight amplified DNA is ideal for DNA library construction. While this has enabled genomic sequencing from one or a few cells of unculturable microorganisms, the process is complicated by the tendency of MDA to generate chimeric DNA rearrangements in the amplified DNA. Determining the source of the DNA rearrangements would be an important step towards reducing or eliminating them. Results Here, we characterize the major types of chimeras formed by carrying out an MDA whole genome amplification from a single E. coli cell and sequencing by the 454 Life Sciences method. Analysis of 475 chimeras revealed the predominant reaction mechanisms that create the DNA rearrangements. The highly branched DNA synthesized in MDA can assume many alternative secondary structures. DNA strands extended on an initial template can be displaced becoming available to prime on a second template creating the chimeras. Evidence supports a model in which branch migration can displace 3'-ends freeing them to prime on the new templates. More than 85% of the resulting DNA rearrangements were inverted sequences with intervening deletions that the model predicts. Intramolecular rearrangements were favored, with displaced 3'-ends reannealing to single stranded 5'-strands contained within the same branched DNA molecule. In over 70% of the chimeric junctions, the 3' termini had initiated priming at complimentary sequences of 2–21 nucleotides (nts) in the new templates. Conclusion Formation of chimeras is an important limitation to the MDA method, particularly for whole genome sequencing. Identification of the mechanism for chimera formation provides new insight into the MDA reaction and suggests methods to reduce chimeras. The 454 sequencing approach used here will provide a rapid method to assess the utility of reaction modifications. PMID:17430586
Lee, Shih-Chieh; Wang, Chia-Hsiang; Yen, Cheng-En; Chang, Chieh
2017-04-01
The major aim of made tea identification is to identify the variety and provenance of the tea plant. The present experiment used 113 tea plants [Camellia sinensis (L.) O. Kuntze] housed at the Tea Research and Extension Substation, from which 113 internal transcribed spacer 2 (ITS2) fragments, 104 trnL intron, and 98 trnL-trnF intergenic sequence region DNA sequences were successfully sequenced. The similarity of the ITS2 nucleotide sequences between tea plants housed at the Tea Research and Extension Substation was 0.379-0.994. In this polymerase chain reaction-amplified noncoding region, no varieties possessed identical sequences. Compared with the trnL intron and trnL-trnF intergenic sequence fragments of chloroplast cpDNA, the proportion of ITS2 nucleotide sequence variation was large and is more suitable for establishing a DNA barcode database to identify tea plant varieties. After establishing the database, 30 imported teas and 35 domestic made teas were used in this model system to explore the feasibility of using ITS2 sequences to identify the varieties and provenances of made teas. A phylogenetic tree was constructed using ITS2 sequences with the unweighted pair group method with arithmetic mean, which indicated that the same variety of tea plant is likely to be successfully categorized into one cluster, but contamination from other tea plants was also detected. This result provides molecular evidence that the similarity between important tea varieties in Taiwan remains high. We suggest a direct, wide collection of made tea and original samples of tea plants to establish an ITS2 sequence molecular barcode identification database to identify the varieties and provenances of tea plants. The DNA barcode comparison method can satisfy the need for a rapid, low-cost, frontline differentiation of the large amount of made teas from Taiwan and abroad, and can provide molecular evidence of their varieties and provenances. Copyright © 2016. Published by Elsevier B.V.
Selective DNA demethylation by fusion of TDG with a sequence-specific DNA-binding domain
Gregory, David J.; Mikhaylova, Lyudmila; Fedulov, Alexey V.
2012-01-01
Our ability to selectively manipulate gene expression by epigenetic means is limited, as there is no approach for targeted reactivation of epigenetically silenced genes, in contrast to what is available for selective gene silencing. We aimed to develop a tool for selective transcriptional activation by DNA demethylation. Here we present evidence that direct targeting of thymine-DNA-glycosylase (TDG) to specific sequences in the DNA can result in local DNA demethylation at potential regulatory sequences and lead to enhanced gene induction. When TDG was fused to a well-characterized DNA-binding domain [the Rel-homology domain (RHD) of NFκB], we observed decreased DNA methylation and increased transcriptional response to unrelated stimulus of inducible nitric oxide synthase (NOS2). The effect was not seen for control genes lacking either RHD-binding sites or high levels of methylation, nor in control mock-transduced cells. Specific reactivation of epigenetically silenced genes may thus be achievable by this approach, which provides a broadly useful strategy to further our exploration of biological mechanisms and to improve control over the epigenome. PMID:22419066
Adachi, Noboru; Umetsu, Kazuo; Shojo, Hideki
2014-01-01
Mitochondrial DNA (mtDNA) is widely used for DNA analysis of highly degraded samples because of its polymorphic nature and high number of copies in a cell. However, as endogenous mtDNA in deteriorated samples is scarce and highly fragmented, it is not easy to obtain reliable data. In the current study, we report the risks of direct sequencing mtDNA in highly degraded material, and suggest a strategy to ensure the quality of sequencing data. It was observed that direct sequencing data of the hypervariable segment (HVS) 1 by using primer sets that generate an amplicon of 407 bp (long-primer sets) was different from results obtained by using newly designed primer sets that produce an amplicon of 120-139 bp (mini-primer sets). The data aligned with the results of mini-primer sets analysis in an amplicon length-dependent manner; the shorter the amplicon, the more evident the endogenous sequence became. Coding region analysis using multiplex amplified product-length polymorphisms revealed the incongruence of single nucleotide polymorphisms between the coding region and HVS 1 caused by contamination with exogenous mtDNA. Although the sequencing data obtained using long-primer sets turned out to be erroneous, it was unambiguous and reproducible. These findings suggest that PCR primers that produce amplicons shorter than those currently recognized should be used for mtDNA analysis in highly degraded samples. Haplogroup motif analysis of the coding region and HVS should also be performed to improve the reliability of forensic mtDNA data. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
DNA sequence+shape kernel enables alignment-free modeling of transcription factor binding.
Ma, Wenxiu; Yang, Lin; Rohs, Remo; Noble, William Stafford
2017-10-01
Transcription factors (TFs) bind to specific DNA sequence motifs. Several lines of evidence suggest that TF-DNA binding is mediated in part by properties of the local DNA shape: the width of the minor groove, the relative orientations of adjacent base pairs, etc. Several methods have been developed to jointly account for DNA sequence and shape properties in predicting TF binding affinity. However, a limitation of these methods is that they typically require a training set of aligned TF binding sites. We describe a sequence + shape kernel that leverages DNA sequence and shape information to better understand protein-DNA binding preference and affinity. This kernel extends an existing class of k-mer based sequence kernels, based on the recently described di-mismatch kernel. Using three in vitro benchmark datasets, derived from universal protein binding microarrays (uPBMs), genomic context PBMs (gcPBMs) and SELEX-seq data, we demonstrate that incorporating DNA shape information improves our ability to predict protein-DNA binding affinity. In particular, we observe that (i) the k-spectrum + shape model performs better than the classical k-spectrum kernel, particularly for small k values; (ii) the di-mismatch kernel performs better than the k-mer kernel, for larger k; and (iii) the di-mismatch + shape kernel performs better than the di-mismatch kernel for intermediate k values. The software is available at https://bitbucket.org/wenxiu/sequence-shape.git. rohs@usc.edu or william-noble@uw.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Ancient DNA Reveals Late Pleistocene Existence of Ostriches in Indian Sub-Continent.
Jain, Sonal; Rai, Niraj; Kumar, Giriraj; Pruthi, Parul Aggarwal; Thangaraj, Kumarasamy; Bajpai, Sunil; Pruthi, Vikas
2017-01-01
Ancient DNA (aDNA) analysis of extinct ratite species is of considerable interest as it provides important insights into their origin, evolution, paleogeographical distribution and vicariant speciation in congruence with continental drift theory. In this study, DNA hotspots were detected in fossilized eggshell fragments of ratites (dated ≥25000 years B.P. by radiocarbon dating) using confocal laser scanning microscopy (CLSM). DNA was isolated from five eggshell fragments and a 43 base pair (bp) sequence of a 16S rRNA mitochondrial-conserved region was successfully amplified and sequenced from one of the samples. Phylogenetic analysis of the DNA sequence revealed a 92% identity of the fossil eggshells to Struthio camelus and their position basal to other palaeognaths, consistent with the vicariant speciation model. Our study provides the first molecular evidence for the presence of ostriches in India, complementing the continental drift theory of biogeographical movement of ostriches in India, and opening up a new window into the evolutionary history of ratites.
Gocayne, J; Robinson, D A; FitzGerald, M G; Chung, F Z; Kerlavage, A R; Lentes, K U; Lai, J; Wang, C D; Fraser, C M; Venter, J C
1987-12-01
Two cDNA clones, lambda RHM-MF and lambda RHB-DAR, encoding the muscarinic cholinergic receptor and the beta-adrenergic receptor, respectively, have been isolated from a rat heart cDNA library. The cDNA clones were characterized by restriction mapping and automated DNA sequence analysis utilizing fluorescent dye primers. The rat heart muscarinic receptor consists of 466 amino acids and has a calculated molecular weight of 51,543. The rat heart beta-adrenergic receptor consists of 418 amino acids and has a calculated molecular weight of 46,890. The two cardiac receptors have substantial amino acid homology (27.2% identity, 50.6% with favored substitutions). The rat cardiac beta receptor has 88.0% homology (92.5% with favored substitutions) with the human brain beta receptor and the rat cardiac muscarinic receptor has 94.6% homology (97.6% with favored substitutions) with the porcine cardiac muscarinic receptor. The muscarinic cholinergic and beta-adrenergic receptors appear to be as conserved as hemoglobin and cytochrome c but less conserved than histones and are clearly members of a multigene family. These data support our hypothesis, based upon biochemical and immunological evidence, that suggests considerable structural homology and evolutionary conservation between adrenergic and muscarinic cholinergic receptors. To our knowledge, this is the first report utilizing automated DNA sequence analysis to determine the structure of a gene.
Barbosa, Patrícia; de Oliveira, Luiz Antonio; Pucci, Marcela Baer; Santos, Mateus Henrique; Moreira-Filho, Orlando; Vicari, Marcelo Ricardo; Nogaroto, Viviane; de Almeida, Mara Cristina; Artoni, Roberto Ferreira
2015-02-01
Most part of the eukaryotic genome is composed of repeated sequences or multiple copies of DNA, which were considered as "junk DNA", and may be associated to the heterochromatin. In this study, three populations of Astyanax aff. scabripinnis from Brazilian rivers of Guaratinguetá and Pindamonhangaba (São Paulo) and a population from Maringá (Paraná) were analyzed concerning the localization of the nucleolar organizer regions (Ag-NORs), the As51 satellite DNA, the 18S ribosomal DNA (rDNA), and the 5S rDNA. Repeated sequences were also isolated and identified by the Cot - 1 method, which indicated similarity (90%) with the LINE UnaL2 retrotransposon. The fluorescence in situ hybridization (FISH) showed the retrotransposon dispersed and more concentrated markers in centromeric and telomeric chromosomal regions. These sequences were co-localized and interspaced with 18S and 5S rDNA and As51, confirmed by fiber-FISH essay. The B chromosome found in these populations pointed to a conspicuous hybridization with LINE probe, which is also co-located in As51 sequences. The NORs were active at unique sites of a homologous pair in the three populations. There were no evidences that transposable elements and repetitive DNA had influence in the transcriptional regulation of ribosomal genes in our analyses.
Roux-Rouquie, Magali; Marilley, Monique
2000-01-01
We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X.laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed. PMID:10982860
Phylogenetic analysis of mtDNA lineages in South American mummies.
Monsalve, M V; Cardenas, F; Guhl, F; Delaney, A D; Devine, D V
1996-07-01
Some studies of mtDNA propose that contemporary Amerindians have descended from four haplotype groups, each defined by specific sets of polymorphisms. One recent study also found evidence of other potential founder haplotypes. We wanted to determine whether the four haplotypes in modern populations were also present in ancient South American aboriginals. We subjected mtDNA from Colombian mummies (470 to 1849 AD) to PCR amplification and restriction endonuclease analysis. The mtDNA D-loop region was surveyed for sequence variation by restriction analysis and a segment of this region was sequenced for each mummy to characterize the haplotypes. Our mummies exhibited three of the four major characteristic haplotypes of Amerindian populations defined by four markers. With sequence data obtained in the ancient samples and published data on contemporary Amerindians it was possible to infer the origin of these six mummies.
Yoo, Jejoong; Kim, Hajin; Aksimentiev, Aleksei; Ha, Taekjip
2016-03-22
Although proteins mediate highly ordered DNA organization in vivo, theoretical studies suggest that homologous DNA duplexes can preferentially associate with one another even in the absence of proteins. Here we combine molecular dynamics simulations with single-molecule fluorescence resonance energy transfer experiments to examine the interactions between duplex DNA in the presence of spermine, a biological polycation. We find that AT-rich DNA duplexes associate more strongly than GC-rich duplexes, regardless of the sequence homology. Methyl groups of thymine acts as a steric block, relocating spermine from major grooves to interhelical regions, thereby increasing DNA-DNA attraction. Indeed, methylation of cytosines makes attraction between GC-rich DNA as strong as that between AT-rich DNA. Recent genome-wide chromosome organization studies showed that remote contact frequencies are higher for AT-rich and methylated DNA, suggesting that direct DNA-DNA interactions that we report here may play a role in the chromosome organization and gene regulation.
NASA Astrophysics Data System (ADS)
Yoo, Jejoong; Kim, Hajin; Aksimentiev, Aleksei; Ha, Taekjip
2016-03-01
Although proteins mediate highly ordered DNA organization in vivo, theoretical studies suggest that homologous DNA duplexes can preferentially associate with one another even in the absence of proteins. Here we combine molecular dynamics simulations with single-molecule fluorescence resonance energy transfer experiments to examine the interactions between duplex DNA in the presence of spermine, a biological polycation. We find that AT-rich DNA duplexes associate more strongly than GC-rich duplexes, regardless of the sequence homology. Methyl groups of thymine acts as a steric block, relocating spermine from major grooves to interhelical regions, thereby increasing DNA-DNA attraction. Indeed, methylation of cytosines makes attraction between GC-rich DNA as strong as that between AT-rich DNA. Recent genome-wide chromosome organization studies showed that remote contact frequencies are higher for AT-rich and methylated DNA, suggesting that direct DNA-DNA interactions that we report here may play a role in the chromosome organization and gene regulation.
Msh2-Msh3 Interferes with Okazaki Fragment Processing to Promote Trinucleotide Repeat Expansions
Kantartzis, Athena; Williams, Gregory M.; Balakrishnan, Lata; Roberts, Rick L.; Surtees, Jennifer A.; Bambara, Robert A.
2012-01-01
Summary Trinucleotide repeat (TNR) expansions are the underlying cause of more than forty neurodegenerative and neuromuscular diseases, including myotonic dystrophy and Huntington’s disease. Although genetic evidence has attributed the cause of these diseases to errors in DNA replication and/or repair, clear molecular mechanisms have not been described. We have focused on the role of the mismatch repair complex Msh2-Msh3 in promoting TNR expansions. We demonstrate that Msh2-Msh3 promotes CTG and CAG repeat expansions in vivo in Saccharomyces cerevisiae. We further provide biochemical evidence that Msh2-Msh3 directly interferes with normal Okazaki fragment processing by flap endonuclease1 (Rad27) and DNA Ligase I (Cdc9) in the presence of TNR sequences, thereby producing small, incremental expansion events. We believe that this is the first mechanistic evidence showing the interplay of replication and repair proteins in the expansion of sequences during lagging strand DNA replication. PMID:22938864
Msh2-Msh3 interferes with Okazaki fragment processing to promote trinucleotide repeat expansions.
Kantartzis, Athena; Williams, Gregory M; Balakrishnan, Lata; Roberts, Rick L; Surtees, Jennifer A; Bambara, Robert A
2012-08-30
Trinucleotide repeat (TNR) expansions are the underlying cause of more than 40 neurodegenerative and neuromuscular diseases, including myotonic dystrophy and Huntington's disease. Although genetic evidence points to errors in DNA replication and/or repair as the cause of these diseases, clear molecular mechanisms have not been described. Here, we focused on the role of the mismatch repair complex Msh2-Msh3 in promoting TNR expansions. We demonstrate that Msh2-Msh3 promotes CTG and CAG repeat expansions in vivo in Saccharomyces cerevisiae. Furthermore, we provide biochemical evidence that Msh2-Msh3 directly interferes with normal Okazaki fragment processing by flap endonuclease1 (Rad27) and DNA ligase I (Cdc9) in the presence of TNR sequences, thereby producing small, incremental expansion events. We believe that this is the first mechanistic evidence showing the interplay of replication and repair proteins in the expansion of sequences during lagging-strand DNA replication. Copyright © 2012 The Authors. Published by Elsevier Inc. All rights reserved.
Scarlatti, G; Leitner, T; Halapi, E; Wahlberg, J; Marchisio, P; Clerici-Schoeller, M A; Wigzell, H; Fenyö, E M; Albert, J; Uhlén, M
1993-01-01
We have compared the variable region 3 sequences from 10 human immunodeficiency virus type 1 (HIV-1)-infected infants to virus sequences from the corresponding mothers. The sequences were derived from DNA of uncultured peripheral blood mononuclear cells (PBMC), DNA of cultured PBMC, and RNA from serum collected at or shortly after delivery. The infected infants, in contrast to the mothers, harbored homogeneous virus populations. Comparison of sequences from the children and clones derived from DNA of the corresponding mothers showed that the transmitted virus represented either a minor or a major virus population of the mother. In contrast to an earlier study, we found no evidence of selection of minor virus variants during transmission. Furthermore, the transmitted virus variant did not show any characteristic molecular features. In some cases the transmitted virus was more related to the virus RNA population of the mother and in other cases it was more related to the virus DNA population. This suggests that either cell-free or cell-associated virus may be transmitted. These data will help AIDS researchers to understand the mechanism of transmission and to plan strategies for prevention of transmission. PMID:8446584
Crainey, James Lee; Marín, Michel Abanto; Silva, Túllio Romão Ribeiro da; de Medeiros, Jansen Fernandes; Pessoa, Felipe Arley Costa; Santos, Yago Vinícius; Vicente, Ana Carolina Paulo; Luz, Sérgio Luiz Bessa
2018-04-18
Despite the broad distribution of M. ozzardi in Latin America and the Caribbean, there is still very little DNA sequence data available to study this neglected parasite's epidemiology. Mitochondrial DNA (mtDNA) sequences, especially the cytochrome oxidase (CO1) gene's barcoding region, have been targeted successfully for filarial diagnostics and for epidemiological, ecological and evolutionary studies. MtDNA-based studies can, however, be compromised by unrecognised mitochondrial pseudogenes, such as Numts. Here, we have used shot-gun Illumina-HiSeq sequencing to recover the first complete Mansonella genus mitogenome and to identify several mitochondrial-origin pseudogenes. Mitogenome phylogenetic analysis placed M. ozzardi in the Onchocercidae "ONC5" clade and suggested that Mansonella parasites are more closely related to Wuchereria and Brugia genera parasites than they are to Loa genus parasites. DNA sequence alignments, BLAST searches and conceptual translations have been used to compliment phylogenetic analysis showing that M. ozzardi from the Amazon and Caribbean regions are near-identical and that previously reported Peruvian M. ozzardi CO1 reference sequences are probably of pseudogene origin. In addition to adding a much-needed resource to the Mansonella genus's molecular tool-kit and providing evidence that some M. ozzardi CO1 sequence deposits are pseudogenes, our results suggest that all Neotropical M. ozzardi parasites are closely related.
Chromosome Evolution in Connection with Repetitive Sequences and Epigenetics in Plants
Li, Shu-Fen; Su, Ting; Cheng, Guang-Qian; Wang, Bing-Xiao; Li, Xu; Deng, Chuan-Liang; Gao, Wu-Jun
2017-01-01
Chromosome evolution is a fundamental aspect of evolutionary biology. The evolution of chromosome size, structure and shape, number, and the change in DNA composition suggest the high plasticity of nuclear genomes at the chromosomal level. Repetitive DNA sequences, which represent a conspicuous fraction of every eukaryotic genome, particularly in plants, are found to be tightly linked with plant chromosome evolution. Different classes of repetitive sequences have distinct distribution patterns on the chromosomes. Mounting evidence shows that repetitive sequences may play multiple generative roles in shaping the chromosome karyotypes in plants. Furthermore, recent development in our understanding of the repetitive sequences and plant chromosome evolution has elucidated the involvement of a spectrum of epigenetic modification. In this review, we focused on the recent evidence relating to the distribution pattern of repetitive sequences in plant chromosomes and highlighted their potential relevance to chromosome evolution in plants. We also discussed the possible connections between evolution and epigenetic alterations in chromosome structure and repatterning, such as heterochromatin formation, centromere function, and epigenetic-associated transposable element inactivation. PMID:29064432
Characterization of an In Vivo Z-DNA Detection Probe Based on a Cell Nucleus Accumulating Intrabody.
Gulis, Galina; Silva, Izabel Cristina Rodrigues; Sousa, Herdson Renney; Sousa, Isabel Garcia; Bezerra, Maryani Andressa Gomes; Quilici, Luana Salgado; Maranhao, Andrea Queiroz; Brigido, Marcelo Macedo
2016-09-01
Left-handed Z-DNA is a physiologically unstable DNA conformation, and its existence in vivo can be attributed to localized torsional distress. Despite evidence for the existence of Z-DNA in vivo, its precise role in the control of gene expression is not fully understood. Here, an in vivo probe based on an anti-Z-DNA intrabody is proposed for native Z-DNA detection. The probe was used for chromatin immunoprecipitation of potential Z-DNA-forming sequences in the human genome. One of the isolated putative Z-DNA-forming sequences was cloned upstream of a reporter gene expression cassette under control of the CMV promoter. The reporter gene encoded an antibody fragment fused to GFP. Transient co-transfection of this vector along with the Z-probe coding vector improved reporter gene expression. This improvement was demonstrated by measuring reporter gene mRNA and protein levels and the amount of fluorescence in co-transfected CHO-K1 cells. These results suggest that the presence of the anti-Z-DNA intrabody can interfere with a Z-DNA-containing reporter gene expression. Therefore, this in vivo probe for the detection of Z-DNA could be used for global correlation of Z-DNA-forming sequences and gene expression regulation.
DNA Damage, DNA Repair, Aging, and Neurodegeneration
Maynard, Scott; Fang, Evandro Fei; Scheibye-Knudsen, Morten; Croteau, Deborah L.; Bohr, Vilhelm A.
2015-01-01
Aging in mammals is accompanied by a progressive atrophy of tissues and organs, and stochastic damage accumulation to the macromolecules DNA, RNA, proteins, and lipids. The sequence of the human genome represents our genetic blueprint, and accumulating evidence suggests that loss of genomic maintenance may causally contribute to aging. Distinct evidence for a role of imperfect DNA repair in aging is that several premature aging syndromes have underlying genetic DNA repair defects. Accumulation of DNA damage may be particularly prevalent in the central nervous system owing to the low DNA repair capacity in postmitotic brain tissue. It is generally believed that the cumulative effects of the deleterious changes that occur in aging, mostly after the reproductive phase, contribute to species-specific rates of aging. In addition to nuclear DNA damage contributions to aging, there is also abundant evidence for a causative link between mitochondrial DNA damage and the major phenotypes associated with aging. Understanding the mechanistic basis for the association of DNA damage and DNA repair with aging and age-related diseases, such as neurodegeneration, would give insight into contravening age-related diseases and promoting a healthy life span. PMID:26385091
Barr, Norman; Ruiz-Arce, Raul; Obregón, Oscar; De Leon, Rosita; Foster, Nelson; Reuter, Chris; Boratynski, Theodore; Vacek, Don
2013-02-01
The utility of the cytochrome oxidase I (COI) DNA sequence used for DNA barcoding and a Sequence Characterized Amplified Region for diagnosing boll weevil, Anthonomus grandis Boheman, variants was evaluated. Maximum likelihood analysis of COI DNA sequences from 154 weevils collected from the United States and Mexico supports previous evidence for limited gene flow between weevil populations on wild cotton and commercial cotton in northern Mexico and southern United States. The wild cotton populations represent a variant of the species called the thurberia weevil, which is not regarded as a significant pest. The 31 boll weevil COI haplotypes observed in the study form two distinct haplogroups (A and B) that are supported by five fixed nucleotide differences and a phylogenetic analysis. Although wild and commercial cotton populations are closely associated with specific haplogroups, there is not a fixed difference between the thurberia weevil variant and other populations. The Sequence Characterized Amplified Region marker generated a larger number of inconclusive results than the COI gene but also supported evidence of shared genotypes between wild and commercial cotton weevil populations. These methods provide additional markers that can assist in the identification of pest weevil populations but not definitively diagnose samples.
Nuclear Mitochondrial DNA Activates Replication in Saccharomyces cerevisiae
Chatre, Laurent; Ricchetti, Miria
2011-01-01
The nuclear genome of eukaryotes is colonized by DNA fragments of mitochondrial origin, called NUMTs. These insertions have been associated with a variety of germ-line diseases in humans. The significance of this uptake of potentially dangerous sequences into the nuclear genome is unclear. Here we provide functional evidence that sequences of mitochondrial origin promote nuclear DNA replication in Saccharomyces cerevisiae. We show that NUMTs are rich in key autonomously replicating sequence (ARS) consensus motifs, whose mutation results in the reduction or loss of DNA replication activity. Furthermore, 2D-gel analysis of the mrc1 mutant exposed to hydroxyurea shows that several NUMTs function as late chromosomal origins. We also show that NUMTs located close to or within ARS provide key sequence elements for replication. Thus NUMTs can act as independent origins, when inserted in an appropriate genomic context or affect the efficiency of pre-existing origins. These findings show that migratory mitochondrial DNAs can impact on the replication of the nuclear region they are inserted in. PMID:21408151
Nuclear mitochondrial DNA activates replication in Saccharomyces cerevisiae.
Chatre, Laurent; Ricchetti, Miria
2011-03-08
The nuclear genome of eukaryotes is colonized by DNA fragments of mitochondrial origin, called NUMTs. These insertions have been associated with a variety of germ-line diseases in humans. The significance of this uptake of potentially dangerous sequences into the nuclear genome is unclear. Here we provide functional evidence that sequences of mitochondrial origin promote nuclear DNA replication in Saccharomyces cerevisiae. We show that NUMTs are rich in key autonomously replicating sequence (ARS) consensus motifs, whose mutation results in the reduction or loss of DNA replication activity. Furthermore, 2D-gel analysis of the mrc1 mutant exposed to hydroxyurea shows that several NUMTs function as late chromosomal origins. We also show that NUMTs located close to or within ARS provide key sequence elements for replication. Thus NUMTs can act as independent origins, when inserted in an appropriate genomic context or affect the efficiency of pre-existing origins. These findings show that migratory mitochondrial DNAs can impact on the replication of the nuclear region they are inserted in.
Trofimova, Irina; Krasikova, Alla
2016-12-01
Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription.
Krasikova, Alla
2016-01-01
ABSTRACT Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription. PMID:27763817
Ramond, J-B; Makhalanyane, T P; Tuffin, M I; Cowan, D A
2015-04-01
Normalization is a procedure classically employed to detect rare sequences in cellular expression profiles (i.e. cDNA libraries). Here, we present a normalization protocol involving the direct treatment of extracted environmental metagenomic DNA with S1 nuclease, referred to as normalization of metagenomic DNA: NmDNA. We demonstrate that NmDNA, prior to post hoc PCR-based experiments (16S rRNA gene T-RFLP fingerprinting and clone library), increased the diversity of sequences retrieved from environmental microbial communities by detection of rarer sequences. This approach could be used to enhance the resolution of detection of ecologically relevant rare members in environmental microbial assemblages and therefore is promising in enabling a better understanding of ecosystem functioning. This study is the first testing 'normalization' on environmental metagenomic DNA (mDNA). The aim of this procedure was to improve the identification of rare phylotypes in environmental communities. Using hypoliths as model systems, we present evidence that this post-mDNA extraction molecular procedure substantially enhances the detection of less common phylotypes and could even lead to the discovery of novel microbial genotypes within a given environment. © 2014 The Society for Applied Microbiology.
Frye, Mark A; Ryu, Euijung; Nassan, Malik; Jenkins, Gregory D; Andreazza, Ana C; Evans, Jared M; McElroy, Susan L; Oglesbee, Devin; Highsmith, W Edward; Biernacka, Joanna M
2017-01-01
Converging genetic, postmortem gene-expression, cellular, and neuroimaging data implicate mitochondrial dysfunction in bipolar disorder. This study was conducted to investigate whether mitochondrial DNA (mtDNA) haplogroups and single nucleotide variants (SNVs) are associated with sub-phenotypes of bipolar disorder. MtDNA from 224 patients with Bipolar I disorder (BPI) was sequenced, and association of sequence variations with 3 sub-phenotypes (psychosis, rapid cycling, and adolescent illness onset) was evaluated. Gene-level tests were performed to evaluate overall burden of minor alleles for each phenotype. The haplogroup U was associated with a higher risk of psychosis. Secondary analyses of SNVs provided nominal evidence for association of psychosis with variants in the tRNA, ND4 and ND5 genes. The association of psychosis with ND4 (gene that encodes NADH dehydrogenase 4) was further supported by gene-level analysis. Preliminary analysis of mtDNA sequence data suggests a higher risk of psychosis with the U haplogroup and variation in the ND4 gene implicated in electron transport chain energy regulation. Further investigation of the functional consequences of this mtDNA variation is encouraged. Copyright © 2016. Published by Elsevier Ltd.
Contamination of sequence databases with adaptor sequences
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yoshikawa, Takeo; Sanders, A.R.; Detera-Wadleigh, S.D.
Because of the exponential increase in the amount of DNA sequences being added to the public databases on a daily basis, it has become imperative to identify sources of contamination rapidly. Previously, contaminations of sequence databases have been reported to alert the scientific community to the problem. These contaminations can be divided into two categories. The first category comprises host sequences that have been difficult for submitters to manage or control. Examples include anomalous sequences derived from Escherichia coli, which are inserted into the chromosomes (and plasmids) of the bacterial hosts. Insertion sequences are highly mobile and are capable ofmore » transposing themselves into plasmids during cloning manipulation. Another example of the first category is the infection with yeast genomic DNA or with bacterial DNA of some commercially available cDNA libraries from Clontech. The second category of database contamination is due to the inadvertent inclusion of nonhost sequences. This category includes incorporation of cloning-vector sequences and multicloning sites in the database submission. M13-derived artifacts have been common, since M13-based vectors have been widely used for subcloning DNA fragments. Recognizing this problem, the National Center for Biotechnology Information (NCBI) started to screen, in April 1994, all sequences directly submitted to GenBank, against a set of vector data retrieved from GenBank by use of key-word searches, such as {open_quotes}vector.{close_quotes} In this report, we present evidence for another sequence artifact that is widespread but that, to our knowledge, has not yet been reported. 11 refs., 1 tab.« less
Weighing the mass spectrometric evidence for authentic Tyrannosaurus rex collagen
Buckley, Mike; Walker, Angela; Ho, Simon Y. W.; Yang, Yue; Smith, Colin; Ashton, Peter; Oates, Jane Thomas; Cappellini, Enrico; Koon, Hannah; Penkman, Kirsty; Elsworth, Ben; Ashford, Dave; Solazzo, Caroline; Andrews, Phil; Strahler, John; Shapiro, Beth; Ostrom, Peggy; Gandhi, Hasand; Miller, Webb; Raney, Brian; Zylber, Maria Ines; Gilbert, M. Thomas P.; Prigodich, Richard V.; Ryan, Michael; Rijsdijk, Kenneth F.; Janoo, Anwar; Collins, Matthew J.
2009-01-01
We use authentication tests developed for ancient DNA to evaluate claims by Asara et al. of collagen peptide sequences recovered from mastodon and Tyrannosaurus rex fossils. Although the mastodon passes, absence of amino acid composition data, lack of evidence for peptide deamidation, and association of the α1(I) peptide sequences with amphibians not birds, suggests that T. rex does not. PMID:18174420
Blanden, R V; Rothenfluh, H S; Zylstra, P; Weiller, G F; Steele, E J
1998-04-01
We present here a unifying hypothesis for the molecular mechanism of somatic hypermutation and somatic gene conversion in IgV genes involving reverse transcription using RNA templates from the V-gene loci to produce cDNA which undergoes homologous recombination with chromosomal V(D)J DNA. Experimental evidence produced over the last 20 years is essentially consistent with this hypothesis. We also review evidence suggesting that somatically generated IgV sequences from B lymphocytes have been fed back to germline DNA over evolutionary time.
Ming-Li Zhang; Stewart C. Sanderson; Yan-Xia Sun; Byalt V. Vyacheslav; Xiao-Li Hao
2014-01-01
Atraphaxis has approximately 25 species and a distribution center in Central Asia. It has been previously used to hypothesize an origin from montane forest. We sampled 18 species covering three sections within the genus and sequenced five cpDNA spacers, atpB-rbcL, psbK-psbI, psbAtrnH, rbcL, and trnL-trnF. BEAST was used to reconstruct phylogenetic relationship and time...
Effective DNA Inhibitors of Cathepsin G by In Vitro Selection
Gatto, Barbara; Vianini, Elena; Lucatello, Lorena; Sissi, Claudia; Moltrasio, Danilo; Pescador, Rodolfo; Porta, Roberto; Palumbo, Manlio
2008-01-01
Cathepsin G (CatG) is a chymotrypsin-like protease released upon degranulation of neutrophils. In several inflammatory and ischaemic diseases the impaired balance between CatG and its physiological inhibitors leads to tissue destruction and platelet aggregation. Inhibitors of CatG are suitable for the treatment of inflammatory diseases and procoagulant conditions. DNA released upon the death of neutrophils at injury sites binds CatG. Moreover, short DNA fragments are more inhibitory than genomic DNA. Defibrotide, a single stranded polydeoxyribonucleotide with antithrombotic effect is also a potent CatG inhibitor. Given the above experimental evidences we employed a selection protocol to assess whether DNA inhibition of CatG may be ascribed to specific sequences present in defibrotide DNA. A Selex protocol was applied to identify the single-stranded DNA sequences exhibiting the highest affinity for CatG, the diversity of a combinatorial pool of oligodeoxyribonucleotides being a good representation of the complexity found in defibrotide. Biophysical and biochemical studies confirmed that the selected sequences bind tightly to the target enzyme and also efficiently inhibit its catalytic activity. Sequence analysis carried out to unveil a motif responsible for CatG recognition showed a recurrence of alternating TG repeats in the selected CatG binders, adopting an extended conformation that grants maximal interaction with the highly charged protein surface. This unprecedented finding is validated by our results showing high affinity and inhibition of CatG by specific DNA sequences of variable length designed to maximally reduce pairing/folding interactions. PMID:19325843
Mutations altering the cleavage specificity of a homing endonuclease
Seligman, Lenny M.; Chisholm, Karen M.; Chevalier, Brett S.; Chadsey, Meggen S.; Edwards, Samuel T.; Savage, Jeremiah H.; Veillet, Adeline L.
2002-01-01
The homing endonuclease I-CreI recognizes and cleaves a particular 22 bp DNA sequence. The crystal structure of I-CreI bound to homing site DNA has previously been determined, leading to a number of predictions about specific protein–DNA contacts. We test these predictions by analyzing a set of endonuclease mutants and a complementary set of homing site mutants. We find evidence that all structurally predicted I-CreI/DNA contacts contribute to DNA recognition and show that these contacts differ greatly in terms of their relative importance. We also describe the isolation of a collection of altered specificity I-CreI derivatives. The in vitro DNA-binding and cleavage properties of two such endonucleases demonstrate that our genetic approach is effective in identifying homing endonucleases that recognize and cleave novel target sequences. PMID:12202772
[Molecular identification of medicinal plant genus Uncaria in Guizhou].
Gang, Tao; Liu, Tao; Zhu, Ying; Liu, Zuo-Yi
2008-06-01
To analyze rDNA ITS regions of the Medicinal Plant Genus Uncaria in Guizhou and construct their phylogenetic tree in order to supply molecular evidence of taxonomy and identification of these Medicinal Plants in genetic level. The ITS gene fragments of the 4 Medicinal Plants were PCR amplified and sequenced. The rDNA ITS regions were analyzed by means of the software of ClustalX, BioEdit and PAUP* 4.0 beta 10. The entire sequences of rDNA ITS1, ITS2, and 5.8S rDNA were obtained, The Maximum-parsimony tree of four ITS regions together with those of similar sequences from GenBank were found, as Mitrayna rubrostipulata (AJ492621 ) and Mitragyna rubrostipulata (AJ605988) were designated as outgroup. The 4 medicinal plants are the 4 species in the genus Uncaria, and are mostly similar to the Uncaria rhynhcophylla.
The DL1 repeats in the genome of Diphyllobothrium latum.
Usmanova, Nadezhda M; Kazakov, Vasiliy I
2010-07-01
Diphyllobothrium latum is a widespread intestinal parasite, which has a great clinical relevance, but there are no sequences of its nuclear genome. In this paper, a repetitive element in the D. latum genome is firstly described. The adult D. latum was obtained in the result of expulsion from intestinum of a patient suffering from diphyllobothriasis. Genomic DNA was isolated from several proglottids of this individual. PstI restriction products of D. latum genomic DNA were sequenced. Polymerase chain reaction (PCR) amplification of these products using genomic DNA and selected primers was carried out. Thereby a cluster of a repetitive element, called DL1, was discovered. For precise identification of a beginning and an end of the repeat, a product of PCR amplification of D. latum genomic DNA with one specific primer was sequenced. In discussion, several evidences that DL1 repeat is a member of the SINE family of retroposons were adduced.
Mitochondrial DNA of Vitis vinifera and the issue of rampant horizontal gene transfer.
Goremykin, Vadim V; Salamini, Francesco; Velasco, Riccardo; Viola, Roberto
2009-01-01
The mitochondrial genome of grape (Vitis vinifera), the largest organelle genome sequenced so far, is presented. The genome is 773,279 nt long and has the highest coding capacity among known angiosperm mitochondrial DNAs (mtDNAs). The proportion of promiscuous DNA of plastid origin in the genome is also the largest ever reported for an angiosperm mtDNA, both in absolute and relative terms. In all, 42.4% of chloroplast genome of Vitis has been incorporated into its mitochondrial genome. In order to test if horizontal gene transfer (HGT) has also contributed to the gene content of the grape mtDNA, we built phylogenetic trees with the coding sequences of mitochondrial genes of grape and their homologs from plant mitochondrial genomes. Many incongruent gene tree topologies were obtained. However, the extent of incongruence between these gene trees is not significantly greater than that observed among optimal trees for chloroplast genes, the common ancestry of which has never been in doubt. In both cases, we attribute this incongruence to artifacts of tree reconstruction, insufficient numbers of characters, and gene paralogy. This finding leads us to question the recent phylogenetic interpretation of Bergthorsson et al. (2003, 2004) and Richardson and Palmer (2007) that rampant HGT into the mtDNA of Amborella best explains phylogenetic incongruence between mitochondrial gene trees for angiosperms. The only evidence for HGT into the Vitis mtDNA found involves fragments of two coding sequences stemming from two closteroviruses that cause the leaf roll disease of this plant. We also report that analysis of sequences shared by both chloroplast and mitochondrial genomes provides evidence for a previously unknown gene transfer route from the mitochondrion to the chloroplast.
Nakano, Tadao; Okamoto, Munehiro; Ikeda, Yatsukaho; Hasegawa, Hideo
2006-12-01
Sequences of mitochondrial cytochrome c oxidase subunit 1 (CO1) gene, nuclear internal transcribed spacer 2 (ITS2) region of ribosomal DNA (rDNA), and 5S rDNA of Enterobius vermicularis from captive chimpanzees in five zoos/institutions in Japan were analyzed and compared with those of pinworm eggs from humans in Japan. Three major types of variants appearing in both CO1 and ITS2 sequences, but showing no apparent connection, were observed among materials collected from the chimpanzees. Each one of them was also observed in pinworms in humans. Sequences of 5S rDNA were identical in the materials from chimpanzees and humans. Phylogenetic analysis of CO1 gene revealed three clusters with high bootstrap value, suggesting considerable divergence, presumably correlated with human evolution, has occurred in the human pinworms. The synonymy of E. gregorii with E. vermicularis is supported by the molecular evidence.
Chaitanya, Lakshmi; van Oven, Mannis; Brauer, Silke; Zimmermann, Bettina; Huber, Gabriela; Xavier, Catarina; Parson, Walther; de Knijff, Peter; Kayser, Manfred
2016-03-01
The use of mitochondrial DNA (mtDNA) for maternal lineage identification often marks the last resort when investigating forensic and missing-person cases involving highly degraded biological materials. As with all comparative DNA testing, a match between evidence and reference sample requires a statistical interpretation, for which high-quality mtDNA population frequency data are crucial. Here, we determined, under high quality standards, the complete mtDNA control-region sequences of 680 individuals from across the Netherlands sampled at 54 sites, covering the entire country with 10 geographic sub-regions. The complete mtDNA control region (nucleotide positions 16,024-16,569 and 1-576) was amplified with two PCR primers and sequenced with ten different sequencing primers using the EMPOP protocol. Haplotype diversity of the entire sample set was very high at 99.63% and, accordingly, the random-match probability was 0.37%. No population substructure within the Netherlands was detected with our dataset. Phylogenetic analyses were performed to determine mtDNA haplogroups. Inclusion of these high-quality data in the EMPOP database (accession number: EMP00666) will improve its overall data content and geographic coverage in the interest of all EMPOP users worldwide. Moreover, this dataset will serve as (the start of) a national reference database for mtDNA applications in forensic and missing person casework in the Netherlands. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Gocayne, J; Robinson, D A; FitzGerald, M G; Chung, F Z; Kerlavage, A R; Lentes, K U; Lai, J; Wang, C D; Fraser, C M; Venter, J C
1987-01-01
Two cDNA clones, lambda RHM-MF and lambda RHB-DAR, encoding the muscarinic cholinergic receptor and the beta-adrenergic receptor, respectively, have been isolated from a rat heart cDNA library. The cDNA clones were characterized by restriction mapping and automated DNA sequence analysis utilizing fluorescent dye primers. The rat heart muscarinic receptor consists of 466 amino acids and has a calculated molecular weight of 51,543. The rat heart beta-adrenergic receptor consists of 418 amino acids and has a calculated molecular weight of 46,890. The two cardiac receptors have substantial amino acid homology (27.2% identity, 50.6% with favored substitutions). The rat cardiac beta receptor has 88.0% homology (92.5% with favored substitutions) with the human brain beta receptor and the rat cardiac muscarinic receptor has 94.6% homology (97.6% with favored substitutions) with the porcine cardiac muscarinic receptor. The muscarinic cholinergic and beta-adrenergic receptors appear to be as conserved as hemoglobin and cytochrome c but less conserved than histones and are clearly members of a multigene family. These data support our hypothesis, based upon biochemical and immunological evidence, that suggests considerable structural homology and evolutionary conservation between adrenergic and muscarinic cholinergic receptors. To our knowledge, this is the first report utilizing automated DNA sequence analysis to determine the structure of a gene. Images PMID:2825184
Is a Genome a Codeword of an Error-Correcting Code?
Kleinschmidt, João H.; Silva-Filho, Márcio C.; Bim, Edson; Herai, Roberto H.; Yamagishi, Michel E. B.; Palazzo, Reginaldo
2012-01-01
Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction. PMID:22649495
The readability of expert reports for non-scientist report-users: reports of DNA analysis.
Howes, Loene M; Julian, Roberta; Kelty, Sally F; Kemp, Nenagh; Kirkbride, K Paul
2014-04-01
DNA evidence can be extremely compelling. With ongoing scientific advances and applications of DNA evidence in the criminal justice system, it is increasingly important that police, lawyers, and judges recognise both the limitations of DNA evidence and the strength of the evidence in particular cases. Because most forensic sciences are formally communicated via expert reports, we analysed the readability of 68 such reports of DNA evidence from 6 of 8 Australian jurisdictions. We conducted content analyses using three categories: content and sequence, language, and format. Categories contained qualitative and quantitative items drawn from theory and past research. Report styles differed by jurisdiction and by main audience - police and the courts. Reports for police were brief and few links were made between sections in these reports. Reports for courts were less brief and used either legal or scientific styles. Common sections in reports for courts included: the scientist's specialised knowledge; laboratory accreditation information; item list; results; and notes on interpretation. Sections were often not in a logical sequence, due to the use of appendices. According to Flesch Reading Ease scores, reports for police had language that was fairly difficult, and reports for courts, difficult. Difficulty was compounded by the use of specialist terms. Reports for police and the appendices of reports for court often used very small font and single line spacing. Many reports for court contained tables that spanned several pages. Suggestions based on theory and past research are provided to assist scientists to enhance the readability of reports for non-scientists. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Specific and non-specific interactions of ParB with DNA: implications for chromosome segregation
Taylor, James A.; Pastrana, Cesar L.; Butterer, Annika; Pernstich, Christian; Gwynn, Emma J.; Sobott, Frank; Moreno-Herrero, Fernando; Dillingham, Mark S.
2015-01-01
The segregation of many bacterial chromosomes is dependent on the interactions of ParB proteins with centromere-like DNA sequences called parS that are located close to the origin of replication. In this work, we have investigated the binding of Bacillus subtilis ParB to DNA in vitro using a variety of biochemical and biophysical techniques. We observe tight and specific binding of a ParB homodimer to the parS sequence. Binding of ParB to non-specific DNA is more complex and displays apparent positive co-operativity that is associated with the formation of larger, poorly defined, nucleoprotein complexes. Experiments with magnetic tweezers demonstrate that non-specific binding leads to DNA condensation that is reversible by protein unbinding or force. The condensed DNA structure is not well ordered and we infer that it is formed by many looping interactions between neighbouring DNA segments. Consistent with this view, ParB is also able to stabilize writhe in single supercoiled DNA molecules and to bridge segments from two different DNA molecules in trans. The experiments provide no evidence for the promotion of non-specific DNA binding and/or condensation events by the presence of parS sequences. The implications of these observations for chromosome segregation are discussed. PMID:25572315
Kane, Nolan; Sveinsson, Saemundur; Dempewolf, Hannes; Yang, Ji Yong; Zhang, Dapeng; Engels, Johannes M M; Cronk, Quentin
2012-02-01
To reliably identify lineages below the species level such as subspecies or varieties, we propose an extension to DNA-barcoding using next-generation sequencing to produce whole organellar genomes and substantial nuclear ribosomal sequence. Because this method uses much longer versions of the traditional DNA-barcoding loci in the plastid and ribosomal DNA, we call our approach ultra-barcoding (UBC). We used high-throughput next-generation sequencing to scan the genome and generate reliable sequence of high copy number regions. Using this method, we examined whole plastid genomes as well as nearly 6000 bases of nuclear ribosomal DNA sequences for nine genotypes of Theobroma cacao and an individual of the related species T. grandiflorum, as well as an additional publicly available whole plastid genome of T. cacao. All individuals of T. cacao examined were uniquely distinguished, and evidence of reticulation and gene flow was observed. Sequence variation was observed in some of the canonical barcoding regions between species, but other regions of the chloroplast were more variable both within species and between species, as were ribosomal spacers. Furthermore, no single region provides the level of data available using the complete plastid genome and rDNA. Our data demonstrate that UBC is a viable, increasingly cost-effective approach for reliably distinguishing varieties and even individual genotypes of T. cacao. This approach shows great promise for applications where very closely related or interbreeding taxa must be distinguished.
Statistical and linguistic features of DNA sequences
NASA Technical Reports Server (NTRS)
Havlin, S.; Buldyrev, S. V.; Goldberger, A. L.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.
1995-01-01
We present evidence supporting the idea that the DNA sequence in genes containing noncoding regions is correlated, and that the correlation is remarkably long range--indeed, base pairs thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationary" feature of the sequence of base pairs by applying a new algorithm called Detrended Fluctuation Analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and noncoding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to all eukaryotic DNA sequences (33 301 coding and 29 453 noncoding) in the entire GenBank database. We describe a simple model to account for the presence of long-range power-law correlations which is based upon a generalization of the classic Levy walk. Finally, we describe briefly some recent work showing that the noncoding sequences have certain statistical features in common with natural languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts, and the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function. We suggest that noncoding regions in plants and invertebrates may display a smaller entropy and larger redundancy than coding regions, further supporting the possibility that noncoding regions of DNA may carry biological information.
The DNA-encoded nucleosome organization of a eukaryotic genome.
Kaplan, Noam; Moore, Irene K; Fondufe-Mittendorf, Yvonne; Gossett, Andrea J; Tillo, Desiree; Field, Yair; LeProust, Emily M; Hughes, Timothy R; Lieb, Jason D; Widom, Jonathan; Segal, Eran
2009-03-19
Nucleosome organization is critical for gene regulation. In living cells this organization is determined by multiple factors, including the action of chromatin remodellers, competition with site-specific DNA-binding proteins, and the DNA sequence preferences of the nucleosomes themselves. However, it has been difficult to estimate the relative importance of each of these mechanisms in vivo, because in vivo nucleosome maps reflect the combined action of all influencing factors. Here we determine the importance of nucleosome DNA sequence preferences experimentally by measuring the genome-wide occupancy of nucleosomes assembled on purified yeast genomic DNA. The resulting map, in which nucleosome occupancy is governed only by the intrinsic sequence preferences of nucleosomes, is similar to in vivo nucleosome maps generated in three different growth conditions. In vitro, nucleosome depletion is evident at many transcription factor binding sites and around gene start and end sites, indicating that nucleosome depletion at these sites in vivo is partly encoded in the genome. We confirm these results with a micrococcal nuclease-independent experiment that measures the relative affinity of nucleosomes for approximately 40,000 double-stranded 150-base-pair oligonucleotides. Using our in vitro data, we devise a computational model of nucleosome sequence preferences that is significantly correlated with in vivo nucleosome occupancy in Caenorhabditis elegans. Our results indicate that the intrinsic DNA sequence preferences of nucleosomes have a central role in determining the organization of nucleosomes in vivo.
How good are indirect tests at detecting recombination in human mtDNA?
White, Daniel James; Bryant, David; Gemmell, Neil John
2013-07-08
Empirical proof of human mitochondrial DNA (mtDNA) recombination in somatic tissues was obtained in 2004; however, a lack of irrefutable evidence exists for recombination in human mtDNA at the population level. Our inability to demonstrate convincingly a signal of recombination in population data sets of human mtDNA sequence may be due, in part, to the ineffectiveness of current indirect tests. Previously, we tested some well-established indirect tests of recombination (linkage disequilibrium vs. distance using D' and r(2), Homoplasy Test, Pairwise Homoplasy Index, Neighborhood Similarity Score, and Max χ(2)) on sequence data derived from the only empirically confirmed case of human mtDNA recombination thus far and demonstrated that some methods were unable to detect recombination. Here, we assess the performance of these six well-established tests and explore what characteristics specific to human mtDNA sequence may affect their efficacy by simulating sequence under various parameters with levels of recombination (ρ) that vary around an empirically derived estimate for human mtDNA (population parameter ρ = 5.492). No test performed infallibly under any of our scenarios, and error rates varied across tests, whereas detection rates increased substantially with ρ values > 5.492. Under a model of evolution that incorporates parameters specific to human mtDNA, including rate heterogeneity, population expansion, and ρ = 5.492, successful detection rates are limited to a range of 7-70% across tests with an acceptable level of false-positive results: the neighborhood similarity score incompatibility test performed best overall under these parameters. Population growth seems to have the greatest impact on recombination detection probabilities across all models tested, likely due to its impact on sequence diversity. The implications of our findings on our current understanding of mtDNA recombination in humans are discussed.
How Good Are Indirect Tests at Detecting Recombination in Human mtDNA?
White, Daniel James; Bryant, David; Gemmell, Neil John
2013-01-01
Empirical proof of human mitochondrial DNA (mtDNA) recombination in somatic tissues was obtained in 2004; however, a lack of irrefutable evidence exists for recombination in human mtDNA at the population level. Our inability to demonstrate convincingly a signal of recombination in population data sets of human mtDNA sequence may be due, in part, to the ineffectiveness of current indirect tests. Previously, we tested some well-established indirect tests of recombination (linkage disequilibrium vs. distance using D′ and r2, Homoplasy Test, Pairwise Homoplasy Index, Neighborhood Similarity Score, and Max χ2) on sequence data derived from the only empirically confirmed case of human mtDNA recombination thus far and demonstrated that some methods were unable to detect recombination. Here, we assess the performance of these six well-established tests and explore what characteristics specific to human mtDNA sequence may affect their efficacy by simulating sequence under various parameters with levels of recombination (ρ) that vary around an empirically derived estimate for human mtDNA (population parameter ρ = 5.492). No test performed infallibly under any of our scenarios, and error rates varied across tests, whereas detection rates increased substantially with ρ values > 5.492. Under a model of evolution that incorporates parameters specific to human mtDNA, including rate heterogeneity, population expansion, and ρ = 5.492, successful detection rates are limited to a range of 7−70% across tests with an acceptable level of false-positive results: the neighborhood similarity score incompatibility test performed best overall under these parameters. Population growth seems to have the greatest impact on recombination detection probabilities across all models tested, likely due to its impact on sequence diversity. The implications of our findings on our current understanding of mtDNA recombination in humans are discussed. PMID:23665874
Chelomina, Galina N; Rozhkovan, Konstantin V; Voronova, Anastasia N; Burundukova, Olga L; Muzarok, Tamara I; Zhuravlev, Yuri N
2016-04-01
Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440-640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine.
Chelomina, Galina N.; Rozhkovan, Konstantin V.; Voronova, Anastasia N.; Burundukova, Olga L.; Muzarok, Tamara I.; Zhuravlev, Yuri N.
2015-01-01
Background Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. Methods The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. Results In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440–640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. Conclusion This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine. PMID:27158239
NASA Technical Reports Server (NTRS)
Lednicky, John A.; Halvorson, Steven J.; Butel, Janet S.
2002-01-01
A lymphotropic papovavirus (LPV) archetypal regulatory region was amplified from DNA from the blood of an immunocompromised rhesus monkey. We believe this is the first nonserological evidence of LPV infection in rhesus monkeys.
Liu, Ning; Tian, Ru; Loeb, Daniel D
2003-02-18
Synthesis of the relaxed-circular (RC) DNA genome of hepadnaviruses requires two template switches during plus-strand DNA synthesis: primer translocation and circularization. Although primer translocation and circularization use different donor and acceptor sequences, and are distinct temporally, they share the common theme of switching from one end of the minus-strand template to the other end. Studies of duck hepatitis B virus have indicated that, in addition to the donor and acceptor sequences, three other cis-acting sequences, named 3E, M, and 5E, are required for the synthesis of RC DNA by contributing to primer translocation and circularization. The mechanism by which 3E, M, and 5E act was not known. We present evidence that these sequences function by base pairing with each other within the minus-strand template. 3E base-pairs with one portion of M (M3) and 5E base-pairs with an adjacent portion of M (M5). We found that disrupting base pairing between 3E and M3 and between 5E and M5 inhibited primer translocation and circularization. More importantly, restoring base pairing with mutant sequences restored the production of RC DNA. These results are consistent with the model that, within duck hepatitis B virus capsids, the ends of the minus-strand template are juxtaposed via base pairing to facilitate the two template switches during plus-strand DNA synthesis.
Liu, Juan; Qi, Zhe-Chen; Zhao, Yun-Peng; Fu, Cheng-Xin; Jenny Xiang, Qiu-Yun
2012-09-01
The complete nucleotide sequence of the chloroplast genome (cpDNA) of Smilax china L. (Smilacaceae) is reported. It is the first complete cp genome sequence in Liliales. Genomic analyses were conducted to examine the rate and pattern of cpDNA genome evolution in Smilax relative to other major lineages of monocots. The cpDNA genomic sequences were combined with those available for Lilium to evaluate the phylogenetic position of Liliales and to investigate the influence of taxon sampling, gene sampling, gene function, natural selection, and substitution rate on phylogenetic inference in monocots. Phylogenetic analyses using sequence data of gene groups partitioned according to gene function, selection force, and total substitution rate demonstrated evident impacts of these factors on phylogenetic inference of monocots and the placement of Liliales, suggesting potential evolutionary convergence or adaptation of some cpDNA genes in monocots. Our study also demonstrated that reduced taxon sampling reduced the bootstrap support for the placement of Liliales in the cpDNA phylogenomic analysis. Analyses of sequences of 77 protein genes with some missing data and sequences of 81 genes (all protein genes plus the rRNA genes) support a sister relationship of Liliales to the commelinids-Asparagales clade, consistent with the APG III system. Analyses of 63 cpDNA protein genes for 32 taxa with few missing data, however, support a sister relationship of Liliales (represented by Smilax and Lilium) to Dioscoreales-Pandanales. Topology tests indicated that these two alignments do not significantly differ given any of these three cpDNA genomic sequence data sets. Furthermore, we found no saturation effect of the data, suggesting that the cpDNA genomic sequence data used in the study are appropriate for monocot phylogenetic study and long-branch attraction is unlikely to be the cause to explain the result of two well-supported, conflict placements of Liliales. Further analyses using sufficient nuclear data remain necessary to evaluate these two phylogenetic hypotheses regarding the position of Liliales and to address the causes of signal conflict among genes and partitions. Copyright © 2012 Elsevier Inc. All rights reserved.
Crossovers are associated with mutation and biased gene conversion at recombination hotspots.
Arbeithuber, Barbara; Betancourt, Andrea J; Ebner, Thomas; Tiemann-Boege, Irene
2015-02-17
Meiosis is a potentially important source of germline mutations, as sites of meiotic recombination experience recurrent double-strand breaks (DSBs). However, evidence for a local mutagenic effect of recombination from population sequence data has been equivocal, likely because mutation is only one of several forces shaping sequence variation. By sequencing large numbers of single crossover molecules obtained from human sperm for two recombination hotspots, we find direct evidence that recombination is mutagenic: Crossovers carry more de novo mutations than nonrecombinant DNA molecules analyzed for the same donors and hotspots. The observed mutations were primarily CG to TA transitions, with a higher frequency of transitions at CpG than non-CpGs sites. This enrichment of mutations at CpG sites at hotspots could predominate in methylated regions involving frequent single-stranded DNA processing as part of DSB repair. In addition, our data set provides evidence that GC alleles are preferentially transmitted during crossing over, opposing mutation, and shows that GC-biased gene conversion (gBGC) predominates over mutation in the sequence evolution of hotspots. These findings are consistent with the idea that gBGC could be an adaptation to counteract the mutational load of recombination.
Crossovers are associated with mutation and biased gene conversion at recombination hotspots
Arbeithuber, Barbara; Betancourt, Andrea J.; Ebner, Thomas; Tiemann-Boege, Irene
2015-01-01
Meiosis is a potentially important source of germline mutations, as sites of meiotic recombination experience recurrent double-strand breaks (DSBs). However, evidence for a local mutagenic effect of recombination from population sequence data has been equivocal, likely because mutation is only one of several forces shaping sequence variation. By sequencing large numbers of single crossover molecules obtained from human sperm for two recombination hotspots, we find direct evidence that recombination is mutagenic: Crossovers carry more de novo mutations than nonrecombinant DNA molecules analyzed for the same donors and hotspots. The observed mutations were primarily CG to TA transitions, with a higher frequency of transitions at CpG than non-CpGs sites. This enrichment of mutations at CpG sites at hotspots could predominate in methylated regions involving frequent single-stranded DNA processing as part of DSB repair. In addition, our data set provides evidence that GC alleles are preferentially transmitted during crossing over, opposing mutation, and shows that GC-biased gene conversion (gBGC) predominates over mutation in the sequence evolution of hotspots. These findings are consistent with the idea that gBGC could be an adaptation to counteract the mutational load of recombination. PMID:25646453
Reading of the non-template DNA by transcription elongation factors.
Svetlov, Vladimir; Nudler, Evgeny
2018-05-14
Unlike transcription initiation and termination, which have easily discernable signals such as promoters and terminators, elongation is regulated through a dynamic network involving RNA/DNA pause signals and states- rather than sequence-specific protein interactions. A report by Nedialkov et al. (in press) provides experimental evidence for sequence-specific recruitment of elongation factor RfaH to transcribing RNA polymerase (RNAP) and outlines the mechanism of gene expression regulation by restraint ("locking") of the DNA non-template strand. According to this model, the elongation complex pauses at the so called "operon polarity sequence" (found in some long bacterial operons coding for virulence genes), when the usually flexible non-template DNA strand adopts a distinct hairpin-loop conformation on the surface of transcribing RNAP. Sequence-specific binding of RfaH to this DNA segment facilitates conversion of RfaH from its inactive closed to its active open conformation. The interaction network formed between RfaH, non-template DNA, and RNAP locks DNA in a conformation that renders the elongation complex resistant to pausing and termination. The effects of such locking on transcript elongation can be mimicked by restraint of the non-template strand due to its shortening. This work advances our understanding of regulation of transcript elongation and has important implications for the action of general transcription factors, such as NusG, which lack apparent sequence-specificity, as well as for the mechanisms of other processes linked to transcription such as transcription-coupled DNA repair. This article is protected by copyright. All rights reserved. © 2018 John Wiley & Sons Ltd.
Alexandrov, Boian S; Fukuyo, Yayoi; Lange, Martin; Horikoshi, Nobuo; Gelev, Vladimir; Rasmussen, Kim Ø; Bishop, Alan R; Usheva, Anny
2012-11-01
The genome-wide mapping of the major gene expression regulators, the transcription factors (TFs) and their DNA binding sites, is of great importance for describing cellular behavior and phenotypic diversity. Presently, the methods for prediction of genomic TF binding produce a large number of false positives, most likely due to insufficient description of the physiochemical mechanisms of protein-DNA binding. Growing evidence suggests that, in the cell, the double-stranded DNA (dsDNA) is subject to local transient strands separations (breathing) that contribute to genomic functions. By using site-specific chromatin immunopecipitations, gel shifts, BIOBASE data, and our model that accurately describes the melting behavior and breathing dynamics of dsDNA we report a specific DNA breathing profile found at YY1 binding sites in cells. We find that the genomic flanking sequence variations and SNPs, may exert long-range effects on DNA dynamics and predetermine YY1 binding. The ubiquitous TF YY1 has a fundamental role in essential biological processes by activating, initiating or repressing transcription depending upon the sequence context it binds. We anticipate that consensus binding sequences together with the related DNA dynamics profile may significantly improve the accuracy of genomic TF binding sites and TF binding-related functional SNPs.
In search of the Boston Strangler: genetic evidence from the exhumation of Mary Sullivan.
Foran, David R; Starrs, James E
2004-01-01
The Boston Strangler was one of the United States' most notorious serial killers, raping and strangling with decorative ligatures thirteen woman in Boston during the early 1960s. Albert DeSalvo, never a suspect in the slayings, confessed in prison (where he was later murdered) to being the Boston Strangler, and the investigation largely ended. Mary Sullivan was the last victim of the Boston Strangler, found sexually assaulted and strangled in her Boston apartment in 1964. Recently, a team of forensic scientists undertook the exhumation and subsequent scientific analysis of Mary Sullivan's remains, in hope of finding consistencies or inconsistencies between DeSalvo's confessed description of the murder and any evidence left behind. Included in these analyses was extensive DNA testing of all UV fluorescent material associated with the body. The large majority of results were negative, however, fluorescent material located on the underwear and entwined in her pubic hair generated two human mitochondrial DNA sequences. Neither of these matched the victim nor members of the forensic team who worked on the evidence. Most importantly, neither DNA sequence could have originated from Albert DeSalvo.
DNA hypomethylation of individual sequences in aborted cloned bovine fetuses.
Chen, Tao; Jiang, Yan; Zhang, Yan-Ling; Liu, Jing-He; Hou, Yi; Schatten, Heide; Chen, Da-Yuan; Sun, Qing-Yuan
2005-09-01
Cloned bovines have a much higher abortion rate than those derived in vivo. Available evidence indicates that inappropriate epigenetic reprogramming of donor nuclei is the primary cause of cloning failure. To gain a better understanding of the DNA methylation changes associated with the high abortion rate of cloned bovines, we examined the DNA methylation status of a repeated sequence (satellite I) and the promoter regions of two single-copy genes (interleukin 3/cytokeratin) in aborted cloned fetuses, aborted fetuses derived from artificial insemination (AI), cloned adults and AI adults by bisulfite sequencing and restriction enzyme analysis. Two of four aborted cloned fetuses show very low methylation levels in the two single-copy gene promoter regions. One of the two fetuses also showed undermethylated status in the satellite I sequence. The other two aborted cloned fetuses have similar methylation levels to those of aborted AI fetuses. However, no difference in methylation was observed between cloned adults and AI adults. Our results demonstrate for the first time the undermethylated status of individual sequences in aborted cloned fetuses. These findings suggest that aberrant DNA methylation may contribute to the developmental failure of cloned bovine fetuses.
Postberg, Jan; Jönsson, Franziska; Weil, Patrick Philipp; Bulic, Aneta; Juranek, Stefan Andreas; Lipps, Hans-Joachim
2018-06-12
During sexual reproduction in the unicellular ciliate Stylonychia somatic macronuclei differentiate from germline micronuclei. Thereby, programmed sequence reduction takes place, leading to the elimination of > 95% of germline sequences, which priorly adopt heterochromatin structure via H3K27me3. Simultaneously, 27nt-ncRNAs become synthesized from parental transcripts and are bound by the Argonaute protein PIWI1. These 27nt-ncRNAs cover sequences destined to the developing macronucleus and are thought to protect them from degradation. We provide evidence and propose that RNA/DNA base-pairing guides PIWI1/27nt-RNA complexes to complementary macronucleus-destined DNA target sequences, hence transiently causing locally stalled replication during polytene chromosome formation. This spatiotemporal delay enables the selective deposition of temporarily available histone H3.4K27me3 nucleosomes at all other sequences being continuously replicated, thus dictating their prospective heterochromatin structure before becoming developmentally eliminated. Concomitantly, 27nt-RNA-covered sites remain protected. We introduce the concept of 'RNA-induced DNA replication interference' and explain how the parental functional genome partition could become transmitted to the progeny.
Ancient wolf lineages in India.
Sharma, Dinesh K; Maldonado, Jesus E; Jhala, Yadrendradev V; Fleischer, Robert C
2004-01-01
All previously obtained wolf (Canis lupus) and dog (Canis familiaris) mitochondrial (mt) DNA sequences fall within an intertwined and shallow clade (the 'wolf-dog' clade). We sequenced mtDNA of recent and historical samples from 45 wolves from throughout lowland peninsular India and 23 wolves from the Himalayas and Tibetan Plateau and compared these sequences with all available wolf and dog sequences. All 45 lowland Indian wolves have one of four closely related haplotypes that form a well-supported, divergent sister lineage to the wolf-dog clade. This unique lineage may have been independent for more than 400,000 years. Although seven Himalayan wolves from western and central Kashmir fall within the widespread wolf-dog clade, one from Ladakh in eastern Kashmir, nine from Himachal Pradesh, four from Nepal and two from Tibet form a very different basal clade. This lineage contains five related haplotypes that probably diverged from other canids more than 800,000 years ago, but we find no evidence of current barriers to admixture. Thus, the Indian subcontinent has three divergent, ancient and apparently parapatric mtDNA lineages within the morphologically delineated wolf. No haplotypes of either novel lineage are found within a sample of 37 Indian (or other) dogs. Thus, we find no evidence that these two taxa played a part in the domestication of canids. PMID:15101402
Ancient wolf lineages in India.
Sharma, Dinesh K; Maldonado, Jesus E; Jhala, Yadrendradev V; Fleischer, Robert C
2004-02-07
All previously obtained wolf (Canis lupus) and dog (Canis familiaris) mitochondrial (mt) DNA sequences fall within an intertwined and shallow clade (the 'wolf-dog' clade). We sequenced mtDNA of recent and historical samples from 45 wolves from throughout lowland peninsular India and 23 wolves from the Himalayas and Tibetan Plateau and compared these sequences with all available wolf and dog sequences. All 45 lowland Indian wolves have one of four closely related haplotypes that form a well-supported, divergent sister lineage to the wolf-dog clade. This unique lineage may have been independent for more than 400,000 years. Although seven Himalayan wolves from western and central Kashmir fall within the widespread wolf-dog clade, one from Ladakh in eastern Kashmir, nine from Himachal Pradesh, four from Nepal and two from Tibet form a very different basal clade. This lineage contains five related haplotypes that probably diverged from other canids more than 800,000 years ago, but we find no evidence of current barriers to admixture. Thus, the Indian subcontinent has three divergent, ancient and apparently parapatric mtDNA lineages within the morphologically delineated wolf. No haplotypes of either novel lineage are found within a sample of 37 Indian (or other) dogs. Thus, we find no evidence that these two taxa played a part in the domestication of canids.
Evidence of birth-and-death evolution of 5S rRNA gene in Channa species (Teleostei, Perciformes).
Barman, Anindya Sundar; Singh, Mamta; Singh, Rajeev Kumar; Lal, Kuldeep Kumar
2016-12-01
In higher eukaryotes, minor rDNA family codes for 5S rRNA that is arranged in tandem arrays and comprises of a highly conserved 120 bp long coding sequence with a variable non-transcribed spacer (NTS). Initially the 5S rDNA repeats are considered to be evolved by the process of concerted evolution. But some recent reports, including teleost fishes suggested that evolution of 5S rDNA repeat does not fit into the concerted evolution model and evolution of 5S rDNA family may be explained by a birth-and-death evolution model. In order to study the mode of evolution of 5S rDNA repeats in Perciformes fish species, nucleotide sequence and molecular organization of five species of genus Channa were analyzed in the present study. Molecular analyses revealed several variants of 5S rDNA repeats (four types of NTS) and networks created by a neighbor net algorithm for each type of sequences (I, II, III and IV) did not show a clear clustering in species specific manner. The stable secondary structure is predicted and upstream and downstream conserved regulatory elements were characterized. Sequence analyses also shown the presence of two putative pseudogenes in Channa marulius. Present study supported that 5S rDNA repeats in genus Channa were evolved under the process of birth-and-death.
Iwasaki, H; Shiba, T; Makino, K; Nakata, A; Shinagawa, H
1989-01-01
The ruvA and ruvB genes of Escherichia coli constitute an operon which belongs to the SOS regulon. Genetic evidence suggests that the products of the ruv operon are involved in DNA repair and recombination. To begin biochemical characterization of these proteins, we developed a plasmid system that overproduced RuvB protein to 20% of total cell protein. Starting from the overproducing system, we purified RuvB protein. The purified RuvB protein behaved like a monomer in gel filtration chromatography and had an apparent relative molecular mass of 38 kilodaltons in sodium dodecyl sulfate-polyacrylamide gel electrophoresis, which agrees with the value predicted from the DNA sequence. The amino acid sequence of the amino-terminal region of the purified protein was analyzed, and the sequence agreed with the one deduced from the DNA sequence. Since the deduced sequence of RuvB protein contained the consensus sequence for ATP-binding proteins, we examined the ATP-binding and ATPase activities of the purified RuvB protein. RuvB protein had a stronger affinity to ADP than to ATP and weak ATPase activity. The results suggest that the weak ATPase activity of RuvB protein is at least partly due to end product inhibition by ADP. Images PMID:2529252
Searching for evidence of selection in avian DNA barcodes.
Kerr, Kevin C R
2011-11-01
The barcode of life project has assembled a tremendous number of mitochondrial cytochrome c oxidase I (COI) sequences. Although these sequences were gathered to develop a DNA-based system for species identification, it has been suggested that further biological inferences may also be derived from this wealth of data. Recurrent selective sweeps have been invoked as an evolutionary mechanism to explain limited intraspecific COI diversity, particularly in birds, but this hypothesis has not been formally tested. In this study, I collated COI sequences from previous barcoding studies on birds and tested them for evidence of selection. Using this expanded data set, I re-examined the relationships between intraspecific diversity and interspecific divergence and sampling effort, respectively. I employed the McDonald-Kreitman test to test for neutrality in sequence evolution between closely related pairs of species. Because amino acid sequences were generally constrained between closely related pairs, I also included broader intra-order comparisons to quantify patterns of protein variation in avian COI sequences. Lastly, using 22 published whole mitochondrial genomes, I compared the evolutionary rate of COI against the other 12 protein-coding mitochondrial genes to assess intragenomic variability. I found no conclusive evidence of selective sweeps. Most evidence pointed to an overall trend of strong purifying selection and functional constraint. The COI protein did vary across the class Aves, but to a very limited extent. COI was the least variable gene in the mitochondrial genome, suggesting that other genes might be more informative for probing factors constraining mitochondrial variation within species. © 2011 Blackwell Publishing Ltd.
DNA interrogation by the CRISPR RNA-guided endonuclease Cas9.
Sternberg, Samuel H; Redding, Sy; Jinek, Martin; Greene, Eric C; Doudna, Jennifer A
2014-03-06
The clustered regularly interspaced short palindromic repeats (CRISPR)-associated enzyme Cas9 is an RNA-guided endonuclease that uses RNA-DNA base-pairing to target foreign DNA in bacteria. Cas9-guide RNA complexes are also effective genome engineering agents in animals and plants. Here we use single-molecule and bulk biochemical experiments to determine how Cas9-RNA interrogates DNA to find specific cleavage sites. We show that both binding and cleavage of DNA by Cas9-RNA require recognition of a short trinucleotide protospacer adjacent motif (PAM). Non-target DNA binding affinity scales with PAM density, and sequences fully complementary to the guide RNA but lacking a nearby PAM are ignored by Cas9-RNA. Competition assays provide evidence that DNA strand separation and RNA-DNA heteroduplex formation initiate at the PAM and proceed directionally towards the distal end of the target sequence. Furthermore, PAM interactions trigger Cas9 catalytic activity. These results reveal how Cas9 uses PAM recognition to quickly identify potential target sites while scanning large DNA molecules, and to regulate scission of double-stranded DNA.
DNA interrogation by the CRISPR RNA-guided endonuclease Cas9
NASA Astrophysics Data System (ADS)
Sternberg, Samuel H.; Redding, Sy; Jinek, Martin; Greene, Eric C.; Doudna, Jennifer A.
2014-03-01
The clustered regularly interspaced short palindromic repeats (CRISPR)-associated enzyme Cas9 is an RNA-guided endonuclease that uses RNA-DNA base-pairing to target foreign DNA in bacteria. Cas9-guide RNA complexes are also effective genome engineering agents in animals and plants. Here we use single-molecule and bulk biochemical experiments to determine how Cas9-RNA interrogates DNA to find specific cleavage sites. We show that both binding and cleavage of DNA by Cas9-RNA require recognition of a short trinucleotide protospacer adjacent motif (PAM). Non-target DNA binding affinity scales with PAM density, and sequences fully complementary to the guide RNA but lacking a nearby PAM are ignored by Cas9-RNA. Competition assays provide evidence that DNA strand separation and RNA-DNA heteroduplex formation initiate at the PAM and proceed directionally towards the distal end of the target sequence. Furthermore, PAM interactions trigger Cas9 catalytic activity. These results reveal how Cas9 uses PAM recognition to quickly identify potential target sites while scanning large DNA molecules, and to regulate scission of double-stranded DNA.
The protective function of noncoding DNA in genome defense of eukaryotic male germ cells.
Qiu, Guo-Hua; Huang, Cuiqin; Zheng, Xintian; Yang, Xiaoyan
2018-04-01
Peripheral and abundant noncoding DNA has been hypothesized to protect the genome and the central protein-coding sequences against DNA damage in somatic genome. In the cytosol, invading exogenous nucleic acids may first be deactivated by small RNAs encoded by noncoding DNA via mechanisms similar to the prokaryotic CRISPR-Cas system. In the nucleus, the radicals generated by radiation in the cytosol, radiation energy and invading exogenous nucleic acids are absorbed, blocked and/or reduced by peripheral heterochromatin, and damaged DNA in heterochromatin is removed and excluded from the nucleus to the cytoplasm through nuclear pore complexes. To further strengthen the hypothesis, this review summarizes the experimental evidence supporting the protective function of noncoding DNA in the genome of male germ cells. Based on these data, this review provides evidence supporting the protective role of noncoding DNA in the genome defense of sperm genome through similar mechanisms to those of the somatic genome.
Iyer, Lakshminarayan M; Abhiman, Saraswathi; Aravind, L
2008-10-04
Using sequence profile methods and structural comparisons we characterize a previously unknown family of nucleic acid polymerases in a group of mobile elements from genomes of diverse bacteria, an algal plastid and certain DNA viruses, including the recently reported Sputnik virus. Using contextual information from domain architectures and gene-neighborhoods we present evidence that they are likely to possess both primase and DNA polymerase activity, comparable to the previously reported prim-pol proteins. These newly identified polymerases help in defining the minimal functional core of superfamily A DNA polymerases and related RNA polymerases. Thus, they provide a framework to understand the emergence of both DNA and RNA polymerization activity in this class of enzymes. They also provide evidence that enigmatic DNA viruses, such as Sputnik, might have emerged from mobile elements coding these polymerases.
Iyer, Lakshminarayan M; Abhiman, Saraswathi; Aravind, L
2008-01-01
Using sequence profile methods and structural comparisons we characterize a previously unknown family of nucleic acid polymerases in a group of mobile elements from genomes of diverse bacteria, an algal plastid and certain DNA viruses, including the recently reported Sputnik virus. Using contextual information from domain architectures and gene-neighborhoods we present evidence that they are likely to possess both primase and DNA polymerase activity, comparable to the previously reported prim-pol proteins. These newly identified polymerases help in defining the minimal functional core of superfamily A DNA polymerases and related RNA polymerases. Thus, they provide a framework to understand the emergence of both DNA and RNA polymerization activity in this class of enzymes. They also provide evidence that enigmatic DNA viruses, such as Sputnik, might have emerged from mobile elements coding these polymerases. This article was reviewed by Eugene Koonin and Mark Ragan. PMID:18834537
Benchmark Dataset for Whole Genome Sequence Compression.
C L, Biji; S Nair, Achuthsankar
2017-01-01
The research in DNA data compression lacks a standard dataset to test out compression tools specific to DNA. This paper argues that the current state of achievement in DNA compression is unable to be benchmarked in the absence of such scientifically compiled whole genome sequence dataset and proposes a benchmark dataset using multistage sampling procedure. Considering the genome sequence of organisms available in the National Centre for Biotechnology and Information (NCBI) as the universe, the proposed dataset selects 1,105 prokaryotes, 200 plasmids, 164 viruses, and 65 eukaryotes. This paper reports the results of using three established tools on the newly compiled dataset and show that their strength and weakness are evident only with a comparison based on the scientifically compiled benchmark dataset. The sample dataset and the respective links are available @ https://sourceforge.net/projects/benchmarkdnacompressiondataset/.
Irwin, Jodi A; Saunier, Jessica L; Strouss, Katharine M; Sturk, Kimberly A; Diegoli, Toni M; Just, Rebecca S; Coble, Michael D; Parson, Walther; Parsons, Thomas J
2007-06-01
In an effort to increase the quantity, breadth and availability of mtDNA databases suitable for forensic comparisons, we have developed a high-throughput process to generate approximately 5000 control region sequences per year from regional US populations, global populations from which the current US population is derived and global populations currently under-represented in available forensic databases. The system utilizes robotic instrumentation for all laboratory steps from pre-extraction through sequence detection, and a rigorous eight-step, multi-laboratory data review process with entirely electronic data transfer. Over the past 3 years, nearly 10,000 control region sequences have been generated using this approach. These data are being made publicly available and should further address the need for consistent, high-quality mtDNA databases for forensic testing.
Giblin-Davis, R M; Williams, D S; Bekal, S; Dickson, D W; Brito, J A; Becker, J O; Preston, J F
2003-01-01
Taxonomically relevant characteristics of a fastidiously Gram-positive, obligately endoparasitic prokaryote (strain S-1) that uses the phytoparasitic sting nematode Belonolaimus longicaudatus as its host are reviewed. 16S rDNA sequence similarity (> or = 93%) confirms its congeneric ranking with other Pasteuria species and strains from nematodes and cladocerans and corroborates morphological, morphometric and host range evidence suggesting a novel taxon. The 16S rDNA sequence of strain S-1 has greatest similarity (96%) to the 16S rDNA sequences of both Pasteuria penetrans from root-knot nematodes (Meloidogyne species) and the recently reported strain of Pasteuria isolated from the soybean cyst nematode Heterodera glycines. Because the obligately endoparasitic nature of prokaryotes in the genus Pasteuria prevents isolation of definitive type strains, strain S-1 is proposed as 'Candidatus Pasteuria usgae' sp. nov.
Dahl, Christopher R.; Bickham, John W.; Wickliffe, Jeffery K.; Custer, Thomas W.
2001-01-01
DNA sequence analysis of a 215 base-pair region of the mitochondrial cytochrome b gene was used to examine genetic variation and search for evidence of an increased mutation rate in black-crowned night-herons. We examined five populations exposed to environmental contamination (primarily PAHs and PCBs) and one reference population from the eastern U.S. There was no evidence of a high mutation rate even within populations previously shown to exhibit increased variation in DNA content among somatic cells as a result of petroleum exposure. Three haplotypes were observed among 99 individuals. The low level of variability could be evidence for a genetic bottleneck, or that cytochrome b is too conservative for use in population genetic studies of this species. With the exception of one population from Louisiana, pair-wise Phist estimates were very low, indicative of little population structure and potentially high rates of effective migration among populations.
Wildman, Derek E.; Uddin, Monica; Liu, Guozhen; Grossman, Lawrence I.; Goodman, Morris
2003-01-01
What do functionally important DNA sites, those scrutinized and shaped by natural selection, tell us about the place of humans in evolution? Here we compare ≈90 kb of coding DNA nucleotide sequence from 97 human genes to their sequenced chimpanzee counterparts and to available sequenced gorilla, orangutan, and Old World monkey counterparts, and, on a more limited basis, to mouse. The nonsynonymous changes (functionally important), like synonymous changes (functionally much less important), show chimpanzees and humans to be most closely related, sharing 99.4% identity at nonsynonymous sites and 98.4% at synonymous sites. On a time scale, the coding DNA divergencies separate the human–chimpanzee clade from the gorilla clade at between 6 and 7 million years ago and place the most recent common ancestor of humans and chimpanzees at between 5 and 6 million years ago. The evolutionary rate of coding DNA in the catarrhine clade (Old World monkey and ape, including human) is much slower than in the lineage to mouse. Among the genes examined, 30 show evidence of positive selection during descent of catarrhines. Nonsynonymous substitutions by themselves, in this subset of positively selected genes, group humans and chimpanzees closest to each other and have chimpanzees diverge about as much from the common human–chimpanzee ancestor as humans do. This functional DNA evidence supports two previously offered taxonomic proposals: family Hominidae should include all extant apes; and genus Homo should include three extant species and two subgenera, Homo (Homo) sapiens (humankind), Homo (Pan) troglodytes (common chimpanzee), and Homo (Pan) paniscus (bonobo chimpanzee). PMID:12766228
Wildman, Derek E; Uddin, Monica; Liu, Guozhen; Grossman, Lawrence I; Goodman, Morris
2003-06-10
What do functionally important DNA sites, those scrutinized and shaped by natural selection, tell us about the place of humans in evolution? Here we compare approximately 90 kb of coding DNA nucleotide sequence from 97 human genes to their sequenced chimpanzee counterparts and to available sequenced gorilla, orangutan, and Old World monkey counterparts, and, on a more limited basis, to mouse. The nonsynonymous changes (functionally important), like synonymous changes (functionally much less important), show chimpanzees and humans to be most closely related, sharing 99.4% identity at nonsynonymous sites and 98.4% at synonymous sites. On a time scale, the coding DNA divergencies separate the human-chimpanzee clade from the gorilla clade at between 6 and 7 million years ago and place the most recent common ancestor of humans and chimpanzees at between 5 and 6 million years ago. The evolutionary rate of coding DNA in the catarrhine clade (Old World monkey and ape, including human) is much slower than in the lineage to mouse. Among the genes examined, 30 show evidence of positive selection during descent of catarrhines. Nonsynonymous substitutions by themselves, in this subset of positively selected genes, group humans and chimpanzees closest to each other and have chimpanzees diverge about as much from the common human-chimpanzee ancestor as humans do. This functional DNA evidence supports two previously offered taxonomic proposals: family Hominidae should include all extant apes; and genus Homo should include three extant species and two subgenera, Homo (Homo) sapiens (humankind), Homo (Pan) troglodytes (common chimpanzee), and Homo (Pan) paniscus (bonobo chimpanzee).
Campo, Daniel; García-Vázquez, Eva
2012-01-01
The 5S rDNA is organized in the genome as tandemly repeated copies of a structural unit composed of a coding sequence plus a nontranscribed spacer (NTS). The coding region is highly conserved in the evolution, whereas the NTS vary in both length and sequence. It has been proposed that 5S rRNA genes are members of a gene family that have arisen through concerted evolution. In this study, we describe the molecular organization and evolution of the 5S rDNA in the genera Lepidorhombus and Scophthalmus (Scophthalmidae) and compared it with already known 5S rDNA of the very different genera Merluccius (Merluccidae) and Salmo (Salmoninae), to identify common structural elements or patterns for understanding 5S rDNA evolution in fish. High intra- and interspecific diversity within the 5S rDNA family in all the genera can be explained by a combination of duplications, deletions, and transposition events. Sequence blocks with high similarity in all the 5S rDNA members across species were identified for the four studied genera, with evidences of intense gene conversion within noncoding regions. We propose a model to explain the evolution of the 5S rDNA, in which the evolutionary units are blocks of nucleotides rather than the entire sequences or single nucleotides. This model implies a "two-speed" evolution: slow within blocks (homogenized by recombination) and fast within the gene family (diversified by duplications and deletions).
Chiba, Satoshi
1999-04-01
An endemic land snail genus Mandarina of the oceanic Bonin (Ogasawara) Islands shows exceptionally rapid evolution not only of morphological and ecological traits, but of DNA sequence. A phylogenetic relationship based on mitochondrial DNA (mtDNA) sequences suggests that morphological differences equivalent to the differences between families were produced between Mandarina and its ancestor during the Pleistocene. The inferred phylogeny shows that species with similar morphologies and life habitats appeared repeatedly and independently in different lineages and islands at different times. Sequential adaptive radiations occurred in different islands of the Bonin Islands and species occupying arboreal, semiarboreal, and terrestrial habitat arose independently in each island. Because of a close relationship between shell morphology and life habitat, independent evolution of the same life habitat in different islands created species possesing the same shell morphology in different islands and lineages. This rapid evolution produced some incongruences between phylogenetic relationship and species taxonomy. Levels of sequence divergence of mtDNA among the species of Mandarina is extremely high. The maximum level of sequence divergence at 16S and 12S ribosomal RNA sequence within Mandarina are 18.7% and 17.7%, respectively, and this suggests that evolution of mtDNA of Mandarina is extremely rapid, more than 20 times faster than the standard rate in other animals. The present examination reveals that evolution of morphological and ecological traits occurs at extremely high rates in the time of adaptive radiation, especially in fragmented environments. © 1999 The Society for the Study of Evolution.
Chowdari, K V; Northup, A; Pless, L; Wood, J; Joo, Y H; Mirnics, K; Lewis, D A; Levitt, P R; Bacanu, S-A; Nimgaonkar, V L
2007-04-01
Many candidate gene association studies have evaluated incomplete, unrepresentative sets of single nucleotide polymorphisms (SNPs), producing non-significant results that are difficult to interpret. Using a rapid, efficient strategy designed to investigate all common SNPs, we tested associations between schizophrenia and two positional candidate genes: ACSL6 (Acyl-Coenzyme A synthetase long-chain family member 6) and SIRT5 (silent mating type information regulation 2 homologue 5). We initially evaluated the utility of DNA sequencing traces to estimate SNP allele frequencies in pooled DNA samples. The mean variances for the DNA sequencing estimates were acceptable and were comparable to other published methods (mean variance: 0.0008, range 0-0.0119). Using pooled DNA samples from cases with schizophrenia/schizoaffective disorder (Diagnostic and Statistical Manual of Mental Disorders edition IV criteria) and controls (n=200, each group), we next sequenced all exons, introns and flanking upstream/downstream sequences for ACSL6 and SIRT5. Among 69 identified SNPs, case-control allele frequency comparisons revealed nine suggestive associations (P<0.2). Each of these SNPs was next genotyped in the individual samples composing the pools. A suggestive association with rs 11743803 at ACSL6 remained (allele-wise P=0.02), with diminished evidence in an extended sample (448 cases, 554 controls, P=0.062). In conclusion, we propose a multi-stage method for comprehensive, rapid, efficient and economical genetic association analysis that enables simultaneous SNP detection and allele frequency estimation in large samples. This strategy may be particularly useful for research groups lacking access to high throughput genotyping facilities. Our analyses did not yield convincing evidence for associations of schizophrenia with ACSL6 or SIRT5.
Epigenomics of Development in Populus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Strauss, Steve; Freitag, Michael; Mockler, Todd
2013-01-10
We conducted research to determine the role of epigenetic modifications during tree development using poplar (Populus trichocarpa), a model woody feedstock species. Using methylated DNA immunoprecipitation (MeDIP) or chromatin immunoprecipitation (ChIP), followed by high-throughput sequencing, we are analyzed DNA and histone methylation patterns in the P. trichocarpa genome in relation to four biological processes: bud dormancy and release, mature organ maintenance, in vitro organogenesis, and methylation suppression. Our project is now completed. We have 1) produced 22 transgenic events for a gene involved in DNA methylation suppression and studied its phenotypic consequences; 2) completed sequencing of methylated DNA from elevenmore » target tissues in wildtype P. trichocarpa; 3) updated our customized poplar genome browser using the open-source software tools (2.13) and (V2.2) of the P. trichocarpa genome; 4) produced summary data for genome methylation in P. trichocarpa, including distribution of methylation across chromosomes and in and around genes; 5) employed bioinformatic and statistical methods to analyze differences in methylation patterns among tissue types; and 6) used bisulfite sequencing of selected target genes to confirm bioinformatics and sequencing results, and gain a higher-resolution view of methylation at selected genes 7) compared methylation patterns to expression using available microarray data. Our main findings of biological significance are the identification of extensive regions of the genome that display developmental variation in DNA methylation; highly distinctive gene-associated methylation profiles in reproductive tissues, particularly male catkins; a strong whole genome/all tissue inverse association of methylation at gene bodies and promoters with gene expression; a lack of evidence that tissue specificity of gene expression is associated with gene methylation; and evidence that genome methylation is a significant impediment to tissue dedifferentiation and redifferentiation in vitro.« less
Park, Ji Hye
2018-01-01
Estimation of postmortem interval (PMI) is paramount in modern forensic investigation. After the disappearance of the early postmortem phenomena conventionally used to estimate PMI, entomologic evidence provides important indicators for PMI estimation. The age of the oldest fly larvae or pupae can be estimated to pinpoint the time of oviposition, which is considered the minimum PMI (PMImin). The development rate of insects is usually temperature dependent and species specific. Therefore, species identification is mandatory for PMImin estimation using entomological evidence. The classical morphological identification method cannot be applied when specimens are damaged or have not yet matured. To overcome this limitation, some investigators employ molecular identification using mitochondrial cytochrome c oxidase subunit I (COI) nucleotide sequences. The molecular identification method commonly uses Sanger's nucleotide sequencing and molecular phylogeny, which are complex and time consuming and constitute another obstacle for forensic investigators. In this study, instead of using conventional Sanger's nucleotide sequencing, single-nucleotide polymorphisms (SNPs) in the COI gene region, which are unique between fly species, were selected and targeted for single-base extension (SBE) technology. These SNPs were genotyped using a SNaPshot® kit. Eleven Calliphoridae and seven Sarcophagidae species were covered. To validate this genotyping, fly DNA samples (103 adults, 84 larvae, and 4 pupae) previously confirmed by DNA barcoding were used. This method worked quickly with minimal DNA, providing a potential alternative to conventional DNA barcoding. Consisting of only a few simple electropherogram peaks, the results were more straightforward compared with those of the conventional DNA barcoding produced by Sanger's nucleotide sequencing. PMID:29682531
Sequence variability of Campylobacter temperate bacteriophages
Clark, Clifford G; Ng, Lai-King
2008-01-01
Background Prophages integrated within the chromosomes of Campylobacter jejuni isolates have been demonstrated very recently. Prior work with Campylobacter temperate bacteriophages, as well as evidence from prophages in other enteric bacteria, suggests these prophages might have a role in the biology and virulence of the organism. However, very little is known about the genetic variability of Campylobacter prophages which, if present, could lead to differential phenotypes in isolates carrying the phages versus those that do not. As a first step in the characterization of C. jejuni prophages, we investigated the distribution of prophage DNA within a C. jejuni population assessed the DNA and protein sequence variability within a subset of the putative prophages found. Results Southern blotting of C. jejuni DNA using probes from genes within the three putative prophages of the C. jejuni sequenced strain RM 1221 demonstrated the presence of at least one prophage gene in a large proportion (27/35) of isolates tested. Of these, 15 were positive for 5 or more of the 7 Campylobacter Mu-like phage 1 (CMLP 1, also designated Campylobacter jejuni integrated element 1, or CJIE 1) genes tested. Twelve of these putative prophages were chosen for further analysis. DNA sequencing of a 9,000 to 11,000 nucleotide region of each prophage demonstrated a close homology with CMLP 1 in both gene order and nucleotide sequence. Structural and sequence variability, including short insertions, deletions, and allele replacements, were found within the prophage genomes, some of which would alter the protein products of the ORFs involved. No insertions of novel genes were detected within the sequenced regions. The 12 prophages and RM 1221 had a % G+C very similar to C. jejuni sequenced strains, as well as promoter regions characteristic of C. jejuni. None of the putative prophages were successfully induced and propagated, so it is not known if they were functional or if they represented remnant prophage DNA in the bacterial chromosomes. Conclusion These putative prophages form a family of phages with conserved sequences, and appear to be adapted to Campylobacter. There was evidence for recombination among groups of prophages, suggesting that the prophages had a mosaic structure. In many of these properties, the Mu-like CMLP 1 homologs characterized in this study resemble temperate bacteriophages of enteric bacteria that are responsible for contributions to virulence and host adaptation. PMID:18366706
Yuan, Xiao-Long; Gao, Ning; Xing, Yan; Zhang, Hai-Bin; Zhang, Ai-Ling; Liu, Jing; He, Jin-Long; Xu, Yuan; Lin, Wen-Mian; Chen, Zan-Mou; Zhang, Hao; Zhang, Zhe; Li, Jia-Qi
2016-02-25
Substantial evidence has shown that DNA methylation regulates the initiation of ovarian and sexual maturation. Here, we investigated the genome-wide profile of DNA methylation in porcine ovaries at single-base resolution using reduced representation bisulfite sequencing. The biological variation was minimal among the three ovarian replicates. We found hypermethylation frequently occurred in regions with low gene abundance, while hypomethylation in regions with high gene abundance. The DNA methylation around transcriptional start sites was negatively correlated with their own CpG content. Additionally, the methylation level in the bodies of genes was higher than that in their 5' and 3' flanking regions. The DNA methylation pattern of the low CpG content promoter genes differed obviously from that of the high CpG content promoter genes. The DNA methylation level of the porcine ovary was higher than that of the porcine intestine. Analyses of the genome-wide DNA methylation in porcine ovaries would advance the knowledge and understanding of the porcine ovarian methylome.
Allison, J; Hall, L; MacIntyre, I; Craig, R K
1981-01-01
(1) Total poly(A)-containing RNA isolated from human thyroid medullary carcinoma tissue was shown to direct the synthesis in the wheat germ cell-free system of a major (Mr 21000) and several minor forms of human calcitonin precursor polyproteins. Evidence for processing of these precursor(s) by the wheat germ cell-free system is also presented. (2) A small complementary DNA (cDNA) plasmid library has been constructed in the PstI site of the plasmid pAT153, using total human thyroid medullary carcinoma poly(A)-containing RNA as the starting material. (3) Plasmids containing abundant cDNA sequences were selected by hybridization in situ, and two of these (ph T-B3 and phT-B6) were characterized by hybridization--translation and restriction analysis. Each was shown to contain human calcitonin precursor polyprotein cDNA sequences. (4) RNA blotting techniques demonstrate that the human calcitonin precursor polyprotein is encoded within a mRNA containing 1000 bases. (5) The results demonstrate that human calcitonin is synthesized as a precursor polyprotein. Images Fig. 1. Fig. 2. Fig. 3. PMID:6896146
Willett-Brozick, J E; Savul, S A; Richey, L E; Baysal, B E
2001-08-01
Constitutional chromosomal translocations are relatively common causes of human morbidity, yet the DNA double-strand break (DSB) repair mechanisms that generate them are incompletely understood. We cloned, sequenced and analyzed the breakpoint junctions of a familial constitutional reciprocal translocation t(9;11)(p24;q23). Within the 10-kb region flanking the breakpoints, chromosome 11 had 25% repeat elements, whereas chromosome 9 had 98% repeats, 95% of which were L1-type LINE elements. The breakpoints occurred within an L1-type repeat element at 9p24 and at the 3'-end of an Alu sequence at 11q23. At the breakpoint junction of derivative chromosome 9, we discovered an unusually large 41-bp insertion, which showed 100% identity to 12S mitochondrial DNA (mtDNA) between nucleotides 896 and 936 of the mtDNA sequence. Analysis of the human genome failed to show the preexistence of the inserted sequence at normal chromosomes 9 and 11 breakpoint junctions or elsewhere in the genome, strongly suggesting that the insertion was derived from human mtDNA and captured into the junction during the DSB repair process. To our knowledge, these findings represent the first observation of spontaneous germ line insertion of modern human mtDNA sequences and suggest that DSB repair may play a role in inter-organellar gene transfer in vivo. Our findings also provide evidence for a previously unrecognized insertional mechanism in human, by which non-mobile extra-chromosomal fragments can be inserted into the genome at DSB repair junctions.
Fox, K R; Grigg, G W
1988-01-01
DNA structural changes induced by bleomycin have been investigated using diethylpyrocarbonate and permanganate as probes under conditions in which the antibiotic binds to, but does not cut the DNA. Diethyl-pyrocarbonate shows an enhanced reaction with adenines in the presence of the antibiotic in the sequences GTA greater than GCA greater than GAA, on the 3' side of the drug cutting site (GPy). Permanganate ions display an enhanced reactivity at the second pyrimidine of the sequence GPyPy. The results are consistent with a model in which bleomycin distorts the structure of the base pair on the 3' side of its binding site. Images PMID:2451809
Application of a mitochondrial DNA control region frequency database for UK domestic cats.
Ottolini, Barbara; Lall, Gurdeep Matharu; Sacchini, Federico; Jobling, Mark A; Wetton, Jon H
2017-03-01
DNA variation in 402bp of the mitochondrial control region flanked by repeat sequences RS2 and RS3 was evaluated by Sanger sequencing in 152 English domestic cats, in order to determine the significance of matching DNA sequences between hairs found with a victim's body and the suspect's pet cat. Whilst 95% of English cats possessed one of the twelve globally widespread mitotypes, four new variants were observed, the most common of which (2% frequency) was shared with the evidential samples. No significant difference in mitotype frequency was seen between 32 individuals from the locality of the crime and 120 additional cats from the rest of England, suggesting a lack of local population structure. However, significant differences were observed in comparison with frequencies in other countries, including the closely neighbouring Netherlands, highlighting the importance of appropriate genetic databases when determining the evidential significance of mitochondrial DNA evidence. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Beckenbach, Andrew T.
2012-01-01
The complete mitochondrial DNA sequences of eight representatives of lower Diptera, suborder Nematocera, along with nearly complete sequences from two other species, are presented. These taxa represent eight families not previously represented by complete mitochondrial DNA sequences. Most of the sequences retain the ancestral dipteran mitochondrial gene arrangement, while one sequence, that of the midge Arachnocampa flava (family Keroplatidae), has an inversion of the trnE gene. The most unusual result is the extensive rearrangement of the mitochondrial genome of a winter crane fly, Paracladura trichoptera (family Trichocera). The pattern of rearrangement indicates that the mechanism of rearrangement involved a tandem duplication of the entire mitochondrial genome, followed by random and nonrandom loss of one copy of each gene. Another winter crane fly retains the ancestral diperan gene arrangement. A preliminary mitochondrial phylogeny of the Diptera is also presented. PMID:22155689
Dionisi, Hebe M.; Chewning, Christopher S.; Morgan, Katherine H.; Menn, Fu-Min; Easter, James P.; Sayler, Gary S.
2004-01-01
We designed a real-time PCR assay able to recognize dioxygenase large-subunit gene sequences with more than 90% similarity to the Ralstonia sp. strain U2 nagAc gene (nagAc-like gene sequences) in order to study the importance of organisms carrying these genes in the biodegradation of naphthalene. Sequencing of PCR products indicated that this real-time PCR assay was specific and able to detect a variety of nagAc-like gene sequences. One to 100 ng of contaminated-sediment total DNA in 25-μl reaction mixtures produced an amplification efficiency of 0.97 without evident PCR inhibition. The assay was applied to surficial freshwater sediment samples obtained in or in close proximity to a coal tar-contaminated Superfund site. Naphthalene concentrations in the analyzed samples varied between 0.18 and 106 mg/kg of dry weight sediment. The assay for nagAc-like sequences indicated the presence of (4.1 ± 0.7) × 103 to (2.9 ± 0.3) × 105 copies of nagAc-like dioxygenase genes per μg of DNA extracted from sediment samples. These values corresponded to (1.2 ± 0.6) × 105 to (5.4 ± 0.4) × 107 copies of this target per g of dry weight sediment when losses of DNA during extraction were taken into account. There was a positive correlation between naphthalene concentrations and nagAc-like gene copies per microgram of DNA (r = 0.89) and per gram of dry weight sediment (r = 0.77). These results provide evidence of the ecological significance of organisms carrying nagAc-like genes in the biodegradation of naphthalene. PMID:15240274
Mitochondrial Mutations in Subjects with Psychiatric Disorders
Magnan, Christophe; van Oven, Mannis; Baldi, Pierre; Myers, Richard M.; Barchas, Jack D.; Schatzberg, Alan F.; Watson, Stanley J.; Akil, Huda; Bunney, William E.; Vawter, Marquis P.
2015-01-01
A considerable body of evidence supports the role of mitochondrial dysfunction in psychiatric disorders and mitochondrial DNA (mtDNA) mutations are known to alter brain energy metabolism, neurotransmission, and cause neurodegenerative disorders. Genetic studies focusing on common nuclear genome variants associated with these disorders have produced genome wide significant results but those studies have not directly studied mtDNA variants. The purpose of this study is to investigate, using next generation sequencing, the involvement of mtDNA variation in bipolar disorder, schizophrenia, major depressive disorder, and methamphetamine use. MtDNA extracted from multiple brain regions and blood were sequenced (121 mtDNA samples with an average of 8,800x coverage) and compared to an electronic database containing 26,850 mtDNA genomes. We confirmed novel and rare variants, and confirmed next generation sequencing error hotspots by traditional sequencing and genotyping methods. We observed a significant increase of non-synonymous mutations found in individuals with schizophrenia. Novel and rare non-synonymous mutations were found in psychiatric cases in mtDNA genes: ND6, ATP6, CYTB, and ND2. We also observed mtDNA heteroplasmy in brain at a locus previously associated with schizophrenia (T16519C). Large differences in heteroplasmy levels across brain regions within subjects suggest that somatic mutations accumulate differentially in brain regions. Finally, multiplasmy, a heteroplasmic measure of repeat length, was observed in brain from selective cases at a higher frequency than controls. These results offer support for increased rates of mtDNA substitutions in schizophrenia shown in our prior results. The variable levels of heteroplasmic/multiplasmic somatic mutations that occur in brain may be indicators of genetic instability in mtDNA. PMID:26011537
Liu, Ning; Tian, Ru; Loeb, Daniel D.
2003-01-01
Synthesis of the relaxed-circular (RC) DNA genome of hepadnaviruses requires two template switches during plus-strand DNA synthesis: primer translocation and circularization. Although primer translocation and circularization use different donor and acceptor sequences, and are distinct temporally, they share the common theme of switching from one end of the minus-strand template to the other end. Studies of duck hepatitis B virus have indicated that, in addition to the donor and acceptor sequences, three other cis-acting sequences, named 3E, M, and 5E, are required for the synthesis of RC DNA by contributing to primer translocation and circularization. The mechanism by which 3E, M, and 5E act was not known. We present evidence that these sequences function by base pairing with each other within the minus-strand template. 3E base-pairs with one portion of M (M3) and 5E base-pairs with an adjacent portion of M (M5). We found that disrupting base pairing between 3E and M3 and between 5E and M5 inhibited primer translocation and circularization. More importantly, restoring base pairing with mutant sequences restored the production of RC DNA. These results are consistent with the model that, within duck hepatitis B virus capsids, the ends of the minus-strand template are juxtaposed via base pairing to facilitate the two template switches during plus-strand DNA synthesis. PMID:12578983
Herrnstadt, Corinna; Elson, Joanna L; Fahy, Eoin; Preston, Gwen; Turnbull, Douglass M; Anderson, Christen; Ghosh, Soumitra S; Olefsky, Jerrold M; Beal, M Flint; Davis, Robert E; Howell, Neil
2002-05-01
The evolution of the human mitochondrial genome is characterized by the emergence of ethnically distinct lineages or haplogroups. Nine European, seven Asian (including Native American), and three African mitochondrial DNA (mtDNA) haplogroups have been identified previously on the basis of the presence or absence of a relatively small number of restriction-enzyme recognition sites or on the basis of nucleotide sequences of the D-loop region. We have used reduced-median-network approaches to analyze 560 complete European, Asian, and African mtDNA coding-region sequences from unrelated individuals to develop a more complete understanding of sequence diversity both within and between haplogroups. A total of 497 haplogroup-associated polymorphisms were identified, 323 (65%) of which were associated with one haplogroup and 174 (35%) of which were associated with two or more haplogroups. Approximately one-half of these polymorphisms are reported for the first time here. Our results confirm and substantially extend the phylogenetic relationships among mitochondrial genomes described elsewhere from the major human ethnic groups. Another important result is that there were numerous instances both of parallel mutations at the same site and of reversion (i.e., homoplasy). It is likely that homoplasy in the coding region will confound evolutionary analysis of small sequence sets. By a linkage-disequilibrium approach, additional evidence for the absence of human mtDNA recombination is presented here.
Smith, Rick W A; Monroe, Cara; Bolnick, Deborah A
2015-01-01
While cytosine methylation has been widely studied in extant populations, relatively few studies have analyzed methylation in ancient DNA. Most existing studies of epigenetic marks in ancient DNA have inferred patterns of methylation in highly degraded samples using post-mortem damage to cytosines as a proxy for cytosine methylation levels. However, this approach limits the inference of methylation compared with direct bisulfite sequencing, the current gold standard for analyzing cytosine methylation at single nucleotide resolution. In this study, we used direct bisulfite sequencing to assess cytosine methylation in ancient DNA from the skeletal remains of 30 Native Americans ranging in age from approximately 230 to 4500 years before present. Unmethylated cytosines were converted to uracils by treatment with sodium bisulfite, bisulfite products of a CpG-rich retrotransposon were pyrosequenced, and C-to-T ratios were quantified for a single CpG position. We found that cytosine methylation is readily recoverable from most samples, given adequate preservation of endogenous nuclear DNA. In addition, our results indicate that the precision of cytosine methylation estimates is inversely correlated with aDNA preservation, such that samples of low DNA concentration show higher variability in measures of percent methylation than samples of high DNA concentration. In particular, samples in this study with a DNA concentration above 0.015 ng/μL generated the most consistent measures of cytosine methylation. This study presents evidence of cytosine methylation in a large collection of ancient human remains, and indicates that it is possible to analyze epigenetic patterns in ancient populations using direct bisulfite sequencing approaches.
Moustafa, Mohamed Abdallah Mohamed; Shimozuru, Michito; Mohamed, Wessam; Taylor, Kyle Rueben; Nakao, Ryo; Sashika, Mariko; Tsubota, Toshio
2017-08-01
Sarcocystis and Hepatozoon species are protozoan parasites that are frequently detected in domestic and wild animals. Rodents are considered common intermediate and paratenic hosts for several Sarcocystis and Hepatozoon species. Here, blood DNA samples from a total of six rodents, including one Myodes rutilus, one Myodes rufocanus, and four Apodemus speciosus, collected from Hokkaido, Japan, were shown by conventional PCR of the 18S ribosomal RNA (rRNA) gene to contain Sarcocystis and Hepatozoon DNA. Sequencing of the DNA detected one Sarcocystis sp. in the M. rufocanus sample and two different Hepatozoon spp. in the M. rutilus and A. speciosus samples. Phylogenetic analysis showed that the detected Sarcocystis sp. sequence grouped with GenBank Sarcocystis sequences from rodents, snakes, and raccoons from Japan and China. The 18S rRNA partial gene sequences of both detected Hepatozoon spp. clustered with GenBank Hepatozoon sequences from snakes, geckos and voles in Europe, Africa, and Asia. This study provides evidence that wild rodents have a role in the maintenance of Sarcocystis and Hepatozoon species on the island of Hokkaido.
Evidence for Interspecies Gene Transfer in the Evolution of 2,4-Dichlorophenoxyacetic Acid Degraders
McGowan, Catherine; Fulthorpe, Roberta; Wright, Alice; Tiedje, J. M.
1998-01-01
Small-subunit ribosomal DNA (SSU rDNA) from 20 phenotypically distinct strains of 2,4-dichlorophenoxyacetic acid (2,4-D)-degrading bacteria was partially sequenced, yielding 18 unique strains belonging to members of the alpha, beta, and gamma subgroups of the class Proteobacteria. To understand the origin of 2,4-D degradation in this diverse collection, the first gene in the 2,4-D pathway, tfdA, was sequenced. The sequences fell into three unique classes found in various members of the beta and gamma subgroups of Proteobacteria. None of the α-Proteobacteria yielded tfdA PCR products. A comparison of the dendrogram of the tfdA genes with that of the SSU rDNA genes demonstrated incongruency in phylogenies, and hence 2,4-D degradation must have originated from gene transfer between species. Only those strains with tfdA sequences highly similar to the tfdA sequence of strain JMP134 (tfdA class I) transferred all the 2,4-D genes and conferred the 2,4-D degradation phenotype to a Burkholderia cepacia recipient. PMID:9758850
More evidence for non-maternal inheritance of mitochondrial DNA?
Bandelt, H-J; Kong, Q-P; Parson, W; Salas, A
2005-12-01
A single case of paternal co-transmission of mitochondrial DNA (mtDNA) in humans has been reported so far. To find potential instances of non-maternal inheritance of mtDNA. Published medical case studies (of single patients) were searched for irregular mtDNA patterns by comparing the given haplotype information for different clones or tissues with the worldwide mtDNA database as known to date-a method that has proved robust and reliable for the detection of flawed mtDNA sequence data. More than 20 studies were found reporting clear cut instances with mtDNAs of different ancestries in single individuals. As examples, cases are reviewed from recent published reports which, at face value, may be taken as evidence for paternal inheritance of mtDNA or recombination. Multiple types (or recombinant types) of quite dissimilar mitochondrial DNA from different parts of the known mtDNA phylogeny are often reported in single individuals. From re-analyses and corrigenda of forensic mtDNA data, it is apparent that the phenomenon of mixed or mosaic mtDNA can be ascribed solely to contamination and sample mix up.
Josephs, Eric A.; Kocak, D. Dewran; Fitzgibbon, Christopher J.; McMenemy, Joshua; Gersbach, Charles A.; Marszalek, Piotr E.
2015-01-01
CRISPR-associated endonuclease Cas9 cuts DNA at variable target sites designated by a Cas9-bound RNA molecule. Cas9's ability to be directed by single ‘guide RNA’ molecules to target nearly any sequence has been recently exploited for a number of emerging biological and medical applications. Therefore, understanding the nature of Cas9's off-target activity is of paramount importance for its practical use. Using atomic force microscopy (AFM), we directly resolve individual Cas9 and nuclease-inactive dCas9 proteins as they bind along engineered DNA substrates. High-resolution imaging allows us to determine their relative propensities to bind with different guide RNA variants to targeted or off-target sequences. Mapping the structural properties of Cas9 and dCas9 to their respective binding sites reveals a progressive conformational transformation at DNA sites with increasing sequence similarity to its target. With kinetic Monte Carlo (KMC) simulations, these results provide evidence of a ‘conformational gating’ mechanism driven by the interactions between the guide RNA and the 14th–17th nucleotide region of the targeted DNA, the stabilities of which we find correlate significantly with reported off-target cleavage rates. KMC simulations also reveal potential methodologies to engineer guide RNA sequences with improved specificity by considering the invasion of guide RNAs into targeted DNA duplex. PMID:26384421
Ward, Jodie; Gilmore, Simon R; Robertson, James; Peakall, Rod
2009-11-01
Plant material is frequently encountered in criminal investigations but often overlooked as potential evidence. We designed a DNA-based molecular identification system for 100 Australian grasses that consisted of a series of polymerase chain reaction assays that enabled the progressive identification of grasses to different taxonomic levels. The identification system was based on DNA sequence variation at four chloroplast and two mitochondrial loci. Seventeen informative indels and 68 single-nucleotide polymorphisms were utilized as molecular markers for subfamily to species-level identification. To identify an unknown sample to subfamily level required a minimum of four markers or nine markers for species identification. The accuracy of the system was confirmed by blind tests. We have demonstrated "proof of concept" of a molecular identification system for trace botanical samples. Our evaluation suggests that the adoption of a system that combines this approach with DNA sequencing could assist the morphological identification of grasses found as forensic evidence.
Molecular Diagnosis of Infantile Mitochondrial Disease with Targeted Next-Generation Sequencing
Calvo, Sarah E.; Compton, Alison G.; Hershman, Steven G.; Lim, Sze Chern; Lieber, Daniel S.; Tucker, Elena J.; Laskowski, Adrienne; Garone, Caterina; Liu, Shangtao; Jaffe, David B.; Christodoulou, John; Fletcher, Janice M.; Bruno, Damien L; Goldblatt, Jack; DiMauro, Salvatore; Thorburn, David R.; Mootha, Vamsi K.
2012-01-01
Advances in next-generation sequencing (NGS) promise to facilitate diagnosis of inherited disorders. While in research settings NGS has pinpointed causal alleles using segregation in large families, the key challenge for clinical diagnosis is application to single individuals. To explore its diagnostic utility, we performed targeted NGS in 42 unrelated infants with clinical and biochemical evidence of mitochondrial oxidative phosphorylation disease, who were refractory to traditional molecular diagnosis. These devastating mitochondrial disorders are characterized by phenotypic and genetic heterogeneity, with over 100 causal genes identified to date. We performed “MitoExome” sequencing of the mitochondrial DNA (mtDNA) and exons of ~1000 nuclear genes encoding mitochondrial proteins and prioritized rare mutations predicted to disrupt function. Since patients and controls harbored a comparable number of such heterozygous alleles, we could not prioritize dominant acting genes. However, patients showed a five-fold enrichment of genes with two such mutations that could underlie recessive disease. In total, 23/42 (55%) patients harbored such recessive genes or pathogenic mtDNA variants. Firm diagnoses were enabled in 10 patients (24%) who had mutations in genes previously linked to disease. 13 patients (31%) had mutations in nuclear genes never linked to disease. The pathogenicity of two such genes, NDUFB3 and AGK, was supported by cDNA complementation and evidence from multiple patients, respectively. The results underscore the immediate potential and challenges of deploying NGS in clinical settings. PMID:22277967
Pilgrim, B L; Perry, R C; Barron, J L; Marshall, H D
2012-09-26
Levels and patterns of mitochondrial DNA (mtDNA) variation were examined to investigate the population structure and possible routes of postglacial recolonization of the world's northernmost native populations of brook trout (Salvelinus fontinalis), which are found in Labrador, Canada. We analyzed the sequence diversity of a 1960-bp portion of the mitochondrial genome (NADH dehydrogenase 1 gene and part of cytochrome oxidase 1) of 126 fish from 32 lakes distributed throughout seven regions of northeastern Canada. These populations were found to have low levels of mtDNA diversity, a characteristic trait of populations at northern extremes, with significant structuring at the level of the watershed. Upon comparison of northeastern brook trout sequences to the publicly available brook trout whole mitochondrial genome (GenBank AF154850), we infer that the GenBank sequence is from a fish whose mtDNA has recombined with that of Arctic charr (S. alpinus). The haplotype distribution provides evidence of two different postglacial founding groups contributing to present-day brook trout populations in the northernmost part of their range; the evolution of the majority of the haplotypes coincides with the timing of glacier retreat from Labrador. Our results exemplify the strong influence that historical processes such as glaciations have had on shaping the current genetic structure of northern species such as the brook trout.
Utro, Filippo; Di Benedetto, Valeria; Corona, Davide F V; Giancarlo, Raffaele
2016-03-15
Thanks to research spanning nearly 30 years, two major models have emerged that account for nucleosome organization in chromatin: statistical and sequence specific. The first is based on elegant, easy to compute, closed-form mathematical formulas that make no assumptions of the physical and chemical properties of the underlying DNA sequence. Moreover, they need no training on the data for their computation. The latter is based on some sequence regularities but, as opposed to the statistical model, it lacks the same type of closed-form formulas that, in this case, should be based on the DNA sequence only. We contribute to close this important methodological gap between the two models by providing three very simple formulas for the sequence specific one. They are all based on well-known formulas in Computer Science and Bioinformatics, and they give different quantifications of how complex a sequence is. In view of how remarkably well they perform, it is very surprising that measures of sequence complexity have not even been considered as candidates to close the mentioned gap. We provide experimental evidence that the intrinsic level of combinatorial organization and information-theoretic content of subsequences within a genome are strongly correlated to the level of DNA encoded nucleosome organization discovered by Kaplan et al Our results establish an important connection between the intrinsic complexity of subsequences in a genome and the intrinsic, i.e. DNA encoded, nucleosome organization of eukaryotic genomes. It is a first step towards a mathematical characterization of this latter 'encoding'. Supplementary data are available at Bioinformatics online. futro@us.ibm.com. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Mahelka, Václav; Krak, Karol; Kopecký, David; Fehrer, Judith; Šafář, Jan; Bartoš, Jan; Hobza, Roman; Blavet, Nicolas; Blattner, Frank R
2017-02-14
The movement of nuclear DNA from one vascular plant species to another in the absence of fertilization is thought to be rare. Here, nonnative rRNA gene [ribosomal DNA (rDNA)] copies were identified in a set of 16 diploid barley ( Hordeum ) species; their origin was traceable via their internal transcribed spacer (ITS) sequence to five distinct Panicoideae genera, a lineage that split from the Pooideae about 60 Mya. Phylogenetic, cytogenetic, and genomic analyses implied that the nonnative sequences were acquired between 1 and 5 Mya after a series of multiple events, with the result that some current Hordeum sp. individuals harbor up to five different panicoid rDNA units in addition to the native Hordeum rDNA copies. There was no evidence that any of the nonnative rDNA units were transcribed; some showed indications of having been silenced via pseudogenization. A single copy of a Panicum sp. rDNA unit present in H. bogdanii had been interrupted by a native transposable element and was surrounded by about 70 kbp of mostly noncoding sequence of panicoid origin. The data suggest that horizontal gene transfer between vascular plants is not a rare event, that it is not necessarily restricted to one or a few genes only, and that it can be selectively neutral.
Oliveros, R; Cutillas, C; De Rojas, M; Arias, P
2000-12-01
Adult worms of Trichuris ovis and T. globulosa were collected from Ovis aries (sheep) and Capra hircus (goats). T. suis was isolated from Sus scrofa domestica (swine) and T. leporis was isolated from Lepus europaeus (rabbits) in Spain. Genomic DNA was isolated and a ribosomal internal transcribed spacer (ITS2) was amplified and sequenced using polymerase-chain-reaction (PCR) techniques. The ITS2 of T. ovis and T. globulosa was 407 nucleotides in length and had a GC content of about 62%. Furthermore, the ITS2 of T. suis and T. leporis was 534 and 418 nucleotides in length and had a GC content of about 64.8% and 62.4%, respectively. There was evidence of slight variation in the sequence within individuals of all species analyzed, indicating intraindividual variation in the sequence of different copies of the ribosomal DNA. Furthermore, low-level intraspecific variation was detected. Sequence analyses of ITS2 products of T. ovis and T. globulosa demonstrated no sequence difference between them. Nevertheless, differences were detected between the ITS2 sequences of T. suis, T. leporis, and T. ovis, indicating that Trichuris species can reliably be differentiated by their ITS2 sequences and PCR-linked restriction-fragment-length polymorphism (RFLP).
Plasmodium falciparum malaria in 1st-2nd century CE southern Italy.
Marciniak, Stephanie; Prowse, Tracy L; Herring, D Ann; Klunk, Jennifer; Kuch, Melanie; Duggan, Ana T; Bondioli, Luca; Holmes, Edward C; Poinar, Hendrik N
2016-12-05
The historical record attests to the devastation malaria exacted on ancient civilizations, particularly the Roman Empire [1]. However, evidence for the presence of malaria during the Imperial period in Italy (1st-5th century CE) is based on indirect sources, such as historical, epigraphic, or skeletal evidence. Although these sources are crucial for revealing the context of this disease, they cannot establish the causative species of Plasmodium. Importantly, definitive evidence for the presence of malaria is now possible through the implementation of ancient DNA technology. As malaria is presumed to have been at its zenith during the Imperial period [1], we selected first or second molars from 58 adults from three cemeteries from this time: Isola Sacra (associated with Portus Romae, 1st-3rd century CE), Velia (1st-2nd century CE), and Vagnari (1st-4th century CE). We performed hybridization capture using baits designed from the mitochondrial (mtDNA) genomes of Plasmodium spp. on a prioritized subset of 11 adults (informed by metagenomic sequencing). The mtDNA sequences generated provided compelling phylogenetic evidence for the presence of P. falciparum in two individuals. This is the first genomic data directly implicating P. falciparum in Imperial period southern Italy in adults. Copyright © 2016 Elsevier Ltd. All rights reserved.
Sastre-Garau, X; Favre, M; Couturier, J; Orth, G
2000-08-01
We previously described two genital carcinomas (IC2, IC4) containing human papillomavirus type 16 (HPV-16)- or HPV-18-related sequences integrated in chromosomal bands containing the c-myc (8q24) or N-myc (2p24) gene, respectively. The c-myc gene was rearranged and amplified in IC2 cells without evidence of overexpression. The N-myc gene was amplified and highly transcribed in IC4 cells. Here, the sequence of an 8039 bp IC4 DNA fragment containing the integrated viral sequences and the cellular junctions is reported. A 3948 bp segment of the genome of HPV-45 encompassing the upstream regulatory region and the E6 and E7 ORFs was integrated into the untranslated part of N-myc exon 3, upstream of the N-myc polyadenylation signal. Both N-myc and HPV-45 sequences were amplified 10- to 20-fold. The 3' ends of the major N-myc transcript were mapped upstream of the 5' junction. A minor N-myc/HPV-45 fusion transcript was also identified, as well as two abundant transcripts from the HPV-45 E6-E7 region. Large amounts of N-myc protein were detected in IC4 cells. A major alteration of c-myc sequences in IC2 cells involved the insertion of a non-coding sequence into the second intron and their co-amplification with the third exon, without any evidence for the integration of HPV-16 sequences within or close to the gene. Different patterns of myc gene alterations may thus be associated with integration of HPV DNA in genital tumours, including the activation of the protooncogene via a mechanism of insertional mutagenesis and/or gene amplification.
Transposable elements and G-quadruplexes.
Kejnovsky, Eduard; Tokan, Viktor; Lexa, Matej
2015-09-01
A significant part of eukaryotic genomes is formed by transposable elements (TEs) containing not only genes but also regulatory sequences. Some of the regulatory sequences located within TEs can form secondary structures like hairpins or three-stranded (triplex DNA) and four-stranded (quadruplex DNA) conformations. This review focuses on recent evidence showing that G-quadruplex-forming sequences in particular are often present in specific parts of TEs in plants and humans. We discuss the potential role of these structures in the TE life cycle as well as the impact of G-quadruplexes on replication, transcription, translation, chromatin status, and recombination. The aim of this review is to emphasize that TEs may serve as vehicles for the genomic spread of G-quadruplexes. These non-canonical DNA structures and their conformational switches may constitute another regulatory system that, together with small and long non-coding RNA molecules and proteins, contribute to the complex cellular network resulting in the large diversity of eukaryotes.
Osmundson, Todd W; Halling, Roy E; den Bakker, Henk C
2007-05-01
This study examines evidence for a particular arbutoid mycorrhizal interaction in páramo, a high-altitude neotropical ecosystem important in hydrological regulation but poorly known in terms of its fungal communities. Comarostaphylis arbutoides Lindley (Ericaceae) often forms dense thickets in Central American páramo habitats. Based on phylogenetic classification, it has been suggested that C. arbutoides forms arbutoid mycorrhizae with diverse Basidiomycetes and Ascomycetes; however, this assumption has not previously been confirmed. Based on field data, we hypothesized an arbutoid mycorrhizal association between C. arbutoides and the recently described bolete Leccinum monticola Halling & G.M. Mueller; in this study, we applied a rigorous approach using anatomical and molecular data to examine evidence for such an association. We examined root samples collected beneath L. monticola basidiomes for mycorrhizal structures, and we also compared rDNA internal transcribed spacer (ITS) sequences between mycorrhizal root tips and leaf or basidiome material of the suspected symbionts. Root cross sections showed a thin hyphal sheath and intracellular hyphal coils typical of arbutoid mycorrhizae. DNA sequence comparisons confirmed the identity of C. arbutoides and L. monticola as the mycorrhizal symbionts. In addition, this paper provides additional evidence for the widespread presence of minisatellite-like inserts in the ITS1 spacer in Leccinum species (including a characterization of the insert in L. monticola) and reports the use of an angiosperm-specific ITS primer pair useful for amplifying plant DNA from mycorrhizal roots without co-amplifying fungal DNA.
Kohno, K; Yasuzawa, K; Hirose, M; Kano, Y; Goshima, N; Tanaka, H; Imamoto, F
1994-06-01
The molecular mechanism of autoregulation of expression of the hupA gene in Escherichia coli was examined. The promoter of the gene contains a palindromic sequence with the potential to form a cruciform DNA structure in which the -35 sequence lies at the base of the stem and the -10 sequence forms a single-stranded loop. An artificial promoter lacking the palindrome, which was constructed by replacing a 10 nucleotide repeat for the predicted cruciform arm by a sequence in the opposite orientation, was not subject to HU-repression. DNA relaxation induced by deleting HU proteins and/or inhibiting DNA gyrase in cells results in increased expression from the hupA promoter. We propose that initiation of transcription of the hupA gene is negatively regulated by steric hindrance of the functional promoter domains for formation of the cruciform configuration, which is facilitated at least in part by negative supercoiling of the hupA promoter DNA region. The promoter region of the hupB gene also contains a palindromic sequence that can assume a cruciform configuration. Negative regulation of this gene by HU proteins may occur by a mechanism similar to that operating for the hupA gene.
DNA sequence templates adjacent nucleosome and ORC sites at gene amplification origins in Drosophila
Liu, Jun; Zimmer, Kurt; Rusch, Douglas B.; Paranjape, Neha; Podicheti, Ram; Tang, Haixu; Calvi, Brian R.
2015-01-01
Eukaryotic origins of DNA replication are bound by the origin recognition complex (ORC), which scaffolds assembly of a pre-replicative complex (pre-RC) that is then activated to initiate replication. Both pre-RC assembly and activation are strongly influenced by developmental changes to the epigenome, but molecular mechanisms remain incompletely defined. We have been examining the activation of origins responsible for developmental gene amplification in Drosophila. At a specific time in oogenesis, somatic follicle cells transition from genomic replication to a locus-specific replication from six amplicon origins. Previous evidence indicated that these amplicon origins are activated by nucleosome acetylation, but how this affects origin chromatin is unknown. Here, we examine nucleosome position in follicle cells using micrococcal nuclease digestion with Ilumina sequencing. The results indicate that ORC binding sites and other essential origin sequences are nucleosome-depleted regions (NDRs). Nucleosome position at the amplicons was highly similar among developmental stages during which ORC is or is not bound, indicating that being an NDR is not sufficient to specify ORC binding. Importantly, the data suggest that nucleosomes and ORC have opposite preferences for DNA sequence and structure. We propose that nucleosome hyperacetylation promotes pre-RC assembly onto adjacent DNA sequences that are disfavored by nucleosomes but favored by ORC. PMID:26227968
mtDNA recombination in a natural population.
Saville, B J; Kohli, Y; Anderson, J B
1998-02-03
Variation in mtDNA has been used extensively to draw inferences in phylogenetics and population biology. In the majority of eukaryotes investigated, transmission of mtDNA is uniparental and clonal, with genotypic diversity arising from mutation alone. In other eukaryotes, the transmission of mtDNA is biparental or primarily uniparental with the possibility of "leakage" from the minority parent. In these cases, heteroplasmy carries the potential for recombination between mtDNAs of different descent. In fungi, such mtDNA recombination has long been documented but only in laboratory experiments and only under conditions in which heteroplasmy is ensured. Despite this experimental evidence, mtDNA recombination has not been to our knowledge documented in a natural population. Because evidence from natural populations is prerequisite to understanding the evolutionary impact of mtDNA recombination, we investigated the possibility of mtDNA recombination in an organism with the demonstrated potential for heteroplasmy in laboratory matings. Using nucleotide sequence data, we report here that the genotypic structure of mtDNA in a natural population of the basidiomycete fungus Armillaria gallica is inconsistent with purely clonal mtDNA evolution and is fully consistent with mtDNA recombination.
Isotachophoresis for fractionation and recovery of cytoplasmic RNA and nucleus from single cells.
Kuriyama, Kentaro; Shintaku, Hirofumi; Santiago, Juan G
2015-07-01
There is a substantial need for simultaneous analyses of RNA and DNA from individual single cells. Such analysis provides unique evidence of cell-to-cell differences and the correlation between gene expression and genomic mutation in highly heterogeneous cell populations. We present a novel microfluidic system that leverages isotachophoresis to fractionate and isolate cytoplasmic RNA and genomic DNA (gDNA) from single cells. The system uniquely enables independent, sequence-specific analyses of these critical markers. Our system uses a microfluidic chip with a simple geometry and four end-channel electrodes, and completes the entire process in <5 min, including lysis, purification, fractionation, and delivery to DNA and RNA output reservoirs, each containing high quality and purity aliquots with no measurable cross-contamination of cytoplasmic RNA versus gDNA. We demonstrate our system with simultaneous, sequence-specific quantitation using off-chip RT-qPCR and qPCR for simultaneous cytoplasmic RNA and gDNA analyses, respectively. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sequence-dependent base pair stepping dynamics in XPD helicase unwinding
Qi, Zhi; Pugh, Robert A; Spies, Maria; Chemla, Yann R
2013-01-01
Helicases couple the chemical energy of ATP hydrolysis to directional translocation along nucleic acids and transient duplex separation. Understanding helicase mechanism requires that the basic physicochemical process of base pair separation be understood. This necessitates monitoring helicase activity directly, at high spatio-temporal resolution. Using optical tweezers with single base pair (bp) resolution, we analyzed DNA unwinding by XPD helicase, a Superfamily 2 (SF2) DNA helicase involved in DNA repair and transcription initiation. We show that monomeric XPD unwinds duplex DNA in 1-bp steps, yet exhibits frequent backsteps and undergoes conformational transitions manifested in 5-bp backward and forward steps. Quantifying the sequence dependence of XPD stepping dynamics with near base pair resolution, we provide the strongest and most direct evidence thus far that forward, single-base pair stepping of a helicase utilizes the spontaneous opening of the duplex. The proposed unwinding mechanism may be a universal feature of DNA helicases that move along DNA phosphodiester backbones. DOI: http://dx.doi.org/10.7554/eLife.00334.001 PMID:23741615
Kawabe, Akira; Charlesworth, Deborah
2007-02-01
We describe patterns of DNA variation among the three centromeric satellite families in Arabidopsis halleri and lyrata. The newly studied subspecies (A. halleri ssp. halleri and A. lyrata ssp. lyrata and petraea), like the previously studied A. halleri ssp. gemmifera and A. lyrata ssp. kawasakiana, have three different centromeric satellite families, the older pAa family (also present in A. arenosa) and two families, pAge1 and pAge2, that probably evolved more recently. Sequence variability is high in all three satellite families, and the pAa sequences do not cluster by their species of origin. Diversity in the pAge2 family is complex, and different from variation among copies of the other two families, showing clear evidence for exchange events among family members, especially in A. halleri ssp. halleri. In A. lyrata ssp. lyrata there is some evidence for recent rapid spread of pAge2 variants, suggesting selection favoring these sequences.
Scaling features of noncoding DNA
NASA Technical Reports Server (NTRS)
Stanley, H. E.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.
1999-01-01
We review evidence supporting the idea that the DNA sequence in genes containing noncoding regions is correlated, and that the correlation is remarkably long range--indeed, base pairs thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene, and utilize this fact to build a Coding Sequence Finder Algorithm, which uses statistical ideas to locate the coding regions of an unknown DNA sequence. Finally, we describe briefly some recent work adapting to DNA the Zipf approach to analyzing linguistic texts, and the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function, and reporting that noncoding regions in eukaryotes display a larger redundancy than coding regions. Specifically, we consider the possibility that this result is solely a consequence of nucleotide concentration differences as first noted by Bonhoeffer and his collaborators. We find that cytosine-guanine (CG) concentration does have a strong "background" effect on redundancy. However, we find that for the purine-pyrimidine binary mapping rule, which is not affected by the difference in CG concentration, the Shannon redundancy for the set of analyzed sequences is larger for noncoding regions compared to coding regions.
Guo, Yan; Cai, Qiuyin; Samuels, David C; Ye, Fei; Long, Jirong; Li, Chung-I; Winther, Jeanette F; Tawn, E Janet; Stovall, Marilyn; Lähteenmäki, Päivi; Malila, Nea; Levy, Shawn; Shaffer, Christian; Shyr, Yu; Shu, Xiao-Ou; Boice, John D
2012-05-15
The human mitochondrial genome has an exclusively maternal mode of inheritance. Mitochondrial DNA (mtDNA) is particularly vulnerable to environmental insults due in part to an underdeveloped DNA repair system, limited to base excision and homologous recombination repair. Radiation exposure to the ovaries may cause mtDNA mutations in oocytes, which may in turn be transmitted to offspring. We hypothesized that the children of female cancer survivors who received radiation therapy may have an increased rate of mtDNA heteroplasmy mutations, which conceivably could increase their risk of developing cancer and other diseases. We evaluated 44 DNA blood samples from 17 Danish and 1 Finnish families (18 mothers and 26 children). All mothers had been treated for cancer as children and radiation doses to their ovaries were determined based on medical records and computational models. DNA samples were sequenced for the entire mitochondrial genome using the Illumina GAII system. Mother's age at sample collection was positively correlated with mtDNA heteroplasmy mutations. There was evidence of heteroplasmy inheritance in that 9 of the 18 families had at least one child who inherited at least one heteroplasmy site from his or her mother. No significant difference in single nucleotide polymorphisms between mother and offspring, however, was observed. Radiation therapy dose to ovaries also was not significantly associated with the heteroplasmy mutation rate among mothers and children. No evidence was found that radiotherapy for pediatric cancer is associated with the mitochondrial genome mutation rate in female cancer survivors and their children. Copyright © 2012 Elsevier B.V. All rights reserved.
Analysis of European mtDNAs for recombination.
Elson, J L; Andrews, R M; Chinnery, P F; Lightowlers, R N; Turnbull, D M; Howell, N
2001-01-01
The standard paradigm postulates that the human mitochondrial genome (mtDNA) is strictly maternally inherited and that, consequently, mtDNA lineages are clonal. As a result of mtDNA clonality, phylogenetic and population genetic analyses should therefore be free of the complexities imposed by biparental recombination. The use of mtDNA in analyses of human molecular evolution is contingent, in fact, on clonality, which is also a condition that is critical both for forensic studies and for understanding the transmission of pathogenic mtDNA mutations within families. This paradigm, however, has been challenged recently by Eyre-Walker and colleagues. Using two different tests, they have concluded that recombination has contributed to the distribution of mtDNA polymorphisms within the human population. We have assembled a database that comprises the complete sequences of 64 European and 2 African mtDNAs. When this set of sequences was analyzed using any of three measures of linkage disequilibrium, one of the tests of Eyre-Walker and colleagues, there was no evidence for mtDNA recombination. When their test for excess homoplasies was applied to our set of sequences, only a slight excess of homoplasies was observed. We discuss possible reasons that our results differ from those of Eyre-Walker and colleagues. When we take the various results together, our conclusion is that mtDNA recombination has not been sufficiently frequent during human evolution to overturn the standard paradigm.
Variola virus topoisomerase: DNA cleavage specificity and distribution of sites in Poxvirus genomes.
Minkah, Nana; Hwang, Young; Perry, Kay; Van Duyne, Gregory D; Hendrickson, Robert; Lefkowitz, Elliot J; Hannenhalli, Sridhar; Bushman, Frederic D
2007-08-15
Topoisomerase enzymes regulate superhelical tension in DNA resulting from transcription, replication, repair, and other molecular transactions. Poxviruses encode an unusual type IB topoisomerase that acts only at conserved DNA sequences containing the core pentanucleotide 5'-(T/C)CCTT-3'. In X-ray structures of the variola virus topoisomerase bound to DNA, protein-DNA contacts were found to extend beyond the core pentanucleotide, indicating that the full recognition site has not yet been fully defined in functional studies. Here we report quantitation of DNA cleavage rates for an optimized 13 bp site and for all possible single base substitutions (40 total sites), with the goals of understanding the molecular mechanism of recognition and mapping topoisomerase sites in poxvirus genome sequences. The data allow a precise definition of enzyme-DNA interactions and the energetic contributions of each. We then used the resulting "action matrix" to show that favorable topoisomerase sites are distributed all along the length of poxvirus DNA sequences, consistent with a requirement for local release of superhelical tension in constrained topological domains. In orthopox genomes, an additional central cluster of sites was also evident. A negative correlation of predicted topoisomerase sites was seen relative to early terminators, but no correlation was seen with early or late promoters. These data define the full variola virus topoisomerase recognition site and provide a new window on topoisomerase function in vivo.
Hajibabaei, Mehrdad; Shokralla, Shadi; Zhou, Xin; Singer, Gregory A. C.; Baird, Donald J.
2011-01-01
Timely and accurate biodiversity analysis poses an ongoing challenge for the success of biomonitoring programs. Morphology-based identification of bioindicator taxa is time consuming, and rarely supports species-level resolution especially for immature life stages. Much work has been done in the past decade to develop alternative approaches for biodiversity analysis using DNA sequence-based approaches such as molecular phylogenetics and DNA barcoding. On-going assembly of DNA barcode reference libraries will provide the basis for a DNA-based identification system. The use of recently introduced next-generation sequencing (NGS) approaches in biodiversity science has the potential to further extend the application of DNA information for routine biomonitoring applications to an unprecedented scale. Here we demonstrate the feasibility of using 454 massively parallel pyrosequencing for species-level analysis of freshwater benthic macroinvertebrate taxa commonly used for biomonitoring. We designed our experiments in order to directly compare morphology-based, Sanger sequencing DNA barcoding, and next-generation environmental barcoding approaches. Our results show the ability of 454 pyrosequencing of mini-barcodes to accurately identify all species with more than 1% abundance in the pooled mixture. Although the approach failed to identify 6 rare species in the mixture, the presence of sequences from 9 species that were not represented by individuals in the mixture provides evidence that DNA based analysis may yet provide a valuable approach in finding rare species in bulk environmental samples. We further demonstrate the application of the environmental barcoding approach by comparing benthic macroinvertebrates from an urban region to those obtained from a conservation area. Although considerable effort will be required to robustly optimize NGS tools to identify species from bulk environmental samples, our results indicate the potential of an environmental barcoding approach for biomonitoring programs. PMID:21533287
Zhu, Zhixuan; Gui, Songtao; Jin, Jing; Yi, Rong; Wu, Zhihua; Qian, Qian; Ding, Yi
2016-09-01
Centromeres on eukaryotic chromosomes consist of large arrays of DNA repeats that undergo very rapid evolution. Nelumbo nucifera Gaertn. (sacred lotus) is a phylogenetic relict and an aquatic perennial basal eudicot. Studies concerning the centromeres of this basal eudicot species could provide ancient evolutionary perspectives. In this study, we characterized the centromeric marker protein NnCenH3 (sacred lotus centromere-specific histone H3 variant), and used a chromatin immunoprecipitation (ChIP)-based technique to recover the NnCenH3 nucleosome-associated sequences of sacred lotus. The properties of the centromere-binding protein and DNA sequences revealed notable divergence between sacred lotus and other flowering plants, including the following factors: (i) an NnCenH3 alternative splicing variant comprising only a partial centromere-targeting domain, (ii) active genes with low transcription levels in the NnCenH3 nucleosomal regions, and (iii) the prevalence of the Ty1/copia class of long terminal repeat (LTR) retrotransposons in the centromeres of sacred lotus chromosomes. In addition, the dynamic natures of the centromeric region showed that some of the centromeric repeat DNA sequences originated from telomeric repeats, and a pair of centromeres on the dicentric chromosome 1 was inactive in the metaphase cells of sacred lotus. Our characterization of the properties of centromeric DNA structure within the sacred lotus genome describes a centromeric profile in ancient basal eudicots and might provide evidence of the origins and evolution of centromeres. Furthermore, the identification of centromeric DNA sequences is of great significance for the assembly of the sacred lotus genome. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Polyfluorophore Labels on DNA: Dramatic Sequence Dependence of Quenching
Teo, Yin Nah; Wilson, James N.
2010-01-01
We describe studies carried out in the DNA context to test how a common fluorescence quencher, dabcyl, interacts with oligodeoxynu-cleoside fluorophores (ODFs)—a system of stacked, electronically interacting fluorophores built on a DNA scaffold. We tested twenty different tetrameric ODF sequences containing varied combinations and orderings of pyrene (Y), benzopyrene (B), perylene (E), dimethylaminostilbene (D), and spacer (S) monomers conjugated to the 3′ end of a DNA oligomer. Hybridization of this probe sequence to a dabcyl-labeled complementary strand resulted in strong quenching of fluorescence in 85% of the twenty ODF sequences. The high efficiency of quenching was also established by their large Stern–Volmer constants (KSV) of between 2.1 × 104 and 4.3 × 105M−1, measured with a free dabcyl quencher. Interestingly, quenching of ODFs displayed strong sequence dependence. This was particularly evident in anagrams of ODF sequences; for example, the sequence BYDS had a KSV that was approximately two orders of magnitude greater than that of BSDY, which has the same dye composition. Other anagrams, for example EDSY and ESYD, also displayed different responses upon quenching by dabcyl. Analysis of spectra showed that apparent excimer and exciplex emission bands were quenched with much greater efficiency compared to monomer emission bands by at least an order of magnitude. This suggests an important role played by delocalized excited states of the π stack of fluorophores in the amplified quenching of fluorescence. PMID:19780115
Molecular evidence of simian virus 40 infections in children
NASA Technical Reports Server (NTRS)
Butel, J. S.; Arrington, A. S.; Wong, C.; Lednicky, J. A.; Finegold, M. J.
1999-01-01
Recent studies have detected simian virus 40 (SV40) DNA in certain human tumors and normal tissues. The significance of human infections by SV40, which was first discovered as a contaminant of poliovirus vaccines used between 1955 and 1963, remains unknown. The occurrence of SV40 infections in unselected hospitalized children was evaluated. Polymerase chain reaction and DNA sequence analyses were done on archival tissue specimens from patients positive for SV40 neutralizing antibody. SV40 DNA was identified in samples from 4 of 20 children (1 Wilms' tumor, 3 transplanted kidney samples). Sequence variation among SV40 regulatory regions ruled out laboratory contamination of specimens. This study shows the presence of SV40 infections in pediatric patients born after 1982.
Zhi-Bin Wen; Ming-Li Zhang; Ge-Lin Zhu; Stewart C. Sanderson
2010-01-01
To reconstruct phylogeny and verify the monophyly of major subgroups, a total of 52 species representing almost all species of Salsoleae s.l. in China were sampled, with analysis based on three molecular markers (nrDNA ITS, cpDNA psbB-psbH and rbcL), using maximum parsimony, maximum likelihood, and Bayesian inference methods. Our molecular evidence provides strong...
Somatic immunoglobulin hypermutation
Diaz, Marilyn; Casali, Paolo
2015-01-01
Immunoglobulin hypermutation provides the structural correlate for the affinity maturation of the antibody response. Characteristic modalities of this mechanism include a preponderance of point-mutations with prevalence of transitions over transversions, and the mutational hotspot RGYW sequence. Recent evidence suggests a mechanism whereby DNA-breaks induce error-prone DNA synthesis in immunoglobulin V(D)J regions by error-prone DNA polymerases. The nature of the targeting mechanism and the trans-factors effecting such breaks and their repair remain to be determined. PMID:11869898
Signatures of Climatic Change In Human Mitochondrial Dna From Europe
NASA Astrophysics Data System (ADS)
Richards, M. B.; Macaulay, V. A.; Torroni, A.; Bandelt, H.-J.
Founder analysis is an approach to analysing non-recombining DNA sequence data, such as variation in the mitochondrial DNA (mtDNA), which aims at identifying and dating migrations into new territory. We applied the approach to about 4,000 human mtDNA sequences from Europe and the Near East, in order to estimate the proportion of modern lineages whose ancestors arrived at various times during the continent's past. We found that the major signal dates to about 15,000 years ago, at the time of rewarming following the Last Glacial Maximum (LGM). There is little or no archaeological evidence for immigration into Europe at this time, and the record indicates that at least parts of southern Europe remained populated during the LGM. Therefore, we interpret this signal as the trace of a bottleneck at the time of the LGM, as a result of the retreat from northern Europe during the peak of the glaciation, followed by a re-expansion from one or more refugial zones. Immigration episodes then figure at the beginning of the Early Upper Palaeolithic, during the Middle Upper Palaeolithic, and with the Neolithic. The impact of the latter on the composition of the European mtDNA pool was evidently rather minor. This result implies that climate is likely to have been a major force shaping human demographic history in Europe.
Molecular Identification of Bacteria from Aseptically Loose Implants
Kobayashi, Naomi; Procop, Gary W.; Krebs, Viktor; Kobayashi, Hideo
2008-01-01
Polymerase chain reaction (PCR) assays have been used to detect bacteria adherent to failed orthopaedic implants, but some PCR assays have had problems with probable false-positive results. We used a combination of a Staphylococcus species-specific PCR and a universal PCR followed by DNA sequencing to identify bacteria on implants retrieved from 52 patients (92 implants) at revision arthroplasty. We addressed two questions in this study: (1) Is this method able to show the existence of bacterial DNA on presumed aseptic loosed implants?; and (2) What proportion of presumed aseptic or culture-negative implants was positive for bacterial DNA by PCR? Fourteen implants (15%) were believed infected, whereas 74 implants (85%) were believed aseptic. Each implant was sonicated and the resulting solution was submitted for dual real-time PCR assay and culture. All implants believed aseptically loose were culture-negative, but nine of the 74 (12%) had bacterial DNA by PCR; two (2.7%) were PCR-positive and also showed histologic findings suggestive of infection. Uniquely developed PCR and bacterial sequencing assays showed bacterial DNA on 12% of implants removed for presumed aseptic loosening. Additional studies are needed to determine the clinical importance of bacterial DNA detected by PCR but not by conventional culture. Level of Evidence: Level III, diagnostic study. See the Guidelines for Authors for a complete description of levels of evidence. PMID:18438724
Coyne, Robert S; Thiagarajan, Mathangi; Jones, Kristie M; Wortman, Jennifer R; Tallon, Luke J; Haas, Brian J; Cassidy-Hanley, Donna M; Wiley, Emily A; Smith, Joshua J; Collins, Kathleen; Lee, Suzanne R; Couvillion, Mary T; Liu, Yifan; Garg, Jyoti; Pearlman, Ronald E; Hamilton, Eileen P; Orias, Eduardo; Eisen, Jonathan A; Methé, Barbara A
2008-01-01
Background Tetrahymena thermophila, a widely studied model for cellular and molecular biology, is a binucleated single-celled organism with a germline micronucleus (MIC) and somatic macronucleus (MAC). The recent draft MAC genome assembly revealed low sequence repetitiveness, a result of the epigenetic removal of invasive DNA elements found only in the MIC genome. Such low repetitiveness makes complete closure of the MAC genome a feasible goal, which to achieve would require standard closure methods as well as removal of minor MIC contamination of the MAC genome assembly. Highly accurate preliminary annotation of Tetrahymena's coding potential was hindered by the lack of both comparative genomic sequence information from close relatives and significant amounts of cDNA evidence, thus limiting the value of the genomic information and also leaving unanswered certain questions, such as the frequency of alternative splicing. Results We addressed the problem of MIC contamination using comparative genomic hybridization with purified MIC and MAC DNA probes against a whole genome oligonucleotide microarray, allowing the identification of 763 genome scaffolds likely to contain MIC-limited DNA sequences. We also employed standard genome closure methods to essentially finish over 60% of the MAC genome. For the improvement of annotation, we have sequenced and analyzed over 60,000 verified EST reads from a variety of cellular growth and development conditions. Using this EST evidence, a combination of automated and manual reannotation efforts led to updates that affect 16% of the current protein-coding gene models. By comparing EST abundance, many genes showing apparent differential expression between these conditions were identified. Rare instances of alternative splicing and uses of the non-standard amino acid selenocysteine were also identified. Conclusion We report here significant progress in genome closure and reannotation of Tetrahymena thermophila. Our experience to date suggests that complete closure of the MAC genome is attainable. Using the new EST evidence, automated and manual curation has resulted in substantial improvements to the over 24,000 gene models, which will be valuable to researchers studying this model organism as well as for comparative genomics purposes. PMID:19036158
Tharmatha, T; Gajapathy, K; Ramasamy, R; Surendran, S N
2017-02-01
The correct identification of sand fly vectors of leishmaniasis is important for controlling the disease. Genetic, particularly DNA sequence data, has lately become an important adjunct to the use of morphological criteria for this purpose. A recent DNA sequencing study revealed the presence of two cryptic species in the Sergentomyia bailyi species complex in India. The present study was undertaken to ascertain the presence of cryptic species in the Se. bailyi complex in Sri Lanka using morphological characteristics and DNA sequences from cytochrome c oxidase subunits. Sand flies were collected from leishmaniasis endemic and non-endemic dry zone districts of Sri Lanka. A total of 175 Se. bailyi specimens were initially screened for morphological variations and the identified samples formed two groups, tentatively termed as Se. bailyi species A and B, based on the relative length of the sensilla chaeticum and antennal flagellomere. DNA sequences from the mitochondrial cytochrome c oxidase subunit I (COI) and subunit II (COII) genes of morphologically identified Se. bailyi species A and B were subsequently analyzed. The two species showed differences in the COI and COII gene sequences and were placed in two separate clades by phylogenetic analysis. An allele specific polymerase chain reaction assay based on sequence variation in the COI gene accurately differentiated species A and B. The study therefore describes the first morphological and genetic evidence for the presence of two cryptic species within the Se. bailyi complex in Sri Lanka and a DNA-based laboratory technique for differentiating them.
Sequence variation between 462 human individuals fine-tunes functional sites of RNA processing
NASA Astrophysics Data System (ADS)
Ferreira, Pedro G.; Oti, Martin; Barann, Matthias; Wieland, Thomas; Ezquina, Suzana; Friedländer, Marc R.; Rivas, Manuel A.; Esteve-Codina, Anna; Estivill, Xavier; Guigó, Roderic; Dermitzakis, Emmanouil; Antonarakis, Stylianos; Meitinger, Thomas; Strom, Tim M.; Palotie, Aarno; François Deleuze, Jean; Sudbrak, Ralf; Lerach, Hans; Gut, Ivo; Syvänen, Ann-Christine; Gyllensten, Ulf; Schreiber, Stefan; Rosenstiel, Philip; Brunner, Han; Veltman, Joris; Hoen, Peter A. C. T.; Jan van Ommen, Gert; Carracedo, Angel; Brazma, Alvis; Flicek, Paul; Cambon-Thomsen, Anne; Mangion, Jonathan; Bentley, David; Hamosh, Ada; Rosenstiel, Philip; Strom, Tim M.; Lappalainen, Tuuli; Guigó, Roderic; Sammeth, Michael
2016-09-01
Recent advances in the cost-efficiency of sequencing technologies enabled the combined DNA- and RNA-sequencing of human individuals at the population-scale, making genome-wide investigations of the inter-individual genetic impact on gene expression viable. Employing mRNA-sequencing data from the Geuvadis Project and genome sequencing data from the 1000 Genomes Project we show that the computational analysis of DNA sequences around splice sites and poly-A signals is able to explain several observations in the phenotype data. In contrast to widespread assessments of statistically significant associations between DNA polymorphisms and quantitative traits, we developed a computational tool to pinpoint the molecular mechanisms by which genetic markers drive variation in RNA-processing, cataloguing and classifying alleles that change the affinity of core RNA elements to their recognizing factors. The in silico models we employ further suggest RNA editing can moonlight as a splicing-modulator, albeit less frequently than genomic sequence diversity. Beyond existing annotations, we demonstrate that the ultra-high resolution of RNA-Seq combined from 462 individuals also provides evidence for thousands of bona fide novel elements of RNA processing—alternative splice sites, introns, and cleavage sites—which are often rare and lowly expressed but in other characteristics similar to their annotated counterparts.
The study of human Y chromosome variation through ancient DNA.
Kivisild, Toomas
2017-05-01
High throughput sequencing methods have completely transformed the study of human Y chromosome variation by offering a genome-scale view on genetic variation retrieved from ancient human remains in context of a growing number of high coverage whole Y chromosome sequence data from living populations from across the world. The ancient Y chromosome sequences are providing us the first exciting glimpses into the past variation of male-specific compartment of the genome and the opportunity to evaluate models based on previously made inferences from patterns of genetic variation in living populations. Analyses of the ancient Y chromosome sequences are challenging not only because of issues generally related to ancient DNA work, such as DNA damage-induced mutations and low content of endogenous DNA in most human remains, but also because of specific properties of the Y chromosome, such as its highly repetitive nature and high homology with the X chromosome. Shotgun sequencing of uniquely mapping regions of the Y chromosomes to sufficiently high coverage is still challenging and costly in poorly preserved samples. To increase the coverage of specific target SNPs capture-based methods have been developed and used in recent years to generate Y chromosome sequence data from hundreds of prehistoric skeletal remains. Besides the prospects of testing directly as how much genetic change in a given time period has accompanied changes in material culture the sequencing of ancient Y chromosomes allows us also to better understand the rate at which mutations accumulate and get fixed over time. This review considers genome-scale evidence on ancient Y chromosome diversity that has recently started to accumulate in geographic areas favourable to DNA preservation. More specifically the review focuses on examples of regional continuity and change of the Y chromosome haplogroups in North Eurasia and in the New World.
RAD tag sequencing as a source of SNP markers in Cynara cardunculus L
2012-01-01
Background The globe artichoke (Cynara cardunculus L. var. scolymus) genome is relatively poorly explored, especially compared to those of the other major Asteraceae crops sunflower and lettuce. No SNP markers are in the public domain. We have combined the recently developed restriction-site associated DNA (RAD) approach with the Illumina DNA sequencing platform to effect the rapid and mass discovery of SNP markers for C. cardunculus. Results RAD tags were sequenced from the genomic DNA of three C. cardunculus mapping population parents, generating 9.7 million reads, corresponding to ~1 Gbp of sequence. An assembly based on paired ends produced ~6.0 Mbp of genomic sequence, separated into ~19,000 contigs (mean length 312 bp), of which ~21% were fragments of putative coding sequence. The shared sequences allowed for the discovery of ~34,000 SNPs and nearly 800 indels, equivalent to a SNP frequency of 5.6 per 1,000 nt, and an indel frequency of 0.2 per 1,000 nt. A sample of heterozygous SNP loci was mapped by CAPS assays and this exercise provided validation of our mining criteria. The repetitive fraction of the genome had a high representation of retrotransposon sequence, followed by simple repeats, AT-low complexity regions and mobile DNA elements. The genomic k-mers distribution and CpG rate of C. cardunculus, compared with data derived from three whole genome-sequenced dicots species, provided a further evidence of the random representation of the C. cardunculus genome generated by RAD sampling. Conclusion The RAD tag sequencing approach is a cost-effective and rapid method to develop SNP markers in a highly heterozygous species. Our approach permitted to generate a large and robust SNP datasets by the adoption of optimized filtering criteria. PMID:22214349
DNA Looping Facilitates Targeting of a Chromatin Remodeling Enzyme
Yadon, Adam N; Singh, Badri Nath; Hampsey, Michael; Tsukiyama, Toshio
2013-01-01
Summary ATP-dependent chromatin remodeling enzymes are highly abundant and play pivotal roles regulating DNA-dependent processes. The mechanisms by which they are targeted to specific loci have not been well understood on a genome-wide scale. Here we present evidence that a major targeting mechanism for the Isw2 chromatin remodeling enzyme to specific genomic loci is through sequence-specific transcription factor (TF)-dependent recruitment. Unexpectedly, Isw2 is recruited in a TF-dependent fashion to a large number of loci without TF binding sites. Using the 3C assay, we show that Isw2 can be targeted by Ume6- and TFIIB-dependent DNA looping. These results identify DNA looping as a previously unknown mechanism for the recruitment of a chromatin remodeling enzyme and defines a novel function for DNA looping. We also present evidence suggesting that Ume6-dependent DNA looping is involved in chromatin remodeling and transcriptional repression, revealing a mechanism by which the three-dimensional folding of chromatin affects DNA-dependent processes. PMID:23478442
Product analysis illuminates the final steps of IES deletion in Tetrahymena thermophila
Saveliev, Sergei V.; Cox, Michael M.
2001-01-01
DNA sequences (IES elements) eliminated from the developing macronucleus in the ciliate Tetrahymena thermophila are released as linear fragments, which have now been detected and isolated. A PCR-mediated examination of fragment end structures reveals three types of strand scission events, reflecting three steps in the deletion process. New evidence is provided for two steps proposed previously: an initiating double-stranded cleavage, and strand transfer to create a branched deletion intermediate. The fragment ends provide evidence for a previously uncharacterized third step: the branched DNA strand is cleaved at one of several defined sites located within 15–16 nucleotides of the IES boundary, liberating the deleted DNA in a linear form. PMID:11406601
Product analysis illuminates the final steps of IES deletion in Tetrahymena thermophila.
Saveliev, S V; Cox, M M
2001-06-15
DNA sequences (IES elements) eliminated from the developing macronucleus in the ciliate Tetrahymena thermophila are released as linear fragments, which have now been detected and isolated. A PCR-mediated examination of fragment end structures reveals three types of strand scission events, reflecting three steps in the deletion process. New evidence is provided for two steps proposed previously: an initiating double-stranded cleavage, and strand transfer to create a branched deletion intermediate. The fragment ends provide evidence for a previously uncharacterized third step: the branched DNA strand is cleaved at one of several defined sites located within 15-16 nucleotides of the IES boundary, liberating the deleted DNA in a linear form.
FANCJ promotes DNA synthesis through G-quadruplex structures
Castillo Bosch, Pau; Segura-Bayona, Sandra; Koole, Wouter; van Heteren, Jane T; Dewar, James M; Tijsterman, Marcel; Knipscheer, Puck
2014-01-01
Our genome contains many G-rich sequences, which have the propensity to fold into stable secondary DNA structures called G4 or G-quadruplex structures. These structures have been implicated in cellular processes such as gene regulation and telomere maintenance. However, G4 sequences are prone to mutations particularly upon replication stress or in the absence of specific helicases. To investigate how G-quadruplex structures are resolved during DNA replication, we developed a model system using ssDNA templates and Xenopus egg extracts that recapitulates eukaryotic G4 replication. Here, we show that G-quadruplex structures form a barrier for DNA replication. Nascent strand synthesis is blocked at one or two nucleotides from the G4. After transient stalling, G-quadruplexes are efficiently unwound and replicated. In contrast, depletion of the FANCJ/BRIP1 helicase causes persistent replication stalling at G-quadruplex structures, demonstrating a vital role for this helicase in resolving these structures. FANCJ performs this function independently of the classical Fanconi anemia pathway. These data provide evidence that the G4 sequence instability in FANCJ−/− cells and Fancj/dog1 deficient C. elegans is caused by replication stalling at G-quadruplexes. PMID:25193968
Human Contamination in Public Genome Assemblies.
Kryukov, Kirill; Imanishi, Tadashi
2016-01-01
Contamination in genome assembly can lead to wrong or confusing results when using such genome as reference in sequence comparison. Although bacterial contamination is well known, the problem of human-originated contamination received little attention. In this study we surveyed 45,735 available genome assemblies for evidence of human contamination. We used lineage specificity to distinguish between contamination and conservation. We found that 154 genome assemblies contain fragments that with high confidence originate as contamination from human DNA. Majority of contaminating human sequences were present in the reference human genome assembly for over a decade. We recommend that existing contaminated genomes should be revised to remove contaminated sequence, and that new assemblies should be thoroughly checked for presence of human DNA before submitting them to public databases.
Wada, H; Satoh, N
1994-01-01
Almost the entire sequences of 18S rDNA were determined for two chaetognaths, five echinoderms, a hemichordate, and two urochordates (a larvacean and a salp). Phylogenetic comparisons of the sequences, together with those of other deuterostomes (an ascidian, a cephalochordate, and vertebrates) and protostomes (an arthropod and a mollusc), suggest the monophyly of the deuterostomes, with the exception of the chaetognaths. Chaetognaths may not be a group of deuterostomes. The deuterostome group closest to vertebrates was the group of cephalochordates. Ascidians, larvaceans, and salps seem to form a discrete group (urochordates), in which the early divergence of larvaceans is evident. These results support the hypothesis that chordates evolved from free-living ancestors. PMID:8127885
Comment on "Protein sequences from mastodon and Tyrannosaurus rex revealed by mass spectrometry".
Buckley, Mike; Walker, Angela; Ho, Simon Y W; Yang, Yue; Smith, Colin; Ashton, Peter; Oates, Jane Thomas; Cappellini, Enrico; Koon, Hannah; Penkman, Kirsty; Elsworth, Ben; Ashford, Dave; Solazzo, Caroline; Andrews, Phillip; Strahler, John; Shapiro, Beth; Ostrom, Peggy; Gandhi, Hasand; Miller, Webb; Raney, Brian; Zylber, Maria Ines; Gilbert, M Thomas P; Prigodich, Richard V; Ryan, Michael; Rijsdijk, Kenneth F; Janoo, Anwar; Collins, Matthew J
2008-01-04
We used authentication tests developed for ancient DNA to evaluate claims by Asara et al. (Reports, 13 April 2007, p. 280) of collagen peptide sequences recovered from mastodon and Tyrannosaurus rex fossils. Although the mastodon samples pass these tests, absence of amino acid composition data, lack of evidence for peptide deamidation, and association of alpha1(I) collagen sequences with amphibians rather than birds suggest that T. rex does not.
Exploring the Limits of DNA Size: Naphtho-homologated DNA Bases and Pairs
Lee, Alex H. F.; Kool, Eric T.
2008-01-01
A new design for DNA bases and base pairs is described in which the pyrimidine bases are widened by naphtho-homologation. Two naphtho-homologated deoxyribosides, dyyT (1) and dyyC (2) were synthesized and could be incorporated into oligonucleotides as suitably protected phosphoramidite derivatives. The deoxyribosides were found to be fluorescent, with emission maxima at 446 and 433 nm, respectively. Studies with single substitutions of 1 and 2 in the natural DNA context revealed exceptionally strong base stacking propensity for both. Sequences containing multiple substitutions of 1 and 2 paired opposite adenine and guanine were subsequently mixed and studied by several analytical methods. Data from UV mixing experiments, FRET measurements, fluorescence quenching experiments, and hybridizations on beads suggest that complementary “doublewide DNA” (yyDNA) strands may self-assemble into helical complexes with 1:1 stoichiometry. Data from thermal denaturation plots and CD spectra were less conclusive. Control experiments in one sequence context gave evidence that yyDNA helices, if formed, are preferentially antiparallel and are sequence selective. Hypothesized base pairing schemes are analogous to Watson-Crick pairing, but with glycosidic C1′-C1′ distances widened by over 45%, to ca. 15.2 Å. The possible self-assembly of the double-wide DNA helix establishes a new limit for the size of information-encoding, DNA-like molecules, and the fluorescence of yyDNA bases suggests uses as reporters in monomeric and oligomeric forms. PMID:16834396
Deletions of fetal and adult muscle cDNA in Duchenne and Becker muscular dystrophy patients.
Cross, G S; Speer, A; Rosenthal, A; Forrest, S M; Smith, T J; Edwards, Y; Flint, T; Hill, D; Davies, K E
1987-01-01
We have isolated a cDNA molecule from a human adult muscle cDNA library which is deleted in several Duchenne muscular dystrophy patients. Patient deletions have been used to map the exons across the Xp21 region of the short arm of the X chromosome. We demonstrate that a very mildly affected 61 year old patient is deleted for at least nine exons of the adult cDNA. We find no evidence for differential exon usage between adult and fetal muscle in this region of the gene. There must therefore be less essential domains of the protein structure which can be removed without complete loss of function. The sequence of 2.0 kb of the adult cDNA shows no homology to any previously described protein listed in the data banks although sequence comparison at the amino acid level suggests that the protein has a structure not dissimilar to rod structures of cytoskeletal proteins such as lamin and myosin. There are single nucleotide differences in the DNA sequence between the adult and fetal cDNAs which result in amino acid changes but none that would be predicted to change the structure of the protein dramatically. Images Fig. 1. Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 7. PMID:3428261
DNA unzipping phase diagram calculated via replica theory.
Roland, C Brian; Hatch, Kristi Adamson; Prentiss, Mara; Shakhnovich, Eugene I
2009-05-01
We show how single-molecule unzipping experiments can provide strong evidence that the zero-force melting transition of long molecules of natural dsDNA should be classified as a phase transition of the higher-order type (continuous). Toward this end, we study a statistical-mechanics model for the fluctuating structure of a long molecule of dsDNA, and compute the equilibrium phase diagram for the experiment in which the molecule is unzipped under applied force. We consider a perfect-matching dsDNA model, in which the loops are volume-excluding chains with arbitrary loop exponent c . We include stacking interactions, hydrogen bonds, and main-chain entropy. We include sequence heterogeneity at the level of random sequences; in particular, there is no correlation in the base-pairing (bp) energy from one sequence position to the next. We present heuristic arguments to demonstrate that the low-temperature macrostate does not exhibit degenerate ergodicity breaking. We use this claim to understand the results of our replica-theoretic calculation of the equilibrium properties of the system. As a function of temperature, we obtain the minimal force at which the molecule separates completely. This critical-force curve is a line in the temperature-force phase diagram that marks the regions where the molecule exists primarily as a double helix versus the region where the molecule exists as two separate strands. We compare our random-sequence model to magnetic tweezer experiments performed on the 48 502 bp genome of bacteriophage lambda . We find good agreement with the experimental data, which is restricted to temperatures between 24 and 50 degrees C . At higher temperatures, the critical-force curve of our random-sequence model is very different for that of the homogeneous-sequence version of our model. For both sequence models, the critical force falls to zero at the melting temperature T_{c} like |T-T_{c}|;{alpha} . For the homogeneous-sequence model, alpha=1/2 almost exactly, while for the random-sequence model, alpha approximately 0.9 . Importantly, the shape of the critical-force curve is connected, via our theory, to the manner in which the helix fraction falls to zero at T_{c} . The helix fraction is the property that is used to classify the melting transition as a type of phase transition. In our calculation, the shape of the critical-force curve holds strong evidence that the zero-force melting transition of long natural dsDNA should be classified as a higher-order (continuous) phase transition. Specifically, the order is 3rd or greater.
Tooley, Paul W; Bandyopadhyay, Ranajit; Carras, Marie M; Pazoutová, Sylvie
2006-04-01
Isolates of Claviceps causing ergot on sorghum in India were analysed by AFLP analysis, and by analysis of DNA sequences of the EF-1alpha gene intron 4 and beta-tubulin gene intron 3 region. Of 89 isolates assayed from six states in India, four were determined to be C. sorghi, and the rest C. africana. A relatively low level of genetic diversity was observed within the Indian C. africana population. No evidence of genetic exchange between C. africana and C. sorghi was observed in either AFLP or DNA sequence analysis. Phylogenetic analysis was conducted using DNA sequences from 14 different Claviceps species. A multigene phylogeny based on the EF-1alpha gene intron 4, the beta-tubulin gene intron 3 region, and rDNA showed that C. sorghi grouped most closely with C. gigantea and C. africana. Although the Claviceps species we analysed were closely related, they colonize hosts that are taxonomically very distinct suggesting that there is no direct coevolution of Claviceps with its hosts.
Footprinting reveals that nogalamycin and actinomycin shuffle between DNA binding sites.
Fox, K R; Waring, M J
1986-01-01
The hypothesis that sequence-selective DNA-binding antibiotics locate their preferred binding sites by a process involving migration from nonspecific sites has been tested by footprinting with DNAase I. Footprinting patterns on the tyrT DNA fragment produced by nogalamycin and actinomycin change with time after mixing the antibiotic with the DNA. Sites of protection as well as enhanced cleavage are seen to develop in a fashion which is both temperature and concentration-dependent. At certain sites cutting is transiently enhanced, then blocked. Limited evidence for slow reaction with echinomycin and mithramycin is presented, but the kinetics of footprinting with daunomycin and distamycin appear instantaneous. The feasibility of adducing direct evidence for shuffling by footprinting seems to be governed by slow dissociation of the antibiotic-DNA complex. It may also be dependent upon the mode of binding, be it intercalative or non-intercalative in character. Images PMID:2421246
Wallgren, Marcus; Mohammad, Jani B.; Yan, Kok-Phen; Pourbozorgi-Langroudi, Parham; Ebrahimi, Mahsa; Sabouri, Nasim
2016-01-01
Certain guanine-rich sequences have an inherent propensity to form G-quadruplex (G4) structures. G4 structures are e.g. involved in telomere protection and gene regulation. However, they also constitute obstacles during replication if they remain unresolved. To overcome these threats to genome integrity, organisms harbor specialized G4 unwinding helicases. In Schizosaccharomyces pombe, one such candidate helicase is Pfh1, an evolutionarily conserved Pif1 homolog. Here, we addressed whether putative G4 sequences in S. pombe can adopt G4 structures and, if so, whether Pfh1 can resolve them. We tested two G4 sequences, derived from S. pombe ribosomal and telomeric DNA regions, and demonstrated that they form inter- and intramolecular G4 structures, respectively. Also, Pfh1 was enriched in vivo at the ribosomal G4 DNA and telomeric sites. The nuclear isoform of Pfh1 (nPfh1) unwound both types of structure, and although the G4-stabilizing compound Phen-DC3 significantly enhanced their stability, nPfh1 still resolved them efficiently. However, stable G4 structures significantly inhibited adenosine triphosphate hydrolysis by nPfh1. Because ribosomal and telomeric DNA contain putative G4 regions conserved from yeasts to humans, our studies support the important role of G4 structure formation in these regions and provide further evidence for a conserved role for Pif1 helicases in resolving G4 structures. PMID:27185885
Phylogeographic Analysis of Mitochondrial DNA in Northern Asian Populations
Derenko, Miroslava ; Malyarchuk, Boris ; Grzybowski, Tomasz ; Denisova, Galina ; Dambueva, Irina ; Perkova, Maria ; Dorzhu, Choduraa ; Luzina, Faina ; Lee, Hong Kyu ; Vanecek, Tomas ; Villems, Richard ; Zakharov, Ilia
2007-01-01
To elucidate the human colonization process of northern Asia and human dispersals to the Americas, a diverse subset of 71 mitochondrial DNA (mtDNA) lineages was chosen for complete genome sequencing from the collection of 1,432 control-region sequences sampled from 18 autochthonous populations of northern, central, eastern, and southwestern Asia. On the basis of complete mtDNA sequencing, we have revised the classification of haplogroups A, D2, G1, M7, and I; identified six new subhaplogroups (I4, N1e, G1c, M7d, M7e, and J1b2a); and fully characterized haplogroups N1a and G1b, which were previously described only by the first hypervariable segment (HVS1) sequencing and coding-region restriction-fragment–length polymorphism analysis. Our findings indicate that the southern Siberian mtDNA pool harbors several lineages associated with the Late Upper Paleolithic and/or early Neolithic dispersals from both eastern Asia and southwestern Asia/southern Caucasus. Moreover, the phylogeography of the D2 lineages suggests that southern Siberia is likely to be a geographical source for the last postglacial maximum spread of this subhaplogroup to northern Siberia and that the expansion of the D2b branch occurred in Beringia ∼7,000 years ago. In general, a detailed analysis of mtDNA gene pools of northern Asians provides the additional evidence to rule out the existence of a northern Asian route for the initial human colonization of Asia. PMID:17924343
Phylogeographic analysis of mitochondrial DNA in northern Asian populations.
Derenko, Miroslava; Malyarchuk, Boris; Grzybowski, Tomasz; Denisova, Galina; Dambueva, Irina; Perkova, Maria; Dorzhu, Choduraa; Luzina, Faina; Lee, Hong Kyu; Vanecek, Tomas; Villems, Richard; Zakharov, Ilia
2007-11-01
To elucidate the human colonization process of northern Asia and human dispersals to the Americas, a diverse subset of 71 mitochondrial DNA (mtDNA) lineages was chosen for complete genome sequencing from the collection of 1,432 control-region sequences sampled from 18 autochthonous populations of northern, central, eastern, and southwestern Asia. On the basis of complete mtDNA sequencing, we have revised the classification of haplogroups A, D2, G1, M7, and I; identified six new subhaplogroups (I4, N1e, G1c, M7d, M7e, and J1b2a); and fully characterized haplogroups N1a and G1b, which were previously described only by the first hypervariable segment (HVS1) sequencing and coding-region restriction-fragment-length polymorphism analysis. Our findings indicate that the southern Siberian mtDNA pool harbors several lineages associated with the Late Upper Paleolithic and/or early Neolithic dispersals from both eastern Asia and southwestern Asia/southern Caucasus. Moreover, the phylogeography of the D2 lineages suggests that southern Siberia is likely to be a geographical source for the last postglacial maximum spread of this subhaplogroup to northern Siberia and that the expansion of the D2b branch occurred in Beringia ~7,000 years ago. In general, a detailed analysis of mtDNA gene pools of northern Asians provides the additional evidence to rule out the existence of a northern Asian route for the initial human colonization of Asia.
Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda
2012-01-01
Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of -45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a -10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated.
Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E.; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda
2012-01-01
Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of ∼45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a ∼10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated. PMID:23071448
2014-01-01
Background Tuber melanosporum, also known in the gastronomic community as “truffle”, features one of the largest fungal genomes (125 Mb) with an exceptionally high transposable element (TE) and repetitive DNA content (>58%). The main purpose of DNA methylation in fungi is TE silencing. As obligate outcrossing organisms, truffles are bound to a sexual mode of propagation, which together with TEs is thought to represent a major force driving the evolution of DNA methylation. Thus, it was of interest to examine if and how T. melanosporum exploits DNA methylation to maintain genome integrity. Findings We performed whole-genome DNA bisulfite sequencing and mRNA sequencing on different developmental stages of T. melanosporum; namely, fruitbody (“truffle”), free-living mycelium and ectomycorrhiza. The data revealed a high rate of cytosine methylation (>44%), selectively targeting TEs rather than genes with a strong preference for CpG sites. Whole genome DNA sequencing uncovered multiple TE-enriched, copy number variant regions bearing a significant fraction of hypomethylated and expressed TEs, almost exclusively in free-living mycelium propagated in vitro. Treatment of mycelia with 5-azacytidine partially reduced DNA methylation and increased TE transcription. Our transcriptome assembly also resulted in the identification of a set of novel transcripts from 614 genes. Conclusions The datasets presented here provide valuable and comprehensive (epi)genomic information that can be of interest for evolutionary genomics studies of multicellular (filamentous) fungi, in particular Ascomycetes belonging to the subphylum, Pezizomycotina. Evidence derived from comparative methylome and transcriptome analyses indicates that a non-exhaustive and partly reversible methylation process operates in truffles. PMID:25392735
Chen, Pao-Yang; Montanini, Barbara; Liao, Wen-Wei; Morselli, Marco; Jaroszewicz, Artur; Lopez, David; Ottonello, Simone; Pellegrini, Matteo
2014-01-01
Tuber melanosporum, also known in the gastronomic community as "truffle", features one of the largest fungal genomes (125 Mb) with an exceptionally high transposable element (TE) and repetitive DNA content (>58%). The main purpose of DNA methylation in fungi is TE silencing. As obligate outcrossing organisms, truffles are bound to a sexual mode of propagation, which together with TEs is thought to represent a major force driving the evolution of DNA methylation. Thus, it was of interest to examine if and how T. melanosporum exploits DNA methylation to maintain genome integrity. We performed whole-genome DNA bisulfite sequencing and mRNA sequencing on different developmental stages of T. melanosporum; namely, fruitbody ("truffle"), free-living mycelium and ectomycorrhiza. The data revealed a high rate of cytosine methylation (>44%), selectively targeting TEs rather than genes with a strong preference for CpG sites. Whole genome DNA sequencing uncovered multiple TE-enriched, copy number variant regions bearing a significant fraction of hypomethylated and expressed TEs, almost exclusively in free-living mycelium propagated in vitro. Treatment of mycelia with 5-azacytidine partially reduced DNA methylation and increased TE transcription. Our transcriptome assembly also resulted in the identification of a set of novel transcripts from 614 genes. The datasets presented here provide valuable and comprehensive (epi)genomic information that can be of interest for evolutionary genomics studies of multicellular (filamentous) fungi, in particular Ascomycetes belonging to the subphylum, Pezizomycotina. Evidence derived from comparative methylome and transcriptome analyses indicates that a non-exhaustive and partly reversible methylation process operates in truffles.
Buschmann, Tilo; Zhang, Rong; Brash, Douglas E; Bystrykh, Leonid V
2014-08-07
DNA barcodes are short unique sequences used to label DNA or RNA-derived samples in multiplexed deep sequencing experiments. During the demultiplexing step, barcodes must be detected and their position identified. In some cases (e.g., with PacBio SMRT), the position of the barcode and DNA context is not well defined. Many reads start inside the genomic insert so that adjacent primers might be missed. The matter is further complicated by coincidental similarities between barcode sequences and reference DNA. Therefore, a robust strategy is required in order to detect barcoded reads and avoid a large number of false positives or negatives.For mass inference problems such as this one, false discovery rate (FDR) methods are powerful and balanced solutions. Since existing FDR methods cannot be applied to this particular problem, we present an adapted FDR method that is suitable for the detection of barcoded reads as well as suggest possible improvements. In our analysis, barcode sequences showed high rates of coincidental similarities with the Mus musculus reference DNA. This problem became more acute when the length of the barcode sequence decreased and the number of barcodes in the set increased. The method presented in this paper controls the tail area-based false discovery rate to distinguish between barcoded and unbarcoded reads. This method helps to establish the highest acceptable minimal distance between reads and barcode sequences. In a proof of concept experiment we correctly detected barcodes in 83% of the reads with a precision of 89%. Sensitivity improved to 99% at 99% precision when the adjacent primer sequence was incorporated in the analysis. The analysis was further improved using a paired end strategy. Following an analysis of the data for sequence variants induced in the Atp1a1 gene of C57BL/6 murine melanocytes by ultraviolet light and conferring resistance to ouabain, we found no evidence of cross-contamination of DNA material between samples. Our method offers a proper quantitative treatment of the problem of detecting barcoded reads in a noisy sequencing environment. It is based on the false discovery rate statistics that allows a proper trade-off between sensitivity and precision to be chosen.
Phylogenetic relations of humans and African apes from DNA sequences in the Psi eta-globin region
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miyamoto, M.M.; Slightom, J.L.; Goodman, M.
Sequences from the upstream and downstream flanking DNA regions of the Psi eta-globin locus in Pan troglodytes (common chimpanzee), Gorilla gorilla (gorilla), and Pongo pygmaeus (orangutan, the closest living relative to Homo, Pan, and Gorilla) provided further data for evaluating the phylogenetic relations of humans and African apes. These newly sequenced orthologs (an additional 4.9 kilobase pairs (kbp) for each species) were combined with published Psi eta-gene sequences and then compared to the same orthologous stretch (a continuous 7.1-kbp region) available for humans. Phylogenetic analysis of these nucleotide sequences by the parsimony method indicated (i) that human and chimpanzee aremore » more closely related to each other than either is to gorilla and (ii) that the slowdown in the rate of sequence evolution evident in higher primates is especially pronounced in humans. These results indicate that features unique to African apes (but not to humans) are primitive and that even local molecular clocks should be applied with caution.« less
Apitz, Janina; Weihe, Andreas; Pohlheim, Frank; Börner, Thomas
2013-02-01
While uniparental transmission of mtDNA is widespread and dominating in eukaryotes leaving mutation as the major source of genotypic diversity, recently, biparental inheritance of mitochondrial genes has been demonstrated in reciprocal crosses of Pelargonium zonale and P. inquinans. The thereby arising heteroplasmy carries the potential for recombination between mtDNAs of different descent, i.e. between the parental mitochondrial genomes. We have analyzed these Pelargonium hybrids for mitochondrial intergenomic recombination events by examining differences in DNA blot hybridization patterns of the mitochondrial genes atp1 and cob. Further investigation of these genes and their flanking regions using nucleotide sequence polymorphisms and PCR revealed DNA segments in the progeny, which contained both P. zonale and P. inquinans sequences suggesting an intergenomic recombination in hybrids of Pelargonium. This turns Pelargonium into an interesting subject for studies of recombination and evolutionary dynamics of mitochondrial genomes.
Evidence for a Complex Class of Nonadenylated mRNA in Drosophila
Zimmerman, J. Lynn; Fouts, David L.; Manning, Jerry E.
1980-01-01
The amount, by mass, of poly(A+) mRNA present in the polyribosomes of third-instar larvae of Drosophila melanogaster, and the relative contribution of the poly(A+) mRNA to the sequence complexity of total polysomal RNA, has been determined. Selective removal of poly(A+) mRNA from total polysomal RNA by use of either oligo-dT-cellulose, or poly(U)-sepharose affinity chromatography, revealed that only 0.15% of the mass of the polysomal RNA was present as poly(A+) mRNA. The present study shows that this RNA hybridized at saturation with 3.3% of the single-copy DNA in the Drosophila genome. After correction for asymmetric transcription and reactability of the DNA, 7.4% of the single-copy DNA in the Drosophila genome is represented in larval poly(A+) mRNA. This corresponds to 6.73 x 106 nucleotides of mRNA coding sequences, or approximately 5,384 diverse RNA sequences of average size 1,250 nucleotides. However, total polysomal RNA hybridizes at saturation to 10.9% of the single-copy DNA sequences. After correcting this value for asymmetric transcription and tracer DNA reactability, 24% of the single-copy DNA in Drosophila is represented in total polysomal RNA. This corresponds to 2.18 x 107 nucleotides of RNA coding sequences or 17,440 diverse RNA molecules of size 1,250 nucleotides. This value is 3.2 times greater than that observed for poly(A+) mRNA, and indicates that ≃69% of the polysomal RNA sequence complexity is contributed by nonadenylated RNA. Furthermore, if the number of different structural genes represented in total polysomal RNA is ≃1.7 x 104, then the number of genes expressed in third-instar larvae exceeds the number of chromomeres in Drosophila by about a factor of three. This numerology indicates that the number of chromomeres observed in polytene chromosomes does not reflect the number of structural gene sequences in the Drosophila genome. PMID:6777246
Presence of DNA methyltransferase activity and CpC methylation in Drosophila melanogaster.
Panikar, Chitra S; Rajpathak, Shriram N; Abhyankar, Varada; Deshmukh, Saniya; Deobagkar, Deepti D
2015-12-01
Drosophila melanogaster lacks DNMT1/DNMT3 based methylation machinery. Despite recent reports confirming the presence of low DNA methylation in Drosophila; little is known about the methyltransferase. Therefore, in this study, we have aimed to investigate the possible functioning of DNA methyltransferase in Drosophila. The 14 K oligo microarray slide was incubated with native cell extract from adult Drosophila to check the presence of the methyltransferase activity. After incubation under appropriate conditions, the methylated oligo sequences were identified by the binding of anti 5-methylcytosine monoclonal antibody. The antibody bound to the methylated oligos was detected using Cy3 labeled secondary antibody. Methylation sensitive restriction enzyme mediated PCR was used to assess the methylation at a few selected loci identified on the array. It could be seen that a few of the total oligos got methylated under the assay conditions. Analysis of methylated oligo sequences provides evidence for the presence of de novo methyltransferase activity and allows identification of its sequence specificity in adult Drosophila. With the help of methylation sensitive enzymes we could detect presence of CpC methylation in the selected genomic regions. This study reports presence of an active DNA methyltransferase in adult Drosophila, which exhibits sequence specificity confirmed by presence of asymmetric methylation at corresponding sites in the genomic DNA. It also provides an innovative approach to investigate methylation specificity of a native methyltransferase.
Double-strand break repair processes drive evolution of the mitochondrial genome in Arabidopsis.
Davila, Jaime I; Arrieta-Montiel, Maria P; Wamboldt, Yashitola; Cao, Jun; Hagmann, Joerg; Shedge, Vikas; Xu, Ying-Zhi; Weigel, Detlef; Mackenzie, Sally A
2011-09-27
The mitochondrial genome of higher plants is unusually dynamic, with recombination and nonhomologous end-joining (NHEJ) activities producing variability in size and organization. Plant mitochondrial DNA also generally displays much lower nucleotide substitution rates than mammalian or yeast systems. Arabidopsis displays these features and expedites characterization of the mitochondrial recombination surveillance gene MSH1 (MutS 1 homolog), lending itself to detailed study of de novo mitochondrial genome activity. In the present study, we investigated the underlying basis for unusual plant features as they contribute to rapid mitochondrial genome evolution. We obtained evidence of double-strand break (DSB) repair, including NHEJ, sequence deletions and mitochondrial asymmetric recombination activity in Arabidopsis wild-type and msh1 mutants on the basis of data generated by Illumina deep sequencing and confirmed by DNA gel blot analysis. On a larger scale, with mitochondrial comparisons across 72 Arabidopsis ecotypes, similar evidence of DSB repair activity differentiated ecotypes. Forty-seven repeat pairs were active in DNA exchange in the msh1 mutant. Recombination sites showed asymmetrical DNA exchange within lengths of 50- to 556-bp sharing sequence identity as low as 85%. De novo asymmetrical recombination involved heteroduplex formation, gene conversion and mismatch repair activities. Substoichiometric shifting by asymmetrical exchange created the appearance of rapid sequence gain and loss in association with particular repeat classes. Extensive mitochondrial genomic variation within a single plant species derives largely from DSB activity and its repair. Observed gene conversion and mismatch repair activity contribute to the low nucleotide substitution rates seen in these genomes. On a phenotypic level, these patterns of rearrangement likely contribute to the reproductive versatility of higher plants.
Nuclear magnetic resonance-based model of a TF1/HmU-DNA complex.
Silva, M V; Pasternack, L B; Kearns, D R
1997-12-15
Transcription factor 1 (TF1), a type II DNA-binding protein encoded by the Bacillus subtilis bacteriophage SPO1, has the capacity for sequence-selective DNA binding and a preference for 5-hydroxymethyl-2'-deoxyuridine (HmU)-containing DNA. In NMR studies of the TF1/HmU-DNA complex, intermolecular NOEs indicate that the flexible beta-ribbon and C-terminal alpha-helix are involved in the DNA-binding site of TF1, placing it in the beta-sheet category of DNA-binding proteins proposed to bind by wrapping two beta-ribbon "arms" around the DNA. Intermolecular and intramolecular NOEs were used to generate an energy-minimized model of the protein-DNA complex in which both DNA bending and protein structure changes are evident.
Design and analysis of linear cascade DNA hybridization chain reactions using DNA hairpins
NASA Astrophysics Data System (ADS)
Bui, Hieu; Garg, Sudhanshu; Miao, Vincent; Song, Tianqi; Mokhtar, Reem; Reif, John
2017-01-01
DNA self-assembly has been employed non-conventionally to construct nanoscale structures and dynamic nanoscale machines. The technique of hybridization chain reactions by triggered self-assembly has been shown to form various interesting nanoscale structures ranging from simple linear DNA oligomers to dendritic DNA structures. Inspired by earlier triggered self-assembly works, we present a system for controlled self-assembly of linear cascade DNA hybridization chain reactions using nine distinct DNA hairpins. NUPACK is employed to assist in designing DNA sequences and Matlab has been used to simulate DNA hairpin interactions. Gel electrophoresis and ensemble fluorescence reaction kinetics data indicate strong evidence of linear cascade DNA hybridization chain reactions. The half-time completion of the proposed linear cascade reactions indicates a linear dependency on the number of hairpins.
Galbany-Casals, M; Carnicero-Campmany, P; Blanco-Moreno, J M; Smissen, R D
2012-09-01
Hybridisation is considered an important evolutionary phenomenon in Gnaphalieae, but contemporary hybridisation has been little explored within the tribe. Here, hybridisation between Helichrysum orientale and Helichrysum stoechas is studied at two different localities in the islands of Crete and Rhodes (Greece). Using three different types of molecular data (AFLP, nrDNA ITS sequences and cpDNA ndhF sequences) and morphological data, the aim is to provide simultaneous and direct comparisons between molecular and morphological variation among the parental species and the studied hybrid populations. AFLP profiles, ITS sequences and morphological data support the existence of hybrids at the two localities studied, shown as morphological and genetic intermediates between the parental species. Chloroplast DNA sequences show that both parental species can act either as pollen donor or as maternal parent. Fertility of hybrids is demonstrated by the viability of seeds produced by hybrids from both localities, and the detection of a backcross specimen to H. orientale. Although there is general congruence of morphological and molecular data, the analysis of morphology and ITS sequences can fail to detect backcross hybrids. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.
Brucella Infection in Asian Sea Otters (Enhydra lutris lutris) on Bering Island, Russia.
Burgess, Tristan L; Johnson, Christine Kreuder; Burdin, Alexander; Gill, Verena A; Doroff, Angela M; Tuomi, Pamela; Smith, Woutrina A; Goldstein, Tracey
2017-10-01
Infection with Brucella spp., long known as a cause of abortion, infertility, and reproductive loss in domestic livestock, has increasingly been documented in marine mammals over the past two decades. We report molecular evidence of Brucella infection in Asian sea otters (Enhydra lutris lutris). Brucella DNA was detected in 3 of 78 (4%) rectal swab samples collected between 2004 and 2006 on Bering Island, Russia. These 78 animals had previously been documented to have a Brucella seroprevalence of 28%, markedly higher than the prevalence documented in sea otters (Enhydra lutris) in North America. All of the DNA sequences amplified were identical to one or more previously isolated Brucella spp. including strains from both terrestrial and marine hosts. Phylogenetic analysis of this sequence suggested that one animal was shedding Brucella spp. DNA with a sequence matching a Brucella abortus strain, whereas two animals yielded a sequence matching a group of strains including isolates classified as Brucella pinnipedialis and Brucella melitensis. Our results highlight the diversity of Brucella spp. within a single sea otter population.
On the path to genetic novelties: insights from programmed DNA elimination and RNA splicing.
Catania, Francesco; Schmitz, Jürgen
2015-01-01
Understanding how genetic novelties arise is a central goal of evolutionary biology. To this end, programmed DNA elimination and RNA splicing deserve special consideration. While programmed DNA elimination reshapes genomes by eliminating chromatin during organismal development, RNA splicing rearranges genetic messages by removing intronic regions during transcription. Small RNAs help to mediate this class of sequence reorganization, which is not error-free. It is this imperfection that makes programmed DNA elimination and RNA splicing excellent candidates for generating evolutionary novelties. Leveraging a number of these two processes' mechanistic and evolutionary properties, which have been uncovered over the past years, we present recently proposed models and empirical evidence for how splicing can shape the structure of protein-coding genes in eukaryotes. We also chronicle a number of intriguing similarities between the processes of programmed DNA elimination and RNA splicing, and highlight the role that the variation in the population-genetic environment may play in shaping their target sequences. © 2015 Wiley Periodicals, Inc.
Fetal aneuploidy detection by maternal plasma DNA sequencing: a technology assessment.
Walsh, Judith M E; Goldberg, James D
2013-06-01
The American College of Obstetricians and Gynecologists currently recommends that all pregnant women be offered screening for chromosomal abnormalities, regardless of maternal age. Traditional screening tests have detection rates ranging from 85% to 90% and false-positive rates of 3% to 5%. A woman with an abnormal noninvasive test is offered a diagnostic test, but diagnostic tests are associated with a risk of pregnancy loss. Recently, analysis of cell-free fetal DNA (cffDNA) in maternal blood has been shown to have potential for the accurate detection of some of the common fetal autosomal aneuploidies. As part of a technology assessment for the California Technology Assessment Forum, we critically reviewed the evidence for the use of cffDNA as a prenatal screening test. We evaluated the evidence for its use as either a 'primary' or an 'advanced' screening test and for its use in screening for three different trisomies: 21, 18, and 13. We evaluated whether the use of cffDNA met established technology assessment criteria and established conclusions about evidence-based use of this new technology. © 2013 John Wiley & Sons, Ltd.
Forensics and mitochondrial DNA: applications, debates, and foundations.
Budowle, Bruce; Allard, Marc W; Wilson, Mark R; Chakraborty, Ranajit
2003-01-01
Debate on the validity and reliability of scientific methods often arises in the courtroom. When the government (i.e., the prosecution) is the proponent of evidence, the defense is obliged to challenge its admissibility. Regardless, those who seek to use DNA typing methodologies to analyze forensic biological evidence have a responsibility to understand the technology and its applications so a proper foundation(s) for its use can be laid. Mitochondrial DNA (mtDNA), an extranuclear genome, has certain features that make it desirable for forensics, namely, high copy number, lack of recombination, and matrilineal inheritance. mtDNA typing has become routine in forensic biology and is used to analyze old bones, teeth, hair shafts, and other biological samples where nuclear DNA content is low. To evaluate results obtained by sequencing the two hypervariable regions of the control region of the human mtDNA genome, one must consider the genetically related issues of nomenclature, reference population databases, heteroplasmy, paternal leakage, recombination, and, of course, interpretation of results. We describe the approaches, the impact some issues may have on interpretation of mtDNA analyses, and some issues raised in the courtroom.
2010-01-01
Bombyx mori and Bombyx mandarina are morphologically and physiologically similar. In this study, we compared the nucleotide variations in the complete mitochondrial (mt) genomes between the domesticated silkmoth, B. mori, and its wild ancestors, Chinese B. mandarina (ChBm) and Japanese B. mandarina (JaBm). The sequence divergence and transition mutation ratio between B. mori and ChBm are significantly smaller than those observed between B. mori and JaBm. The preference of transition by DNA strands between B. mori and ChBm is consistent with that between B. mori and JaBm, however, the regional variation in nucleotide substitution rate shows a different feature. These results suggest that the ChBm mt genome is not undergoing the same evolutionary process as JaBm, providing evidence for selection on mtDNA. Moreover, investigation of the nucleotide sequence divergence in the A+T-rich region of Bombyx mt genomes also provides evidence for the assumption that the A+T-rich region might not be the fastest evolving region of the mtDNA of insects. PMID:21637625
Gao, Yuan; Zhang, Yan; Yang, Xin; Qiu, Jian-Hua; Duan, Hong; Xu, Wen-Wen; Chang, Qiao-Cheng; Wang, Chun-Ren
2017-01-01
Equine strongyles, the significant nematode pathogens of horses, are characterized by high quantities and species abundance, but classification of this group of parasitic nematodes is debated. Mitochondrial (mt) genome DNA data are often used to address classification controversies. Thus, the objectives of this study were to determine the complete mt genomes of three Cyathostominae nematode species (Cyathostomum catinatum, Cylicostephanus minutus, and Poteriostomum imparidentatum) of horses and reconstruct the phylogenetic relationship of Strongylidae with other nematodes in Strongyloidea to test the hypothesis that Triodontophorus spp. belong to Cyathostominae using the mt genomes. The mt genomes of Cy. catinatum, Cs. minutus, and P. imparidentatum were 13,838, 13,826, and 13,817 bp in length, respectively. Complete mt nucleotide sequence comparison of all Strongylidae nematodes revealed that sequence identity ranged from 77.8 to 91.6%. The mt genome sequences of Triodontophorus species had relatively high identity with Cyathostominae nematodes, rather than Strongylus species of the same subfamily (Strongylinae). Comparative analyses of mt genome organization for Strongyloidea nematodes sequenced to date revealed that members of this superfamily possess identical gene arrangements. Phylogenetic analyses using mtDNA data indicated that the Triodontophorus species clustered with Cyathostominae species instead of Strongylus species. The present study first determined the complete mt genome sequences of Cy. catinatum, Cs. minutus, and P. imparidentatum, which will provide novel genetic markers for further studies of Strongylidae taxonomy, population genetics, and systematics. Importantly, sequence comparison and phylogenetic analyses based on mtDNA sequences supported the hypothesis that Triodontophorus belongs to Cyathostominae. PMID:28824575
Xiao, Yong; Yang, Zhao-hui; Zeng, Guang-ming; Ma, Yan-he; Liu, You-sheng; Wang, Rong-juan; Xu, Zheng-yong
2007-05-01
For studying the bacterial diversity and the mechanism of denitrification in sequencing bath biofilm reactor (SBBR) treating landfill leachate to provide microbial evidence for technique improvements, total microbial DNA was extracted from samples which were collected from natural landfill leachate and biofilm of a SBBR that could efficiently remove NH4+ -N and COD of high concentration. 16S rDNA fragments were amplified from the total DNA successfully using a pair of universal bacterial 16S rDNA primer, GC341F and 907R, and then were used for denaturing gradient gel electrophoresis (DGGE) analysis. The bands in the gel were analyzed by statistical methods and excided from the gel for sequencing, and the sequences were used for homology analysis and then two phylogenetic trees were constructed using DNAStar software. Results indicated that the bacterial diversity of the biofilm in SBBR and the landfill leachate was abundant, and no obvious change of community structure happened during running in the biofilm, in which most bacteria came from the landfill leachate. There may be three different modes of denitrification in the reactor because several different nitrifying bacteria, denitrifying bacteria and anaerobic ammonia oxidation bacteria coexisted in it. The results provided some valuable references for studying microbiological mechanism of denitrification in SBBR.
Modahl, Cassandra M.; Mackessy, Stephen P.
2016-01-01
Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides access to cDNA sequences in the absence of living specimens, even from commercial venom sources, to evaluate important regional differences in venom composition and to study snake venom protein evolution. PMID:27280639
Identification of presumed ancestral DNA sequences of phaseolin in Phaseolus vulgaris.
Kami, J; Velásquez, V B; Debouck, D G; Gepts, P
1995-01-01
Common bean (Phaseolus vulgaris) consists of two major geographic gene pools, one distributed in Mexico, Central America, and Colombia and the other in the southern Andes (southern Peru, Bolivia, and Argentina). Amplification and sequencing of members of the multigene family coding for phaseolin, the major seed storage protein of the common bean, provide evidence for accumulation of tandem direct repeats in both introns and exons during evolution of the multigene family in this species. The presumed ancestral phaseolin sequences, without tandem repeats, were found in recently discovered but nearly extinct wild common bean populations of Ecuador and northern Peru that are intermediate between the two major gene pools of the species based on geographical and molecular arguments. Our results illustrate the usefulness of tandem direct repeats in establishing the polarity of DNA sequence divergence and therefore in proposing phylogenies. Images Fig. 1 Fig. 3 PMID:7862642
DOE Office of Scientific and Technical Information (OSTI.GOV)
McGrath, B.C.; Dunn, J.J.; France, L.L.
1995-12-31
Lyme borreliosis, caused by the spirochete Borrelia burgdorferi, is the most common vector-borne disease in North America and Western Europe. As the major delayed immune response in humans, a better understanding of the major outer surface lipoproteins OspA and OspB are of much interest. These proteins have been shown to exhibit three distinct phylogenetic genotypes based on their DNA sequences. This paper describes the cloning of genomic DNA for each variant and amplification of PCR. DNA sequence data was used to derive computer driven phylogenetic analysis and deduced amino acid sequences. Overproduction of variant OspAs was carried out in E.more » coli using a T7-based expression system. Circular dichroism and fluorescence studies was carried out on the recombinant B31 PspA yielding evidence supporting a B31 protein containing 11% alpha-helix, 34% antiparallel beta-sheet, 12% parallel beta sheet.« less
NASA Astrophysics Data System (ADS)
Amin, Muhammad Hilman Fu'adil; Pidada, Ida Bagus Rai; Sugiharto, Widyatmoko, Johan Nuari; Irawan, Bambang
2016-03-01
Species identification and taxonomy of sea cucumber remains a challenge problem in some taxa. Caudinidae family of sea cucumber was comerciallized in Surabaya, and it was used as sea cucumber chips. Members of Caudinid sea cucumber have similiar morphology, so it is hard to identify this sea cucumber only from morphological appearance. DNA barcoding is useful method to overcome this problem. The aim of this study was to determine Caudinid specimen of sea cucumber in East Java by morphological and molecular approach. Sample was collected from east coast of Surabaya, then preserved in absolute ethanol. After DNA isolation, Cytochrome Oxydase I (COI) gene amplification was performed using Echinoderm universal primer and PCR product was sequenced. Sequencing result was analyzed and identified in NCBI database using BLAST. Results showed that Caudinid specimen in have closely related to Acaudina molpadioides sequence in GenBank with 86% identity. Morphological data, especially based on ossicle, also showed that the specimen is Acaudina molpadioides.
Roschewski, Mark; Dunleavy, Kieron; Pittaluga, Stefania; Moorhead, Martin; Pepin, Francois; Kong, Katherine; Shovlin, Margaret; Jaffe, Elaine S; Staudt, Louis M; Lai, Catherine; Steinberg, Seth M; Chen, Clara C; Zheng, Jianbiao; Willis, Thomas D; Faham, Malek; Wilson, Wyndham H
2015-05-01
Diffuse large-B-cell lymphoma is curable, but when treatment fails, outcome is poor. Although imaging can help to identify patients at risk of treatment failure, they are often imprecise, and radiation exposure is a potential health risk. We aimed to assess whether circulating tumour DNA encoding the clonal immunoglobulin gene sequence could be detected in the serum of patients with diffuse large-B-cell lymphoma and used to predict clinical disease recurrence after frontline treatment. We used next-generation DNA sequencing to retrospectively analyse cell-free circulating tumour DNA in patients assigned to one of three treatment protocols between May 8, 1993, and June 6, 2013. Eligible patients had diffuse large-B-cell lymphoma, no evidence of indolent lymphoma, and were previously untreated. We obtained serial serum samples and concurrent CT scans at specified times during most treatment cycles and up to 5 years of follow-up. VDJ gene segments of the rearranged immunoglobulin receptor genes were amplified and sequenced from pretreatment specimens and serum circulating tumour DNA encoding the VDJ rearrangements was quantitated. Tumour clonotypes were identified in pretreatment specimens from 126 patients who were followed up for a median of 11 years (IQR 6·8-14·2). Interim monitoring of circulating tumour DNA at the end of two treatment cycles in 108 patients showed a 5-year time to progression of 41·7% (95% CI 22·2-60·1) in patients with detectable circulating tumour DNA and 80·2% (69·6-87·3) in those without detectable circulating tumour DNA (p<0·0001). Detectable interim circulating tumour DNA had a positive predictive value of 62·5% (95% CI 40·6-81·2) and a negative predictive value of 79·8% (69·6-87·8). Surveillance monitoring of circulating tumour DNA was done in 107 patients who achieved complete remission. A Cox proportional hazards model showed that the hazard ratio for clinical disease progression was 228 (95% CI 51-1022) for patients who developed detectable circulating tumour DNA during surveillance compared with patients with undetectable circulating tumour DNA (p<0·0001). Surveillance circulating tumour DNA had a positive predictive value of 88·2% (95% CI 63·6-98·5) and a negative predictive value of 97·8% (92·2-99·7) and identified risk of recurrence at a median of 3·5 months (range 0-200) before evidence of clinical disease. Surveillance circulating tumour DNA identifies patients at risk of recurrence before clinical evidence of disease in most patients and results in a reduced disease burden at relapse. Interim circulating tumour DNA is a promising biomarker to identify patients at high risk of treatment failure. National Cancer Institute and Adaptive Biotechnologies. Copyright © 2015 Elsevier Ltd. All rights reserved.
Molecular Characterization of a Non–Babesia divergens Organism Causing Zoonotic Babesiosis in Europe
Cacciò, Simone; Gherlinzoni, Filippo; Aspöck, Horst; Slemenda, Susan B.; Piccaluga, PierPaolo; Martinelli, Giovanni; Edelhofer, Renate; Hollenstein, Ursula; Poletti, Giovanni; Pampiglione, Silvio; Löschenberger, Karin; Tura, Sante; Pieniazek, Norman J.
2003-01-01
In Europe, most reported human cases of babesiosis have been attributed, without strong molecular evidence, to infection with the bovine parasite Babesia divergens. We investigated the first known human cases of babesiosis in Italy and Austria, which occurred in two asplenic men. The complete 18S ribosomal RNA (18S rRNA) gene was amplified from specimens of their whole blood by polymerase chain reaction (PCR). With phylogenetic analysis, we compared the DNA sequences of the PCR products with those for other Babesia spp. The DNA sequences were identical for the organism from the two patients. In phylogenetic analysis, the organism clusters with B. odocoilei, a parasite of white-tailed deer; these two organisms form a sister group with B. divergens. This evidence indicates the patients were not infected with B. divergens but with an organism with previously unreported molecular characteristics for the 18S rRNA gene. PMID:12967491
PATENTS IN GENOMICS AND HUMAN GENETICS
Cook-Deegan, Robert; Heaney, Christopher
2010-01-01
Genomics and human genetics are scientifically fundamental and commercially valuable. These fields grew to prominence in an era of growth in government and nonprofit research funding, and of even greater growth of privately funded research and development in biotechnology and pharmaceuticals. Patents on DNA technologies are a central feature of this story, illustrating how patent law adapts---and sometimes fails to adapt---to emerging genomic technologies. In instrumentation and for therapeutic proteins, patents have largely played their traditional role of inducing investment in engineering and product development, including expensive postdiscovery clinical research to prove safety and efficacy. Patents on methods and DNA sequences relevant to clinical genetic testing show less evidence of benefits and more evidence of problems and impediments, largely attributable to university exclusive licensing practices. Whole-genome sequencing will confront uncertainty about infringing granted patents but jurisprudence trends away from upholding the broadest and potentially most troublesome patent claims. PMID:20590431
[Tonoplast transport and salt tolerance in plants
DOE Office of Scientific and Technical Information (OSTI.GOV)
Taiz, L.
1993-01-01
We have showed that the tonoplast V-ATPase could be specifically inhibited by antisense DNA to the catalytic (A) subunit; that cell expansion was inhibited in carrot transformants deficient in the enzyme and have provided evidence for at least two different isoforms of the A subunit which are Golgi- and tonoplast-specific. These findings prompted a search for sequences of the isoforms of the A subunit in carrot. We have cloned and sequenced 1.0--1.5 kb fragments of three different genes for the catalytic subunit, the fragments differ greatly in their introns, but have nearly identical exons. We are using PCR to amplifymore » and subclone carrot seedling cDNA. Thus far two bands have been amplified and are currently being subcloned for sequencing.« less
[Tonoplast transport and salt tolerance in plants]. Progress report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Taiz, L.
1993-04-01
We have showed that the tonoplast V-ATPase could be specifically inhibited by antisense DNA to the catalytic (A) subunit; that cell expansion was inhibited in carrot transformants deficient in the enzyme and have provided evidence for at least two different isoforms of the A subunit which are Golgi- and tonoplast-specific. These findings prompted a search for sequences of the isoforms of the A subunit in carrot. We have cloned and sequenced 1.0--1.5 kb fragments of three different genes for the catalytic subunit, the fragments differ greatly in their introns, but have nearly identical exons. We are using PCR to amplifymore » and subclone carrot seedling cDNA. Thus far two bands have been amplified and are currently being subcloned for sequencing.« less
Detection of Human Papillomavirus Type 2 Related Sequence in Oral Papilloma
Yamaguchi, Taihei; Shindoh, Masanobu; Amemiya, Akira; Inoue, Nobuo; Kawamura, Masaaki; Sakaoka, Hiroshi; Inoue, Masakazu; Fujinaga, Kei
1998-01-01
Oral papilloma is a benign tumourous lesion. Part of this lesion is associated with human papillomavirus (HPV) infection. We analysed the genetical and histopathological evidence for HPV type 2 infection in three oral papillomas. Southern blot hybridization showed HPV 2a sequence in one lesion. Cells of the positive specimen appeared to contain high copy numbers of the viral DNA in an episomal state. In situ staining demonstrated virus capsid antigen in koilocytotic cells and surrounding cells in the hyperplastic epithelial layer. Two other specimens contained no HPV sequences by labeled probe of full length linear HPVs 2a, 6b, 11, 16, 18, 31 and 33 DNA under low stringency hybridization conditions. These results showed the possibility that HPV 2 plays a role in oral papilloma. PMID:9699941
DOE Office of Scientific and Technical Information (OSTI.GOV)
Geraghty, M.T.; Stetten, G.; Kearns, W.
1994-09-01
X-linked adrenoleukodystrophy (ALD) is a disorder of peroxisomal {beta}-oxidation of very long chain fatty acids. It presents either as progressive dementia in childhood or as progressive paraparesis in later years. Adrenal insufficiency occurs in both phenotypes. The gene of the ALD protein has been mapped to Xq28 and has recently been cloned and characterized. The ALD protein has significant homology to the peroxisomal membrane protein, PMP70 and belongs to the ATP binding cassette superfamily of transporters. We screened a human genomic library with an ALDP cDNA and isolated 5 different but highly similar clones containing sequences corresponding to the 3{prime}more » end of the ALDP gene. Comparison of the sequences over the region corresponding to exon 9 through the 3{prime} end of the ALDP gene reveals {approximately}96% nucleotide identity in both exonic and intronic regions. Splice sites and open reading frames are maintained. Using both FISH and human-rodent DNA mapping panels, we positively assign these ALDP-related sequences to chromosomes 2, 16 and 22, and provisionally to 1 and 20. Southern blot of primate DNA probed with a partial ALDP cDNA (exon 2-10) shows that expansion of ALDP-related sequences occurred in higher primates (chimp, gorilla and human). Although Northern blots show multiple ALDP-hybridizing transcripts in certain tissues, we have no evidence to date for expression of these ALDP-related sequences. In conclusion, our data show there has been an unusual and recent dispersal to multiple chromosomes of structural gene sequences related to the ALDP gene. The functional significance of these sequences remains to be determined but their existence complicates PCR and mutation analysis of the ALDP gene.« less
Grace, Christy R.; Ferreira, Antonio M.; Waddell, M. Brett; Ridout, Granger; Naeve, Deanna; Leuze, Michael; LoCascio, Philip F.; Panetta, John C.; Wilkinson, Mark R.; Pui, Ching-Hon; Naeve, Clayton W.; Uberbacher, Edward C.; Bonten, Erik J.; Evans, William E.
2016-01-01
MicroRNAs are important regulators of gene expression, acting primarily by binding to sequence-specific locations on already transcribed messenger RNAs (mRNA) and typically down-regulating their stability or translation. Recent studies indicate that microRNAs may also play a role in up-regulating mRNA transcription levels, although a definitive mechanism has not been established. Double-helical DNA is capable of forming triple-helical structures through Hoogsteen and reverse Hoogsteen interactions in the major groove of the duplex, and we show physical evidence (i.e., NMR, FRET, SPR) that purine or pyrimidine-rich microRNAs of appropriate length and sequence form triple-helical structures with purine-rich sequences of duplex DNA, and identify microRNA sequences that favor triplex formation. We developed an algorithm (Trident) to search genome-wide for potential triplex-forming sites and show that several mammalian and non-mammalian genomes are enriched for strong microRNA triplex binding sites. We show that those genes containing sequences favoring microRNA triplex formation are markedly enriched (3.3 fold, p<2.2 × 10−16) for genes whose expression is positively correlated with expression of microRNAs targeting triplex binding sequences. This work has thus revealed a new mechanism by which microRNAs could interact with gene promoter regions to modify gene transcription. PMID:26844769
Packaging of Dinoroseobacter shibae DNA into Gene Transfer Agent Particles Is Not Random.
Tomasch, Jürgen; Wang, Hui; Hall, April T K; Patzelt, Diana; Preusse, Matthias; Petersen, Jörn; Brinkmann, Henner; Bunk, Boyke; Bhuju, Sabin; Jarek, Michael; Geffers, Robert; Lang, Andrew S; Wagner-Döbler, Irene
2018-01-01
Gene transfer agents (GTAs) are phage-like particles which contain a fragment of genomic DNA of the bacterial or archaeal producer and deliver this to a recipient cell. GTA gene clusters are present in the genomes of almost all marine Rhodobacteraceae (Roseobacters) and might be important contributors to horizontal gene transfer in the world's oceans. For all organisms studied so far, no obvious evidence of sequence specificity or other nonrandom process responsible for packaging genomic DNA into GTAs has been found. Here, we show that knock-out of an autoinducer synthase gene of Dinoroseobacter shibae resulted in overproduction and release of functional GTA particles (DsGTA). Next-generation sequencing of the 4.2-kb DNA fragments isolated from DsGTAs revealed that packaging was not random. DNA from low-GC conjugative plasmids but not from high-GC chromids was excluded from packaging. Seven chromosomal regions were strongly overrepresented in DNA isolated from DsGTA. These packaging peaks lacked identifiable conserved sequence motifs that might represent recognition sites for the GTA terminase complex. Low-GC regions of the chromosome, including the origin and terminus of replication, were underrepresented in DNA isolated from DsGTAs. DNA methylation reduced packaging frequency while the level of gene expression had no influence. Chromosomal regions found to be over- and underrepresented in DsGTA-DNA were regularly spaced. We propose that a "headful" type of packaging is initiated at the sites of coverage peaks and, after linearization of the chromosomal DNA, proceeds in both directions from the initiation site. GC-content, DNA-modifications, and chromatin structure might influence at which sides GTA packaging can be initiated. © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Packaging of Dinoroseobacter shibae DNA into Gene Transfer Agent Particles Is Not Random
Wang, Hui; Hall, April T K; Patzelt, Diana; Preusse, Matthias; Petersen, Jörn; Brinkmann, Henner; Bunk, Boyke; Bhuju, Sabin; Jarek, Michael; Geffers, Robert; Lang, Andrew S; Wagner-Döbler, Irene
2018-01-01
Abstract Gene transfer agents (GTAs) are phage-like particles which contain a fragment of genomic DNA of the bacterial or archaeal producer and deliver this to a recipient cell. GTA gene clusters are present in the genomes of almost all marine Rhodobacteraceae (Roseobacters) and might be important contributors to horizontal gene transfer in the world’s oceans. For all organisms studied so far, no obvious evidence of sequence specificity or other nonrandom process responsible for packaging genomic DNA into GTAs has been found. Here, we show that knock-out of an autoinducer synthase gene of Dinoroseobacter shibae resulted in overproduction and release of functional GTA particles (DsGTA). Next-generation sequencing of the 4.2-kb DNA fragments isolated from DsGTAs revealed that packaging was not random. DNA from low-GC conjugative plasmids but not from high-GC chromids was excluded from packaging. Seven chromosomal regions were strongly overrepresented in DNA isolated from DsGTA. These packaging peaks lacked identifiable conserved sequence motifs that might represent recognition sites for the GTA terminase complex. Low-GC regions of the chromosome, including the origin and terminus of replication, were underrepresented in DNA isolated from DsGTAs. DNA methylation reduced packaging frequency while the level of gene expression had no influence. Chromosomal regions found to be over- and underrepresented in DsGTA-DNA were regularly spaced. We propose that a “headful” type of packaging is initiated at the sites of coverage peaks and, after linearization of the chromosomal DNA, proceeds in both directions from the initiation site. GC-content, DNA-modifications, and chromatin structure might influence at which sides GTA packaging can be initiated. PMID:29325123
Qiu, Guo-Hua
2016-01-01
In this review, the protective function of the abundant non-coding DNA in the eukaryotic genome is discussed from the perspective of genome defense against exogenous nucleic acids. Peripheral non-coding DNA has been proposed to act as a bodyguard that protects the genome and the central protein-coding sequences from ionizing radiation-induced DNA damage. In the proposed mechanism of protection, the radicals generated by water radiolysis in the cytosol and IR energy are absorbed, blocked and/or reduced by peripheral heterochromatin; then, the DNA damage sites in the heterochromatin are removed and expelled from the nucleus to the cytoplasm through nuclear pore complexes, most likely through the formation of extrachromosomal circular DNA. To strengthen this hypothesis, this review summarizes the experimental evidence supporting the protective function of non-coding DNA against exogenous nucleic acids. Based on these data, I hypothesize herein about the presence of an additional line of defense formed by small RNAs in the cytosol in addition to their bodyguard protection mechanism in the nucleus. Therefore, exogenous nucleic acids may be initially inactivated in the cytosol by small RNAs generated from non-coding DNA via mechanisms similar to the prokaryotic CRISPR-Cas system. Exogenous nucleic acids may enter the nucleus, where some are absorbed and/or blocked by heterochromatin and others integrate into chromosomes. The integrated fragments and the sites of DNA damage are removed by repetitive non-coding DNA elements in the heterochromatin and excluded from the nucleus. Therefore, the normal eukaryotic genome and the central protein-coding sequences are triply protected by non-coding DNA against invasion by exogenous nucleic acids. This review provides evidence supporting the protective role of non-coding DNA in genome defense. Copyright © 2016 Elsevier B.V. All rights reserved.
Effects of the Ion PGM™ Hi-Q™ sequencing chemistry on sequence data quality.
Churchill, Jennifer D; King, Jonathan L; Chakraborty, Ranajit; Budowle, Bruce
2016-09-01
Massively parallel sequencing (MPS) offers substantial improvements over current forensic DNA typing methodologies such as increased resolution, scalability, and throughput. The Ion PGM™ is a promising MPS platform for analysis of forensic biological evidence. The system employs a sequencing-by-synthesis chemistry on a semiconductor chip that measures a pH change due to the release of hydrogen ions as nucleotides are incorporated into the growing DNA strands. However, implementation of MPS into forensic laboratories requires a robust chemistry. Ion Torrent's Hi-Q™ Sequencing Chemistry was evaluated to determine if it could improve on the quality of the generated sequence data in association with selected genetic marker targets. The whole mitochondrial genome and the HID-Ion STR 10-plex panel were sequenced on the Ion PGM™ system with the Ion PGM™ Sequencing 400 Kit and the Ion PGM™ Hi-Q™ Sequencing Kit. Concordance, coverage, strand balance, noise, and deletion ratios were assessed in evaluating the performance of the Ion PGM™ Hi-Q™ Sequencing Kit. The results indicate that reliable, accurate data are generated and that sequencing through homopolymeric regions can be improved with the use of Ion Torrent's Hi-Q™ Sequencing Chemistry. Overall, the quality of the generated sequencing data supports the potential for use of the Ion PGM™ in forensic genetic laboratories.
Scar-less multi-part DNA assembly design automation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hillson, Nathan J.
The present invention provides a method of a method of designing an implementation of a DNA assembly. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which to assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding flanking homology sequences to each of the DNA oligos. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which tomore » assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding optimized overhang sequences to each of the DNA oligos.« less
Marr, Melissa M; Brace, Selina; Schreve, Danielle C; Barnes, Ian
2018-02-09
Establishing true phylogenetic relationships between populations is a critical consideration when sourcing individuals for translocation. This presents huge difficulties with threatened and endangered species that have become extirpated from large areas of their former range. We utilise ancient DNA (aDNA) to reconstruct the phylogenetic relationships of a keystone species which has become extinct in Britain, the Eurasian beaver Castor fiber. We sequenced seventeen 492 bp partial tRNAPro and control region sequences from Late Pleistocene and Holocene age beavers and included these in network, demographic and genealogy analyses. The mode of postglacial population expansion from refugia was investigated by employing tests of neutrality and a pairwise mismatch distribution analysis. We found evidence of a pre-Late Glacial Maximum ancestor for the Western C. fiber clade which experienced a rapid demographic expansion during the terminal Pleistocene to early Holocene period. Ancient British beavers were found to originate from the Western phylogroup but showed no phylogenetic affinity to any one modern relict population over another. Instead, we find that they formed part of a large, continuous, pan-Western European clade that harbored little internal substructure. Our study highlights the utility of aDNA in reconstructing population histories of extirpated species which has real-world implications for conservation planning.
Qin, QinBo; Wang, Juan; Wang, YuDe; Liu, Yun; Liu, ShaoJun
2015-03-13
The offspring with 100 chromosomes (abbreviated as GRCC) have been obtained in the first generation of Carassius auratus red var. (abbreviated as RCC, 2n = 100) (♀) × Megalobrama amblycephala (abbreviated as BSB, 2n = 48) (♂), in which the females and unexpected males both are found. Chromosomal and karyotypic analysis has been reported in GRCC which gynogenesis origin has been suggested, but lack genetic evidence. Fluorescence in situ hybridization with species-specific centromere probes directly proves that GRCC possess two sets of RCC-derived chromosomes. Sequence analysis of the coding region (5S) and adjacent nontranscribed spacer (abbreviated as NTS) reveals that three types of 5S rDNA class (class I; class II and class III) in GRCC are completely inherited from their female parent (RCC), and show obvious base variations and insertions-deletions. Fluorescence in situ hybridization with the entire 5S rDNA probe reveals obvious chromosomal loci (class I and class II) variation in GRCC. This paper provides directly genetic evidence that GRCC is gynogenesis origin. In addition, our result is also reveals that distant hybridization inducing gynogenesis can lead to sequence and partial chromosomal loci of 5S rDNA gene obvious variation.
Silva Jr., Wilson A.; Bonatto, Sandro L.; Holanda, Adriano J.; Ribeiro-dos-Santos, Andrea K.; Paixão, Beatriz M.; Goldman, Gustavo H.; Abe-Sandes, Kiyoko; Rodriguez-Delfin, Luis; Barbosa, Marcela; Paçó-Larson, Maria Luiza; Petzl-Erler, Maria Luiza; Valente, Valeria; Santos, Sidney E. B.; Zago, Marco A.
2002-01-01
There is general agreement that the Native American founder populations migrated from Asia into America through Beringia sometime during the Pleistocene, but the hypotheses concerning the ages and the number of these migrations and the size of the ancestral populations are surrounded by controversy. DNA sequence variations of several regions of the genome of Native Americans, especially in the mitochondrial DNA (mtDNA) control region, have been studied as a tool to help answer these questions. However, the small number of nucleotides studied and the nonclocklike rate of mtDNA control-region evolution impose several limitations to these results. Here we provide the sequence analysis of a continuous region of 8.8 kb of the mtDNA outside the D-loop for 40 individuals, 30 of whom are Native Americans whose mtDNA belongs to the four founder haplogroups. Haplogroups A, B, and C form monophyletic clades, but the five haplogroup D sequences have unstable positions and usually do not group together. The high degree of similarity in the nucleotide diversity and time of differentiation (i.e., ∼21,000 years before present) of these four haplogroups support a common origin for these sequences and suggest that the populations who harbor them may also have a common history. Additional evidence supports the idea that this age of differentiation coincides with the process of colonization of the New World and supports the hypothesis of a single and early entry of the ancestral Asian population into the Americas. PMID:12022039
On the roles of repetitive DNA elements in the context of a unified genomic-epigenetic system.
von Sternberg, Richard
2002-12-01
Repetitive DNA sequences comprise a substantial portion of most eukaryotic and some prokaryotic chromosomes. Despite nearly forty years of research, the functions of various sequence families as a whole and their monomer units remain largely unknown. The inability to map specific functional roles onto many repetitive DNA elements (REs), coupled with the taxon-specificity of sequence families, have led many to speculate that these genomic components are "selfish" replicators generating genomic "junk." The purpose of this paper is to critically examine the selfishness, evolutionary effects, and functionality of REs. First, a brief overview of the range of ideas pertaining to RE function is presented. Second, the argument is presented that the selfish DNA "hypothesis" is actually a narrative scheme, that it serves to protect neo-Darwinian assumptions from criticism, and that this story is untestable and therefore not a hypothesis. Third, attempts to synthesize the selfish DNA concept with complex systems models of the genome and RE functionality are critiqued. Fourth, the supposed connection between RE-induced mutations and macroevolutionary events are stated to be at variance with empirical evidence and theoretical considerations. Hypotheses that base phylogenetic transitions in repetitive sequence changes thus remain speculative. Fifth and finally, the case is made for viewing REs as integrally functional components of chromosomes, genomes, and cells. It is argued throughout that a new conceptual framework is needed for understanding the roles of repetitive DNA in genomic/epigenetic systems, and that neo-Darwinian "narratives" have been the primary obstacle to elucidating the effects of these enigmatic components of chromosomes.
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection
NASA Astrophysics Data System (ADS)
Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T.; Carr, Christopher E.
2017-08-01
Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry-dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a "universal" nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars.
mtDNA variation in the Yanomami: evidence for additional New World founding lineages.
Easton, R D; Merriwether, D A; Crews, D E; Ferrell, R E
1996-07-01
Native Americans have been classified into four founding haplogroups with as many as seven founding lineages based on mtDNA RFLPs and DNA sequence data. mtDNA analysis was completed for 83 Yanomami from eight villages in the Surucucu and Catrimani Plateau regions of Roraima in northwestern Brazil. Samples were typed for 15 polymorphic mtDNA sites (14 RFLP sites and 1 deletion site), and a subset was sequenced for both hypervariable regions of the mitochondrial D-loop. Substantial mitochondrial diversity was detected among the Yanomami, five of seven accepted founding haplotypes and three others were observed. Of the 83 samples, 4 (4.8%) were lineage B1, 1 (1.2%) was lineage B2, 31 (37.4%) were lineage C1, 29 (34.9%) were lineage C2, 2 (2.4%) were lineage D1, 6 (7.2%) were lineage D2, 7 (8.4%) were a haplotype we designated "X6," and 3 (3.6%) were a haplotype we designated "X7." Sequence analysis found 43 haplotypes in 50 samples. B2, X6, and X7 are previously unrecognized mitochondrial founding lineage types of Native Americans. The widespread distribution of these haplotypes in the New World and Asia provides support for declaring these lineages to be New World founding types.
mtDNA variation in the Yanomami: evidence for additional New World founding lineages.
Easton, R. D.; Merriwether, D. A.; Crews, D. E.; Ferrell, R. E.
1996-01-01
Native Americans have been classified into four founding haplogroups with as many as seven founding lineages based on mtDNA RFLPs and DNA sequence data. mtDNA analysis was completed for 83 Yanomami from eight villages in the Surucucu and Catrimani Plateau regions of Roraima in northwestern Brazil. Samples were typed for 15 polymorphic mtDNA sites (14 RFLP sites and 1 deletion site), and a subset was sequenced for both hypervariable regions of the mitochondrial D-loop. Substantial mitochondrial diversity was detected among the Yanomami, five of seven accepted founding haplotypes and three others were observed. Of the 83 samples, 4 (4.8%) were lineage B1, 1 (1.2%) was lineage B2, 31 (37.4%) were lineage C1, 29 (34.9%) were lineage C2, 2 (2.4%) were lineage D1, 6 (7.2%) were lineage D2, 7 (8.4%) were a haplotype we designated "X6," and 3 (3.6%) were a haplotype we designated "X7." Sequence analysis found 43 haplotypes in 50 samples. B2, X6, and X7 are previously unrecognized mitochondrial founding lineage types of Native Americans. The widespread distribution of these haplotypes in the New World and Asia provides support for declaring these lineages to be New World founding types. PMID:8659527
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection.
Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T; Carr, Christopher E
2017-08-01
Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry-dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a "universal" nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars. Key Words: Life-detection instruments-Nucleic acids-Mars-Panspermia. Astrobiology 17, 747-760.
Cryptic diversity in European bats.
Mayer, F.; von Helversen, O.
2001-01-01
Different species of bat can be morphologically very similar. In order to estimate the amount of cryptic diversity among European bats we screened the intra- and interspecific genetic variation in 26 European vespertilionid bat species. We sequenced the DNA of subunit 1 of the mitochondrial protein NADH dehydrogenase (ND1) from several individuals of a species, which were sampled in a variety of geographical regions. A phylogeny based on the mitochondrial (mt) DNA data is in good agreement with the current classification in the family. Highly divergent mitochondrial lineages were found in two taxa, which differed in at least 11% of their ND1 sequence. The two mtDNA lineages in Plecotus austriacus correlated with the two subspecies Plecotus austriacus austriacus and Plecotus austriacus kolombatovici. The two mtDNA lineages in Myotis mystacinus were partitioned among two morphotypes. The evidence for two new bat species within Europe is discussed. Convergent adaptive evolution might have contributed to the morphological similarity among distantly related species if they occupy similar ecological niches. Closely related species may differ in their ecology but not necessarily in their morphology. On the other hand, two morphologically clearly different species (Eptesicus serotinus and Eptesicus nilssonii) were found to be genetically very similar. Neither morphological nor mitochondrial DNA sequence analysis alone can be guaranteed to identify species. PMID:11522202
Toward a mtDNA locus-specific mutation database using the LOVD platform.
Elson, Joanna L; Sweeney, Mary G; Procaccio, Vincent; Yarham, John W; Salas, Antonio; Kong, Qing-Peng; van der Westhuizen, Francois H; Pitceathly, Robert D S; Thorburn, David R; Lott, Marie T; Wallace, Douglas C; Taylor, Robert W; McFarland, Robert
2012-09-01
The Human Variome Project (HVP) is a global effort to collect and curate all human genetic variation affecting health. Mutations of mitochondrial DNA (mtDNA) are an important cause of neurogenetic disease in humans; however, identification of the pathogenic mutations responsible can be problematic. In this article, we provide explanations as to why and suggest how such difficulties might be overcome. We put forward a case in support of a new Locus Specific Mutation Database (LSDB) implemented using the Leiden Open-source Variation Database (LOVD) system that will not only list primary mutations, but also present the evidence supporting their role in disease. Critically, we feel that this new database should have the capacity to store information on the observed phenotypes alongside the genetic variation, thereby facilitating our understanding of the complex and variable presentation of mtDNA disease. LOVD supports fast queries of both seen and hidden data and allows storage of sequence variants from high-throughput sequence analysis. The LOVD platform will allow construction of a secure mtDNA database; one that can fully utilize currently available data, as well as that being generated by high-throughput sequencing, to link genotype with phenotype enhancing our understanding of mitochondrial disease, with a view to providing better prognostic information. © 2012 Wiley Periodicals, Inc.
Toward a mtDNA Locus-Specific Mutation Database Using the LOVD Platform
Elson, Joanna L.; Sweeney, Mary G.; Procaccio, Vincent; Yarham, John W.; Salas, Antonio; Kong, Qing-Peng; van der Westhuizen, Francois H.; Pitceathly, Robert D.S.; Thorburn, David R.; Lott, Marie T.; Wallace, Douglas C.; Taylor, Robert W.; McFarland, Robert
2015-01-01
The Human Variome Project (HVP) is a global effort to collect and curate all human genetic variation affecting health. Mutations of mitochondrial DNA (mtDNA) are an important cause of neurogenetic disease in humans; however, identification of the pathogenic mutations responsible can be problematic. In this article, we provide explanations as to why and suggest how such difficulties might be overcome. We put forward a case in support of a new Locus Specific Mutation Database (LSDB) implemented using the Leiden Open-source Variation Database (LOVD) system that will not only list primary mutations, but also present the evidence supporting their role in disease. Critically, we feel that this new database should have the capacity to store information on the observed phenotypes alongside the genetic variation, thereby facilitating our understanding of the complex and variable presentation of mtDNA disease. LOVD supports fast queries of both seen and hidden data and allows storage of sequence variants from high-throughput sequence analysis. The LOVD platform will allow construction of a secure mtDNA database; one that can fully utilize currently available data, as well as that being generated by high-throughput sequencing, to link genotype with phenotype enhancing our understanding of mitochondrial disease, with a view to providing better prognostic information. PMID:22581690
DNA methylation dynamics during early plant life.
Bouyer, Daniel; Kramdi, Amira; Kassam, Mohamed; Heese, Maren; Schnittger, Arp; Roudier, François; Colot, Vincent
2017-09-25
Cytosine methylation is crucial for gene regulation and silencing of transposable elements in mammals and plants. While this epigenetic mark is extensively reprogrammed in the germline and early embryos of mammals, the extent to which DNA methylation is reset between generations in plants remains largely unknown. Using Arabidopsis as a model, we uncovered distinct DNA methylation dynamics over transposable element sequences during the early stages of plant development. Specifically, transposable elements and their relics show invariably high methylation at CG sites but increasing methylation at CHG and CHH sites. This non-CG methylation culminates in mature embryos, where it reaches saturation for a large fraction of methylated CHH sites, compared to the typical 10-20% methylation level observed in seedlings or adult plants. Moreover, the increase in CHH methylation during embryogenesis matches the hypomethylated state in the early endosperm. Finally, we show that interfering with the embryo-to-seedling transition results in the persistence of high CHH methylation levels after germination, specifically over sequences that are targeted by the RNA-directed DNA methylation (RdDM) machinery. Our findings indicate the absence of extensive resetting of DNA methylation patterns during early plant life and point instead to an important role of RdDM in reinforcing DNA methylation of transposable element sequences in every cell of the mature embryo. Furthermore, we provide evidence that this elevated RdDM activity is a specific property of embryogenesis.
Zaiko, Anastasija; Fletcher, Lauren M.; Laroche, Olivier; Wood, Susanna A.
2017-01-01
High-throughput sequencing metabarcoding studies in marine biosecurity have largely focused on targeting environmental DNA (eDNA). DNA can persist extracellularly in the environment, making discrimination of living organisms difficult. In this study, bilge water samples (i.e., water accumulating on-board a vessel during transit) were collected from 15 small recreational and commercial vessels. eDNA and eRNA molecules were co-extracted and the V4 region of the 18S ribosomal RNA gene targeted for metabarcoding. In total, 62.7% of the Operational Taxonomic Units (OTUs) were identified at least once in the corresponding eDNA and eRNA reads, with 19.5% unique to eDNA and 17.7% to eRNA. There were substantial differences in diversity between molecular compartments; 57% of sequences from eDNA-only OTUs belonged to fungi, likely originating from legacy DNA. In contrast, there was a higher percentage of metazoan (50.2%) and ciliate (31.7%) sequences in the eRNA-only OTUs. Our data suggest that the presence of eRNA-only OTUs could be due to increased cellular activities of some rare taxa that were not identified in the eDNA datasets, unusually high numbers of rRNA transcripts in ciliates, and/or artefacts produced during the reverse transcriptase, PCR and sequencing steps. The proportions of eDNA/eRNA shared and unshared OTUs were highly heterogeneous within individual bilge water samples. Multiple factors including boat type and the activities performed on-board, such as washing of scientific equipment, may play a major role in contributing to this variability. For some marine biosecurity applications analysis, eDNA-only data may be sufficient, however there are an increasing number of instances where distinguishing the living portion of a community is essential. For these circumstances, we suggest only including OTUs that are present in both eDNA and eRNA data. OTUs found only in the eRNA data need to be interpreted with caution until further research provides conclusive evidence for their origin. PMID:29095959
Pochon, Xavier; Zaiko, Anastasija; Fletcher, Lauren M; Laroche, Olivier; Wood, Susanna A
2017-01-01
High-throughput sequencing metabarcoding studies in marine biosecurity have largely focused on targeting environmental DNA (eDNA). DNA can persist extracellularly in the environment, making discrimination of living organisms difficult. In this study, bilge water samples (i.e., water accumulating on-board a vessel during transit) were collected from 15 small recreational and commercial vessels. eDNA and eRNA molecules were co-extracted and the V4 region of the 18S ribosomal RNA gene targeted for metabarcoding. In total, 62.7% of the Operational Taxonomic Units (OTUs) were identified at least once in the corresponding eDNA and eRNA reads, with 19.5% unique to eDNA and 17.7% to eRNA. There were substantial differences in diversity between molecular compartments; 57% of sequences from eDNA-only OTUs belonged to fungi, likely originating from legacy DNA. In contrast, there was a higher percentage of metazoan (50.2%) and ciliate (31.7%) sequences in the eRNA-only OTUs. Our data suggest that the presence of eRNA-only OTUs could be due to increased cellular activities of some rare taxa that were not identified in the eDNA datasets, unusually high numbers of rRNA transcripts in ciliates, and/or artefacts produced during the reverse transcriptase, PCR and sequencing steps. The proportions of eDNA/eRNA shared and unshared OTUs were highly heterogeneous within individual bilge water samples. Multiple factors including boat type and the activities performed on-board, such as washing of scientific equipment, may play a major role in contributing to this variability. For some marine biosecurity applications analysis, eDNA-only data may be sufficient, however there are an increasing number of instances where distinguishing the living portion of a community is essential. For these circumstances, we suggest only including OTUs that are present in both eDNA and eRNA data. OTUs found only in the eRNA data need to be interpreted with caution until further research provides conclusive evidence for their origin.
Mishamandani, Sara; Gutierrez, Tony; Aitken, Michael D.
2014-01-01
Marine hydrocarbon-degrading bacteria perform a fundamental role in the oxidation and ultimate removal of crude oil and its petrochemical derivatives in coastal and open ocean environments. Those with an almost exclusive ability to utilize hydrocarbons as a sole carbon and energy source have been found confined to just a few genera. Here we used stable isotope probing (SIP), a valuable tool to link the phylogeny and function of targeted microbial groups, to investigate hydrocarbon-degrading bacteria in coastal North Carolina sea water (Beaufort Inlet, USA) with uniformly labeled [13C]n-hexadecane. The dominant sequences in clone libraries constructed from 13C-enriched bacterial DNA (from n-hexadecane enrichments) were identified to belong to the genus Alcanivorax, with ≤98% sequence identity to the closest type strain—thus representing a putative novel phylogenetic taxon within this genus. Unexpectedly, we also identified 13C-enriched sequences in heavy DNA fractions that were affiliated to the genus Methylophaga. This is a contentious group since, though some of its members have been proposed to degrade hydrocarbons, substantive evidence has not previously confirmed this. We used quantitative PCR primers targeting the 16S rRNA gene of the SIP-identified Alcanivorax and Methylophaga to determine their abundance in incubations amended with unlabeled n-hexadecane. Both showed substantial increases in gene copy number during the experiments. Subsequently, we isolated a strain representing the SIP-identified Methylophaga sequences (99.9% 16S rRNA gene sequence identity) and used it to show, for the first time, direct evidence of hydrocarbon degradation by a cultured Methylophaga sp. This study demonstrates the value of coupling SIP with cultivation methods to identify and expand on the known diversity of hydrocarbon-degrading bacteria in the marine environment. PMID:24578702
Picardi, Ernesto; Quagliariello, Carla
2008-03-26
In plant mitochondria, the post-transcriptional RNA editing process converts C to U at a number of specific sites of the mRNA sequence and usually restores phylogenetically conserved codons and the encoded amino acid residues. Sites undergoing RNA editing evolve at a higher rate than sites not modified by the process. As a result, editing sites strongly affect the evolution of plant mitochondrial genomes, representing an important source of sequence variability and potentially informative characters. To date no clear and convincing evidence has established whether or not editing sites really affect the topology of reconstructed phylogenetic trees. For this reason, we investigated here the effect of RNA editing on the tree building process of twenty different plant mitochondrial gene sequences and by means of computer simulations. Based on our simulation study we suggest that the editing 'noise' in tree topology inference is mainly manifested at the cDNA level. In particular, editing sites tend to confuse tree topologies when artificial genomic and cDNA sequences are generated shorter than 500 bp and with an editing percentage higher than 5.0%. Similar results have been also obtained with genuine plant mitochondrial genes. In this latter instance, indeed, the topology incongruence increases when the editing percentage goes up from about 3.0 to 14.0%. However, when the average gene length is higher than 1,000 bp (rps3, matR and atp1) no differences in the comparison between inferred genomic and cDNA topologies could be detected. Our findings by the here reported in silico and in vivo computer simulation system seem to strongly suggest that editing sites contribute in the generation of misleading phylogenetic trees if the analyzed mitochondrial gene sequence is highly edited (higher than 3.0%) and reduced in length (shorter than 500 bp). In the current lack of direct experimental evidence the results presented here encourage, thus, the use of genomic mitochondrial rather than cDNA sequences for reconstructing phylogenetic events in land plants.
TIR-NBS-LRR genes are rare in monocots: evidence from diverse monocot orders
Tarr, D Ellen K; Alexander, Helen M
2009-01-01
Background Plant resistance (R) gene products recognize pathogen effector molecules. Many R genes code for proteins containing nucleotide binding site (NBS) and C-terminal leucine-rich repeat (LRR) domains. NBS-LRR proteins can be divided into two groups, TIR-NBS-LRR and non-TIR-NBS-LRR, based on the structure of the N-terminal domain. Although both classes are clearly present in gymnosperms and eudicots, only non-TIR sequences have been found consistently in monocots. Since most studies in monocots have been limited to agriculturally important grasses, it is difficult to draw conclusions. The purpose of our study was to look for evidence of these sequences in additional monocot orders. Findings Using degenerate PCR, we amplified NBS sequences from four monocot species (C. blanda, D. marginata, S. trifasciata, and Spathiphyllum sp.), a gymnosperm (C. revoluta) and a eudicot (C. canephora). We successfully amplified TIR-NBS-LRR sequences from dicot and gymnosperm DNA, but not from monocot DNA. Using databases, we obtained NBS sequences from additional monocots, magnoliids and basal angiosperms. TIR-type sequences were not present in monocot or magnoliid sequences, but were present in the basal angiosperms. Phylogenetic analysis supported a single TIR clade and multiple non-TIR clades. Conclusion We were unable to find monocot TIR-NBS-LRR sequences by PCR amplification or database searches. In contrast to previous studies, our results represent five monocot orders (Poales, Zingiberales, Arecales, Asparagales, and Alismatales). Our results establish the presence of TIR-NBS-LRR sequences in basal angiosperms and suggest that although these sequences were present in early land plants, they have been reduced significantly in monocots and magnoliids. PMID:19785756
Mechanisms and dynamics of nuclear lamina-genome interactions.
Amendola, Mario; van Steensel, Bas
2014-06-01
The nuclear lamina (NL) interacts with the genomic DNA and is thought to influence chromosome organization and gene expression. Both DNA sequences and histone modifications are important for NL tethering of the genomic DNA. These interactions are dynamic in individual cells and can change during differentiation and development. Evidence is accumulating that the NL contributes to the repression of transcription. Advances in mapping, genome-editing and microscopy techniques are increasing our understanding of the molecular mechanisms involved in NL-genome interactions. Copyright © 2014 Elsevier Ltd. All rights reserved.
Verginelli, Fabio; Capelli, Cristian; Coia, Valentina; Musiani, Marco; Falchetti, Mario; Ottini, Laura; Palmirotta, Raffaele; Tagliacozzo, Antonio; De Grossi Mazzorin, Iacopo; Mariani-Costantini, Renato
2005-12-01
The question of the origins of the dog has been much debated. The dog is descended from the wolf that at the end of the last glaciation (the archaeologically hypothesized period of dog domestication) was one of the most widespread among Holarctic mammals. Scenarios provided by genetic studies range from multiple dog-founding events to a single origin in East Asia. The earliest fossil dogs, dated approximately 17-12,000 radiocarbon ((14)C) years ago (YA), were found in Europe and in the Middle East. Ancient DNA (a-DNA) evidence could contribute to the identification of dog-founder wolf populations. To gain insight into the relationships between ancient European wolves and dogs we analyzed a 262-bp mitochondrial DNA control region fragment retrieved from five prehistoric Italian canids ranging in age from approximately 15,000 to approximately 3,000 (14)C YA. These canids were compared to a worldwide sample of 547 purebred dogs and 341 wolves. The ancient sequences were highly diverse and joined the three major clades of extant dog sequences. Phylogenetic investigations highlighted relationships between the ancient sequences and geographically widespread extant dog matrilines and between the ancient sequences and extant wolf matrilines of mainly East European origin. The results provide a-DNA support for the involvement of European wolves in the origins of the three major dog clades. Genetic data also suggest multiple independent domestication events. East European wolves may still reflect the genetic variation of ancient dog-founder populations.
The effects of DNA supercoiling on G-quadruplex formation.
Sekibo, Doreen A T; Fox, Keith R
2017-12-01
Guanine-rich DNAs can fold into four-stranded structures that contain stacks of G-quartets. Bioinformatics studies have revealed that G-rich sequences with the potential to adopt these structures are unevenly distributed throughout genomes, and are especially found in gene promoter regions. With the exception of the single-stranded telomeric DNA, all genomic G-rich sequences will always be present along with their C-rich complements, and quadruplex formation will be in competition with the corresponding Watson-Crick duplex. Quadruplex formation must therefore first require local dissociation (melting) of the duplex strands. Since negative supercoiling is known to facilitate the formation of alternative DNA structures, we have investigated G-quadruplex formation within negatively supercoiled DNA plasmids. Plasmids containing multiple copies of (G3T)n and (G3T4)n repeats, were probed with dimethylsulphate, potassium permanganate and S1 nuclease. While dimethylsulphate footprinting revealed some evidence for G-quadruplex formation in (G3T)n sequences, this was not affected by supercoiling, and permanganate failed to detect exposed thymines in the loop regions. (G3T4)n sequences were not protected from DMS and showed no reaction with permanganate. Similarly, both S1 nuclease and 2D gel electrophoresis of DNA topoisomers did not detect any supercoil-dependent structural transitions. These results suggest that negative supercoiling alone is not sufficient to drive G-quadruplex formation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health
Martin, William F.
2017-01-01
Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. PMID:28444372
Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L
2013-01-30
Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
2013-01-01
Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705
Gobert, V; Moja, S; Taberlet, P; Wink, M
2006-07-01
Phylogenetic reconstructions with molecular tools are now widely used, thanks to advances in PCR and sequencing technologies. The choice of the molecular target still remains a problem because too few comparative data are available. This is particularly true for hybrid taxa, where differential introgression of genome parts leads to incongruity between data sets. We have studied the potential of three data partitions to reconstruct the phylogeny of mints related to M. x piperita. These included nuclear DNA (ITS), chloroplast DNA (non-coding regions trnL intron, intergenic spacers trnL-trnF, and psbA-trnH), and AFLP and ISSR, markers. The taxonomic sampling was composed of hybrids, diploid and polyploid genomes. Since the genealogy of cultivated mint hybrids is known, they represent a model group to compare the usefulness of various molecular markers for phylogeny inference. Incongruities between ITS, chloroplast DNA, and AFLP-ISSR phylogenetic trees were recorded, although DNA fingerprinting data were congruent with morphological classification. Evidence of chloroplast capture events was obtained for M. x piperita. Direct sequencing of ITS led to biased results because of the existence of pseudogenes. Sequencing of cloned ITS further failed to provide evidence of the existence of the two parental copy types for M. x piperita, a sterile hybrid that has had no opportunity for concerted evolution of ITS copies. AFLP-ISSR data clustered M. x piperita with the parent that had the largest genome. This study sheds light on differential of introgression of different genome regions in mint hybrids.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2002-01-01
The land plants and their immediate green algal ancestors, the charophytes, form the Streptophyta. There is evidence that both the chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) underwent substantial changes in their architecture (intron insertions, gene losses, scrambling in gene order, and genome expansion in the case of mtDNA) during the evolution of streptophytes; however, because no charophyte organelle DNAs have been sequenced completely thus far, the suite of events that shaped streptophyte organelle genomes remains largely unknown. Here, we have determined the complete cpDNA (131,183 bp) and mtDNA (56,574 bp) sequences of the charophyte Chaetosphaeridium globosum (Coleochaetales). At the levels of gene content (124 genes), intron composition (18 introns), and gene order, Chaetosphaeridium cpDNA is remarkably similar to land-plant cpDNAs, implying that most of the features characteristic of land-plant lineages were gained during the evolution of charophytes. Although the gene content of Chaetosphaeridium mtDNA (67 genes) closely resembles that of the bryophyte Marchantia polymorpha (69 genes), this charophyte mtDNA differs substantially from its land-plant relatives at the levels of size, intron composition (11 introns), and gene order. Our finding that it shares only one intron with its land-plant counterparts supports the idea that the vast majority of mitochondrial introns in land plants appeared after the emergence of these organisms. Our results also suggest that the events accounting for the spacious intergenic spacers found in land-plant mtDNAs took place late during the evolution of charophytes or coincided with the transition from charophytes to land plants. PMID:12161560
A broad survey of recombination in animal mitochondria.
Piganeau, Gwenaël; Gardner, Michael; Eyre-Walker, Adam
2004-12-01
Recombination in mitochondrial DNA (mtDNA) remains a controversial topic. Here we present a survey of 279 animal mtDNA data sets, of which 12 were from asexual species. Using four separate tests, we show that there is widespread evidence of recombination; for one test as many as 14.2% of the data sets reject a model of clonal inheritance and in several data sets, including primates, the recombinants can be identified visually. We show that none of the tests give significant results for obligate clonal species (apomictic pathogens) and that the sexual species show significantly greater evidence of recombination than asexual species. For some data sets, such as Macaca nemestrina, additional data sets suggest that the recombinants are not artifacts. For others, it cannot be determined whether the recombinants are real or produced by laboratory error. Either way, the results have important implications for how mtDNA is sequenced and used.
Fayle, Tom M; Scholtz, Olivia; Dumbrell, Alex J; Russell, Stephen; Segar, Simon T; Eggleton, Paul
2015-01-01
Termites and ants contribute more to animal biomass in tropical rain forests than any other single group and perform vital ecosystem functions. Although ants prey on termites, at the community level the linkage between these groups is poorly understood. Thus, assessing the distribution and specificity of ant termitophagy is of considerable interest. We describe an approach for quantifying ant-termite food webs by sequencing termite DNA (cytochrome c oxidase subunit II, COII) from ant guts and apply this to a soil-dwelling ant community from tropical rain forest in Gabon. We extracted DNA from 215 ants from 15 species. Of these, 17.2 % of individuals had termite DNA in their guts, with BLAST analysis confirming the identity of 34.1 % of these termites to family level or better. Although ant species varied in detection of termite DNA, ranging from 63 % (5/7; Camponotus sp. 1) to 0 % (0/7; Ponera sp. 1), there was no evidence (with small sample sizes) for heterogeneity in termite consumption across ant taxa, and no evidence for species-specific ant-termite predation. In all three ant species with identifiable termite DNA in multiple individuals, multiple termite species were represented. Furthermore, the two termite species that were detected on multiple occasions in ant guts were in both cases found in multiple ant species, suggesting that ant-termite food webs are not strongly compartmentalised. However, two ant species were found to consume only Anoplotermes-group termites, indicating possible predatory specialisation at a higher taxonomic level. Using a laboratory feeding test, we were able to detect termite COII sequences in ant guts up to 2 h after feeding, indicating that our method only detects recent feeding events. Our data provide tentative support for the hypothesis that unspecialised termite predation by ants is widespread and highlight the use of molecular approaches for future studies of ant-termite food webs.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions
Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S
2013-06-25
A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions
Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA
2011-01-18
A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
Paugh, Steven W.; Coss, David R.; Bao, Ju; ...
2016-02-04
MicroRNAs are important regulators of gene expression, acting primarily by binding to sequence-specific locations on already transcribed messenger RNAs (mRNA). Recent studies indicate that microRNAs may also play a role in up-regulating mRNA transcription levels, although a definitive mechanism has not been established. Double-helical DNA is capable of forming triple-helical structures through Hoogsteen and reverse Hoogsteen interactions in the major groove of the duplex, and we show physical evidence that microRNAs form triple-helical structures with duplex DNA, and identify microRNA sequences that favor triplex formation. We developed an algorithm (Trident) to search genome-wide for potential triplex-forming sites and show thatmore » several mammalian and non-mammalian genomes are enriched for strong microRNA triplex binding sites. We show that those genes containing sequences favoring microRNA triplex formation are markedly enriched (3.3 fold, p<2.2 x 10 -16) for genes whose expression is positively correlated with expression of microRNAs targeting triplex binding sequences. As a result, this work has thus revealed a new mechanism by which microRNAs can interact with gene promoter regions to modify gene transcription.« less
Identification of the sequence variations of 15 autosomal STR loci in a Chinese population.
Chen, Wenjing; Cheng, Jianding; Ou, Xueling; Chen, Yong; Tong, Dayue; Sun, Hongyu
2014-01-01
DNA sequence variation including base(s) changes and insertion or deletion in the primer binding region may cause a null allele and, if this changes the length of the amplified fragment out of the allelic ladder, off-ladder (OL) alleles may be detected. In order to provide accurate and reliable DNA evidence for forensic DNA analysis, it is essential to clarify sequence variations in prevalently used STR loci. Suspected null alleles and OL alleles of PlowerPlex16® System from 21,934 unrelated Chinese individuals were verified by alternative systems and sequenced. A total of 17 cases with null alleles were identified, including 12 kinds of point mutations in 16 cases and a 19-base deletion in one case. The total frequency of null alleles was 7.751 × 10(-4). Eight hundred and forty-four OL alleles classified as being of 97 different kinds were observed at 15 STR loci of the PowerPlex®16 system except vWA. All the frequencies of OL alleles were under 0.01. Null alleles should be confirmed by alternative primers and OL alleles should be named appropriately. Particular attention should be paid to sequence variation, since incorrect designation could lead to false conclusions.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paugh, Steven W.; Coss, David R.; Bao, Ju
MicroRNAs are important regulators of gene expression, acting primarily by binding to sequence-specific locations on already transcribed messenger RNAs (mRNA). Recent studies indicate that microRNAs may also play a role in up-regulating mRNA transcription levels, although a definitive mechanism has not been established. Double-helical DNA is capable of forming triple-helical structures through Hoogsteen and reverse Hoogsteen interactions in the major groove of the duplex, and we show physical evidence that microRNAs form triple-helical structures with duplex DNA, and identify microRNA sequences that favor triplex formation. We developed an algorithm (Trident) to search genome-wide for potential triplex-forming sites and show thatmore » several mammalian and non-mammalian genomes are enriched for strong microRNA triplex binding sites. We show that those genes containing sequences favoring microRNA triplex formation are markedly enriched (3.3 fold, p<2.2 x 10 -16) for genes whose expression is positively correlated with expression of microRNAs targeting triplex binding sequences. As a result, this work has thus revealed a new mechanism by which microRNAs can interact with gene promoter regions to modify gene transcription.« less
Motriuk-Smith, Dagmara; Seville, R Scott; Quealy, Leah; Oliver, Clinton E.
2011-01-01
The taxonomy of the coccidia has historically been morphologically based. The purpose of this study was to establish if conspecificity of isolates of Eimeria callospermophili from 4 ground-dwelling squirrel hosts (Rodentia: Sciuridae) is supported by comparison of rDNA sequence data and to examine how this species relates to eimerian species from other sciurid hosts. Eimeria callospermophili was isolated from 4 wild caught hosts, i.e., Urocitellus elegans, Cynomys leucurus, Marmota flaviventris, and Cynomys ludovicianus. The ITS1 and ITS2 genomic rDNA sequences were PCR generated, sequenced, and analyzed. The highest intraspecific pairwise distance values of 6.0% in ITS1 and 7.1% in ITS2 were observed in C. leucurus. Interspecific pairwise distance values greater than 5% do not support E. callospermophili conspecificity. Generated E. callospermophili sequences were compared to Eimeria lancasterensis from Sciuris niger and Sciurus niger cinereus, and Eimeria ontarioensis from S. niger. A single well-supported clade was formed by E. callospermophili amplicons in Neighbor Joining and Maximum Parsimony analyses. However, within the clade there was little evidence of host or geographic structuring of the species. PMID:21506777
Alguacil, Maria del Mar; Torrecillas, Emma; Lozano, Zenaida; Roldán, Antonio
2011-01-01
Arbuscular mycorrhizal fungi (AMF) play important roles as plant protection agents, reducing or suppressing nematode colonization. However, it has never been investigated whether the galls produced in roots by nematode infection are colonized by AMF. This study tested whether galls produced by Meloidogyne incognita infection in Prunus persica roots are colonized by AMF. We also determined the changes in AMF composition and biodiversity mediated by infection with this root-knot nematode. DNA from galls and roots of plants infected by M. incognita and from roots of noninfected plants was extracted, amplified, cloned, and sequenced using AMF-specific primers. Phylogenetic analysis using the small-subunit (SSU) ribosomal DNA (rDNA) data set revealed 22 different AMF sequence types (17 Glomus sequence types, 3 Paraglomus sequence types, 1 Scutellospora sequence type, and 1 Acaulospora sequence type). The highest AMF diversity was found in uninfected roots, followed by infected roots and galls. This study indicates that the galls produced in P. persica roots due to infection with M. incognita were colonized extensively by a community of AMF, belonging to the families Paraglomeraceae and Glomeraceae, that was different from the community detected in roots. Although the function of the AMF in the galls is still unknown, we hypothesize that they act as protection agents against opportunistic pathogens. PMID:21984233
Alguacil, Maria del Mar; Torrecillas, Emma; Lozano, Zenaida; Roldán, Antonio
2011-12-01
Arbuscular mycorrhizal fungi (AMF) play important roles as plant protection agents, reducing or suppressing nematode colonization. However, it has never been investigated whether the galls produced in roots by nematode infection are colonized by AMF. This study tested whether galls produced by Meloidogyne incognita infection in Prunus persica roots are colonized by AMF. We also determined the changes in AMF composition and biodiversity mediated by infection with this root-knot nematode. DNA from galls and roots of plants infected by M. incognita and from roots of noninfected plants was extracted, amplified, cloned, and sequenced using AMF-specific primers. Phylogenetic analysis using the small-subunit (SSU) ribosomal DNA (rDNA) data set revealed 22 different AMF sequence types (17 Glomus sequence types, 3 Paraglomus sequence types, 1 Scutellospora sequence type, and 1 Acaulospora sequence type). The highest AMF diversity was found in uninfected roots, followed by infected roots and galls. This study indicates that the galls produced in P. persica roots due to infection with M. incognita were colonized extensively by a community of AMF, belonging to the families Paraglomeraceae and Glomeraceae, that was different from the community detected in roots. Although the function of the AMF in the galls is still unknown, we hypothesize that they act as protection agents against opportunistic pathogens.
Turina, Massimo; Ghignone, Stefano; Astolfi, Nausicaa; Silvestri, Alessandro; Bonfante, Paola; Lanfranco, Luisa
2018-02-02
Arbuscular Mycorrhizal Fungi (AMF) are key components of the plant microbiota. AMF genetic complexity is increased by the presence of endobacteria, which live inside many species. A further component of such complexity is the virome associated to AMF, whose knowledge is still very limited. Here, by exploiting transcriptomic data we describe the virome of Gigaspora margarita. A BLAST search for viral RNA-dependent RNA polymerases sequences allowed the identification of four mitoviruses, one Ourmia-like narnavirus, one Giardia-like virus, and two sequences related to Fusarium graminearum mycoviruses. Northern blot and RT-PCR confirmed the authenticity of all the sequences with the exception of the F. graminearum-related ones. All the mitoviruses are replicative and functional since both positive strand and negative strand RNA are present. The abundance of the viral RNA molecules is not regulated by the presence or absence of Candidatus Glomeribacter gigasporarum, the endobacterium hosted by G. margarita, with the exception of the Ourmia-like sequence which is absent in bacteria-cured spores. In addition, we report, for the first time, DNA fragments corresponding to mitovirus sequences associated to the presence of viral RNA. These sequences are not integrated in the mitochondrial DNA and preliminary evidence seems to exclude integration in the nuclear genome. © 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.
Orlando, Ludovic; Pagés, Marie; Calvignac, Sébastien; Hughes, Sandrine; Hänni, Catherine
2007-02-22
Pigmy elephants inhabited the islands from the Mediterranean region during the Pleistocene period but became extinct in the course of the Holocene. Despite striking distinctive anatomical characteristics related to insularity, some similarities with the lineage of extant Asian elephants have suggested that pigmy elephants could be most probably seen as members of the genus Elephas. Poulakakis et al (2006) have recently challenged this view by recovering a short mtDNA sequence from an 800 000 year old fossil of the Cretan pigmy elephant (Elephas creticus). According to the authors of this study, a deep taxonomic revision of Cretan dwarf elephants would be needed, as the sequence exhibits clear affinities with woolly mammoth haplotypes. However, we point here many aspects that seriously weaken the strength of the ancient DNA evidence reported.
DNA Recognition by a σ 54 Transcriptional Activator from Aquifex aeolicus
Vidangos, Natasha K.; Heideker, Johanna; Lyubimov, Artem; ...
2014-08-23
Transcription initiation by bacterial σ 54-polymerase requires the action of a transcriptional activator protein. Activators bind sequence-specifically upstream of the transcription initiation site via a DNA-binding domain. The structurally characterized DNA-binding domains from activators all belong to the Factor for Inversion Stimulation (Fis) family of helix-turn-helix DNA-binding proteins. We report here structures of the free and DNA-bound forms of the DNA-binding domain of NtrC4 (4DBD) from Aquifex aeolicus, a member of the NtrC family of σ 54 activators. Two NtrC4 binding sites were identified upstream (-145 and -85 base pairs) from the start of the lpxC gene, which is responsiblemore » for the first committed step in Lipid A biosynthesis. This is the first experimental evidence for σ 54 regulation in lpxC expression. 4DBD was crystallized both without DNA and in complex with the -145 binding site. The structures, together with biochemical data, indicate that NtrC4 binds to DNA in a manner that is similar to that of its close homologue, Fis. Ultimately, the greater sequence specificity for the binding of 4DBD relative to Fis seems to arise from a larger number of base specific contacts contributing to affinity than for Fis.« less
Slon, Viviane; Viola, Bence; Renaud, Gabriel; Gansauge, Marie-Theres; Benazzi, Stefano; Sawyer, Susanna; Hublin, Jean-Jacques; Shunkov, Michael V.; Derevianko, Anatoly P.; Kelso, Janet; Prüfer, Kay; Meyer, Matthias; Pääbo, Svante
2017-01-01
The presence of Neandertals in Europe and Western Eurasia before the arrival of anatomically modern humans is well supported by archaeological and paleontological data. In contrast, fossil evidence for Denisovans, a sister group of Neandertals recently identified on the basis of DNA sequences, is limited to three specimens, all of which originate from Denisova Cave in the Altai Mountains (Siberia, Russia). We report the retrieval of DNA from a deciduous lower second molar (Denisova 2), discovered in a deep stratigraphic layer in Denisova Cave, and show that this tooth comes from a female Denisovan individual. On the basis of the number of “missing substitutions” in the mitochondrial DNA determined from the specimen, we find that Denisova 2 is substantially older than two of the other Denisovans, reinforcing the view that Denisovans were likely to have been present in the vicinity of Denisova Cave over an extended time period. We show that the level of nuclear DNA sequence diversity found among Denisovans is within the lower range of that of present-day human populations. PMID:28695206
Genetic evidence from mitochondrial DNA corroborates the origin of Tibetan chickens.
Zhang, Long; Zhang, Pu; Li, Qingqing; Gaur, Uma; Liu, Yiping; Zhu, Qing; Zhao, Xiaoling; Wang, Yan; Yin, Huadong; Hu, Yaodong; Liu, Aiping; Li, Diyan
2017-01-01
Chicken is the most common poultry species and is important to human societies. Tibetan chicken (Gallus gallus domesticus) is a breed endemic to China that is distributed mainly on the Qinghai-Tibet Plateau. However, its origin has not been well characterized. In the present study, we sequenced partial mitochondrial DNA (mtDNA) control region of 239 and 283 samples from Tibetan and Sichuan indigenous chickens, respectively. Incorporating 1091 published sequences, we constructed the matrilineal genealogy of Tibetan chickens to further document their domestication history. We found that the genetic structure of the mtDNA haplotypes of Tibetan chickens are dominated by seven major haplogroups (A-G). In addition, phylogenetic and network analyses showed that Tibetan chickens are not distinguishable from the indigenous chickens in surrounding areas. Furthermore, some clades of Tibetan chickens may have originated from game fowls. In summary, our results collectively indicated that Tibetan chickens may have diverged from indigenous chickens in the adjacent regions and hybridized with various chickens.
Genetic evidence from mitochondrial DNA corroborates the origin of Tibetan chickens
Zhu, Qing; Zhao, Xiaoling; Wang, Yan; Yin, Huadong; Hu, Yaodong; Liu, Aiping; Li, Diyan
2017-01-01
Chicken is the most common poultry species and is important to human societies. Tibetan chicken (Gallus gallus domesticus) is a breed endemic to China that is distributed mainly on the Qinghai-Tibet Plateau. However, its origin has not been well characterized. In the present study, we sequenced partial mitochondrial DNA (mtDNA) control region of 239 and 283 samples from Tibetan and Sichuan indigenous chickens, respectively. Incorporating 1091 published sequences, we constructed the matrilineal genealogy of Tibetan chickens to further document their domestication history. We found that the genetic structure of the mtDNA haplotypes of Tibetan chickens are dominated by seven major haplogroups (A-G). In addition, phylogenetic and network analyses showed that Tibetan chickens are not distinguishable from the indigenous chickens in surrounding areas. Furthermore, some clades of Tibetan chickens may have originated from game fowls. In summary, our results collectively indicated that Tibetan chickens may have diverged from indigenous chickens in the adjacent regions and hybridized with various chickens. PMID:28241078
A novel peptide from the ACEI/BPP-CNP precursor in the venom of Crotalus durissus collilineatus.
Higuchi, Shigesada; Murayama, Nobuhiro; Saguchi, Ken-ichi; Ohi, Hiroaki; Fujita, Yoshiaki; da Silva, Nelson Jorge; de Siqueira, Rodrigo José Bezerra; Lahlou, Saad; Aird, Steven D
2006-10-01
In crotaline venoms, angiotensin-converting enzyme inhibitors [ACEIs, also known as bradykinin potentiating peptides (BPPs)], are products of a gene coding for an ACEI/BPP-C-type natriuretic peptide (CNP) precursor. In the genes from Bothrops jararaca and Gloydius blomhoffii, ACEI/BPP sequences are repeated. Sequencing of a cDNA clone from venom glands of Crotalus durissus collilineatus showed that two ACEIs/BPPs are located together at the N-terminus, but without repeats. An additional sequence for CNP was unexpectedly found at the C-terminus. Homologous genes for the ACEI/BPP-CNP precursor suggest that most crotaline venoms contain both ACEIs/BPPs and CNP. The sequence of ACEIs/BPPs is separated from the CNP sequence by a long spacer sequence. Previously, there was no evidence that this spacer actually coded any expressed peptides. Aird and Kaiser (1986, unpublished) previously isolated and sequenced a peptide of 11 residues (TPPAGPDVGPR) from Crotalus viridis viridis venom. In the present study, analysis of the cDNA clone from C. d. collilineatus revealed a nearly identical sequence in the ACEI/BPP-CNP spacer. Fractionation of the crude venom by reverse phase HPLC (C(18)), and analysis of the fractions by mass spectrometry (MS) indicated a component of 1020.5 Da. Amino acid sequencing by MS/MS confirmed that C. d. collilineatus venom contains the peptide TPPAGPDGGPR. Its high proline content and paired proline residues are typical of venom hypotensive peptides, although it lacks the usual N-terminal pyroglutamate. It has no demonstrable hypotensive activity when injected intravenously in rats; however, its occurrence in the venoms of dissimilar species suggests that its presence is not accidental. Evidence suggests that these novel toxins probably activate anaphylatoxin C3a receptors.
Molecular Evidence of Chlamydia-Like Organisms in the Feces of Myotis daubentonii Bats.
Hokynar, K; Vesterinen, E J; Lilley, T M; Pulliainen, A T; Korhonen, S J; Paavonen, J; Puolakkainen, M
2017-01-15
Chlamydia-like organisms (CLOs) are recently identified members of the Chlamydiales order. CLOs share intracellular lifestyles and biphasic developmental cycles, and they have been detected in environmental samples as well as in various hosts such as amoebae and arthropods. In this study, we screened bat feces for the presence of CLOs by molecular analysis. Using pan-Chlamydiales PCR targeting the 16S rRNA gene, Chlamydiales DNA was detected in 54% of the specimens. PCR amplification, sequencing, and phylogenetic analysis of the 16S rRNA and 23S rRNA genes were used to classify positive specimens and infer their phylogenetic relationships. Most sequences matched best with Rhabdochlamydia species or uncultured Chlamydia sequences identified in ticks. Another set of sequences matched best with sequences of the Chlamydia genus or uncultured Chlamydiales from snakes. To gain evidence of whether CLOs in bat feces are merely diet borne, we analyzed insects trapped from the same location where the bats foraged. Interestingly, the CLO sequences resembling Rhabdochlamydia spp. were detected in insect material as well, but the other set of CLO sequences was not, suggesting that this set might not originate from prey. Thus, bats represent another potential host for Chlamydiales and could harbor novel, previously unidentified members of this order. Several pathogenic viruses are known to colonize bats, and recent analyses indicate that bats are also reservoir hosts for bacterial genera. Chlamydia-like organisms (CLOs) have been detected in several animal species. CLOs have high 16S rRNA sequence similarity to Chlamydiaceae and exhibit similar intracellular lifestyles and biphasic developmental cycles. Our study describes the frequent occurrence of CLO DNA in bat feces, suggesting an expanding host species spectrum for the Chlamydiales As bats can acquire various infectious agents through their diet, prey insects were also studied. We identified CLO sequences in bats that matched best with sequences in prey insects but also CLO sequences not detected in prey insects. This suggests that a portion of CLO DNA present in bat feces is not prey borne. Furthermore, some sequences from bat droppings not originating from their diet might well represent novel, previously unidentified members of the Chlamydiales order. Copyright © 2016 American Society for Microbiology.
Radioresistance of GGG Sequences to Prompt Strand Break Formation from Direct-Type Radiation Damage
Black, Paul J.; Miller, Adam S.; Hayes, Jeffrey J.
2016-01-01
Purpose As humans, we are constantly exposed to ionizing radiation from natural, man-made and cosmic sources which can damage DNA, leading to deleterious effects including cancer incidence. In this work we introduce a method to monitor strand breaks resulting from damage due to the direct effect of ionizing radiation and provide evidence for sequence-dependent effects leading to strand breaks. Materials and methods To analyze only DNA strand breaks caused by radiation damage due to the direct effect of ionizing radiation, we combined an established technique to generate dehydrated DNA samples with a technique to analyze single strand breaks on short oligonucleotide sequences via denaturing gel electrophoresis. Results We find that direct damage primarily results in a reduced number of strand breaks in guanine triplet regions (GGG) when compared to isolated guanine (G) bases with identical flanking base context. In addition, we observe strand break behavior possibly indicative of protection of guanine bases when flanked by pyrimidines, and sensitization of guanine to strand break when flanked by adenine (A) bases in both isolated G and GGG cases. Conclusions These observations provide insight into the strand break behavior in GGG regions damaged via the direct effect of ionizing radiation. In addition, this could be indicative of DNA sequences that are naturally more susceptible to strand break due to the direct effect of ionizing radiation. PMID:27349757
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tully, D.B.; Hillman, D.; Herbert, E.
1986-05-01
Glucocorticoids negatively regulate expression of the human proopiomelanocortin (POMC) gene. It has been postulated that this effect may be modulated by a direct interaction of the glucocorticoid receptor (GR) with DNA in the vicinity of the POMC promoter. In order to investigate interactions of GR with POMC DNA, DNA-cellulose competitive binding assays have been performed using isolated fragments of cloned POMC DNA to compete with calf thymus DNA-cellulose for binding of triamcinolone acetonide affinity-labelled GR prepared from HeLa S/sub 3/ cells. In these assays, two fragments isolated from the 5' flanking sequences of POMC DNA (Fragment 3,-1765 to -677 andmore » Fragment 4, -676 to +125 with respect to the mRNA cap site) have competed favorably, with Fragment 3 consistently competing more strongly than Fragment 4. Additional studies have been conducted utilizing a newly developed South-western Blot procedure in which specific /sup 32/P-labelled DNA fragments are allowed to bind to dexamethasone mesylate labelled GR immobilized on nitrocellulose filters. Results from these studies have also shown preferential binding by POMC DNA fragments 3 and 4. DNA footprinting and gene transfer experiments are now being conducted to further characterize the nature of GR interaction with POMC DNA.« less
Zock, C; Iselt, A; Doerfler, W
1993-01-01
Human adenovirus type 12 (Ad12) cannot replicate in hamster cells, whereas human cells are permissive for Ad12. Ad12 DNA replication and late-gene and virus-associated RNA expression are blocked in hamster cells. Early Ad12 genes are transcribed, and the viral DNA can be integrated into the host genome. Ad12 DNA replication and late-gene transcription can be complemented in hamster cells by E1 functions of Ad2 or Ad5, for which hamster cells are fully permissive (for a review, see W. Doerfler, Adv. Virus Res. 39:89-128, 1991). We have previously demonstrated that a 33-nucleotide mitigator sequence, which is located in the downstream region of the major late promoter (MLP) of Ad12 DNA, is responsible for the inactivity of the Ad12 MLP in hamster cells (C. Zock and W. Doerfler, EMBO J. 9:1615-1623, 1990). A similar negative regulator has not been found in the MLP of Ad2 DNA. We have now studied the mechanism of action of this mitigator element. The results of nuclear run-on experiments document the absence of MLP transcripts in the nuclei of Ad12-infected BHK21 hamster cells. Surprisingly, the mitigator element cannot elicit its function in in vitro transcription experiments with nuclear extracts from both hamster BHK21 and human HeLa cells. Intact nuclear topology and/or tightly bound nuclear elements that cannot be eluted in nuclear extracts are somehow required for recognition of the Ad12 mitigator. Electrophoretic mobility shift assays have not revealed significant differences in the binding of proteins from human HeLa or hamster BHK21 cells to the mitigator sequence in the MLP of Ad12 DNA or to the corresponding sequence in Ad2 DNA. We have converted the sequence of the mitigator in the MLP of Ad12 DNA to the equivalent sequence in the MLP of Ad2 DNA by site-directed mutagenesis. This construct was not active in hamster cells. When the Ad12 mitigator, on the other hand, was inserted into the Ad2 MLP, the latter's function in hamster cells was not compromised. Deletions in the 5' upstream region of the Ad12 MLP have provided evidence for the existence of additional sequences that codetermine the deficiency of the Ad12 MLP in hamster cells. The amphifunctional YY1 protein from HeLa cells can bind specifically to the mitigator and to upstream elements of the MLP of Ad12 DNA.(ABSTRACT TRUNCATED AT 400 WORDS) Images PMID:8419643
NASA Astrophysics Data System (ADS)
Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.
2017-07-01
DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
Torrell, Helena; Salas, Antonio; Abasolo, Nerea; Morén, Constanza; Garrabou, Glòria; Valero, Joaquín; Alonso, Yolanda; Vilella, Elisabet; Costas, Javier; Martorell, Lourdes
2014-10-01
It has been reported that certain genetic factors involved in schizophrenia could be located in the mitochondrial DNA (mtDNA). Therefore, we hypothesized that mtDNA mutations and/or variants would be present in schizophrenia patients and may be related to schizophrenia characteristics and mitochondrial function. This study was performed in three steps: (1) identification of pathogenic mutations and variants in 14 schizophrenia patients with an apparent maternal inheritance of the disease by sequencing the entire mtDNA; (2) case-control association study of 23 variants identified in step 1 (16 missense, 3 rRNA, and 4 tRNA variants) in 495 patients and 615 controls, and (3) analyses of the associated variants according to the clinical, psychopathological, and neuropsychological characteristics and according to the oxidative and enzymatic activities of the mitochondrial respiratory chain. We did not identify pathogenic mtDNA mutations in the 14 sequenced patients. Two known variants were nominally associated with schizophrenia and were further studied. The MT-RNR2 1811A > G variant likely does not play a major role in schizophrenia, as it was not associated with clinical, psychopathological, or neuropsychological variables, and the MT-ATP6 9110T > C p.Ile195Thr variant did not result in differences in the oxidative and enzymatic functions of the mitochondrial respiratory chain. The patients with apparent maternal inheritance of schizophrenia did not exhibit any mutations in their mtDNA. The variants nominally associated with schizophrenia in the present study were not related either to phenotypic characteristics or to mitochondrial function. We did not find evidence pointing to a role for mtDNA sequence variation in schizophrenia. © 2014 Wiley Periodicals, Inc.
Radiocarbon and DNA evidence for a pre-Columbian introduction of Polynesian chickens to Chile
Storey, Alice A.; Ramírez, José Miguel; Quiroz, Daniel; Burley, David V.; Addison, David J.; Walter, Richard; Anderson, Atholl J.; Hunt, Terry L.; Athens, J. Stephen; Huynen, Leon; Matisoo-Smith, Elizabeth A.
2007-01-01
Two issues long debated among Pacific and American prehistorians are (i) whether there was a pre-Columbian introduction of chicken (Gallus gallus) to the Americas and (ii) whether Polynesian contact with South America might be identified archaeologically, through the recovery of remains of unquestionable Polynesian origin. We present a radiocarbon date and an ancient DNA sequence from a single chicken bone recovered from the archaeological site of El Arenal-1, on the Arauco Peninsula, Chile. These results not only provide firm evidence for the pre-Columbian introduction of chickens to the Americas, but strongly suggest that it was a Polynesian introduction. PMID:17556540
Radiocarbon and DNA evidence for a pre-Columbian introduction of Polynesian chickens to Chile.
Storey, Alice A; Ramírez, José Miguel; Quiroz, Daniel; Burley, David V; Addison, David J; Walter, Richard; Anderson, Atholl J; Hunt, Terry L; Athens, J Stephen; Huynen, Leon; Matisoo-Smith, Elizabeth A
2007-06-19
Two issues long debated among Pacific and American prehistorians are (i) whether there was a pre-Columbian introduction of chicken (Gallus gallus) to the Americas and (ii) whether Polynesian contact with South America might be identified archaeologically, through the recovery of remains of unquestionable Polynesian origin. We present a radiocarbon date and an ancient DNA sequence from a single chicken bone recovered from the archaeological site of El Arenal-1, on the Arauco Peninsula, Chile. These results not only provide firm evidence for the pre-Columbian introduction of chickens to the Americas, but strongly suggest that it was a Polynesian introduction.
A Brave New World for an Old World Pest: Helicoverpa armigera (Lepidoptera: Noctuidae) in Brazil
Walsh, Thomas; Thomazoni, Danielle; Silvie, Pierre; Behere, Gajanan T.; Anderson, Craig; Downes, Sharon
2013-01-01
The highly polyphagous Old World cotton bollworm Helicoverpa armigera is a quarantine agricultural pest for the American continents. Historically H. armigera is thought to have colonised the American continents around 1.5 to 2 million years ago, leading to the current H. zea populations on the American continents. The relatively recent species divergence history is evident in mating compatibility between H. zea and H. armigera under laboratory conditions. Despite periodic interceptions of H. armigera into North America, this pest species is not believed to have successfully established significant populations on either continent. In this study, we provide molecular evidence via mitochondrial DNA (mtDNA) cytochrome oxidase I (COI) and cytochrome b (Cyt b) partial gene sequences for the successful recent incursion of H. armigera into the New World, with individuals being detected at two sites (Primavera do Leste, Pedra Preta) within the State of Mato Grosso in Brazil. The mtDNA COI and Cyt b haplotypes detected in the Brazilian H. armigera individuals are common throughout the Old World, thus precluding identification of the founder populations. Combining the two partial mtDNA gene sequences showed that at least two matrilines are present in Brazil, while the inclusion of three nuclear DNA Exon-Primed Intron-Crossing (EPIC) markers identified a further two possible matrilines in our samples. The economic, biosecurity, resistance management, ecological and evolutionary implications of this incursion are discussed in relation to the current agricultural practices in the Americas. PMID:24260345
Large-Scale Concatenation cDNA Sequencing
Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.
1997-01-01
A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
What Information is Stored in DNA: Does it Contain Digital Error Correcting Codes?
NASA Astrophysics Data System (ADS)
Liebovitch, Larry
1998-03-01
The longest term correlations in living systems are the information stored in DNA which reflects the evolutionary history of an organism. The 4 bases (A,T,G,C) encode sequences of amino acids as well as locations of binding sites for proteins that regulate DNA. The fidelity of this important information is maintained by ANALOG error check mechanisms. When a single strand of DNA is replicated the complementary base is inserted in the new strand. Sometimes the wrong base is inserted that sticks out disrupting the phosphate backbone. The new base is not yet methylated, so repair enzymes, that slide along the DNA, can tear out the wrong base and replace it with the right one. The bases in DNA form a sequence of 4 different symbols and so the information is encoded in a DIGITAL form. All the digital codes in our society (ISBN book numbers, UPC product codes, bank account numbers, airline ticket numbers) use error checking code, where some digits are functions of other digits to maintain the fidelity of transmitted informaiton. Does DNA also utitlize a DIGITAL error chekcing code to maintain the fidelity of its information and increase the accuracy of replication? That is, are some bases in DNA functions of other bases upstream or downstream? This raises the interesting mathematical problem: How does one determine whether some symbols in a sequence of symbols are a function of other symbols. It also bears on the issue of determining algorithmic complexity: What is the function that generates the shortest algorithm for reproducing the symbol sequence. The error checking codes most used in our technology are linear block codes. We developed an efficient method to test for the presence of such codes in DNA. We coded the 4 bases as (0,1,2,3) and used Gaussian elimination, modified for modulus 4, to test if some bases are linear combinations of other bases. We used this method to analyze the base sequence in the genes from the lac operon and cytochrome C. We did not find evidence for such error correcting codes in these genes. However, we analyzed only a small amount of DNA and if digitial error correcting schemes are present in DNA, they may be more subtle than such simple linear block codes. The basic issue we raise here, is how information is stored in DNA and an appreciation that digital symbol sequences, such as DNA, admit of interesting schemes to store and protect the fidelity of their information content. Liebovitch, Tao, Todorov, Levine. 1996. Biophys. J. 71:1539-1544. Supported by NIH grant EY6234.
Mechanism of DNA binding enhancement by hepatitis B virus protein pX.
Palmer, C R; Gegnas, L D; Schepartz, A
1997-12-09
At least three hundred million people worldwide are infected with the hepatitis B virus (HBV), and epidemiological studies show a clear correlation between chronic HBV infection and the development of hepatocellular carcinoma. HBV encodes a protein, pX, which abducts the cellular transcriptional machinery in several ways including direct interactions with bZIP transcription factors. These interactions increase the DNA affinities of target bZIP proteins in a DNA sequence-dependent manner. Here we use a series of bZIP peptide models to explore the mechanism by which pX interacts with bZIP proteins. Our results suggest that pX increases bZIP.DNA stability by increasing the stability of the bZIP dimer as well as the affinity of the dimer for DNA. Additional experiments provide evidence for a mechanism in which pX recognizes the composite structure of the peptide.DNA complex, not simply the primary peptide sequence. These experiments provide a framework for understanding how pX alters the patterns of transcription within the nucleus. The similarities between the mechanism proposed for pX and the mechanism previously proposed for the human T-cell leukemia virus protein Tax are discussed.
Follett, Shelby E; Ingersoll, Azure D; Murray, Sally A; Reilly, Teresa M; Lehmann, Teresa E
2017-10-01
Bleomycins are a group of glycopeptide antibiotics synthesized by Streptomyces verticillus that are widely used for the treatment of various neoplastic diseases. These antibiotics have the ability to chelate a metal center, mainly Fe(II), and cause site-specific DNA cleavage. Bleomycins are differentiated by their C-terminal regions. Although this antibiotic family is a successful course of treatment for some types of cancers, it is known to cause pulmonary fibrosis. Previous studies have identified that bleomycin-related pulmonary toxicity is linked to the C-terminal region of these drugs. This region has been shown to closely interact with DNA. We examined the binding of Zn(II)peplomycin and Zn(II)bleomycin-A 2 to a DNA hairpin of sequence 5'-CCAGTATTTTTACTGG-3', containing the binding site 5'-GT-3', and compared the results with those obtained from our studies of the same MBLMs bound to a DNA hairpin containing the binding site 5'-GC-3'. We provide evidence that the DNA base sequence has a strong impact in the final structure of the drug-target complex.
Mariella, Jr., Raymond P.
2008-11-18
A method of synthesizing a desired double-stranded DNA of a predetermined length and of a predetermined sequence. Preselected sequence segments that will complete the desired double-stranded DNA are determined. Preselected segment sequences of DNA that will be used to complete the desired double-stranded DNA are provided. The preselected segment sequences of DNA are assembled to produce the desired double-stranded DNA.
Tian, Wenzhi; Chua, Kevin; Strober, Warren; Chu, Charles C.
2002-01-01
BACKGROUND: Identification of differentially expressed genes between normal and diseased states is an area of intense current medical research that can lead to the discovery of new therapeutic targets. However, isolation of differentially expressed genes by subtraction often suffers from unreported contamination of the resulting subtraction library with clones containing DNA sequences not from the original RNA samples. MATERIALS AND METHODS: Subtraction using cDNA representational difference analysis (RDA) was performed on human B cells from normal or common variable immunodeficiency patients. The material remaining after the subtraction was cloned and individual clones were sequenced. The sequence of one clone with similarity to integrases (ILG1, integrase-like gene-1) was used to obtain the full length cDNA sequence and as a probe for the presence of this sequence in RNA or genomic DNA samples. RESULTS: After five rounds of cDNA RDA, 23.3% of the clones from the resulting subtraction library contained Escherichia coli DNA. In addition, three clones contained the sequence of a new integrase, ILG1. The full length cDNA sequence of ILG1 exhibits prokaryotic, but not eukaryotic, features. At the DNA level, ILG1 is not similar to any known gene. At the protein level, ILG1 has 58% similarity to integrases from the cryptic P4 bacteriophage family (S clade). The catalytic domain of ILG1 contains the conserved features found in site-specific recombinases. The critical residues that form the catalytic active site pocket are conserved, including the highly conserved R-H-R-Y hallmark of these recombinases. Interestingly, ILG1 was not present in the original B cell populations. By probing genomic DNA, ILG1 could only be detected in the E. coli TOP10F' strain used in our laboratory for molecular cloning, but not in any of its precursor strains, including TOP10. Furthermore, bacteria cultured from the mouth of the laboratory worker who performed cDNA RDA were also positive for ILG1. CONCLUSIONS: In the course of our studies using cDNA RDA, we have isolated and identified ILG1, a likely active site-specific recombinase and new member of the bacteriophage P4 family of integrases. This family of integrases is implicated in the horizontal DNA transfer of pathogenic genes between bacterial species, such as those found in pathogenic strains of E. coli, Shigella, Yersinia, and Vibrio cholera. Using ILG1 as a marker of our laboratory E. coli strain TOP10F', our evidence suggests that contaminating bacterial DNA in our subtraction experiment is due to this laboratory bacterial strain, which colonized exposed surfaces of the laboratory worker. Thus, identification of differentially expressed genes between normal and diseased states could be dramatically improved by using extra precaution to prevent bacterial contamination of samples. PMID:12393938
Nanopore Technology: A Simple, Inexpensive, Futuristic Technology for DNA Sequencing.
Gupta, P D
2016-10-01
In health care, importance of DNA sequencing has been fully established. Sanger's Capillary Electrophoresis DNA sequencing methodology is time consuming, cumbersome, hence become more expensive. Lately, because of its versatility DNA sequencing became house hold name, and therefore, there is an urgent need of simple, fast, inexpensive, DNA sequencing technology. In the beginning of this century efforts were made, and Nanopore DNA sequencing technology was developed; still it is infancy, nevertheless, it is the futuristic technology.
Molecular detection and characterization of Anaplasma platys in dogs and ticks in Cuba.
Silva, Claudia Bezerra da; Santos, Huarrisson Azevedo; Navarrete, Maylín González; Ribeiro, Carla Carolina Dias Uzedo; Gonzalez, Belkis Corona; Zaldivar, Maykelin Fuentes; Pires, Marcus Sandes; Peckle, Maristela; Costa, Renata Lins da; Vitari, Gabriela Lopes Vivas; Massard, Carlos Luiz
2016-07-01
Canine cyclic thrombocytopenia, an infectious disease caused by Anaplasma platys is a worldwide dog health problem. This study aimed to detect and characterize A. platys deoxyribonucleic acid (DNA) in dogs and ticks from Cuba using molecular methods. The study was conducted in four cities of Cuba (Habana del Este, Boyeros, Cotorro and San José de las Lajas). Blood samples were collected from 100 dogs in these cities. The animals were inspected for the detection of tick infestation and specimens were collected. Genomic DNA was extracted from dog blood and ticks using a commercial kit. Genomic DNA samples from blood and ticks were tested by a nested polymerase chain reaction (nPCR) to amplify 678 base pairs (bp) from the 16S ribosomal DNA (rDNA) of A. platys. Positive samples in nPCR were also subjected to PCR to amplify a fragment of 580bp from the citrate synthase (gltA) gene and the products were sequenced. Only Rhipicephalus sanguineus sensu lato (s.l.) was found on dogs, and 10.20% (n=5/49) of these ticks plus sixteen percent (16.0%, n=16/100) of dogs were considered positive for A. platys by nPCR targeting the 16S rDNA gene. All analyzed gltA and 16S rDNA sequences showed a 99-100% identity with sequences of A. platys reported in around the world. Phylogenetic analysis showed two defined clusters for the 16S rDNA gene and three defined clusters for the gltA gene. Based on the gltA gene, the deduced amino acid sequence showed two mutations at positions 88 and 168 compared with the sequence DQ525687 (GenBank ID from Italian sample), used as a reference in the alignment. A preliminary study on the epidemiological aspects associated with infection by A. platys showed no statistical association with the variables studied (p>0.05). This is the first evidence of the presence of A. platys in dogs and ticks in Cuba. Further studies are needed to evaluate the epidemiological aspects of A. platys infection in Cuban dogs. Copyright © 2016 Elsevier GmbH. All rights reserved.
The genome-wide DNA sequence specificity of the anti-tumour drug bleomycin in human cells.
Murray, Vincent; Chen, Jon K; Tanaka, Mark M
2016-07-01
The cancer chemotherapeutic agent, bleomycin, cleaves DNA at specific sites. For the first time, the genome-wide DNA sequence specificity of bleomycin breakage was determined in human cells. Utilising Illumina next-generation DNA sequencing techniques, over 200 million bleomycin cleavage sites were examined to elucidate the bleomycin genome-wide DNA selectivity. The genome-wide bleomycin cleavage data were analysed by four different methods to determine the cellular DNA sequence specificity of bleomycin strand breakage. For the most highly cleaved DNA sequences, the preferred site of bleomycin breakage was at 5'-GT* dinucleotide sequences (where the asterisk indicates the bleomycin cleavage site), with lesser cleavage at 5'-GC* dinucleotides. This investigation also determined longer bleomycin cleavage sequences, with preferred cleavage at 5'-GT*A and 5'- TGT* trinucleotide sequences, and 5'-TGT*A tetranucleotides. For cellular DNA, the hexanucleotide DNA sequence 5'-RTGT*AY (where R is a purine and Y is a pyrimidine) was the most highly cleaved DNA sequence. It was striking that alternating purine-pyrimidine sequences were highly cleaved by bleomycin. The highest intensity cleavage sites in cellular and purified DNA were very similar although there were some minor differences. Statistical nucleotide frequency analysis indicated a G nucleotide was present at the -3 position (relative to the cleavage site) in cellular DNA but was absent in purified DNA.
Roy, Anirban; Spoorthi, P; Panwar, G; Bag, Manas Kumar; Prasad, T V; Kumar, Gunjeet; Gangopadhyay, K K; Dutta, M
2013-06-01
An evaluation of 70 accessions of ash gourd germplasm grown at National Bureau of Plant Genetic Resources, New Delhi, India during Kharif season (2010) showed natural occurrence of a yellow stunt disease in three accessions (IC554690, IC036330 and Pusa Ujjwal). A set of begomovirus specific primers used in PCR gave expected amplicon from all the symptomatic plants; however no betasatellite was detected. Complete genome of the begomovirus (DNA-A and DNA-B), amplified through rolling circle amplification, was cloned and sequenced. The begomovirus under study shared high sequence identities to different isolates of Tomato leaf curl New Delhi virus (ToLCNDV) and clustered with them. Among those isolates, the DNA-A and DNA-B of the present begomovirus isolate showed highest 99.6 and 96.8 % sequence identities, respectively with an isolate reported on pumpkin from India (DNA-A: AM286433, DNA-B: AM286435). Based on the sequence analysis, the begomovirus obtained from ash gourd was considered as an isolate of ToLCNDV. Thus, the present findings constitute the first report of occurrence of a new yellow stunt disease in ash gourd from India and demonstrated the association of ToLCNDV with the symptomatic samples. Occurrence of ToLCNDV in ash gourd germplasm not only adds up a new cucurbitaceous host of this virus but also raises the concern about the perpetuation of this virus in absence of its main host tomato and thus has an epidemiological relevance for understanding the rapid spread of this virus in tomato and other hosts in Indian sub-continent.
Repatriation and Identification of Finnish World War II Soldiers
Palo, Jukka U.; Hedman, Minttu; Söderholm, Niklas; Sajantila, Antti
2007-01-01
Aim To present a summary of the organization, field search, repatriation, forensic anthropological examination, and DNA analysis for the purpose of identification of Finnish soldiers with unresolved fate in World War II. Methods Field searches were organized, executed, and financed by the Ministry of Education and the Association for Cherishing the Memory of the Dead of the War. Anthropological examination conducted on human remains retrieved in the field searches was used to establish the minimum number of individuals and description of the skeletal diseases, treatment, anomalies, or injuries. DNA tests were performed by extracting DNA from powdered bones and blood samples from relatives. Mitochondrial DNA (mtDNA) sequence comparisons, together with circumstantial evidence, were used to connect the remains to the putative family members. Results At present, the skeletal remains of about a thousand soldiers have been found and repatriated. In forensic anthropological examination, several injuries related to death were documented. For the total of 181 bone samples, mtDNA HVR-1 and HVR-2 sequences were successfully obtained for 167 (92.3%) and 148 (81.8%) of the samples, respectively. Five samples yielded no reliable sequence data. Our data suggests that mtDNA preserves at least for 60 years in the boreal acidic soil. The quality of the obtained mtDNA sequence data varied depending on the sample bone type, with long compact bones (femur, tibia and humerus) having significantly better (90.0%) success rate than other bones (51.2%). Conclusion Although more than 60 years have passed since the World War II, our experience is that resolving the fate of soldiers missing in action is still of uttermost importance for people having lost their relatives in the war. Although cultural and individual differences may exist, our experience presented here gives a good perspective on the importance of individual identification performed by forensic professionals. PMID:17696308
Song, Xiaomin; Wang, Jing; Wu, Fang; Li, Xu; Teng, Maikun; Gong, Weimin
2005-01-01
SPE10 is an antifungal protein isolated from the seeds of Pachyrrhizus erosus. cDNA encoding a 47 amino acid peptide was cloned by RT-PCR and the gene sequence proved SPE10 to be a new member of plant defensin family. The synthetic cDNA with codons preferred in yeast was cloned into the pPIC9 plasmid directly in-frame with the secretion signal alpha-mating factor, and highly expressed in methylotrophic Pichia pastoris. Activity assays showed the recombinant SPE10 inhibited specifically the growth of several pathogenic fungi as native SPE10. Circular dichroism and fluorescence spectroscopy analysis indicated that the native and recombinant protein should have same folding, though there are eight cystein residues in the sequence. Several evidence suggested SPE10 should be the first dimeric plant defensin reported so far.
DNA Barcode Identification of Freshwater Snails in the Family Bithyniidae from Thailand
Kulsantiwong, Jutharat; Prasopdee, Sattrachai; Ruangsittichai, Jiraporn; Ruangjirachuporn, Wipaporn; Boonmars, Thidarut; Viyanant, Vithoon; Pierossi, Paola; Hebert, Paul D. N.; Tesana, Smarn
2013-01-01
Freshwater snails in the family Bithyniidae are the first intermediate host for Southeast Asian liver fluke (Opisthorchis viverrini), the causative agent of opisthorchiasis. Unfortunately, the subtle morphological characters that differentiate species in this group are not easily discerned by non-specialists. This is a serious matter because the identification of bithyniid species is a fundamental prerequisite for better understanding of the epidemiology of this disease. Because DNA barcoding, the analysis of sequence diversity in the 5’ region of the mitochondrial COI gene, has shown strong performance in other taxonomic groups, we decided to test its capacity to resolve 10 species/ subspecies of bithyniids from Thailand. Our analysis of 217 specimens indicated that COI sequences delivered species-level identification for 9 of 10 currently recognized species. The mean intraspecific divergence of COI was 2.3% (range 0-9.2 %), whereas sequence divergences between congeneric species averaged 8.7% (range 0-22.2 %). Although our results indicate that DNA barcoding can differentiate species of these medically-important snails, we also detected evidence for the presence of one overlooked species and one possible case of synonymy. PMID:24223896
Courtney, Joshua W.; Dryden, Richard L.; Montgomery, Jill; Schneider, Bradley S.; Smith, Gary; Massung, Robert F.
2003-01-01
Ixodes scapularis ticks were collected in 2000 and 2001 from two areas in Pennsylvania and tested for the presence of Anaplasma phagocytophilum and Borrelia burgdorferi by PCR and DNA sequencing. Of the ticks collected from northwestern and southeastern Pennsylvania, 162 of 263 (61.6%) and 25 of 191 (13.1%), respectively, were found to be positive for B. burgdorferi. DNA sequencing showed >99% identity with B. burgdorferi strains B31 and JD1. PCR testing for A. phagocytophilum revealed that 5 of 263 (1.9%) from northwestern Pennsylvania and 76 of 191 (39.8%) from southeastern Pennsylvania were positive. DNA sequencing revealed two genotypes of A. phagocytophilum, the human granulocytic ehrlichiosis (HGE) agent and a variant (AP-Variant 1) that has not been associated with human infection. Although only the HGE agent was present in northwestern Pennsylvania, both genotypes were found in southeastern Pennsylvania. These data add to a growing body of evidence showing that AP-Variant 1 is the predominant agent in areas where both genotypes coexist. PMID:12682147
Zhang, Tao; Talbert, Paul B; Zhang, Wenli; Wu, Yufeng; Yang, Zujun; Henikoff, Jorja G; Henikoff, Steven; Jiang, Jiming
2013-12-10
Plant and animal centromeres comprise megabases of highly repeated satellite sequences, yet centromere function can be specified epigenetically on single-copy DNA by the presence of nucleosomes containing a centromere-specific variant of histone H3 (cenH3). We determined the positions of cenH3 nucleosomes in rice (Oryza sativa), which has centromeres composed of both the 155-bp CentO satellite repeat and single-copy non-CentO sequences. We find that cenH3 nucleosomes protect 90-100 bp of DNA from micrococcal nuclease digestion, sufficient for only a single wrap of DNA around the cenH3 nucleosome core. cenH3 nucleosomes are translationally phased with 155-bp periodicity on CentO repeats, but not on non-CentO sequences. CentO repeats have an ∼10-bp periodicity in WW dinucleotides and in micrococcal nuclease cleavage, providing evidence for rotational phasing of cenH3 nucleosomes on CentO and suggesting that satellites evolve for translational and rotational stabilization of centromeric nucleosomes.
Ribeyre, Cyril; Lopes, Judith; Boulé, Jean-Baptiste; Piazza, Aurèle; Guédin, Aurore; Zakian, Virginia A; Mergny, Jean-Louis; Nicolas, Alain
2009-05-01
In budding yeast, the Pif1 DNA helicase is involved in the maintenance of both nuclear and mitochondrial genomes, but its role in these processes is still poorly understood. Here, we provide evidence for a new Pif1 function by demonstrating that its absence promotes genetic instability of alleles of the G-rich human minisatellite CEB1 inserted in the Saccharomyces cerevisiae genome, but not of other tandem repeats. Inactivation of other DNA helicases, including Sgs1, had no effect on CEB1 stability. In vitro, we show that CEB1 repeats formed stable G-quadruplex (G4) secondary structures and the Pif1 protein unwinds these structures more efficiently than regular B-DNA. Finally, synthetic CEB1 arrays in which we mutated the potential G4-forming sequences were no longer destabilized in pif1Delta cells. Hence, we conclude that CEB1 instability in pif1Delta cells depends on the potential to form G-quadruplex structures, suggesting that Pif1 could play a role in the metabolism of G4-forming sequences.
Evidence for louse-transmitted diseases in soldiers of Napoleon's Grand Army in Vilnius.
Raoult, Didier; Dutour, Olivier; Houhamdi, Linda; Jankauskas, Rimantas; Fournier, Pierre-Edouard; Ardagna, Yann; Drancourt, Michel; Signoli, Michel; La, Vu Dang; Macia, Yves; Aboudharam, Gerard
2006-01-01
Many soldiers in Napoleon's Grand Army died of infectious diseases during its retreat from Russia. Because soldiers were commonly infested with body lice, it has been speculated that louse-borne infectious diseases, such as epidemic typhus (caused by Rickettsia prowazekii), were common. We investigated this possibility during recent excavations of a mass grave of Napoleon's soldiers in Vilnius, Lithuania. Segments of 5 body lice, identified morphologically and by polymerase chain reaction (PCR) amplification and sequencing, were found in earth from the grave that also contained fragments of soldiers' uniforms. DNA of Bartonella quintana (the agent of trench fever) was identified by PCR and sequencing in 3 of the lice. Similarly, PCR and sequencing of dental pulp from the remains of 35 soldiers revealed DNA of B. quintana in 7 soldiers and DNA of R. prowazekii in 3 other soldiers. Our results show that louse-borne infectious diseases affected nearly one-third of Napoleon's soldiers buried in Vilnius and indicate that these diseases might have been a major factor in the French retreat from Russia.
DNA barcoding insect–host plant associations
Jurado-Rivera, José A.; Vogler, Alfried P.; Reid, Chris A.M.; Petitpierre, Eduard; Gómez-Zurita, Jesús
2008-01-01
Short-sequence fragments (‘DNA barcodes’) used widely for plant identification and inventorying remain to be applied to complex biological problems. Host–herbivore interactions are fundamental to coevolutionary relationships of a large proportion of species on the Earth, but their study is frequently hampered by limited or unreliable host records. Here we demonstrate that DNA barcodes can greatly improve this situation as they (i) provide a secure identification of host plant species and (ii) establish the authenticity of the trophic association. Host plants of leaf beetles (subfamily Chrysomelinae) from Australia were identified using the chloroplast trnL(UAA) intron as barcode amplified from beetle DNA extracts. Sequence similarity and phylogenetic analyses provided precise identifications of each host species at tribal, generic and specific levels, depending on the available database coverage in various plant lineages. The 76 species of Chrysomelinae included—more than 10 per cent of the known Australian fauna—feed on 13 plant families, with preference for Australian radiations of Myrtaceae (eucalypts) and Fabaceae (acacias). Phylogenetic analysis of beetles shows general conservation of host association but with rare host shifts between distant plant lineages, including a few cases where barcodes supported two phylogenetically distant host plants. The study demonstrates that plant barcoding is already feasible with the current publicly available data. By sequencing plant barcodes directly from DNA extractions made from herbivorous beetles, strong physical evidence for the host association is provided. Thus, molecular identification using short DNA fragments brings together the detection of species and the analysis of their interactions. PMID:19004756
De Jonckheere, Johan F; Gryseels, Sophie; Eddyani, Miriam
2012-08-01
We have isolated several free-living amoeba strains from the environment in Ghana, which have internal transcribed spacers, including the 5.8S rDNA, sequences similar to sequences attributed to Vahlkampfiidae (Heterolobosea) in databases. However, morphological examination shows that the isolates belong to the Hartmannellidae (Amoebozoa). We provide evidence that the sequences in the databases are wrongly classified as belonging to a genus or species of the Vahlkampfiidae, but rather belong to strains of the genus Hartmannella. Copyright © 2012 Elsevier GmbH. All rights reserved.
Gambley, C F; Geering, A D W; Steele, V; Thomas, J E
2008-01-01
A previously published partial sequence of pineapple bacilliform virus was shown to be from a retrotransposon (family Metaviridae) and not from a badnavirus as previously thought. Two newly discovered sequence groups isolated from pineapple were associated with bacilliform virions and were transmitted by mealybugs. Phylogenetic analyses indicated that they were members of new badnavirus species. A third caulimovirid sequence was also amplified from pineapple, but available evidence suggests that this DNA is not encapsidated, but more likely derived from an endogenous virus.
Sequence and Structure Dependent DNA-DNA Interactions
NASA Astrophysics Data System (ADS)
Kopchick, Benjamin; Qiu, Xiangyun
Molecular forces between dsDNA strands are largely dominated by electrostatics and have been extensively studied. Quantitative knowledge has been accumulated on how DNA-DNA interactions are modulated by varied biological constituents such as ions, cationic ligands, and proteins. Despite its central role in biology, the sequence of DNA has not received substantial attention and ``random'' DNA sequences are typically used in biophysical studies. However, ~50% of human genome is composed of non-random-sequence DNAs, particularly repetitive sequences. Furthermore, covalent modifications of DNA such as methylation play key roles in gene functions. Such DNAs with specific sequences or modifications often take on structures other than the canonical B-form. Here we present series of quantitative measurements of the DNA-DNA forces with the osmotic stress method on different DNA sequences, from short repeats to the most frequent sequences in genome, and to modifications such as bromination and methylation. We observe peculiar behaviors that appear to be strongly correlated with the incurred structural changes. We speculate the causalities in terms of the differences in hydration shell and DNA surface structures.
DNA secondary structures: stability and function of G-quadruplex structures
Bochman, Matthew L.; Paeschke, Katrin; Zakian, Virginia A.
2013-01-01
In addition to the canonical double helix, DNA can fold into various other inter- and intramolecular secondary structures. Although many such structures were long thought to be in vitro artefacts, bioinformatics demonstrates that DNA sequences capable of forming these structures are conserved throughout evolution, suggesting the existence of non-B-form DNA in vivo. In addition, genes whose products promote formation or resolution of these structures are found in diverse organisms, and a growing body of work suggests that the resolution of DNA secondary structures is critical for genome integrity. This Review focuses on emerging evidence relating to the characteristics of G-quadruplex structures and the possible influence of such structures on genomic stability and cellular processes, such as transcription. PMID:23032257
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection
Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T.
2017-01-01
Abstract Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry–dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a “universal” nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars. Key Words: Life-detection instruments—Nucleic acids—Mars—Panspermia. Astrobiology 17, 747–760. PMID:28704064
The presence of ancient human T-cell lymphotropic virus type I provirus DNA in an Andean mummy.
Li, H C; Fujiyoshi, T; Lou, H; Yashiki, S; Sonoda, S; Cartier, L; Nunez, L; Munoz, I; Horai, S; Tajima, K
1999-12-01
The worldwide geographic and ethnic clustering of patients with diseases related to human T-cell lymphotropic virus type I (HTLV-I) may be explained by the natural history of HTLV-I infection. The genetic characteristics of indigenous people in the Andes are similar to those of the Japanese, and HTLV-I is generally detected in both groups. To clarify the common origin of HTLV-I in Asia and the Andes, we analyzed HTLV-I provirus DNA from Andean mummies about 1,500 years old. Two of 104 mummy bone marrow specimens yielded a band of human beta-globin gene DNA 110 base pairs in length, and one of these two produced bands of HTLV-I-pX (open reading frame encoding p40x, p27x) and HTLV-I-LTR (long terminal repeat) gene DNA 159 base pairs and 157 base pairs in length, respectively. The nucleotide sequences of ancient HTLV-I-pX and HTLV-I-LTR clones isolated from mummy bone marrow were similar to those in contemporary Andeans and Japanese, although there was microheterogeneity in the sequences of some mummy DNA clones. This result provides evidence that HTLV-I was carried with ancient Mongoloids to the Andes before the Colonial era. Analysis of ancient HTLV-I sequences could be a useful tool for studying the history of human retroviral infection as well as human prehistoric migration.
Ancient HTLV type 1 provirus DNA of Andean mummy.
Sonoda, S; Li, H C; Cartier, L; Nunez, L; Tajima, K
2000-11-01
The worldwide geographic and ethnic clustering of patients with diseases related to human T cell lymphotropic virus type 1 (HTLV-1) may be explained by the natural history of HTLV-1 infection. The genetic characteristics of indigenous people in the Andes are similar to those of the Japanese, and HTLV-1 is generally detected in both groups. To clarify the common origin of HTLV-1 in Asia and the Andes, we analyzed HTLV-1 provirus DNA from Andean mummies about 1500 years old. Two of 104 mummy bone marrow specimens yielded a band of human beta-globin gene DNA 110 base pairs in length, and one of these two produced bands of HTLV-1-pX (open reading frame encoding p(40x), p(27x)) and HTLV-1-LTR (long terminal repeat) gene DNA 159 base pairs and 157 base pairs in length, respectively. The nucleotide sequences of ancient HTLV-1-pX and HTLV-1-LTR clones isolated from mummy bone marrow were similar to those in contemporary Andeans and Japanese, although there was microheterogeneity in the sequences of some mummy DNA clones. This result provides evidence that HTLV-1 was carried with ancient Mongoloids to the Andes before the Colonial era. Analysis of ancient HTLV-1 sequences could be a useful tool for studying the history of human retroviral infection as well as human prehistoric migration.
2013-01-01
Background The etiology of fibromyalgia and chronic fatigue syndrome (FM/CFS) is currently unknown. A recurrent viral infection is an attractive hypothesis repeatedly found in the literature since it would explain the persistent pain and tiredness these patients suffer from. The initial striking link of two distinct orphan retroviruses: the gamma retroviruses murine leukemia virus (MLV)-related virus and the delta retrovirus T-lymphotropic virus type 2 (HTLV-2) to chronic fatigue have not been confirmed to date. Results Genomic DNA (gDNA) from 75 fibromyalgia patients suffering from chronic fatigue and 79 age-matched local healthy controls were screened for the presence of MLV-related and HTLV-2 related proviral sequences. The XMRV env gene was amplified in 20% of samples tested (24% patients/15% healthy controls). Unexpectedly, no PCR amplifications from independent gDNA preparations of the same individuals were obtained. None of the positive samples showed presence of contaminating murine sequences previously reported by other investigators, neither contained additional regions of the virus making us conclude that the initial env amplification came from spurious air-driven amplicon contaminants. No specific HTLV-2 sequences were obtained at any time from any of the 154 quality-controlled gDNA preparations screened. Conclusions Previous associations between MLV-related or HTLV-2 retrovirus infection with chronic fatigue must be discarded. Thus, studies showing positive amplification of HTLV-2 sequences from chronic fatigue participants should be revised for possible undetected technical problems. To avoid false positives of viral infection, not only extreme precautions should be taken when nested-PCR reactions are prepared and exhaustive foreign DNA contamination controls performed, but also consistent amplification of diverse regions of the virus in independent preparations from the same individual must be demanded. The fact that our cohort of patients did not present evidence of any of the two types of retroviral infection formerly associated to chronic fatigue does not rule out the possibility that other viruses are involved in inciting or maintaining fibromyalgia and/or chronic fatigue conditions. PMID:24216038
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.
Hazkani-Covo, Einat; Martin, William F
2017-05-01
Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Wang, Hao-Ching; Ko, Tzu-Ping; Wu, Mao-Lun; Ku, Shan-Chi; Wu, Hsing-Ju; Wang, Andrew H.-J.
2012-01-01
DNA mimic proteins occupy the DNA binding sites of DNA-binding proteins, and prevent these sites from being accessed by DNA. We show here that the Neisseria conserved hypothetical protein DMP19 acts as a DNA mimic. The crystal structure of DMP19 shows a dsDNA-like negative charge distribution on the surface, suggesting that this protein should be added to the short list of known DNA mimic proteins. The crystal structure of another related protein, NHTF (Neisseria hypothetical transcription factor), provides evidence that it is a member of the xenobiotic-response element (XRE) family of transcriptional factors. NHTF binds to a palindromic DNA sequence containing a 5′-TGTNAN11TNACA-3′ recognition box that controls the expression of an NHTF-related operon in which the conserved nitrogen-response protein [i.e. (Protein-PII) uridylyltransferase] is encoded. The complementary surface charges between DMP19 and NHTF suggest specific charge–charge interaction. In a DNA-binding assay, we found that DMP19 can prevent NHTF from binding to its DNA-binding sites. Finally, we used an in situ gene regulation assay to provide evidence that NHTF is a repressor of its down-stream genes and that DMP19 can neutralize this effect. We therefore conclude that the interaction of DMP19 and NHTF provides a novel gene regulation mechanism in Neisseria spps. PMID:22373915
Unraveling the Sex Chromosome Heteromorphism of the Paradoxical Frog Pseudis tocantins
Gatto, Kaleb Pretto; Busin, Carmen Silvia; Lourenço, Luciana Bolsoni
2016-01-01
The paradoxical frog Pseudis tocantins is the only species in the Hylidae family with known heteromorphic Z and W sex chromosomes. The Z chromosome is metacentric and presents an interstitial nucleolar organizer region (NOR) on the long arm that is adjacent to a pericentromeric heterochromatic band. In contrast, the submetacentric W chromosome carries a pericentromeric NOR on the long arm, which is adjacent to a clearly evident heterochromatic band that is larger than the band found on the Z chromosome and justify the size difference observed between these chromosomes. Here, we provide evidence that the non-centromeric heterochromatic bands in Zq and Wq differ not only in size and location but also in composition, based on comparative genomic hybridization (CGH) and an analysis of the anuran PcP190 satellite DNA. The finding of PcP190 sequences in P. tocantins extends the presence of this satellite DNA, which was previously detected among Leptodactylidae and Hylodidae, suggesting that this family of repetitive DNA is even older than it was formerly considered. Seven groups of PcP190 sequences were recognized in the genome of P. tocantins. PcP190 probes mapped to the heterochromatic band in Wq, and a Southern blot analysis indicated the accumulation of PcP190 in the female genome of P. tocantins, which suggests the involvement of this satellite DNA in the evolution of the sex chromosomes of this species. PMID:27214234
Haag, Taiana; Santos, Anelisie S; De Angelo, Carlos; Srbek-Araujo, Ana Carolina; Sana, Dênis A; Morato, Ronaldo G; Salzano, Francisco M; Eizirik, Eduardo
2009-07-01
The elusive nature and endangered status of most carnivore species imply that efficient approaches for their non-invasive sampling are required to allow for genetic and ecological studies. Faecal samples are a major potential source of information, and reliable approaches are needed to foster their application in this field, particularly in areas where few studies have been conducted. A major obstacle to the reliable use of faecal samples is their uncertain species-level identification in the field, an issue that can be addressed with DNA-based assays. In this study we describe a sequence-based approach that efficiently distinguishes jaguar versus puma scats, and that presents several desirable properties: (1) considerably high amplification and sequencing rates; (2) multiple diagnostic sites reliably differentiating the two focal species; (3) high information content that allows for future application in other carnivores; (4) no evidence of amplification of prey DNA; and (5) no evidence of amplification of a nuclear mitochondrial DNA insertion known to occur in the jaguar. We demonstrate the reliability and usefulness of this approach by evaluating 55 field-collected samples from four locations in the highly fragmented Atlantic Forest biome of Brazil and Argentina, and document the presence of one or both of these endangered felids in each of these areas.
Evolutionary genomics of miniature inverted-repeat transposable elements (MITEs) in Brassica.
Nouroz, Faisal; Noreen, Shumaila; Heslop-Harrison, J S
2015-12-01
Miniature inverted-repeat transposable elements (MITEs) are truncated derivatives of autonomous DNA transposons, and are dispersed abundantly in most eukaryotic genomes. We aimed to characterize various MITEs families in Brassica in terms of their presence, sequence characteristics and evolutionary activity. Dot plot analyses involving comparison of homoeologous bacterial artificial chromosome (BAC) sequences allowed identification of 15 novel families of mobile MITEs. Of which, 5 were Stowaway-like with TA Target Site Duplications (TSDs), 4 Tourist-like with TAA/TTA TSDs, 5 Mutator-like with 9-10 bp TSDs and 1 novel MITE (BoXMITE1) flanked by 3 bp TSDs. Our data suggested that there are about 30,000 MITE-related sequences in Brassica rapa and B. oleracea genomes. In situ hybridization showed one abundant family was dispersed in the A-genome, while another was located near 45S rDNA sites. PCR analysis using primers flanking sequences of MITE elements detected MITE insertion polymorphisms between and within the three Brassica (AA, BB, CC) genomes, with many insertions being specific to single genomes and others showing evidence of more recent evolutionary insertions. Our BAC sequence comparison strategy enables identification of evolutionarily active MITEs with no prior knowledge of MITE sequences. The details of MITE families reported in Brassica enable their identification, characterization and annotation. Insertion polymorphisms of MITEs and their transposition activity indicated important mechanism of genome evolution and diversification. MITE families derived from known Mariner, Harbinger and Mutator DNA transposons were discovered, as well as some novel structures. The identification of Brassica MITEs will have broad applications in Brassica genomics, breeding, hybridization and phylogeny through their use as DNA markers.
Ma, Hongying; Wu, Yajiang; Xiang, Hai; Yang, Yunzhou; Wang, Min; Zhao, Chunjiang; Wu, Changxin
2018-01-01
There are large populations of indigenous horse ( Equus caballus ) in China and some other parts of East Asia. However, their matrilineal genetic diversity and origin remained poorly understood. Using a combination of mitochondrial DNA (mtDNA) and hypervariable region (HVR-1) sequences, we aim to investigate the origin of matrilineal inheritance in these domestic horses. To investigate patterns of matrilineal inheritance in domestic horses, we conducted a phylogenetic study using 31 de novo mtDNA genomes together with 317 others from the GenBank. In terms of the updated phylogeny, a total of 5,180 horse mitochondrial HVR-1 sequences were analyzed. Eightteen haplogroups (Aw-Rw) were uncovered from the analysis of the whole mitochondrial genomes. Most of which have a divergence time before the earliest domestication of wild horses (about 5,800 years ago) and during the Upper Paleolithic (35-10 KYA). The distribution of some haplogroups shows geographic patterns. The Lw haplogroup contained a significantly higher proportion of European horses than the horses from other regions, while haplogroups Jw, Rw, and some maternal lineages of Cw, have a higher frequency in the horses from East Asia. The 5,180 sequences of horse mitochondrial HVR-1 form nine major haplogroups (A-I). We revealed a corresponding relationship between the haplotypes of HVR-1 and those of whole mitochondrial DNA sequences. The data of the HVR-1 sequences also suggests that Jw, Rw, and some haplotypes of Cw may have originated in East Asia while Lw probably formed in Europe. Our study supports the hypothesis of the multiple origins of the maternal lineage of domestic horses and some maternal lineages of domestic horses may have originated from East Asia.
Wang, Yiqin; Picard, Martin; Gu, Zhenglong
2016-10-01
Increasing clinical and biochemical evidence implicate mitochondrial dysfunction in the pathophysiology of Autism Spectrum Disorder (ASD), but little is known about the biological basis for this connection. A possible cause of ASD is the genetic variation in the mitochondrial DNA (mtDNA) sequence, which has yet to be thoroughly investigated in large genomic studies of ASD. Here we evaluated mtDNA variation, including the mixture of different mtDNA molecules in the same individual (i.e., heteroplasmy), using whole-exome sequencing data from mother-proband-sibling trios from simplex families (n = 903) where only one child is affected by ASD. We found that heteroplasmic mutations in autistic probands were enriched at non-polymorphic mtDNA sites (P = 0.0015), which were more likely to confer deleterious effects than heteroplasmies at polymorphic mtDNA sites. Accordingly, we observed a ~1.5-fold enrichment of nonsynonymous mutations (P = 0.0028) as well as a ~2.2-fold enrichment of predicted pathogenic mutations (P = 0.0016) in autistic probands compared to their non-autistic siblings. Both nonsynonymous and predicted pathogenic mutations private to probands conferred increased risk of ASD (Odds Ratio, OR[95% CI] = 1.87[1.14-3.11] and 2.55[1.26-5.51], respectively), and their influence on ASD was most pronounced in families with probands showing diminished IQ and/or impaired social behavior compared to their non-autistic siblings. We also showed that the genetic transmission pattern of mtDNA heteroplasmies with high pathogenic potential differed between mother-autistic proband pairs and mother-sibling pairs, implicating developmental and possibly in utero contributions. Taken together, our genetic findings substantiate pathogenic mtDNA mutations as a potential cause for ASD and synergize with recent work calling attention to their unique metabolic phenotypes for diagnosis and treatment of children with ASD.
Ferri, Gianmarco; Alù, Milena; Corradini, Beatrice; Beduschi, Giovanni
2009-09-01
Forensic botany can provide significant supporting evidence during criminal investigations. However, it is still an underutilized field of investigation with its most common application limited to identifying specific as well as suspected illegal plants. The ubiquitous presence of plant species can be useful in forensics, but the absence of an accurate identification system remains the major obstacle to the present inability to routinely and correctly identify trace botanical evidence. Many plant materials cannot be identified and differentiated to the species level by traditional morphological characteristics when botanical specimens are degraded and lack physical features. By taking advantage of a universal barcode system, DNA sequencing, and other biomolecular techniques used routinely in forensic investigations, two chloroplast DNA regions were evaluated for their use as "barcoding" markers for plant identification in the field of forensics. We therefore investigated the forensic use of two non-coding plastid regions, psbA-trnH and trnL-trnF, to create a multimarker system for species identification that could be useful throughout the plant kingdom. The sequences from 63 plants belonging to our local flora were submitted and registered on the GenBank database. Sequence comparison to set up the level of identification (species, genus, or family) through Blast algorithms allowed us to assess the suitability of this method. The results confirmed the effectiveness of our botanic universal multimarker assay in forensic investigations.
Evidence for a Pneumocystis carinii Flo8-like transcription factor: insights into organism adhesion.
Kottom, Theodore J; Limper, Andrew H
2016-02-01
Pneumocystis carinii (Pc) adhesion to alveolar epithelial cells is well established and is thought to be a prerequisite for the initiation of Pneumocystis pneumonia. Pc binding events occur in part through the major Pc surface glycoprotein Msg, as well as an integrin-like molecule termed PcInt1. Recent data from the Pc sequencing project also demonstrate DNA sequences homologous to other genes important in Candida spp. binding to mammalian host cells, as well as organism binding to polystyrene surfaces and in biofilm formation. One of these genes, flo8, a transcription factor needed for downstream cAMP/PKA-pathway-mediated activation of the major adhesion/flocculin Flo11 in yeast, was cloned from a Pc cDNA library utilizing a partial sequence available in the Pc genome database. A CHEF blot of Pc genomic DNA yielded a single band providing evidence this gene is present in the organism. BLASTP analysis of the predicted protein demonstrated 41 % homology to the Saccharomyces cerevisiae Flo8. Northern blotting demonstrated greatest expression at pH 6.0-8.0, pH comparable to reported fungal biofilm milieu. Western blot and immunoprecipitation assays of PcFlo8 protein in isolated cyst and tropic life forms confirmed the presence of the cognate protein in these Pc life forms. Heterologous expression of Pcflo8 cDNA in flo8Δ-deficient yeast strains demonstrated that the Pcflo8 was able to restore yeast binding to polystyrene and invasive growth of yeast flo8Δ cells. Furthermore, Pcflo8 promoted yeast binding to HEK293 human epithelial cells, strengthening its functional classification as a Flo8 transcription factor. Taken together, these data suggest that PcFlo8 is expressed by Pc and may exert activity in organism adhesion and biofilm formation.
Evidence for a Pneumocystis carinii Flo8-like Transcription Factor: Insights into Organism Adhesion
Kottom, Theodore J.; Limper, Andrew H.
2015-01-01
Pneumocystis carinii (Pc) adhesion to alveolar epithelial cells is well established and is thought to be a prerequisite for initiation of Pneumocystis pneumonia. Pc binding events occur in part through the major Pc surface glycoprotein Msg, as well as an integrin-like molecule termed PcInt1. Recent data from the Pc sequencing project also demonstrate DNA sequences homologous to other genes important in Candida spp. binding to mammalian host cells, as well as organism binding to polystyrene surfaces and in biofilm formation. One of these genes, flo8, a transcription factor needed for downstream cAMP/PKA-pathway-mediated activation of the major adhesin/flocculin Flo11 in yeast, was cloned from a Pc cDNA library utilizing a partial sequence available in the Pc genome database. A CHEF blot of Pc genomic DNA yielded a single band providing evidence this gene is present in the organism. BLASTP analysis of the predicted protein demonstrated 41% homology to the Saccharomyces cerevisiae Flo8. Northern blotting demonstrated greatest expression at pH 6.0–8.0, pH comparable to reported fungal biofilm milieu. Western blot and immunoprecipitation assays of PcFlo8 protein in isolated cyst and tropic life forms confirmed the presence of the cognate protein in these Pc life forms. Heterologous expression of Pcflo8 cDNA in flo8Δ (deficient) yeast strains demonstrated the Pcflo8 was able to restore yeast binding to polystyrene and invasive growth of yeast flo8Δ cells. Furthermore, Pcflo8 promoted yeast binding to HEK293 human epithelial cells, strengthening its functional classification as a Flo8 transcription factor. Taken together these data suggests that PcFlo8 is expressed by Pc and may exert activity in organism adhesion and biofilm formation. PMID:26215665
NASA Technical Reports Server (NTRS)
La Duc, Myron T.; Sumner, Randall; Pierson, Duane; Venkat, Parth; Venkateswaran, Kasthuri
2004-01-01
Molecular analyses were carried out on four preflight and six postflight International Space Station (ISS)-associated potable water samples at various stages of purification, storage, and transport, to ascertain their associated microbial diversities and overall microbial burdens. Following DNA extraction, PCR amplification, and molecular cloning procedures, rDNA sequences closely related to pathogenic species of Acidovorax, Afipia, Brevundimonas, Propionibacterium, Serratia, and others were recovered in varying abundance. Retrieval of sequences arising from the iodine (biocide)-reducing Delftia acidovorans in postflight waters is also of concern. Total microbial burdens of ISS potable waters were derived from data generated by an ATP-based enumeration procedure, with results ranging from 0 to 4.9 x 10(4) cells/ml. Regardless of innate biases in sample collection and analysis, such circumstantial evidence for the presence of viable, intact pathogenic cells should not be taken lightly. Implementation of new cultivation approaches and/or viability-based assays are requisite to confirm such an occurrence.
Molecular evidence for a uniform microbial community in sponges from different oceans.
Hentschel, Ute; Hopke, Jörn; Horn, Matthias; Friedrich, Anja B; Wagner, Michael; Hacker, Jörg; Moore, Bradley S
2002-09-01
Sponges (class Porifera) are evolutionarily ancient metazoans that populate the tropical oceans in great abundances but also occur in temperate regions and even in freshwater. Sponges contain large numbers of bacteria that are embedded within the animal matrix. The phylogeny of these bacteria and the evolutionary age of the interaction are virtually unknown. In order to provide insights into the species richness of the microbial community of sponges, we performed a comprehensive diversity survey based on 190 sponge-derived 16S ribosomal DNA (rDNA) sequences. The sponges Aplysina aerophoba and Theonella swinhoei were chosen for construction of the bacterial 16S rDNA library because they are taxonomically distantly related and they populate nonoverlapping geographic regions. In both sponges, a uniform microbial community was discovered whose phylogenetic signature is distinctly different from that of marine plankton or marine sediments. Altogether 14 monophyletic, sponge-specific sequence clusters were identified that belong to at least seven different bacterial divisions. By definition, the sequences of each cluster are more closely related to each other than to a sequence from nonsponge sources. These monophyletic clusters comprise 70% of all publicly available sponge-derived 16S rDNA sequences, reflecting the generality of the observed phenomenon. This shared microbial fraction represents the smallest common denominator of the sponges investigated in this study. Bacteria that are exclusively found in certain host species or that occur only transiently would have been missed. A picture emerges where sponges can be viewed as highly concentrated reservoirs of so far uncultured and elusive marine microorganisms.
Iwanowicz, L; Densmore, C; Hahn, C; McAllister, P; Odenkirk, J
2013-09-01
The Northern Snakehead Channa argus is an introduced species that now inhabits the Chesapeake Bay. During a preliminary survey for introduced pathogens possibly harbored by these fish in Virginia waters, a filterable agent was isolated from five specimens that produced cytopathic effects in BF-2 cells. Based on PCR amplification and partial sequencing of the major capsid protein (MCP), DNA polymerase (DNApol), and DNA methyltransferase (Mtase) genes, the isolates were identified as Largemouth Bass virus (LMBV). Nucleotide sequences of the MCP (492 bp) and DNApol (419 pb) genes were 100% identical to those of LMBV. The nucleotide sequence of the Mtase (206 bp) gene was 99.5% identical to that of LMBV, and the single nucleotide substitution did not lead to a predicted amino acid coding change. This is the first report of LMBV from the Northern Snakehead, and provides evidence that noncentrarchid fishes may be susceptible to this virus.
Iwanowicz, Luke R.; Densmore, Christine L.; Hahn, Cassidy M.; McAllister, Phillip; Odenkirk, John
2013-01-01
The Northern Snakehead Channa argus is an introduced species that now inhabits the Chesapeake Bay. During a preliminary survey for introduced pathogens possibly harbored by these fish in Virginia waters, a filterable agent was isolated from five specimens that produced cytopathic effects in BF-2 cells. Based on PCR amplification and partial sequencing of the major capsid protein (MCP), DNA polymerase (DNApol), and DNA methyltransferase (Mtase) genes, the isolates were identified as Largemouth Bass virus (LMBV). Nucleotide sequences of the MCP (492 bp) and DNApol (419 pb) genes were 100% identical to those of LMBV. The nucleotide sequence of the Mtase (206 bp) gene was 99.5% identical to that of LMBV, and the single nucleotide substitution did not lead to a predicted amino acid coding change. This is the first report of LMBV from the Northern Snakehead, and provides evidence that noncentrarchid fishes may be susceptible to this virus.
A High-Throughput Process for the Solid-Phase Purification of Synthetic DNA Sequences
Grajkowski, Andrzej; Cieślak, Jacek; Beaucage, Serge L.
2017-01-01
An efficient process for the purification of synthetic phosphorothioate and native DNA sequences is presented. The process is based on the use of an aminopropylated silica gel support functionalized with aminooxyalkyl functions to enable capture of DNA sequences through an oximation reaction with the keto function of a linker conjugated to the 5′-terminus of DNA sequences. Deoxyribonucleoside phosphoramidites carrying this linker, as a 5′-hydroxyl protecting group, have been synthesized for incorporation into DNA sequences during the last coupling step of a standard solid-phase synthesis protocol executed on a controlled pore glass (CPG) support. Solid-phase capture of the nucleobase- and phosphate-deprotected DNA sequences released from the CPG support is demonstrated to proceed near quantitatively. Shorter than full-length DNA sequences are first washed away from the capture support; the solid-phase purified DNA sequences are then released from this support upon reaction with tetra-n-butylammonium fluoride in dry dimethylsulfoxide (DMSO) and precipitated in tetrahydrofuran (THF). The purity of solid-phase-purified DNA sequences exceeds 98%. The simulated high-throughput and scalability features of the solid-phase purification process are demonstrated without sacrificing purity of the DNA sequences. PMID:28628204
Zhou, X.; Robinson, J.L.; Geraci, C.J.; Parker, C.R.; Flint, O.S.; Etnier, D.A.; Ruiter, D.; DeWalt, R.E.; Jacobus, L.M.; Hebert, P.D.N.
2011-01-01
Deoxyribonucleic acid (DNA) barcoding is an effective tool for species identification and lifestage association in a wide range of animal taxa. We developed a strategy for rapid construction of a regional DNA-barcode reference library and used the caddisflies (Trichoptera) of the Great Smoky Mountains National Park (GSMNP) as a model. Nearly 1000 cytochrome c oxidase subunit I (COI) sequences, representing 209 caddisfly species previously recorded from GSMNP, were obtained from the global Trichoptera Barcode of Life campaign. Most of these sequences were collected from outside the GSMNP area. Another 645 COI sequences, representing 80 species, were obtained from specimens collected in a 3-d bioblitz (short-term, intense sampling program) in GSMNP. The joint collections provided barcode coverage for 212 species, 91% of the GSMNP fauna. Inclusion of samples from other localities greatly expedited construction of the regional DNA-barcode reference library. This strategy increased intraspecific divergence and decreased average distances to nearest neighboring species, but the DNA-barcode library was able to differentiate 93% of the GSMNP Trichoptera species examined. Global barcoding projects will aid construction of regional DNA-barcode libraries, but local surveys make crucial contributions to progress by contributing rare or endemic species and full-length barcodes generated from high-quality DNA. DNA taxonomy is not a goal of our present work, but the investigation of COI divergence patterns in caddisflies is providing new insights into broader biodiversity patterns in this group and has directed attention to various issues, ranging from the need to re-evaluate species taxonomy with integrated morphological and molecular evidence to the necessity of an appropriate interpretation of barcode analyses and its implications in understanding species diversity (in contrast to a simple claim for barcoding failure).
An improved model for whole genome phylogenetic analysis by Fourier transform.
Yin, Changchuan; Yau, Stephen S-T
2015-10-07
DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Replication Protein A-1 Has a Preference for the Telomeric G-rich Sequence in Trypanosoma cruzi.
Pavani, Raphael Souza; Vitarelli, Marcela O; Fernandes, Carlos A H; Mattioli, Fabio F; Morone, Mariana; Menezes, Milene C; Fontes, Marcos R M; Cano, Maria Isabel N; Elias, Maria Carolina
2018-05-01
Replication protein A (RPA), the major eukaryotic single-stranded binding protein, is a heterotrimeric complex formed by RPA-1, RPA-2, and RPA-3. RPA is a fundamental player in replication, repair, recombination, and checkpoint signaling. In addition, increasing evidences have been adding functions to RPA in telomere maintenance, such as interaction with telomerase to facilitate its activity and also involvement in telomere capping in some conditions. Trypanosoma cruzi, the etiological agent of Chagas disease is a protozoa parasite that appears early in the evolution of eukaryotes. Recently, we have showed that T. cruziRPA presents canonical functions being involved with DNA replication and DNA damage response. Here, we found by FISH/IF assays that T. cruziRPA localizes at telomeres even outside replication (S) phase. In vitro analysis showed that one telomeric repeat is sufficient to bind RPA-1. Telomeric DNA induces different secondary structural modifications on RPA-1 in comparison with other types of DNA. In addition, RPA-1 presents a higher affinity for telomeric sequence compared to randomic sequence, suggesting that RPA may play specific roles in T. cruzi telomeric region. © 2017 The Author(s) Journal of Eukaryotic Microbiology © 2017 International Society of Protistologists.
A Critical Reassessment of the Role of Mitochondria in Tumorigenesis
Salas, Antonio; Yao, Yong-Gang; Macaulay, Vincent; Vega, Ana; Carracedo, Ángel; Bandelt, Hans-Jürgen
2005-01-01
Background Mitochondrial DNA (mtDNA) is being analyzed by an increasing number of laboratories in order to investigate its potential role as an active marker of tumorigenesis in various types of cancer. Here we question the conclusions drawn in most of these investigations, especially those published in high-rank cancer research journals, under the evidence that a significant number of these medical mtDNA studies are based on obviously flawed sequencing results. Methods and Findings In our analyses, we take a phylogenetic approach and employ thorough database searches, which together have proven successful for detecting erroneous sequences in the fields of human population genetics and forensics. Apart from conceptual problems concerning the interpretation of mtDNA variation in tumorigenesis, in most cases, blocks of seemingly somatic mutations clearly point to contamination or sample mix-up and, therefore, have nothing to do with tumorigenesis. Conclusion The role of mitochondria in tumorigenesis remains unclarified. Our findings of laboratory errors in many contributions would represent only the tip of the iceberg since most published studies do not provide the raw sequence data for inspection, thus hindering a posteriori evaluation of the results. There is no precedent for such a concatenation of errors and misconceptions affecting a whole subfield of medical research. PMID:16187796
Yin, Changchuan
2015-04-01
To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
Single-cell genomic sequencing using Multiple Displacement Amplification.
Lasken, Roger S
2007-10-01
Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
A novel paired domain DNA recognition motif can mediate Pax2 repression of gene transcription.
Håvik, B; Ragnhildstveit, E; Lorens, J B; Saelemyr, K; Fauske, O; Knudsen, L K; Fjose, A
1999-12-20
The paired domain (PD) is an evolutionarily conserved DNA-binding domain encoded by the Pax gene family of developmental regulators. The Pax proteins are transcription factors and are involved in a variety of processes such as brain development, patterning of the central nervous system (CNS), and B-cell development. In this report we demonstrate that the zebrafish Pax2 PD can interact with a novel type of DNA sequences in vitro, the triple-A motif, consisting of a heptameric nucleotide sequence G/CAAACA/TC with an invariant core of three adjacent adenosines. This recognition sequence was found to be conserved in known natural Pax5 repressor elements involved in controlling the expression of the p53 and J-chain genes. By identifying similar high affinity binding sites in potential target genes of the Pax2 protein, including the pax2 gene itself, we obtained further evidence that the triple-A sites are biologically significant. The putative natural target sites also provide a basis for defining an extended consensus recognition sequence. In addition, we observed in transformation assays a direct correlation between Pax2 repressor activity and the presence of triple-A sites. The results suggest that a transcriptional regulatory function of Pax proteins can be modulated by PD binding to different categories of target sequences. Copyright 1999 Academic Press.
Lo, Yu-Sheng; Tseng, Wen-Hsuan; Chuang, Chien-Ying; Hou, Ming-Hon
2013-01-01
The potent anticancer drug actinomycin D (ActD) functions by intercalating into DNA at GpC sites, thereby interrupting essential biological processes including replication and transcription. Certain neurological diseases are correlated with the expansion of (CGG)n trinucleotide sequences, which contain many contiguous GpC sites separated by a single G:G mispair. To characterize the binding of ActD to CGG triplet repeat sequences, the structural basis for the strong binding of ActD to neighbouring GpC sites flanking a G:G mismatch has been determined based on the crystal structure of ActD bound to ATGCGGCAT, which contains a CGG triplet sequence. The binding of ActD molecules to GCGGC causes many unexpected conformational changes including nucleotide flipping out, a sharp bend and a left-handed twist in the DNA helix via a two site-binding model. Heat denaturation, circular dichroism and surface plasmon resonance analyses showed that adjacent GpC sequences flanking a G:G mismatch are preferred ActD-binding sites. In addition, ActD was shown to bind the hairpin conformation of (CGG)16 in a pairwise combination and with greater stability than that of other DNA intercalators. Our results provide evidence of a possible biological consequence of ActD binding to CGG triplet repeat sequences. PMID:23408860
The female urinary microbiome in urgency urinary incontinence.
Pearce, Meghan M; Zilliox, Michael J; Rosenfeld, Amy B; Thomas-White, Krystal J; Richter, Holly E; Nager, Charles W; Visco, Anthony G; Nygaard, Ingrid E; Barber, Matthew D; Schaffer, Joseph; Moalli, Pamela; Sung, Vivian W; Smith, Ariana L; Rogers, Rebecca; Nolen, Tracy L; Wallace, Dennis; Meikle, Susan F; Gai, Xiaowu; Wolfe, Alan J; Brubaker, Linda
2015-09-01
The purpose of this study was to characterize the urinary microbiota in women who are planning treatment for urgency urinary incontinence and to describe clinical associations with urinary symptoms, urinary tract infection, and treatment outcomes. Catheterized urine samples were collected from multisite randomized trial participants who had no clinical evidence of urinary tract infection; 16S ribosomal RNA gene sequencing was used to dichotomize participants as either DNA sequence-positive or sequence-negative. Associations with demographics, urinary symptoms, urinary tract infection risk, and treatment outcomes were determined. In sequence-positive samples, microbiotas were characterized on the basis of their dominant microorganisms. More than one-half (51.1%; 93/182) of the participants' urine samples were sequence-positive. Sequence-positive participants were younger (55.8 vs 61.3 years old; P = .0007), had a higher body mass index (33.7 vs 30.1 kg/m(2); P = .0009), had a higher mean baseline daily urgency urinary incontinence episodes (5.7 vs 4.2 episodes; P < .0001), responded better to treatment (decrease in urgency urinary incontinence episodes, -4.4 vs -3.3; P = .0013), and were less likely to experience urinary tract infection (9% vs 27%; P = .0011). In sequence-positive samples, 8 major bacterial clusters were identified; 7 clusters were dominated not only by a single genus, most commonly Lactobacillus (45%) or Gardnerella (17%), but also by other taxa (25%). The remaining cluster had no dominant genus (13%). DNA sequencing confirmed urinary bacterial DNA in many women with urgency urinary incontinence who had no signs of infection. Sequence status was associated with baseline urgency urinary incontinence episodes, treatment response, and posttreatment urinary tract infection risk. Copyright © 2015 Elsevier Inc. All rights reserved.
Do, Hoang Dang Khoa; Kim, Joo-Hwan
2017-01-01
Chloroplast genomes (cpDNA) are highly valuable resources for evolutionary studies of angiosperms, since they are highly conserved, are small in size, and play critical roles in plants. Slipped-strand mispairing (SSM) was assumed to be a mechanism for generating repeat units in cpDNA. However, research on the employment of different small repeated sequences through SSM events, which may induce the accumulation of distinct types of repeats within the same region in cpDNA, has not been documented. Here, we sequenced two chloroplast genomes from the endemic species Heloniopsis tubiflora (Korea) and Xerophyllum tenax (USA) to cover the gap between molecular data and explore "hot spots" for genomic events in Melanthiaceae. Comparative analysis of 23 complete cpDNA sequences revealed that there were different stages of deletion in the rps16 region across the Melanthiaceae. Based on the partial or complete loss of rps16 gene in cpDNA, we have firstly reported potential molecular markers for recognizing two sections ( Veratrum and Fuscoveratrum ) of Veratrum . Melathiaceae exhibits a significant change in the junction between large single copy and inverted repeat regions, ranging from trnH_GUG to a part of rps3 . Our results show an accumulation of tandem repeats in the rpl23-ycf2 regions of cpDNAs. Small conserved sequences exist and flank tandem repeats in further observation of this region across most of the examined taxa of Liliales. Therefore, we propose three scenarios in which different small repeated sequences were used during SSM events to generate newly distinct types of repeats. Occasionally, prior to the SSM process, point mutation event and double strand break repair occurred and induced the formation of initial repeat units which are indispensable in the SSM process. SSM may have likely occurred more frequently for short repeats than for long repeat sequences in tribe Parideae (Melanthiaceae, Liliales). Collectively, these findings add new evidence of dynamic results from SSM in chloroplast genomes which can be useful for further evolutionary studies in angiosperms. Additionally, genomics events in cpDNA are potential resources for mining molecular markers in Liliales.
Villacreses, Javier; Rojas-Herrera, Marcelo; Sánchez, Carolina; Hewstone, Nicole; Undurraga, Soledad F.; Alzate, Juan F.; Manque, Patricio; Maracaja-Coutinho, Vinicius; Polanco, Victor
2015-01-01
Here, we report the genome sequence and evidence for transcriptional activity of a virus-like element in the native Chilean berry tree Aristotelia chilensis. We propose to name the endogenous sequence as Aristotelia chilensis Virus 1 (AcV1). High-throughput sequencing of the genome of this tree uncovered an endogenous viral element, with a size of 7122 bp, corresponding to the complete genome of AcV1. Its sequence contains three open reading frames (ORFs): ORFs 1 and 2 shares 66%–73% amino acid similarity with members of the Caulimoviridae virus family, especially the Petunia vein clearing virus (PVCV), Petuvirus genus. ORF1 encodes a movement protein (MP); ORF2 a Reverse Transcriptase (RT) and a Ribonuclease H (RNase H) domain; and ORF3 showed no amino acid sequence similarity with any other known virus proteins. Analogous to other known endogenous pararetrovirus sequences (EPRVs), AcV1 is integrated in the genome of Maqui Berry and showed low viral transcriptional activity, which was detected by deep sequencing technology (DNA and RNA-seq). Phylogenetic analysis of AcV1 and other pararetroviruses revealed a closer resemblance with Petuvirus. Overall, our data suggests that AcV1 could be a new member of Caulimoviridae family, genus Petuvirus, and the first evidence of this kind of virus in a fruit plant. PMID:25855242
Parson, W; Gusmão, L; Hares, D R; Irwin, J A; Mayr, W R; Morling, N; Pokorak, E; Prinz, M; Salas, A; Schneider, P M; Parsons, T J
2014-11-01
The DNA Commission of the International Society of Forensic Genetics (ISFG) regularly publishes guidelines and recommendations concerning the application of DNA polymorphisms to the question of human identification. Previous recommendations published in 2000 addressed the analysis and interpretation of mitochondrial DNA (mtDNA) in forensic casework. While the foundations set forth in the earlier recommendations still apply, new approaches to the quality control, alignment and nomenclature of mitochondrial sequences, as well as the establishment of mtDNA reference population databases, have been developed. Here, we describe these developments and discuss their application to both mtDNA casework and mtDNA reference population databasing applications. While the generation of mtDNA for forensic casework has always been guided by specific standards, it is now well-established that data of the same quality are required for the mtDNA reference population data used to assess the statistical weight of the evidence. As a result, we introduce guidelines regarding sequence generation, as well as quality control measures based on the known worldwide mtDNA phylogeny, that can be applied to ensure the highest quality population data possible. For both casework and reference population databasing applications, the alignment and nomenclature of haplotypes is revised here and the phylogenetic alignment proffered as acceptable standard. In addition, the interpretation of heteroplasmy in the forensic context is updated, and the utility of alignment-free database searches for unbiased probability estimates is highlighted. Finally, we discuss statistical issues and define minimal standards for mtDNA database searches. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Feliciello, Isidoro; Parazajder, Josip; Akrap, Ivana; Ugarković, Durđica
2013-05-01
DNA methylation has been studied in many eukaryotic organisms, in particular vertebrates, and was implicated in developmental and phenotypic variations. Little is known about the role of DNA methylation in invertebrates, although insects are considered as excellent models for studying the evolution of DNA methylation. In the red flour beetle, Tribolium castaneum (Tenebrionidae, Coleoptera), no evidence of DNA methylation has been found till now. In this paper, a cytosine methylation in Tribolium castaneum embryos was detected by methylation sensitive restriction endonucleases and immuno-dot blot assay. DNA methylation in embryos is followed by a global demethylation in larvae, pupae and adults. DNA demethylation seems to proceed actively through 5-hydroxymethylcytosine, most probably by the action of TET enzyme. Bisulfite sequencing of a highly abundant satellite DNA located in pericentromeric heterochromatin revealed similar profile of cytosine methylation in adults and embryos. Cytosine methylation was not only restricted to CpG sites but was found at CpA, CpT and CpC sites. In addition, complete cytosine demethylation of heterochromatic satellite DNA was induced by heat stress. The results reveal existence of DNA methylation cycling in T. castaneum ranging from strong overall cytosine methylation in embryos to a weak DNA methylation in other developmental stages. Nevertheless, DNA methylation is preserved within heterochromatin during development, indicating its role in heterochromatin formation and maintenance. It is, however, strongly affected by heat stress, suggesting a role for DNA methylation in heterochromatin structure modulation during heat stress response.
Corella, Alfons; Bert, Francesc; Pérez-Pérez, Alejandro; Gené, Manel; Turbón, Daniel
2007-01-01
Chimane, Moseten Aymara and Quechua are Amerindian populations living in the Bolivian Piedmont, a characteristic ecoregion between the eastern slope of the Andean mountains and the Amazonian Llanos de Moxos. In both neighbouring areas, dense and complex societies have developed over the centuries. The Piedmont area is especially interesting from a human peopling perspective since there is no clear evidence regarding the genetic influence and peculiarities of these populations. This land has been used extensively as a territory of economic and cultural exchange between the Andes and Amazonia, however Chimane and Moseten populations have been sufficiently isolated from their neighbour groups to be recognized as distinct populations. Genetic information suggests that evolutionary processes, such as genetic drift, natural selection and genetic admixture have formed the history of the Piedmont populations. The objective of this study is to characterize the genetic diversity of the Piedmont populations, analysing the sequence variability of the HVR-I control region in the mitochondrial DNA (mtDNA). Haplogroup mtDNA data available from the whole of Central and South America were utilized to determine the relationship of the Piedmont populations with other Amerindian populations. Hair pulls were obtained in situ, and DNA from non-related individuals was extracted using a standard Chelex 100 method. A 401 bp DNA fragment of HVR-I region was amplified using standard procedures. Two independent 401 and 328 bp DNA fragments were sequenced separately for each sample. The sequence analyses included mismatch distribution and mean pairwise differences, median network analyses, AMOVA and principal component analyses. The genetic diversity of DNA sequences was measured and compared with other South Amerindian populations. The genetic diversity of 401 nucleotide mtDNA sequences, in the hypervariable Control Region, from positions 16 000-16 400, was characterized in a sample of 46 Amerindians living in the Piedmont area in the Beni Department of Bolivia. The results obtained indicate that the genetic diversity in the area is higher than that observed in other American groups living in much larger areas and despite the reduced size of the studied area the human groups analysed show high levels of inter-group variability. In addition, results show that Amerindian populations living in the Piedmont are genetically more related to those in the Andean than in the Amazonian populations.
USDA-ARS?s Scientific Manuscript database
Maternal obesity (OB) and excessive gestational weight gain (GWG) are strong independent contributors that augment obesity risk in offspring. However, direct evidence of epigenetic changes associated with maternal habitus remains sparse. We utilized Bisulfite Amplicon Sequencing (BSAS) to conduct t...
Acquisition of New DNA Sequences After Infection of Chicken Cells with Avian Myeloblastosis Virus
Shoyab, M.; Baluda, M. A.; Evans, R.
1974-01-01
DNA-RNA hybridization studies between 70S RNA from avian myeloblastosis virus (AMV) and an excess of DNA from (i) AMV-induced leukemic chicken myeloblasts or (ii) a mixture of normal and of congenitally infected K-137 chicken embryos producing avian leukosis viruses revealed the presence of fast- and slow-hybridizing virus-specific DNA sequences. However, the leukemic cells contained twice the level of AMV-specific DNA sequences observed in normal chicken embryonic cells. The fast-reacting sequences were two to three times more numerous in leukemic DNA than in DNA from the mixed embryos. The slow-reacting sequences had a reiteration frequency of approximately 9 and 6, in the two respective systems. Both the fast- and the slow-reacting DNA sequences in leukemic cells exhibited a higher Tm (2 C) than the respective DNA sequences in normal cells. In normal and leukemic cells the slow hybrid sequences appeared to have a Tm which was 2 C higher than that of the fast hybrid sequences. Individual non-virus-producing chicken embryos, either group-specific antigen positive or negative, contained 40 to 100 copies of the fast sequences and 2 to 6 copies of the slowly hybridizing sequences per cell genome. Normal rat cells did not contain DNA that hybridized with AMV RNA, whereas non-virus-producing rat cells transformed by B-77 avian sarcoma virus contained only the slowly reacting sequences. The results demonstrate that leukemic cells transformed by AMV contain new AMV-specific DNA sequences which were not present before infection. PMID:16789139
Absence of ancient DNA in sub-fossil insect inclusions preserved in 'Anthropocene' Colombian copal.
Penney, David; Wadsworth, Caroline; Fox, Graeme; Kennedy, Sandra L; Preziosi, Richard F; Brown, Terence A
2013-01-01
Insects preserved in copal, the sub-fossilized resin precursor of amber, have potential value in molecular ecological studies of recently-extinct species and of extant species that have never been collected as living specimens. The objective of the work reported in this paper was therefore to determine if ancient DNA is present in insects preserved in copal. We prepared DNA libraries from two stingless bees (Apidae: Meliponini: Trigonisca ameliae) preserved in 'Anthropocene' Colombian copal, dated to 'post-Bomb' and 10,612±62 cal yr BP, respectively, and obtained sequence reads using the GS Junior 454 System. Read numbers were low, but were significantly higher for DNA extracts prepared from crushed insects compared with extracts obtained by a non-destructive method. The younger specimen yielded sequence reads up to 535 nucleotides in length, but searches of these sequences against the nucleotide database revealed very few significant matches. None of these hits was to stingless bees though one read of 97 nucleotides aligned with two non-contiguous segments of the mitochondrial cytochrome oxidase subunit I gene of the East Asia bumblebee Bombus hypocrita. The most significant hit was for 452 nucleotides of a 470-nucleotide read that aligned with part of the genome of the root-nodulating bacterium Bradyrhizobium japonicum. The other significant hits were to proteobacteria and an actinomycete. Searches directed specifically at Apidae nucleotide sequences only gave short and insignificant alignments. All of the reads from the older specimen appeared to be artefacts. We were therefore unable to obtain any convincing evidence for the preservation of ancient DNA in either of the two copal inclusions that we studied, and conclude that DNA is not preserved in this type of material. Our results raise further doubts about claims of DNA extraction from fossil insects in amber, many millions of years older than copal.
Absence of Ancient DNA in Sub-Fossil Insect Inclusions Preserved in ‘Anthropocene’ Colombian Copal
Penney, David; Wadsworth, Caroline; Fox, Graeme; Kennedy, Sandra L.; Preziosi, Richard F.; Brown, Terence A.
2013-01-01
Insects preserved in copal, the sub-fossilized resin precursor of amber, have potential value in molecular ecological studies of recently-extinct species and of extant species that have never been collected as living specimens. The objective of the work reported in this paper was therefore to determine if ancient DNA is present in insects preserved in copal. We prepared DNA libraries from two stingless bees (Apidae: Meliponini: Trigonisca ameliae) preserved in ‘Anthropocene’ Colombian copal, dated to ‘post-Bomb’ and 10,612±62 cal yr BP, respectively, and obtained sequence reads using the GS Junior 454 System. Read numbers were low, but were significantly higher for DNA extracts prepared from crushed insects compared with extracts obtained by a non-destructive method. The younger specimen yielded sequence reads up to 535 nucleotides in length, but searches of these sequences against the nucleotide database revealed very few significant matches. None of these hits was to stingless bees though one read of 97 nucleotides aligned with two non-contiguous segments of the mitochondrial cytochrome oxidase subunit I gene of the East Asia bumblebee Bombus hypocrita. The most significant hit was for 452 nucleotides of a 470-nucleotide read that aligned with part of the genome of the root-nodulating bacterium Bradyrhizobium japonicum. The other significant hits were to proteobacteria and an actinomycete. Searches directed specifically at Apidae nucleotide sequences only gave short and insignificant alignments. All of the reads from the older specimen appeared to be artefacts. We were therefore unable to obtain any convincing evidence for the preservation of ancient DNA in either of the two copal inclusions that we studied, and conclude that DNA is not preserved in this type of material. Our results raise further doubts about claims of DNA extraction from fossil insects in amber, many millions of years older than copal. PMID:24039876
Zhang, Yanhong; Pham, Nancy Kim; Zhang, Huixian; Lin, Junda; Lin, Qiang
2014-01-01
Population genetic of seahorses is confidently influenced by their species-specific ecological requirements and life-history traits. In the present study, partial sequences of mitochondrial cytochrome b (cytb) and control region (CR) were obtained from 50 Hippocampus mohnikei and 92 H. trimaculatus from four zoogeographical zones. A total of 780 base pairs of cytb gene were sequenced to characterize mitochondrial DNA (mtDNA) diversity. The mtDNA marker revealed high haplotype diversity, low nucleotide diversity, and a lack of population structure across both populations of H. mohnikei and H. trimaculatus. A neighbour-joining (NJ) tree of cytb gene sequences showed that H. mohnikei haplotypes formed one cluster. A maximum likelihood (ML) tree of cytb gene sequences showed that H. trimaculatus belonged to one lineage. The star-like pattern median-joining network of cytb and CR markers indicated a previous demographic expansion of H. mohnikei and H. trimaculatus. The cytb and CR data sets exhibited a unimodal mismatch distribution, which may have resulted from population expansion. Mismatch analysis suggested that the expansion was initiated about 276,000 years ago for H. mohnikei and about 230,000 years ago for H. trimaculatus during the middle Pleistocene period. This study indicates a possible signature of genetic variation and population expansion in two seahorses under complex marine environments.
Geiss, K T; Abbas, G M; Makaroff, C A
1994-04-01
The mitochondrial gene coding for subunit 4 of the NADH dehydrogenase complex I (nad4) has been isolated and characterized from lettuce, Lactuca sativa. Analysis of nad4 genes in a number of plants by Southern hybridization had previously suggested that the intron content varied between species. Characterization of the lettuce gene confirms this observation. Lettuce nad4 contains two exons and one group IIA intron, whereas previously sequenced nad4 genes from turnip and wheat contain three group IIA introns. Northern analysis identified a transcript of 1600 nucleotides, which represents the mature nad4 mRNA and a primary transcript of 3200 nucleotides. Sequence analysis of lettuce and turnip nad4 cDNAs was used to confirm the intron/exon border sequences and to examine RNA editing patterns. Editing is observed at the 5' and 3' ends of the lettuce transcript, but is absent from sequences that correspond to exons two, three and the 5' end of exon four in turnip and wheat. In contrast, turnip transcripts are highly edited in this region, suggesting that homologous recombination of an edited and spliced cDNA intermediate was involved in the loss of introns two and three from an ancestral lettuce nad4 gene.
Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC
2006-01-01
Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935
Goldfarb, Ilona Telefus; Adeli, Sharareh; Berk, Tucker; Phillippe, Mark
2018-05-01
While there is evidence for a relationship between cell-free fetal DNA (cffDNA) and parturition, questions remain regarding whether cffDNA could trigger a pro-inflammatory response on the pathway to parturition. We hypothesized that placental and/or fetal DNA stimulates toll-like receptor 9 (TLR9) leading to secretion of pro-inflammatory cytokines by macrophage cells. Four in vitro DNA stimulation studies were performed using RAW 264.7 mouse peritoneal macrophage cells incubated in media containing the following DNA particles: an oligodeoxynucleotide (ODN2395), intact genomic DNA (from mouse placentas, fetuses and adult liver), mouse DNA complexed with DOTAP (a cationic liposome forming compound), and telomere-depleted mouse DNA. Interleukin 6 (IL6) secretion was measured in the media by enzyme-linked immunosorbent assay; and the cell pellet was homogenized for protein content (picograms IL6/mg protein). Robust IL6 secretion was observed in response to ODN2395 (a CpG-rich TLR9 agonist), mouse DNA-DOTAP complexes, and telomere-depleted mouse DNA in concentrations of 5 to 15 μg/mL. In contrast, ODN A151 (containing telomere sequence motifs), intact genomic mouse DNA, and restriction enzyme-digested DNA had no effect on IL6 secretion. The IL6 response was significantly inhibited by chloroquine (10 μg/mL), thereby confirming the important role for TLR9 in the response by macrophage cells. DNA derived from mouse placentas and fetuses, and depleted of telomeric sequences, stimulates a robust pro-inflammatory response by macrophage cells, thereby supporting the hypothesis that cffDNA is able to stimulate an innate immune response that could trigger the onset of parturition. These findings are of clinical importance, as we search for effective treatment/prevention of preterm parturition.
Smith, Oliver; Momber, Garry; Bates, Richard; Garwood, Paul; Fitch, Simon; Pallen, Mark; Gaffney, Vincent; Allaby, Robin G
2015-02-27
The Mesolithic-to-Neolithic transition marked the time when a hunter-gatherer economy gave way to agriculture, coinciding with rising sea levels. Bouldnor Cliff, is a submarine archaeological site off the Isle of Wight in the United Kingdom that has a well-preserved Mesolithic paleosol dated to 8000 years before the present. We analyzed a core obtained from sealed sediments, combining evidence from microgeomorphology and microfossils with sedimentary ancient DNA (sedaDNA) analyses to reconstruct floral and faunal changes during the occupation of this site, before it was submerged. In agreement with palynological analyses, the sedaDNA sequences suggest a mixed habitat of oak forest and herbaceous plants. However, they also provide evidence of wheat 2000 years earlier than mainland Britain and 400 years earlier than proximate European sites. These results suggest that sophisticated social networks linked the Neolithic front in southern Europe to the Mesolithic peoples of northern Europe. Copyright © 2015, American Association for the Advancement of Science.
G-quadruplex in animal development: Contribution to gene expression and genomic heterogeneity.
Armas, Pablo; Calcaterra, Nora Beatriz
2018-05-18
During animal development, gene expression is orchestrated by specific and highly evolutionarily conserved mechanisms that take place accurately, both at spatial and temporal levels. The last decades have provided compelling evidence showing that chromatin state plays essential roles in orchestrating most of the stages of development. The DNA molecule can adopt alternative structures different from the helical duplex architecture. G-rich DNA sequences can fold as intrastrand quadruple helix structures called G-quadruplexes or G4-DNA. G4 can also be formed in RNA molecules, such as mRNA, lncRNA and pre-miRNA. Emerging evidences suggest that G4s have crucial roles in a variety of biological processes, including transcription, recombination, replication, translation and chromosome stability. In this review, we have collected recent information gathered by various laboratories showing the important role of G4 DNA and RNA structures in several steps of animal development. Copyright © 2018 Elsevier B.V. All rights reserved.
McCutchen-Maloney, Sandra L.
2002-01-01
DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
Kalsbeek, Anton M F; Chan, Eva F K; Grogan, Judith; Petersen, Desiree C; Jaratlerdsiri, Weerachai; Gupta, Ruta; Lyons, Ruth J; Haynes, Anne-Maree; Horvath, Lisa G; Kench, James G; Stricker, Phillip D; Hayes, Vanessa M
2016-10-05
Prostate cancer management is complicated by extreme disease heterogeneity, which is further limited by availability of prognostic biomarkers. Recognition of prostate cancer as a genetic disease has prompted a focus on the nuclear genome for biomarker discovery, with little attention given to the mitochondrial genome. While it is evident that mitochondrial DNA (mtDNA) mutations are acquired during prostate tumorigenesis, no study has evaluated the prognostic value of mtDNA variation. Here we used next-generation sequencing to interrogate the mitochondrial genomes from prostate tissue biopsies and matched blood of 115 men having undergone a radical prostatectomy for which there was a mean of 107 months clinical follow-up. We identified 74 unique prostate cancer specific somatic mtDNA variants in 50 patients, providing significant expansion to the growing catalog of prostate cancer mtDNA mutations. While no single variant or variant cluster showed recurrence across multiple patients, we observe a significant positive correlation between the total burden of acquired mtDNA variation and elevated Gleason Score at diagnosis and biochemical relapse. We add to accumulating evidence that total acquired genomic burden, rather than specific mtDNA mutations, has diagnostic value. This is the first study to demonstrate the prognostic potential of mtDNA mutational burden in prostate cancer.
Whole genome sequencing distinguishes between relapse and reinfection in recurrent leprosy cases
Bührer-Sékula, Samira; Benjak, Andrej; Loiseau, Chloé; Singh, Pushpendra; Pontes, Maria A. A.; Gonçalves, Heitor S.; Hungria, Emerith M.; Busso, Philippe; Piton, Jérémie; Silveira, Maria I. S.; Cruz, Rossilene; Schetinni, Antônio; Costa, Maurício B.; Virmond, Marcos C. L.; Diorio, Suzana M.; Dias-Baptista, Ida M. F.; Rosa, Patricia S.; Matsuoka, Masanori; Penna, Maria L. F.; Cole, Stewart T.; Penna, Gerson O.
2017-01-01
Background Since leprosy is both treated and controlled by multidrug therapy (MDT) it is important to monitor recurrent cases for drug resistance and to distinguish between relapse and reinfection as a means of assessing therapeutic efficacy. All three objectives can be reached with single nucleotide resolution using next generation sequencing and bioinformatics analysis of Mycobacterium leprae DNA present in human skin. Methodology DNA was isolated by means of optimized extraction and enrichment methods from samples from three recurrent cases in leprosy patients participating in an open-label, randomized, controlled clinical trial of uniform MDT in Brazil (U-MDT/CT-BR). Genome-wide sequencing of M. leprae was performed and the resultant sequence assemblies analyzed in silico. Principal findings In all three cases, no mutations responsible for resistance to rifampicin, dapsone and ofloxacin were found, thus eliminating drug resistance as a possible cause of disease recurrence. However, sequence differences were detected between the strains from the first and second disease episodes in all three patients. In one case, clear evidence was obtained for reinfection with an unrelated strain whereas in the other two cases, relapse appeared more probable. Conclusions/Significance This is the first report of using M. leprae whole genome sequencing to reveal that treated and cured leprosy patients who remain in endemic areas can be reinfected by another strain. Next generation sequencing can be applied reliably to M. leprae DNA extracted from biopsies to discriminate between cases of relapse and reinfection, thereby providing a powerful tool for evaluating different outcomes of therapeutic regimens and for following disease transmission. PMID:28617800
Woodcock, Clayton B; Yakubov, Aziz B; Reich, Norbert O
2017-08-01
Caulobacter crescentus relies on DNA methylation by the cell cycle-regulated methyltransferase (CcrM) in addition to key transcription factors to control the cell cycle and direct cellular differentiation. CcrM is shown here to efficiently methylate its cognate recognition site 5'-GANTC-3' in single-stranded and hemimethylated double-stranded DNA. We report the K m , k cat , k methylation , and K d for single-stranded and hemimethylated substrates, revealing discrimination of 10 7 -fold for noncognate sequences. The enzyme also shows a similar discrimination against single-stranded RNA. Two independent assays clearly show that CcrM is highly processive with single-stranded and hemimethylated DNA. Collectively, the data provide evidence that CcrM and other DNA-modifying enzymes may use a new mechanism to recognize DNA in a key epigenetic process.
Methylation patterns of repetitive DNA sequences in germ cells of Mus musculus.
Sanford, J; Forrester, L; Chapman, V; Chandley, A; Hastie, N
1984-03-26
The major and the minor satellite sequences of Mus musculus were undermethylated in both sperm and oocyte DNAs relative to the amount of undermethylation observed in adult somatic tissue DNA. This hypomethylation was specific for satellite sequences in sperm DNA. Dispersed repetitive and low copy sequences show a high degree of methylation in sperm DNA; however, a dispersed repetitive sequence was undermethylated in oocyte DNA. This finding suggests a difference in the amount of total genomic DNA methylation between sperm and oocyte DNA. The methylation levels of the minor satellite sequences did not change during spermiogenesis, and were not associated with the onset of meiosis or a specific stage in sperm development.
Process of labeling specific chromosomes using recombinant repetitive DNA
Moyzis, R.K.; Meyne, J.
1988-02-12
Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
Jacobson, D R; Gorevic, P D; Buxbaum, J N
1990-01-01
Senile systemic amyloidosis (SSA) is a late-onset disease characterized by deposition of amyloid fibrils containing transthyretin (TTR). Amino acid sequencing of protein isolated from the amyloid fibrils of a patient with SSA identified TTR containing a position - 122 isoleucine-for-valine substitution. This change led to the prediction of a genomic G-to-A transition, destroying an MaeIII restriction site. We confirmed the presence of the variant DNA fragment both by Southern blotting and by visualization of MaeIII digests of DNA amplified around codon 122, by using the polymerase chain reaction. The patient's DNA was entirely resistant to MaeIII cleavage; therefore, only the mutant sequence was present. DNA from none of either 24 controls or six other SSA patients contained the variant. Quantitative Southern blotting demonstrated that the patient's DNA contained two copies of the TTR gene per genome; the mutation was therefore homozygous rather than hemizygous. In the present case, the homozygous mutation TTR (122 Val----Ile) is associated with SSA, a finding which is consistent with autosomal recessive inheritance of this condition. Images Figure 2 Figure 4 Figure 5 Figure 6 Figure 7 PMID:2349941
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.
Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook
2014-11-01
As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
RAP80, ubiquitin and SUMO in the DNA damage response.
Lombardi, Patrick M; Matunis, Michael J; Wolberger, Cynthia
2017-08-01
A decade has passed since the first reported connection between RAP80 and BRCA1 in DNA double-strand break repair. Despite the initial identification of RAP80 as a factor localizing BRCA1 to DNA double-strand breaks and potentially promoting homologous recombination, there is increasing evidence that RAP80 instead suppresses homologous recombination to fine-tune the balance of competing DNA repair processes during the S/G 2 phase of the cell cycle. RAP80 opposes homologous recombination by inhibiting DNA end-resection and sequestering BRCA1 into the BRCA1-A complex. Ubiquitin and SUMO modifications of chromatin at DNA double-strand breaks recruit RAP80, which contains distinct sequence motifs that recognize ubiquitin and SUMO. Here, we review RAP80's role in repressing homologous recombination at DNA double-strand breaks and how this role is facilitated by its ability to bind ubiquitin and SUMO modifications.
Mitochondrial DNA recombination in a free-ranging Australian lizard.
Ujvari, Beata; Dowton, Mark; Madsen, Thomas
2007-04-22
Mitochondrial DNA (mtDNA) is the traditional workhorse for reconstructing evolutionary events. The frequent use of mtDNA in such analyses derives from the apparent simplicity of its inheritance: maternal and lacking bi-parental recombination. However, in hybrid zones, the reproductive barriers are often not completely developed, resulting in the breakdown of male mitochondrial elimination mechanisms, leading to leakage of paternal mitochondria and transient heteroplasmy, resulting in an increased possibility of recombination. Despite the widespread occurrence of heteroplasmy and the presence of the molecular machinery necessary for recombination, we know of no documented example of recombination of mtDNA in any terrestrial wild vertebrate population. By sequencing the entire mitochondrial genome (16761bp), we present evidence for mitochondrial recombination in the hybrid zone of two mitochondrial haplotypes in the Australian frillneck lizard (Chlamydosaurus kingii).
Enlightenment of Yeast Mitochondrial Homoplasmy: Diversified Roles of Gene Conversion
Ling, Feng; Mikawa, Tsutomu; Shibata, Takehiko
2011-01-01
Mitochondria have their own genomic DNA. Unlike the nuclear genome, each cell contains hundreds to thousands of copies of mitochondrial DNA (mtDNA). The copies of mtDNA tend to have heterogeneous sequences, due to the high frequency of mutagenesis, but are quickly homogenized within a cell (“homoplasmy”) during vegetative cell growth or through a few sexual generations. Heteroplasmy is strongly associated with mitochondrial diseases, diabetes and aging. Recent studies revealed that the yeast cell has the machinery to homogenize mtDNA, using a common DNA processing pathway with gene conversion; i.e., both genetic events are initiated by a double-stranded break, which is processed into 3′ single-stranded tails. One of the tails is base-paired with the complementary sequence of the recipient double-stranded DNA to form a D-loop (homologous pairing), in which repair DNA synthesis is initiated to restore the sequence lost by the breakage. Gene conversion generates sequence diversity, depending on the divergence between the donor and recipient sequences, especially when it occurs among a number of copies of a DNA sequence family with some sequence variations, such as in immunoglobulin diversification in chicken. MtDNA can be regarded as a sequence family, in which the members tend to be diversified by a high frequency of spontaneous mutagenesis. Thus, it would be interesting to determine why and how double-stranded breakage and D-loop formation induce sequence homogenization in mitochondria and sequence diversification in nuclear DNA. We will review the mechanisms and roles of mtDNA homoplasmy, in contrast to nuclear gene conversion, which diversifies gene and genome sequences, to provide clues toward understanding how the common DNA processing pathway results in such divergent outcomes. PMID:24710143
"First generation" automated DNA sequencing technology.
Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M
2011-10-01
Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.
Baumann, G; Geisse, S; Sullivan, M
1991-03-01
The structurally unrelated immunosuppressive drugs cyclosporin A (Sandimmun) and FK-506 both interfere with the process of T-cell proliferation by blocking the transcription of the T-cell growth factor interleukin-2 (IL-2). Here we demonstrate that the transcriptional activation of this gene requires the binding of regulatory nuclear proteins to a promoter element with sequence similarity to the consensus binding site for NF-kappa B-related transcription factors. We present evidence that the binding by regulatory nuclear proteins to the kappa B element of the IL-2 promoter is affected negatively by cyclosporin A and FK-506 at concentrations paralleling their immunosuppressive activity in vivo. The decrease in DNA-protein complex formation induced by the immunosuppressive drugs correlates with a decrease in IL-2 production. FK-506 is 10 to 100 times more potent than cyclosporin A in its ability to inhibit sequence-specific DNA binding and IL-2 production. Our findings suggest that the actions of both drugs converge at the level of DNA-protein interaction.
Tabib, M; Zolgharnein, H; Mohammadi, M; Salari-Aliabadi, M A; Qasemi, A; Roshani, S; Rajabi-Maham, H; Frootan, F
2011-01-01
Genetic diversity of sea turtles (hawksbill turtle) was studied using sequencing of mitochondrial DNA (mtDNA, D-loop region). Thirty dead embryos were collected from the Kish and Qeshm Islands in the Persian Gulf. Analysis of sequence variation over 890 bp of the mtDNA control region revealed five haplotypes among 30 individuals. This is the first time that Iranian haplotypes have been recorded. Nucleotide and haplotype diversity was 0.77 and 0.001 for Qeshm Island and 0.64 and 0.002 for Kish Island, respectively. Total haplotype diversity was calculated as 0.69, which demonstrates low genetic diversity in this area. The data also indicated very high rates of migration between the populations of these two islands. A comparison of our data with data from previous studies downloaded from a gene bank showed that turtles of the Persian Gulf migrated from the Pacific and the Sea of Oman into this area. On the other hand, evidence of migration from populations to the West was not found.
Yurkov, Andrey; Guerreiro, Marco A; Sharma, Lav; Carvalho, Cláudia; Fonseca, Álvaro
2015-01-01
Cryptococcus flavescens and C. terrestris are phenotypically indistinguishable sister species that belong to the order Tremellales (Tremellomycetes, Basidiomycota) and which may be mistaken for C. laurentii based on phenotype. Phylogenetic separation between C. flavescens and C. terrestris was based on rDNA sequence analyses, but very little is known on their intraspecific genetic variability or propensity for sexual reproduction. We studied 59 strains from different substrates and geographic locations, and used a multilocus sequencing (MLS) approach complemented with the sequencing of mating type (MAT) genes to assess genetic variation and reexamine the boundaries of the two species, as well as their sexual status. The following five loci were chosen for MLS: the rDNA ITS-LSU region, the rDNA IGS1 spacer, and fragments of the genes encoding the largest subunit of RNA polymerase II (RPB1), the translation elongation factor 1 alpha (TEF1) and the p21-activated protein kinase (STE20). Phylogenetic network analyses confirmed the genetic separation of the two species and revealed two additional cryptic species, for which the names Cryptococcus baii and C. ruineniae are proposed. Further analyses of the data revealed a high degree of genetic heterogeneity within C. flavescens as well as evidence for recombination between lineages detected for this species. Strains of C. terrestris displayed higher levels of similarity in all analysed genes and appear to make up a single recombining group. The two MAT genes (STE3 and SXI1/SXI2) sequenced for C. flavescens strains confirmed the potential for sexual reproduction and suggest the presence of a tetrapolar mating system with a biallelic pheromone/receptor locus and a multiallelic HD locus. In C. terrestris we could only sequence STE3, which revealed a biallelic P/R locus. In spite of the strong evidence for sexual recombination in the two species, attempts at mating compatible strains of both species on culture media were unsuccessful.
Sharma, Lav; Carvalho, Cláudia; Fonseca, Álvaro
2015-01-01
Cryptococcus flavescens and C. terrestris are phenotypically indistinguishable sister species that belong to the order Tremellales (Tremellomycetes, Basidiomycota) and which may be mistaken for C. laurentii based on phenotype. Phylogenetic separation between C. flavescens and C. terrestris was based on rDNA sequence analyses, but very little is known on their intraspecific genetic variability or propensity for sexual reproduction. We studied 59 strains from different substrates and geographic locations, and used a multilocus sequencing (MLS) approach complemented with the sequencing of mating type (MAT) genes to assess genetic variation and reexamine the boundaries of the two species, as well as their sexual status. The following five loci were chosen for MLS: the rDNA ITS-LSU region, the rDNA IGS1 spacer, and fragments of the genes encoding the largest subunit of RNA polymerase II (RPB1), the translation elongation factor 1 alpha (TEF1) and the p21-activated protein kinase (STE20). Phylogenetic network analyses confirmed the genetic separation of the two species and revealed two additional cryptic species, for which the names Cryptococcus baii and C. ruineniae are proposed. Further analyses of the data revealed a high degree of genetic heterogeneity within C. flavescens as well as evidence for recombination between lineages detected for this species. Strains of C. terrestris displayed higher levels of similarity in all analysed genes and appear to make up a single recombining group. The two MAT genes (STE3 and SXI1/SXI2) sequenced for C. flavescens strains confirmed the potential for sexual reproduction and suggest the presence of a tetrapolar mating system with a biallelic pheromone/receptor locus and a multiallelic HD locus. In C. terrestris we could only sequence STE3, which revealed a biallelic P/R locus. In spite of the strong evidence for sexual recombination in the two species, attempts at mating compatible strains of both species on culture media were unsuccessful. PMID:25811603
Influence of DNA sequence on the structure of minicircles under torsional stress
Wang, Qian; Irobalieva, Rossitza N.; Chiu, Wah; Schmid, Michael F.; Fogg, Jonathan M.; Zechiedrich, Lynn
2017-01-01
Abstract The sequence dependence of the conformational distribution of DNA under various levels of torsional stress is an important unsolved problem. Combining theory and coarse-grained simulations shows that the DNA sequence and a structural correlation due to topology constraints of a circle are the main factors that dictate the 3D structure of a 336 bp DNA minicircle under torsional stress. We found that DNA minicircle topoisomers can have multiple bend locations under high torsional stress and that the positions of these sharp bends are determined by the sequence, and by a positive mechanical correlation along the sequence. We showed that simulations and theory are able to provide sequence-specific information about individual DNA minicircles observed by cryo-electron tomography (cryo-ET). We provided a sequence-specific cryo-ET tomogram fitting of DNA minicircles, registering the sequence within the geometric features. Our results indicate that the conformational distribution of minicircles under torsional stress can be designed, which has important implications for using minicircle DNA for gene therapy. PMID:28609782
Bloom, Kristie; Ely, Abdullah; Mussolino, Claudio; Cathomen, Toni; Arbuthnot, Patrick
2013-01-01
Chronic hepatitis B virus (HBV) infection remains an important global health problem. Stability of the episomal covalently closed circular HBV DNA (cccDNA) is largely responsible for the modest curative efficacy of available therapy. Since licensed anti-HBV drugs have a post-transcriptional mechanism of action, disabling cccDNA is potentially of therapeutic benefit. To develop this approach, we engineered mutagenic transcription activator-like effector nucleases (TALENs) that target four HBV-specific sites within the viral genome. TALENs with cognate sequences in the S or C open-reading frames (ORFs) efficiently disrupted sequences at the intended sites and suppressed markers of viral replication. Following triple transfection of cultured HepG2.2.15 cells under mildly hypothermic conditions, the S TALEN caused targeted mutation in ~35% of cccDNA molecules. Markers of viral replication were also inhibited in vivo in a murine hydrodynamic injection model of HBV replication. HBV target sites within S and C ORFs of the injected HBV DNA were mutated without evidence of toxicity. These findings are the first to demonstrate a targeted nuclease-mediated disruption of HBV cccDNA. Efficacy in vivo also indicates that these engineered nucleases have potential for use in treatment of chronic HBV infection. PMID:23883864
Evidence implicating Thamnostylum lucknowense as an etiological agent of Rhino-orbital Mucormycosis
USDA-ARS?s Scientific Manuscript database
In this report, we present a case of rhino-orbital mucormycosis in a 57-year-old female with poorly controlled diabetes mellitus. The causal agent was cultured from a specimen of the nasal crust and identified phenotypically and independently using nuclear ribosomal DNA sequence data as Thamnostylum...
Analysis of DNA Sequences by an Optical Time-Integrating Correlator: Proof-of-Concept Experiments.
1992-05-01
DNA ANALYSIS STRATEGY 4 2.1 Representation of DNA Bases 4 2.2 DNA Analysis Strategy 6 3.0 CUSTOM GENERATORS FOR DNA SEQUENCES 10 3.1 Hardware Design 10...of the DNA bases where each base is represented by a 7-bits long pseudorandom sequence. 5 Figure 4: Coarse analysis of a DNA sequence. 7 Figure 5: Fine...a 20-bases long database. 32 xiii LIST OF TABLES PAGE Table 1: Short representations of the DNA bases where each base is represented by 7-bits long
Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W
1998-08-01
The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.
Fife, K H; Boggs, D
1986-01-01
Clinical observations indicate that herpes simplex virus type 1 (HSV-1) is significantly less likely than herpes simplex virus type 2 (HSV-2) to establish latency in (or reactivate from) sacral ganglionic tissue. In an effort to identify viral functions associated with latency, we analyzed HSV-1 isolates from three patients with established recurrent genital herpes and sought evidence of DNA sequences and proteins similar to those found in HSV-2. By restriction endonuclease cleavage patterns and by DNA hybridization analysis using either whole HSV-2 DNA or several cloned segments of HSV-2 DNA as probes, we found that the three HSV-1 isolates from patients with recurrent genital herpes showed no unusual homology to HSV-2 as compared with other HSV-1 isolates. Similarly, the proteins of these isolates could not be distinguished from those of other HSV-1 isolates and were distinct from those of HSV-2. At this level of resolution, there was no evidence to suggest that these recurrent genital HSV-1 isolates were intertypic recombinants, nor did they show any other unusual similarity to HSV-2.
Malysheva, D N; Tokarskaya, Olga N; Petrosyan, Varos G; Danielyan, Felix D; Darevsky, Iliya S; Ryskov, Alexei P
2007-01-01
Microsatellites, or short tandem repeats, are abundant across genomes of most organisms. It is evident that the most straightforward and conclusive way of studying mutations in microsatellite-containing loci is to use clonally transmitted genomes or DNA sequences inherited in multigeneration pedigrees. At present, little is known about the origin of genetic variation in species that lack effective genetic recombination. DNA fingerprinting in 43 families of the parthenogenetic lizard species Darevskia armeniaca (131 siblings), using (GACA)(4), (GGCA)(4), (GATA)(4), and (CAC)(5) probes, revealed mutant fingerprints in siblings that differed from their mothers in several restriction DNA fragments. In some cases, the mutant fingerprints detected in siblings were also found in population samples. The mutation rate for new restriction fragment length estimated by using multilocus probes varied from 0.8 x 10(-2) to 4.9 x 10(-2) per band/per sibling. Probably, the most variations detected as restriction fragment length polymorphism have germ-line origin, but somatic changes of (CAC)(n) fingerprints in adult lizards were also observed. These results provide new evidence of existing unstable regions in genomes of parthenogenetic vertebrate animals, which provide genetic variation in unisexual populations.
Queen, Rachel A.; Steyn, Jannetta S.; Lord, Phillip
2017-01-01
Mitochondrial DNA (mtDNA) mutations are well recognized as an important cause of inherited disease. Diseases caused by mtDNA mutations exhibit a high degree of clinical heterogeneity with a complex genotype-phenotype relationship, with many such mutations exhibiting incomplete penetrance. There is evidence that the spectrum of mutations causing mitochondrial disease might differ between different mitochondrial lineages (haplogroups) seen in different global populations. This would point to the importance of sequence context in the expression of mutations. To explore this possibility, we looked for mutations which are known to cause disease in humans, in animals of other species unaffected by mtDNA disease. The mt-tRNA genes are the location of many pathogenic mutations, with the m.3243A>G mutation on the mt-tRNA-Leu(UUR) being the most frequently seen mutation in humans. This study looked for the presence of m.3243A>G in 2784 sequences from 33 species, as well as any of the other mutations reported in association with disease located on mt-tRNA-Leu(UUR). We report a number of disease associated variations found on mt-tRNA-Leu(UUR) in other chordates, as the major population variant, with m.3243A>G being seen in 6 species. In these, we also found a number of mutations which appear compensatory and which could prevent the pathogenicity associated with this change in humans. This work has important implications for the discovery and diagnosis of mtDNA mutations in non-European populations. In addition, it might provide a partial explanation for the conflicting results in the literature that examines the role of mtDNA variants in complex traits. PMID:29161289
mtDNA and the Origin of the Icelanders: Deciphering Signals of Recent Population History
Helgason, Agnar; Sigurðardóttir, Sigrún; Gulcher, Jeffrey R.; Ward, Ryk; Stefánsson, Kári
2000-01-01
Previous attempts to investigate the origin of the Icelanders have provided estimates of ancestry ranging from a 98% British Isles contribution to an 86% Scandinavian contribution. We generated mitochondrial sequence data for 401 Icelandic individuals and compared these data with >2,500 other European sequences from published sources, to determine the probable origins of women who contributed to Iceland’s settlement. Although the mean number of base-pair differences is high in the Icelandic sequences and they are widely distributed in the overall European mtDNA phylogeny, we find a smaller number of distinct mitochondrial lineages, compared with most other European populations. The frequencies of a number of mtDNA lineages in the Icelanders deviate noticeably from those in neighboring populations, suggesting that founder effects and genetic drift may have had a considerable influence on the Icelandic gene pool. This is in accordance with available demographic evidence about Icelandic population history. A comparison with published mtDNA lineages from European populations indicates that, whereas most founding females probably originated from Scandinavia and the British Isles, lesser contributions from other populations may also have taken place. We present a highly resolved phylogenetic network for the Icelandic data, identifying a number of previously unreported mtDNA lineage clusters and providing a detailed depiction of the evolutionary relationships between European mtDNA clusters. Our findings indicate that European populations contain a large number of closely related mitochondrial lineages, many of which have not yet been sampled in the current comparative data set. Consequently, substantial increases in sample sizes that use mtDNA data will be needed to obtain valid estimates of the diverse ancestral mixtures that ultimately gave rise to contemporary populations. PMID:10712214
Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting
NASA Astrophysics Data System (ADS)
Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.
1997-05-01
Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.
Barreta, J; Gutiérrez-Gil, B; Iñiguez, V; Saavedra, V; Chiri, R; Latorre, E; Arranz, J J
2013-04-01
The objectives of this work were to assess the mtDNA diversity of Bolivian South American camelid (SAC) populations and to shed light on the evolutionary relationships between the Bolivian camelids and other populations of SACs. We have analysed two different mtDNA regions: the complete coding region of the MT-CYB gene and 513 bp of the D-loop region. The populations sampled included Bolivian llamas, alpacas and vicunas, and Chilean guanacos. High levels of genetic diversity were observed in the studied populations. In general, MT-CYB was more variable than D-loop. On a species level, the vicunas showed the lowest genetic variability, followed by the guanacos, alpacas and llamas. Phylogenetic analyses performed by including additional available mtDNA sequences from the studied species confirmed the existence of the two monophyletic clades previously described by other authors for guanacos (G) and vicunas (V). Significant levels of mtDNA hybridization were found in the domestic species. Our sequence analyses revealed significant sequence divergence within clade G, and some of the Bolivian llamas grouped with the majority of the southern guanacos. This finding supports the existence of more than the one llama domestication centre in South America previously suggested on the basis of archaeozoological evidence. Additionally, analysis of D-loop sequences revealed two new matrilineal lineages that are distinct from the previously reported G and V clades. The results presented here represent the first report on the population structure and genetic variability of Bolivian camelids and may help to elucidate the complex and dynamic domestication process of SAC populations. © 2012 The Authors, Animal Genetics © 2012 Stichting International Foundation for Animal Genetics.
Weinert, T A; Hartwell, L H
1990-12-01
In eucaryotic cells, incompletely replicated or damaged chromosomes induce cell cycle arrest in G2 before mitosis, and in the yeast Saccharomyces cerevisiae the RAD9 gene is essential for the cell cycle arrest (T.A. Weinert and L. H. Hartwell, Science 241:317-322, 1988). In this report, we extend the analysis of RAD9-dependent cell cycle control. We found that both induction of RAD9-dependent arrest in G2 and recovery from arrest could occur in the presence of the protein synthesis inhibitor cycloheximide, showing that the mechanism of RAD9-dependent control involves a posttranslational mechanism(s). We have isolated and determined the DNA sequence of the RAD9 gene, confirming the DNA sequence reported previously (R. H. Schiestl, P. Reynolds, S. Prakash, and L. Prakash, Mol. Cell. Biol. 9:1882-1886, 1989). The predicted protein sequence for the Rad9 protein bears no similarity to sequences of known proteins. We also found that synthesis of the RAD9 transcript in the cell cycle was constitutive and not induced by X-irradiation. We constructed yeast cells containing a complete deletion of the RAD9 gene; the rad9 null mutants were viable, sensitive to X- and UV irradiation, and defective for cell cycle arrest after DNA damage. Although Rad+ and rad9 delta cells had similar growth rates and cell cycle kinetics in unirradiated cells, the spontaneous rate of chromosome loss (in unirradiated cells) was elevated 7- to 21-fold in rad9 delta cells. These studies show that in the presence of induced or endogenous DNA damage, RAD9 is a negative regulator that inhibits progression from G2 in order to preserve cell viability and to maintain the fidelity of chromosome transmission.
Colombo, M M; Swanton, M T; Donini, P; Prescott, D M
1984-01-01
Oxytricha nova is a hypotrichous ciliate with micronuclei and macronuclei. Micronuclei, which contain large, chromosomal-sized DNA, are genetically inert but undergo meiosis and exchange during cell mating. Macronuclei, which contain only small, gene-sized DNA molecules, provide all of the nuclear RNA needed to run the cell. After cell mating the macronucleus is derived from a micronucleus, a derivation that includes excision of the genes from chromosomes and elimination of the remaining DNA. The eliminated DNA includes all of the repetitious sequences and approximately 95% of the unique sequences. We cloned large restriction fragments from the micronucleus that confer replication ability on a replication-deficient plasmid in Saccharomyces cerevisiae. Sequences that confer replication ability are called autonomously replicating sequences. The frequency and effectiveness of autonomously replicating sequences in micronuclear DNA are similar to those reported for DNAs of other organisms introduced into yeast cells. Of the 12 micronuclear fragments with autonomously replicating sequence activity, 9 also showed homology to macronuclear DNA, indicating that they contain a macronuclear gene sequence. We conclude from this that autonomously replicating sequence activity is nonrandomly distributed throughout micronuclear DNA and is preferentially associated with those regions of micronuclear DNA that contain genes. Images PMID:6092934
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation
Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob
2014-01-01
As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
Heritable Epigenomic Changes to the Maize Methylome Resulting from Tissue Culture.
Han, Zhaoxue; Crisp, Peter A; Stelpflug, Scott; Kaeppler, Shawn M; Li, Qing; Springer, Nathan M
2018-05-30
DNA methylation can contribute to the maintenance of genome integrity and regulation of gene expression. In most situations, DNA methylation patterns are inherited quite stably. However, changes in DNA methylation can occur at some loci as a result of tissue culture resulting in somaclonal variation. To investigate heritable epigenetic changes as a consequence of tissue culture, a sequence-capture bisulfite sequencing approach was implemented to monitor context-specific DNA methylation patterns in ∼15Mb of the maize genome for a population of plants that had been regenerated from tissue culture. Plants that have been regenerated from tissue culture exhibit gains and losses of DNA methylation at a subset of genomic regions. There was evidence for a high rate of homozygous changes to DNA methylation levels that occur consistently in multiple independent tissue culture lines suggesting that some loci are either targeted or hotspots for epigenetic variation. The consistent changes inherited following tissue culture include both gains and losses of DNA methylation and can affect CG, CHG or both contexts within a region. Only a subset of the tissue culture changes observed in callus plants are observed in the primary regnerants but the majority of DNA methylation changes present in primary regenerants are passed onto offspring. This study provides insights into the susceptibility of some loci and potential mechanisms that could contribute to altered DNA methylation and epigenetic state that occur during tissue culture in plant species. Copyright © 2018, Genetics.
El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R
2013-07-01
Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Watanobe, Takuma; Ishiguro, Naotaka; Nakano, Masuo; Takamiya, Hiroto; Matsui, Akira; Hongo, Hitomi
2002-08-01
Ancient DNAs of Sus scrofa specimens excavated from archaeological sites on the Okinawa islands were examined to clarify the genetic relationships among prehistoric Sus scrofa, modern wild boars and domestic pigs inhabiting the Ryukyu archipelago, the Japanese islands, and the Asian continent. We extracted remain DNA from 161 bone specimens excavated from 12 archaeological sites on the Okinawa islands and successfully amplified mitochondrial DNA control region fragments from 33 of 161 specimens. Pairwise difference between prehistoric and modern S. scrofa nucleotide sequences showed that haplotypes of the East Asian domestic pig lineage were found from archaeological specimens together with Ryukyu wild boars native to the Ryukyu archipelago. Phylogenetic analysis of 14 ancient sequences (11 haplotypes; 574 bp) indicated that S. scrofa specimens from two Yayoi-Heian sites (Kitahara and Ara shellmiddens) and two Recent Times sites (Wakuta Kiln and Kiyuna sites) are grouped with modern East Asian domestic pigs. Sus scrofa specimens from Shimizu shellmidden (Yayoi-Heian Period) were very closely related to modern Sus scrofa riukiuanus but had a unique nucleotide insertion, indicating that the population is genetically distinct from the lineage of modern Ryukyu wild boars. This genetic evidence suggests that domestic pigs from the Asian continent were introduced to the Okinawa islands in the early Yayoi-Heian period (1700-2000 BP), or earlier.
Knight, Cameron G; Munday, John S; Rosa, Brielle V; Kiupel, Matti
2011-12-01
A 9-year-old gelding presented with approximately 100 papillomas that covered about 75% of the distal penis. Biopsy was performed, and histology showed evidence of viral cytopathic change and koilocytosis. Polymerase chain reaction using DNA extracted from biopsied tissue amplified equine papillomavirus type 2 (EcPV-2) DNA sequences. Sixteen months later, the horse was re-examined and the appearance of the papillomas was unchanged. Equine papillomavirus type 2 DNA sequences were again amplified from both biopsied tissue and swabs of the penis. Papillomavirus was localized to the lesions by immunohistochemistry and in situ hybridization. An examination 2 years after the initial presentation revealed no detectable change in the appearance of the penis. The large number of papillomas and their failure to regress over an extended period support a clinical classification of papillomatosis. To the authors' knowledge, this is the first report of papillomatosis of the equine penis. This novel clinical manifestation suggests that persistent EcPV-2 infection is possible in horses. As there is evidence that EcPV-2 may promote development of equine penile squamous cell carcinoma, understanding the natural history of EcPV-2 infections may be important in preventing equine penile neoplasia. © 2011 The Authors. Veterinary Dermatology. © 2011 ESVD and ACVD.
Gandini, C. L.; Sanchez-Puerta, M. V.
2017-01-01
Angiosperm mitochondrial genomes (mtDNA) exhibit variable quantities of alien sequences. Many of these sequences are acquired by intracellular gene transfer (IGT) from the plastid. In addition, frequent events of horizontal gene transfer (HGT) between mitochondria of different species also contribute to their expanded genomes. In contrast, alien sequences are rarely found in plastid genomes. Most of the plant-to-plant HGT events involve mitochondrion-to-mitochondrion transfers. Occasionally, foreign sequences in mtDNAs are plastid-derived (MTPT), raising questions about their origin, frequency, and mechanism of transfer. The rising number of complete mtDNAs allowed us to address these questions. We identified 15 new foreign MTPTs, increasing significantly the number of those previously reported. One out of five of the angiosperm species analyzed contained at least one foreign MTPT, suggesting a remarkable frequency of HGT among plants. By analyzing the flanking regions of the foreign MTPTs, we found strong evidence for mt-to-mt transfers in 65% of the cases. We hypothesize that plastid sequences were initially acquired by the native mtDNA via IGT and then transferred to a distantly-related plant via mitochondrial HGT, rather than directly from a foreign plastid to the mitochondrial genome. Finally, we describe three novel putative cases of mitochondrial-derived sequences among angiosperm plastomes. PMID:28262720
Taming the Past: Ancient DNA and the Study of Animal Domestication.
MacHugh, David E; Larson, Greger; Orlando, Ludovic
2017-02-08
During the last decade, ancient DNA research has been revolutionized by the availability of increasingly powerful DNA sequencing and ancillary genomics technologies, giving rise to the new field of paleogenomics. In this review, we show how our understanding of the genetic basis of animal domestication and the origins and dispersal of livestock and companion animals during the Upper Paleolithic and Neolithic periods is being rapidly transformed through new scientific knowledge generated with paleogenomic methods. These techniques have been particularly informative in revealing high-resolution patterns of artificial and natural selection and evidence for significant admixture between early domestic animal populations and their wild congeners.
Choe, Se-Eun; Nguyen, Thuy Thi-Dieu; Kang, Tae-Gyu; Kweon, Chang-Hee; Kang, Seung-Won
2011-09-01
Nuclear ribosomal DNA sequence of the second internal transcribed spacer (ITS-2) has been used efficiently to identify the liver fluke species collected from different hosts and various geographic regions. ITS-2 sequences of 19 Fasciola samples collected from Korean native cattle were determined and compared. Sequence comparison including ITS-2 sequences of isolates from this study and reference sequences from Fasciola hepatica and Fasciola gigantica and intermediate Fasciola in Genbank revealed seven identical variable sites of investigated isolates. Among 19 samples, 12 individuals had ITS-2 sequences completely identical to that of pure F. hepatica, five possessed the sequences identical to F. gigantica type, whereas two shared the sequence of both F. hepatica and F. gigantica. No variations in length and nucleotide composition of ITS-2 sequence were observed within isolates that belonged to F. hepatica or F. gigantica. At the position of 218, five Fasciola containing a single-base substitution (C>T) formed a distinct branch inside the F. gigantica-type group which was similar to those of Asian-origin isolates. The phylogenetic tree of the Fasciola spp. based on complete ITS-2 sequences from this study and other representative isolates in different locations clearly showed that pure F. hepatica, F. gigantica type and intermediate Fasciola were observed. The result also provided additional genetic evidence for the existence of three forms of Fasciola isolated from native cattle in Korea by genetic approach using ITS-2 sequence.
Molecular Approach to the Identification of Fish in the South China Sea
Zhang, Junbin; Hanner, Robert
2012-01-01
Background DNA barcoding is one means of establishing a rapid, accurate, and cost-effective system for the identification of species. It involves the use of short, standard gene targets to create sequence profiles of known species against sequences of unknowns that can be matched and subsequently identified. The Fish Barcode of Life (FISH-BOL) campaign has the primary goal of gathering DNA barcode records for all the world's fish species. As a contribution to FISH-BOL, we examined the degree to which DNA barcoding can discriminate marine fishes from the South China Sea. Methodology/Principal Findings DNA barcodes of cytochrome oxidase subunit I (COI) were characterized using 1336 specimens that belong to 242 species fishes from the South China Sea. All specimen provenance data (including digital specimen images and geospatial coordinates of collection localities) and collateral sequence information were assembled using Barcode of Life Data System (BOLD; www.barcodinglife.org). Small intraspecific and large interspecific differences create distinct genetic boundaries among most species. In addition, the efficiency of two mitochondrial genes, 16S rRNA (16S) and cytochrome b (cytb), and one nuclear ribosomal gene, 18S rRNA (18S), was also evaluated for a few select groups of species. Conclusions/Significance The present study provides evidence for the effectiveness of DNA barcoding as a tool for monitoring marine biodiversity. Open access data of fishes from the South China Sea can benefit relative applications in ecology and taxonomy. PMID:22363454
Shah, Kushani; Thomas, Shelby; Stein, Arnold
2013-01-01
In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C Sanger sequencing reactions. They prepare and run the gels, perform Southern blots (which require only 10 min), and detect sequencing ladders using a colorimetric detection system. Students enlarge their sequencing ladders from digital images of their small nylon membranes, and read the sequence manually. They compare their reads with the actual DNA sequence using BLAST2. After mastering the DNA sequencing system, students prepare their own DNA from a cheek swab, polymerase chain reaction-amplify a region of their DNA that encompasses a SNP of interest, and perform sequencing to determine their genotype at the SNP position. A family pedigree can also be constructed. The SNP chosen by the instructor was rs17822931, which is in the ABCC11 gene and is the determinant of human earwax type. Genotypes at the rs178229931 site vary in different ethnic populations. © 2013 by The International Union of Biochemistry and Molecular Biology.
Kröber, Magdalena; Bekel, Thomas; Diaz, Naryttza N; Goesmann, Alexander; Jaenicke, Sebastian; Krause, Lutz; Miller, Dimitri; Runte, Kai J; Viehöver, Prisca; Pühler, Alfred; Schlüter, Andreas
2009-06-01
The phylogenetic structure of the microbial community residing in a fermentation sample from a production-scale biogas plant fed with maize silage, green rye and liquid manure was analysed by an integrated approach using clone library sequences and metagenome sequence data obtained by 454-pyrosequencing. Sequencing of 109 clones from a bacterial and an archaeal 16S-rDNA amplicon library revealed that the obtained nucleotide sequences are similar but not identical to 16S-rDNA database sequences derived from different anaerobic environments including digestors and bioreactors. Most of the bacterial 16S-rDNA sequences could be assigned to the phylum Firmicutes with the most abundant class Clostridia and to the class Bacteroidetes, whereas most archaeal 16S-rDNA sequences cluster close to the methanogen Methanoculleus bourgensis. Further sequences of the archaeal library most probably represent so far non-characterised species within the genus Methanoculleus. A similar result derived from phylogenetic analysis of mcrA clone sequences. The mcrA gene product encodes the alpha-subunit of methyl-coenzyme-M reductase involved in the final step of methanogenesis. BLASTn analysis applying stringent settings resulted in assignment of 16S-rDNA metagenome sequence reads to 62 16S-rDNA amplicon sequences thus enabling frequency of abundance estimations for 16S-rDNA clone library sequences. Ribosomal Database Project (RDP) Classifier processing of metagenome 16S-rDNA reads revealed abundance of the phyla Firmicutes, Bacteroidetes and Euryarchaeota and the orders Clostridiales, Bacteroidales and Methanomicrobiales. Moreover, a large fraction of 16S-rDNA metagenome reads could not be assigned to lower taxonomic ranks, demonstrating that numerous microorganisms in the analysed fermentation sample of the biogas plant are still unclassified or unknown.
Prado, Blanca R.; Pozo, Carmen; Valdez-Moreno, Martha; Hebert, Paul D. N.
2011-01-01
Background Recent studies have demonstrated the utility of DNA barcoding in the discovery of overlooked species and in the connection of immature and adult stages. In this study, we use DNA barcoding to examine diversity patterns in 121 species of Nymphalidae from the Yucatan Peninsula in Mexico. Our results suggest the presence of cryptic species in 8 of these 121 taxa. As well, the reference database derived from the analysis of adult specimens allowed the identification of nymphalid caterpillars providing new details on host plant use. Methodology/Principal Findings We gathered DNA barcode sequences from 857 adult Nymphalidae representing 121 different species. This total includes four species (Adelpha iphiclus, Adelpha malea, Hamadryas iphtime and Taygetis laches) that were initially overlooked because of their close morphological similarity to other species. The barcode results showed that each of the 121 species possessed a diagnostic array of barcode sequences. In addition, there was evidence of cryptic taxa; seven species included two barcode clusters showing more than 2% sequence divergence while one species included three clusters. All 71 nymphalid caterpillars were identified to a species level by their sequence congruence to adult sequences. These caterpillars represented 16 species, and included Hamadryas julitta, an endemic species from the Yucatan Peninsula whose larval stages and host plant (Dalechampia schottii, also endemic to the Yucatan Peninsula) were previously unknown. Conclusions/Significance This investigation has revealed overlooked species in a well-studied museum collection of nymphalid butterflies and suggests that there is a substantial incidence of cryptic species that await full characterization. The utility of barcoding in the rapid identification of caterpillars also promises to accelerate the assembly of information on life histories, a particularly important advance for hyperdiverse tropical insect assemblages. PMID:22132140
Leon, Lydia J; Doyle, Ronan; Diez-Benavente, Ernest; Clark, Taane G; Klein, Nigel; Stanier, Philip; Moore, Gudrun E
2018-05-18
In this study differences in the placental microbiota of term and preterm deliveries from a large UK pregnancy cohort were studied using 16S targeted amplicon sequencing. The impact of contamination from DNA extraction, PCR reagents, as well as those from delivery itself were also examined. A total of 400 placental samples from 256 singleton pregnancies were analysed and differences investigated between spontaneous preterm, non-spontaneous preterm, and term delivered placenta. DNA from recently delivered placenta was extracted, and screening for bacterial DNA was carried out using targeted sequencing of the 16S rRNA gene on the Illumina MiSeq platform. Sequenced reads were analysed for presence of contaminating operational taxonomic units (OTUs) identified via sequencing of negative extraction and PCR blank samples. Differential abundance and between sample (beta) diversity metrics were then compared. A large proportion of the reads sequenced from the extracted placental samples mapped to OTUs that were also found in negative extractions. Striking differences in the composition of samples were also observed, according to whether the placenta was delivered abdominally or vaginally, providing strong circumstantial evidence for delivery contamination as an important contributor to observed microbial profiles. When OTU and genus level abundances were compared between the groups of interest, a number of organisms were enriched in the spontaneous preterm cohort, including organisms that have been previously associated with adverse pregnancy outcomes, specifically Mycoplasma spp., and Ureaplasma spp.. However, analyses of overall community structure did not reveal convincing evidence for the existence of a reproducible 'preterm placental microbiome'. IMPORTANCE Preterm birth is associated with both psychological and physical disabilities and is the leading cause of infant morbidity and mortality worldwide. Infection is known to be an important cause of spontaneous preterm birth, and recent research has implicated variation in the 'placental microbiome' with preterm birth risk. Consistent with previous studies, the abundance of certain clinically relevant species differed between spontaneous preterm and non-spontaneous preterm or term delivered placenta. These results support the view that a proportion of spontaneous preterm births have an intra-uterine infection component. However, an additional observation from this study was that a substantial proportion of reads sequenced were contaminating reads, rather than DNA from endogenous, clinically relevant species. This observation warrants caution in the interpretation of sequencing output from such low biomass samples as the placenta. Copyright © 2018 Leon et al.
2013-01-01
Background Mitochondrial DNA (mtDNA) typing can be a useful aid for identifying people from compromised samples when nuclear DNA is too damaged, degraded or below detection thresholds for routine short tandem repeat (STR)-based analysis. Standard mtDNA typing, focused on PCR amplicon sequencing of the control region (HVS I and HVS II), is limited by the resolving power of this short sequence, which misses up to 70% of the variation present in the mtDNA genome. Methods We used in-solution hybridisation-based DNA capture (using DNA capture probes prepared from modern human mtDNA) to recover mtDNA from post-mortem human remains in which the majority of DNA is both highly fragmented (<100 base pairs in length) and chemically damaged. The method ‘immortalises’ the finite quantities of DNA in valuable extracts as DNA libraries, which is followed by the targeted enrichment of endogenous mtDNA sequences and characterisation by next-generation sequencing (NGS). Results We sequenced whole mitochondrial genomes for human identification from samples where standard nuclear STR typing produced only partial profiles or demonstrably failed and/or where standard mtDNA hypervariable region sequences lacked resolving power. Multiple rounds of enrichment can substantially improve coverage and sequencing depth of mtDNA genomes from highly degraded samples. The application of this method has led to the reliable mitochondrial sequencing of human skeletal remains from unidentified World War Two (WWII) casualties approximately 70 years old and from archaeological remains (up to 2,500 years old). Conclusions This approach has potential applications in forensic science, historical human identification cases, archived medical samples, kinship analysis and population studies. In particular the methodology can be applied to any case, involving human or non-human species, where whole mitochondrial genome sequences are required to provide the highest level of maternal lineage discrimination. Multiple rounds of in-solution hybridisation-based DNA capture can retrieve whole mitochondrial genome sequences from even the most challenging samples. PMID:24289217
Analysis of JC virus DNA replication using a quantitative and high-throughput assay
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shin, Jong; Phelan, Paul J.; Chhum, Panharith
2014-11-15
Progressive Multifocal Leukoencephalopathy (PML) is caused by lytic replication of JC virus (JCV) in specific cells of the central nervous system. Like other polyomaviruses, JCV encodes a large T-antigen helicase needed for replication of the viral DNA. Here, we report the development of a luciferase-based, quantitative and high-throughput assay of JCV DNA replication in C33A cells, which, unlike the glial cell lines Hs 683 and U87, accumulate high levels of nuclear T-ag needed for robust replication. Using this assay, we investigated the requirement for different domains of T-ag, and for specific sequences within and flanking the viral origin, in JCVmore » DNA replication. Beyond providing validation of the assay, these studies revealed an important stimulatory role of the transcription factor NF1 in JCV DNA replication. Finally, we show that the assay can be used for inhibitor testing, highlighting its value for the identification of antiviral drugs targeting JCV DNA replication. - Highlights: • Development of a high-throughput screening assay for JCV DNA replication using C33A cells. • Evidence that T-ag fails to accumulate in the nuclei of established glioma cell lines. • Evidence that NF-1 directly promotes JCV DNA replication in C33A cells. • Proof-of-concept that the HTS assay can be used to identify pharmacological inhibitor of JCV DNA replication.« less
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.
Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab
2012-01-01
RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.
Direct Detection and Sequencing of Damaged DNA Bases
2011-01-01
Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications. PMID:22185597
Direct detection and sequencing of damaged DNA bases.
Clark, Tyson A; Spittle, Kristi E; Turner, Stephen W; Korlach, Jonas
2011-12-20
Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications.
A comprehensive list of cloned human DNA sequences
Schmidtke, Jörg; Cooper, David N.
1987-01-01
A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3575113
A comprehensive list of cloned human DNA sequences
Schmidtke, Jörg; Cooper, David N.
1990-01-01
A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2333227
A comprehensive list of cloned human DNA sequences
Schmidtke, Jörg; Cooper, David N.
1988-01-01
A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3368330
A comprehensive list of cloned human DNA sequences
Schmidtke, Jörg; Cooper, David N.
1989-01-01
A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2654889
Kilo-sequencing: an ordered strategy for rapid DNA sequence data acquisition.
Barnes, W M; Bevan, M
1983-01-01
A strategy for rapid DNA sequence acquisition in an ordered, nonrandom manner, while retaining all of the conveniences of the dideoxy method with M13 transducing phage DNA template, is described. Target DNA 3 to 14 kb in size can be stably carried by our M13 vectors. Suitable targets are stretches of DNA which lack an enzyme recognition site which is unique on our cloning vectors and adjacent to the sequencing primer; current sites that are so useful when lacking are Pst, Xba, HindIII, BglII, EcoRI. By an in vitro procedure, we cut RF DNA once randomly and once specifically, to create thousands of deletions which start at the unique restriction site adjacent to the dideoxy sequencing primer and extend various distances across the target DNA. Phage carrying a desired size of deletions, whose DNA as template will give rise to DNA sequence data in a desired location along the target DNA, may be purified by electrophoresis alive on agarose gels. Phage running in the same location on the agarose gel thus conveniently give rise to nucleotide sequence data from the same kilobase of target DNA. Images PMID:6298723
Williams, R R; Hassan-Walker, A F; Lavender, F L; Morgan, M; Faik, P; Ragoussis, J
2001-05-16
Minisatellites are tandemly repeated DNA sequences found throughout the genomes of all eukaryotes. They are regions often prone to instability and hence hypervariability; thus repeat unit sequence is generally not conserved beyond closely related species. We have studied the minisatellite located in intron 9 of the human glucose phosphate isomerase (GPI) gene (also known as neuroleukin, autocrine motility factor, maturation and differentiation factor) and have found, by Zoo blotting coupled with PCR amplification and DNA sequencing, that similar repeat units are present in seven other species of mammal. There is also evidence for the presence of the minisatellite in chicken. The repeat unit does not appear to be present at any other locus in these genomes. Minisatellite DNA has been reported to be involved in recombination activity, control of gene expression of nearby gene(s) (both transcriptional and translational), whilst others form protein coding regions. The high level of conservation exhibited by the GPI minisatellite, coupled with the unique location, strongly suggests a functional role. Our results from transient and stable transfections using luciferase reporter constructs have shown that the GPI minisatellite region can act to increase transcription from the SV40 promoter, CMV promoter and the human GPI promoter.
Evidence of protein-free homology recognition in magnetic bead force-extension experiments
NASA Astrophysics Data System (ADS)
O'Lee, D. J.; Danilowicz, C.; Rochester, C.; Kornyshev, A. A.; Prentiss, M.
2016-07-01
Earlier theoretical studies have proposed that the homology-dependent pairing of large tracts of dsDNA may be due to physical interactions between homologous regions. Such interactions could contribute to the sequence-dependent pairing of chromosome regions that may occur in the presence or the absence of double-strand breaks. Several experiments have indicated the recognition of homologous sequences in pure electrolytic solutions without proteins. Here, we report single-molecule force experiments with a designed 60 kb long dsDNA construct; one end attached to a solid surface and the other end to a magnetic bead. The 60 kb constructs contain two 10 kb long homologous tracts oriented head to head, so that their sequences match if the two tracts fold on each other. The distance between the bead and the surface is measured as a function of the force applied to the bead. At low forces, the construct molecules extend substantially less than normal, control dsDNA, indicating the existence of preferential interaction between the homologous regions. The force increase causes no abrupt but continuous unfolding of the paired homologous regions. Simple semi-phenomenological models of the unfolding mechanics are proposed, and their predictions are compared with the data.
Silicene nanoribbon as a new DNA sequencing device
NASA Astrophysics Data System (ADS)
Alesheikh, Sara; Shahtahmassebi, Nasser; Roknabadi, Mahmood Rezaee; Pilevar Shahri, Raheleh
2018-02-01
The importance of applying DNA sequencing in different fields, results in looking for fast and cheap methods. Nanotechnology helps this development by introducing nanostructures used for DNA sequencing. In this work we study the interaction between zigzag silicene nanoribbon and DNA nucleobases using DFT and non equilibrium Green's function approach, to investigate the possibility of using zigzag silicene nanoribbons as a biosensor for DNA sequencing.
Isolation and characterization of target sequences of the chicken CdxA homeobox gene.
Margalit, Y; Yarus, S; Shapira, E; Gruenbaum, Y; Fainsod, A
1993-01-01
The DNA binding specificity of the chicken homeodomain protein CDXA was studied. Using a CDXA-glutathione-S-transferase fusion protein, DNA fragments containing the binding site for this protein were isolated. The sources of DNA were oligonucleotides with random sequence and chicken genomic DNA. The DNA fragments isolated were sequenced and tested in DNA binding assays. Sequencing revealed that most DNA fragments are AT rich which is a common feature of homeodomain binding sites. By electrophoretic mobility shift assays it was shown that the different target sequences isolated bind to the CDXA protein with different affinities. The specific sequences bound by the CDXA protein in the genomic fragments isolated, were determined by DNase I footprinting. From the footprinted sequences, the CDXA consensus binding site was determined. The CDXA protein binds the consensus sequence A, A/T, T, A/T, A, T, A/G. The CAUDAL binding site in the ftz promoter is also included in this consensus sequence. When tested, some of the genomic target sequences were capable of enhancing the transcriptional activity of reporter plasmids when introduced into CDXA expressing cells. This study determined the DNA sequence specificity of the CDXA protein and it also shows that this protein can further activate transcription in cells in culture. Images PMID:7909943
Sequence periodicity in nucleosomal DNA and intrinsic curvature.
Nair, T Murlidharan
2010-05-17
Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA.
Ocan, Moses; Bwanga, Freddie; Okeng, Alfred; Katabazi, Fred; Kigozi, Edgar; Kyobe, Samuel; Ogwal-Okeng, Jasper; Obua, Celestino
2016-08-19
In the absence of an effective vaccine, malaria treatment and eradication is still a challenge in most endemic areas globally. This is especially the case with the current reported emergence of resistance to artemisinin agents in Southeast Asia. This study therefore explored the prevalence of K13-propeller gene polymorphisms among Plasmodium falciparum parasites in northern Uganda. Adult patients (≥18 years) presenting to out-patients department of Lira and Gulu regional referral hospitals in northern Uganda were randomly recruited. Laboratory investigation for presence of plasmodium infection among patients was done using Plasmodium falciparum exclusive rapid diagnostic test, histidine rich protein-2 (HRP2) (Pf). Finger prick capillary blood from patients with a positive malaria test was spotted on a filter paper Whatman no. 903. The parasite DNA was extracted using chelex resin method and sequenced for mutations in K13-propeller gene using Sanger sequencing. PCR DNA sequence products were analyzed using in DNAsp 5.10.01software, data was further processed in Excel spreadsheet 2007. A total of 60 parasite DNA samples were sequenced. Polymorphisms in the K13-propeller gene were detected in four (4) of the 60 parasite DNA samples sequenced. A non-synonymous polymorphism at codon 533 previously detected in Cambodia was found in the parasite DNA samples analyzed. Polymorphisms at codon 522 (non-synonymous) and codon 509 (synonymous) were also found in the samples analyzed. The study found evidence of positive selection in the Plasmodium falciparum population in northern Uganda (Tajima's D = -1.83205; Fu and Li's D = -1.82458). Polymorphism in the K13-propeller gene previously reported in Cambodia has been found in the Ugandan Plasmodium falciparum parasites. There is need for continuous surveillance for artemisinin resistance gene markers in the country.
Bishop, R.P.; Hemmink, J.D.; Morrison, W.I.; Weir, W.; Toye, P.G.; Sitt, T.; Spooner, P.R.; Musoke, A.J.; Skilton, R.A.; Odongo, D.O.
2015-01-01
African Cape buffalo (Syncerus caffer) is the wildlife reservoir of multiple species within the apicomplexan protozoan genus Theileria, including Theileria parva which causes East coast fever in cattle. A parasite, which has not yet been formally named, known as Theileria sp. (buffalo) has been recognized as a potentially distinct species based on rDNA sequence, since 1993. We demonstrate using reverse line blot (RLB) and sequencing of 18S rDNA genes, that in an area where buffalo and cattle co-graze and there is a heavy tick challenge, T. sp. (buffalo) can frequently be isolated in culture from cattle leukocytes. We also show that T. sp. (buffalo), which is genetically very closely related to T. parva, according to 18s rDNA sequence, has a conserved orthologue of the polymorphic immunodominant molecule (PIM) that forms the basis of the diagnostic ELISA used for T. parva serological detection. Closely related orthologues of several CD8 T cell target antigen genes are also shared with T. parva. By contrast, orthologues of the T. parva p104 and the p67 sporozoite surface antigens could not be amplified by PCR from T. sp. (buffalo), using conserved primers designed from the corresponding T. parva sequences. Collectively the data re-emphasise doubts regarding the value of rDNA sequence data alone for defining apicomplexan species in the absence of additional data. ‘Deep 454 pyrosequencing’ of DNA from two Theileria sporozoite stabilates prepared from Rhipicephalus appendiculatus ticks fed on buffalo failed to detect T. sp. (buffalo). This strongly suggests that R. appendiculatus may not be a vector for T. sp. (buffalo). Collectively, the data provides further evidence that T. sp. (buffalo). is a distinct species from T. parva. PMID:26543804
Assessing the Fidelity of Ancient DNA Sequences Amplified From Nuclear Genes
Binladen, Jonas; Wiuf, Carsten; Gilbert, M. Thomas P.; Bunce, Michael; Barnett, Ross; Larson, Greger; Greenwood, Alex D.; Haile, James; Ho, Simon Y. W.; Hansen, Anders J.; Willerslev, Eske
2006-01-01
To date, the field of ancient DNA has relied almost exclusively on mitochondrial DNA (mtDNA) sequences. However, a number of recent studies have reported the successful recovery of ancient nuclear DNA (nuDNA) sequences, thereby allowing the characterization of genetic loci directly involved in phenotypic traits of extinct taxa. It is well documented that postmortem damage in ancient mtDNA can lead to the generation of artifactual sequences. However, as yet no one has thoroughly investigated the damage spectrum in ancient nuDNA. By comparing clone sequences from 23 fossil specimens, recovered from environments ranging from permafrost to desert, we demonstrate the presence of miscoding lesion damage in both the mtDNA and nuDNA, resulting in insertion of erroneous bases during amplification. Interestingly, no significant differences in the frequency of miscoding lesion damage are recorded between mtDNA and nuDNA despite great differences in cellular copy numbers. For both mtDNA and nuDNA, we find significant positive correlations between total sequence heterogeneity and the rates of type 1 transitions (adenine → guanine and thymine → cytosine) and type 2 transitions (cytosine → thymine and guanine → adenine), respectively. Type 2 transitions are by far the most dominant and increase relative to those of type 1 with damage load. The results suggest that the deamination of cytosine (and 5-methyl cytosine) to uracil (and thymine) is the main cause of miscoding lesions in both ancient mtDNA and nuDNA sequences. We argue that the problems presented by postmortem damage, as well as problems with contamination from exogenous sources of conserved nuclear genes, allelic variation, and the reliance on single nucleotide polymorphisms, call for great caution in studies relying on ancient nuDNA sequences. PMID:16299392
[Current applications of high-throughput DNA sequencing technology in antibody drug research].
Yu, Xin; Liu, Qi-Gang; Wang, Ming-Rong
2012-03-01
Since the publication of a high-throughput DNA sequencing technology based on PCR reaction was carried out in oil emulsions in 2005, high-throughput DNA sequencing platforms have been evolved to a robust technology in sequencing genomes and diverse DNA libraries. Antibody libraries with vast numbers of members currently serve as a foundation of discovering novel antibody drugs, and high-throughput DNA sequencing technology makes it possible to rapidly identify functional antibody variants with desired properties. Herein we present a review of current applications of high-throughput DNA sequencing technology in the analysis of antibody library diversity, sequencing of CDR3 regions, identification of potent antibodies based on sequence frequency, discovery of functional genes, and combination with various display technologies, so as to provide an alternative approach of discovery and development of antibody drugs.
DNA fingerprinting, DNA barcoding, and next generation sequencing technology in plants.
Sucher, Nikolaus J; Hennell, James R; Carles, Maria C
2012-01-01
DNA fingerprinting of plants has become an invaluable tool in forensic, scientific, and industrial laboratories all over the world. PCR has become part of virtually every variation of the plethora of approaches used for DNA fingerprinting today. DNA sequencing is increasingly used either in combination with or as a replacement for traditional DNA fingerprinting techniques. A prime example is the use of short, standardized regions of the genome as taxon barcodes for biological identification of plants. Rapid advances in "next generation sequencing" (NGS) technology are driving down the cost of sequencing and bringing large-scale sequencing projects into the reach of individual investigators. We present an overview of recent publications that demonstrate the use of "NGS" technology for DNA fingerprinting and DNA barcoding applications.
Mammalian DNA enriched for replication origins is enriched for snap-back sequences.
Zannis-Hadjopoulos, M; Kaufmann, G; Martin, R G
1984-11-15
Using the instability of replication loops as a method for the isolation of double-stranded nascent DNA, extruded DNA enriched for replication origins was obtained and denatured. Snap-back DNA, single-stranded DNA with inverted repeats (palindromic sequences), reassociates rapidly into stem-loop structures with zero-order kinetics when conditions are changed from denaturing to renaturing, and can be assayed by chromatography on hydroxyapatite. Origin-enriched nascent DNA strands from mouse, rat and monkey cells growing either synchronously or asynchronously were purified and assayed for the presence of snap-back sequences. The results show that origin-enriched DNA is also enriched for snap-back sequences, implying that some origins for mammalian DNA replication contain or lie near palindromic sequences.
Trans-Homolog Interactions Facilitating Paramutation in Maize
2015-01-01
Paramutations represent locus-specific trans-homolog interactions affecting the heritable silencing properties of endogenous alleles. Although examples of paramutation are well studied in maize (Zea mays), the responsible mechanisms remain unclear. Genetic analyses indicate roles for plant-specific DNA-dependent RNA polymerases that generate small RNAs, and current working models hypothesize that these small RNAs direct heritable changes at sequences often acting as transcriptional enhancers. Several studies have defined specific sequences that mediate paramutation behaviors, and recent results identify a diversity of DNA-dependent RNA polymerase complexes operating in maize. Other reports ascribe broader roles for some of these complexes in normal genome function. This review highlights recent research to understand the molecular mechanisms of paramutation and examines evidence relevant to small RNA-based modes of transgenerational epigenetic inheritance. PMID:26149572
Baglivo, Ilaria; Esposito, Sabrina; De Cesare, Lucia; Sparago, Angela; Anvar, Zahra; Riso, Vincenzo; Cammisa, Marco; Fattorusso, Roberto; Grimaldi, Giovanna; Riccio, Andrea; Pedone, Paolo V.
2013-01-01
In the mouse, ZFP57 contains three classical Cys2His2 zinc finger domains (ZF) and recognizes the methylated TGCmetCGC target sequence using the first and the second ZFs. In this study, we demonstrate that the human ZFP57 (hZFP57) containing six Cys2His2 ZFs, binds the same methylated sequence through the third and the fourth ZFs, and identify the aminoacids critical for DNA interaction. In addition, we present evidences indicating that hZFP57 mutations and hypomethylation of the TNDM1 ICR both associated with Transient Neonatal Diabetes Mellitus type 1 result in loss of hZFP57 binding to the TNDM1 locus, likely causing PLAGL1 activation. PMID:23499433
Kshirsagar, Rucha; Khan, Krishnendu; Joshi, Mamata V; Hosur, Ramakrishna V; Muniyappa, K
2017-05-23
A plethora of evidence suggests that different types of DNA quadruplexes are widely present in the genome of all organisms. The existence of a growing number of proteins that selectively bind and/or process these structures underscores their biological relevance. Moreover, G-quadruplex DNA has been implicated in the alignment of four sister chromatids by forming parallel guanine quadruplexes during meiosis; however, the underlying mechanism is not well defined. Here we show that a G/C-rich motif associated with a meiosis-specific DNA double-strand break (DSB) in Saccharomyces cerevisiae folds into G-quadruplex, and the C-rich sequence complementary to the G-rich sequence forms an i-motif. The presence of G-quadruplex or i-motif structures upstream of the green fluorescent protein-coding sequence markedly reduces the levels of gfp mRNA expression in S. cerevisiae cells, with a concomitant decrease in green fluorescent protein abundance, and blocks primer extension by DNA polymerase, thereby demonstrating the functional significance of these structures. Surprisingly, although S. cerevisiae Hop1, a component of synaptonemal complex axial/lateral elements, exhibits strong affinity to G-quadruplex DNA, it displays a much weaker affinity for the i-motif structure. However, the Hop1 C-terminal but not the N-terminal domain possesses strong i-motif binding activity, implying that the C-terminal domain has a distinct substrate specificity. Additionally, we found that Hop1 promotes intermolecular pairing between G/C-rich DNA segments associated with a meiosis-specific DSB site. Our results support the idea that the G/C-rich motifs associated with meiosis-specific DSBs fold into intramolecular G-quadruplex and i-motif structures, both in vitro and in vivo, thus revealing an important link between non-B form DNA structures and Hop1 in meiotic chromosome synapsis and recombination. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Implications of the dependence of the elastic properties of DNA on nucleotide sequence.
Olson, Wilma K; Swigon, David; Coleman, Bernard D
2004-07-15
Recent advances in structural biochemistry have provided evidence that not only the geometric properties but also the elastic moduli of duplex DNA are strongly dependent on nucleotide sequence in a way that is not accounted for by classical rod models of the Kirchhoff type. A theory of sequence-dependent DNA elasticity is employed here to calculate the dependence of the equilibrium configurations of circular DNA on the binding of ligands that can induce changes in intrinsic twist at a single base-pair step. Calculations are presented of the influence on configurations of the assumed values and distribution along the DNA of intrinsic roll and twist and a modulus coupling roll to twist. Among the results obtained are the following. For minicircles formed from intrinsically straight DNA, the distribution of roll-twist coupling strongly affects the dependence of the total elastic energy Psi on the amount alpha of imposed untwisting, and that dependence can be far from quadratic. (In fact, for a periodic distribution of roll-twist coupling with a period equal to the intrinsic helical repeat length, Psi can be essentially independent of alpha for -90 degrees < alpha <90 degrees.) When the minicircle is homogeneous and without roll-twist coupling, but with uniform positive intrinsic roll, the point at which Psi attains its minimum value shifts towards negative values of alpha. It is remarked that there are cases in which one can relate graphs of Psi versus alpha to the 'effective values' of bending and twisting moduli and helical repeat length obtained from measurements of equilibrium distributions of topoisomers and probabilities of ring closure. For a minicircle formed from DNA that has an 'S' shape when stress-free, the graphs of Psi versus alpha have maxima at alpha = 0. As the binding of a twisting agent to such a minicircle results in a net decrease in Psi, the affinity of the twisting agent for binding to the minicircle is greater than its affinity for binding to unconstrained DNA with the same sequence.
Gene Unprediction with Spurio: A tool to identify spurious protein sequences.
Höps, Wolfram; Jeffryes, Matt; Bateman, Alex
2018-01-01
We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation. Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases. We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes. Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio
The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio; ...
2016-03-09
The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Molecular Evidence for a Natural Primary Triple Hybrid in Plants Revealed from Direct Sequencing
Kaplan, Zdenek; Fehrer, Judith
2007-01-01
Background and Aims Molecular evidence for natural primary hybrids composed of three different plant species is very rarely reported. An investigation was therefore carried out into the origin and a possible scenario for the rise of a sterile plant clone showing a combination of diagnostic morphological features of three separate, well-defined Potamogeton species. Methods The combination of sequences from maternally inherited cytoplasmic (rpl20-rps12) and biparentally inherited nuclear ribosomal DNA (ITS) was used to identify the exact identity of the putative triple hybrid. Key Results Direct sequencing showed ITS variants of three parental taxa, P. gramineus, P. lucens and P. perfoliatus, whereas chloroplast DNA identified P. perfoliatus as the female parent. A scenario for the rise of the triple hybrid through a fertile binary hybrid P. gramineus × P. lucens crossed with P. perfoliatus is described. Conclusions Even though the triple hybrid is sterile, it possesses an efficient strategy for its existence and became locally successful even in the parental environment, perhaps as a result of heterosis. The population investigated is the only one known of this hybrid, P. × torssanderi, worldwide. Isozyme analysis indicated the colony to be genetically uniform. The plants studied represented a single clone that seems to have persisted at this site for a long time. PMID:17478544
New insights into the promoterless transcription of DNA coligo templates by RNA polymerase III.
Lama, Lodoe; Seidl, Christine I; Ryan, Kevin
2014-01-01
Chemically synthesized DNA can carry small RNA sequence information but converting that information into small RNA is generally thought to require large double-stranded promoters in the context of plasmids, viruses and genes. We previously found evidence that circularized oligodeoxynucleotides (coligos) containing certain sequences and secondary structures can template the synthesis of small RNA by RNA polymerase III in vitro and in human cells. By using immunoprecipitated RNA polymerase III we now report corroborating evidence that this enzyme is the sole polymerase responsible for coligo transcription. The immobilized polymerase enabled experiments showing that coligo transcripts can be formed through transcription termination without subsequent 3' end trimming. To better define the determinants of productive transcription, a structure-activity relationship study was performed using over 20 new coligos. The results show that unpaired nucleotides in the coligo stem facilitate circumtranscription, but also that internal loops and bulges should be kept small to avoid secondary transcription initiation sites. A polymerase termination sequence embedded in the double-stranded region of a hairpin-encoding coligo stem can antagonize transcription. Using lessons learned from new and old coligos, we demonstrate how to convert poorly transcribed coligos into productive templates. Our findings support the possibility that coligos may prove useful as chemically synthesized vectors for the ectopic expression of small RNA in human cells.
Specific minor groove solvation is a crucial determinant of DNA binding site recognition
Harris, Lydia-Ann; Williams, Loren Dean; Koudelka, Gerald B.
2014-01-01
The DNA sequence preferences of nearly all sequence specific DNA binding proteins are influenced by the identities of bases that are not directly contacted by protein. Discrimination between non-contacted base sequences is commonly based on the differential abilities of DNA sequences to allow narrowing of the DNA minor groove. However, the factors that govern the propensity of minor groove narrowing are not completely understood. Here we show that the differential abilities of various DNA sequences to support formation of a highly ordered and stable minor groove solvation network are a key determinant of non-contacted base recognition by a sequence-specific binding protein. In addition, disrupting the solvent network in the non-contacted region of the binding site alters the protein's ability to recognize contacted base sequences at positions 5–6 bases away. This observation suggests that DNA solvent interactions link contacted and non-contacted base recognition by the protein. PMID:25429976
A Method for Preparing DNA Sequencing Templates Using a DNA-Binding Microplate
Yang, Yu; Hebron, Haroun R.; Hang, Jun
2009-01-01
A DNA-binding matrix was immobilized on the surface of a 96-well microplate and used for plasmid DNA preparation for DNA sequencing. The same DNA-binding plate was used for bacterial growth, cell lysis, DNA purification, and storage. In a single step using one buffer, bacterial cells were lysed by enzymes, and released DNA was captured on the plate simultaneously. After two wash steps, DNA was eluted and stored in the same plate. Inclusion of phosphates in the culture medium was found to enhance the yield of plasmid significantly. Purified DNA samples were used successfully in DNA sequencing with high consistency and reproducibility. Eleven vectors and nine libraries were tested using this method. In 10 μl sequencing reactions using 3 μl sample and 0.25 μl BigDye Terminator v3.1, the results from a 3730xl sequencer gave a success rate of 90–95% and read-lengths of 700 bases or more. The method is fully automatable and convenient for manual operation as well. It enables reproducible, high-throughput, rapid production of DNA with purity and yields sufficient for high-quality DNA sequencing at a substantially reduced cost. PMID:19568455
Dendritic Cell-Based Immunotherapy of Breast Cancer: Modulation by CpG DNA
2005-09-01
tumor-associated antigens and bacterial DNA oligodeoxynucleotides containing unmethylated CpG sequences (CpG DNA) further augment the immune priming...associated antigens by cytotoxic T lymphocytes, and bacterial DNA oligodeoxy- nucleotides containing unmethylated CpG sequences (CpG DNA) can further...further amplify their immunostimulatory capacity and bacterial DNA oligodeoxynucleotides (ODN) containing unmethylated CpG sequences (CpG DNA) provide such
Evidence for a recent origin of penguins
Subramanian, Sankar; Beans-Picón, Gabrielle; Swaminathan, Siva K.; Millar, Craig D.; Lambert, David M.
2013-01-01
Penguins are a remarkable group of birds, with the 18 extant species living in diverse climatic zones from the tropics to Antarctica. The timing of the origin of these extant penguins remains controversial. Previous studies based on DNA sequences and fossil records have suggested widely differing times for the origin of the group. This has given rise to widely differing biogeographic narratives about their evolution. To resolve this problem, we sequenced five introns from 11 species representing all genera of living penguins. Using these data and other available DNA sequences, together with the ages of multiple penguin fossils to calibrate the molecular clock, we estimated the age of the most recent common ancestor of extant penguins to be 20.4 Myr (17.0–23.8 Myr). This time is half of the previous estimates based on molecular sequence data. Our results suggest that most of the major groups of extant penguins diverged 11–16 Ma. This overlaps with the sharp decline in Antarctic temperatures that began approximately 12 Ma, suggesting a possible relationship between climate change and penguin evolution. PMID:24227045
Silaghi, C; Scheuerle, M C; Friche Passos, L M; Thiel, C; Pfister, K
2011-02-01
Central Switzerland is a highly endemic region for tick-borne fever (TBF) in cattle, however, little is known about A. phagocytophilum in goats. In the present study, 72 animals from six goat flocks (373 EDTA blood-samples) in Central Switzerland were analysed for A. phagocytophilum DNA. A real-time PCR targeting the msp2 gene of A. phagocytophilum was performed and in positive samples the partial 165 rRNA, groEL and msp4 gene were amplified for sequence analysis. Four DNA extracts were positive. Different sequence types on basis of the amplified genes were found. For comparison, sequences of A. phagocytophilum from 12 cattle (originating from Switzerland and Southern Germany) were analysed. The 165 rRNA gene sequences from cattle were all identical amongst each other, but the groEL and msp4 gene differed depending on the origin of the cattle samples and differed from the variants from goats. This study clearly provides molecular evidence for the presence of different types of A. phagocytophilum in goat flocks in Switzerland, a fact which deserves more thorough attention in clinical studies.
Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide
2011-09-01
Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.
Biological sequence compression algorithms.
Matsumoto, T; Sadakane, K; Imai, H
2000-01-01
Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.
Arrieta-Montiel, Maria P; Shedge, Vikas; Davila, Jaime; Christensen, Alan C; Mackenzie, Sally A
2009-12-01
The plant mitochondrial genome is recombinogenic, with DNA exchange activity controlled to a large extent by nuclear gene products. One nuclear gene, MSH1, appears to participate in suppressing recombination in Arabidopsis at every repeated sequence ranging in size from 108 to 556 bp. Present in a wide range of plant species, these mitochondrial repeats display evidence of successful asymmetric DNA exchange in Arabidopsis when MSH1 is disrupted. Recombination frequency appears to be influenced by repeat sequence homology and size, with larger size repeats corresponding to increased DNA exchange activity. The extensive mitochondrial genomic reorganization of the msh1 mutant produced altered mitochondrial transcription patterns. Comparison of mitochondrial genomes from the Arabidopsis ecotypes C24, Col-0, and Ler suggests that MSH1 activity accounts for most or all of the polymorphisms distinguishing these genomes, producing ecotype-specific stoichiometric changes in each line. Our observations suggest that MSH1 participates in mitochondrial genome evolution by influencing the lineage-specific pattern of mitochondrial genetic variation in higher plants.
Association between mitochondrial DNA variations and Alzheimer's Disease in the ADNI cohort
Lakatos, Anita; Derbeneva, Olga; Younes, Danny; Keator, David; Bakken, Trygve; Lvova, Maria; Brandon, Marty; Guffanti, Guia; Reglodi, Dora; Saykin, Andrew; Weiner, Michael; Macciardi, Fabio; Schork, Nicholas; Wallace, Douglas C.; Potkin, Steven G.
2010-01-01
Despite the central role of amyloid deposition in the development of Alzheimer's disease (AD), the pathogenesis of AD still remains elusive at the molecular level. Increasing evidence suggests that compromised mitochondrial function contributes to the aging process and thus may increase the risk of AD. Dysfunctional mitochondria contribute to reactive oxygen species (ROS) which can lead to extensive macromolecule oxidative damage and the progression of amyloid pathology. Oxidative stress and amyloid toxicity leave neurons chemically vulnerable. Because the brain relies on aerobic metabolism, it is apparent that mitochondria are critical for the cerebral function. Mitochondrial DNA sequence-changes could shift cell dynamics and facilitate neuronal vulnerability. Therefore we postulated that mitochondrial DNA sequence polymorphisms may increase the risk of AD. We evaluated the role of mitochondrial haplogroups derived from 138 mitochondrial polymorphisms in 358 Caucasian ADNI subjects. Our results indicate that the mitochondrial haplogroup UK may confer genetic susceptibility to AD independently of the APOE4 allele. PMID:20538375
Detection of DNA Methylation by Whole-Genome Bisulfite Sequencing.
Li, Qing; Hermanson, Peter J; Springer, Nathan M
2018-01-01
DNA methylation plays an important role in the regulation of the expression of transposons and genes. Various methods have been developed to assay DNA methylation levels. Bisulfite sequencing is considered to be the "gold standard" for single-base resolution measurement of DNA methylation levels. Coupled with next-generation sequencing, whole-genome bisulfite sequencing (WGBS) allows DNA methylation to be evaluated at a genome-wide scale. Here, we described a protocol for WGBS in plant species with large genomes. This protocol has been successfully applied to assay genome-wide DNA methylation levels in maize and barley. This protocol has also been successfully coupled with sequence capture technology to assay DNA methylation levels in a targeted set of genomic regions.
Evaluating bacterial pathogen DNA preservation in museum osteological collections
Barnes, Ian; Thomas, Mark G
2005-01-01
Reports of bacterial pathogen DNA sequences obtained from archaeological bone specimens raise the possibility of greatly improving our understanding of the history of infectious diseases. However, the survival of pathogen DNA over long time periods is poorly characterized, and scepticism remains about the reliability of these data. In order to explore the survival of bacterial pathogen DNA in bone specimens, we analysed samples from 59 eighteenth and twentieth century individuals known to have been infected with either Mycobacterium tuberculosis or Treponema pallidum. No reproducible evidence of surviving pathogen DNA was obtained, despite the use of extraction and PCR-amplification methods determined to be highly sensitive. These data suggest that previous studies need to be interpreted with caution, and we propose that a much greater emphasis is placed on understanding how pathogen DNA survives in archaeological material, and how its presence can be properly verified and used. PMID:16608682
Single-Molecule Electrical Random Resequencing of DNA and RNA
NASA Astrophysics Data System (ADS)
Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji
2012-07-01
Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.
Corruption of genomic databases with anomalous sequence.
Lamperti, E D; Kittelberger, J M; Smith, T F; Villa-Komaroff, L
1992-06-11
We describe evidence that DNA sequences from vectors used for cloning and sequencing have been incorporated accidentally into eukaryotic entries in the GenBank database. These incorporations were not restricted to one type of vector or to a single mechanism. Many minor instances may have been the result of simple editing errors, but some entries contained large blocks of vector sequence that had been incorporated by contamination or other accidents during cloning. Some cases involved unusual rearrangements and areas of vector distant from the normal insertion sites. Matches to vector were found in 0.23% of 20,000 sequences analyzed in GenBank Release 63. Although the possibility of anomalous sequence incorporation has been recognized since the inception of GenBank and should be easy to avoid, recent evidence suggests that this problem is increasing more quickly than the database itself. The presence of anomalous sequence may have serious consequences for the interpretation and use of database entries, and will have an impact on issues of database management. The incorporated vector fragments described here may also be useful for a crude estimate of the fidelity of sequence information in the database. In alignments with well-defined ends, the matching sequences showed 96.8% identity to vector; when poorer matches with arbitrary limits were included, the aggregate identity to vector sequence was 94.8%.
Iftikhar, Romana; Ashfaq, Muhammad; Rasool, Akhtar; Hebert, Paul D N
2016-01-01
Although thrips are globally important crop pests and vectors of viral disease, species identifications are difficult because of their small size and inconspicuous morphological differences. Sequence variation in the mitochondrial COI-5' (DNA barcode) region has proven effective for the identification of species in many groups of insect pests. We analyzed barcode sequence variation among 471 thrips from various plant hosts in north-central Pakistan. The Barcode Index Number (BIN) system assigned these sequences to 55 BINs, while the Automatic Barcode Gap Discovery detected 56 partitions, a count that coincided with the number of monophyletic lineages recognized by Neighbor-Joining analysis and Bayesian inference. Congeneric species showed an average of 19% sequence divergence (range = 5.6% - 27%) at COI, while intraspecific distances averaged 0.6% (range = 0.0% - 7.6%). BIN analysis suggested that all intraspecific divergence >3.0% actually involved a species complex. In fact, sequences for three major pest species (Haplothrips reuteri, Thrips palmi, Thrips tabaci), and one predatory thrips (Aeolothrips intermedius) showed deep intraspecific divergences, providing evidence that each is a cryptic species complex. The study compiles the first barcode reference library for the thrips of Pakistan, and examines global haplotype diversity in four important pest thrips.
Powell, J. Elijah; Ratnayeke, Nalin; Moran, Nancy A.
2017-01-01
High throughput rRNA amplicon surveys of bacterial communities provide a rapid snapshot of taxonomic composition. But strains with nearly identical rRNA sequences often differ in gene repertoires and metabolic capabilities. To assess strain-level variation within Snodgrassella alvi, a gut symbiont of corbiculate bees, we performed deep sequencing on amplicons of a single copy coding gene (minD) as well as the 16S rDNA V4 region. We surveyed honey bees (Apis mellifera) sampled globally and 12 bumble bee species (Bombus) sampled from two regions of the USA. The minD analyses reveal that S. alvi contains far more strain diversity than is evident from 16S rDNA analysis. Many taxa inferred on the basis of 16S rDNA are shared between A. mellifera and Bombus species, but taxa inferred on the basis of minD are never shared and often are restricted to particular Bombus species. Clustering based on minD revealed that gut communities often reflect host species and geographic location. Both minD and 16S rDNA analyses indicate that strain diversity is higher in A. mellifera than in Bombus species. The minD locus flanks a 16S gene, enabling development of strain-specific 16S fluorescent probes to illuminate the spatial relationship of strains within the bee gut. PMID:27482856
Saberivand, Adel; Ahsan, Sima
2016-01-01
Simple and precise methods for sex determination in animals are a pre-requisite for a number of applications in animal production and forensics. Some of the existing methods depend only on the detection of Y-chromosome specific sequences. However, the detection of Y and X-chromosome specific sequences is advantageous. In the present study the accuracy of sex determination by SRY (sex-determining region Y) and AMEL (Amelogenin) gene detection was assessed using a polymerase chain reaction (PCR) of DNA extracted from free fetal cells in maternal blood, which is noninvasive for fetus and easier to collect. The PCR amplification of SRY primers produced a single band of 171bp from ewes bearing a male fetus, whereas no band was amplified from the DNA extracted from ewes pregnant to a female fetus. Moreover, two bands of 182 and 242bp in male and a single band of 242 in female fetuses were produced by AMEL gene primers in the PCR reaction. Using this technique 100% of samples were successfully sexed, excluding twins. In conclusion, we demonstrated that sex determination using DNA of free fetal cells in maternal plasma is efficient using both SRY and AMEL gene sequences. It also is evident that this method is not suitable for sex determination of twin pregnancies. Copyright © 2015 Elsevier B.V. All rights reserved.
Sloan, Daniel B; Müller, Karel; McCauley, David E; Taylor, Douglas R; Storchová, Helena
2012-12-01
In angiosperms, mitochondrial-encoded genes can cause cytoplasmic male sterility (CMS), resulting in the coexistence of female and hermaphroditic individuals (gynodioecy). We compared four complete mitochondrial genomes from the gynodioecious species Silene vulgaris and found unprecedented amounts of intraspecific diversity for plant mitochondrial DNA (mtDNA). Remarkably, only about half of overall sequence content is shared between any pair of genomes. The four mtDNAs range in size from 361 to 429 kb and differ in gene complement, with rpl5 and rps13 being intact in some genomes but absent or pseudogenized in others. The genomes exhibit essentially no conservation of synteny and are highly repetitive, with evidence of reciprocal recombination occurring even across short repeats (< 250 bp). Some mitochondrial genes exhibit atypically high degrees of nucleotide polymorphism, while others are invariant. The genomes also contain a variable number of small autonomously mapping chromosomes, which have only recently been identified in angiosperm mtDNA. Southern blot analysis of one of these chromosomes indicated a complex in vivo structure consisting of both monomeric circles and multimeric forms. We conclude that S. vulgaris harbors an unusually large degree of variation in mtDNA sequence and structure and discuss the extent to which this variation might be related to CMS. © 2012 The Authors. New Phytologist © 2012 New Phytologist Trust.
The genome of Eimeria spp., with special reference to Eimeria tenella--a coccidium from the chicken.
Shirley, M W
2000-04-10
Eimeria spp. contain at least four genomes. The nuclear genome is best studied in the avian species Eimeria tenella and comprises about 60 Mbp DNA contained within ca. 14 chromosomes; other avian and lupine species appear to possess a nuclear genome of similar size. In addition, sequence data and hybridisation studies have provided direct evidence for extrachromosomal mitochondrial and plastid DNA genomes, and double-stranded RNA segments have also been described. The unique phenotype of "precocious" development that characterises some selected lines of Eimeria spp. not only provides the basis for the first generation of live attenuated vaccines, but offers a significant entrée into studies on the regulation of an apicomplexan life-cycle. With a view to identifying loci implicated in the trait of precocious development, a genetic linkage map of the genome of E. tenella is being constructed in this laboratory from analyses of the inheritance of over 400 polymorphic DNA markers in the progeny of a cross between complementary drug-resistant and precocious parents. Other projects that impinge directly or indirectly on the genome and/or genetics of Eimeria spp. are currently in progress in several laboratories, and include the derivation of expressed sequence tag data and the development of ancillary technologies such as transfection techniques. No large-scale genomic DNA sequencing projects have been reported.
Hua, Jimeng; Li, Ming; Dong, Pengzhi; Cui, Ying; Xie, Qiang; Bu, Wenjun
2009-01-01
Background The true water bugs are grouped in infraorder Nepomorpha (Insecta: Hemiptera: Heteroptera) and are of great economic importance. The phylogenetic relationships within Nepomorpha and the taxonomic hierarchies of Pleoidea and Aphelocheiroidea are uncertain. Most of the previous studies were based on morphological characters without algorithmic assessment. In the latest study, the molecular markers employed in phylogenetic analyses were partial sequences of 16S rDNA and 18S rDNA with a total length about 1 kb. Up to now, no mitochondrial genome of the true water bugs has been sequenced, which is one of the largest data sets that could be compared across animal taxa. In this study we analyzed the unresolved problems in Nepomorpha using evidence from mitochondrial genomes. Results Nine mitochondrial genomes of Nepomorpha and five of other hemipterans were sequenced. These mitochondrial genomes contain the commonly found 37 genes without gene rearrangements. Based on the nucleotide sequences of mt-genomes, Pleoidea is not a member of the Nepomorpha and Aphelocheiroidea should be grouped back into Naucoroidea. Phylogenetic relationships among the superfamilies of Nepomorpha were resolved robustly. Conclusion The mt-genome is an effective data source for resolving intraordinal phylogenetic problems at the superfamily level within Heteroptera. The mitochondrial genomes of the true water bugs are typical insect mt-genomes. Based on the nucleotide sequences of the mt-genomes, we propose the Pleoidea to be a separate heteropteran infraorder. The infraorder Nepomorpha consists of five superfamilies with the relationships (Corixoidea + ((Naucoroidea + Notonectoidea) + (Ochteroidea + Nepoidea))). PMID:19523246
DNA/RNA hybrid substrates modulate the catalytic activity of purified AID.
Abdouni, Hala S; King, Justin J; Ghorbani, Atefeh; Fifield, Heather; Berghuis, Lesley; Larijani, Mani
2018-01-01
Activation-induced cytidine deaminase (AID) converts cytidine to uridine at Immunoglobulin (Ig) loci, initiating somatic hypermutation and class switching of antibodies. In vitro, AID acts on single stranded DNA (ssDNA), but neither double-stranded DNA (dsDNA) oligonucleotides nor RNA, and it is believed that transcription is the in vivo generator of ssDNA targeted by AID. It is also known that the Ig loci, particularly the switch (S) regions targeted by AID are rich in transcription-generated DNA/RNA hybrids. Here, we examined the binding and catalytic behavior of purified AID on DNA/RNA hybrid substrates bearing either random sequences or GC-rich sequences simulating Ig S regions. If substrates were made up of a random sequence, AID preferred substrates composed entirely of DNA over DNA/RNA hybrids. In contrast, if substrates were composed of S region sequences, AID preferred to mutate DNA/RNA hybrids over substrates composed entirely of DNA. Accordingly, AID exhibited a significantly higher affinity for binding DNA/RNA hybrid substrates composed specifically of S region sequences, than any other substrates composed of DNA. Thus, in the absence of any other cellular processes or factors, AID itself favors binding and mutating DNA/RNA hybrids composed of S region sequences. AID:DNA/RNA complex formation and supporting mutational analyses suggest that recognition of DNA/RNA hybrids is an inherent structural property of AID. Copyright © 2017 Elsevier Ltd. All rights reserved.
Characterization of the repetitive DNA elements in the genome of fish lymphocystis disease viruses.
Schnitzler, P; Darai, G
1989-09-01
The complete DNA nucleotide sequence of the repetitive DNA elements in the genome of fish lymphocystis disease virus (FLDV) isolated from two different species (flounder and dab) was determined. The size of these repetitive DNA elements was found to be 1413 bp which corresponds to the DNA sequences of the 5' terminus of the EcoRI DNA fragment B (0.034 to 0.052 m.u.) and to the EcoRI DNA fragment M (0.718 to 0.736 m.u.) of the FLDV genome causing lymphocystis disease in flounder and plaice. The degree of DNA nucleotide homology between both regions was found to be 99%. The repetitive DNA element in the genome of FLDV isolated from other fish species (dab) was identified and is located within the EcoRI DNA fragment B and J of the viral genome. The DNA nucleotide sequence of one duplicate of this repetition (EcoRI DNA fragment J) was determined (1410 bp) and compared to the DNA nucleotide sequences of the repetitive DNA elements of the genome of FLDV isolated from flounder. It was found that the repetitive DNA elements of the genome of FLDV derived from two different fish species are highly conserved and possess a degree of DNA sequence homology of 94%. The DNA sequences of each strand of the individual repetitive element possess one open reading frame.
Beyond DNA: integrating inclusive inheritance into an extended theory of evolution.
Danchin, Étienne; Charmantier, Anne; Champagne, Frances A; Mesoudi, Alex; Pujol, Benoit; Blanchet, Simon
2011-06-17
Many biologists are calling for an 'extended evolutionary synthesis' that would 'modernize the modern synthesis' of evolution. Biological information is typically considered as being transmitted across generations by the DNA sequence alone, but accumulating evidence indicates that both genetic and non-genetic inheritance, and the interactions between them, have important effects on evolutionary outcomes. We review the evidence for such effects of epigenetic, ecological and cultural inheritance and parental effects, and outline methods that quantify the relative contributions of genetic and non-genetic heritability to the transmission of phenotypic variation across generations. These issues have implications for diverse areas, from the question of missing heritability in human complex-trait genetics to the basis of major evolutionary transitions.
Long-range correlations and charge transport properties of DNA sequences
NASA Astrophysics Data System (ADS)
Liu, Xiao-liang; Ren, Yi; Xie, Qiong-tao; Deng, Chao-sheng; Xu, Hui
2010-04-01
By using Hurst's analysis and transfer approach, the rescaled range functions and Hurst exponents of human chromosome 22 and enterobacteria phage lambda DNA sequences are investigated and the transmission coefficients, Landauer resistances and Lyapunov coefficients of finite segments based on above genomic DNA sequences are calculated. In a comparison with quasiperiodic and random artificial DNA sequences, we find that λ-DNA exhibits anticorrelation behavior characterized by a Hurst exponent 0.5
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].
Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y
2017-08-01
To analyze and detect the whole genome sequence of human mitochondrial DNA (mtDNA) by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine
Rojas-Cartagena, Carmencita; Ortíz-Pineda, Pablo; Ramírez-Gómez, Francisco; Suárez-Castillo, Edna C.; Matos-Cruz, Vanessa; Rodríguez, Carlos; Ortíz-Zuazaga, Humberto; García-Arrarás, José E.
2010-01-01
Repair and regeneration are key processes for tissue maintenance, and their disruption may lead to disease states. Little is known about the molecular mechanisms that underline the repair and regeneration of the digestive tract. The sea cucumber Holothuria glaberrima represents an excellent model to dissect and characterize the molecular events during intestinal regeneration. To study the gene expression profile, cDNA libraries were constructed from normal, 3-day, and 7-day regenerating intestines of H. glaberrima. Clones were randomly sequenced and queried against the nonredundant protein database at the National Center for Biotechnology Information. RT-PCR analyses were made of several genes to determine their expression profile during intestinal regeneration. A total of 5,173 sequences from three cDNA libraries were obtained. About 46.2, 35.6, and 26.2% of the sequences for the normal, 3-days, and 7-days cDNA libraries, respectively, shared significant similarity with known sequences in the protein database of GenBank but only present 10% of similarity among them. Analysis of the libraries in terms of functional processes, protein domains, and most common sequences suggests that a differential expression profile is taking place during the regeneration process. Further examination of the expressed sequence tag dataset revealed that 12 putative genes are differentially expressed at significant level (R > 6). Experimental validation by RT-PCR analysis reveals that at least three genes (unknown C-4677-1, melanotransferrin, and centaurin) present a differential expression during regeneration. These findings strongly suggest that the gene expression profile varies among regeneration stages and provide evidence for the existence of differential gene expression. PMID:17579180
Wong, S W; Schaffer, P A
1991-05-01
Like other DNA-containing viruses, the three origins of herpes simplex virus type 1 (HSV-1) DNA replication are flanked by sequences containing transcriptional regulatory elements. In a transient plasmid replication assay, deletion of sequences comprising the transcriptional regulatory elements of ICP4 and ICP22/47, which flank oriS, resulted in a greater than 80-fold decrease in origin function compared with a plasmid, pOS-822, which retains these sequences. In an effort to identify specific cis-acting elements responsible for this effect, we conducted systematic deletion analysis of the flanking region with plasmid pOS-822 and tested the resulting mutant plasmids for origin function. Stimulation by cis-acting elements was shown to be both distance and orientation dependent, as changes in either parameter resulted in a decrease in oriS function. Additional evidence for the stimulatory effect of flanking sequences on origin function was demonstrated by replacement of these sequences with the cytomegalovirus immediate-early promoter, resulting in nearly wild-type levels of oriS function. In competition experiments, cotransfection of cells with the test plasmid, pOS-822, and increasing molar concentrations of a competitor plasmid which contained the ICP4 and ICP22/47 transcriptional regulatory regions but lacked core origin sequences resulted in a significant reduction in the replication efficiency of pOS-822, demonstrating that factors which bind specifically to the oriS-flanking sequences are likely involved as auxiliary proteins in oriS function. Together, these studies demonstrate that trans-acting factors and the sites to which they bind play a critical role in the efficiency of HSV-1 DNA replication from oriS in transient-replication assays.
Nicosia, Aldo; Maggio, Teresa; Mazzola, Salvatore; Cuttitta, Angela
2013-10-30
Anemonia viridis is a widespread and extensively studied Mediterranean species of sea anemone from which a large number of polypeptide toxins, such as blood depressing substances (BDS) peptides, have been isolated. The first members of this class, BDS-1 and BDS-2, are polypeptides belonging to the β-defensin fold family and were initially described for their antihypertensive and antiviral activities. BDS-1 and BDS-2 are 43 amino acid peptides characterised by three disulfide bonds that act as neurotoxins affecting Kv3.1, Kv3.2 and Kv3.4 channel gating kinetics. In addition, BDS-1 inactivates the Nav1.7 and Nav1.3 channels. The development of a large dataset of A. viridis expressed sequence tags (ESTs) and the identification of 13 putative BDS-like cDNA sequences has attracted interest, especially as scientific and diagnostic tools. A comparison of BDS cDNA sequences showed that the untranslated regions are more conserved than the protein-coding regions. Moreover, the KA/KS ratios calculated for all pairwise comparisons showed values greater than 1, suggesting mechanisms of accelerated evolution. The structures of the BDS homologs were predicted by molecular modelling. All toxins possess similar 3D structures that consist of a triple-stranded antiparallel β-sheet and an additional small antiparallel β-sheet located downstream of the cleavage/maturation site; however, the orientation of the triple-stranded β-sheet appears to differ among the toxins. To characterise the spatial expression profile of the putative BDS cDNA sequences, tissue-specific cDNA libraries, enriched for BDS transcripts, were constructed. In addition, the proper amplification of ectodermal or endodermal markers ensured the tissue specificity of each library. Sequencing randomly selected clones from each library revealed ectodermal-specific expression of ten BDS transcripts, while transcripts of BDS-8, BDS-13, BDS-14 and BDS-15 failed to be retrieved, likely due to under-representation in our cDNA libraries. The calculation of the relative abundance of BDS transcripts in the cDNA libraries revealed that BDS-1, BDS-3, BDS-4, BDS-5 and BDS-6 are the most represented transcripts.
Comte, Alexia; Gräfenhan, Tom; Links, Matthew G; Hemmingsen, Sean M; Dumonceaux, Tim J
2017-01-01
We examined the epiphytic microbiome of cereal grain using the universal barcode chaperonin-60 (cpn60). Microbial community profiling of seed washes containing DNA extracts prepared from field-grown cereal grain detected sequences from a fungus identified only to Class Sordariomycetes. To identify the fungal sequence and to improve the reference database, we determined cpn60 sequences from field-collected and reference strains of the ergot fungus, Claviceps purpurea. These data allowed us to identify this fungal sequence as deriving from C. purpurea, and suggested that C. purpurea DNA is readily detectable on agricultural commodities, including those for which ergot was not identified as a grading factor. To get a sense of the prevalence and level of C. purpurea DNA in cereal grains, we developed a quantitative PCR assay based on the fungal internal transcribed spacer (ITS) and applied it to 137 samples from the 2014 crop year. The amount of Claviceps DNA quantified correlated strongly with the proportion of ergot sclerotia identified in each grain lot, although there was evidence that non-target organisms were responsible for some false positives with the ITS-based assay. We therefore developed a cpn60-targeted loop-mediated isothermal amplification assay and applied it to the same grain wash samples. The time to positive displayed a significant, inverse correlation to ergot levels determined by visual ratings. These results indicate that both laboratory-based and field-adaptable molecular diagnostic assays can be used to detect and quantify pathogen load in bulk commodities using cereal grain washes.
Comte, Alexia; Gräfenhan, Tom; Links, Matthew G.; Hemmingsen, Sean M.
2017-01-01
We examined the epiphytic microbiome of cereal grain using the universal barcode chaperonin-60 (cpn60). Microbial community profiling of seed washes containing DNA extracts prepared from field-grown cereal grain detected sequences from a fungus identified only to Class Sordariomycetes. To identify the fungal sequence and to improve the reference database, we determined cpn60 sequences from field-collected and reference strains of the ergot fungus, Claviceps purpurea. These data allowed us to identify this fungal sequence as deriving from C. purpurea, and suggested that C. purpurea DNA is readily detectable on agricultural commodities, including those for which ergot was not identified as a grading factor. To get a sense of the prevalence and level of C. purpurea DNA in cereal grains, we developed a quantitative PCR assay based on the fungal internal transcribed spacer (ITS) and applied it to 137 samples from the 2014 crop year. The amount of Claviceps DNA quantified correlated strongly with the proportion of ergot sclerotia identified in each grain lot, although there was evidence that non-target organisms were responsible for some false positives with the ITS-based assay. We therefore developed a cpn60-targeted loop-mediated isothermal amplification assay and applied it to the same grain wash samples. The time to positive displayed a significant, inverse correlation to ergot levels determined by visual ratings. These results indicate that both laboratory-based and field-adaptable molecular diagnostic assays can be used to detect and quantify pathogen load in bulk commodities using cereal grain washes. PMID:28257512
Sequence periodicity in nucleosomal DNA and intrinsic curvature
2010-01-01
Background Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Results Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. Conclusions The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA. PMID:20487515
Murray, V
1999-01-01
This article reviews the literature concerning the sequence specificity of DNA-damaging agents. DNA-damaging agents are widely used in cancer chemotherapy. It is important to understand fully the determinants of DNA sequence specificity so that more effective DNA-damaging agents can be developed as antitumor drugs. There are five main methods of DNA sequence specificity analysis: cleavage of end-labeled fragments, linear amplification with Taq DNA polymerase, ligation-mediated polymerase chain reaction (PCR), single-strand ligation PCR, and footprinting. The DNA sequence specificity in purified DNA and in intact mammalian cells is reviewed for several classes of DNA-damaging agent. These include agents that form covalent adducts with DNA, free radical generators, topoisomerase inhibitors, intercalators and minor groove binders, enzymes, and electromagnetic radiation. The main sites of adduct formation are at the N-7 of guanine in the major groove of DNA and the N-3 of adenine in the minor groove, whereas free radical generators abstract hydrogen from the deoxyribose sugar and topoisomerase inhibitors cause enzyme-DNA cross-links to form. Several issues involved in the determination of the DNA sequence specificity are discussed. The future directions of the field, with respect to cancer chemotherapy, are also examined.
Origin, evolution, and biogeography of Juglans: a phylogenetic perspective
USDA-ARS?s Scientific Manuscript database
The eastern Asian and eastern North American disjunction in Juglans offers an opportunity to estimate the time since divergence of the Eurasian and American lineages and to compare it with paleobotanical evidences. Five chloroplast DNA non-coding spacer (NCS) sequences: trnT-trnF, psbA-trnH, atpB-r...
Epigenetics: A Fascinating Field with Profound Research, Clinical, & Public Health Implications
ERIC Educational Resources Information Center
Stein, Richard A.; Davis, Devra Lee
2012-01-01
Epigenetics is emerging as one of the most dynamic and vibrant biomedical areas. Multiple lines of evidence confirm that inherited genetic changes alone cannot fully explain all phenotypic characteristics of live organisms, and additional factors, which are not encoded in the DNA sequence, are involved. The contribution of non-genetic factors is…
2011-01-01
Background Bituminaria bituminosa is a perennial legume species from the Canary Islands and Mediterranean region that has potential as a drought-tolerant pasture species and as a source of pharmaceutical compounds. Three botanical varieties have previously been identified in this species: albomarginata, bituminosa and crassiuscula. B. bituminosa can be considered a genomic 'orphan' species with very few genomic resources available. New DNA sequencing technologies provide an opportunity to develop high quality molecular markers for such orphan species. Results 432,306 mRNA molecules were sampled from a leaf transcriptome of a single B. bituminosa plant using Roche 454 pyrosequencing, resulting in an average read length of 345 bp (149.1 Mbp in total). Sequences were assembled into 3,838 isotigs/contigs representing putatively unique gene transcripts. Gene ontology descriptors were identified for 3,419 sequences. Raw sequence reads containing simple sequence repeat (SSR) motifs were identified, and 240 primer pairs flanking these motifs were designed. Of 87 primer pairs developed this way, 75 (86.2%) successfully amplified primarily single fragments by PCR. Fragment analysis using 20 primer pairs in 79 accessions of B. bituminosa detected 130 alleles at 21 SSR loci. Genetic diversity analyses confirmed that variation at these SSR loci accurately reflected known taxonomic relationships in original collections of B. bituminosa and provided additional evidence that a division of the botanical variety bituminosa into two according to geographical origin (Mediterranean region and Canary Islands) may be appropriate. Evidence of cross-pollination was also found between botanical varieties within a B. bituminosa breeding programme. Conclusions B. bituminosa can no longer be considered a genomic orphan species, having now a large (albeit incomplete) repertoire of expressed gene sequences that can serve as a resource for future genetic studies. This experimental approach was effective in developing codominant and polymorphic SSR markers for application in diverse genetic studies. These markers have already given new insight into genetic variation in B. bituminosa, providing evidence that a division of the botanical variety bituminosa may be appropriate. This approach is commended to those seeking to develop useful markers for genomic orphan species. PMID:22171578
Xu, Jianping; Yan, Zhun; Guo, Hong
2009-06-01
The inheritance of mitochondrial genes and genomes are uniparental in most sexual eukaryotes. This pattern of inheritance makes mitochondrial genomes in natural populations effectively clonal. Here, we examined the mitochondrial population genetics of the emerging human pathogenic fungus Cryptococcus gattii. The DNA sequences for five mitochondrial DNA fragments were obtained from each of 50 isolates belonging to two evolutionary divergent lineages, VGI and VGII. Our analyses revealed a greater sequence diversity within VGI than that within VGII, consistent with observations of the nuclear genes. The combined analyses of all five gene fragments indicated significant divergence between VGI and VGII. However, the five individual genealogies showed different relationships among the isolates, consistent with recent hybridization and mitochondrial gene transfer between the two lineages. Population genetic analyses of the multilocus data identified evidence for predominantly clonal mitochondrial population structures within both lineages. Interestingly, there were clear signatures of recombination among mitochondrial genes within the VGII lineage. Our analyses suggest historical mitochondrial genome divergence within C. gattii, but there is evidence for recent hybridization and recombination in the mitochondrial genome of this important human yeast pathogen.
Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing
Chandran, Anandhakumar; Syed, Junetha; Taylor, Rhys D.; Kashiwazaki, Gengo; Sato, Shinsuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi
2016-01-01
Chemically engineered small molecules targeting specific genomic sequences play an important role in drug development research. Pyrrole-imidazole polyamides (PIPs) are a group of molecules that can bind to the DNA minor-groove and can be engineered to target specific sequences. Their biological effects rely primarily on their selective DNA binding. However, the binding mechanism of PIPs at the chromatinized genome level is poorly understood. Herein, we report a method using high-throughput sequencing to identify the DNA-alkylating sites of PIP-indole-seco-CBI conjugates. High-throughput sequencing analysis of conjugate 2 showed highly similar DNA-alkylating sites on synthetic oligos (histone-free DNA) and on human genomes (chromatinized DNA context). To our knowledge, this is the first report identifying alkylation sites across genomic DNA by alkylating PIP conjugates using high-throughput sequencing. PMID:27098039
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.
Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D
2017-01-01
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Extensive paternal mtDNA leakage in natural populations of Drosophila melanogaster.
Nunes, Maria D S; Dolezal, Marlies; Schlötterer, Christian
2013-04-01
Strict maternal inheritance is considered a hallmark of animal mtDNA. Although recent reports suggest that paternal leakage occurs in a broad range of species, it is still considered an exceptionally rare event. To evaluate the impact of paternal leakage on the evolution of mtDNA, it is essential to reliably estimate the frequency of paternal leakage in natural populations. Using allele-specific real-time quantitative PCR (RT-qPCR), we show that heteroplasmy is common in natural populations with at least 14% of the individuals carrying multiple mitochondrial haplotypes. However, the average frequency of the minor mtDNA haplotype is low (0.8%), which suggests that this pervasive heteroplasmy has not been noticed before due to a lack of power in sequencing surveys. Based on the distribution of mtDNA haplotypes in the offspring of heteroplasmic mothers, we found no evidence for strong selection against one of the haplotypes. We estimated that the rate of paternal leakage is 6% and that at least 100 generations are required for complete sorting of mtDNA haplotypes. Despite the high proportion of heteroplasmic individuals in natural populations, we found no evidence for recombination between mtDNA molecules, suggesting that either recombination is rare or recombinant haplotypes are counter-selected. Our results indicate that evolutionary studies using mtDNA as a marker might be biased by paternal leakage in this species. © 2013 Blackwell Publishing Ltd.
Zhang, Yanhong; Pham, Nancy Kim; Zhang, Huixian; Lin, Junda; Lin, Qiang
2014-01-01
Population genetic of seahorses is confidently influenced by their species-specific ecological requirements and life-history traits. In the present study, partial sequences of mitochondrial cytochrome b (cytb) and control region (CR) were obtained from 50 Hippocampus mohnikei and 92 H. trimaculatus from four zoogeographical zones. A total of 780 base pairs of cytb gene were sequenced to characterize mitochondrial DNA (mtDNA) diversity. The mtDNA marker revealed high haplotype diversity, low nucleotide diversity, and a lack of population structure across both populations of H. mohnikei and H. trimaculatus. A neighbour-joining (NJ) tree of cytb gene sequences showed that H. mohnikei haplotypes formed one cluster. A maximum likelihood (ML) tree of cytb gene sequences showed that H. trimaculatus belonged to one lineage. The star-like pattern median-joining network of cytb and CR markers indicated a previous demographic expansion of H. mohnikei and H. trimaculatus. The cytb and CR data sets exhibited a unimodal mismatch distribution, which may have resulted from population expansion. Mismatch analysis suggested that the expansion was initiated about 276,000 years ago for H. mohnikei and about 230,000 years ago for H. trimaculatus during the middle Pleistocene period. This study indicates a possible signature of genetic variation and population expansion in two seahorses under complex marine environments. PMID:25144384
Land use type significantly affects microbial gene transcription in soil.
Nacke, Heiko; Fischer, Christiane; Thürmer, Andrea; Meinicke, Peter; Daniel, Rolf
2014-05-01
Soil microorganisms play an essential role in sustaining biogeochemical processes and cycling of nutrients across different land use types. To gain insights into microbial gene transcription in forest and grassland soil, we isolated mRNA from 32 sampling sites. After sequencing of generated complementary DNA (cDNA), a total of 5,824,229 sequences could be further analyzed. We were able to assign nonribosomal cDNA sequences to all three domains of life. A dominance of bacterial sequences, which were affiliated to 25 different phyla, was found. Bacterial groups capable of aromatic compound degradation such as Phenylobacterium and Burkholderia were detected in significantly higher relative abundance in forest soil than in grassland soil. Accordingly, KEGG pathway categories related to degradation of aromatic ring-containing molecules (e.g., benzoate degradation) were identified in high abundance within forest soil-derived metatranscriptomic datasets. The impact of land use type forest on community composition and activity is evidently to a high degree caused by the presence of wood breakdown products. Correspondingly, bacterial groups known to be involved in lignin degradation and containing ligninolytic genes such as Burkholderia, Bradyrhizobium, and Azospirillum exhibited increased transcriptional activity in forest soil. Higher solar radiation in grassland presumably induced increased transcription of photosynthesis-related genes within this land use type. This is in accordance with high abundance of photosynthetic organisms and plant-infecting viruses in grassland.
The consequences of sequence erosion in the evolution of recombination hotspots.
Tiemann-Boege, Irene; Schwarz, Theresa; Striedner, Yasmin; Heissl, Angelika
2017-12-19
Meiosis is initiated by a double-strand break (DSB) introduced in the DNA by a highly controlled process that is repaired by recombination. In many organisms, recombination occurs at specific and narrow regions of the genome, known as recombination hotspots, which overlap with regions enriched for DSBs. In recent years, it has been demonstrated that conversions and mutations resulting from the repair of DSBs lead to a rapid sequence evolution at recombination hotspots eroding target sites for DSBs. We still do not fully understand the effect of this erosion in the recombination activity, but evidence has shown that the binding of trans -acting factors like PRDM9 is affected. PRDM9 is a meiosis-specific, multi-domain protein that recognizes DNA target motifs by its zinc finger domain and directs DSBs to these target sites. Here we discuss the changes in affinity of PRDM9 to eroded recognition sequences, and explain how these changes in affinity of PRDM9 can affect recombination, leading sometimes to sterility in the context of hybrid crosses. We also present experimental data showing that DNA methylation reduces PRDM9 binding in vitro Finally, we discuss PRDM9-independent hotspots, posing the question how these hotspots evolve and change with sequence erosion.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'. © 2017 The Authors.
The consequences of sequence erosion in the evolution of recombination hotspots
Schwarz, Theresa; Heissl, Angelika
2017-01-01
Meiosis is initiated by a double-strand break (DSB) introduced in the DNA by a highly controlled process that is repaired by recombination. In many organisms, recombination occurs at specific and narrow regions of the genome, known as recombination hotspots, which overlap with regions enriched for DSBs. In recent years, it has been demonstrated that conversions and mutations resulting from the repair of DSBs lead to a rapid sequence evolution at recombination hotspots eroding target sites for DSBs. We still do not fully understand the effect of this erosion in the recombination activity, but evidence has shown that the binding of trans-acting factors like PRDM9 is affected. PRDM9 is a meiosis-specific, multi-domain protein that recognizes DNA target motifs by its zinc finger domain and directs DSBs to these target sites. Here we discuss the changes in affinity of PRDM9 to eroded recognition sequences, and explain how these changes in affinity of PRDM9 can affect recombination, leading sometimes to sterility in the context of hybrid crosses. We also present experimental data showing that DNA methylation reduces PRDM9 binding in vitro. Finally, we discuss PRDM9-independent hotspots, posing the question how these hotspots evolve and change with sequence erosion. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’. PMID:29109225
Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors.
Adalsteinsson, Viktor A; Ha, Gavin; Freeman, Samuel S; Choudhury, Atish D; Stover, Daniel G; Parsons, Heather A; Gydush, Gregory; Reed, Sarah C; Rotem, Denisse; Rhoades, Justin; Loginov, Denis; Livitz, Dimitri; Rosebrock, Daniel; Leshchiner, Ignaty; Kim, Jaegil; Stewart, Chip; Rosenberg, Mara; Francis, Joshua M; Zhang, Cheng-Zhong; Cohen, Ofir; Oh, Coyin; Ding, Huiming; Polak, Paz; Lloyd, Max; Mahmud, Sairah; Helvie, Karla; Merrill, Margaret S; Santiago, Rebecca A; O'Connor, Edward P; Jeong, Seong H; Leeson, Rachel; Barry, Rachel M; Kramkowski, Joseph F; Zhang, Zhenwei; Polacek, Laura; Lohr, Jens G; Schleicher, Molly; Lipscomb, Emily; Saltzman, Andrea; Oliver, Nelly M; Marini, Lori; Waks, Adrienne G; Harshman, Lauren C; Tolaney, Sara M; Van Allen, Eliezer M; Winer, Eric P; Lin, Nancy U; Nakabayashi, Mari; Taplin, Mary-Ellen; Johannessen, Cory M; Garraway, Levi A; Golub, Todd R; Boehm, Jesse S; Wagle, Nikhil; Getz, Gad; Love, J Christopher; Meyerson, Matthew
2017-11-06
Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.
Genotyping of ancient Mycobacterium tuberculosis strains reveals historic genetic diversity.
Müller, Romy; Roberts, Charlotte A; Brown, Terence A
2014-04-22
The evolutionary history of the Mycobacterium tuberculosis complex (MTBC) has previously been studied by analysis of sequence diversity in extant strains, but not addressed by direct examination of strain genotypes in archaeological remains. Here, we use ancient DNA sequencing to type 11 single nucleotide polymorphisms and two large sequence polymorphisms in the MTBC strains present in 10 archaeological samples from skeletons from Britain and Europe dating to the second-nineteenth centuries AD. The results enable us to assign the strains to groupings and lineages recognized in the extant MTBC. We show that at least during the eighteenth-nineteenth centuries AD, strains of M. tuberculosis belonging to different genetic groups were present in Britain at the same time, possibly even at a single location, and we present evidence for a mixed infection in at least one individual. Our study shows that ancient DNA typing applied to multiple samples can provide sufficiently detailed information to contribute to both archaeological and evolutionary knowledge of the history of tuberculosis.
Analysis of protein-coding genetic variation in 60,706 humans.
Lek, Monkol; Karczewski, Konrad J; Minikel, Eric V; Samocha, Kaitlin E; Banks, Eric; Fennell, Timothy; O'Donnell-Luria, Anne H; Ware, James S; Hill, Andrew J; Cummings, Beryl B; Tukiainen, Taru; Birnbaum, Daniel P; Kosmicki, Jack A; Duncan, Laramie E; Estrada, Karol; Zhao, Fengmei; Zou, James; Pierce-Hoffman, Emma; Berghout, Joanne; Cooper, David N; Deflaux, Nicole; DePristo, Mark; Do, Ron; Flannick, Jason; Fromer, Menachem; Gauthier, Laura; Goldstein, Jackie; Gupta, Namrata; Howrigan, Daniel; Kiezun, Adam; Kurki, Mitja I; Moonshine, Ami Levy; Natarajan, Pradeep; Orozco, Lorena; Peloso, Gina M; Poplin, Ryan; Rivas, Manuel A; Ruano-Rubio, Valentin; Rose, Samuel A; Ruderfer, Douglas M; Shakir, Khalid; Stenson, Peter D; Stevens, Christine; Thomas, Brett P; Tiao, Grace; Tusie-Luna, Maria T; Weisburd, Ben; Won, Hong-Hee; Yu, Dongmei; Altshuler, David M; Ardissino, Diego; Boehnke, Michael; Danesh, John; Donnelly, Stacey; Elosua, Roberto; Florez, Jose C; Gabriel, Stacey B; Getz, Gad; Glatt, Stephen J; Hultman, Christina M; Kathiresan, Sekar; Laakso, Markku; McCarroll, Steven; McCarthy, Mark I; McGovern, Dermot; McPherson, Ruth; Neale, Benjamin M; Palotie, Aarno; Purcell, Shaun M; Saleheen, Danish; Scharf, Jeremiah M; Sklar, Pamela; Sullivan, Patrick F; Tuomilehto, Jaakko; Tsuang, Ming T; Watkins, Hugh C; Wilson, James G; Daly, Mark J; MacArthur, Daniel G
2016-08-18
Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.
Unusual DNA Structures Associated With Germline Genetic Activity in Caenorhabditis elegans
Fire, Andrew; Alcazar, Rosa; Tan, Frederick
2006-01-01
We describe a surprising long-range periodicity that underlies a substantial fraction of C. elegans genomic sequence. Extended segments (up to several hundred nucleotides) of the C. elegans genome show a strong bias toward occurrence of AA/TT dinucleotides along one face of the helix while little or no such constraint is evident on the opposite helical face. Segments with this characteristic periodicity are highly overrepresented in intron sequences and are associated with a large fraction of genes with known germline expression in C. elegans. In addition to altering the path and flexibility of DNA in vitro, sequences of this character have been shown by others to constrain DNA∷nucleosome interactions, potentially producing a structure that could resist the assembly of highly ordered (phased) nucleosome arrays that have been proposed as a precursor to heterochromatin. We propose a number of ways that the periodic occurrence of An/Tn clusters could reflect evolution and function of genes that express in the germ cell lineage of C. elegans. PMID:16648589
Standage, Daniel S; Berens, Ali J; Glastad, Karl M; Severin, Andrew J; Brendel, Volker P; Toth, Amy L
2016-04-01
Comparative genomics of social insects has been intensely pursued in recent years with the goal of providing insights into the evolution of social behaviour and its underlying genomic and epigenomic basis. However, the comparative approach has been hampered by a paucity of data on some of the most informative social forms (e.g. incipiently and primitively social) and taxa (especially members of the wasp family Vespidae) for studying social evolution. Here, we provide a draft genome of the primitively eusocial model insect Polistes dominula, accompanied by analysis of caste-related transcriptome and methylome sequence data for adult queens and workers. Polistes dominula possesses a fairly typical hymenopteran genome, but shows very low genomewide GC content and some evidence of reduced genome size. We found numerous caste-related differences in gene expression, with evidence that both conserved and novel genes are related to caste differences. Most strikingly, these -omics data reveal a major reduction in one of the major epigenetic mechanisms that has been previously suggested to be important for caste differences in social insects: DNA methylation. Along with a conspicuous loss of a key gene associated with environmentally responsive DNA methylation (the de novo DNA methyltransferase Dnmt3), these wasps have greatly reduced genomewide methylation to almost zero. In addition to providing a valuable resource for comparative analysis of social insect evolution, our integrative -omics data for this important behavioural and evolutionary model system call into question the general importance of DNA methylation in caste differences and evolution in social insects. © 2016 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
Clark, Kerry L; Leydet, Brian F; Threlkeld, Clifford
2014-05-01
The present study investigated the cause of illness in human patients primarily in the southern USA with suspected Lyme disease based on erythema migrans-like skin lesions and/or symptoms consistent with early localized or late disseminated Lyme borreliosis. The study also included some patients from other states throughout the USA. Several PCR assays specific for either members of the genus Borrelia or only for Lyme group Borrelia spp. (Borrelia burgdorferi sensu lato), and DNA sequence analysis, were used to identify Borrelia spp. DNA in blood and skin biopsy samples from human patients. B. burgdorferi sensu lato DNA was found in both blood and skin biopsy samples from patients residing in the southern states and elsewhere in the USA, but no evidence of DNA from other Borrelia spp. was detected. Based on phylogenetic analysis of partial flagellin (flaB) gene sequences, strains that clustered separately with B. burgdorferi sensu stricto, Borrelia americana or Borrelia andersonii were associated with Lyme disease-like signs and symptoms in patients from the southern states, as well as from some other areas of the country. Strains most similar to B. burgdorferi sensu stricto and B. americana were found most commonly and appeared to be widely distributed among patients residing throughout the USA. The study findings suggest that human cases of Lyme disease in the southern USA may be more common than previously recognized and may also be caused by more than one species of B. burgdorferi sensu lato. This study provides further evidence that B. burgdorferi sensu stricto is not the only species associated with signs and/or symptoms consistent with Lyme borreliosis in the USA.
An evolution based biosensor receptor DNA sequence generation algorithm.
Kim, Eungyeong; Lee, Malrey; Gatton, Thomas M; Lee, Jaewan; Zang, Yupeng
2010-01-01
A biosensor is composed of a bioreceptor, an associated recognition molecule, and a signal transducer that can selectively detect target substances for analysis. DNA based biosensors utilize receptor molecules that allow hybridization with the target analyte. However, most DNA biosensor research uses oligonucleotides as the target analytes and does not address the potential problems of real samples. The identification of recognition molecules suitable for real target analyte samples is an important step towards further development of DNA biosensors. This study examines the characteristics of DNA used as bioreceptors and proposes a hybrid evolution-based DNA sequence generating algorithm, based on DNA computing, to identify suitable DNA bioreceptor recognition molecules for stable hybridization with real target substances. The Traveling Salesman Problem (TSP) approach is applied in the proposed algorithm to evaluate the safety and fitness of the generated DNA sequences. This approach improves efficiency and stability for enhanced and variable-length DNA sequence generation and allows extension to generation of variable-length DNA sequences with diverse receptor recognition requirements.
Interactions of neanderthals and modern humans: what can be inferred from mitochondrial DNA?
Cyran, Krzysztof A; Kimmel, Marek
2005-07-01
This paper reviews the state-of-the-art knowledge concerning the relationship between Neanderthals and Upper Paleolithic modern humans. The branching-process method is applied to infer the upper limit of hypothetical Neanderthal admixture, consistent with the evidence based on mitochon- drial DNA sequences of contemporary modern humans, as well as Neanderthal and early modern European H. sapiens fossils. As a result, a maximum value of 15% admixture is obtained. This estimate is discussed in the context of its consequences for the two competing theories of modern human origin.
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis
Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab
2012-01-01
RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. Availability http://www.cemb.edu.pk/sw.html Abbreviations RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language. PMID:23055611
Structural and Thermodynamic Signatures of DNA Recognition by Mycobacterium tuberculosis DnaA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsodikov, Oleg V.; Biswas, Tapan
An essential protein, DnaA, binds to 9-bp DNA sites within the origin of replication oriC. These binding events are prerequisite to forming an enigmatic nucleoprotein scaffold that initiates replication. The number, sequences, positions, and orientations of these short DNA sites, or DnaA boxes, within the oriCs of different bacteria vary considerably. To investigate features of DnaA boxes that are important for binding Mycobacterium tuberculosis DnaA (MtDnaA), we have determined the crystal structures of the DNA binding domain (DBD) of MtDnaA bound to a cognate MtDnaA-box (at 2.0 {angstrom} resolution) and to a consensus Escherichia coli DnaA-box (at 2.3 {angstrom}). Thesemore » structures, complemented by calorimetric equilibrium binding studies of MtDnaA DBD in a series of DnaA-box variants, reveal the main determinants of DNA recognition and establish the [T/C][T/A][G/A]TCCACA sequence as a high-affinity MtDnaA-box. Bioinformatic and calorimetric analyses indicate that DnaA-box sequences in mycobacterial oriCs generally differ from the optimal binding sequence. This sequence variation occurs commonly at the first 2 bp, making an in vivo mycobacterial DnaA-box effectively a 7-mer and not a 9-mer. We demonstrate that the decrease in the affinity of these MtDnaA-box variants for MtDnaA DBD relative to that of the highest-affinity box TTGTCCACA is less than 10-fold. The understanding of DnaA-box recognition by MtDnaA and E. coli DnaA enables one to map DnaA-box sequences in the genomes of M. tuberculosis and other eubacteria.« less
Crainey, James L; Mattos-Glória, Aline; Hamada, Neusa; Luz, Sérgio L B
2014-03-01
Following the success of the Onchocerciasis Elimination Programme for the Americas (OEPA), there is now just one Latin American onchocerciasis focus where onchocerciasis transmission is described as 'on-going:' the Amazonia Onchocerciasis focus. In the hyperendemic highland areas of the Amazonia focus, Simulium guianense s.l. Wise are the most important vectors of the disease. Populations of S. guianense s.l. are, however, known to vary in their cytogenetics and in a range of behaviours, including in their biting habits. In the hypoendemic lowland areas of the Amazonia focus, for example, S. guianense s.l. are generally regarded as zoophilic and consequently unimportant to disease transmission. Robust tools, to discriminate among various populations of S. guianense s.l. have, however, not yet been developed. In the work reported here, we have assessed the utility of a ribosomal DNA sequence fragment spanning the nuclear ribosomal ITS-1, ITS-2 and 5.8S sequence regions and a ∼850 nucleotide portion of the mitochondrial cytochrome oxidase gene (CO1) for species-level identification and for resolving the within species substructuring. We report here how we have generated 78 CO1 sequences from a rich set of both zoophilic and anthropophilic populations of S. guianense s.l. that were collected from eight sites that are broadly distributed across Brazil. Consistent with previous findings, our analysis supports the genetic isolation of Simulium litobranchium from S. guianense s.l. In contrast with previous findings, however, our results did not provide support for the divergence of the two species prior to the radiation of S. guianense s.l. In our analysis of the S. guianense s.l. ribosomal DNA sequence trace files we generated, we provide clear evidence of multiple within-specimen single nucleotide polymorphisms and indels suggesting that S. guianense s.l. ribosomal DNA is not a good target for conventional DNA barcoding. This is the first report of S. guianense s.l. within individual ribosomal DNA variation and thus the first evidence that the species is not subject to the normal effects of concerted evolution. Collectively, these data illustrate the need for diverse sampling in the development of robust molecular tools for vector identification and suggest that ribosomal DNA might be able to assist with resolving S. guianense s.l. species substructuring that C01 barcoding has hitherto failed to. Copyright © 2013 Elsevier B.V. All rights reserved.
DNA barcode goes two-dimensions: DNA QR code web server.
Liu, Chang; Shi, Linchun; Xu, Xiaolan; Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin
2012-01-01
The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, "DNA barcode" actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications.
TaxI: a software tool for DNA barcoding using distance methods
Steinke, Dirk; Vences, Miguel; Salzburger, Walter; Meyer, Axel
2005-01-01
DNA barcoding is a promising approach to the diagnosis of biological diversity in which DNA sequences serve as the primary key for information retrieval. Most existing software for evolutionary analysis of DNA sequences was designed for phylogenetic analyses and, hence, those algorithms do not offer appropriate solutions for the rapid, but precise analyses needed for DNA barcoding, and are also unable to process the often large comparative datasets. We developed a flexible software tool for DNA taxonomy, named TaxI. This program calculates sequence divergences between a query sequence (taxon to be barcoded) and each sequence of a dataset of reference sequences defined by the user. Because the analysis is based on separate pairwise alignments this software is also able to work with sequences characterized by multiple insertions and deletions that are difficult to align in large sequence sets (i.e. thousands of sequences) by multiple alignment algorithms because of computational restrictions. Here, we demonstrate the utility of this approach with two datasets of fish larvae and juveniles from Lake Constance and juvenile land snails under different models of sequence evolution. Sets of ribosomal 16S rRNA sequences, characterized by multiple indels, performed as good as or better than cox1 sequence sets in assigning sequences to species, demonstrating the suitability of rRNA genes for DNA barcoding. PMID:16214755
Tabor, Stanley; Richardson, Charles C.
1995-04-25
A method for sequencing a strand of DNA, including the steps off: providing the strand of DNA; annealing the strand with a primer able to hybridize to the strand to give an annealed mixture; incubating the mixture with four deoxyribonucleoside triphosphates, a DNA polymerase, and at least three deoxyribonucleoside triphosphates in different amounts, under conditions in favoring primer extension to form nucleic acid fragments complementory to the DNA to be sequenced; labelling the nucleic and fragments; separating them and determining the position of the deoxyribonucleoside triphosphates by differences in the intensity of the labels, thereby to determine the DNA sequence.
Kukita, Yoji; Matoba, Ryo; Uchida, Junji; Hamakawa, Takuya; Doki, Yuichiro; Imamura, Fumio; Kato, Kikuya
2015-08-01
Circulating tumour DNA (ctDNA) is an emerging field of cancer research. However, current ctDNA analysis is usually restricted to one or a few mutation sites due to technical limitations. In the case of massively parallel DNA sequencers, the number of false positives caused by a high read error rate is a major problem. In addition, the final sequence reads do not represent the original DNA population due to the global amplification step during the template preparation. We established a high-fidelity target sequencing system of individual molecules identified in plasma cell-free DNA using barcode sequences; this system consists of the following two steps. (i) A novel target sequencing method that adds barcode sequences by adaptor ligation. This method uses linear amplification to eliminate the errors introduced during the early cycles of polymerase chain reaction. (ii) The monitoring and removal of erroneous barcode tags. This process involves the identification of individual molecules that have been sequenced and for which the number of mutations have been absolute quantitated. Using plasma cell-free DNA from patients with gastric or lung cancer, we demonstrated that the system achieved near complete elimination of false positives and enabled de novo detection and absolute quantitation of mutations in plasma cell-free DNA. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Brown and polar bear Y chromosomes reveal extensive male-biased gene flow within brother lineages.
Bidon, Tobias; Janke, Axel; Fain, Steven R; Eiken, Hans Geir; Hagen, Snorre B; Saarma, Urmas; Hallström, Björn M; Lecomte, Nicolas; Hailer, Frank
2014-06-01
Brown and polar bears have become prominent examples in phylogeography, but previous phylogeographic studies relied largely on maternally inherited mitochondrial DNA (mtDNA) or were geographically restricted. The male-specific Y chromosome, a natural counterpart to mtDNA, has remained underexplored. Although this paternally inherited chromosome is indispensable for comprehensive analyses of phylogeographic patterns, technical difficulties and low variability have hampered its application in most mammals. We developed 13 novel Y-chromosomal sequence and microsatellite markers from the polar bear genome and screened these in a broad geographic sample of 130 brown and polar bears. We also analyzed a 390-kb-long Y-chromosomal scaffold using sequencing data from published male ursine genomes. Y chromosome evidence support the emerging understanding that brown and polar bears started to diverge no later than the Middle Pleistocene. Contrary to mtDNA patterns, we found 1) brown and polar bears to be reciprocally monophyletic sister (or rather brother) lineages, without signals of introgression, 2) male-biased gene flow across continents and on phylogeographic time scales, and 3) male dispersal that links the Alaskan ABC islands population to mainland brown bears. Due to female philopatry, mtDNA provides a highly structured estimate of population differentiation, while male-biased gene flow is a homogenizing force for nuclear genetic variation. Our findings highlight the importance of analyzing both maternally and paternally inherited loci for a comprehensive view of phylogeographic history, and that mtDNA-based phylogeographic studies of many mammals should be reevaluated. Recent advances in sequencing technology render the analysis of Y-chromosomal variation feasible, even in nonmodel organisms. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Smith, Adam C.; Suzuki, Masako; Thompson, Reid; Choufani, Sanaa; Higgins, Michael J.; Chiu, Idy W.; Squire, Jeremy A.; Greally, John M.; Weksberg, Rosanna
2015-01-01
Beckwith-Wiedemann syndrome (BWS) is an overgrowth syndrome associated with genetic or epigenetic alterations in one of two imprinted domains on chromosome 11p15.5. Rarely, chromosomal translocations or inversions of chromosome 11p15.5 are associated with BWS but the molecular pathophysiology in such cases is not understood. In our series of 3 translocation and 2 inversion patients with BWS, the chromosome 11p15.5 breakpoints map within the centromeric imprinted domain, 2. We hypothesized that either microdeletions/microduplications adjacent to the breakpoints could disrupt genomic sequences important for imprinted gene regulation. An alternate hypothesis was that epigenetic alterations of as yet unknown regulatory DNA sequences, result in the BWS phenotype. A high resolution Nimblegen custom microarray was designed representing all non-repetitive sequences in the telomeric 33 MB of the short arm of human chromosome 11. For the BWS-associated chromosome 11p15.5 translocations and inversions, we found no evidence of microdeletions/microduplications. DNA methylation was also tested on this microarray using the HpaII tiny fragment enrichment by ligation-mediated PCR (HELP) assay. This high-resolution DNA methylation microarray analysis revealed a gain of DNA methylation in the translocation/inversion patients affecting the p-ter segment of chromosome 11p15, including both imprinted domains. BWS patients that inherited a maternal translocation or inversion also demonstrated reduced expression of the growth suppressing imprinted gene, CDKN1C in Domain 2. In summary, our data demonstrate that translocations and inversions involving imprinted domain 2 on chromosome 11p15.5, alter regional DNA methylation patterns and imprinted gene expression in cis, suggesting that these epigenetic alterations are generated by an alteration in “chromatin context”. PMID:22079941
Epigenomics and bolting tolerance in sugar beet genotypes.
Hébrard, Claire; Peterson, Daniel G; Willems, Glenda; Delaunay, Alain; Jesson, Béline; Lefèbvre, Marc; Barnes, Steve; Maury, Stéphane
2016-01-01
In sugar beet (Beta vulgaris altissima), bolting tolerance is an essential agronomic trait reflecting the bolting response of genotypes after vernalization. Genes involved in induction of sugar beet bolting have now been identified, and evidence suggests that epigenetic factors are involved in their control. Indeed, the time course and amplitude of DNA methylation variations in the shoot apical meristem have been shown to be critical in inducing sugar beet bolting, and a few functional targets of DNA methylation during vernalization have been identified. However, molecular mechanisms controlling bolting tolerance levels among genotypes are still poorly understood. Here, gene expression and DNA methylation profiles were compared in shoot apical meristems of three bolting-resistant and three bolting-sensitive genotypes after vernalization. Using Cot fractionation followed by 454 sequencing of the isolated low-copy DNA, 6231 contigs were obtained that were used along with public sugar beet DNA sequences to design custom Agilent microarrays for expression (56k) and methylation (244k) analyses. A total of 169 differentially expressed genes and 111 differentially methylated regions were identified between resistant and sensitive vernalized genotypes. Fourteen sequences were both differentially expressed and differentially methylated, with a negative correlation between their methylation and expression levels. Genes involved in cold perception, phytohormone signalling, and flowering induction were over-represented and collectively represent an integrative gene network from environmental perception to bolting induction. Altogether, the data suggest that the genotype-dependent control of DNA methylation and expression of an integrative gene network participate in bolting tolerance in sugar beet, opening up perspectives for crop improvement. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-09-19
To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G planctonica and 262,888-bp G sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-01-01
Abstract To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G. planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G. planctonica and 262,888-bp G. sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G. sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. PMID:27503298
Specific Inhibition of the transcription factor Ci by a Cobalt(III)-Schiff base-DNA conjugate
Hurtado, Ryan R.; Harney, Allison S.; Heffern, Marie C.; Holbrook, Robert J.; Holmgren, Robert A.; Meade, Thomas J.
2012-01-01
We describe the use of Co(III) Schiff base-DNA conjugates, a versatile class of research tools that target C2H2 transcription factors, to inhibit the Hedgehog (Hh) pathway. In developing mammalian embryos, Hh signaling is critical for the formation and development of many tissues and organs. Inappropriate activation of the Hedgehog (Hh) pathway has been implicated in a variety of cancers including medulloblastomas and basal cell carcinomas. It is well known that Hh regulates the activity of the Gli family of C2H2 zinc finger transcription factors in mammals. In Drosophila the function of the Gli proteins is performed by a single transcription factor with an identical DNA binding consensus sequence, Cubitus Interruptus (Ci). We have demonstrated previously that conjugation of a specific 17 base-pair oligonucleotide to a Co(III) Schiff base complex results in a targeted inhibitor of the Snail family C2H2 zinc finger transcription factors. Modification of the oligonucleotide sequence in the Co(III) Schiff base-DNA conjugate to that of Ci’s consensus sequence (Co(III)-Ci) generates an equally selective inhibitor of Ci. Co(III)-Ci irreversibly binds the Ci zinc finger domain and prevents it from binding DNA in vitro. In a Ci responsive tissue culture reporter gene assay, Co(III)-Ci reduces the transcriptional activity of Ci in a concentration dependent manner. In addition, injection of wild-type Drosophila embryos with Co(III)-Ci phenocopies a Ci loss of function phenotype, demonstrating effectiveness in vivo. This study provides evidence that Co(III) Schiff base-DNA conjugates are a versatile class of specific and potent tools for studying zinc finger domain proteins and have potential applications as customizable anti-cancer therapeutics. PMID:22214326
Aguilar, William; Paz, Manuel M; Vargas, Anayatzinc; Clement, Cristina C; Cheng, Shu-Yuan; Champeil, Elise
2018-04-20
Mitomycin C (MC), a potent antitumor drug, and decarbamoylmitomycin C (DMC), a derivative lacking the carbamoyl group, form highly cytotoxic DNA interstrand crosslinks. The major interstrand crosslink formed by DMC is the C1'' epimer of the major crosslink formed by MC. The molecular basis for the stereochemical configuration exhibited by DMC was investigated using biomimetic synthesis. The formation of DNA-DNA crosslinks by DMC is diastereospecific and diastereodivergent: Only the 1''S-diastereomer of the initially formed monoadduct can form crosslinks at GpC sequences, and only the 1''R-diastereomer of the monoadduct can form crosslinks at CpG sequences. We also show that CpG and GpC sequences react with divergent diastereoselectivity in the first alkylation step: 1"S stereochemistry is favored at GpC sequences and 1''R stereochemistry is favored at CpG sequences. Therefore, the first alkylation step results, at each sequence, in the selective formation of the diastereomer able to generate an interstrand DNA-DNA crosslink after the "second arm" alkylation. Examination of the known DNA adduct pattern obtained after treatment of cancer cell cultures with DMC indicates that the GpC sequence is the major target for the formation of DNA-DNA crosslinks in vivo by this drug. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sproul, John S; Maddison, David R
2017-11-01
Despite advances that allow DNA sequencing of old museum specimens, sequencing small-bodied, historical specimens can be challenging and unreliable as many contain only small amounts of fragmented DNA. Dependable methods to sequence such specimens are especially critical if the specimens are unique. We attempt to sequence small-bodied (3-6 mm) historical specimens (including nomenclatural types) of beetles that have been housed, dried, in museums for 58-159 years, and for which few or no suitable replacement specimens exist. To better understand ideal approaches of sample preparation and produce preparation guidelines, we compared different library preparation protocols using low amounts of input DNA (1-10 ng). We also explored low-cost optimizations designed to improve library preparation efficiency and sequencing success of historical specimens with minimal DNA, such as enzymatic repair of DNA. We report successful sample preparation and sequencing for all historical specimens despite our low-input DNA approach. We provide a list of guidelines related to DNA repair, bead handling, reducing adapter dimers and library amplification. We present these guidelines to facilitate more economical use of valuable DNA and enable more consistent results in projects that aim to sequence challenging, irreplaceable historical specimens. © 2017 John Wiley & Sons Ltd.
Mohammed, Monzoorul Haque; Ghosh, Tarini Shankar; Chadaram, Sudha; Mande, Sharmila S
2011-11-30
Obtaining accurate estimates of microbial diversity using rDNA profiling is the first step in most metagenomics projects. Consequently, most metagenomic projects spend considerable amounts of time, money and manpower for experimentally cloning, amplifying and sequencing the rDNA content in a metagenomic sample. In the second step, the entire genomic content of the metagenome is extracted, sequenced and analyzed. Since DNA sequences obtained in this second step also contain rDNA fragments, rapid in silico identification of these rDNA fragments would drastically reduce the cost, time and effort of current metagenomic projects by entirely bypassing the experimental steps of primer based rDNA amplification, cloning and sequencing. In this study, we present an algorithm called i-rDNA that can facilitate the rapid detection of 16S rDNA fragments from amongst millions of sequences in metagenomic data sets with high detection sensitivity. Performance evaluation with data sets/database variants simulating typical metagenomic scenarios indicates the significantly high detection sensitivity of i-rDNA. Moreover, i-rDNA can process a million sequences in less than an hour on a simple desktop with modest hardware specifications. In addition to the speed of execution, high sensitivity and low false positive rate, the utility of the algorithmic approach discussed in this paper is immense given that it would help in bypassing the entire experimental step of primer-based rDNA amplification, cloning and sequencing. Application of this algorithmic approach would thus drastically reduce the cost, time and human efforts invested in all metagenomic projects. A web-server for the i-rDNA algorithm is available at http://metagenomics.atc.tcs.com/i-rDNA/
Biosensors for DNA sequence detection
NASA Technical Reports Server (NTRS)
Vercoutere, Wenonah; Akeson, Mark
2002-01-01
DNA biosensors are being developed as alternatives to conventional DNA microarrays. These devices couple signal transduction directly to sequence recognition. Some of the most sensitive and functional technologies use fibre optics or electrochemical sensors in combination with DNA hybridization. In a shift from sequence recognition by hybridization, two emerging single-molecule techniques read sequence composition using zero-mode waveguides or electrical impedance in nanoscale pores.
Thomas, W. Kelley; Vida, J. T.; Frisse, Linda M.; Mundo, Manuel; Baldwin, James G.
1997-01-01
To effectively integrate DNA sequence analysis and classical nematode taxonomy, we must be able to obtain DNA sequences from formalin-fixed specimens. Microdissected sections of nematodes were removed from specimens fixed in formalin, using standard protocols and without destroying morphological features. The fixed sections provided sufficient template for multiple polymerase chain reaction-based DNA sequence analyses. PMID:19274156
Star, Bastiaan; Nederbragt, Alexander J.; Hansen, Marianne H. S.; Skage, Morten; Gilfillan, Gregor D.; Bradbury, Ian R.; Pampoulie, Christophe; Stenseth, Nils Chr; Jakobsen, Kjetill S.; Jentoft, Sissel
2014-01-01
Degradation-specific processes and variation in laboratory protocols can bias the DNA sequence composition from samples of ancient or historic origin. Here, we identify a novel artifact in sequences from historic samples of Atlantic cod (Gadus morhua), which forms interrupted palindromes consisting of reverse complementary sequence at the 5′ and 3′-ends of sequencing reads. The palindromic sequences themselves have specific properties – the bases at the 5′-end align well to the reference genome, whereas extensive misalignments exists among the bases at the terminal 3′-end. The terminal 3′ bases are artificial extensions likely caused by the occurrence of hairpin loops in single stranded DNA (ssDNA), which can be ligated and amplified in particular library creation protocols. We propose that such hairpin loops allow the inclusion of erroneous nucleotides, specifically at the 3′-end of DNA strands, with the 5′-end of the same strand providing the template. We also find these palindromes in previously published ancient DNA (aDNA) datasets, albeit at varying and substantially lower frequencies. This artifact can negatively affect the yield of endogenous DNA in these types of samples and introduces sequence bias. PMID:24608104
Yamada, Kazuhiko; Nishida-Umehara, Chizuko; Matsuda, Yoichi
2004-03-01
We isolated a new family of satellite DNA sequences from HaeIII- and EcoRI-digested genomic DNA of the Blakiston's fish owl ( Ketupa blakistoni). The repetitive sequences were organized in tandem arrays of the 174 bp element, and localized to the centromeric regions of all macrochromosomes, including the Z and W chromosomes, and microchromosomes. This hybridization pattern was consistent with the distribution of C-band-positive centromeric heterochromatin, and the satellite DNA sequences occupied 10% of the total genome as a major component of centromeric heterochromatin. The sequences were homogenized between macro- and microchromosomes in this species, and therefore intraspecific divergence of the nucleotide sequences was low. The 174 bp element cross-hybridized to the genomic DNA of six other Strigidae species, but not to that of the Tytonidae, suggesting that the satellite DNA sequences are conserved in the same family but fairly divergent between the different families in the Strigiformes. Secondly, the centromeric satellite DNAs were cloned from eight Strigidae species, and the nucleotide sequences of 41 monomer fragments were compared within and between species. Molecular phylogenetic relationships of the nucleotide sequences were highly correlated with both the taxonomy based on morphological traits and the phylogenetic tree constructed by DNA-DNA hybridization. These results suggest that the satellite DNA sequence has evolved by concerted evolution in the Strigidae and that it is a good taxonomic and phylogenetic marker to examine genetic diversity between Strigiformes species.
The (not so) immortal strand hypothesis.
Tomasetti, Cristian; Bozic, Ivana
2015-03-01
Non-random segregation of DNA strands during stem cell replication has been proposed as a mechanism to minimize accumulated genetic errors in stem cells of rapidly dividing tissues. According to this hypothesis, an "immortal" DNA strand is passed to the stem cell daughter and not the more differentiated cell, keeping the stem cell lineage replication error-free. After it was introduced, experimental evidence both in favor and against the hypothesis has been presented. Using a novel methodology that utilizes cancer sequencing data we are able to estimate the rate of accumulation of mutations in healthy stem cells of the colon, blood and head and neck tissues. We find that in these tissues mutations in stem cells accumulate at rates strikingly similar to those expected without the protection from the immortal strand mechanism. Utilizing an approach that is fundamentally different from previous efforts to confirm or refute the immortal strand hypothesis, we provide evidence against non-random segregation of DNA during stem cell replication. Our results strongly suggest that parental DNA is passed randomly to stem cell daughters and provides new insight into the mechanism of DNA replication in stem cells. Copyright © 2015. Published by Elsevier B.V.
Evidence for horizontal transfer of mitochondrial DNA to the plastid genome in a bamboo genus.
Ma, Peng-Fei; Zhang, Yu-Xiao; Guo, Zhen-Hua; Li, De-Zhu
2015-06-23
In flowering plants, three genomes (nuclear, mitochondrial, and plastid) coexist and intracellular horizontal transfer of DNA is prevalent, especially from the plastid to the mitochondrion genome. However, the plastid genomes are generally conserved in evolution and have long been considered immune to foreign DNA. Recently, the opposite direction of DNA transfer from the mitochondrial to the plastid genome has been reported in two eudicot lineages. Here we sequenced 6 plastid genomes of bamboos, three of which are neotropical woody species and three are herbaceous ones. Several unusual features were found, including the duplication of trnT-GGU and loss of one copy of rps19 due to contraction of inverted repeats (IRs). The most intriguing was the ~2.7 kb insertion in the plastid IR regions in the three herbaceous bamboos. Furthermore, the insertion was documented to be horizontally transferred from the mitochondrial to the plastid genome. Our study provided evidence of the mitochondrial-to-plastid DNA transfer in the monocots, demonstrating again that this rare event does occur in other angiosperm lineages. However, the mechanism underlying the transfer remains obscure, and more studies in other plants may elucidate it in the future.
The (not so) Immortal Strand Hypothesis
Tomasetti, Cristian; Bozic, Ivana
2015-01-01
Background Non-random segregation of DNA strands during stem cell replication has been proposed as a mechanism to minimize accumulated genetic errors in stem cells of rapidly dividing tissues. According to this hypothesis, an “immortal” DNA strand is passed to the stem cell daughter and not the more differentiated cell, keeping the stem cell lineage replication error-free. After it was introduced, experimental evidence both in favor and against the hypothesis has been presented. Principal Findings Using a novel methodology that utilizes cancer sequencing data we are able to estimate the rate of accumulation of mutations in healthy stem cells of the colon, blood and head and neck tissues. We to find that in these tissues mutations in stem cells accumulate at rates strikingly similar to those expected without the protection from the immortal strand mechanism. Significance Utilizing an approach that is fundamentally different from previous efforts to confirm or refute the immortal strand hypothesis, we provide strong evidence against non-random segregation of DNA during stem cell replication. Our results strongly suggest that parental DNA is passed randomly to stem cell daughters and provides new insight into the mechanism of DNA replication in stem cells. PMID:25700960
Barker, F Keith; Barrowclough, George F; Groth, Jeff G
2002-01-01
Passerine birds comprise over half of avian diversity, but have proved difficult to classify. Despite a long history of work on this group, no comprehensive hypothesis of passerine family-level relationships was available until recent analyses of DNA-DNA hybridization data. Unfortunately, given the value of such a hypothesis in comparative studies of passerine ecology and behaviour, the DNA-hybridization results have not been well tested using independent data and analytical approaches. Therefore, we analysed nucleotide sequence variation at the nuclear RAG-1 and c-mos genes from 69 passerine taxa, including representatives of most currently recognized families. In contradiction to previous DNA-hybridization studies, our analyses suggest paraphyly of suboscine passerines because the suboscine New Zealand wren Acanthisitta was found to be sister to all other passerines. Additionally, we reconstructed the parvorder Corvida as a basal paraphyletic grade within the oscine passerines. Finally, we found strong evidence that several family-level taxa are misplaced in the hybridization results, including the Alaudidae, Irenidae, and Melanocharitidae. The hypothesis of relationships we present here suggests that the oscine passerines arose on the Australian continental plate while it was isolated by oceanic barriers and that a major northern radiation of oscines (i.e. the parvorder Passerida) originated subsequent to dispersal from the south. PMID:11839199
Barker, F Keith; Barrowclough, George F; Groth, Jeff G
2002-02-07
Passerine birds comprise over half of avian diversity, but have proved difficult to classify. Despite a long history of work on this group, no comprehensive hypothesis of passerine family-level relationships was available until recent analyses of DNA-DNA hybridization data. Unfortunately, given the value of such a hypothesis in comparative studies of passerine ecology and behaviour, the DNA-hybridization results have not been well tested using independent data and analytical approaches. Therefore, we analysed nucleotide sequence variation at the nuclear RAG-1 and c-mos genes from 69 passerine taxa, including representatives of most currently recognized families. In contradiction to previous DNA-hybridization studies, our analyses suggest paraphyly of suboscine passerines because the suboscine New Zealand wren Acanthisitta was found to be sister to all other passerines. Additionally, we reconstructed the parvorder Corvida as a basal paraphyletic grade within the oscine passerines. Finally, we found strong evidence that several family-level taxa are misplaced in the hybridization results, including the Alaudidae, Irenidae, and Melanocharitidae. The hypothesis of relationships we present here suggests that the oscine passerines arose on the Australian continental plate while it was isolated by oceanic barriers and that a major northern radiation of oscines (i.e. the parvorder Passerida) originated subsequent to dispersal from the south.
Alpha3, a transposable element that promotes host sexual reproduction.
Barsoum, Emad; Martinez, Paula; Aström, Stefan U
2010-01-01
Theoretical models predict that selfish DNA elements require host sex to persist in a population. Therefore, a transposon that induces sex would strongly favor its own spread. We demonstrate that a protein homologous to transposases, called alpha3, was essential for mating type switch in Kluyveromyces lactis. Mutational analysis showed that amino acids conserved among transposases were essential for its function. During switching, sequences in the 5' and 3' flanking regions of the alpha3 gene were joined, forming a DNA circle, showing that alpha3 mobilized from the genome. The sequences encompassing the alpha3 gene circle junctions in the mating type alpha (MATalpha) locus were essential for switching from MATalpha to MATa, suggesting that alpha3 mobilization was a coupled event. Switching also required a DNA-binding protein, Mating type switch 1 (Mts1), whose binding sites in MATalpha were important. Expression of Mts1 was repressed in MATa/MATalpha diploids and by nutrients, limiting switching to haploids in low-nutrient conditions. A hairpin-capped DNA double-strand break (DSB) was observed in the MATa locus in mre11 mutant strains, indicating that mating type switch was induced by MAT-specific DSBs. This study provides empirical evidence for selfish DNA promoting host sexual reproduction by mediating mating type switch.