dna sequence composition: Topics by Science.gov

Sample records for dna sequence composition

Biosensors for DNA sequence detection

NASA Technical Reports Server (NTRS)

Vercoutere, Wenonah; Akeson, Mark

2002-01-01

DNA biosensors are being developed as alternatives to conventional DNA microarrays. These devices couple signal transduction directly to sequence recognition. Some of the most sensitive and functional technologies use fibre optics or electrochemical sensors in combination with DNA hybridization. In a shift from sequence recognition by hybridization, two emerging single-molecule techniques read sequence composition using zero-mode waveguides or electrical impedance in nanoscale pores.
Nucleotide sequence composition and method for detection of neisseria gonorrhoeae

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lo, A.; Yang, H.L.

1990-02-13

This patent describes a composition of matter that is specific for {ital Neisseria gonorrhoeae}. It comprises: at least one nucleotide sequence for which the ratio of the amount of the sequence which hybridizes to chromosomal DNA of {ital Neisseria gonorrhoeae} to the amount of the sequence which hybridizes to chromosomal DNA of {ital Neisseria meningitidis} is greater than about five. The ratio being obtained by a method described.
Quantitative DNA fiber mapping

DOEpatents

Gray, Joe W.; Weier, Heinz-Ulrich G.

1998-01-01

The present invention relates generally to the DNA mapping and sequencing technologies. In particular, the present invention provides enhanced methods and compositions for the physical mapping and positional cloning of genomic DNA. The present invention also provides a useful analytical technique to directly map cloned DNA sequences onto individual stretched DNA molecules.
An improved model for whole genome phylogenetic analysis by Fourier transform.

PubMed

Yin, Changchuan; Yau, Stephen S-T

2015-10-07

DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Comparison of base composition analysis and Sanger sequencing of mitochondrial DNA for four U.S. population groups.

PubMed

Kiesler, Kevin M; Coble, Michael D; Hall, Thomas A; Vallone, Peter M

2014-01-01

A set of 711 samples from four U.S. population groups was analyzed using a novel mass spectrometry based method for mitochondrial DNA (mtDNA) base composition profiling. Comparison of the mass spectrometry results with Sanger sequencing derived data yielded a concordance rate of 99.97%. Length heteroplasmy was identified in 46% of samples and point heteroplasmy was observed in 6.6% of samples in the combined mass spectral and Sanger data set. Using discrimination capacity as a metric, Sanger sequencing of the full control region had the highest discriminatory power, followed by the mass spectrometry base composition method, which was more discriminating than Sanger sequencing of just the hypervariable regions. This trend is in agreement with the number of nucleotides covered by each of the three assays. Published by Elsevier Ireland Ltd.
Simulations Using Random-Generated DNA and RNA Sequences

ERIC Educational Resources Information Center

Bryce, C. F. A.

1977-01-01

Using a very simple computer program written in BASIC, a very large number of random-generated DNA or RNA sequences are obtained. Students use these sequences to predict complementary sequences and translational products, evaluate base compositions, determine frequencies of particular triplet codons, and suggest possible secondary structures.…
[Replication of Streptomyces plasmids: the DNA nucleotide sequence of plasmid pSB 24.2].

PubMed

Bolotin, A P; Sorokin, A V; Aleksandrov, N N; Danilenko, V N; Kozlov, Iu I

1985-11-01

The nucleotide sequence of DNA in plasmid pSB 24.2, a natural deletion derivative of plasmid pSB 24.1 isolated from S. cyanogenus was studied. The plasmid amounted by its size to 3706 nucleotide pairs. The G-C composition was equal to 73 per cent. The analysis of the DNA structure in plasmid pSB 24.2 revealed the protein-encoding sequence of DNA, the continuity of which was significant for replication of the plasmid containing more than 1300 nucleotide pairs. The analysis also revealed two A-T-rich areas of DNA, the G-C composition of which was less than 55 per cent and a DNA area with a branched pin structure. The results may be of value in investigation of plasmid replication in actinomycetes and experimental cloning of DNA with this plasmid as a vector.
repDNA: a Python package to generate various modes of feature vectors for DNA sequences by incorporating user-defined physicochemical properties and sequence-order effects.

PubMed

Liu, Bin; Liu, Fule; Fang, Longyun; Wang, Xiaolong; Chou, Kuo-Chen

2015-04-15

In order to develop powerful computational predictors for identifying the biological features or attributes of DNAs, one of the most challenging problems is to find a suitable approach to effectively represent the DNA sequences. To facilitate the studies of DNAs and nucleotides, we developed a Python package called representations of DNAs (repDNA) for generating the widely used features reflecting the physicochemical properties and sequence-order effects of DNAs and nucleotides. There are three feature groups composed of 15 features. The first group calculates three nucleic acid composition features describing the local sequence information by means of kmers; the second group calculates six autocorrelation features describing the level of correlation between two oligonucleotides along a DNA sequence in terms of their specific physicochemical properties; the third group calculates six pseudo nucleotide composition features, which can be used to represent a DNA sequence with a discrete model or vector yet still keep considerable sequence-order information via the physicochemical properties of its constituent oligonucleotides. In addition, these features can be easily calculated based on both the built-in and user-defined properties via using repDNA. The repDNA Python package is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/repDNA/. bliu@insun.hit.edu.cn or kcchou@gordonlifescience.org Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Functionalized gold nanoparticles as additive to form polymer/metal composite matrix for improved DNA sequencing by capillary electrophoresis.

PubMed

Zhou, Dan; Yang, Liping; Yang, Runmiao; Song, Weihua; Peng, Shuhua; Wang, Yanmei

2009-11-15

A new matrix additive, poly (N,N-dimethylacrylamide)-functionalized gold nanoparticle (GNP-PDMA), was prepared by "grafting-to" approach, and then incorporated into quasi-interpenetrating network (quasi-IPN) composed of linear polyacrylamide (LPA, 3.3 MDa) and PDMA to form novel polymer/metal composite sieving matrix (quasi-IPN/GNP-PDMA) for DNA sequencing by capillary electrophoresis. Without complete optimization, quasi-IPN/GNP-PDMA yielded a readlength of 801 bases at 98% accuracy in about 64 min by using the ABI 310 Genetic Analyzer at 50 degrees C and 150 V/cm. Compared with previous quasi-IPN/GNPs, quasi-IPN/GNP-PDMA can further improve DNA sequencing performances. This is because the presence of GNP-PDMA can improve the compatibility of GNPs with the whole sequencing system, enhance the entanglement degree of networks, and increase the GNP concentration in system, which consequently lead to higher restriction and stability, higher apparent molecular weight (MW), and smaller pore size of the total sieving networks. Furthermore, the composite matrix was also compared with quasi-IPN containing higher-MW LPA and commercial POP-6. The results indicate that the composite matrix is a promising one for DNA sequencing to achieve full automation due to the separation provided with high resolution, speediness, excellent reproducibility, and easy loading in the presence of GNP-PDMA.
Composition and immuno-stimulatory properties of extracellular DNA from mouse gut flora.

PubMed

Qi, Ce; Li, Ya; Yu, Ren-Qiang; Zhou, Sheng-Li; Wang, Xing-Guo; Le, Guo-Wei; Jin, Qing-Zhe; Xiao, Hang; Sun, Jin

2017-11-28

To demonstrate that specific bacteria might release bacterial extracellular DNA (eDNA) to exert immunomodulatory functions in the mouse small intestine. Extracellular DNA was extracted using phosphate buffered saline with 0.5 mmol/L dithiothreitol combined with two phenol extractions. TOTO-1 iodide, a cell-impermeant and high-affinity nucleic acid stain, was used to confirm the existence of eDNA in the mucus layers of the small intestine and colon in healthy Male C57BL/6 mice. Composition difference of eDNA and intracellular DNA (iDNA) of the small intestinal mucus was studied by Illumina sequencing and terminal restriction fragment length polymorphism (T-RFLP). Stimulation of cytokine production by eDNA was studied in RAW264.7 cells in vitro . TOTO-1 iodide staining confirmed existence of eDNA in loose mucus layer of the mouse colon and thin surface mucus layer of the small intestine. Illumina sequencing analysis and T-RFLP revealed that the composition of the eDNA in the small intestinal mucus was significantly different from that of the iDNA of the small intestinal mucus bacteria. Illumina Miseq sequencing showed that the eDNA sequences came mainly from Gram-negative bacteria of Bacteroidales S24-7. By contrast, predominant bacteria of the small intestinal flora comprised Gram-positive bacteria. Both eDNA and iDNA were added to native or lipopolysaccharide-stimulated Raw267.4 macrophages, respectively. The eDNA induced significantly lower tumor necrosis factor-α/interleukin-10 (IL-10) and IL-6/IL-10 ratios than iDNA, suggesting the predominance for maintaining immune homeostasis of the gut. Our results indicated that degraded bacterial genomic DNA was mainly released by Gram-negative bacteria, especially Bacteroidales-S24-7 and Stenotrophomonas genus in gut mucus of mice. They decreased pro-inflammatory activity compared to total gut flora genomic DNA.
Single-Molecule Electrical Random Resequencing of DNA and RNA

NASA Astrophysics Data System (ADS)

Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji

2012-07-01

Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.
Mechanisms generating long range correlation in nucleotide composition of the Borrelia Burgdorferi genome

NASA Astrophysics Data System (ADS)

Mackiewicz, P.; Gierlik, A.; Kowalczuk, M.; Szczepanik, D.; Dudek, M. R.; Cebrat, S.

1999-12-01

We have analysed protein coding and intergenic sequences in the Borrelia burgdorferi (the Lyme disease bacterium) genome using different kinds of DNA walks. Genes occupying the leading strand of DNA have significantly different nucleotide composition from genes occupying the lagging strand. Nucleotide compositional bias of the two DNA strands reflects the aminoacid composition of proteins. 96% of genes coding for ribosomal proteins lie on the leading DNA strand, which suggests that the positions of these as well as other genes are non-random. In the B. burgdorferi genome, the asymmetry in intergenic DNA sequences is lower than the asymmetry in the third positions in codons. All these characters of the B. burgdorferi genome suggest that both replication-associated mutational pressure and recombination mechanisms have established the specific structure of the genome and now any recombination leading to inversion of a gene in respect to the direction of replication is forbidden. This property of the genome allows us to assume that it is in a steady state, which enables us to fix some parameters for simulations of DNA evolution.
Methods and compositions for chromosome-specific staining

DOEpatents

Gray, Joe W.; Pinkel, Daniel

2003-07-22

Methods and compositions for chromosome-specific staining are provided. Compositions comprise heterogenous mixtures of labeled nucleic acid fragments having substantially complementary base sequences to unique sequence regions of the chromosomal DNA for which their associated staining reagent is specific. Methods include methods for making the chromosome-specific staining compositions of the invention, and methods for applying the staining compositions to chromosomes.
Organizational heterogeneity of vertebrate genomes.

PubMed

Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham

2012-01-01

Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.
Enzymatic DNA molecules

NASA Technical Reports Server (NTRS)

Joyce, Gerald F. (Inventor); Breaker, Ronald R. (Inventor)

1998-01-01

The present invention discloses deoxyribonucleic acid enzymes--catalytic or enzymatic DNA molecules--capable of cleaving nucleic acid sequences or molecules, particularly RNA, in a site-specific manner, as well as compositions including same. Methods of making and using the disclosed enzymes and compositions are also disclosed.
Electrochemical direct immobilization of DNA sequences for label-free herpes virus detection

NASA Astrophysics Data System (ADS)

Tam, Phuong Dinh; Trung, Tran; Tuan, Mai Anh; Chien, Nguyen Duc

2009-09-01

DNA sequences/bio-macromolecules of herpes virus (5'-AT CAC CGA CCC GGA GAG GGA C-3') were directly immobilized into polypyrrole matrix by using the cyclic voltammetry method, and grafted onto arrays of interdigitated platinum microelectrodes. The morphology surface of the obtained PPy/DNA of herpes virus composite films was investigated by a FESEM Hitachi-S 4800. Fourier transform infrared spectroscopy (FTIR) was used to characterize the PPy/DNA film and to study the specific interactions that may exist between DNA biomacromolecules and PPy chains. Attempts are made to use these PPy/DNA composite films for label-free herpes virus detection revealed a response time of 60 s in solutions containing as low as 2 nM DNA concentration, and self life of six months when immerged in double distilled water and kept refrigerated.
Methods for chromosome-specific staining

DOEpatents

Gray, Joe W.; Pinkel, Daniel

1995-01-01

Methods and compositions for chromosome-specific staining are provided. Compositions comprise heterogenous mixtures of labeled nucleic acid fragments having substantially complementary base sequences to unique sequence regions of the chromosomal DNA for which their associated staining reagent is specific. Methods include methods for making the chromosome-specific staining compositions of the invention, and methods for applying the staining compositions to chromosomes.
Bacterial community composition in different sediments from the Eastern Mediterranean Sea: a comparison of four 16S ribosomal DNA clone libraries.

PubMed

Polymenakou, Paraskevi N; Bertilsson, Stefan; Tselepides, Anastasios; Stephanou, Euripides G

2005-10-01

The regional variability of sediment bacterial community composition and diversity was studied by comparative analysis of four large 16S ribosomal DNA (rDNA) clone libraries from sediments in different regions of the Eastern Mediterranean Sea (Thermaikos Gulf, Cretan Sea, and South lonian Sea). Amplified rDNA restriction analysis of 664 clones from the libraries indicate that the rDNA richness and evenness was high: for example, a near-1:1 relationship among screened clones and number of unique restriction patterns when up to 190 clones were screened for each library. Phylogenetic analysis of 207 bacterial 16S rDNA sequences from the sediment libraries demonstrated that Gamma-, Delta-, and Alphaproteobacteria, Holophaga/Acidobacteria, Planctomycetales, Actinobacteria, Bacteroidetes, and Verrucomicrobia were represented in all four libraries. A few clones also grouped with the Betaproteobacteria, Nitrospirae, Spirochaetales, Chlamydiae, Firmicutes, and candidate division OPl 1. The abundance of sequences affiliated with Gammaproteobacteria was higher in libraries from shallow sediments in the Thermaikos Gulf (30 m) and the Cretan Sea (100 m) compared to the deeper South Ionian station (2790 m). Most sequences in the four sediment libraries clustered with uncultured 16S rDNA phylotypes from marine habitats, and many of the closest matches were clones from hydrocarbon seeps, benzene-mineralizing consortia, sulfate reducers, sulk oxidizers, and ammonia oxidizers. LIBSHUFF statistics of 16S rDNA gene sequences from the four libraries revealed major differences, indicating either a very high richness in the sediment bacterial communities or considerable variability in bacterial community composition among regions, or both.
Differential repetitive DNA composition in the centromeric region of chromosomes of Amazonian lizard species in the family Teiidae

PubMed Central

Carvalho, Natalia D. M.; Carmo, Edson; Neves, Rogerio O.; Schneider, Carlos Henrique; Gross, Maria Claudia

2016-01-01

Abstract Differences in heterochromatin distribution patterns and its composition were observed in Amazonian teiid species. Studies have shown repetitive DNA harbors heterochromatic blocks which are located in centromeric and telomeric regions in Ameiva ameiva (Linnaeus, 1758), Kentropyx calcarata (Spix, 1825), Kentropyx pelviceps (Cope, 1868), and Tupinambis teguixin (Linnaeus, 1758). In Cnemidophorus sp.1, repetitive DNA has multiple signals along all chromosomes. The aim of this study was to characterize moderately and highly repetitive DNA sequences by Cot1-DNA from Ameiva ameiva and Cnemidophorus sp.1 genomes through cloning and DNA sequencing, as well as mapping them chromosomally to better understand its organization and genome dynamics. The results of sequencing of DNA libraries obtained by Cot1-DNA showed that different microsatellites, transposons, retrotransposons, and some gene families also comprise the fraction of repetitive DNA in the teiid species. FISH using Cot1-DNA probes isolated from both Ameiva ameiva and Cnemidophorus sp.1 showed these sequences mainly located in heterochromatic centromeric, and telomeric regions in Ameiva ameiva, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin chromosomes, indicating they play structural and functional roles in the genome of these species. In Cnemidophorus sp.1, Cot1-DNA probe isolated from Ameiva ameiva had multiple interstitial signals on chromosomes, whereas mapping of Cot1-DNA isolated from the Ameiva ameiva and Cnemidophorus sp.1 highlighted centromeric regions of some chromosomes. Thus, the data obtained showed that many repetitive DNA classes are part of the genome of Ameiva ameiva, Cnemidophorus sp.1, Kentroyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin, and these sequences are shared among the analyzed teiid species, but they were not always allocated at the same chromosome position. PMID:27551343
Differential repetitive DNA composition in the centromeric region of chromosomes of Amazonian lizard species in the family Teiidae.

PubMed

Carvalho, Natalia D M; Carmo, Edson; Neves, Rogerio O; Schneider, Carlos Henrique; Gross, Maria Claudia

2016-01-01

Differences in heterochromatin distribution patterns and its composition were observed in Amazonian teiid species. Studies have shown repetitive DNA harbors heterochromatic blocks which are located in centromeric and telomeric regions in Ameiva ameiva (Linnaeus, 1758), Kentropyx calcarata (Spix, 1825), Kentropyx pelviceps (Cope, 1868), and Tupinambis teguixin (Linnaeus, 1758). In Cnemidophorus sp.1, repetitive DNA has multiple signals along all chromosomes. The aim of this study was to characterize moderately and highly repetitive DNA sequences by C ot1-DNA from Ameiva ameiva and Cnemidophorus sp.1 genomes through cloning and DNA sequencing, as well as mapping them chromosomally to better understand its organization and genome dynamics. The results of sequencing of DNA libraries obtained by C ot1-DNA showed that different microsatellites, transposons, retrotransposons, and some gene families also comprise the fraction of repetitive DNA in the teiid species. FISH using C ot1-DNA probes isolated from both Ameiva ameiva and Cnemidophorus sp.1 showed these sequences mainly located in heterochromatic centromeric, and telomeric regions in Ameiva ameiva, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin chromosomes, indicating they play structural and functional roles in the genome of these species. In Cnemidophorus sp.1, C ot1-DNA probe isolated from Ameiva ameiva had multiple interstitial signals on chromosomes, whereas mapping of C ot1-DNA isolated from the Ameiva ameiva and Cnemidophorus sp.1 highlighted centromeric regions of some chromosomes. Thus, the data obtained showed that many repetitive DNA classes are part of the genome of Ameiva ameiva, Cnemidophorus sp.1, Kentroyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin, and these sequences are shared among the analyzed teiid species, but they were not always allocated at the same chromosome position.

Methods for chromosome-specific staining

DOEpatents

Gray, J.W.; Pinkel, D.

1995-09-05

Methods and compositions for chromosome-specific staining are provided. Compositions comprise heterogeneous mixtures of labeled nucleic acid fragments having substantially complementary base sequences to unique sequence regions of the chromosomal DNA for which their associated staining reagent is specific. Methods include ways for making the chromosome-specific staining compositions of the invention, and methods for applying the staining compositions to chromosomes. 3 figs.
Optical properties and electronic transitions of DNA oligonucleotides as a function of composition and stacking sequence.

PubMed

Schimelman, Jacob B; Dryden, Daniel M; Poudel, Lokendra; Krawiec, Katherine E; Ma, Yingfang; Podgornik, Rudolf; Parsegian, V Adrian; Denoyer, Linda K; Ching, Wai-Yim; Steinmetz, Nicole F; French, Roger H

2015-02-14

The role of base pair composition and stacking sequence in the optical properties and electronic transitions of DNA is of fundamental interest. We present and compare the optical properties of DNA oligonucleotides (AT)10, (AT)5(GC)5, and (AT-GC)5 using both ab initio methods and UV-vis molar absorbance measurements. Our data indicate a strong dependence of both the position and intensity of UV absorbance features on oligonucleotide composition and stacking sequence. The partial densities of states for each oligonucleotide indicate that the valence band edge arises from a feature associated with the PO4(3-) complex anion, and the conduction band edge arises from anti-bonding states in DNA base pairs. The results show a strong correspondence between the ab initio and experimentally determined optical properties. These results highlight the benefit of full spectral analysis of DNA, as opposed to reductive methods that consider only the 260 nm absorbance (A260) or simple purity ratios, such as A260/A230 or A260/A280, and suggest that the slope of the absorption edge onset may provide a useful metric for the degree of base pair stacking in DNA. These insights may prove useful for applications in biology, bioelectronics, and mesoscale self-assembly.
Palindromic Sequence Artifacts Generated during Next Generation Sequencing Library Preparation from Historic and Ancient DNA

PubMed Central

Star, Bastiaan; Nederbragt, Alexander J.; Hansen, Marianne H. S.; Skage, Morten; Gilfillan, Gregor D.; Bradbury, Ian R.; Pampoulie, Christophe; Stenseth, Nils Chr; Jakobsen, Kjetill S.; Jentoft, Sissel

2014-01-01

Degradation-specific processes and variation in laboratory protocols can bias the DNA sequence composition from samples of ancient or historic origin. Here, we identify a novel artifact in sequences from historic samples of Atlantic cod (Gadus morhua), which forms interrupted palindromes consisting of reverse complementary sequence at the 5′ and 3′-ends of sequencing reads. The palindromic sequences themselves have specific properties – the bases at the 5′-end align well to the reference genome, whereas extensive misalignments exists among the bases at the terminal 3′-end. The terminal 3′ bases are artificial extensions likely caused by the occurrence of hairpin loops in single stranded DNA (ssDNA), which can be ligated and amplified in particular library creation protocols. We propose that such hairpin loops allow the inclusion of erroneous nucleotides, specifically at the 3′-end of DNA strands, with the 5′-end of the same strand providing the template. We also find these palindromes in previously published ancient DNA (aDNA) datasets, albeit at varying and substantially lower frequencies. This artifact can negatively affect the yield of endogenous DNA in these types of samples and introduces sequence bias. PMID:24608104
Comparison of large-insert, small-insert and pyrosequencing libraries for metagenomic analysis.

PubMed

Danhorn, Thomas; Young, Curtis R; DeLong, Edward F

2012-11-01

The development of DNA sequencing methods for characterizing microbial communities has evolved rapidly over the past decades. To evaluate more traditional, as well as newer methodologies for DNA library preparation and sequencing, we compared fosmid, short-insert shotgun and 454 pyrosequencing libraries prepared from the same metagenomic DNA samples. GC content was elevated in all fosmid libraries, compared with shotgun and 454 libraries. Taxonomic composition of the different libraries suggested that this was caused by a relative underrepresentation of dominant taxonomic groups with low GC content, notably Prochlorales and the SAR11 cluster, in fosmid libraries. While these abundant taxa had a large impact on library representation, we also observed a positive correlation between taxon GC content and fosmid library representation in other low-GC taxa, suggesting a general trend. Analysis of gene category representation in different libraries indicated that the functional composition of a library was largely a reflection of its taxonomic composition, and no additional systematic biases against particular functional categories were detected at the level of sequencing depth in our samples. Another important but less predictable factor influencing the apparent taxonomic and functional library composition was the read length afforded by the different sequencing technologies. Our comparisons and analyses provide a detailed perspective on the influence of library type on the recovery of microbial taxa in metagenomic libraries and underscore the different uses and utilities of more traditional, as well as contemporary 'next-generation' DNA library construction and sequencing technologies for exploring the genomics of the natural microbial world.
DNA extraction protocols cause differences in 16S rRNA amplicon sequencing efficiency but not in community profile composition or structure

DOE PAGES

None

2014-12-01

The recent development of methods applying next-generation sequencing to microbial community characterization has led to the proliferation of these studies in a wide variety of sample types. Yet, variation in the physical properties of environmental samples demands that optimal DNA extraction techniques be explored for each new environment. The microbiota associated with many species of insects offer an extraction challenge as they are frequently surrounded by an armored exoskeleton, inhibiting disruption of the tissues within. In this study, we examine the efficacy of several commonly used protocols for extracting bacterial DNA from ants. While bacterial community composition recovered using Illuminamore » 16S rRNA amplicon sequencing was not detectably biased by any method, the quantity of bacterial DNA varied drastically, reducing the number of samples that could be amplified and sequenced. These results indicate that the concentration necessary for dependable sequencing is around 10,000 copies of target DNA per microliter. Exoskeletal pulverization and tissue digestion increased the reliability of extractions, suggesting that these steps should be included in any study of insect-associated microorganisms that relies on obtaining microbial DNA from intact body segments. Although laboratory and analysis techniques should be standardized across diverse sample types as much as possible, minimal modifications such as these will increase the number of environments in which bacterial communities can be successfully studied.« less
Predicting DNA binding proteins using support vector machine with hybrid fractal features.

PubMed

Niu, Xiao-Hui; Hu, Xue-Hai; Shi, Feng; Xia, Jing-Bo

2014-02-21

DNA-binding proteins play a vitally important role in many biological processes. Prediction of DNA-binding proteins from amino acid sequence is a significant but not fairly resolved scientific problem. Chaos game representation (CGR) investigates the patterns hidden in protein sequences, and visually reveals previously unknown structure. Fractal dimensions (FD) are good tools to measure sizes of complex, highly irregular geometric objects. In order to extract the intrinsic correlation with DNA-binding property from protein sequences, CGR algorithm, fractal dimension and amino acid composition are applied to formulate the numerical features of protein samples in this paper. Seven groups of features are extracted, which can be computed directly from the primary sequence, and each group is evaluated by the 10-fold cross-validation test and Jackknife test. Comparing the results of numerical experiments, the group of amino acid composition and fractal dimension (21-dimension vector) gets the best result, the average accuracy is 81.82% and average Matthew's correlation coefficient (MCC) is 0.6017. This resulting predictor is also compared with existing method DNA-Prot and shows better performances. © 2013 The Authors. Published by Elsevier Ltd All rights reserved.
Full-Length Venom Protein cDNA Sequences from Venom-Derived mRNA: Exploring Compositional Variation and Adaptive Multigene Evolution

PubMed Central

Modahl, Cassandra M.; Mackessy, Stephen P.

2016-01-01

Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides access to cDNA sequences in the absence of living specimens, even from commercial venom sources, to evaluate important regional differences in venom composition and to study snake venom protein evolution. PMID:27280639
DNA-PK assay

DOEpatents

Anderson, Carl W.; Connelly, Margery A.

2004-10-12

The present invention provides a method for detecting DNA-activated protein kinase (DNA-PK) activity in a biological sample. The method includes contacting a biological sample with a detectably-labeled phosphate donor and a synthetic peptide substrate defined by the following features to provide specific recognition and phosphorylation by DNA-PK: (1) a phosphate-accepting amino acid pair which may include serine-glutamine (Ser-Gln) (SQ), threonine-glutamine (Thr-Gln) (TQ), glutamine-serine (Gln-Ser) (QS), or glutamine-threonine (Gln-Thr) (QT); (2) enhancer amino acids which may include glutamic acid or glutamine immediately adjacent at the amino- or carboxyl- side of the amino acid pair and forming an amino acid pair-enhancer unit; (3) a first spacer sequence at the amino terminus of the amino acid pair-enhancer unit; (4) a second spacer sequence at the carboxyl terminus of the amino acid pair-enhancer unit, which spacer sequences may include any combination of amino acids that does not provide a phosphorylation site consensus sequence motif; and, (5) a tag moiety, which may be an amino acid sequence or another chemical entity that permits separating the synthetic peptide from the phosphate donor. A compostion and a kit for the detection of DNA-PK activity are also provided. Methods for detecting DNA, protein phosphatases and substances that alter the activity of DNA-PK are also provided. The present invention also provides a method of monitoring protein kinase and DNA-PK activity in living cells. -A composition and a kit for monitoring protein kinase activity in vitro and a composition and a kit for monitoring DNA-PK activities in living cells are also provided. A method for identifying agents that alter protein kinase activity in vitro and a method for identifying agents that alter DNA-PK activity in living cells are also provided.
Cytophotometric and biochemical analyses of DNA in pentaploid and diploid Agave species.

PubMed

Cavallini, A; Natali, L; Cionini, G; Castorena-Sanchez, I

1996-04-01

Nuclear DNA content, chromatin structure, and DNA composition were investigated in four Agave species: two diploid, Agave tequilana Weber and Agave angustifolia Haworth var. marginata Hort., and two pentaploid, Agave fourcroydes Lemaire and Agave sisalana Perrine. It was determined that the genome size of pentaploid species is nearly 2.5 times that of diploid ones. Cytophotometric analyses of chromatin structure were performed following Feulgen or DAPI staining to determine optical density profiles of interphase nuclei. Pentaploid species showed higher frequencies of condensed chromatin (heterochromatin) than diploid species. On the other hand, a lower frequency of A-T rich (DAPI stained) heterochromatin was found in pentaploid species than in diploid ones, indicating that heterochromatin in pentaploid species is made up of sequences with base compositions different from those of diploid species. Since thermal denaturation profiles of extracted DNA showed minor variations in the base composition of the genomes of the four species, it is supposed that, in pentaploid species, the large heterochromatin content is not due to an overrepresentation of G-C repetitive sequences but rather to the condensation of nonrepetitive sequences, such as, for example, redundant gene copies switched off in the polyploid complement. It is suggested that speciation in the genus Agave occurs through point mutations and minor DNA rearrangements, as is also indicated by the relative stability of the karyotype of this genus. Key words : Agave, DNA cytophotometry, DNA melting profiles, chromatin structure, genome size.
Nucleosome Positioning and Epigenetics

NASA Astrophysics Data System (ADS)

Schwab, David; Bruinsma, Robijn

2008-03-01

The role of chromatin structure in gene regulation has recently taken center stage in the field of epigenetics, phenomena that change the phenotype without changing the DNA sequence. Recent work has also shown that nucleosomes, a complex of DNA wrapped around a histone octamer, experience a sequence dependent energy landscape due to the variation in DNA bend stiffness with sequence composition. In this talk, we consider the role nucleosome positioning might play in the formation of heterochromatin, a compact form of DNA generically responsible for gene silencing. In particular, we discuss how different patterns of nucleosome positions, periodic or random, could either facilitate or suppress heterochromatin stability and formation.
Bioaerosol DNA Extraction Technique from Air Filters Collected from Marine and Freshwater Locations

NASA Astrophysics Data System (ADS)

Beckwith, M.; Crandall, S. G.; Barnes, A.; Paytan, A.

2015-12-01

Bioaerosols are composed of microorganisms suspended in air. Among these organisms include bacteria, fungi, virus, and protists. Microbes introduced into the atmosphere can drift, primarily by wind, into natural environments different from their point of origin. Although bioaerosols can impact atmospheric dynamics as well as the ecology and biogeochemistry of terrestrial systems, very little is known about the composition of bioaerosols collected from marine and freshwater environments. The first step to determine composition of airborne microbes is to successfully extract environmental DNA from air filters. We asked 1) can DNA be extracted from quartz (SiO2) air filters? and 2) how can we optimize the DNA yield for downstream metagenomic sequencing? Aerosol filters were collected and archived on a weekly basis from aquatic sites (USA, Bermuda, Israel) over the course of 10 years. We successfully extracted DNA from a subsample of ~ 20 filters. We modified a DNA extraction protocol (Qiagen) by adding a beadbeating step to mechanically shear cell walls in order to optimize our DNA product. We quantified our DNA yield using a spectrophotometer (Nanodrop 1000). Results indicate that DNA can indeed be extracted from quartz filters. The additional beadbeating step helped increase our yield - up to twice as much DNA product was obtained compared to when this step was omitted. Moreover, bioaerosol DNA content does vary across time. For instance, the DNA extracted from filters from Lake Tahoe, USA collected near the end of June decreased from 9.9 ng/μL in 2007 to 3.8 ng/μL in 2008. Further next-generation sequencing analysis of our extracted DNA will be performed to determine the composition of these microbes. We will also model the meteorological and chemical factors that are good predictors for microbial composition for our samples over time and space.
A PDDA/poly(2,6-pyridinedicarboxylic acid)-CNTs composite film DNA electrochemical sensor and its application for the detection of specific sequences related to PAT gene and NOS gene.

PubMed

Yang, Tao; Zhang, Wei; Du, Meng; Jiao, Kui

2008-05-30

2,6-Pyridinedicarboxylic acid (PDC) was electropolymerized on the glassy carbon electrode (GCE) surface combined with carboxylic group-functionalized single-walled carbon nanotubes (SWNTs) by cyclic voltammetry (CV) to form PDC-SWNTs composite film, which was rich in negatively charged carboxylic group. Then, poly(diallyldimethyl ammonium chloride) (PDDA), a linear cationic polyelectrolyte, was electrostatically adsorbed on the PDC-SWNTs/GCE surface. DNA probes with negatively charged phosphate group at the 5' end were immobilized on the PDDA/PDC-SWNTs/GCE due to the strong electrostatic attraction between PDDA and phosphate group of DNA. It has been found that modification of the electrode with PDC-SWNTs film has enhanced the effective electrode surface area and electron-transfer ability, in addition to providing negatively charged groups for the electrostatic assembly of cationic polyelectrolyte. PDDA plays a key role in the attachment of DNA probes to the PDC-SWNTs composite film and acts as a bridge to connect DNA with PDC-SWNTs film. The cathodic peak current of methylene blue (MB), an electroactive label, decreased obviously after the hybridization of DNA probe (ssDNA) with the complementary DNA (cDNA). This peak current change was used to monitor the recognition of the specific sequences related to PAT gene in the transgenic corn and the polymerase chain reaction (PCR) amplification of NOS gene from the sample of transgenic soybean with satisfactory results. Under optimal conditions, the dynamic detection range of the sensor to PAT gene target sequence was from 1.0x10(-11) to 1.0x10(-6) mol/L with the detection limit of 2.6x10(-12) mol/L.
An extended sequence specificity for UV-induced DNA damage.

PubMed

Chung, Long H; Murray, Vincent

2018-01-01

The sequence specificity of UV-induced DNA damage was determined with a higher precision and accuracy than previously reported. UV light induces two major damage adducts: cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). Employing capillary electrophoresis with laser-induced fluorescence and taking advantages of the distinct properties of the CPDs and 6-4PPs, we studied the sequence specificity of UV-induced DNA damage in a purified DNA sequence using two approaches: end-labelling and a polymerase stop/linear amplification assay. A mitochondrial DNA sequence that contained a random nucleotide composition was employed as the target DNA sequence. With previous methodology, the UV sequence specificity was determined at a dinucleotide or trinucleotide level; however, in this paper, we have extended the UV sequence specificity to a hexanucleotide level. With the end-labelling technique (for 6-4PPs), the consensus sequence was found to be 5'-GCTC*AC (where C* is the breakage site); while with the linear amplification procedure, it was 5'-TCTT*AC. With end-labelling, the dinucleotide frequency of occurrence was highest for 5'-TC*, 5'-TT* and 5'-CC*; whereas it was 5'-TT* for linear amplification. The influence of neighbouring nucleotides on the degree of UV-induced DNA damage was also examined. The core sequences consisted of pyrimidine nucleotides 5'-CTC* and 5'-CTT* while an A at position "1" and C at position "2" enhanced UV-induced DNA damage. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.
Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

PubMed

Bergman, C M; Kreitman, M

2001-08-01

Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.
A computer aided thermodynamic approach for predicting the formation of Z-DNA in naturally occurring sequences

NASA Technical Reports Server (NTRS)

Ho, P. S.; Ellison, M. J.; Quigley, G. J.; Rich, A.

1986-01-01

The ease with which a particular DNA segment adopts the left-handed Z-conformation depends largely on the sequence and on the degree of negative supercoiling to which it is subjected. We describe a computer program (Z-hunt) that is designed to search long sequences of naturally occurring DNA and retrieve those nucleotide combinations of up to 24 bp in length which show a strong propensity for Z-DNA formation. Incorporated into Z-hunt is a statistical mechanical model based on empirically determined energetic parameters for the B to Z transition accumulated to date. The Z-forming potential of a sequence is assessed by ranking its behavior as a function of negative superhelicity relative to the behavior of similar sized randomly generated nucleotide sequences assembled from over 80,000 combinations. The program makes it possible to compare directly the Z-forming potential of sequences with different base compositions and different sequence lengths. Using Z-hunt, we have analyzed the DNA sequences of the bacteriophage phi X174, plasmid pBR322, the animal virus SV40 and the replicative form of the eukaryotic adenovirus-2. The results are compared with those previously obtained by others from experiments designed to locate Z-DNA forming regions in these sequences using probes which show specificity for the left-handed DNA conformation.
Quantitative high-throughput profiling of snake venom gland transcriptomes and proteomes (Ovophis okinavensis and Protobothrops flavoviridis)

PubMed Central

2013-01-01

Background Advances in DNA sequencing and proteomics have facilitated quantitative comparisons of snake venom composition. Most studies have employed one approach or the other. Here, both Illumina cDNA sequencing and LC/MS were used to compare the transcriptomes and proteomes of two pit vipers, Protobothrops flavoviridis and Ovophis okinavensis, which differ greatly in their biology. Results Sequencing of venom gland cDNA produced 104,830 transcripts. The Protobothrops transcriptome contained transcripts for 103 venom-related proteins, while the Ovophis transcriptome contained 95. In both, transcript abundances spanned six orders of magnitude. Mass spectrometry identified peptides from 100% of transcripts that occurred at higher than contaminant (e.g. human keratin) levels, including a number of proteins never before sequenced from snakes. These transcriptomes reveal fundamentally different envenomation strategies. Adult Protobothrops venom promotes hemorrhage, hypotension, incoagulable blood, and prey digestion, consistent with mammalian predation. Ovophis venom composition is less readily interpreted, owing to insufficient pharmacological data for venom serine and metalloproteases, which comprise more than 97.3% of Ovophis transcripts, but only 38.0% of Protobothrops transcripts. Ovophis venom apparently represents a hybrid strategy optimized for frogs and small mammals. Conclusions This study illustrates the power of cDNA sequencing combined with MS profiling. The former quantifies transcript composition, allowing detection of novel proteins, but cannot indicate which proteins are actually secreted, as does MS. We show, for the first time, that transcript and peptide abundances are correlated. This means that MS can be used for quantitative, non-invasive venom profiling, which will be beneficial for studies of endangered species. PMID:24224955
Sequence-dependent modelling of local DNA bending phenomena: curvature prediction and vibrational analysis.

PubMed

Vlahovicek, K; Munteanu, M G; Pongor, S

1999-01-01

Bending is a local conformational micropolymorphism of DNA in which the original B-DNA structure is only distorted but not extensively modified. Bending can be predicted by simple static geometry models as well as by a recently developed elastic model that incorporate sequence dependent anisotropic bendability (SDAB). The SDAB model qualitatively explains phenomena including affinity of protein binding, kinking, as well as sequence-dependent vibrational properties of DNA. The vibrational properties of DNA segments can be studied by finite element analysis of a model subjected to an initial bending moment. The frequency spectrum is obtained by applying Fourier analysis to the displacement values in the time domain. This analysis shows that the spectrum of the bending vibrations quite sensitively depends on the sequence, for example the spectrum of a curved sequence is characteristically different from the spectrum of straight sequence motifs of identical basepair composition. Curvature distributions are genome-specific, and pronounced differences are found between protein-coding and regulatory regions, respectively, that is, sites of extreme curvature and/or bendability are less frequent in protein-coding regions. A WWW server is set up for the prediction of curvature and generation of 3D models from DNA sequences (http:@www.icgeb.trieste.it/dna).
Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes

PubMed Central

Huang, Yongjie; Mrázek, Jan

2014-01-01

Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877
Factors That Affect Large Subunit Ribosomal DNA Amplicon Sequencing Studies of Fungal Communities: Classification Method, Primer Choice, and Error

PubMed Central

Porter, Teresita M.; Golding, G. Brian

2012-01-01

Nuclear large subunit ribosomal DNA is widely used in fungal phylogenetics and to an increasing extent also amplicon-based environmental sequencing. The relatively short reads produced by next-generation sequencing, however, makes primer choice and sequence error important variables for obtaining accurate taxonomic classifications. In this simulation study we tested the performance of three classification methods: 1) a similarity-based method (BLAST + Metagenomic Analyzer, MEGAN); 2) a composition-based method (Ribosomal Database Project naïve Bayesian classifier, NBC); and, 3) a phylogeny-based method (Statistical Assignment Package, SAP). We also tested the effects of sequence length, primer choice, and sequence error on classification accuracy and perceived community composition. Using a leave-one-out cross validation approach, results for classifications to the genus rank were as follows: BLAST + MEGAN had the lowest error rate and was particularly robust to sequence error; SAP accuracy was highest when long LSU query sequences were classified; and, NBC runs significantly faster than the other tested methods. All methods performed poorly with the shortest 50–100 bp sequences. Increasing simulated sequence error reduced classification accuracy. Community shifts were detected due to sequence error and primer selection even though there was no change in the underlying community composition. Short read datasets from individual primers, as well as pooled datasets, appear to only approximate the true community composition. We hope this work informs investigators of some of the factors that affect the quality and interpretation of their environmental gene surveys. PMID:22558215
Transcription blockage by homopurine DNA sequences: role of sequence composition and single-strand breaks

PubMed Central

Belotserkovskii, Boris P.; Neil, Alexander J.; Saleh, Syed Shayon; Shin, Jane Hae Soo; Mirkin, Sergei M.; Hanawalt, Philip C.

2013-01-01

The ability of DNA to adopt non-canonical structures can affect transcription and has broad implications for genome functioning. We have recently reported that guanine-rich (G-rich) homopurine-homopyrimidine sequences cause significant blockage of transcription in vitro in a strictly orientation-dependent manner: when the G-rich strand serves as the non-template strand [Belotserkovskii et al. (2010) Mechanisms and implications of transcription blockage by guanine-rich DNA sequences., Proc. Natl Acad. Sci. USA, 107, 12816–12821]. We have now systematically studied the effect of the sequence composition and single-stranded breaks on this blockage. Although substitution of guanine by any other base reduced the blockage, cytosine and thymine reduced the blockage more significantly than adenine substitutions, affirming the importance of both G-richness and the homopurine-homopyrimidine character of the sequence for this effect. A single-strand break in the non-template strand adjacent to the G-rich stretch dramatically increased the blockage. Breaks in the non-template strand result in much weaker blockage signals extending downstream from the break even in the absence of the G-rich stretch. Our combined data support the notion that transcription blockage at homopurine-homopyrimidine sequences is caused by R-loop formation. PMID:23275544

Estimation of a Killer Whale (Orcinus orca) Population’s Diet Using Sequencing Analysis of DNA from Feces

PubMed Central

Ford, Michael J.; Hempelmann, Jennifer; Hanson, M. Bradley; Ayres, Katherine L.; Baird, Robin W.; Emmons, Candice K.; Lundin, Jessica I.; Schorr, Gregory S.; Wasser, Samuel K.; Park, Linda K.

2016-01-01

Estimating diet composition is important for understanding interactions between predators and prey and thus illuminating ecosystem function. The diet of many species, however, is difficult to observe directly. Genetic analysis of fecal material collected in the field is therefore a useful tool for gaining insight into wild animal diets. In this study, we used high-throughput DNA sequencing to quantitatively estimate the diet composition of an endangered population of wild killer whales (Orcinus orca) in their summer range in the Salish Sea. We combined 175 fecal samples collected between May and September from five years between 2006 and 2011 into 13 sample groups. Two known DNA composition control groups were also created. Each group was sequenced at a ~330bp segment of the 16s gene in the mitochondrial genome using an Illumina MiSeq sequencing system. After several quality controls steps, 4,987,107 individual sequences were aligned to a custom sequence database containing 19 potential fish prey species and the most likely species of each fecal-derived sequence was determined. Based on these alignments, salmonids made up >98.6% of the total sequences and thus of the inferred diet. Of the six salmonid species, Chinook salmon made up 79.5% of the sequences, followed by coho salmon (15%). Over all years, a clear pattern emerged with Chinook salmon dominating the estimated diet early in the summer, and coho salmon contributing an average of >40% of the diet in late summer. Sockeye salmon appeared to be occasionally important, at >18% in some sample groups. Non-salmonids were rarely observed. Our results are consistent with earlier results based on surface prey remains, and confirm the importance of Chinook salmon in this population’s summer diet. PMID:26735849
Estimation of a Killer Whale (Orcinus orca) Population's Diet Using Sequencing Analysis of DNA from Feces.

PubMed

Ford, Michael J; Hempelmann, Jennifer; Hanson, M Bradley; Ayres, Katherine L; Baird, Robin W; Emmons, Candice K; Lundin, Jessica I; Schorr, Gregory S; Wasser, Samuel K; Park, Linda K

2016-01-01

Estimating diet composition is important for understanding interactions between predators and prey and thus illuminating ecosystem function. The diet of many species, however, is difficult to observe directly. Genetic analysis of fecal material collected in the field is therefore a useful tool for gaining insight into wild animal diets. In this study, we used high-throughput DNA sequencing to quantitatively estimate the diet composition of an endangered population of wild killer whales (Orcinus orca) in their summer range in the Salish Sea. We combined 175 fecal samples collected between May and September from five years between 2006 and 2011 into 13 sample groups. Two known DNA composition control groups were also created. Each group was sequenced at a ~330bp segment of the 16s gene in the mitochondrial genome using an Illumina MiSeq sequencing system. After several quality controls steps, 4,987,107 individual sequences were aligned to a custom sequence database containing 19 potential fish prey species and the most likely species of each fecal-derived sequence was determined. Based on these alignments, salmonids made up >98.6% of the total sequences and thus of the inferred diet. Of the six salmonid species, Chinook salmon made up 79.5% of the sequences, followed by coho salmon (15%). Over all years, a clear pattern emerged with Chinook salmon dominating the estimated diet early in the summer, and coho salmon contributing an average of >40% of the diet in late summer. Sockeye salmon appeared to be occasionally important, at >18% in some sample groups. Non-salmonids were rarely observed. Our results are consistent with earlier results based on surface prey remains, and confirm the importance of Chinook salmon in this population's summer diet.
Validation and application of quantitative PCR assays using host-specific Bacteroidales genetic markers for swine fecal pollution tracking.

PubMed

Fan, Lihua; Shuai, Jiangbing; Zeng, Ruoxue; Mo, Hongfei; Wang, Suhua; Zhang, Xiaofeng; He, Yongqiang

2017-12-01

Genome fragment enrichment (GFE) method was applied to identify host-specific bacterial genetic markers that differ among different fecal metagenomes. To enrich for swine-specific DNA fragments, swine fecal DNA composite (n = 34) was challenged against a DNA composite consisting of cow, human, goat, sheep, chicken, duck and goose fecal DNA extracts (n = 83). Bioinformatic analyses of 384 non-redundant swine enriched metagenomic sequences indicated a preponderance of Bacteroidales-like regions predicted to encode metabolism-associated, cellular processes and information storage and processing. After challenged against fecal DNA extracted from different animal sources, four sequences from the clone libraries targeting two Bacteroidales- (genes 1-38 and 3-53), a Clostridia- (gene 2-109) as well as a Bacilli-like sequence (gene 2-95), respectively, showed high specificity to swine feces based on PCR analysis. Host-specificity and host-sensitivity analysis confirmed that oligonucleotide primers and probes capable of annealing to select Bacteroidales-like sequences (1-38 and 3-53) exhibited high specificity (>90%) in quantitative PCR assays with 71 fecal DNAs from non-target animal sources. The two assays also demonstrated broad distributions of corresponding genetic markers (>94% positive) among 72 swine feces. After evaluation with environmental water samples from different areas, swine-targeted assays based on two Bacteroidales-like GFE sequences appear to be suitable quantitative tracing tools for swine fecal pollution. Copyright © 2017 Elsevier Ltd. All rights reserved.
Previously unknown and highly divergent ssDNA viruses populate the oceans.

PubMed

Labonté, Jessica M; Suttle, Curtis A

2013-11-01

Single-stranded DNA (ssDNA) viruses are economically important pathogens of plants and animals, and are widespread in oceans; yet, the diversity and evolutionary relationships among marine ssDNA viruses remain largely unknown. Here we present the results from a metagenomic study of composite samples from temperate (Saanich Inlet, 11 samples; Strait of Georgia, 85 samples) and subtropical (46 samples, Gulf of Mexico) seawater. Most sequences (84%) had no evident similarity to sequenced viruses. In total, 608 putative complete genomes of ssDNA viruses were assembled, almost doubling the number of ssDNA viral genomes in databases. These comprised 129 genetically distinct groups, each represented by at least one complete genome that had no recognizable similarity to each other or to other virus sequences. Given that the seven recognized families of ssDNA viruses have considerable sequence homology within them, this suggests that many of these genetic groups may represent new viral families. Moreover, nearly 70% of the sequences were similar to one of these genomes, indicating that most of the sequences could be assigned to a genetically distinct group. Most sequences fell within 11 well-defined gene groups, each sharing a common gene. Some of these encoded putative replication and coat proteins that had similarity to sequences from viruses infecting eukaryotes, suggesting that these were likely from viruses infecting eukaryotic phytoplankton and zooplankton.
Ultrasensitive determination of DNA sequences by flow injection chemiluminescence using silver ions as labels.

PubMed

Zheng, Lichun; Liu, Xiuhui; Zhou, Min; Ma, Yongjun; Wu, Guofan; Lu, Xiaoquan

2014-10-27

We presented a new strategy for ultrasensitive detection of DNA sequences based on the novel detection probe which was labeled with Ag(+) using metallothionein (MT) as a bridge. The assay relied on a sandwich-type DNA hybridization in which the DNA targets were first hybridized to the captured oligonucleotide probes immobilized on Fe3O4@Au composite magnetic nanoparticles (MNPs), and then the Ag(+)-modified detection probes were used to monitor the presence of the specific DNA targets. After being anchored on the hybrids, Ag(+) was released down through acidic treatment and sensitively determined by a coupling flow injection-chemiluminescent reaction system (Ag(+)-Mn(2+)-K2S2O8-H3PO4-luminol) (FI-CL). The experiment results showed that the CL intensities increased linearly with the concentrations of DNA targets in the range from 10 to 500 pmol L(-1) with a detection limit of 3.3 pmol L(-1). The high sensitivity in this work may be ascribed to the high molar ratio of Ag(+)-MT, the sensitive determination of Ag(+) by the coupling FI-CL reaction system and the perfect magnetic separation based on Fe3O4@Au composite MNPs. Moreover, the proposed strategy exhibited excellent selectivity against the mismatched DNA sequences and could be applied to real samples analysis. Copyright © 2014 Elsevier B.V. All rights reserved.
Molecular mechanisms of adaptation emerging from the physics and evolution of nucleic acids and proteins.

PubMed

Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N

2014-03-01

DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea.
Molecular mechanisms of adaptation emerging from the physics and evolution of nucleic acids and proteins

PubMed Central

Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N.

2014-01-01

DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea. PMID:24371267
Identification of DNA-binding proteins by combining auto-cross covariance transformation and ensemble learning.

PubMed

Liu, Bin; Wang, Shanyi; Dong, Qiwen; Li, Shumin; Liu, Xuan

2016-04-20

DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. With the rapid development of next generation of sequencing technique, the number of protein sequences is unprecedentedly increasing. Thus it is necessary to develop computational methods to identify the DNA-binding proteins only based on the protein sequence information. In this study, a novel method called iDNA-KACC is presented, which combines the Support Vector Machine (SVM) and the auto-cross covariance transformation. The protein sequences are first converted into profile-based protein representation, and then converted into a series of fixed-length vectors by the auto-cross covariance transformation with Kmer composition. The sequence order effect can be effectively captured by this scheme. These vectors are then fed into Support Vector Machine (SVM) to discriminate the DNA-binding proteins from the non DNA-binding ones. iDNA-KACC achieves an overall accuracy of 75.16% and Matthew correlation coefficient of 0.5 by a rigorous jackknife test. Its performance is further improved by employing an ensemble learning approach, and the improved predictor is called iDNA-KACC-EL. Experimental results on an independent dataset shows that iDNA-KACC-EL outperforms all the other state-of-the-art predictors, indicating that it would be a useful computational tool for DNA binding protein identification. .
[Influence of PCR cycle number on microbial diversity analysis through next generation sequencing].

PubMed

An, Yunhe; Gao, Lijuan; Li, Junbo; Tian, Yanjie; Wang, Jinlong; Zheng, Xuejuan; Wu, Huijuan

2016-08-25

Using of high throughput sequencing technology to study the microbial diversity in complex samples has become one of the hottest issues in the field of microbial diversity research. In this study, the soil and sheep rumen chyme samples were used to extract DNA, respectively. Then the 25 ng total DNA was used to amplify the 16S rRNA V3 region with 20, 25, 30 PCR cycles, and the final sequencing library was constructed by mixing equal amounts of purified PCR products. Finally, the operational taxonomic unit (OUT) amount, rarefaction curve, microbial number and species were compared through data analysis. It was found that at the same amount of DNA template, the proportion of the community composition was not the best with more numbers of PCR cycle, although the species number was much more. In all, when the PCR cycle number is 25, the number of species and proportion of the community composition were the most optimal both in soil or chyme samples.
On the Sequence-Directed Nature of Human Gene Mutation: The Role of Genomic Architecture and the Local DNA Sequence Environment in Mediating Gene Mutations Underlying Human Inherited Disease

PubMed Central

Cooper, David N.; Bacolla, Albino; Férec, Claude; Vasquez, Karen M.; Kehrer-Sawatzki, Hildegard; Chen, Jian-Min

2011-01-01

Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher-order features of the genomic architecture. The human genome is now recognized to contain ‘pervasive architectural flaws’ in that certain DNA sequences are inherently mutation-prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of non-canonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair, and may serve to increase mutation frequencies in generalized fashion (i.e. both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease. PMID:21853507
Estimating population genetic parameters and comparing model goodness-of-fit using DNA sequences with error

PubMed Central

Liu, Xiaoming; Fu, Yun-Xin; Maxwell, Taylor J.; Boerwinkle, Eric

2010-01-01

It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood of the observed SNP configurations to infer population mutation rate θ = 4Neμ, population exponential growth rate R, and error rate ɛ, simultaneously. Using simulation, we show the combined effects of the parameters, θ, n, ɛ, and R on the accuracy of parameter estimation. We compared our maximum composite likelihood estimator (MCLE) of θ with other θ estimators that take into account the error. The results show the MCLE performs well when the sample size is large or the error rate is high. Using parametric bootstrap, composite likelihood can also be used as a statistic for testing the model goodness-of-fit of the observed DNA sequences. The MCLE method is applied to sequence data on the ANGPTL4 gene in 1832 African American and 1045 European American individuals. PMID:19952140
The full mitochondrial genome sequence of Raillietina tetragona from chicken (Cestoda: Davaineidae).

PubMed

Liang, Jian-Ying; Lin, Rui-Qing

2016-11-01

In the present study, the complete mitochondrial DNA (mtDNA) sequence of Raillietina tetragona was sequenced and its gene contents and genome organizations was compared with that of other tapeworm. The complete mt genome sequence of R. tetragona is 14,444 bp in length. It contains 12 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and two non-coding region. All genes are transcribed in the same direction and have a nucleotide composition high in A and T. The contents of A + T of the complete mt genome are 71.4% for R. tetragona. The R. tetragona mt genome sequence provides novel mtDNA marker for studying the molecular epidemiology and population genetics of Raillietina and has implications for the molecular diagnosis of chicken cestodosis caused by Raillietina.
Species composition of the genus Saprolegnia in fin fish aquaculture environments, as determined by nucleotide sequence analysis of the nuclear rDNA ITS regions.

PubMed

de la Bastide, Paul Y; Leung, Wai Lam; Hintz, William E

2015-01-01

The ITS region of the rDNA gene was compared for Saprolegnia spp. in order to improve our understanding of nucleotide sequence variability within and between species of this genus, determine species composition in Canadian fin fish aquaculture facilities, and to assess the utility of ITS sequence variability in genetic marker development. From a collection of more than 400 field isolates, ITS region nucleotide sequences were studied and it was determined that there was sufficient consistent inter-specific variation to support the designation of species identity based on ITS sequence data. This non-subjective approach to species identification does not rely upon transient morphological features. Phylogenetic analyses comparing our ITS sequences and species designations with data from previous studies generally supported the clade scheme of Diéguez-Uribeondo et al. (2007) and found agreement with the molecular taxonomic cluster system of Sandoval-Sierra et al. (2014). Our Canadian ITS sequence collection will thus contribute to the public database and assist the clarification of Saprolegnia spp. taxonomy. The analysis of ITS region sequence variability facilitated genus- and species-level identification of unknown samples from aquaculture facilities and provided useful information on species composition. A unique ITS-RFLP for the identification of S. parasitica was also described. Copyright © 2014 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Compositional segmentation and complexity measurement in stock indices

NASA Astrophysics Data System (ADS)

Wang, Haifeng; Shang, Pengjian; Xia, Jianan

2016-01-01

In this paper, we introduce a complexity measure based on the entropic segmentation called sequence compositional complexity (SCC) into the analysis of financial time series. SCC was first used to deal directly with the complex heterogeneity in nonstationary DNA sequences. We already know that SCC was found to be higher in sequences with long-range correlation than those with low long-range correlation, especially in the DNA sequences. Now, we introduce this method into financial index data, subsequently, we find that the values of SCC of some mature stock indices, such as S & P 500 (simplified with S & P in the following) and HSI, are likely to be lower than the SCC value of Chinese index data (such as SSE). What is more, we find that, if we classify the indices with the method of SCC, the financial market of Hong Kong has more similarities with mature foreign markets than Chinese ones. So we believe that a good correspondence is found between the SCC of the index sequence and the complexity of the market involved.
Mitochondrial DNA control region sequences from Nairobi (Kenya): inferring phylogenetic parameters for the establishment of a forensic database.

PubMed

Brandstätter, Anita; Peterson, Christine T; Irwin, Jodi A; Mpoke, Solomon; Koech, Davy K; Parson, Walther; Parsons, Thomas J

2004-10-01

Large forensic mtDNA databases which adhere to strict guidelines for generation and maintenance, are not available for many populations outside of the United States and western Europe. We have established a high quality mtDNA control region sequence database for urban Nairobi as both a reference database for forensic investigations, and as a tool to examine the genetic variation of Kenyan sequences in the context of known African variation. The Nairobi sequences exhibited high variation and a low random match probability, indicating utility for forensic testing. Haplogroup identification and frequencies were compared with those reported from other published studies on African, or African-origin populations from Mozambique, Sierra Leone, and the United States, and suggest significant differences in the mtDNA compositions of the various populations. The quality of the sequence data in our study was investigated and supported using phylogenetic measures. Our data demonstrate the diversity and distinctiveness of African populations, and underline the importance of establishing additional forensic mtDNA databases of indigenous African populations.
Mapping the Space of Genomic Signatures

PubMed Central

Kari, Lila; Hill, Kathleen A.; Sayem, Abu S.; Karamichalis, Rallis; Bryans, Nathaniel; Davis, Katelyn; Dattani, Nikesh S.

2015-01-01

We propose a computational method to measure and visualize interrelationships among any number of DNA sequences allowing, for example, the examination of hundreds or thousands of complete mitochondrial genomes. An "image distance" is computed for each pair of graphical representations of DNA sequences, and the distances are visualized as a Molecular Distance Map: Each point on the map represents a DNA sequence, and the spatial proximity between any two points reflects the degree of structural similarity between the corresponding sequences. The graphical representation of DNA sequences utilized, Chaos Game Representation (CGR), is genome- and species-specific and can thus act as a genomic signature. Consequently, Molecular Distance Maps could inform species identification, taxonomic classifications and, to a certain extent, evolutionary history. The image distance employed, Structural Dissimilarity Index (DSSIM), implicitly compares the occurrences of oligomers of length up to k (herein k = 9) in DNA sequences. We computed DSSIM distances for more than 5 million pairs of complete mitochondrial genomes, and used Multi-Dimensional Scaling (MDS) to obtain Molecular Distance Maps that visually display the sequence relatedness in various subsets, at different taxonomic levels. This general-purpose method does not require DNA sequence alignment and can thus be used to compare similar or vastly different DNA sequences, genomic or computer-generated, of the same or different lengths. We illustrate potential uses of this approach by applying it to several taxonomic subsets: phylum Vertebrata, (super)kingdom Protista, classes Amphibia-Insecta-Mammalia, class Amphibia, and order Primates. This analysis of an extensive dataset confirms that the oligomer composition of full mtDNA sequences can be a source of taxonomic information. This method also correctly finds the mtDNA sequences most closely related to that of the anatomically modern human (the Neanderthal, the Denisovan, and the chimp), and that the sequence most different from it in this dataset belongs to a cucumber. PMID:26000734
16S rRNA Gene Sequence Analysis of Drinking Water Using RNA and DNA Extracts as Targets for Clone Library Development

EPA Science Inventory

The bacterial composition of chlorinated drinking water was analyzed using 16S rRNA gene clone libraries derived from DNA extracts of 12 samples and compared to clone libraries previously generated using RNA extracts from the same samples. Phylogenetic analysis of 761 DNA-based ...
Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor.

PubMed

Kohany, Oleksiy; Gentles, Andrew J; Hankus, Lukasz; Jurka, Jerzy

2006-10-25

Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at http://www.girinst.org/repbase/submission.html (RepbaseSubmitter) and http://www.girinst.org/censor/index.php (Censor).
Effects of nucleoside analog incorporation on DNA binding to the DNA binding domain of the GATA-1 erythroid transcription factor.

PubMed

Foti, M; Omichinski, J G; Stahl, S; Maloney, D; West, J; Schweitzer, B I

1999-02-05

We investigate here the effects of the incorporation of the nucleoside analogs araC (1-beta-D-arabinofuranosylcytosine) and ganciclovir (9-[(1,3-dihydroxy-2-propoxy)methyl] guanine) into the DNA binding recognition sequence for the GATA-1 erythroid transcription factor. A 10-fold decrease in binding affinity was observed for the ganciclovir-substituted DNA complex in comparison to an unmodified DNA of the same sequence composition. AraC substitution did not result in any changes in binding affinity. 1H-15N HSQC and NOESY NMR experiments revealed a number of chemical shift changes in both DNA and protein in the ganciclovir-modified DNA-protein complex when compared to the unmodified DNA-protein complex. These changes in chemical shift and binding affinity suggest a change in the binding mode of the complex when ganciclovir is incorporated into the GATA DNA binding site.
A Tandemly Arranged Pattern of Two 5S rDNA Arrays in Amolops mantzorum (Anura, Ranidae).

PubMed

Liu, Ting; Song, Menghuan; Xia, Yun; Zeng, Xiaomao

2017-01-01

In an attempt to extend the knowledge of the 5S rDNA organization in anurans, the 5S rDNA sequences of Amolops mantzorum were isolated, characterized, and mapped by FISH. Two forms of 5S rDNA, type I (209 bp) and type II (about 870 bp), were found in specimens investigated from various populations. Both of them contained a 118-bp coding sequence, readily differentiated by their non-transcribed spacer (NTS) sizes and compositions. Four probes (the 5S rDNA coding sequences, the type I NTS, the type II NTS, and the entire type II 5S rDNA sequences) were respectively labeled with TAMRA or digoxigenin to hybridize with mitotic chromosomes for samples of all localities. It turned out that all probes showed the same signals that appeared in every centromeric region and in the telomeric regions of chromosome 5, without differences within or between populations. Obviously, both type I and type II of the 5S rDNA arrays arranged in tandem, which was contrasting with other frogs or fishes recorded to date. More interestingly, all the probes detected centromeric regions in all karyotypes, suggesting the presence of a satellite DNA family derived from 5S rDNA. © 2017 S. Karger AG, Basel.

The chloroplast and mitochondrial genome sequences of the charophyte Chaetosphaeridium globosum: Insights into the timing of the events that restructured organelle DNAs within the green algal lineage that led to land plants

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

2002-01-01

The land plants and their immediate green algal ancestors, the charophytes, form the Streptophyta. There is evidence that both the chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) underwent substantial changes in their architecture (intron insertions, gene losses, scrambling in gene order, and genome expansion in the case of mtDNA) during the evolution of streptophytes; however, because no charophyte organelle DNAs have been sequenced completely thus far, the suite of events that shaped streptophyte organelle genomes remains largely unknown. Here, we have determined the complete cpDNA (131,183 bp) and mtDNA (56,574 bp) sequences of the charophyte Chaetosphaeridium globosum (Coleochaetales). At the levels of gene content (124 genes), intron composition (18 introns), and gene order, Chaetosphaeridium cpDNA is remarkably similar to land-plant cpDNAs, implying that most of the features characteristic of land-plant lineages were gained during the evolution of charophytes. Although the gene content of Chaetosphaeridium mtDNA (67 genes) closely resembles that of the bryophyte Marchantia polymorpha (69 genes), this charophyte mtDNA differs substantially from its land-plant relatives at the levels of size, intron composition (11 introns), and gene order. Our finding that it shares only one intron with its land-plant counterparts supports the idea that the vast majority of mitochondrial introns in land plants appeared after the emergence of these organisms. Our results also suggest that the events accounting for the spacious intergenic spacers found in land-plant mtDNAs took place late during the evolution of charophytes or coincided with the transition from charophytes to land plants. PMID:12161560
The impact of different DNA extraction kits and laboratories upon the assessment of human gut microbiota composition by 16S rRNA gene sequencing.

PubMed

Kennedy, Nicholas A; Walker, Alan W; Berry, Susan H; Duncan, Sylvia H; Farquarson, Freda M; Louis, Petra; Thomson, John M; Satsangi, Jack; Flint, Harry J; Parkhill, Julian; Lees, Charlie W; Hold, Georgina L

2014-01-01

Determining bacterial community structure in fecal samples through DNA sequencing is an important facet of intestinal health research. The impact of different commercially available DNA extraction kits upon bacterial community structures has received relatively little attention. The aim of this study was to analyze bacterial communities in volunteer and inflammatory bowel disease (IBD) patient fecal samples extracted using widely used DNA extraction kits in established gastrointestinal research laboratories. Fecal samples from two healthy volunteers (H3 and H4) and two relapsing IBD patients (I1 and I2) were investigated. DNA extraction was undertaken using MoBio Powersoil and MP Biomedicals FastDNA SPIN Kit for Soil DNA extraction kits. PCR amplification for pyrosequencing of bacterial 16S rRNA genes was performed in both laboratories on all samples. Hierarchical clustering of sequencing data was done using the Yue and Clayton similarity coefficient. DNA extracted using the FastDNA kit and the MoBio kit gave median DNA concentrations of 475 (interquartile range 228-561) and 22 (IQR 9-36) ng/µL respectively (p<0.0001). Hierarchical clustering of sequence data by Yue and Clayton coefficient revealed four clusters. Samples from individuals H3 and I2 clustered by patient; however, samples from patient I1 extracted with the MoBio kit clustered with samples from patient H4 rather than the other I1 samples. Linear modelling on relative abundance of common bacterial families revealed significant differences between kits; samples extracted with MoBio Powersoil showed significantly increased Bacteroidaceae, Ruminococcaceae and Porphyromonadaceae, and lower Enterobacteriaceae, Lachnospiraceae, Clostridiaceae, and Erysipelotrichaceae (p<0.05). This study demonstrates significant differences in DNA yield and bacterial DNA composition when comparing DNA extracted from the same fecal sample with different extraction kits. This highlights the importance of ensuring that samples in a study are prepared with the same method, and the need for caution when cross-comparing studies that use different methods.
Informational structure of genetic sequences and nature of gene splicing

NASA Astrophysics Data System (ADS)

Trifonov, E. N.

1991-10-01

Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.
Biological nanopore MspA for DNA sequencing

NASA Astrophysics Data System (ADS)

Manrao, Elizabeth A.

Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.
Self-assembly of multiferroic core-shell particulate nanocomposites through DNA-DNA hybridization and magnetic field directed assembly of superstructures

NASA Astrophysics Data System (ADS)

Sreenivasulu, Gollapudi; Lochbiler, Thomas A.; Panda, Manashi; Srinivasan, Gopalan; Chavez, Ferman A.

2016-04-01

Multiferroic composites of ferromagnetic and ferroelectric phases are of importance for studies on mechanical strain mediated coupling between the magnetic and electric subsystems. This work is on DNA-assisted self-assembly of superstructures of such composites with nanometer periodicity. The synthesis involved oligomeric DNA-functionalized ferroelectric and ferromagnetic nanoparticles, 600 nm BaTiO3 (BTO) and 200 nm NiFe2O4 (NFO), respectively. Mixing BTO and NFO particles, possessing complementary DNA sequences, resulted in the formation of ordered core-shell heteronanocomposites held together by DNA hybridization. The composites were imaged by scanning electron microscopy and scanning microwave microscopy. The presence of heteroassemblies along with core-shell architecture is clearly observed. The reversible nature of the DNA hybridization allows for restructuring the composites into mm-long linear chains and 2D-arrays in the presence of a static magnetic field and ring-like structures in a rotating-magnetic field. Strong magneto-electric (ME) coupling in as-assembled composites is evident from static magnetic field H induced polarization and low-frequency magnetoelectric voltage coefficient measurements. Upon annealing the nanocomposites at high temperatures, evidence for the formation of bulk composites with excellent cross-coupling between the electric and magnetic subsystems is obtained by H-induced polarization and low-frequency ME voltage coefficient. The ME coupling strength in the self-assembled composites is measured to be much stronger than in bulk composites with randomly distributed NFO and BTO prepared by direct mixing and sintering.
Combined Use of 16S Ribosomal DNA and 16S rRNA To Study the Bacterial Community of Polychlorinated Biphenyl-Polluted Soil

PubMed Central

Nogales, Balbina; Moore, Edward R. B.; Llobet-Brossa, Enrique; Rossello-Mora, Ramon; Amann, Rudolf; Timmis, Kenneth N.

2001-01-01

The bacterial diversity assessed from clone libraries prepared from rRNA (two libraries) and ribosomal DNA (rDNA) (one library) from polychlorinated biphenyl (PCB)-polluted soil has been analyzed. A good correspondence of the community composition found in the two types of library was observed. Nearly 29% of the cloned sequences in the rDNA library were identical to sequences in the rRNA libraries. More than 60% of the total cloned sequence types analyzed were grouped in phylogenetic groups (a clone group with sequence similarity higher than 97% [98% for Burkholderia and Pseudomonas-type clones]) represented in both types of libraries. Some of those phylogenetic groups, mostly represented by a single (or pair) of cloned sequence type(s), were observed in only one of the types of library. An important difference between the libraries was the lack of clones representative of the Actinobacteria in the rDNA library. The PCB-polluted soil exhibited a high bacterial diversity which included representatives of two novel lineages. The apparent abundance of bacteria affiliated to the beta-subclass of the Proteobacteria, and to the genus Burkholderia in particular, was confirmed by fluorescence in situ hybridization analysis. The possible influence on apparent diversity of low template concentrations was assessed by dilution of the RNA template prior to amplification by reverse transcription-PCR. Although differences in the composition of the two rRNA libraries obtained from high and low RNA concentrations were observed, the main components of the bacterial community were represented in both libraries, and therefore their detection was not compromised by the lower concentrations of template used in this study. PMID:11282645
A fungal mock community control for amplicon sequencing experiments

USDA-ARS?s Scientific Manuscript database

The field of microbial ecology has been profoundly advanced by the ability to profile the composition of complex microbial communities by means of high throughput amplicon sequencing of marker genes amplified directly from environmental genomic DNA extracts. However, it has become increasingly clear...
Detection of Bacterial Pathogens from Broncho-Alveolar Lavage by Next-Generation Sequencing.

PubMed

Leo, Stefano; Gaïa, Nadia; Ruppé, Etienne; Emonet, Stephane; Girard, Myriam; Lazarevic, Vladimir; Schrenzel, Jacques

2017-09-20

The applications of whole-metagenome shotgun sequencing (WMGS) in routine clinical analysis are still limited. A combination of a DNA extraction procedure, sequencing, and bioinformatics tools is essential for the removal of human DNA and for improving bacterial species identification in a timely manner. We tackled these issues with a broncho-alveolar lavage (BAL) sample from an immunocompromised patient who had developed severe chronic pneumonia. We extracted DNA from the BAL sample with protocols based either on sequential lysis of human and bacterial cells or on the mechanical disruption of all cells. Metagenomic libraries were sequenced on Illumina HiSeq platforms. Microbial community composition was determined by k-mer analysis or by mapping to taxonomic markers. Results were compared to those obtained by conventional clinical culture and molecular methods. Compared to mechanical cell disruption, a sequential lysis protocol resulted in a significantly increased proportion of bacterial DNA over human DNA and higher sequence coverage of Mycobacterium abscessus , Corynebacterium jeikeium and Rothia dentocariosa , the bacteria reported by clinical microbiology tests. In addition, we identified anaerobic bacteria not searched for by the clinical laboratory. Our results further support the implementation of WMGS in clinical routine diagnosis for bacterial identification.
DNA barcoding reveals seasonal shifts in diet and consumption of deep-sea fishes in wedge-tailed shearwaters

PubMed Central

Ando, Haruko; Horikoshi, Kazuo; Suzuki, Hajime; Isagi, Yuji

2018-01-01

The foraging ecology of pelagic seabirds is difficult to characterize because of their large foraging areas. In the face of this difficulty, DNA metabarcoding may be a useful approach to analyze diet compositions and foraging behaviors. Using this approach, we investigated the diet composition and its seasonal variation of a common seabird species on the Ogasawara Islands, Japan: the wedge-tailed shearwater Ardenna pacifica. We collected fecal samples during the prebreeding (N = 73) and rearing (N = 96) periods. The diet composition of wedge-tailed shearwater was analyzed by Ion Torrent sequencing using two universal polymerase chain reaction primers for the 12S and 16S mitochondrial DNA regions that targeted vertebrates and mollusks, respectively. The results of a BLAST search of obtained sequences detected 31 and 1 vertebrate and mollusk taxa, respectively. The results of the diet composition analysis showed that wedge-tailed shearwaters frequently consumed deep-sea fishes throughout the sampling season, indicating the importance of these fishes as a stable food resource. However, there was a marked seasonal shift in diet, which may reflect seasonal changes in food resource availability and wedge-tailed shearwater foraging behavior. The collected data regarding the shearwater diet may be useful for in situ conservation efforts. Future research that combines DNA metabarcoding with other tools, such as data logging, may provide further insight into the foraging ecology of pelagic seabirds. PMID:29630670
Estimation of the Relative Abundance of Different Bacteroides and Prevotella Ribotypes in Gut Samples by Restriction Enzyme Profiling of PCR-Amplified 16S rRNA Gene Sequences

PubMed Central

Wood, Jacqueline; Scott, Karen P.; Avguštin, Gorazd; Newbold, C. James; Flint, Harry J.

1998-01-01

We describe an approach for determining the genetic composition of Bacteroides and Prevotella populations in gut contents based on selective amplification of 16S rRNA gene sequences (rDNA) followed by cleavage of the amplified material with restriction enzymes. The relative contributions of different ribotypes to total Bacteroides and Prevotella 16S rDNA are estimated after end labelling of one of the PCR primers, and the contribution of Bacteroides and Prevotella sequences to total eubacterial 16S rDNA is estimated by measuring the binding of oligonucleotide probes to amplified DNA. Bacteroides and Prevotella 16S rDNA accounted for between 12 and 62% of total eubacterial 16S rDNA in samples of ruminal contents from six sheep and a cow. Ribotypes 4, 5, 6, and 7, which include most cultivated rumen Prevotella strains, together accounted for between 20 and 86% of the total amplified Bacteroides and Prevotella rDNA in these samples. The most abundant Bacteroides or Prevotella ribotype in four animals, however, was ribotype 8, for which there is only one known cultured isolate, while ribotypes 1 and 2, which include many colonic Bacteroides spp., were the most abundant in two animals. This indicates that some abundant Bacteroides and Prevotella groups in the rumen are underrepresented among cultured rumen Prevotella isolates. The approach described here provides a rapid, convenient, and widely applicable method for comparing the genotypic composition of bacterial populations in gut samples. PMID:9758785
Effect of preservation method on spider monkey (Ateles geoffroyi) fecal microbiota over 8 weeks.

PubMed

Hale, Vanessa L; Tan, Chia L; Knight, Rob; Amato, Katherine R

2015-06-01

Studies of the gut microbiome have become increasingly common with recent technological advances. Gut microbes play an important role in human and animal health, and gut microbiome analysis holds great potential for evaluating health in wildlife, as microbiota can be assessed from non-invasively collected fecal samples. However, many common fecal preservation protocols (e.g. freezing at -80 °C) are not suitable for field conditions, or have not been tested for long-term (greater than 2 weeks) storage. In this study, we collected fresh fecal samples from captive spider monkeys (Ateles geoffroyi) at the Columbian Park Zoo (Lafayette, IN, USA). The samples were pooled, homogenized, and preserved for up to 8 weeks prior to DNA extraction and sequencing. Preservation methods included: freezing at -20 °C, freezing at -80 °C, immersion in 100% ethanol, application to FTA cards, and immersion in RNAlater. At 0 (fresh), 1, 2, 4, and 8 weeks from fecal collection, DNA was extracted and microbial DNA was amplified and sequenced. DNA concentration, purity, microbial diversity, and microbial composition were compared across all methods and time points. DNA concentration and purity did not correlate with microbial diversity or composition. Microbial composition of frozen and ethanol samples were most similar to fresh samples. FTA card and RNAlater-preserved samples had the least similar microbial composition and abundance compared to fresh samples. Microbial composition and diversity were relatively stable over time within each preservation method. Based on these results, if freezers are not available, we recommend preserving fecal samples in ethanol (for up to 8weeks) prior to microbial extraction and analysis. Copyright © 2015 Elsevier B.V. All rights reserved.
Molecular analysis of microbiota along the digestive tract of juvenile Atlantic salmon (Salmo salar L.).

PubMed

Navarrete, P; Espejo, R T; Romero, J

2009-04-01

Dominant bacterial microbiota of the gut of juvenile farmed Atlantic salmon was investigated using a combination of molecular approaches. Bacterial community composition from the stomach, the pyloric caeca, and the intestine was assessed by extracting DNA directly from each gut compartment. Temporal temperature gradient gel electrophoresis (TTGE) analysis of 16S ribosomal DNA (rDNA) amplicons showed very similar bacterial compositions throughout the digestive tract. Band sequencing revealed a narrow diversity of species with a dominance of Pseudomonas in the three compartments. However, cloning revealed more diversity among the Pseudomonas sequences. To confirm these results, we analyzed the bacterial community by amplifying the variable 16S-23S rDNA intergenic spacer region (ITS). Similar ITS profiles were observed among gastrointestinal compartments of salmon, confirming the TTGE results. Moreover, the dominant ITS band at 650 bp, identified as Pseudomonas, was observed in the ITS profile from fish collected in two seasons (July 2003 and 2004). In contrast, aerobic culture analysis revealed Shewanella spp. as the most prevalent isolate. This discrepancy was resolved by evaluating 16S rDNA and ITS polymerase chain reaction amplification efficiency from both Shewanella and Pseudomonas isolates. Very similar efficiencies were observed in the two bacteria. Hence, this discrepancy may be explained by preferential cultivation of Shewanella spp. under the experimental conditions. Also, we included analyses of pelleted feed and the water influent to explore environmental influences on the bacterial composition of the gut microbiota. Overall, these results indicate a homogeneous composition of the bacterial community composition along the gastrointestinal tract of reared juvenile salmon. This community is mainly composed of Pseudomonas spp., which could be derived from water influent and may be selectively associated with salmon in this hatchery.
Estimating Diversity of Florida Keys Zooplankton Using New Environmental DNA Methods

NASA Astrophysics Data System (ADS)

Djurhuus, A.; Goldsmith, D. B.; Sawaya, N. A.; Breitbart, M.

2016-02-01

Zooplankton are of great importance in marine food webs, where they serve to link the phytoplankton and bacteria with higher trophic levels. Zooplankton are a diverse group containing molluscs, crustaceans, fish larvae and many other taxa. The sheer number of species and often minor morphological distinctions between species makes it challenging and exceptionally time consuming to identify the species composition of marine zooplankton samples. As a part of the Marine Biodiversity Observation Network (MBON) project, we have developed and groundtruthed an alternative, relatively time-efficient method for zooplankton identification using environmental DNA (eDNA). Samples were collected from Molasses reef, Looe Key, and Western Sambo along the Florida Keys from five bi-monthly cruises on board the RV Walton Smith. Samples were collected for environmental DNA (eDNA) by filtering 1 L of water on to a 0.22 µm filter and zooplankton samples were collected using nets with three mesh sizes (64μm, 200μm, and 500μm) to catch different size fractions. Half of zooplankton samples were fixed in 70% ethanol and half in 10% formalin, for DNA extraction and morphological identification, respectively. Individuals representing visually abundant taxa were picked into individual wells for PCR with universal 18S rRNA gene primers and subsequent sequencing to build a reference barcode database for zooplankton species commonly found in the study region. PCR and Illumina MiSeq next generation sequencing was applied to the eDNA extracted from the 0.22 μm filters and sequences were be compared to our local custom database as well as publicly available databases to determine zooplankton community composition. Finally, composition and diversity analyses were performed to compare results obtained with the new eDNA approach to standard morphological classification of zooplankton communities. Results show that the eDNA approach can enable the determination of zooplankton diversity through collection of a single water sample, which, when combined with bacterial and archaeal diversity analyses, will help us understand the coupling between different trophic levels and the drivers of plankton dynamics in the sub-tropical Florida Keys.
The Organization of Repetitive DNA in the Genomes of Amazonian Lizard Species in the Family Teiidae.

PubMed

Carvalho, Natalia D M; Pinheiro, Vanessa S S; Carmo, Edson J; Goll, Leonardo G; Schneider, Carlos H; Gross, Maria C

2015-01-01

Repetitive DNA is the largest fraction of the eukaryote genome and comprises tandem and dispersed sequences. It presents variations in relation to its composition, number of copies, distribution, dynamics, and genome organization, and participates in the evolutionary diversification of different vertebrate species. Repetitive sequences are usually located in the heterochromatin of centromeric and telomeric regions of chromosomes, contributing to chromosomal structures. Therefore, the aim of this study was to physically map repetitive DNA sequences (5S rDNA, telomeric sequences, tropomyosin gene 1, and retroelements Rex1 and SINE) of mitotic chromosomes of Amazonian species of teiids (Ameiva ameiva, Cnemidophorus sp. 1, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin) to understand their genome organization and karyotype evolution. The mapping of repetitive sequences revealed a distinct pattern in Cnemidophorus sp. 1, whereas the other species showed all sequences interspersed in the heterochromatic region. Physical mapping of the tropomyosin 1 gene was performed for the first time in lizards and showed that in addition to being functional, this gene has a structural function similar to the mapped repetitive elements as it is located preferentially in centromeric regions and termini of chromosomes. © 2016 S. Karger AG, Basel.
DNA sequences and composition from 12 BAC clones-derived MUSB SSR markers mapped to cotton (Gossypium Hirsutum L. x G. Barbadense L.)chromosomes 11 and 21

USDA-ARS?s Scientific Manuscript database

To discover resistance (R) and/or pathogen-induced (PR) genes involved in disease response, 12 bacterial artificial chromosome (BAC) clones from cv. Acala Maxxa (G. hirsutum) were sequenced at the Clemson University, Genomics Institute, Clemson, SC. These BACs derived MUSB single sequence repeat (SS...
Design of stapled DNA-minor-groove-binding molecules with a mutable atom simulated annealing method

NASA Astrophysics Data System (ADS)

Walker, Wynn L.; Kopka, Mary L.; Dickerson, Richard E.; Goodsell, David S.

1997-11-01

We report the design of optimal linker geometries for the synthesis of stapledDNA-minor-groove-binding molecules. Netropsin, distamycin, and lexitropsinsbind side-by-side to mixed-sequence DNA and offer an opportunity for thedesign of sequence-reading molecules. Stapled molecules, with two moleculescovalently linked side-by-side, provide entropic gains and restrain theposition of one molecule relative to its neighbor. Using a free-atom simulatedannealing technique combined with a discrete mutable atom definition, optimallengths and atomic composition for covalent linkages are determined, and anovel hydrogen bond `zipper' is proposed to phase two molecules accuratelyside-by-side.
Plant genotyping using fluorescently tagged inter-simple sequence repeats (ISSRs): basic principles and methodology.

PubMed

Prince, Linda M

2015-01-01

Inter-simple sequence repeat PCR (ISSR-PCR) is a fast, inexpensive genotyping technique based on length variation in the regions between microsatellites. The method requires no species-specific prior knowledge of microsatellite location or composition. Very small amounts of DNA are required, making this method ideal for organisms of conservation concern, or where the quantity of DNA is extremely limited due to organism size. ISSR-PCR can be highly reproducible but requires careful attention to detail. Optimization of DNA extraction, fragment amplification, and normalization of fragment peak heights during fluorescent detection are critical steps to minimizing the downstream time spent verifying and scoring the data.
The punctilious RNA polymerase II core promoter

PubMed Central

Vo ngoc, Long; Wang, Yuan-Liang; Kassavetis, George A.; Kadonaga, James T.

2017-01-01

The signals that direct the initiation of transcription ultimately converge at the core promoter, which is the gateway to transcription. Here we provide an overview of the RNA polymerase II core promoter in bilateria (bilaterally symmetric animals). The core promoter is diverse in terms of its composition and function yet is also punctilious, as it acts with strict rules and precision. We additionally describe an expanded view of the core promoter that comprises the classical DNA sequence motifs, sequence-specific DNA-binding transcription factors, chromatin signals, and DNA structure. This model may eventually lead to a more unified conceptual understanding of the core promoter. PMID:28808065
Self-assembly of multiferroic core-shell particulate nanocomposites through DNA-DNA hybridization and magnetic field directed assembly of superstructures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sreenivasulu, Gollapudi; Srinivasan, Gopalan, E-mail: srinivas@oakland.edu, E-mail: chavez@oakland.edu; Lochbiler, Thomas A.

Multiferroic composites of ferromagnetic and ferroelectric phases are of importance for studies on mechanical strain mediated coupling between the magnetic and electric subsystems. This work is on DNA-assisted self-assembly of superstructures of such composites with nanometer periodicity. The synthesis involved oligomeric DNA-functionalized ferroelectric and ferromagnetic nanoparticles, 600 nm BaTiO{sub 3} (BTO) and 200 nm NiFe{sub 2}O{sub 4} (NFO), respectively. Mixing BTO and NFO particles, possessing complementary DNA sequences, resulted in the formation of ordered core-shell heteronanocomposites held together by DNA hybridization. The composites were imaged by scanning electron microscopy and scanning microwave microscopy. The presence of heteroassemblies along with core-shellmore » architecture is clearly observed. The reversible nature of the DNA hybridization allows for restructuring the composites into mm-long linear chains and 2D-arrays in the presence of a static magnetic field and ring-like structures in a rotating-magnetic field. Strong magneto-electric (ME) coupling in as-assembled composites is evident from static magnetic field H induced polarization and low-frequency magnetoelectric voltage coefficient measurements. Upon annealing the nanocomposites at high temperatures, evidence for the formation of bulk composites with excellent cross-coupling between the electric and magnetic subsystems is obtained by H-induced polarization and low-frequency ME voltage coefficient. The ME coupling strength in the self-assembled composites is measured to be much stronger than in bulk composites with randomly distributed NFO and BTO prepared by direct mixing and sintering.« less
Identifying active foraminifera in the Sea of Japan using metatranscriptomic approach

NASA Astrophysics Data System (ADS)

Lejzerowicz, Franck; Voltsky, Ivan; Pawlowski, Jan

2013-02-01

Metagenetics represents an efficient and rapid tool to describe environmental diversity patterns of microbial eukaryotes based on ribosomal DNA sequences. However, the results of metagenetic studies are often biased by the presence of extracellular DNA molecules that are persistent in the environment, especially in deep-sea sediment. As an alternative, short-lived RNA molecules constitute a good proxy for the detection of active species. Here, we used a metatranscriptomic approach based on RNA-derived (cDNA) sequences to study the diversity of the deep-sea benthic foraminifera and compared it to the metagenetic approach. We analyzed 257 ribosomal DNA and cDNA sequences obtained from seven sediments samples collected in the Sea of Japan at depths ranging from 486 to 3665 m. The DNA and RNA-based approaches gave a similar view of the taxonomic composition of foraminiferal assemblage, but differed in some important points. First, the cDNA dataset was dominated by sequences of rotaliids and robertiniids, suggesting that these calcareous species, some of which have been observed in Rose Bengal stained samples, are the most active component of foraminiferal community. Second, the richness of monothalamous (single-chambered) foraminifera was particularly high in DNA extracts from the deepest samples, confirming that this group of foraminifera is abundant but not necessarily very active in the deep-sea sediments. Finally, the high divergence of undetermined sequences in cDNA dataset indicate the limits of our database and lack of knowledge about some active but possibly rare species. Our study demonstrates the capability of the metatranscriptomic approach to detect active foraminiferal species and prompt its use in future high-throughput sequencing-based environmental surveys.

A comparative study of ancient environmental DNA to pollen and macrofossils from lake sediments reveals taxonomic overlap and additional plant taxa

NASA Astrophysics Data System (ADS)

Pedersen, Mikkel Winther; Ginolhac, Aurélien; Orlando, Ludovic; Olsen, Jesper; Andersen, Kenneth; Holm, Jakob; Funder, Svend; Willerslev, Eske; Kjær, Kurt H.

2013-09-01

We use 2nd generation sequencing technology on sedimentary ancient DNA (sedaDNA) from a lake in South Greenland to reconstruct the local floristic history around a low-arctic lake and compare the results with those previously obtained from pollen and macrofossils in the same lake. Thirty-eight of thirty-nine samples from the core yielded putative DNA sequences. Using a multiple assignment strategy on the trnL g-h DNA barcode, consisting of two different phylogenetic and one sequence similarity assignment approaches, thirteen families of plants were identified, of which two (Scrophulariaceae and Asparagaceae) are absent from the pollen and macrofossil records. An age model for the sediment based on twelve radiocarbon dates establishes a chronology and shows that the lake record dates back to 10,650 cal yr BP. Our results suggest that sedaDNA analysis from lake sediments, although taxonomically less detailed than pollen and macrofossil analyses can be a complementary tool for establishing the composition of both terrestrial and aquatic local plant communities and a method for identifying additional taxa.
Divergence, differential methylation and interspersion of melon satellite DNA sequences.

PubMed Central

Shmookler Reis, R; Timmis, J N; Ingle, J

1981-01-01

Melon (Cucumis melo) satellite DNA consists of two components, Q and S, each with a buoyant density in CsCl of 1.707 g/ml, but differing by 9 degrees C in "melting" temperature. These physical properties appear to be in contradiction, since both depend on G + C content. In order to resolve this anomaly, base compositions were directly determined for isolated fractions. the low-"melting" component S contains 41.8% G + C, with 6% of C present as 5-methylcytosine, whereas Q DNA contains 54% G + C, with 41% of C methylated. Analyses of restriction site loss agreed well with the direct determinations of methylation and divergence, and indicated some clustering of methylated sites in Q DNA. Analysis of restricted main-band DNA by hydridization with RNA complementary to Q satellite DNA ("Southern transfer") showed satellite Q tandem arrays interspersed in DNA of main-band density. Sequence divergence and extent of methylation did not appear to depend on whether a repeat array was present as satellite or interspersed in main-band DNA. Hydridization in situ indicated considerable heterogeneity in the genomic proportion of the Q-DNA sequences in melon fruit nuclei, implying over- and under-representation consistent with extensive unequal recombination in satellite Q tandem arrays. The cucumber, Cucumis sativus, contains less than 8% as much Q-homologous DNA per genome as the melon, suggesting rapid evolutionary gain or loss of these tandem repeat sequences. Images Fig. 2. PLATE 1 Fig. 4. Fig. 10. PMID:6172117
Impact of cultivation on characterisation of species composition of soil bacterial communities.

PubMed

McCaig, A E.; Grayston, S J.; Prosser, J I.; Glover, L A.

2001-03-01

The species composition of culturable bacteria in Scottish grassland soils was investigated using a combination of Biolog and 16S rDNA analysis for characterisation of isolates. The inclusion of a molecular approach allowed direct comparison of sequences from culturable bacteria with sequences obtained during analysis of DNA extracted directly from the same soil samples. Bacterial strains were isolated on Pseudomonas isolation agar (PIA), a selective medium, and on tryptone soya agar (TSA), a general laboratory medium. In total, 12 and 21 morphologically different bacterial cultures were isolated on PIA and TSA, respectively. Biolog and sequencing placed PIA isolates in the same taxonomic groups, the majority of cultures belonging to the Pseudomonas (sensu stricto) group. However, analysis of 16S rDNA sequences proved more efficient than Biolog for characterising TSA isolates due to limitations of the Microlog database for identifying environmental bacteria. In general, 16S rDNA sequences from TSA isolates showed high similarities to cultured species represented in sequence databases, although TSA-8 showed only 92.5% similarity to the nearest relative, Bacillus insolitus. In general, there was very little overlap between the culturable and uncultured bacterial communities, although two sequences, PIA-2 and TSA-13, showed >99% similarity to soil clones. A cloning step was included prior to sequence analysis of two isolates, TSA-5 and TSA-14, and analysis of several clones confirmed that these cultures comprised at least four and three sequence types, respectively. All isolate clones were most closely related to uncultured bacteria, with clone TSA-5.1 showing 99.8% similarity to a sequence amplified directly from the same soil sample. Interestingly, one clone, TSA-5.4, clustered within a novel group comprising only uncultured sequences. This group, which is associated with the novel, deep-branching Acidobacterium capsulatum lineage, also included clones isolated during direct analysis of the same soil and from a wide range of other sample types studied elsewhere. The study demonstrates the value of fine-scale molecular analysis for identification of laboratory isolates and indicates the culturability of approximately 1% of the total population but under a restricted range of media and cultivation conditions.
Evaluation of carbon nanotube based copper nanoparticle composite for the efficient detection of agroviruses

USDA-ARS?s Scientific Manuscript database

Nanomaterials based sensors offer sensitivity and selectivity for the detection of a specific analyte-of-the-interest. Described here is a novel assay for the detection of a DNA sequence based on nanostructured carbon nanotubes/copper nanoparticles composite. This assay was modeled on strong electro...
Identification and characterization of large DNA deletions affecting oil quality traits in soybean seeds through transcriptome sequencing analysis

USDA-ARS?s Scientific Manuscript database

Understanding the molecular and genetic mechanisms underlying variation in seed composition and contents among different genotypes is important for soybean oil quality improvement. We designed a bioinformatics approach to compare seed transcriptomes of 9 soybean genotypes varying in oil composition ...
PseKNC: a flexible web server for generating pseudo K-tuple nucleotide composition.

PubMed

Chen, Wei; Lei, Tian-Yu; Jin, Dian-Chuan; Lin, Hao; Chou, Kuo-Chen

2014-07-01

The pseudo oligonucleotide composition, or pseudo K-tuple nucleotide composition (PseKNC), can be used to represent a DNA or RNA sequence with a discrete model or vector yet still keep considerable sequence order information, particularly the global or long-range sequence order information, via the physicochemical properties of its constituent oligonucleotides. Therefore, the PseKNC approach may hold very high potential for enhancing the power in dealing with many problems in computational genomics and genome sequence analysis. However, dealing with different DNA or RNA problems may need different kinds of PseKNC. Here, we present a flexible and user-friendly web server for PseKNC (at http://lin.uestc.edu.cn/pseknc/default.aspx) by which users can easily generate many different modes of PseKNC according to their need by selecting various parameters and physicochemical properties. Furthermore, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the current web server to generate their desired PseKNC without the need to follow the complicated mathematical equations, which are presented in this article just for the integrity of PseKNC formulation and its development. It is anticipated that the PseKNC web server will become a very useful tool in computational genomics and genome sequence analysis. Copyright © 2014 Elsevier Inc. All rights reserved.
Detection of Different DNA Animal Species in Commercial Candy Products.

PubMed

Muñoz-Colmenero, Marta; Martínez, Jose Luis; Roca, Agustín; Garcia-Vazquez, Eva

2016-03-01

Candy products are consumed all across the world, but there is not much information about their composition. In this study we have used a DNA-based approach for determining the animal species occurring in 40 commercial candies of different types. We extracted DNA and performed PCR amplification, cloning and sequencing for obtaining species-informative DNA sequences. Eight species were identified including fish (hake and anchovy) in 22% of the products analyzed. Bovine and porcine were the most abundant appearing in 27 samples each one. Most products contained a mixture of species. Marshmallows (7), jelly-types, and gummies (20) contained a significantly higher number of species than hard candies (9). We demonstrated the presence of DNA animal species in candy product which allow consumers to make choices and prevent allergic reaction. © 2016 Institute of Food Technologists®
Stretching chimeric DNA: A test for the putative S-form

NASA Astrophysics Data System (ADS)

Whitelam, Stephen; Pronk, Sander; Geissler, Phillip L.

2008-11-01

Double-stranded DNA "overstretches" at a pulling force of about 65 pN, increasing in length by a factor of 1.7. The nature of the overstretched state is unknown, despite its considerable importance for DNA's biological function and technological application. Overstretching is thought by some to be a force-induced denaturation and by others to consist of a transition to an elongated, hybridized state called S-DNA. Within a statistical mechanical model, we consider the effect upon overstretching of extreme sequence heterogeneity. "Chimeric" sequences possessing halves of markedly different AT composition elongate under fixed external conditions via distinct, spatially segregated transitions. The corresponding force-extension data vary with pulling rate in a manner that depends qualitatively and strikingly upon whether the hybridized S-form is accessible. This observation implies a test for S-DNA that could be performed in experiment.
Context based computational analysis and characterization of ARS consensus sequences (ACS) of Saccharomyces cerevisiae genome.

PubMed

Singh, Vinod Kumar; Krishnamachari, Annangarachari

2016-09-01

Genome-wide experimental studies in Saccharomyces cerevisiae reveal that autonomous replicating sequence (ARS) requires an essential consensus sequence (ACS) for replication activity. Computational studies identified thousands of ACS like patterns in the genome. However, only a few hundreds of these sites act as replicating sites and the rest are considered as dormant or evolving sites. In a bid to understand the sequence makeup of replication sites, a content and context-based analysis was performed on a set of replicating ACS sequences that binds to origin-recognition complex (ORC) denoted as ORC-ACS and non-replicating ACS sequences (nrACS), that are not bound by ORC. In this study, DNA properties such as base composition, correlation, sequence dependent thermodynamic and DNA structural profiles, and their positions have been considered for characterizing ORC-ACS and nrACS. Analysis reveals that ORC-ACS depict marked differences in nucleotide composition and context features in its vicinity compared to nrACS. Interestingly, an A-rich motif was also discovered in ORC-ACS sequences within its nucleosome-free region. Profound changes in the conformational features, such as DNA helical twist, inclination angle and stacking energy between ORC-ACS and nrACS were observed. Distribution of ACS motifs in the non-coding segments points to the locations of ORC-ACS which are found far away from the adjacent gene start position compared to nrACS thereby enabling an accessible environment for ORC-proteins. Our attempt is novel in considering the contextual view of ACS and its flanking region along with nucleosome positioning in the S. cerevisiae genome and may be useful for any computational prediction scheme.
Optimizing Illumina next-generation sequencing library preparation for extremely AT-biased genomes.

PubMed

Oyola, Samuel O; Otto, Thomas D; Gu, Yong; Maslen, Gareth; Manske, Magnus; Campino, Susana; Turner, Daniel J; Macinnis, Bronwyn; Kwiatkowski, Dominic P; Swerdlow, Harold P; Quail, Michael A

2012-01-03

Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences. We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates. We have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of DNA starting material.
Isolation of a cDNA Encoding a Granule-Bound 152-Kilodalton Starch-Branching Enzyme in Wheat1

PubMed Central

Båga, Monica; Nair, Ramesh B.; Repellin, Anne; Scoles, Graham J.; Chibbar, Ravindra N.

2000-01-01

Screening of a wheat (Triticum aestivum) cDNA library for starch-branching enzyme I (SBEI) genes combined with 5′-rapid amplification of cDNA ends resulted in isolation of a 4,563-bp composite cDNA, Sbe1c. Based on sequence alignment to characterized SBEI cDNA clones isolated from plants, the SBEIc predicted from the cDNA sequence was produced with a transit peptide directing the polypeptide into plastids. Furthermore, the predicted mature form of SBEIc was much larger (152 kD) than previously characterized plant SBEI (80–100 kD) and contained a partial duplication of SBEI sequences. The first SBEI domain showed high amino acid similarity to a 74-kD wheat SBEI-like protein that is inactive as a branching enzyme when expressed in Escherichia coli. The second SBEI domain on SBEIc was identical in sequence to a functional 87-kD SBEI produced in the wheat endosperm. Immunoblot analysis of proteins produced in developing wheat kernels demonstrated that the 152-kD SBEIc was, in contrast to the 87- to 88-kD SBEI, preferentially associated with the starch granules. Proteins similar in size and recognized by wheat SBEI antibodies were also present in Triticum monococcum, Triticum tauschii, and Triticum turgidum subsp. durum. PMID:10982440
A rapid, generally applicable method to engineer zinc fingers illustrated by targeting the HIV-1 promoter.

PubMed

Isalan, M; Klug, A; Choo, Y

2001-07-01

DNA-binding domains with predetermined sequence specificity are engineered by selection of zinc finger modules using phage display, allowing the construction of customized transcription factors. Despite remarkable progress in this field, the available protein-engineering methods are deficient in many respects, thus hampering the applicability of the technique. Here we present a rapid and convenient method that can be used to design zinc finger proteins against a variety of DNA-binding sites. This is based on a pair of pre-made zinc finger phage-display libraries, which are used in parallel to select two DNA-binding domains each of which recognizes given 5 base pair sequences, and whose products are recombined to produce a single protein that recognizes a composite (9 base pair) site of predefined sequence. Engineering using this system can be completed in less than two weeks and yields proteins that bind sequence-specifically to DNA with Kd values in the nanomolar range. To illustrate the technique, we have selected seven different proteins to bind various regions of the human immunodeficiency virus 1 (HIV-1) promoter.
B-chromosome systems in the greater glider, Petauroides volans (Marsupialia: Pseudocheiridae). II. Investigation of B-chromosome DNA sequences isolated by micromanipulation and PCR.

PubMed

McQuade, L R; Hill, R J; Francis, D

1994-01-01

B chromosomes, despite their common occurrence throughout the animal and plant kingdoms, have not been investigated extensively at the molecular level. While the majority of B chromosomes occurring in animals have been described as heterochromatic, only a few researchers have examined the DNA of these chromosomes beyond this gross cytological level. This is the case in the largest of the gliding marsupial possums, the greater glider, Petauroides volans. To examine the molecular composition and localization of B-chromosome DNA sequences in P. volans, a combination of micromanipulation and the polymerase chain reaction was used in this study to isolate and then amplify the DNA of the B chromosomes. Localization of the isolated B-chromosome sequences to metaphase chromosomes was investigated using fluorescence in situ hybridization. The B chromosomes in this species are shown to be composed of a heterogeneous mixture of sequences, some of which are unique to the B chromosomes, while others exhibit homology to the centromeric regions of the autosomal complement.
Oligonucleotide fingerprinting of rRNA genes for analysis of fungal community composition.

PubMed

Valinsky, Lea; Della Vedova, Gianluca; Jiang, Tao; Borneman, James

2002-12-01

Thorough assessments of fungal diversity are currently hindered by technological limitations. Here we describe a new method for identifying fungi, oligonucleotide fingerprinting of rRNA genes (OFRG). ORFG sorts arrayed rRNA gene (ribosomal DNA [rDNA]) clones into taxonomic clusters through a series of hybridization experiments, each using a single oligonucleotide probe. A simulated annealing algorithm was used to design an OFRG probe set for fungal rDNA. Analysis of 1,536 fungal rDNA clones derived from soil generated 455 clusters. A pairwise sequence analysis showed that clones with average sequence identities of 99.2% were grouped into the same cluster. To examine the accuracy of the taxonomic identities produced by this OFRG experiment, we determined the nucleotide sequences for 117 clones distributed throughout the tree. For all but two of these clones, the taxonomic identities generated by this OFRG experiment were consistent with those generated by a nucleotide sequence analysis. Eighty-eight percent of the clones were affiliated with Ascomycota, while 12% belonged to BASIDIOMYCOTA: A large fraction of the clones were affiliated with the genera Fusarium (404 clones) and Raciborskiomyces (176 clones). Smaller assemblages of clones had high sequence identities to the Alternaria, Ascobolus, Chaetomium, Cryptococcus, and Rhizoctonia clades.
DNA nanotechnology-based composite-type gold nanoparticle-immunostimulatory DNA hydrogel for tumor photothermal immunotherapy.

PubMed

Yata, Tomoya; Takahashi, Yuki; Tan, Mengmeng; Nakatsuji, Hirotaka; Ohtsuki, Shozo; Murakami, Tatsuya; Imahori, Hiroshi; Umeki, Yuka; Shiomi, Tomoki; Takakura, Yoshinobu; Nishikawa, Makiya

2017-11-01

Success of tumor photothermal immunotherapy requires a system that induces heat stress in cancer cells and enhances strong anti-tumor immune responses. Here, we designed a composite-type immunostimulatory DNA hydrogel consisting of a hexapod-like structured DNA (hexapodna) with CpG sequences and gold nanoparticles. Mixing of the properly designed hexapodna and oligodeoxynucleotide-modified gold nanoparticles resulted in the formation of composite-type gold nanoparticle-DNA hydrogels. Laser irradiation of the hydrogel resulted in the release of hexapodna, which efficiently stimulated immune cells to release proinflammatory cytokines. Then, EG7-OVA tumor-bearing mice received an intratumoral injection of a gold nanoparticle-DNA hydrogel, followed by laser irradiation at 780 nm. This treatment increased the local temperature and the mRNA expression of heat shock protein 70 in the tumor tissue, increased tumor-associated antigen-specific IgG levels in the serum, and induced tumor-associated antigen-specific interferon-γ production from splenocytes. Moreover, the treatment significantly retarded the tumor growth and extended the survival of the tumor-bearing mice. Copyright © 2017 Elsevier Ltd. All rights reserved.
DNA encoding for plant digalactosyldiacylglycerol galactosyltransferase and methods of use

DOEpatents

Benning, Christoph; Doermann, Peter

2003-11-04

The cDNA encoding digalactosyldiacylglycerol galactosyltransferase (DGD1) is provided. The deduced amino acid sequence is also provided. Methods of making and using DGD1 to screen for new herbicides and alter a plant's leaf lipid composition are also provided, as well as expression vectors, transgenic plants or other organisms transfected with said vectors.
16S rRNA Gene Sequence Analysis of Drinking Water Using RNA and DNA Extracts as Targets for Clone Library Development

EPA Science Inventory

We examined the bacterial composition of chlorinated drinking water using 16S rRNA gene clone libraries derived from RNA and DNA extracted from twelve water samples collected in three different months (June, August, and September of 2007). Phylogenetic analysis of 1234 and 1117 ...
16S rRNA Gene Sequence Analysis of Drinking Water Using RNA and DNA Extracts as Targets for Clone Library Development - Poster

EPA Science Inventory

We examined the bacterial composition of chlorinated drinking water using 16S rRNA gene clone libraries derived from RNA and DNA extracted from twelve water samples collected in three different months (June, August, and September of 2007). Phylogenetic analysis of 1234 and 1117 ...
Microbial composition analyses by 16S rRNA sequencing: A proof of concept approach to provenance determination of archaeological ochre.

PubMed

Lenehan, Claire E; Tobe, Shanan S; Smith, Renee J; Popelka-Filcoff, Rachel S

2017-01-01

Many archaeological science studies use the concept of "provenance", where the origins of cultural material can be determined through physical or chemical properties that relate back to the origins of the material. Recent studies using DNA profiling of bacteria have been used for the forensic determination of soils, towards determination of geographic origin. This manuscript presents a novel approach to the provenance of archaeological minerals and related materials through the use of 16S rRNA sequencing analysis of microbial DNA. Through the microbial DNA characterization from ochre and multivariate statistics, we have demonstrated the clear discrimination between four distinct Australian cultural ochre sites.
Sequence-dependent response of DNA to torsional stress: a potential biological regulation mechanism.

PubMed

Reymer, Anna; Zakrzewska, Krystyna; Lavery, Richard

2018-02-28

Torsional restraints on DNA change in time and space during the life of the cell and are an integral part of processes such as gene expression, DNA repair and packaging. The mechanical behavior of DNA under torsional stress has been studied on a mesoscopic scale, but little is known concerning its response at the level of individual base pairs and the effects of base pair composition. To answer this question, we have developed a geometrical restraint that can accurately control the total twist of a DNA segment during all-atom molecular dynamics simulations. By applying this restraint to four different DNA oligomers, we are able to show that DNA responds to both under- and overtwisting in a very heterogeneous manner. Certain base pair steps, in specific sequence environments, are able to absorb most of the torsional stress, leaving other steps close to their relaxed conformation. This heterogeneity also affects the local torsional modulus of DNA. These findings suggest that modifying torsional stress on DNA could act as a modulator for protein binding via the heterogeneous changes in local DNA structure.

Sequence-dependent response of DNA to torsional stress: a potential biological regulation mechanism

PubMed Central

Reymer, Anna; Zakrzewska, Krystyna; Lavery, Richard

2018-01-01

Abstract Torsional restraints on DNA change in time and space during the life of the cell and are an integral part of processes such as gene expression, DNA repair and packaging. The mechanical behavior of DNA under torsional stress has been studied on a mesoscopic scale, but little is known concerning its response at the level of individual base pairs and the effects of base pair composition. To answer this question, we have developed a geometrical restraint that can accurately control the total twist of a DNA segment during all-atom molecular dynamics simulations. By applying this restraint to four different DNA oligomers, we are able to show that DNA responds to both under- and overtwisting in a very heterogeneous manner. Certain base pair steps, in specific sequence environments, are able to absorb most of the torsional stress, leaving other steps close to their relaxed conformation. This heterogeneity also affects the local torsional modulus of DNA. These findings suggest that modifying torsional stress on DNA could act as a modulator for protein binding via the heterogeneous changes in local DNA structure. PMID:29267977
Herpes simplex virus DNA packaging sequences adopt novel structures that are specifically recognized by a component of the cleavage and packaging machinery.

PubMed

Adelman, K; Salmon, B; Baines, J D

2001-03-13

The product of the herpes simplex virus type 1 U(L)28 gene is essential for cleavage of concatemeric viral DNA into genome-length units and packaging of this DNA into viral procapsids. To address the role of U(L)28 in this process, purified U(L)28 protein was assayed for the ability to recognize conserved herpesvirus DNA packaging sequences. We report that DNA fragments containing the pac1 DNA packaging motif can be induced by heat treatment to adopt novel DNA conformations that migrate faster than the corresponding duplex in nondenaturing gels. Surprisingly, these novel DNA structures are high-affinity substrates for U(L)28 protein binding, whereas double-stranded DNA of identical sequence composition is not recognized by U(L)28 protein. We demonstrate that only one strand of the pac1 motif is responsible for the formation of novel DNA structures that are bound tightly and specifically by U(L)28 protein. To determine the relevance of the observed U(L)28 protein-pac1 interaction to the cleavage and packaging process, we have analyzed the binding affinity of U(L)28 protein for pac1 mutants previously shown to be deficient in cleavage and packaging in vivo. Each of the pac1 mutants exhibited a decrease in DNA binding by U(L)28 protein that correlated directly with the reported reduction in cleavage and packaging efficiency, thereby supporting a role for the U(L)28 protein-pac1 interaction in vivo. These data therefore suggest that the formation of novel DNA structures by the pac1 motif confers added specificity on recognition of DNA packaging sequences by the U(L)28-encoded component of the herpesvirus cleavage and packaging machinery.
Ancestral sequence reconstruction in primate mitochondrial DNA: compositional bias and effect on functional inference.

PubMed

Krishnan, Neeraja M; Seligmann, Hervé; Stewart, Caro-Beth; De Koning, A P Jason; Pollock, David D

2004-10-01

Reconstruction of ancestral DNA and amino acid sequences is an important means of inferring information about past evolutionary events. Such reconstructions suggest changes in molecular function and evolutionary processes over the course of evolution and are used to infer adaptation and convergence. Maximum likelihood (ML) is generally thought to provide relatively accurate reconstructed sequences compared to parsimony, but both methods lead to the inference of multiple directional changes in nucleotide frequencies in primate mitochondrial DNA (mtDNA). To better understand this surprising result, as well as to better understand how parsimony and ML differ, we constructed a series of computationally simple "conditional pathway" methods that differed in the number of substitutions allowed per site along each branch, and we also evaluated the entire Bayesian posterior frequency distribution of reconstructed ancestral states. We analyzed primate mitochondrial cytochrome b (Cyt-b) and cytochrome oxidase subunit I (COI) genes and found that ML reconstructs ancestral frequencies that are often more different from tip sequences than are parsimony reconstructions. In contrast, frequency reconstructions based on the posterior ensemble more closely resemble extant nucleotide frequencies. Simulations indicate that these differences in ancestral sequence inference are probably due to deterministic bias caused by high uncertainty in the optimization-based ancestral reconstruction methods (parsimony, ML, Bayesian maximum a posteriori). In contrast, ancestral nucleotide frequencies based on an average of the Bayesian set of credible ancestral sequences are much less biased. The methods involving simpler conditional pathway calculations have slightly reduced likelihood values compared to full likelihood calculations, but they can provide fairly unbiased nucleotide reconstructions and may be useful in more complex phylogenetic analyses than considered here due to their speed and flexibility. To determine whether biased reconstructions using optimization methods might affect inferences of functional properties, ancestral primate mitochondrial tRNA sequences were inferred and helix-forming propensities for conserved pairs were evaluated in silico. For ambiguously reconstructed nucleotides at sites with high base composition variability, ancestral tRNA sequences from Bayesian analyses were more compatible with canonical base pairing than were those inferred by other methods. Thus, nucleotide bias in reconstructed sequences apparently can lead to serious bias and inaccuracies in functional predictions.
SSR_pipeline: a bioinformatic infrastructure for identifying microsatellites from paired-end Illumina high-throughput DNA sequencing data

USGS Publications Warehouse

Miller, Mark P.; Knaus, Brian J.; Mullins, Thomas D.; Haig, Susan M.

2013-01-01

SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (e.g., microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains 3 analysis modules along with a fourth control module that can automate analyses of large volumes of data. The modules are used to 1) identify the subset of paired-end sequences that pass Illumina quality standards, 2) align paired-end reads into a single composite DNA sequence, and 3) identify sequences that possess microsatellites (both simple and compound) conforming to user-specified parameters. The microsatellite search algorithm is extremely efficient, and we have used it to identify repeats with motifs from 2 to 25bp in length. Each of the 3 analysis modules can also be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc.). We demonstrate use of the program with data from the brine fly Ephydra packardi (Diptera: Ephydridae) and provide empirical timing benchmarks to illustrate program performance on a common desktop computer environment. We further show that the Illumina platform is capable of identifying large numbers of microsatellites, even when using unenriched sample libraries and a very small percentage of the sequencing capacity from a single DNA sequencing run. All modules from SSR_pipeline are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, and Windows).
SSR_pipeline: a bioinformatic infrastructure for identifying microsatellites from paired-end Illumina high-throughput DNA sequencing data.

PubMed

Miller, Mark P; Knaus, Brian J; Mullins, Thomas D; Haig, Susan M

2013-01-01

SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (e.g., microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains 3 analysis modules along with a fourth control module that can automate analyses of large volumes of data. The modules are used to 1) identify the subset of paired-end sequences that pass Illumina quality standards, 2) align paired-end reads into a single composite DNA sequence, and 3) identify sequences that possess microsatellites (both simple and compound) conforming to user-specified parameters. The microsatellite search algorithm is extremely efficient, and we have used it to identify repeats with motifs from 2 to 25 bp in length. Each of the 3 analysis modules can also be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc.). We demonstrate use of the program with data from the brine fly Ephydra packardi (Diptera: Ephydridae) and provide empirical timing benchmarks to illustrate program performance on a common desktop computer environment. We further show that the Illumina platform is capable of identifying large numbers of microsatellites, even when using unenriched sample libraries and a very small percentage of the sequencing capacity from a single DNA sequencing run. All modules from SSR_pipeline are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, and Windows).
PCR Conditions for 16S Primers for Analysis of Microbes in the Colon of Rats.

PubMed

Guillen, I A; Camacho, H; Tuero, A D; Bacardí, D; Palenzuela, D O; Aguilera, A; Silva, J A; Estrada, R; Gell, O; Suárez, J; Ancizar, J; Brown, E; Colarte, A B; Castro, J; Novoa, L I

2016-09-01

The study of the composition of the intestinal flora is important to the health of the host, playing a key role in maintaining intestinal homeostasis and the evolution of the immune system. For these studies, various universal primers of the 16S rDNA gene are used in microbial taxonomy. Here, we report an evaluation of 5 universal primers to explore the presence of microbial DNA in colon biopsies preserved in RNAlater solution. The DNA extracted was used for the amplification of PCR products containing the variable (V) regions of the microbial 16S rDNA gene. The PCR products were studied by restriction fragment length polymorphism (RFLP) analysis and DNA sequence, whose percent of homology with microbial sequences reported in GenBank was verified using bioinformatics tools. The presence of microbes in the colon of rats was quantified by the quantitative PCR (qPCR) technique. We obtained microbial DNA from rat, useful for PCR analysis with the universal primers for the bacteria 16S rDNA. The sequences of PCR products obtained from a colon biopsy of the animal showed homology with the classes bacilli (Lactobacillus spp) and proteobacteria, normally represented in the colon of rats. The proposed methodology allowed the attainment of DNA of bacteria with the quality and integrity for use in qPCR, sequencing, and PCR-RFLP analysis. The selected universal primers provided knowledge of the abundance of microorganisms and the formation of a preliminary test of bacterial diversity in rat colon biopsies.
Vertebrate Genome Evolution in the Light of Fish Cytogenomics and rDNAomics

PubMed Central

Howell, W. Mike

2018-01-01

To understand the cytogenomic evolution of vertebrates, we must first unravel the complex genomes of fishes, which were the first vertebrates to evolve and were ancestors to all other vertebrates. We must not forget the immense time span during which the fish genomes had to evolve. Fish cytogenomics is endowed with unique features which offer irreplaceable insights into the evolution of the vertebrate genome. Due to the general DNA base compositional homogeneity of fish genomes, fish cytogenomics is largely based on mapping DNA repeats that still represent serious obstacles in genome sequencing and assembling, even in model species. Localization of repeats on chromosomes of hundreds of fish species and populations originating from diversified environments have revealed the biological importance of this genomic fraction. Ribosomal genes (rDNA) belong to the most informative repeats and in fish, they are subject to a more relaxed regulation than in higher vertebrates. This can result in formation of a literal ‘rDNAome’ consisting of more than 20,000 copies with their high proportion employed in extra-coding functions. Because rDNA has high rates of transcription and recombination, it contributes to genome diversification and can form reproductive barrier. Our overall knowledge of fish cytogenomics grows rapidly by a continuously increasing number of fish genomes sequenced and by use of novel sequencing methods improving genome assembly. The recently revealed exceptional compositional heterogeneity in an ancient fish lineage (gars) sheds new light on the compositional genome evolution in vertebrates generally. We highlight the power of synergy of cytogenetics and genomics in fish cytogenomics, its potential to understand the complexity of genome evolution in vertebrates, which is also linked to clinical applications and the chromosomal backgrounds of speciation. We also summarize the current knowledge on fish cytogenomics and outline its main future avenues. PMID:29443947
Fragmentation of contaminant and endogenous DNA in ancient samples determined by shotgun sequencing; prospects for human palaeogenomics.

PubMed

García-Garcerà, Marc; Gigli, Elena; Sanchez-Quinto, Federico; Ramirez, Oscar; Calafell, Francesc; Civit, Sergi; Lalueza-Fox, Carles

2011-01-01

Despite the successful retrieval of genomes from past remains, the prospects for human palaeogenomics remain unclear because of the difficulty of distinguishing contaminant from endogenous DNA sequences. Previous sequence data generated on high-throughput sequencing platforms indicate that fragmentation of ancient DNA sequences is a characteristic trait primarily arising due to depurination processes that create abasic sites leading to DNA breaks. METHODOLOGY/PRINCIPALS FINDINGS: To investigate whether this pattern is present in ancient remains from a temperate environment, we have 454-FLX pyrosequenced different samples dated between 5,500 and 49,000 years ago: a bone from an extinct goat (Myotragus balearicus) that was treated with a depurinating agent (bleach), an Iberian lynx bone not subjected to any treatment, a human Neolithic sample from Barcelona (Spain), and a Neandertal sample from the El Sidrón site (Asturias, Spain). The efficiency of retrieval of endogenous sequences is below 1% in all cases. We have used the non-human samples to identify human sequences (0.35 and 1.4%, respectively), that we positively know are contaminants. We observed that bleach treatment appears to create a depurination-associated fragmentation pattern in resulting contaminant sequences that is indistinguishable from previously described endogenous sequences. Furthermore, the nucleotide composition pattern observed in 5' and 3' ends of contaminant sequences is much more complex than the flat pattern previously described in some Neandertal contaminants. Although much research on samples with known contaminant histories is needed, our results suggest that endogenous and contaminant sequences cannot be distinguished by the fragmentation pattern alone.
Bacterial and fungal composition profiling of microbial based cleaning products.

PubMed

Subasinghe, R M; Samarajeewa, A D; Meier, M; Coleman, G; Clouthier, H; Crosthwait, J; Tayabali, A F; Scroggins, R; Shwed, P S; Beaudette, L A

2018-06-01

Microbial based cleaning products (MBCPs) are a new generation of cleaning products that are gaining greater use in household, institutional, and industrial settings. Little is known about the exact microbial composition of these products because they are not identified in detail on product labels and formulations are often proprietary. To gain a better understanding of their microbial and fungal composition towards risk assessment, the cultivable microorganisms and rDNA was surveyed for microbial content in five different MBCPs manufactured and sold in North America. Individual bacterial and fungal colonies were identified by ribosequencing and fatty acid methyl ester (FAME) gas chromatography. Metagenomic DNA (mDNA) corresponding to each of the products was subjected to amplification and short read sequencing of seven of the variable regions of the bacterial 16S ribosomal DNA. Taken together, the cultivable microorganism and rDNA survey analyses showed that three of the products were simple mixtures of Bacillus species. The two other products featured a mixture of cultivable fungi with Bacilli, and by rDNA survey analysis, they featured greater microbial complexity. This study improves our understanding of the microbial composition of several MBCPs towards a more comprehensive risk assessment. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.
Plant centromere compositions

DOEpatents

Mach, Jennifer M [Chicago, IL; Zieler, Helge [Del Mar, CA; Jin, RongGuan [Chesterfield, MO; Keith, Kevin [Three Forks, MT; Copenhaver, Gregory P [Chapel Hill, NC; Preuss, Daphne [Chicago, IL

2011-08-02

The present invention provides for the nucleic acid sequences of plant centromeres. This will permit construction of stably inherited recombinant DNA constructs and minichromosomes which can serve as vectors for the construction of transgenic plant and animal cells.
Plant centromere compositions

DOEpatents

Mach,; Jennifer M. , Zieler; Helge, Jin [Del Mar, CA; RongGuan, Keith [Chesterfield, MO; Kevin, Copenhaver [Three Forks, MT; Gregory P. , Preuss; Daphne, [Chicago, IL

2011-11-22

The present invention provides for the nucleic acid sequences of plant centromeres. This will permit construction of stably inherited recombinant DNA constructs and minichromosomes which can serve as vectors for the construction of transgenic plant and animal cells.
Plant centromere compositions

DOEpatents

Keith, Kevin; Copenhaver, Gregory; Preuss, Daphne

2006-10-10

The present invention provides for the nucleic acid sequences of plant centromeres. This will permit construction of stably inherited recombinant DNA constructs and minichromosomes which can serve as vectors for the construction of transgenic plant and animal cells.
Plant centromere compositions

DOEpatents

Mach, Jennifer [Chicago, IL; Zieler, Helge [Chicago, IL; Jin, James [Chicago, IL; Keith, Kevin [Chicago, IL; Copenhaver, Gregory [Chapel Hill, NC; Preuss, Daphne [Chicago, IL

2006-06-26

The present invention provides for the nucleic acid sequences of plant centromeres. This will permit construction of stably inherited recombinant DNA constructs and minichromosomes which can serve as vectors for the construction of transgenic plant and animal cells.
Plant centromere compositions

DOEpatents

Mach, Jennifer [Chicago, IL; Zieler, Helge [Chicago, IL; Jin, RongGuan [Chicago, IL; Keith, Kevin [Chicago, IL; Copenhaver, Gregory [Chapel Hill, NC; Preuss, Daphne [Chicago, IL

2007-06-05

The present invention provides for the nucleic acid sequences of plant centromeres. This will permit construction of stably inherited recombinant DNA constructs and minichromosomes which can serve as vectors for the construction of transgenic plant and animal cells.
A Review on the Applications of Next Generation Sequencing Technologies as Applied to Food-Related Microbiome Studies

PubMed Central

Cao, Yu; Fanning, Séamus; Proos, Sinéad; Jordan, Kieran; Srikumar, Shabarinath

2017-01-01

The development of next generation sequencing (NGS) techniques has enabled researchers to study and understand the world of microorganisms from broader and deeper perspectives. The contemporary advances in DNA sequencing technologies have not only enabled finer characterization of bacterial genomes but also provided deeper taxonomic identification of complex microbiomes which in its genomic essence is the combined genetic material of the microorganisms inhabiting an environment, whether the environment be a particular body econiche (e.g., human intestinal contents) or a food manufacturing facility econiche (e.g., floor drain). To date, 16S rDNA sequencing, metagenomics and metatranscriptomics are the three basic sequencing strategies used in the taxonomic identification and characterization of food-related microbiomes. These sequencing strategies have used different NGS platforms for DNA and RNA sequence identification. Traditionally, 16S rDNA sequencing has played a key role in understanding the taxonomic composition of a food-related microbiome. Recently, metagenomic approaches have resulted in improved understanding of a microbiome by providing a species-level/strain-level characterization. Further, metatranscriptomic approaches have contributed to the functional characterization of the complex interactions between different microbial communities within a single microbiome. Many studies have highlighted the use of NGS techniques in investigating the microbiome of fermented foods. However, the utilization of NGS techniques in studying the microbiome of non-fermented foods are limited. This review provides a brief overview of the advances in DNA sequencing chemistries as the technology progressed from first, next and third generations and highlights how NGS provided a deeper understanding of food-related microbiomes with special focus on non-fermented foods. PMID:29033905
Novel division level bacterial diversity in a Yellowstone hot spring.

PubMed

Hugenholtz, P; Pitulle, C; Hershberger, K L; Pace, N R

1998-01-01

A culture-independent molecular phylogenetic survey was carried out for the bacterial community in Obsidian Pool (OP), a Yellowstone National Park hot spring previously shown to contain remarkable archaeal diversity (S. M. Barns, R. E. Fundyga, M. W. Jeffries, and N. R. Page, Proc. Natl. Acad. Sci. USA 91:1609-1613, 1994). Small-subunit rRNA genes (rDNA) were amplified directly from OP sediment DNA by PCR with universally conserved or Bacteria-specific rDNA primers and cloned. Unique rDNA types among > 300 clones were identified by restriction fragment length polymorphism, and 122 representative rDNA sequences were determined. These were found to represent 54 distinct bacterial sequence types or clusters (> or = 98% identity) of sequences. A majority (70%) of the sequence types were affiliated with 14 previously recognized bacterial divisions (main phyla; kingdoms); 30% were unaffiliated with recognized bacterial divisions. The unaffiliated sequence types (represented by 38 sequences) nominally comprise 12 novel, division level lineages termed candidate divisions. Several OP sequences were nearly identical to those of cultivated chemolithotrophic thermophiles, including the hydrogen-oxidizing Calderobacterium and the sulfate reducers Thermodesulfovibrio and Thermodesulfobacterium, or belonged to monophyletic assemblages recognized for a particular type of metabolism, such as the hydrogen-oxidizing Aquificales and the sulfate-reducing delta-Proteobacteria. The occurrence of such organisms is consistent with the chemical composition of OP (high in reduced iron and sulfur) and suggests a lithotrophic base for primary productivity in this hot spring, through hydrogen oxidation and sulfate reduction. Unexpectedly, no archaeal sequences were encountered in OP clone libraries made with universal primers. Hybridization analysis of amplified OP DNA with domain-specific probes confirmed that the analyzed community rDNA from OP sediment was predominantly bacterial. These results expand substantially our knowledge of the extent of bacterial diversity and call into question the commonly held notion that Archaea dominate hydrothermal environments. Finally, the currently known extent of division level bacterial phylogenetic diversity is collated and summarized.
Diversity of Bacteria at Healthy Human Conjunctiva

PubMed Central

Dong, Qunfeng; Brulc, Jennifer M.; Iovieno, Alfonso; Bates, Brandon; Garoutte, Aaron; Miller, Darlene; Revanna, Kashi V.; Gao, Xiang; Antonopoulos, Dionysios A.; Slepak, Vladlen Z.

2011-01-01

Purpose. Ocular surface (OS) microbiota contributes to infectious and autoimmune diseases of the eye. Comprehensive analysis of microbial diversity at the OS has been impossible because of the limitations of conventional cultivation techniques. This pilot study aimed to explore true diversity of human OS microbiota using DNA sequencing-based detection and identification of bacteria. Methods. Composition of the bacterial community was characterized using deep sequencing of the 16S rRNA gene amplicon libraries generated from total conjunctival swab DNA. The DNA sequences were classified and the diversity parameters measured using bioinformatics software ESPRIT and MOTHUR and tools available through the Ribosomal Database Project-II (RDP-II). Results. Deep sequencing of conjunctival rDNA from four subjects yielded a total of 115,003 quality DNA reads, corresponding to 221 species-level phylotypes per subject. The combined bacterial community classified into 5 phyla and 59 distinct genera. However, 31% of all DNA reads belonged to unclassified or novel bacteria. The intersubject variability of individual OS microbiomes was very significant. Regardless, 12 genera—Pseudomonas, Propionibacterium, Bradyrhizobium, Corynebacterium, Acinetobacter, Brevundimonas, Staphylococci, Aquabacterium, Sphingomonas, Streptococcus, Streptophyta, and Methylobacterium—were ubiquitous among the analyzed cohort and represented the putative “core” of conjunctival microbiota. The other 47 genera accounted for <4% of the classified portion of this microbiome. Unexpectedly, healthy conjunctiva contained many genera that are commonly identified as ocular surface pathogens. Conclusions. The first DNA sequencing-based survey of bacterial population at the conjunctiva have revealed an unexpectedly diverse microbial community. All analyzed samples contained ubiquitous (core) genera that included commensal, environmental, and opportunistic pathogenic bacteria. PMID:21571682
VIP Barcoding: composition vector-based software for rapid species identification based on DNA barcoding.

PubMed

Fan, Long; Hui, Jerome H L; Yu, Zu Guo; Chu, Ka Hou

2014-07-01

Species identification based on short sequences of DNA markers, that is, DNA barcoding, has emerged as an integral part of modern taxonomy. However, software for the analysis of large and multilocus barcoding data sets is scarce. The Basic Local Alignment Search Tool (BLAST) is currently the fastest tool capable of handling large databases (e.g. >5000 sequences), but its accuracy is a concern and has been criticized for its local optimization. However, current more accurate software requires sequence alignment or complex calculations, which are time-consuming when dealing with large data sets during data preprocessing or during the search stage. Therefore, it is imperative to develop a practical program for both accurate and scalable species identification for DNA barcoding. In this context, we present VIP Barcoding: a user-friendly software in graphical user interface for rapid DNA barcoding. It adopts a hybrid, two-stage algorithm. First, an alignment-free composition vector (CV) method is utilized to reduce searching space by screening a reference database. The alignment-based K2P distance nearest-neighbour method is then employed to analyse the smaller data set generated in the first stage. In comparison with other software, we demonstrate that VIP Barcoding has (i) higher accuracy than Blastn and several alignment-free methods and (ii) higher scalability than alignment-based distance methods and character-based methods. These results suggest that this platform is able to deal with both large-scale and multilocus barcoding data with accuracy and can contribute to DNA barcoding for modern taxonomy. VIP Barcoding is free and available at http://msl.sls.cuhk.edu.hk/vipbarcoding/. © 2014 John Wiley & Sons Ltd.
Intra-specific variation in genome size in maize: cytological and phenotypic correlates

PubMed Central

Realini, María Florencia; Poggio, Lidia; Cámara-Hernández, Julián; González, Graciela Esther

2016-01-01

Genome size variation accompanies the diversification and evolution of many plant species. Relationships between DNA amount and phenotypic and cytological characteristics form the basis of most hypotheses that ascribe a biological role to genome size. The goal of the present research was to investigate the intra-specific variation in the DNA content in maize populations from Northeastern Argentina and further explore the relationship between genome size and the phenotypic traits seed weight and length of the vegetative cycle. Moreover, cytological parameters such as the percentage of heterochromatin as well as the number, position and sequence composition of knobs were analysed and their relationships with 2C DNA values were explored. The populations analysed presented significant differences in 2C DNA amount, from 4.62 to 6.29 pg, representing 36.15 % of the inter-populational variation. Moreover, intra-populational genome size variation was found, varying from 1.08 to 1.63-fold. The variation in the percentage of knob heterochromatin as well as in the number, chromosome position and sequence composition of the knobs was detected among and within the populations. Although a positive relationship between genome size and the percentage of heterochromatin was observed, a significant correlation was not found. This confirms that other non-coding repetitive DNA sequences are contributing to the genome size variation. A positive relationship between DNA amount and the seed weight has been reported in a large number of species, this relationship was not found in the populations studied here. The length of the vegetative cycle showed a positive correlation with the percentage of heterochromatin. This result allowed attributing an adaptive effect to heterochromatin since the length of this cycle would be optimized via selection for an appropriate percentage of heterochromatin. PMID:26644343
2-Way k-Means as a Model for Microbiome Samples.

PubMed

Jackson, Weston J; Agarwal, Ipsita; Pe'er, Itsik

2017-01-01

Motivation . Microbiome sequencing allows defining clusters of samples with shared composition. However, this paradigm poorly accounts for samples whose composition is a mixture of cluster-characterizing ones and which therefore lie in between them in the cluster space. This paper addresses unsupervised learning of 2-way clusters. It defines a mixture model that allows 2-way cluster assignment and describes a variant of generalized k -means for learning such a model. We demonstrate applicability to microbial 16S rDNA sequencing data from the Human Vaginal Microbiome Project.

2-Way k-Means as a Model for Microbiome Samples

PubMed Central

2017-01-01

Motivation. Microbiome sequencing allows defining clusters of samples with shared composition. However, this paradigm poorly accounts for samples whose composition is a mixture of cluster-characterizing ones and which therefore lie in between them in the cluster space. This paper addresses unsupervised learning of 2-way clusters. It defines a mixture model that allows 2-way cluster assignment and describes a variant of generalized k-means for learning such a model. We demonstrate applicability to microbial 16S rDNA sequencing data from the Human Vaginal Microbiome Project. PMID:29177026
Origin and composition of cell-free DNA in spent medium from human embryo culture during preimplantation development.

PubMed

Vera-Rodriguez, M; Diez-Juan, A; Jimenez-Almazan, J; Martinez, S; Navarro, R; Peinado, V; Mercader, A; Meseguer, M; Blesa, D; Moreno, I; Valbuena, D; Rubio, C; Simon, C

2018-04-01

What is the origin and composition of cell-free DNA in human embryo spent culture media? Cell-free DNA from human embryo spent culture media represents a mix of maternal and embryonic DNA, and the mixture can be more complex for mosaic embryos. In 2016, ~300 000 human embryos were chromosomally and/or genetically analyzed using preimplantation genetic testing for aneuploidies (PGT-A) or monogenic disorders (PGT-M) before transfer into the uterus. While progress in genetic techniques has enabled analysis of the full karyotype in a single cell with high sensitivity and specificity, these approaches still require an embryo biopsy. Thus, non-invasive techniques are sought as an alternative. This study was based on a total of 113 human embryos undergoing trophectoderm biopsy as part of PGT-A analysis. For each embryo, the spent culture media used between Day 3 and Day 5 of development were collected for cell-free DNA analysis. In addition to the 113 spent culture media samples, 28 media drops without embryo contact were cultured in parallel under the same conditions to use as controls. In total, 141 media samples were collected and divided into two groups: one for direct DNA quantification (53 spent culture media and 17 controls), the other for whole-genome amplification (60 spent culture media and 11 controls) and subsequent quantification. Some samples with amplified DNA (N = 56) were used for aneuploidy testing by next-generation sequencing; of those, 35 samples underwent single-nucleotide polymorphism (SNP) sequencing to detect maternal contamination. Finally, from the 35 spent culture media analyzed by SNP sequencing, 12 whole blastocysts were analyzed by fluorescence in situ hybridization (FISH) to determine the level of mosaicism in each embryo, as a possible origin for discordance between sample types. Trophectoderm biopsies and culture media samples (20 μl) underwent whole-genome amplification, then libraries were generated and sequenced for an aneuploidy study. For SNP sequencing, triads including trophectoderm DNA, cell-free DNA, and follicular fluid DNA were analyzed. In total, 124 SNPs were included with 90 SNPs distributed among all autosomes and 34 SNPs located on chromosome Y. Finally, 12 whole blastocysts were fixed and individual cells were analyzed by FISH using telomeric/centromeric probes for the affected chromosomes. We found a higher quantity of cell-free DNA in spent culture media co-cultured with embryos versus control media samples (P ≤ 0.001). The presence of cell-free DNA in the spent culture media enabled a chromosomal diagnosis, although results differed from those of trophectoderm biopsy analysis in most cases (67%). Discordant results were mainly attributable to a high percentage of maternal DNA in the spent culture media, with a median percentage of embryonic DNA estimated at 8%. Finally, from the discordant cases, 91.7% of whole blastocysts analyzed by FISH were mosaic and 75% of the analyzed chromosomes were concordant with the trophectoderm DNA diagnosis instead of the cell-free DNA result. This study was limited by the sample size and the number of cells analyzed by FISH. This is the first study to combine chromosomal analysis of cell-free DNA, SNP sequencing to identify maternal contamination, and whole-blastocyst analysis for detecting mosaicism. Our results provide a better understanding of the origin of cell-free DNA in spent culture media, offering an important step toward developing future non-invasive karyotyping that must rely on the specific identification of DNA released from human embryos. This work was funded by Igenomix S.L. There are no competing interests.
Detecting the borders between coding and non-coding DNA regions in prokaryotes based on recursive segmentation and nucleotide doublets statistics

PubMed Central

2012-01-01

Background Detecting the borders between coding and non-coding regions is an essential step in the genome annotation. And information entropy measures are useful for describing the signals in genome sequence. However, the accuracies of previous methods of finding borders based on entropy segmentation method still need to be improved. Methods In this study, we first applied a new recursive entropic segmentation method on DNA sequences to get preliminary significant cuts. A 22-symbol alphabet is used to capture the differential composition of nucleotide doublets and stop codon patterns along three phases in both DNA strands. This process requires no prior training datasets. Results Comparing with the previous segmentation methods, the experimental results on three bacteria genomes, Rickettsia prowazekii, Borrelia burgdorferi and E.coli, show that our approach improves the accuracy for finding the borders between coding and non-coding regions in DNA sequences. Conclusions This paper presents a new segmentation method in prokaryotes based on Jensen-Rényi divergence with a 22-symbol alphabet. For three bacteria genomes, comparing to A12_JR method, our method raised the accuracy of finding the borders between protein coding and non-coding regions in DNA sequences. PMID:23282225
Comment on "Protein sequences from mastodon and Tyrannosaurus rex revealed by mass spectrometry".

PubMed

Buckley, Mike; Walker, Angela; Ho, Simon Y W; Yang, Yue; Smith, Colin; Ashton, Peter; Oates, Jane Thomas; Cappellini, Enrico; Koon, Hannah; Penkman, Kirsty; Elsworth, Ben; Ashford, Dave; Solazzo, Caroline; Andrews, Phillip; Strahler, John; Shapiro, Beth; Ostrom, Peggy; Gandhi, Hasand; Miller, Webb; Raney, Brian; Zylber, Maria Ines; Gilbert, M Thomas P; Prigodich, Richard V; Ryan, Michael; Rijsdijk, Kenneth F; Janoo, Anwar; Collins, Matthew J

2008-01-04

We used authentication tests developed for ancient DNA to evaluate claims by Asara et al. (Reports, 13 April 2007, p. 280) of collagen peptide sequences recovered from mastodon and Tyrannosaurus rex fossils. Although the mastodon samples pass these tests, absence of amino acid composition data, lack of evidence for peptide deamidation, and association of alpha1(I) collagen sequences with amphibians rather than birds suggest that T. rex does not.
A genome-wide BAC-end sequence survey provides first insights into sweetpotato (Ipomoea batatas (L.) Lam.) genome composition.

PubMed

Si, Zengzhi; Du, Bing; Huo, Jinxi; He, Shaozhen; Liu, Qingchang; Zhai, Hong

2016-11-21

Sweetpotato, Ipomoea batatas (L.) Lam., is an important food crop widely grown in the world. However, little is known about the genome of this species because it is a highly heterozygous hexaploid. Gaining a more in-depth knowledge of sweetpotato genome is therefore necessary and imperative. In this study, the first bacterial artificial chromosome (BAC) library of sweetpotato was constructed. Clones from the BAC library were end-sequenced and analyzed to provide genome-wide information about this species. The BAC library contained 240,384 clones with an average insert size of 101 kb and had a 7.93-10.82 × coverage of the genome, and the probability of isolating any single-copy DNA sequence from the library was more than 99%. Both ends of 8310 BAC clones randomly selected from the library were sequenced to generate 11,542 high-quality BAC-end sequences (BESs), with an accumulative length of 7,595,261 bp and an average length of 658 bp. Analysis of the BESs revealed that 12.17% of the sweetpotato genome were known repetitive DNA, including 7.37% long terminal repeat (LTR) retrotransposons, 1.15% Non-LTR retrotransposons and 1.42% Class II DNA transposons etc., 18.31% of the genome were identified as sweetpotato-unique repetitive DNA and 10.00% of the genome were predicted to be coding regions. In total, 3,846 simple sequences repeats (SSRs) were identified, with a density of one SSR per 1.93 kb, from which 288 SSRs primers were designed and tested for length polymorphism using 20 sweetpotato accessions, 173 (60.07%) of them produced polymorphic bands. Sweetpotato BESs had significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum than those of Vitis vinifera, Theobroma cacao and Arabidopsis thaliana. The first BAC library for sweetpotato has been successfully constructed. The high quality BESs provide first insights into sweetpotato genome composition, and have significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum. These resources as a robust platform will be used in high-resolution mapping, gene cloning, assembly of genome sequences, comparative genomics and evolution for sweetpotato.
Methodology challenges in studying human gut microbiota - effects of collection, storage, DNA extraction and next generation sequencing technologies.

PubMed

Panek, Marina; Čipčić Paljetak, Hana; Barešić, Anja; Perić, Mihaela; Matijašić, Mario; Lojkić, Ivana; Vranešić Bender, Darija; Krznarić, Željko; Verbanac, Donatella

2018-03-23

The information on microbiota composition in the human gastrointestinal tract predominantly originates from the analyses of human faeces by application of next generation sequencing (NGS). However, the detected composition of the faecal bacterial community can be affected by various factors including experimental design and procedures. This study evaluated the performance of different protocols for collection and storage of faecal samples (native and OMNIgene.GUT system) and bacterial DNA extraction (MP Biomedicals, QIAGEN and MO BIO kits), using two NGS platforms for 16S rRNA gene sequencing (Ilumina MiSeq and Ion Torrent PGM). OMNIgene.GUT proved as a reliable and convenient system for collection and storage of faecal samples although favouring Sutterella genus. MP provided superior DNA yield and quality, MO BIO depleted Gram positive organisms while using QIAGEN with OMNIgene.GUT resulted in greatest variability compared to other two kits. MiSeq and IT platforms in their supplier recommended setups provided comparable reproducibility of donor faecal microbiota. The differences included higher diversity observed with MiSeq and increased capacity of MiSeq to detect Akkermansia muciniphila, [Odoribacteraceae], Erysipelotrichaceae and Ruminococcaceae (primarily Faecalibacterium prausnitzii). The results of our study could assist the investigators using NGS technologies to make informed decisions on appropriate tools for their experimental pipelines.
Ligation Bias in Illumina Next-Generation DNA Libraries: Implications for Sequencing Ancient Genomes

PubMed Central

Seguin-Orlando, Andaine; Schubert, Mikkel; Clary, Joel; Stagegaard, Julia; Alberdi, Maria T.; Prado, José Luis; Prieto, Alfredo; Willerslev, Eske; Orlando, Ludovic

2013-01-01

Ancient DNA extracts consist of a mixture of endogenous molecules and contaminant DNA templates, often originating from environmental microbes. These two populations of templates exhibit different chemical characteristics, with the former showing depurination and cytosine deamination by-products, resulting from post-mortem DNA damage. Such chemical modifications can interfere with the molecular tools used for building second-generation DNA libraries, and limit our ability to fully characterize the true complexity of ancient DNA extracts. In this study, we first use fresh DNA extracts to demonstrate that library preparation based on adapter ligation at AT-overhangs are biased against DNA templates starting with thymine residues, contrarily to blunt-end adapter ligation. We observe the same bias on fresh DNA extracts sheared on Bioruptor, Covaris and nebulizers. This contradicts previous reports suggesting that this bias could originate from the methods used for shearing DNA. This also suggests that AT-overhang adapter ligation efficiency is affected in a sequence-dependent manner and results in an uneven representation of different genomic contexts. We then show how this bias could affect the base composition of ancient DNA libraries prepared following AT-overhang ligation, mainly by limiting the ability to ligate DNA templates starting with thymines and therefore deaminated cytosines. This results in particular nucleotide misincorporation damage patterns, deviating from the signature generally expected for authenticating ancient sequence data. Consequently, we show that models adequate for estimating post-mortem DNA damage levels must be robust to the molecular tools used for building ancient DNA libraries. PMID:24205269
The cDNA sequence of a neutral horseradish peroxidase.

PubMed

Bartonek-Roxå, E; Eriksson, H; Mattiasson, B

1991-02-16

A cDNA clone encoding a horseradish (Armoracia rusticana) peroxidase has been isolated and characterized. The cDNA contains 1378 nucleotides excluding the poly(A) tail and the deduced protein contains 327 amino acids which includes a 28 amino acid leader sequence. The predicted amino acid sequence is nine amino acids shorter than the major isoenzyme belonging to the horseradish peroxidase C group (HRP-C) and the sequence shows 53.7% identity with this isoenzyme. The described clone encodes nine cysteines of which eight correspond well with the cysteines found in HRP-C. Five potential N-glycosylation sites with the general sequence Asn-X-Thr/Ser are present in the deduced sequence. Compared to the earlier described HRP-C this is three glycosylation sites less. The shorter sequence and fewer N-glycosylation sites give the native isoenzyme a molecular weight of several thousands less than the horseradish peroxidase C isoenzymes. Comparison with the net charge value of HRP-C indicates that the described cDNA clone encodes a peroxidase which has either the same or a slightly less basic pI value, depending on whether the encoded protein is N-terminally blocked or not. This excludes the possibility that HRP-n could belong to either the HRP-A, -D or -E groups. The low sequence identity (53.7%) with HRP-C indicates that the described clone does not belong to the HRP-C isoenzyme group and comparison of the total amino acid composition with the HRP-B group does not place the described clone within this isoenzyme group. Our conclusion is that the described cDNA clone encodes a neutral horseradish peroxidase which belongs to a new, not earlier described, horseradish peroxidase group.
Improving accuracy of DNA diet estimates using food tissue control materials and an evaluation of proxies for digestion bias.

PubMed

Thomas, Austen C; Jarman, Simon N; Haman, Katherine H; Trites, Andrew W; Deagle, Bruce E

2014-08-01

Ecologists are increasingly interested in quantifying consumer diets based on food DNA in dietary samples and high-throughput sequencing of marker genes. It is tempting to assume that food DNA sequence proportions recovered from diet samples are representative of consumer's diet proportions, despite the fact that captive feeding studies do not support that assumption. Here, we examine the idea of sequencing control materials of known composition along with dietary samples in order to correct for technical biases introduced during amplicon sequencing and biological biases such as variable gene copy number. Using the Ion Torrent PGM(©) , we sequenced prey DNA amplified from scats of captive harbour seals (Phoca vitulina) fed a constant diet including three fish species in known proportions. Alongside, we sequenced a prey tissue mix matching the seals' diet to generate tissue correction factors (TCFs). TCFs improved the diet estimates (based on sequence proportions) for all species and reduced the average estimate error from 28 ± 15% (uncorrected) to 14 ± 9% (TCF-corrected). The experimental design also allowed us to infer the magnitude of prey-specific digestion biases and calculate digestion correction factors (DCFs). The DCFs were compared with possible proxies for differential digestion (e.g. fish protein%, fish lipid%) revealing a strong relationship between the DCFs and percent lipid of the fish prey, suggesting prey-specific corrections based on lipid content would produce accurate diet estimates in this study system. These findings demonstrate the value of parallel sequencing of food tissue mixtures in diet studies and offer new directions for future research in quantitative DNA diet analysis. © 2013 John Wiley & Sons Ltd.
Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.

PubMed

Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi

2014-01-01

A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.
Simple sequence repeats in Escherichia coli: abundance, distribution, composition, and polymorphism.

PubMed

Gur-Arie, R; Cohen, C J; Eitan, Y; Shelef, L; Hallerman, E M; Kashi, Y

2000-01-01

Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.
Diversity of indoor fungi as revealed by DNA metabarcoding.

PubMed

Korpelainen, Helena; Pietiläinen, Maria

2017-01-01

In the present study, we conducted DNA metabarcoding (the nuclear ITS2 region) for indoor fungal samples originating from two nursery schools with a suspected mould problem (sampling before and after renovation), from two university buildings, and from an old farmhouse. Good-quality sequences were obtained, and the results showed that DNA metabarcoding provides high resolution in fungal identification. The pooled proportions of sequences representing filamentous ascomycetes, filamentous basidiomycetes, yeasts, and other fungi equalled 62.3%, 8.0%, 28.3%, and 1.4%, respectively, and the total number of fungal genera found during the study was 585. When comparing fungal diversities and taxonomic composition between different types of buildings, no obvious pattern was detected. The average pairwise values of Sørensen Chao indices that were used to compare similarities for taxon composition between samples among the samples from the two university buildings, two nurseries, and farmhouse equaled 0.693, 0.736, 0.852, 0.928, and 0.981, respectively, while the mean similarity index for all samples was 0.864. We discovered that making explicit conclusions on the relationship between the indoor air quality and mycoflora is complicated by the lack of appropriate indicators for air quality and by the occurrence of wide spatial and temporal changes in diversity and compositions among samples.
Superstatistical model of bacterial DNA architecture

NASA Astrophysics Data System (ADS)

Bogachev, Mikhail I.; Markelov, Oleg A.; Kayumov, Airat R.; Bunde, Armin

2017-02-01

Understanding the physical principles that govern the complex DNA structural organization as well as its mechanical and thermodynamical properties is essential for the advancement in both life sciences and genetic engineering. Recently we have discovered that the complex DNA organization is explicitly reflected in the arrangement of nucleotides depicted by the universal power law tailed internucleotide interval distribution that is valid for complete genomes of various prokaryotic and eukaryotic organisms. Here we suggest a superstatistical model that represents a long DNA molecule by a series of consecutive ~150 bp DNA segments with the alternation of the local nucleotide composition between segments exhibiting long-range correlations. We show that the superstatistical model and the corresponding DNA generation algorithm explicitly reproduce the laws governing the empirical nucleotide arrangement properties of the DNA sequences for various global GC contents and optimal living temperatures. Finally, we discuss the relevance of our model in terms of the DNA mechanical properties. As an outlook, we focus on finding the DNA sequences that encode a given protein while simultaneously reproducing the nucleotide arrangement laws observed from empirical genomes, that may be of interest in the optimization of genetic engineering of long DNA molecules.
A Sequence-Specific Nicking Endonuclease from Streptomyces: Purification, Physical and Catalytic Properties

PubMed Central

Somyoonsap, Peechapack; Kitpreechavanich, Vichein

2013-01-01

A sequence-specific nicking endonuclease from Streptomyces designated as DC13 was purified to near homogeneity. Starting with 30 grams of wet cells, the enzyme was purified by ammonium sulfate fractionation, DEAE cellulose, and phenyl-Sepharose chromatography. The purified protein had a specific activity 1000 units/mg and migrated on SDS-PAGE gel with an estimated molecular weight of 71 kDa. Determination of subunit composition by gel filtration chromatography indicated that the native enzyme is a monomer. When incubated with different DNA substrates including pBluescript II KS, pUC118, pET-15b, and pET-26b, the enzyme converted these supercoiled plasmids to a mixture of open circular and linear DNA products, with the open circular DNA as the major cleavage product. Analysis of the kinetic of DNA cleavage showed that the enzyme appeared to cleave super-coiled plasmid in two distinct steps: a rapid cleavage of super-coiled plasmid to an open circular DNA followed a much slower step to linear DNA. The DNA cleavage reaction of the enzyme required Mg2+ as a cofactor. Based on the monomeric nature of the enzyme, the kinetics of DNA cleavage exhibited by the enzyme, and cofactor requirement, it is suggested here that the purified enzyme is a sequence-specific nicking endonuclease that is similar to type IIS restriction endonuclease. PMID:25937959
Environmental DNA sequencing primers for eutardigrades and bdelloid rotifers

PubMed Central

2009-01-01

Background The time it takes to isolate individuals from environmental samples and then extract DNA from each individual is one of the problems with generating molecular data from meiofauna such as eutardigrades and bdelloid rotifers. The lack of consistent morphological information and the extreme abundance of these classes makes morphological identification of rare, or even common cryptic taxa a large and unwieldy task. This limits the ability to perform large-scale surveys of the diversity of these organisms. Here we demonstrate a culture-independent molecular survey approach that enables the generation of large amounts of eutardigrade and bdelloid rotifer sequence data directly from soil. Our PCR primers, specific to the 18s small-subunit rRNA gene, were developed for both eutardigrades and bdelloid rotifers. Results The developed primers successfully amplified DNA of their target organism from various soil DNA extracts. This was confirmed by both the BLAST similarity searches and phylogenetic analyses. Tardigrades showed much better phylogenetic resolution than bdelloids. Both groups of organisms exhibited varying levels of endemism. Conclusion The development of clade-specific primers for characterizing eutardigrades and bdelloid rotifers from environmental samples should greatly increase our ability to characterize the composition of these taxa in environmental samples. Environmental sequencing as shown here differs from other molecular survey methods in that there is no need to pre-isolate the organisms of interest from soil in order to amplify their DNA. The DNA sequences obtained from methods that do not require culturing can be identified post-hoc and placed phylogenetically as additional closely related sequences are obtained from morphologically identified conspecifics. Our non-cultured environmental sequence based approach will be able to provide a rapid and large-scale screening of the presence, absence and diversity of Bdelloidea and Eutardigrada in a variety of soils. PMID:20003362
Polyfluorophore Labels on DNA: Dramatic Sequence Dependence of Quenching

PubMed Central

Teo, Yin Nah; Wilson, James N.

2010-01-01

We describe studies carried out in the DNA context to test how a common fluorescence quencher, dabcyl, interacts with oligodeoxynu-cleoside fluorophores (ODFs)—a system of stacked, electronically interacting fluorophores built on a DNA scaffold. We tested twenty different tetrameric ODF sequences containing varied combinations and orderings of pyrene (Y), benzopyrene (B), perylene (E), dimethylaminostilbene (D), and spacer (S) monomers conjugated to the 3′ end of a DNA oligomer. Hybridization of this probe sequence to a dabcyl-labeled complementary strand resulted in strong quenching of fluorescence in 85% of the twenty ODF sequences. The high efficiency of quenching was also established by their large Stern–Volmer constants (KSV) of between 2.1 × 104 and 4.3 × 105M−1, measured with a free dabcyl quencher. Interestingly, quenching of ODFs displayed strong sequence dependence. This was particularly evident in anagrams of ODF sequences; for example, the sequence BYDS had a KSV that was approximately two orders of magnitude greater than that of BSDY, which has the same dye composition. Other anagrams, for example EDSY and ESYD, also displayed different responses upon quenching by dabcyl. Analysis of spectra showed that apparent excimer and exciplex emission bands were quenched with much greater efficiency compared to monomer emission bands by at least an order of magnitude. This suggests an important role played by delocalized excited states of the π stack of fluorophores in the amplified quenching of fluorescence. PMID:19780115
SPRi-based biosensing platforms for detection of specific DNA sequences using thiolate and dithiocarbamate assemblies

NASA Astrophysics Data System (ADS)

Drozd, Marcin; Pietrzak, Mariusz D.; Malinowska, Elżbieta

2018-05-01

The framework of presented study covers the development and examination of the analytical performance of surface plasmon resonance-based (SPR) DNA biosensors dedicated for a detection of model target oligonucleotide sequence. For this aim, various strategies of immobilization of DNA probes on gold transducers were tested. Besides the typical approaches: chemisorption of thiolated ssDNA (DNA-thiol) and physisorption of non-functionalized oligonucleotides, relatively new method based on chemisorption of dithiocarbamate-functionalized ssDNA (DNA-DTC) was applied for the first time for preparation of DNA-based SPR biosensor. The special emphasis was put on the correlation between the method of DNA immobilization and the composition of obtained receptor layer. The carried out studies focused on the examination of the capability of developed receptors layers to interact with both target DNA and DNA-functionalized AuNPs. It was found, that the detection limit of target DNA sequence (27 nb length) depends on the strategy of probe immobilization and backfilling method, and in the best case it amounted to 0,66 nM. Moreover, the application of ssDNA-functionalized gold nanoparticles (AuNPs) as plasmonic labels for secondary enhancement of SPR response is presented. The influence of spatial organization and surface density of a receptor layer on the ability to interact with DNA-functionalized AuNPs is discussed. Due to the best compatibility of receptors immobilized via DTC chemisorption: 1.47 ± 0.4 ·1012 molecules • cm-2 (with the calculated area occupied by single nanoparticle label of 132.7 nm2), DNA chemisorption based on DTCs is pointed as especially promising for DNA biosensors utilizing indirect detection in competitive assays.
SPRi-Based Biosensing Platforms for Detection of Specific DNA Sequences Using Thiolate and Dithiocarbamate Assemblies.

PubMed

Drozd, Marcin; Pietrzak, Mariusz D; Malinowska, Elżbieta

2018-01-01

The framework of presented study covers the development and examination of the analytical performance of surface plasmon resonance-based (SPR) DNA biosensors dedicated for a detection of model target oligonucleotide sequence. For this aim, various strategies of immobilization of DNA probes on gold transducers were tested. Besides the typical approaches: chemisorption of thiolated ssDNA (DNA-thiol) and physisorption of non-functionalized oligonucleotides, relatively new method based on chemisorption of dithiocarbamate-functionalized ssDNA (DNA-DTC) was applied for the first time for preparation of DNA-based SPR biosensor. The special emphasis was put on the correlation between the method of DNA immobilization and the composition of obtained receptor layer. The carried out studies focused on the examination of the capability of developed receptors layers to interact with both target DNA and DNA-functionalized AuNPs. It was found, that the detection limit of target DNA sequence (27 nb length) depends on the strategy of probe immobilization and backfilling method, and in the best case it amounted to 0.66 nM. Moreover, the application of ssDNA-functionalized gold nanoparticles (AuNPs) as plasmonic labels for secondary enhancement of SPR response is presented. The influence of spatial organization and surface density of a receptor layer on the ability to interact with DNA-functionalized AuNPs is discussed. Due to the best compatibility of receptors immobilized via DTC chemisorption: 1.47 ± 0.4 · 10 12 molecules · cm -2 (with the calculated area occupied by single nanoparticle label of ~132.7 nm 2 ), DNA chemisorption based on DTCs is pointed as especially promising for DNA biosensors utilizing indirect detection in competitive assays.
Shotgun Bisulfite Sequencing of the Betula platyphylla Genome Reveals the Tree’s DNA Methylation Patterning

PubMed Central

Su, Chang; Wang, Chao; He, Lin; Yang, Chuanping; Wang, Yucheng

2014-01-01

DNA methylation plays a critical role in the regulation of gene expression. Most studies of DNA methylation have been performed in herbaceous plants, and little is known about the methylation patterns in tree genomes. In the present study, we generated a map of methylated cytosines at single base pair resolution for Betula platyphylla (white birch) by bisulfite sequencing combined with transcriptomics to analyze DNA methylation and its effects on gene expression. We obtained a detailed view of the function of DNA methylation sequence composition and distribution in the genome of B. platyphylla. There are 34,460 genes in the whole genome of birch, and 31,297 genes are methylated. Conservatively, we estimated that 14.29% of genomic cytosines are methylcytosines in birch. Among the methylation sites, the CHH context accounts for 48.86%, and is the largest proportion. Combined transcriptome and methylation analysis showed that the genes with moderate methylation levels had higher expression levels than genes with high and low methylation. In addition, methylated genes are highly enriched for the GO subcategories of binding activities, catalytic activities, cellular processes, response to stimulus and cell death, suggesting that methylation mediates these pathways in birch trees. PMID:25514241
Localization of Action of the Is50-Encoded Transposase Protein

PubMed Central

Phadnis, Suhas H.; Sasakawa, Chihiro; Berg, Douglas E.

1986-01-01

The movement of the bacterial insertion sequence IS50 and of composite elements containing direct terminal repeats of IS50 involves the two ends of IS50, designated O (outside) and I (inside), which are weakly matched in DNA sequence, and an IS50 encoded protein, transposase, which recognizes the O and I ends and acts preferentially in cis. Previous data had suggested that, initially, transposase interacts preferentially with the O end sequence and then, in a second step, with either an O or an I end. To better understand the cis action of transposase and how IS50 ends are selected, we generated a series of composite transposons which contain direct repeats of IS50 elements. In each transposon, one IS50 element encoded transposase (tnp +), and the other contained a null (tnp-) allele. In each of the five sets of composite transposons studied, the transposon for which the tnp+ IS50 element contained its O end was more active than a complementary transposon for which the tnp - IS50 element contained its O end. This pattern of O end use suggests models in which the cis action of transposase and its choice of ends is determined by protein tracking along DNA molecules. PMID:3007274

Mitochondrial DNA repairs double-strand breaks in yeast chromosomes.

PubMed

Ricchetti, M; Fairhead, C; Dujon, B

1999-11-04

The endosymbiotic theory for the origin of eukaryotic cells proposes that genetic information can be transferred from mitochondria to the nucleus of a cell, and genes that are probably of mitochondrial origin have been found in nuclear chromosomes. Occasionally, short or rearranged sequences homologous to mitochondrial DNA are seen in the chromosomes of different organisms including yeast, plants and humans. Here we report a mechanism by which fragments of mitochondrial DNA, in single or tandem array, are transferred to yeast chromosomes under natural conditions during the repair of double-strand breaks in haploid mitotic cells. These repair insertions originate from noncontiguous regions of the mitochondrial genome. Our analysis of the Saccharomyces cerevisiae mitochondrial genome indicates that the yeast nuclear genome does indeed contain several short sequences of mitochondrial origin which are similar in size and composition to those that repair double-strand breaks. These sequences are located predominantly in non-coding regions of the chromosomes, frequently in the vicinity of retrotransposon long terminal repeats, and appear as recent integration events. Thus, colonization of the yeast genome by mitochondrial DNA is an ongoing process.
Characterization of Satellite DNA Sequences from the Commercially Important Marine Rotifers Brachionus rotundiformis and Brachionus plicatilis.

PubMed

Boehm; Gibson; Lubzens

2000-01-01

This study was initiated to search for species-specific and strain-specific satellite DNA sequences for which oligonucleotide primers could be designed to differentiate between various commercially important strains of the marine monogonont rotifers Brachionus rotundiformis and Brachionus plicatilis. Two unrelated, highly reiterated satellite sequences were cloned and characterized. The eight sequenced monomers from B. rotundiformis and six from B. plicatilis had low intrarepeat variability and were similar in their overall lengths, A + T compositions, and high degrees of repeated motif substructure. However, hybridizations to 19 representative strains, sequence characterizations, and GenBank searches indicated that these two satellites are morphotype-specific and population-specific, respectively, and share little homology to each other or to other characterized sequences in the database. Primer pairs designed for the B. rotundiformis satellite confirmed hybridization specificities on polymerase chain reaction and could serve as a useful molecular diagnostic tool to identify strains belonging to the SS morphotype, which are gaining widespread usage as first feeds for marine fish in commercial production.
Chromosome Evolution in Connection with Repetitive Sequences and Epigenetics in Plants.

PubMed

Li, Shu-Fen; Su, Ting; Cheng, Guang-Qian; Wang, Bing-Xiao; Li, Xu; Deng, Chuan-Liang; Gao, Wu-Jun

2017-10-24

Chromosome evolution is a fundamental aspect of evolutionary biology. The evolution of chromosome size, structure and shape, number, and the change in DNA composition suggest the high plasticity of nuclear genomes at the chromosomal level. Repetitive DNA sequences, which represent a conspicuous fraction of every eukaryotic genome, particularly in plants, are found to be tightly linked with plant chromosome evolution. Different classes of repetitive sequences have distinct distribution patterns on the chromosomes. Mounting evidence shows that repetitive sequences may play multiple generative roles in shaping the chromosome karyotypes in plants. Furthermore, recent development in our understanding of the repetitive sequences and plant chromosome evolution has elucidated the involvement of a spectrum of epigenetic modification. In this review, we focused on the recent evidence relating to the distribution pattern of repetitive sequences in plant chromosomes and highlighted their potential relevance to chromosome evolution in plants. We also discussed the possible connections between evolution and epigenetic alterations in chromosome structure and repatterning, such as heterochromatin formation, centromere function, and epigenetic-associated transposable element inactivation.
DNA Data Visualization (DDV): Software for Generating Web-Based Interfaces Supporting Navigation and Analysis of DNA Sequence Data of Entire Genomes.

PubMed

Neugebauer, Tomasz; Bordeleau, Eric; Burrus, Vincent; Brzezinski, Ryszard

2015-01-01

Data visualization methods are necessary during the exploration and analysis activities of an increasingly data-intensive scientific process. There are few existing visualization methods for raw nucleotide sequences of a whole genome or chromosome. Software for data visualization should allow the researchers to create accessible data visualization interfaces that can be exported and shared with others on the web. Herein, novel software developed for generating DNA data visualization interfaces is described. The software converts DNA data sets into images that are further processed as multi-scale images to be accessed through a web-based interface that supports zooming, panning and sequence fragment selection. Nucleotide composition frequencies and GC skew of a selected sequence segment can be obtained through the interface. The software was used to generate DNA data visualization of human and bacterial chromosomes. Examples of visually detectable features such as short and long direct repeats, long terminal repeats, mobile genetic elements, heterochromatic segments in microbial and human chromosomes, are presented. The software and its source code are available for download and further development. The visualization interfaces generated with the software allow for the immediate identification and observation of several types of sequence patterns in genomes of various sizes and origins. The visualization interfaces generated with the software are readily accessible through a web browser. This software is a useful research and teaching tool for genetics and structural genomics.
The blood DNA virome in 8,000 humans.

PubMed

Moustafa, Ahmed; Xie, Chao; Kirkness, Ewen; Biggs, William; Wong, Emily; Turpaz, Yaron; Bloom, Kenneth; Delwart, Eric; Nelson, Karen E; Venter, J Craig; Telenti, Amalio

2017-03-01

The characterization of the blood virome is important for the safety of blood-derived transfusion products, and for the identification of emerging pathogens. We explored non-human sequence data from whole-genome sequencing of blood from 8,240 individuals, none of whom were ascertained for any infectious disease. Viral sequences were extracted from the pool of sequence reads that did not map to the human reference genome. Analyses sifted through close to 1 Petabyte of sequence data and performed 0.5 trillion similarity searches. With a lower bound for identification of 2 viral genomes/100,000 cells, we mapped sequences to 94 different viruses, including sequences from 19 human DNA viruses, proviruses and RNA viruses (herpesviruses, anelloviruses, papillomaviruses, three polyomaviruses, adenovirus, HIV, HTLV, hepatitis B, hepatitis C, parvovirus B19, and influenza virus) in 42% of the study participants. Of possible relevance to transfusion medicine, we identified Merkel cell polyomavirus in 49 individuals, papillomavirus in blood of 13 individuals, parvovirus B19 in 6 individuals, and the presence of herpesvirus 8 in 3 individuals. The presence of DNA sequences from two RNA viruses was unexpected: Hepatitis C virus is revealing of an integration event, while the influenza virus sequence resulted from immunization with a DNA vaccine. Age, sex and ancestry contributed significantly to the prevalence of infection. The remaining 75 viruses mostly reflect extensive contamination of commercial reagents and from the environment. These technical problems represent a major challenge for the identification of novel human pathogens. Increasing availability of human whole-genome sequences will contribute substantial amounts of data on the composition of the normal and pathogenic human blood virome. Distinguishing contaminants from real human viruses is challenging.
Simulation studies of DNA at the nanoscale: Interactions with proteins, polycations, and surfaces

NASA Astrophysics Data System (ADS)

Elder, Robert M.

Understanding the nanoscale interactions of DNA, a multifunctional biopolymer with sequence-dependent properties, with other biological and synthetic substrates and molecules is essential to advancing these technologies. This doctoral thesis research is aimed at understanding the thermodynamics and molecular-level structure when DNA interacts with proteins, polycations, and functionalized surfaces. First, we investigate the ability of a DNA damage recognition protein (HMGB1a) to bind to anti-cancer drug-induced DNA damage, seeking to explain how HMGB1a differentiates between the drugs in vivo. Using atomistic molecular dynamics simulations, we show that the structure of the drug-DNA molecule exhibits drug- and base sequence-dependence that explains some of the experimentally observed differential recognition of the drugs in various sequence contexts. Then, we show how steric hindrance from the drug decreases the deformability of the drug-DNA molecule, which decreases recognition by the protein, a concept that can be applied to rational drug design. Second, we study how polycation architecture and chemistry affect polycation-DNA binding so as to design optimal polycations for high efficiency gene (DNA) delivery. Using a multiscale computational approach involving atomistic and coarse-grained simulations, we examine how rearranging polylysine from a linear to a grafted architecture, and several aspects of the grafted architecture, affect polycation-DNA binding and the structure of polycation-DNA complexes. Next, going beyond lysine we examine how oligopeptide chemistry and sequence in the grafted architecture affects polycation-DNA binding and find that strategic placement of hydrophobic peptides might be used to tailor binding strength. Third, we study the adsorption and conformations of single-stranded DNA (an amphiphilic biopolymer) on model hydrophilic and hydrophobic surfaces. Short ssDNA oligomers adsorb to both surfaces with similar strength, with the strength of adsorption to the hydrophobic surface depending on the composition of the DNA strands, i.e. purine or pyrimidine bases. Additionally, DNA-surface and DNA-water interactions near the surfaces govern the adsorption. For longer ssDNA oligomers, the effects of surface chemistry and temperature on ssDNA conformations are rather small, but either the hydrophilic surface or increased temperature favor slightly more compact conformations due to energetic and entropic effects, respectively.
Weighing the mass spectrometric evidence for authentic Tyrannosaurus rex collagen

PubMed Central

Buckley, Mike; Walker, Angela; Ho, Simon Y. W.; Yang, Yue; Smith, Colin; Ashton, Peter; Oates, Jane Thomas; Cappellini, Enrico; Koon, Hannah; Penkman, Kirsty; Elsworth, Ben; Ashford, Dave; Solazzo, Caroline; Andrews, Phil; Strahler, John; Shapiro, Beth; Ostrom, Peggy; Gandhi, Hasand; Miller, Webb; Raney, Brian; Zylber, Maria Ines; Gilbert, M. Thomas P.; Prigodich, Richard V.; Ryan, Michael; Rijsdijk, Kenneth F.; Janoo, Anwar; Collins, Matthew J.

2009-01-01

We use authentication tests developed for ancient DNA to evaluate claims by Asara et al. of collagen peptide sequences recovered from mastodon and Tyrannosaurus rex fossils. Although the mastodon passes, absence of amino acid composition data, lack of evidence for peptide deamidation, and association of the α1(I) peptide sequences with amphibians not birds, suggests that T. rex does not. PMID:18174420
DNA Microarray Profiling of a Diverse Collection of Nosocomial Methicillin-Resistant Staphylococcus aureus Isolates Assigns the Majority to the Correct Sequence Type and Staphylococcal Cassette Chromosome mec (SCCmec) Type and Results in the Subsequent Identification and Characterization of Novel SCCmec-SCCM1 Composite Islands

PubMed Central

Brennan, Orla M.; Deasy, Emily C.; Rossney, Angela S.; Kinnevey, Peter M.; Ehricht, Ralf; Monecke, Stefan; Coleman, David C.

2012-01-01

One hundred seventy-five isolates representative of methicillin-resistant Staphylococcus aureus (MRSA) clones that predominated in Irish hospitals between 1971 and 2004 and that previously underwent multilocus sequence typing (MLST) and staphylococcal cassette chromosome mec (SCCmec) typing were characterized by spa typing (175 isolates) and DNA microarray profiling (107 isolates). The isolates belonged to 26 sequence type (ST)-SCCmec types and subtypes and 35 spa types. The array assigned all isolates to the correct MLST clonal complex (CC), and 94% (100/107) were assigned an ST, with 98% (98/100) correlating with MLST. The array assigned all isolates to the correct SCCmec type, but subtyping of only some SCCmec elements was possible. Additional SCCmec/SCC genes or DNA sequence variation not detected by SCCmec typing was detected by array profiling, including the SCC-fusidic acid resistance determinant Q6GD50/fusC. Novel SCCmec/SCC composite islands (CIs) were detected among CC8 isolates and comprised SCCmec IIA-IIE, IVE, IVF, or IVg and a ccrAB4-SCC element with 99% DNA sequence identity to SCCM1 from ST8/t024-MRSA, SCCmec VIII, and SCC-CI in Staphylococcus epidermidis. The array showed that the majority of isolates harbored one or more superantigen (94%; 100/107) and immune evasion cluster (91%; 97/107) genes. Apart from fusidic acid and trimethoprim resistance, the correlation between isolate antimicrobial resistance phenotype and the presence of specific resistance genes was ≥97%. Array profiling allowed high-throughput, accurate assignment of MRSA to CCs/STs and SCCmec types and provided further evidence of the diversity of SCCmec/SCC. In most cases, array profiling can accurately predict the resistance phenotype of an isolate. PMID:22869569
A general method to eliminate laboratory induced recombinants during massive, parallel sequencing of cDNA library.

PubMed

Waugh, Caryll; Cromer, Deborah; Grimm, Andrew; Chopra, Abha; Mallal, Simon; Davenport, Miles; Mak, Johnson

2015-04-09

Massive, parallel sequencing is a potent tool for dissecting the regulation of biological processes by revealing the dynamics of the cellular RNA profile under different conditions. Similarly, massive, parallel sequencing can be used to reveal the complexity of viral quasispecies that are often found in the RNA virus infected host. However, the production of cDNA libraries for next-generation sequencing (NGS) necessitates the reverse transcription of RNA into cDNA and the amplification of the cDNA template using PCR, which may introduce artefact in the form of phantom nucleic acids species that can bias the composition and interpretation of original RNA profiles. Using HIV as a model we have characterised the major sources of error during the conversion of viral RNA to cDNA, namely excess RNA template and the RNaseH activity of the polymerase enzyme, reverse transcriptase. In addition we have analysed the effect of PCR cycle on detection of recombinants and assessed the contribution of transfection of highly similar plasmid DNA to the formation of recombinant species during the production of our control viruses. We have identified RNA template concentrations, RNaseH activity of reverse transcriptase, and PCR conditions as key parameters that must be carefully optimised to minimise chimeric artefacts. Using our optimised RT-PCR conditions, in combination with our modified PCR amplification procedure, we have developed a reliable technique for accurate determination of RNA species using NGS technology.
iDNA-Prot: Identification of DNA Binding Proteins Using Random Forest with Grey Model

PubMed Central

Lin, Wei-Zhong; Fang, Jian-An; Xiao, Xuan; Chou, Kuo-Chen

2011-01-01

DNA-binding proteins play crucial roles in various cellular processes. Developing high throughput tools for rapidly and effectively identifying DNA-binding proteins is one of the major challenges in the field of genome annotation. Although many efforts have been made in this regard, further effort is needed to enhance the prediction power. By incorporating the features into the general form of pseudo amino acid composition that were extracted from protein sequences via the “grey model” and by adopting the random forest operation engine, we proposed a new predictor, called iDNA-Prot, for identifying uncharacterized proteins as DNA-binding proteins or non-DNA binding proteins based on their amino acid sequences information alone. The overall success rate by iDNA-Prot was 83.96% that was obtained via jackknife tests on a newly constructed stringent benchmark dataset in which none of the proteins included has pairwise sequence identity to any other in a same subset. In addition to achieving high success rate, the computational time for iDNA-Prot is remarkably shorter in comparison with the relevant existing predictors. Hence it is anticipated that iDNA-Prot may become a useful high throughput tool for large-scale analysis of DNA-binding proteins. As a user-friendly web-server, iDNA-Prot is freely accessible to the public at the web-site on http://icpr.jci.edu.cn/bioinfo/iDNA-Prot or http://www.jci-bioinfo.cn/iDNA-Prot. Moreover, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results. PMID:21935457
Single-cell paired-end genome sequencing reveals structural variation per cell cycle

PubMed Central

Voet, Thierry; Kumar, Parveen; Van Loo, Peter; Cooke, Susanna L.; Marshall, John; Lin, Meng-Lay; Zamani Esteki, Masoud; Van der Aa, Niels; Mateiu, Ligia; McBride, David J.; Bignell, Graham R.; McLaren, Stuart; Teague, Jon; Butler, Adam; Raine, Keiran; Stebbings, Lucy A.; Quail, Michael A.; D’Hooghe, Thomas; Moreau, Yves; Futreal, P. Andrew; Stratton, Michael R.; Vermeesch, Joris R.; Campbell, Peter J.

2013-01-01

The nature and pace of genome mutation is largely unknown. Because standard methods sequence DNA from populations of cells, the genetic composition of individual cells is lost, de novo mutations in cells are concealed within the bulk signal and per cell cycle mutation rates and mechanisms remain elusive. Although single-cell genome analyses could resolve these problems, such analyses are error-prone because of whole-genome amplification (WGA) artefacts and are limited in the types of DNA mutation that can be discerned. We developed methods for paired-end sequence analysis of single-cell WGA products that enable (i) detecting multiple classes of DNA mutation, (ii) distinguishing DNA copy number changes from allelic WGA-amplification artefacts by the discovery of matching aberrantly mapping read pairs among the surfeit of paired-end WGA and mapping artefacts and (iii) delineating the break points and architecture of structural variants. By applying the methods, we capture DNA copy number changes acquired over one cell cycle in breast cancer cells and in blastomeres derived from a human zygote after in vitro fertilization. Furthermore, we were able to discover and fine-map a heritable inter-chromosomal rearrangement t(1;16)(p36;p12) by sequencing a single blastomere. The methods will expedite applications in basic genome research and provide a stepping stone to novel approaches for clinical genetic diagnosis. PMID:23630320
Molecular analysis of meso- and thermophilic microbiota associated with anaerobic biowaste degradation

PubMed Central

2012-01-01

Background Microbial anaerobic digestion (AD) is used as a waste treatment process to degrade complex organic compounds into methane. The archaeal and bacterial taxa involved in AD are well known, whereas composition of the fungal community in the process has been less studied. The present study aimed to reveal the composition of archaeal, bacterial and fungal communities in response to increasing organic loading in mesophilic and thermophilic AD processes by applying 454 amplicon sequencing technology. Furthermore, a DNA microarray method was evaluated in order to develop a tool for monitoring the microbiological status of AD. Results The 454 sequencing showed that the diversity and number of bacterial taxa decreased with increasing organic load, while archaeal i.e. methanogenic taxa remained more constant. The number and diversity of fungal taxa increased during the process and varied less in composition with process temperature than bacterial and archaeal taxa, even though the fungal diversity increased with temperature as well. Evaluation of the microarray using AD sample DNA showed correlation of signal intensities with sequence read numbers of corresponding target groups. The sensitivity of the test was found to be about 1%. Conclusions The fungal community survives in anoxic conditions and grows with increasing organic loading, suggesting that Fungi may contribute to the digestion by metabolising organic nutrients for bacterial and methanogenic groups. The microarray proof of principle tests suggest that the method has the potential for semiquantitative detection of target microbial groups given that comprehensive sequence data is available for probe design. PMID:22727142
Complexity: an internet resource for analysis of DNA sequence complexity

PubMed Central

Orlov, Y. L.; Potapov, V. N.

2004-01-01

The search for DNA regions with low complexity is one of the pivotal tasks of modern structural analysis of complete genomes. The low complexity may be preconditioned by strong inequality in nucleotide content (biased composition), by tandem or dispersed repeats or by palindrome-hairpin structures, as well as by a combination of all these factors. Several numerical measures of textual complexity, including combinatorial and linguistic ones, together with complexity estimation using a modified Lempel–Ziv algorithm, have been implemented in a software tool called ‘Complexity’ (http://wwwmgs.bionet.nsc.ru/mgs/programs/low_complexity/). The software enables a user to search for low-complexity regions in long sequences, e.g. complete bacterial genomes or eukaryotic chromosomes. In addition, it estimates the complexity of groups of aligned sequences. PMID:15215465
DNA polymorphism sensitive impedimetric detection on gold-nanoislands modified electrodes.

PubMed

Bonanni, Alessandra; Pividori, Maria Isabel; del Valle, Manel

2015-05-01

Nanocomposite materials are being increasingly used in biosensing applications as they can significantly improve biosensor performance. Here we report the use of a novel impedimetric genosensor based on gold nanoparticles graphite-epoxy nanocomposite (nanoAu-GEC) for the detection of triple base mutation deletion in a cystic-fibrosis (CF) related human DNA sequence. The developed platform consists of chemisorbing gold nano-islands surrounded by rigid, non-chemisorbing, and conducting graphite-epoxy composite. The ratio of the gold nanoparticles in the composite was carefully optimized by electrochemical and microscopy studies. Such platform allows the very fast and stable thiol immobilization of DNA probes on the gold islands, thus minimizing the steric and electrostatic repulsion among the DNA probes and improving the detection of DNA polymorphism down to 2.25fmol by using electrochemical impedance spectroscopy. These findings are very important in order to develop new and renewable platforms to be used in point-of-care devices for the detection of biomolecules. Copyright © 2015 Elsevier B.V. All rights reserved.
Accurate phylogenetic classification of DNA fragments based onsequence composition

DOE Office of Scientific and Technical Information (OSTI.GOV)

McHardy, Alice C.; Garcia Martin, Hector; Tsirigos, Aristotelis

2006-05-01

Metagenome studies have retrieved vast amounts of sequenceout of a variety of environments, leading to novel discoveries and greatinsights into the uncultured microbial world. Except for very simplecommunities, diversity makes sequence assembly and analysis a verychallenging problem. To understand the structure a 5 nd function ofmicrobial communities, a taxonomic characterization of the obtainedsequence fragments is highly desirable, yet currently limited mostly tothose sequences that contain phylogenetic marker genes. We show that forclades at the rank of domain down to genus, sequence composition allowsthe very accurate phylogenetic 10 characterization of genomic sequence.We developed a composition-based classifier, PhyloPythia, for de novophylogenetic sequencemore » characterization and have trained it on adata setof 340 genomes. By extensive evaluation experiments we show that themethodis accurate across all taxonomic ranks considered, even forsequences that originate fromnovel organisms and are as short as 1kb.Application to two metagenome datasets 15 obtained from samples ofphosphorus-removing sludge showed that the method allows the accurateclassification at genus level of most sequence fragments from thedominant populations, while at the same time correctly characterizingeven larger parts of the samples at higher taxonomic levels.« less
Universality of long-range correlations in expansion randomization systems

NASA Astrophysics Data System (ADS)

Messer, P. W.; Lässig, M.; Arndt, P. F.

2005-10-01

We study the stochastic dynamics of sequences evolving by single-site mutations, segmental duplications, deletions, and random insertions. These processes are relevant for the evolution of genomic DNA. They define a universality class of non-equilibrium 1D expansion-randomization systems with generic stationary long-range correlations in a regime of growing sequence length. We obtain explicitly the two-point correlation function of the sequence composition and the distribution function of the composition bias in sequences of finite length. The characteristic exponent χ of these quantities is determined by the ratio of two effective rates, which are explicitly calculated for several specific sequence evolution dynamics of the universality class. Depending on the value of χ, we find two different scaling regimes, which are distinguished by the detectability of the initial composition bias. All analytic results are accurately verified by numerical simulations. We also discuss the non-stationary build-up and decay of correlations, as well as more complex evolutionary scenarios, where the rates of the processes vary in time. Our findings provide a possible example for the emergence of universality in molecular biology.
Molecular Survey of Concrete Sewer Biofilm Microbial Communities

EPA Science Inventory

Although bacteria are implicated in deteriorating concrete structures, there is very little information on the composition of concrete microbial communities. To this end, we studied different concrete biofilms by performing sequence analysis of 16S rDNA concrete clone libraries. ...
Intercalation of XR5944 with the estrogen response element is modulated by the tri-nucleotide spacer sequence between half-sites

PubMed Central

Sidell, Neil; Mathad, Raveendra I.; Shu, Feng-jue; Zhang, Zhenjiang; Kallen, Caleb B.; Yang, Danzhou

2011-01-01

DNA-intercalating molecules can impair DNA replication, DNA repair, and gene transcription. We previously demonstrated that XR5944, a DNA bis-intercalator, specifically blocks binding of estrogen receptor-α (ERα) to the consensus estrogen response element (ERE). The consensus ERE sequence is AGGTCAnnnTGACCT, where nnn is known as the tri-nucleotide spacer. Recent work has shown that the tri-nucleotide spacer can modulate ERα-ERE binding affinity and ligand-mediated transcriptional responses. To further understand the mechanism by which XR5944 inhibits ERα-ERE binding, we tested its ability to interact with consensus EREs with variable tri-nucleotide spacer sequences and with natural but non-consensus ERE sequences using one dimensional nuclear magnetic resonance (1D 1H NMR) titration studies. We found that the tri-nucleotide spacer sequence significantly modulates the binding of XR5944 to EREs. Of the sequences that were tested, EREs with CGG and AGG spacers showed the best binding specificity with XR5944, while those spaced with TTT demonstrated the least specific binding. The binding stoichiometry of XR5944 with EREs was 2:1, which can explain why the spacer influences the drug-DNA interaction; each XR5944 spans four nucleotides (including portions of the spacer) when intercalating with DNA. To validate our NMR results, we conducted functional studies using reporter constructs containing consensus EREs with tri-nucleotide spacers CGG, CTG, and TTT. Results of reporter assays in MCF-7 cells indicated that XR5944 was significantly more potent in inhibiting the activity of CGG- than TTT-spaced EREs, consistent with our NMR results. Taken together, these findings predict that the anti-estrogenic effects of XR5944 will depend not only on ERE half-site composition but also on the tri-nucleotide spacer sequence of EREs located in the promoters of estrogen-responsive genes. PMID:21333738
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing.

PubMed

Hribová, Eva; Neumann, Pavel; Matsumoto, Takashi; Roux, Nicolas; Macas, Jirí; Dolezel, Jaroslav

2010-09-16

Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection.
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing

PubMed Central

2010-01-01

Background Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. Results In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. Conclusion A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection. PMID:20846365

Soil drying procedure affects the DNA quantification of Lactarius vinosus but does not change the fungal community composition.

PubMed

Castaño, Carles; Parladé, Javier; Pera, Joan; Martínez de Aragón, Juan; Alday, Josu G; Bonet, José Antonio

2016-11-01

Drying soil samples before DNA extraction is commonly used for specific fungal DNA quantification and metabarcoding studies, but the impact of different drying procedures on both the specific fungal DNA quantity and the fungal community composition has not been analyzed. We tested three different drying procedures (freeze-drying, oven-drying, and room temperature) on 12 different soil samples to determine (a) the soil mycelium biomass of the ectomycorrhizal species Lactarius vinosus using qPCR with a specifically designed TaqMan® probe and (b) the fungal community composition and diversity using the PacBio® RS II sequencing platform. Mycelium biomass of L. vinosus was significantly greater in the freeze-dried soil samples than in samples dried at oven and room temperature. However, drying procedures had no effect on fungal community composition or on fungal diversity. In addition, there were no significant differences in the proportions of fungi according to their functional roles (moulds vs. mycorrhizal species) in response to drying procedures. Only six out of 1139 operational taxonomic units (OTUs) had increased their relative proportions after soil drying at room temperature, with five of these OTUs classified as mould or yeast species. However, the magnitude of these changes was small, with an overall increase in relative abundance of these OTUs of approximately 2 %. These results suggest that DNA degradation may occur especially after drying soil samples at room temperature, but affecting equally nearly all fungi and therefore causing no significant differences in diversity and community composition. Despite the minimal effects caused by the drying procedures at the fungal community composition, freeze-drying resulted in higher concentrations of L. vinosus DNA and prevented potential colonization from opportunistic species.
Improved serial analysis of V1 ribosomal sequence tags (SARST-V1) provides a rapid, comprehensive, sequence-based characterization of bacterial diversity and community composition.

PubMed

Yu, Zhongtang; Yu, Marie; Morrison, Mark

2006-04-01

Serial analysis of ribosomal sequence tags (SARST) is a recently developed technology that can generate large 16S rRNA gene (rrs) sequence data sets from microbiomes, but there are numerous enzymatic and purification steps required to construct the ribosomal sequence tag (RST) clone libraries. We report here an improved SARST method, which still targets the V1 hypervariable region of rrs genes, but reduces the number of enzymes, oligonucleotides, reagents, and technical steps needed to produce the RST clone libraries. The new method, hereafter referred to as SARST-V1, was used to examine the eubacterial diversity present in community DNA recovered from the microbiome resident in the ovine rumen. The 190 sequenced clones contained 1055 RSTs and no less than 236 unique phylotypes (based on > or = 95% sequence identity) that were assigned to eight different eubacterial phyla. Rarefaction and monomolecular curve analyses predicted that the complete RST clone library contains 99% of the 353 unique phylotypes predicted to exist in this microbiome. When compared with ribosomal intergenic spacer analysis (RISA) of the same community DNA sample, as well as a compilation of nine previously published conventional rrs clone libraries prepared from the same type of samples, the RST clone library provided a more comprehensive characterization of the eubacterial diversity present in rumen microbiomes. As such, SARST-V1 should be a useful tool applicable to comprehensive examination of diversity and composition in microbiomes and offers an affordable, sequence-based method for diversity analysis.
Simple Sequence Repeats in Escherichia coli: Abundance, Distribution, Composition, and Polymorphism

PubMed Central

Gur-Arie, Riva; Cohen, Cyril J.; Eitan, Yuval; Shelef, Leora; Hallerman, Eric M.; Kashi, Yechezkel

2000-01-01

Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.[The sequence data described in this paper have been submitted to the GenBank data library under accession numbers AF209020–209030 and AF209508–209518.] PMID:10645951
Molecular characterization of phototrophic microorganisms in the forefield of a receding glacier in the Swiss Alps

NASA Astrophysics Data System (ADS)

Frey, Beat; Bühler, Lukas; Schmutz, Stefan; Zumsteg, Anita; Furrer, Gerhard

2013-03-01

Recently deglaciated areas are ideal environments to study soil formation and primary microbial succession where phototrophic microorganisms may play a role as primary producers. The aim of our study was to investigate the cyanobacterial and green algal community composition in three different successional stages of the Damma glacier forefield in the Swiss Alps using 16S rDNA and ITS rDNA clone libraries. Cyanobacterial target sequences varied along the glacier forefield, with the highest cyanobacterial 16S rRNA gene copies found in sparsely vegetated soils. Sequence analysis revealed that the phototrophic communities were distinct in each of the three soil environments. The majority of the cyanobacterial sequences retrieved from barren soils were related to the Oscillatoriales. The diversity in sparsely vegetated soils was low, and sequences closely related to Nostoc sp. dominated. The majority of the algal phylotypes are related to members of the Trebouxiophyceae known to live as symbiotic partners in lichens. We conclude that the community composition appears to shift markedly along the chronosequence, indicating that each soil environment selects for its phototrophic community. When cyanobacteria occur together with eukaryotic microalgae, they form a rich source of organic matter and may be important contributors of carbon in nutrient-deficient deglaciated soils.
Community Composition and Transcriptional Activity of Ammonia-Oxidizing Prokaryotes of Seagrass Thalassia hemprichii in Coral Reef Ecosystems.

PubMed

Ling, Juan; Lin, Xiancheng; Zhang, Yanying; Zhou, Weiguo; Yang, Qingsong; Lin, Liyun; Zeng, Siquan; Zhang, Ying; Wang, Cong; Ahmad, Manzoor; Long, Lijuan; Dong, Junde

2018-01-01

Seagrasses in coral reef ecosystems play important ecological roles by enhancing coral reef resilience under ocean acidification. However, seagrass primary productivity is typically constrained by limited nitrogen availability. Ammonia oxidation is an important process conducted by ammonia-oxidizing archaea (AOA) and bacteria (AOB), yet little information is available concerning the community structure and potential activity of seagrass AOA and AOB. Therefore, this study investigated the variations in the abundance, diversity and transcriptional activity of AOA and AOB at the DNA and transcript level from four sample types: the leaf, root, rhizosphere sediment and bulk sediment of seagrass Thalassia hemprichii in three coral reef ecosystems. DNA and complementary DNA (cDNA) were used to prepare clone libraries and DNA and cDNA quantitative PCR ( q PCR) assays, targeting the ammonia monooxygenase-subunit ( amo A) genes as biomarkers. Our results indicated that the closest relatives of the obtained archaeal and bacterial amo A gene sequences recovered from DNA and cDNA libraries mainly originated from the marine environment. Moreover, all the obtained AOB sequences belong to the Nitrosomonadales cluster. Nearly all the AOA communities exhibited higher diversity than the AOB communities at the DNA level, but the q PCR data demonstrated that the abundances of AOB communities were higher than that of AOA communities based on both DNA and RNA transcripts. Collectively, most of the samples shared greater community composition similarity with samples from the same location rather than sample type. Furthermore, the abundance of archaeal amo A gene in rhizosphere sediments showed significant relationships with the ammonium concentration of sediments and the nitrogen content of plant tissue (leaf and root) at the DNA level ( P < 0.05). Conversely, no such relationships were found for the AOB communities. This work provides new insight into the nitrogen cycle, particularly nitrification of seagrass meadows in coral reef ecosystems.
The complete mitochondrial genome of the Asian tapirs (Tapirus indicus): the only extant Tapiridae species in the old world.

PubMed

Muangkram, Yuttamol; Wajjwalku, Worawidh; Kaolim, Nongnid; Buddhakosai, Waradee; Kamolnorranath, Sumate; Siriaroonrat, Boripat; Tipkantha, Wanlaya; Dongsaard, Khwanruean; Maikaew, Umaporn; Sanannu, Saowaphang

2016-01-01

Asian tapir (Tapirus indicus) is categorized as Endangered on the 2008 IUCN red list. The first full-length mitochondrial DNA (mtDNA) sequence of Asian tapir is 16,717 bp in length. Base composition shows 34.6% A, 27.2% T, 25.8% C and 12.3% G. Highest polymorphic site is on the control region as typical for many species.
Fluorescent signatures for variable DNA sequences

PubMed Central

Rice, John E.; Reis, Arthur H.; Rice, Lisa M.; Carver-Brown, Rachel K.; Wangh, Lawrence J.

2012-01-01

Life abounds with genetic variations writ in sequences that are often only a few hundred nucleotides long. Rapid detection of these variations for identification of genetic diseases, pathogens and organisms has become the mainstay of molecular science and medicine. This report describes a new, highly informative closed-tube polymerase chain reaction (PCR) strategy for analysis of both known and unknown sequence variations. It combines efficient quantitative amplification of single-stranded DNA targets through LATE-PCR with sets of Lights-On/Lights-Off probes that hybridize to their target sequences over a broad temperature range. Contiguous pairs of Lights-On/Lights-Off probes of the same fluorescent color are used to scan hundreds of nucleotides for the presence of mutations. Sets of probes in different colors can be combined in the same tube to analyze even longer single-stranded targets. Each set of hybridized Lights-On/Lights-Off probes generates a composite fluorescent contour, which is mathematically converted to a sequence-specific fluorescent signature. The versatility and broad utility of this new technology is illustrated in this report by characterization of variant sequences in three different DNA targets: the rpoB gene of Mycobacterium tuberculosis, a sequence in the mitochondrial cytochrome C oxidase subunit 1 gene of nematodes and the V3 hypervariable region of the bacterial 16 s ribosomal RNA gene. We anticipate widespread use of these technologies for diagnostics, species identification and basic research. PMID:22879378
Development of PCR primers specific for the amplification and direct sequencing of gyrB genes from microbacteria, order Actinomycetales.

PubMed

Richert, Kathrin; Brambilla, Evelyne; Stackebrandt, Erko

2005-01-01

PCR primer sets were developed for the specific amplification and sequence analyses encoding the gyrase subunit B (gyrB) of members of the family Microbacteriaceae, class Actinobacteria. The family contains species highly related by 16S rRNA gene sequence analyses. In order to test if the gene sequence analysis of gyrB is appropriate to discriminate between closely related species, we evaluate the 16S rRNA gene phylogeny of its members. As the published universal primer set for gyrB failed to amplify the responding gene of the majority of the 80 type strains of the family, three new primer sets were identified that generated fragments with a composite sequence length of about 900 nt. However, the amplification of all three fragments was successful only in 25% of the 80 type strains. In this study, the substitution frequencies in genes encoding gyrase and 16S rDNA were compared for 10 strains of nine genera. The frequency of gyrB nucleotide substitution is significantly higher than that of the 16S rDNA, and no linear correlation exists between the similarities of both molecules among members of the Microbacteriaceae. The phylogenetic analyses using the gyrB sequences provide higher resolution than using 16S rDNA sequences and seem able to discriminate between closely related species.
Seasonal diversity and dynamics of haptophytes in the Skagerrak, Norway, explored by high-throughput sequencing

PubMed Central

Egge, Elianne Sirnæs; Johannessen, Torill Vik; Andersen, Tom; Eikrem, Wenche; Bittner, Lucie; Larsen, Aud; Sandaa, Ruth-Anne; Edvardsen, Bente

2015-01-01

Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high-throughput sequencing. From outer Oslofjorden, S Norway, nano- and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta-specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA-sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September–October (autumn) and lowest in April–May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP-3–5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in-depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters. PMID:25893259
Seasonal diversity and dynamics of haptophytes in the Skagerrak, Norway, explored by high-throughput sequencing.

PubMed

Egge, Elianne Sirnaes; Johannessen, Torill Vik; Andersen, Tom; Eikrem, Wenche; Bittner, Lucie; Larsen, Aud; Sandaa, Ruth-Anne; Edvardsen, Bente

2015-06-01

Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high-throughput sequencing. From outer Oslofjorden, S Norway, nano- and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta-specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA-sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September-October (autumn) and lowest in April-May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP-3-5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in-depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters. © 2015 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
Are mutagenic non D-loop direct repeat motifs in mitochondrial DNA under a negative selection pressure?

PubMed Central

Lakshmanan, Lakshmi Narayanan; Gruber, Jan; Halliwell, Barry; Gunawan, Rudiyanto

2015-01-01

Non D-loop direct repeats (DRs) in mitochondrial DNA (mtDNA) have been commonly implicated in the mutagenesis of mtDNA deletions associated with neuromuscular disease and ageing. Further, these DRs have been hypothesized to put a constraint on the lifespan of mammals and are under a negative selection pressure. Using a compendium of 294 mammalian mtDNA, we re-examined the relationship between species lifespan and the mutagenicity of such DRs. Contradicting the prevailing hypotheses, we found no significant evidence that long-lived mammals possess fewer mutagenic DRs than short-lived mammals. By comparing DR counts in human mtDNA with those in selectively randomized sequences, we also showed that the number of DRs in human mtDNA is primarily determined by global mtDNA properties, such as the bias in synonymous codon usage (SCU) and nucleotide composition. We found that SCU bias in mtDNA positively correlates with DR counts, where repeated usage of a subset of codons leads to more frequent DR occurrences. While bias in SCU and nucleotide composition has been attributed to nucleotide mutational bias, mammalian mtDNA still exhibit higher SCU bias and DR counts than expected from such mutational bias, suggesting a lack of negative selection against non D-loop DRs. PMID:25855815
The impact of base stacking on the conformations and electrostatics of single-stranded DNA.

PubMed

Plumridge, Alex; Meisburger, Steve P; Andresen, Kurt; Pollack, Lois

2017-04-20

Single-stranded DNA (ssDNA) is notable for its interactions with ssDNA binding proteins (SSBs) during fundamentally important biological processes including DNA repair and replication. Previous work has begun to characterize the conformational and electrostatic properties of ssDNA in association with SSBs. However, the conformational distributions of free ssDNA have been difficult to determine. To capture the vast array of ssDNA conformations in solution, we pair small angle X-ray scattering with novel ensemble fitting methods, obtaining key parameters such as the size, shape and stacking character of strands with different sequences. Complementary ion counting measurements using inductively coupled plasma atomic emission spectroscopy are employed to determine the composition of the ion atmosphere at physiological ionic strength. Applying this combined approach to poly dA and poly dT, we find that the global properties of these sequences are very similar, despite having vastly different propensities for single-stranded helical stacking. These results suggest that a relatively simple mechanism for the binding of ssDNA to non-specific SSBs may be at play, which explains the disparity in binding affinities observed for these systems. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Introduction of a novel 18S rDNA gene arrangement along with distinct ITS region in the saline water microalga Dunaliella

PubMed Central

2010-01-01

Comparison of 18S rDNA gene sequences is a very promising method for identification and classification of living organisms. Molecular identification and discrimination of different Dunaliella species were carried out based on the size of 18S rDNA gene and, number and position of introns in the gene. Three types of 18S rDNA structure have already been reported: the gene with a size of ~1770 bp lacking any intron, with a size of ~2170 bp consisting one intron near 5' terminus, and with a size of ~2570 bp harbouring two introns near 5' and 3' termini. Hereby, we report a new 18S rDNA gene arrangement in terms of intron localization and nucleotide sequence in a Dunaliella isolated from Iranian salt lakes (ABRIINW-M1/2). PCR amplification with genus-specific primers resulted in production of a ~2170 bp DNA band, which is similar to that of D. salina 18S rDNA gene containing only one intron near 5' terminus. Whilst, sequence composition of the gene revealed the lack of any intron near 5' terminus in our isolate. Furthermore, another alteration was observed due to the presence of a 440 bp DNA fragment near 3' terminus. Accordingly, 18S rDNA gene of the isolate is clearly different from those of D. salina and any other Dunaliella species reported so far. Moreover, analysis of ITS region sequence showed the diversity of this region compared to the previously reported species. 18S rDNA and ITS sequences of our isolate were submitted with accesion numbers of EU678868 and EU927373 in NCBI database, respectively. The optimum growth rate of this isolate occured at the salinity level of 1 M NaCl. The maximum carotenoid content under stress condition of intense light (400 μmol photon m-2 s-1), high salinity (4 M NaCl) and deficiency of nitrate and phosphate nutritions reached to 240 ng/cell after 15 days. PMID:20377865
Amplicon-Based Sequencing of Soil Fungi from Wood Preservative Test Sites

PubMed Central

Kirker, Grant T.; Bishell, Amy B.; Jusino, Michelle A.; Palmer, Jonathan M.; Hickey, William J.; Lindner, Daniel L.

2017-01-01

Soil samples were collected from field sites in two AWPA (American Wood Protection Association) wood decay hazard zones in North America. Two field plots at each site were exposed to differing preservative chemistries via in-ground installations of treated wood stakes for approximately 50 years. The purpose of this study is to characterize soil fungal species and to determine if long term exposure to various wood preservatives impacts soil fungal community composition. Soil fungal communities were compared using amplicon-based DNA sequencing of the internal transcribed spacer 1 (ITS1) region of the rDNA array. Data show that soil fungal community composition differs significantly between the two sites and that long-term exposure to different preservative chemistries is correlated with different species composition of soil fungi. However, chemical analyses using ICP-OES found levels of select residual preservative actives (copper, chromium and arsenic) to be similar to naturally occurring levels in unexposed areas. A list of indicator species was compiled for each treatment-site combination; functional guild analyses indicate that long-term exposure to wood preservatives may have both detrimental and stimulatory effects on soil fungal species composition. Fungi with demonstrated capacity to degrade industrial pollutants were found to be highly correlated with areas that experienced long-term exposure to preservative testing. PMID:29093702
Metabarcoding of the kombucha microbial community grown in different microenvironments.

PubMed

Reva, Oleg N; Zaets, Iryna E; Ovcharenko, Leonid P; Kukharenko, Olga E; Shpylova, Switlana P; Podolich, Olga V; de Vera, Jean-Pierre; Kozyrovska, Natalia O

2015-12-01

Introducing of the DNA metabarcoding analysis of probiotic microbial communities allowed getting insight into their functioning and establishing a better control on safety and efficacy of the probiotic communities. In this work the kombucha poly-microbial probiotic community was analysed to study its flexibility under different growth conditions. Environmental DNA sequencing revealed a complex and flexible composition of the kombucha microbial culture (KMC) constituting more bacterial and fungal organisms in addition to those found by cultural method. The community comprised bacterial and yeast components including cultured and uncultivable microorganisms. Culturing the KMC under different conditions revealed the core part of the community which included acetobacteria of two genera Komagataeibacter (former Gluconacetobacter) and Gluconobacter, and representatives of several yeast genera among which Brettanomyces/Dekkera and Pichia (including former Issatchenkia) were dominant. Herbaspirillum spp. and Halomonas spp., which previously had not been described in KMC, were found to be minor but permanent members of the community. The community composition was dependent on the growth conditions. The bacterial component of KMC was relatively stable, but may include additional member-lactobacilli. The yeast species composition was significantly variable. High-throughput sequencing showed complexity and variability of KMC that may affect the quality of the probiotic drink. It was hypothesized that the kombucha core community might recruit some environmental bacteria, particularly lactobacilli, which potentially may contribute to the fermentative capacity of the probiotic drink. As many KMC-associated microorganisms cannot be cultured out of the community, a robust control for community composition should be provided by using DNA metabarcoding.
Genome-Wide Spectra of Transcription Insertions and Deletions Reveal That Slippage Depends on RNA:DNA Hybrid Complementarity

PubMed Central

Traverse, Charles C.

2017-01-01

ABSTRACT Advances in sequencing technologies have enabled direct quantification of genome-wide errors that occur during RNA transcription. These errors occur at rates that are orders of magnitude higher than rates during DNA replication, but due to technical difficulties such measurements have been limited to single-base substitutions and have not yet quantified the scope of transcription insertions and deletions. Previous reporter gene assay findings suggested that transcription indels are produced exclusively by elongation complex slippage at homopolymeric runs, so we enumerated indels across the protein-coding transcriptomes of Escherichia coli and Buchnera aphidicola, which differ widely in their genomic base compositions and incidence of repeat regions. As anticipated from prior assays, transcription insertions prevailed in homopolymeric runs of A and T; however, transcription deletions arose in much more complex sequences and were rarely associated with homopolymeric runs. By reconstructing the relocated positions of the elongation complex as inferred from the sequences inserted or deleted during transcription, we show that continuation of transcription after slippage hinges on the degree of nucleotide complementarity within the RNA:DNA hybrid at the new DNA template location. PMID:28851848
A Novel Computational Method for Detecting DNA Methylation Sites with DNA Sequence Information and Physicochemical Properties.

PubMed

Pan, Gaofeng; Jiang, Limin; Tang, Jijun; Guo, Fei

2018-02-08

DNA methylation is an important biochemical process, and it has a close connection with many types of cancer. Research about DNA methylation can help us to understand the regulation mechanism and epigenetic reprogramming. Therefore, it becomes very important to recognize the methylation sites in the DNA sequence. In the past several decades, many computational methods-especially machine learning methods-have been developed since the high-throughout sequencing technology became widely used in research and industry. In order to accurately identify whether or not a nucleotide residue is methylated under the specific DNA sequence context, we propose a novel method that overcomes the shortcomings of previous methods for predicting methylation sites. We use k -gram, multivariate mutual information, discrete wavelet transform, and pseudo amino acid composition to extract features, and train a sparse Bayesian learning model to do DNA methylation prediction. Five criteria-area under the receiver operating characteristic curve (AUC), Matthew's correlation coefficient (MCC), accuracy (ACC), sensitivity (SN), and specificity-are used to evaluate the prediction results of our method. On the benchmark dataset, we could reach 0.8632 on AUC, 0.8017 on ACC, 0.5558 on MCC, and 0.7268 on SN. Additionally, the best results on two scBS-seq profiled mouse embryonic stem cells datasets were 0.8896 and 0.9511 by AUC, respectively. When compared with other outstanding methods, our method surpassed them on the accuracy of prediction. The improvement of AUC by our method compared to other methods was at least 0.0399 . For the convenience of other researchers, our code has been uploaded to a file hosting service, and can be downloaded from: https://figshare.com/s/0697b692d802861282d3.
PHYLOGENETIC DIVERSITY IN DRINKING WATER BACTERIA IN A DISTRIBUTION SYSTEM SIMULATOR

EPA Science Inventory

This work was carried out to characterize the composition of microbial populations in a distribution system simulator (DSS) by direct sequence analysis of 16S rDNA clone libraries. Bacterial populations were examined in chlorinated distribution water and chloraminated DSS feed an...
Chromosome Evolution in Connection with Repetitive Sequences and Epigenetics in Plants

PubMed Central

Li, Shu-Fen; Su, Ting; Cheng, Guang-Qian; Wang, Bing-Xiao; Li, Xu; Deng, Chuan-Liang; Gao, Wu-Jun

2017-01-01

Chromosome evolution is a fundamental aspect of evolutionary biology. The evolution of chromosome size, structure and shape, number, and the change in DNA composition suggest the high plasticity of nuclear genomes at the chromosomal level. Repetitive DNA sequences, which represent a conspicuous fraction of every eukaryotic genome, particularly in plants, are found to be tightly linked with plant chromosome evolution. Different classes of repetitive sequences have distinct distribution patterns on the chromosomes. Mounting evidence shows that repetitive sequences may play multiple generative roles in shaping the chromosome karyotypes in plants. Furthermore, recent development in our understanding of the repetitive sequences and plant chromosome evolution has elucidated the involvement of a spectrum of epigenetic modification. In this review, we focused on the recent evidence relating to the distribution pattern of repetitive sequences in plant chromosomes and highlighted their potential relevance to chromosome evolution in plants. We also discussed the possible connections between evolution and epigenetic alterations in chromosome structure and repatterning, such as heterochromatin formation, centromere function, and epigenetic-associated transposable element inactivation. PMID:29064432
Selectivity by host plants affects the distribution of arbuscular mycorrhizal fungi: evidence from ITS rDNA sequence metadata.

PubMed

Yang, Haishui; Zang, Yanyan; Yuan, Yongge; Tang, Jianjun; Chen, Xin

2012-04-12

Arbuscular mycorrhizal fungi (AMF) can form obligate symbioses with the vast majority of land plants, and AMF distribution patterns have received increasing attention from researchers. At the local scale, the distribution of AMF is well documented. Studies at large scales, however, are limited because intensive sampling is difficult. Here, we used ITS rDNA sequence metadata obtained from public databases to study the distribution of AMF at continental and global scales. We also used these sequence metadata to investigate whether host plant is the main factor that affects the distribution of AMF at large scales. We defined 305 ITS virtual taxa (ITS-VTs) among all sequences of the Glomeromycota by using a comprehensive maximum likelihood phylogenetic analysis. Each host taxonomic order averaged about 53% specific ITS-VTs, and approximately 60% of the ITS-VTs were host specific. Those ITS-VTs with wide host range showed wide geographic distribution. Most ITS-VTs occurred in only one type of host functional group. The distributions of most ITS-VTs were limited across ecosystem, across continent, across biogeographical realm, and across climatic zone. Non-metric multidimensional scaling analysis (NMDS) showed that AMF community composition differed among functional groups of hosts, and among ecosystem, continent, biogeographical realm, and climatic zone. The Mantel test showed that AMF community composition was significantly correlated with plant community composition among ecosystem, among continent, among biogeographical realm, and among climatic zone. The structural equation modeling (SEM) showed that the effects of ecosystem, continent, biogeographical realm, and climatic zone were mainly indirect on AMF distribution, but plant had strongly direct effects on AMF. The distribution of AMF as indicated by ITS rDNA sequences showed a pattern of high endemism at large scales. This pattern indicates high specificity of AMF for host at different scales (plant taxonomic order and functional group) and high selectivity from host plants for AMF. The effects of ecosystemic, biogeographical, continental and climatic factors on AMF distribution might be mediated by host plants.

Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.

PubMed

Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L

2013-01-30

Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

PubMed Central

2013-01-01

Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705
SSR_pipeline--computer software for the identification of microsatellite sequences from paired-end Illumina high-throughput DNA sequence data

USGS Publications Warehouse

Miller, Mark P.; Knaus, Brian J.; Mullins, Thomas D.; Haig, Susan M.

2013-01-01

SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (SSRs; for example, microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains three analysis modules along with a fourth control module that can be used to automate analyses of large volumes of data. The modules are used to (1) identify the subset of paired-end sequences that pass quality standards, (2) align paired-end reads into a single composite DNA sequence, and (3) identify sequences that possess microsatellites conforming to user specified parameters. Each of the three separate analysis modules also can be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc). All modules are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, Windows). The program suite relies on a compiled Python extension module to perform paired-end alignments. Instructions for compiling the extension from source code are provided in the documentation. Users who do not have Python installed on their computers or who do not have the ability to compile software also may choose to download packaged executable files. These files include all Python scripts, a copy of the compiled extension module, and a minimal installation of Python in a single binary executable. See program documentation for more information.
Species classifier choice is a key consideration when analysing low-complexity food microbiome data.

PubMed

Walsh, Aaron M; Crispie, Fiona; O'Sullivan, Orla; Finnegan, Laura; Claesson, Marcus J; Cotter, Paul D

2018-03-20

The use of shotgun metagenomics to analyse low-complexity microbial communities in foods has the potential to be of considerable fundamental and applied value. However, there is currently no consensus with respect to choice of species classification tool, platform, or sequencing depth. Here, we benchmarked the performances of three high-throughput short-read sequencing platforms, the Illumina MiSeq, NextSeq 500, and Ion Proton, for shotgun metagenomics of food microbiota. Briefly, we sequenced six kefir DNA samples and a mock community DNA sample, the latter constructed by evenly mixing genomic DNA from 13 food-related bacterial species. A variety of bioinformatic tools were used to analyse the data generated, and the effects of sequencing depth on these analyses were tested by randomly subsampling reads. Compositional analysis results were consistent between the platforms at divergent sequencing depths. However, we observed pronounced differences in the predictions from species classification tools. Indeed, PERMANOVA indicated that there was no significant differences between the compositional results generated by the different sequencers (p = 0.693, R 2 = 0.011), but there was a significant difference between the results predicted by the species classifiers (p = 0.01, R 2 = 0.127). The relative abundances predicted by the classifiers, apart from MetaPhlAn2, were apparently biased by reference genome sizes. Additionally, we observed varying false-positive rates among the classifiers. MetaPhlAn2 had the lowest false-positive rate, whereas SLIMM had the greatest false-positive rate. Strain-level analysis results were also similar across platforms. Each platform correctly identified the strains present in the mock community, but accuracy was improved slightly with greater sequencing depth. Notably, PanPhlAn detected the dominant strains in each kefir sample above 500,000 reads per sample. Again, the outputs from functional profiling analysis using SUPER-FOCUS were generally accordant between the platforms at different sequencing depths. Finally, and expectedly, metagenome assembly completeness was significantly lower on the MiSeq than either on the NextSeq (p = 0.03) or the Proton (p = 0.011), and it improved with increased sequencing depth. Our results demonstrate a remarkable similarity in the results generated by the three sequencing platforms at different sequencing depths, and, in fact, the choice of bioinformatics methodology had a more evident impact on results than the choice of sequencer did.
Characterization of the Fb-Nof Transposable Element of Drosophila Melanogaster

PubMed Central

Harden, N.; Ashburner, M.

1990-01-01

FB-NOF is a composite transposable element of Drosophila melanogaster. It is composed of foldback sequences, of variable length, which flank a 4-kb NOF sequence with 308-bp inverted repeat termini. The NOF sequence could potentially code for a 120-kD polypeptide. The FB-NOF element is responsible for unstable mutations of the white gene (w(c) and w(DZL)) and is associated with the large TEs of G. Ising. Although most strains of D. melanogaster have 20-30 sites of FB insertion, FB-NOF elements are usually rare, many strains lack this composite element or have only one copy of it. A few strains, including w(DZL) and Basc have many (8-21) copies of FB-NOF, and these show a tendency to insert at ``hot-spots.'' These strains also have an increased number of FB elements. The DNA sequence of the NOF region associated with TE146(Z) has been determined. PMID:2174013
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

PubMed

Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

1987-08-01

To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded.
Identity of the xerophilic species Aspergillus penicillioides: Integrated analysis of the genotypic and phenotypic characters.

PubMed

Tamura, Miki; Kawasaki, Hiroko; Sugiyama, Junta

1999-02-01

We examined the identity of Aspergillus penicillioides, the typical xerophilic and strictly anamorphic species, using an integrated analysis of the genotypic and phenotypic characters. Our experimental methods on two genotypic characters, i.e., DNA base composition using the HPLC method and DNA relatedness using the nitrocellulose filter hybridization technique between A. flavus, A. oryzae, and their close relations revealed a good agreement with the values by buoyant density (for DNA base composition) and spectrophotometric determination (for DNA relatedness) reported by Kurtzman et al. in 1986. On the basis of these comparisons, we examined DNA base composition and DNA relatedness of six selected strains of A. penicillioides, including IFO 8155 (originally described as A. vitricola), one strain of A. restrictus, and the respective strains from Eurotium amstelodami, E. repens, and E. rubrum. As a result, five strains within A. penicillioides, including the neotype strain NRRL 4548, had G+C contents of 46 to 49 mol%, whereas IFO 8155 had 50 mol%. A. restrictus had 52 mol%, and three Eurotium species ranged from 46 to 49 mol%. The DNA relatedness between A. penicillioides (five strains), except for IFO 8155, exhibited values greater than 70%, but the DNA complementarity between four strains and IFO 8155 in A. penicillioides revealed values of less than 40%. DNA relatedness values between three species of Eurotium were 65 to 72%. We determined 18S, 5.8S, and ITS rDNA sequences as other genotypic characters from A. penicillioides (six strains), A. restrictus, and related teleomorphic species of Eurotium. In three phylogenetic trees inferred from these sequences, five strains of A. penicillioides, including the neotype strain, were closely related to each other, whereas IFO 8155 was distantly related and grouped with other xerophilic species. Our results have suggested that A. penicillioides typified by NRRL 4548 and A. penicillioides IFO 8155 (ex holotype of A. vitricola) are not conspecific. The enzyme patterns as a genotypic character and general morphology and conidial ornamentation types as phenotypic characters supported this conclusion. Therefore the name A. vitricola Ohtsuki, typified by the holotype strain IFO 8155, should be revived. Evolutionary affinities among Aspergillus species and related teleomorphs, including the xerophilic taxa, are discussed.
A DNA Barcoding Approach to Characterize Pollen Collected by Honeybees

PubMed Central

Bruni, Ilaria; Scaccabarozzi, Daniela; Sandionigi, Anna; Barbuto, Michela; Casiraghi, Maurizio; Labra, Massimo

2014-01-01

In the present study, we investigated DNA barcoding effectiveness to characterize honeybee pollen pellets, a food supplement largely used for human nutrition due to its therapeutic properties. We collected pollen pellets using modified beehives placed in three zones within an alpine protected area (Grigna Settentrionale Regional Park, Italy). A DNA barcoding reference database, including rbcL and trnH-psbA sequences from 693 plant species (104 sequenced in this study) was assembled. The database was used to identify pollen collected from the hives. Fifty-two plant species were identified at the molecular level. Results suggested rbcL alone could not distinguish among congeneric plants; however, psbA-trnH identified most of the pollen samples at the species level. Substantial variability in pollen composition was observed between the highest elevation locality (Alpe Moconodeno), characterized by arid grasslands and a rocky substrate, and the other two sites (Cornisella and Ortanella) at lower altitudes. Pollen from Ortanella and Cornisella showed the presence of typical deciduous forest species; however in samples collected at Ortanella, pollen of the invasive Lonicera japonica, and the ornamental Pelargonium x hortorum were observed. Our results indicated pollen composition was largely influenced by floristic local biodiversity, plant phenology, and the presence of alien flowering species. Therefore, pollen molecular characterization based on DNA barcoding might serve useful to beekeepers in obtaining honeybee products with specific nutritional or therapeutic characteristics desired by food market demands. PMID:25296114
DNA replication-timing analysis of human chromosome 22 at high resolution and different developmental states.

PubMed

White, Eric J; Emanuelsson, Olof; Scalzo, David; Royce, Thomas; Kosak, Steven; Oakeley, Edward J; Weissman, Sherman; Gerstein, Mark; Groudine, Mark; Snyder, Michael; Schübeler, Dirk

2004-12-21

Duplication of the genome during the S phase of the cell cycle does not occur simultaneously; rather, different sequences are replicated at different times. The replication timing of specific sequences can change during development; however, the determinants of this dynamic process are poorly understood. To gain insights into the contribution of developmental state, genomic sequence, and transcriptional activity to replication timing, we investigated the timing of DNA replication at high resolution along an entire human chromosome (chromosome 22) in two different cell types. The pattern of replication timing was correlated with respect to annotated genes, gene expression, novel transcribed regions of unknown function, sequence composition, and cytological features. We observed that chromosome 22 contains regions of early- and late-replicating domains of 100 kb to 2 Mb, many (but not all) of which are associated with previously described chromosomal bands. In both cell types, expressed sequences are replicated earlier than nontranscribed regions. However, several highly transcribed regions replicate late. Overall, the DNA replication-timing profiles of the two different cell types are remarkably similar, with only nine regions of difference observed. In one case, this difference reflects the differential expression of an annotated gene that resides in this region. Novel transcribed regions with low coding potential exhibit a strong propensity for early DNA replication. Although the cellular function of such transcripts is poorly understood, our results suggest that their activity is linked to the replication-timing program.
Sequence Composition and Gene Content of the Short Arm of Rye (Secale cereale) Chromosome 1

PubMed Central

Fluch, Silvia; Kopecky, Dieter; Burg, Kornel; Šimková, Hana; Taudien, Stefan; Petzold, Andreas; Kubaláková, Marie; Platzer, Matthias; Berenyi, Maria; Krainer, Siegfried; Doležel, Jaroslav; Lelley, Tamas

2012-01-01

Background The purpose of the study is to elucidate the sequence composition of the short arm of rye chromosome 1 (Secale cereale) with special focus on its gene content, because this portion of the rye genome is an integrated part of several hundreds of bread wheat varieties worldwide. Methodology/Principal Findings Multiple Displacement Amplification of 1RS DNA, obtained from flow sorted 1RS chromosomes, using 1RS ditelosomic wheat-rye addition line, and subsequent Roche 454FLX sequencing of this DNA yielded 195,313,589 bp sequence information. This quantity of sequence information resulted in 0.43× sequence coverage of the 1RS chromosome arm, permitting the identification of genes with estimated probability of 95%. A detailed analysis revealed that more than 5% of the 1RS sequence consisted of gene space, identifying at least 3,121 gene loci representing 1,882 different gene functions. Repetitive elements comprised about 72% of the 1RS sequence, Gypsy/Sabrina (13.3%) being the most abundant. More than four thousand simple sequence repeat (SSR) sites mostly located in gene related sequence reads were identified for possible marker development. The existence of chloroplast insertions in 1RS has been verified by identifying chimeric chloroplast-genomic sequence reads. Synteny analysis of 1RS to the full genomes of Oryza sativa and Brachypodium distachyon revealed that about half of the genes of 1RS correspond to the distal end of the short arm of rice chromosome 5 and the proximal region of the long arm of Brachypodium distachyon chromosome 2. Comparison of the gene content of 1RS to 1HS barley chromosome arm revealed high conservation of genes related to chromosome 5 of rice. Conclusions The present study revealed the gene content and potential gene functions on this chromosome arm and demonstrated numerous sequence elements like SSRs and gene-related sequences, which can be utilised for future research as well as in breeding of wheat and rye. PMID:22328922
Development and evaluation of specific PCR primers targeting the ribosomal DNA-internal transcribed spacer (ITS) region of peritrich ciliates in environmental samples

NASA Astrophysics Data System (ADS)

Su, Lei; Zhang, Qianqian; Gong, Jun

2017-07-01

Peritrich ciliates are highly diverse and can be important bacterial grazers in aquatic ecosystems. Morphological identifications of peritrich species and assemblages in the environment are time-consuming and expertise-demanding. In this study, two peritrich-specific PCR primers were newly designed to amplify a fragment including the internal transcribed spacer (ITS) region of ribosomal rDNA from environmental samples. The primers showed high specificity in silico, and in tests with peritrich isolates and environmental DNA. Application of these primers in clone library construction and sequencing yielded exclusively sequences of peritrichs for water and sediment samples. We also found the ITS1, ITS2, ITS, D1 region of 28S rDNA, and ITS+D1 region co-varied with, and generally more variable than, the V9 region of 18S rDNA in peritrichs. The newly designed specific primers thus provide additional tools to study the molecular diversity, community composition, and phylogeography of these ecologically important protists in different systems.
Effect of DNA Extraction Methods on the Apparent Structure of Yak Rumen Microbial Communities as Revealed by 16S rDNA Sequencing.

PubMed

Chen, Ya-Bing; Lan, Dao-Liang; Tang, Cheng; Yang, Xiao-Nong; Li, Jian

2015-01-01

To more efficiently identify the microbial community of the yak rumen, the standardization of DNA extraction is key to ensure fidelity while studying environmental microbial communities. In this study, we systematically compared the efficiency of several extraction methods based on DNA yield, purity, and 16S rDNA sequencing to determine the optimal DNA extraction methods whose DNA products reflect complete bacterial communities. The results indicate that method 6 (hexadecyltrimethylammomium bromide-lysozyme-physical lysis by bead beating) is recommended for the DNA isolation of the rumen microbial community due to its high yield, operational taxonomic unit, bacterial diversity, and excellent cell-breaking capability. The results also indicate that the bead-beating step is necessary to effectively break down the cell walls of all of the microbes, especially Gram-positive bacteria. Another aim of this study was to preliminarily analyze the bacterial community via 16S rDNA sequencing. The microbial community spanned approximately 21 phyla, 35 classes, 75 families, and 112 genera. A comparative analysis showed some variations in the microbial community between yaks and cattle that may be attributed to diet and environmental differences. Interestingly, numerous uncultured or unclassified bacteria were found in yak rumen, suggesting that further research is required to determine the specific functional and ecological roles of these bacteria in yak rumen. In summary, the investigation of the optimal DNA extraction methods and the preliminary evaluation of the bacterial community composition of yak rumen support further identification of the specificity of the rumen microbial community in yak and the discovery of distinct gene resources.
Evidence of Differences between the Communities of Arbuscular Mycorrhizal Fungi Colonizing Galls and Roots of Prunus persica Infected by the Root-Knot Nematode Meloidogyne incognita▿

PubMed Central

Alguacil, Maria del Mar; Torrecillas, Emma; Lozano, Zenaida; Roldán, Antonio

2011-01-01

Arbuscular mycorrhizal fungi (AMF) play important roles as plant protection agents, reducing or suppressing nematode colonization. However, it has never been investigated whether the galls produced in roots by nematode infection are colonized by AMF. This study tested whether galls produced by Meloidogyne incognita infection in Prunus persica roots are colonized by AMF. We also determined the changes in AMF composition and biodiversity mediated by infection with this root-knot nematode. DNA from galls and roots of plants infected by M. incognita and from roots of noninfected plants was extracted, amplified, cloned, and sequenced using AMF-specific primers. Phylogenetic analysis using the small-subunit (SSU) ribosomal DNA (rDNA) data set revealed 22 different AMF sequence types (17 Glomus sequence types, 3 Paraglomus sequence types, 1 Scutellospora sequence type, and 1 Acaulospora sequence type). The highest AMF diversity was found in uninfected roots, followed by infected roots and galls. This study indicates that the galls produced in P. persica roots due to infection with M. incognita were colonized extensively by a community of AMF, belonging to the families Paraglomeraceae and Glomeraceae, that was different from the community detected in roots. Although the function of the AMF in the galls is still unknown, we hypothesize that they act as protection agents against opportunistic pathogens. PMID:21984233
Evidence of differences between the communities of arbuscular mycorrhizal fungi colonizing galls and roots of Prunus persica infected by the root-knot nematode Meloidogyne incognita.

PubMed

Alguacil, Maria del Mar; Torrecillas, Emma; Lozano, Zenaida; Roldán, Antonio

2011-12-01

Arbuscular mycorrhizal fungi (AMF) play important roles as plant protection agents, reducing or suppressing nematode colonization. However, it has never been investigated whether the galls produced in roots by nematode infection are colonized by AMF. This study tested whether galls produced by Meloidogyne incognita infection in Prunus persica roots are colonized by AMF. We also determined the changes in AMF composition and biodiversity mediated by infection with this root-knot nematode. DNA from galls and roots of plants infected by M. incognita and from roots of noninfected plants was extracted, amplified, cloned, and sequenced using AMF-specific primers. Phylogenetic analysis using the small-subunit (SSU) ribosomal DNA (rDNA) data set revealed 22 different AMF sequence types (17 Glomus sequence types, 3 Paraglomus sequence types, 1 Scutellospora sequence type, and 1 Acaulospora sequence type). The highest AMF diversity was found in uninfected roots, followed by infected roots and galls. This study indicates that the galls produced in P. persica roots due to infection with M. incognita were colonized extensively by a community of AMF, belonging to the families Paraglomeraceae and Glomeraceae, that was different from the community detected in roots. Although the function of the AMF in the galls is still unknown, we hypothesize that they act as protection agents against opportunistic pathogens.
Strong transcription blockage mediated by R-loop formation within a G-rich homopurine–homopyrimidine sequence localized in the vicinity of the promoter

PubMed Central

Soo Shin, Jane Hae

2017-01-01

Abstract Guanine-rich (G-rich) homopurine–homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA–DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA–DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription ‘bursting’) and may also have practical implications for the design of expression vectors. PMID:28498974
Strong transcription blockage mediated by R-loop formation within a G-rich homopurine-homopyrimidine sequence localized in the vicinity of the promoter.

PubMed

Belotserkovskii, Boris P; Soo Shin, Jane Hae; Hanawalt, Philip C

2017-06-20

Guanine-rich (G-rich) homopurine-homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA-DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA-DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription 'bursting') and may also have practical implications for the design of expression vectors. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
454 Pyrosequencing to Describe Microbial Eukaryotic Community Composition, Diversity and Relative Abundance: A Test for Marine Haptophytes

PubMed Central

Egge, Elianne; Bittner, Lucie; Andersen, Tom; Audic, Stéphane; de Vargas, Colomban; Edvardsen, Bente

2013-01-01

Next generation sequencing of ribosomal DNA is increasingly used to assess the diversity and structure of microbial communities. Here we test the ability of 454 pyrosequencing to detect the number of species present, and assess the relative abundance in terms of cell numbers and biomass of protists in the phylum Haptophyta. We used a mock community consisting of equal number of cells of 11 haptophyte species and compared targeting DNA and RNA/cDNA, and two different V4 SSU rDNA haptophyte-biased primer pairs. Further, we tested four different bioinformatic filtering methods to reduce errors in the resulting sequence dataset. With sequencing depth of 11000–20000 reads and targeting cDNA with Haptophyta specific primers Hap454 we detected all 11 species. A rarefaction analysis of expected number of species recovered as a function of sampling depth suggested that minimum 1400 reads were required here to recover all species in the mock community. Relative read abundance did not correlate to relative cell numbers. Although the species represented with the largest biomass was also proportionally most abundant among the reads, there was generally a weak correlation between proportional read abundance and proportional biomass of the different species, both with DNA and cDNA as template. The 454 sequencing generated considerable spurious diversity, and more with cDNA than DNA as template. With initial filtering based only on match with barcode and primer we observed 100-fold more operational taxonomic units (OTUs) at 99% similarity than the number of species present in the mock community. Filtering based on quality scores, or denoising with PyroNoise resulted in ten times more OTU99% than the number of species. Denoising with AmpliconNoise reduced the number of OTU99% to match the number of species present in the mock community. Based on our analyses, we propose a strategy to more accurately depict haptophyte diversity using 454 pyrosequencing. PMID:24069303
Replication, checkpoint suppression and structure of centromeric DNA

PubMed Central

Romeo, Francesco; Costanzo, Vincenzo

2016-01-01

ABSTRACT Human centromeres contain large amounts of repetitive DNA sequences known as α satellite DNA, which can be difficult to replicate and whose functional role is unclear. Recently, we have characterized protein composition, structural organization and checkpoint response to stalled replication forks of centromeric chromatin reconstituted in Xenopus laevis egg extract. We showed that centromeric DNA has high affinity for SMC2-4 subunits of condensins and for CENP-A, it is enriched for DNA repair factors and suppresses the ATR checkpoint to ensure its efficient replication. We also showed that centromeric chromatin forms condensins enriched and topologically constrained DNA loops, which likely contribute to the overall structure of the centromere. These findings have important implications on how chromosomes are organized and genome stability is maintained in mammalian cells. PMID:27893298
Characterization and Modulation of Proteins Involved in Sulfur Mustard Vesication

DTIC Science & Technology

2000-06-01

PARP staining was present throughout the nucleus, the DBD showed a more localized punctate pattern in the region of the nucleolus and throughout the...34 oligonucleotide is synthesized that is identical in base composition to the antisense, but had a randomly generated sequence. This is an important control...reversed this inhibitory effect. The roles of PARP in modulating the composition and enzyme activities of the DNA synthesome were further investigated by
Using faecal DNA to determine consumption by kangaroos of plants considered palatable to sheep.

PubMed

Ho, K W; Krebs, G L; McCafferty, P; van Wyngaarden, S P; Addison, J

2010-02-01

Disagreement exists within the scientific community with regards to the level of competition for feed between sheep and kangaroos in the Australian rangelands. The greatest challenge to solving this debate is finding effective means of determining the composition of the diets of these potential grazing competitors. An option is to adopt a non-invasive approach that combines faecal collection and molecular techniques that focus on faecal DNA as the primary source of dietary information. As proof-of-concept, we show that a DNA reference data bank on plant species can be established. This DNA reference data bank was then used as a library to identify plant species in kangaroo faeces collected in the southern rangelands of Western Australia. To enhance the method development and to begin the investigation of competitive grazing between sheep and kangaroos, 16 plant species known to be palatable to sheep were initially targeted for collection. To ensure that only plant sequences were studied, PCR amplification was performed using a universal primer pair previously shown to be specific to the chloroplast transfer RNA leucine (trnL) UAA gene intron. Overall, genus-specific, single and differently sized amplicons were reliably and reproducibly generated; enabling the differentiation of reference plants by PCR product length heterogeneity. However, there were a few plants that could not be clearly differentiated on the basis of size alone. This prompted the adoption of a post-PCR step that enabled further differentiation according to base sequence variation. Restriction endonucleases make sequence-specific cleavages on DNA to produce discrete and reproducible fragments having unique sizes and base compositions. Their availability, affordability and simplicity-of-use put restriction enzyme sequence (RES) profiling as a logical post-PCR step for confirming plant species identity. We demonstrate that PCR-RES profiling of plant and faecal matter is useful for the identification of plants included in the diet of kangaroos. The limitations, potential and the opportunities created for researchers interested in investigating the diet of competing herbivores in the rangelands are discussed.

Mechanism of DNA binding enhancement by hepatitis B virus protein pX.

PubMed

Palmer, C R; Gegnas, L D; Schepartz, A

1997-12-09

At least three hundred million people worldwide are infected with the hepatitis B virus (HBV), and epidemiological studies show a clear correlation between chronic HBV infection and the development of hepatocellular carcinoma. HBV encodes a protein, pX, which abducts the cellular transcriptional machinery in several ways including direct interactions with bZIP transcription factors. These interactions increase the DNA affinities of target bZIP proteins in a DNA sequence-dependent manner. Here we use a series of bZIP peptide models to explore the mechanism by which pX interacts with bZIP proteins. Our results suggest that pX increases bZIP.DNA stability by increasing the stability of the bZIP dimer as well as the affinity of the dimer for DNA. Additional experiments provide evidence for a mechanism in which pX recognizes the composite structure of the peptide.DNA complex, not simply the primary peptide sequence. These experiments provide a framework for understanding how pX alters the patterns of transcription within the nucleus. The similarities between the mechanism proposed for pX and the mechanism previously proposed for the human T-cell leukemia virus protein Tax are discussed.
Biophysical characterization of an integrin-targeted lipopolyplex gene delivery vector.

PubMed

Mustapa, M Firouz Mohd; Bell, Paul C; Hurley, Christopher A; Nicol, Alastair; Guénin, Erwann; Sarkar, Supti; Writer, Michele J; Barker, Susie E; Wong, John B; Pilkington-Miksa, Michael A; Papahadjopoulos-Sternberg, Brigitte; Shamlou, Parviz Ayazi; Hailes, Helen C; Hart, Stephen L; Zicha, Daniel; Tabor, Alethea B

2007-11-13

Nonviral gene delivery vectors now show good therapeutic potential: however, detailed characterization of the composition and macromolecular organization of such particles remains a challenge. This paper describes experiments to elucidate the structure of a ternary, targeted, lipopolyplex synthetic vector, the LID complex. This consists of a lipid component, Lipofectin (L) (1:1 DOTMA:DOPE), plasmid DNA (D), and a dual-function, cationic peptide component (I) containing DNA condensation and integrin-targeting sequences. Fluorophore-labeled lipid, peptide, and DNA components were used to formulate the vector, and the stoichiometry of the particles was established by fluorescence correlation spectroscopy (FCS). The size of the complex was measured by FCS, and the sizes of LID, L, LD, and ID complexes were measured by dynamic light scattering (DLS). Fluorescence quenching experiments and freeze-fracture electron microscopy were then used to demonstrate the arrangement of the lipid, peptide, and DNA components within the complex. These experiments showed that the cationic portion of the peptide, I, interacts with the plasmid DNA, resulting in a tightly condensed DNA-peptide inner core; this is surrounded by a disordered lipid layer, from which the integrin-targeting sequence of the peptide partially protrudes.
Taxonomic and functional assignment of cloned sequences from high Andean forest soil metagenome.

PubMed

Montaña, José Salvador; Jiménez, Diego Javier; Hernández, Mónica; Angel, Tatiana; Baena, Sandra

2012-02-01

Total metagenomic DNA was isolated from high Andean forest soil and subjected to taxonomical and functional composition analyses by means of clone library generation and sequencing. The obtained yield of 1.7 μg of DNA/g of soil was used to construct a metagenomic library of approximately 20,000 clones (in the plasmid p-Bluescript II SK+) with an average insert size of 4 Kb, covering 80 Mb of the total metagenomic DNA. Metagenomic sequences near the plasmid cloning site were sequenced and them trimmed and assembled, obtaining 299 reads and 31 contigs (0.3 Mb). Taxonomic assignment of total sequences was performed by BLASTX, resulting in 68.8, 44.8 and 24.5% classification into taxonomic groups using the metagenomic RAST server v2.0, WebCARMA v1.0 online system and MetaGenome Analyzer v3.8 software, respectively. Most clone sequences were classified as Bacteria belonging to phlya Actinobacteria, Proteobacteria and Acidobacteria. Among the most represented orders were Actinomycetales (34% average), Rhizobiales, Burkholderiales and Myxococcales and with a greater number of sequences in the genus Mycobacterium (7% average), Frankia, Streptomyces and Bradyrhizobium. The vast majority of sequences were associated with the metabolism of carbohydrates, proteins, lipids and catalytic functions, such as phosphatases, glycosyltransferases, dehydrogenases, methyltransferases, dehydratases and epoxide hydrolases. In this study we compared different methods of taxonomic and functional assignment of metagenomic clone sequences to evaluate microbial diversity in an unexplored soil ecosystem, searching for putative enzymes of biotechnological interest and generating important information for further functional screening of clone libraries.
Electrochemical biosensor based on functional composite nanofibers for detection of K-ras gene via multiple signal amplification strategy.

PubMed

Wang, Xiaoying; Shu, Guofang; Gao, Chanchan; Yang, Yu; Xu, Qian; Tang, Meng

2014-12-01

An electrochemical biosensor based on functional composite nanofibers for hybridization detection of specific K-ras gene that is highly associated with colorectal cancer via multiple signal amplification strategy has been developed. The carboxylated multiwalled carbon nanotubes (MWCNTs) doped nylon 6 (PA6) composite nanofibers (MWCNTs-PA6) was prepared using electrospinning, which served as the nanosized backbone for thionine (TH) electropolymerization. The functional composite nanofibers [MWCNTs-PA6-PTH, where PTH is poly(thionine)] used as supporting scaffolds for single-stranded DNA1 (ssDNA1) immobilization can dramatically increase the amount of DNA attachment and the hybridization sensitivity. Through the hybridization reaction, a sandwich format of ssDNA1/K-ras gene/gold nanoparticle-labeled ssDNA2 (AuNPs-ssDNA2) was fabricated, and the AuNPs offered excellent electrochemical signal transduction. The signal amplification was further implemented by forming network-like thiocyanuric acid/gold nanoparticles (TA/AuNPs). A significant sensitivity enhancement was obtained; the detection limit was down to 30fM, and the discriminations were up to 54.3 and 51.9% between the K-ras gene and the one-base mismatched sequences including G/C and A/T mismatched bases, respectively. The amenability of this method to the analyses of K-ras gene from the SW480 colorectal cancer cell lysates was demonstrated. The results are basically consistent with those of the K-ras Kit (HRM: high-resolution melt). The method holds promise for the diagnosis and management of cancer. Copyright © 2014 Elsevier Inc. All rights reserved.
B chromosome dynamics in Prochilodus costatus (Teleostei, Characiformes) and comparisons with supernumerary chromosome system in other Prochilodus species

PubMed Central

Melo, Silvana; Utsunomia, Ricardo; Penitente, Manolo; Sobrinho-Scudeler, Patrícia Elda; Porto-Foresti, Fábio; Oliveira, Claudio; Foresti, Fausto; Dergam, Jorge Abdala

2017-01-01

Abstract Within the genus Prochilodus Agassiz, 1829, five species are known to carry B chromosomes, i.e. chromosomes beyond the usual diploid number that have been traditionally considered as accessory for the genome. Chromosome microdissection and mapping of repetitive DNA sequences are effective tools to assess the DNA content and allow a better understanding about the origin and composition of these elements in an array of species. In this study, a novel characterization of B chromosomes in Prochilodus costatus Valenciennes, 1850 (2n=54) was reported for the first time and their sequence complementarity with the supernumerary chromosomes observed in Prochilodus lineatus (Valenciennes, 1836) and Prochilodus argenteus Agassiz, 1829 was investigated. The hybridization patterns obtained with chromosome painting using the micro B probe of P. costatus and the satDNA SATH1 mapping made it possible to assume homology of sequences between the B chromosomes of these congeneric species. Our results suggest that the origin of B chromosomes in the genus Prochilodus is a phylogenetically old event. PMID:28919971
Single-cell DNA methylome sequencing and bioinformatic inference of epigenomic cell-state dynamics.

PubMed

Farlik, Matthias; Sheffield, Nathan C; Nuzzo, Angelo; Datlinger, Paul; Schönegger, Andreas; Klughammer, Johanna; Bock, Christoph

2015-03-03

Methods for single-cell genome and transcriptome sequencing have contributed to our understanding of cellular heterogeneity, whereas methods for single-cell epigenomics are much less established. Here, we describe a whole-genome bisulfite sequencing (WGBS) assay that enables DNA methylation mapping in very small cell populations (μWGBS) and single cells (scWGBS). Our assay is optimized for profiling many samples at low coverage, and we describe a bioinformatic method that analyzes collections of single-cell methylomes to infer cell-state dynamics. Using these technological advances, we studied epigenomic cell-state dynamics in three in vitro models of cellular differentiation and pluripotency, where we observed characteristic patterns of epigenome remodeling and cell-to-cell heterogeneity. The described method enables single-cell analysis of DNA methylation in a broad range of biological systems, including embryonic development, stem cell differentiation, and cancer. It can also be used to establish composite methylomes that account for cell-to-cell heterogeneity in complex tissue samples. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Polymerase ribozyme efficiency increased by G/T-rich DNA oligonucleotides

PubMed Central

Yao, Chengguo; Müller, Ulrich F.

2011-01-01

The RNA world hypothesis states that the early evolution of life went through a stage where RNA served as genome and as catalyst. The replication of RNA world organisms would have been facilitated by ribozymes that catalyze RNA polymerization. To recapitulate an RNA world in the laboratory, a series of RNA polymerase ribozymes was developed previously. However, these ribozymes have a polymerization efficiency that is too low for self-replication, and the most efficient ribozymes prefer one specific template sequence. The limiting factor for polymerization efficiency is the weak sequence-independent binding to its primer/template substrate. Most of the known polymerase ribozymes bind an RNA heptanucleotide to form the P2 duplex on the ribozyme. By modifying this heptanucleotide, we were able to significantly increase polymerization efficiency. Truncations at the 3′-terminus of this heptanucleotide increased full-length primer extension by 10-fold, on a specific template sequence. In contrast, polymerization on several different template sequences was improved dramatically by replacing the RNA heptanucleotide with DNA oligomers containing randomized sequences of 15 nt. The presence of G and T in the random sequences was sufficient for this effect, with an optimal composition of 60% G and 40% T. Our results indicate that these DNA sequences function by establishing many weak and nonspecific base-pairing interactions to the single-stranded portion of the template. Such low-specificity interactions could have had important functions in an RNA world. PMID:21622900
Distortion of genetically modified organism quantification in processed foods: influence of particle size compositions and heat-induced DNA degradation.

PubMed

Moreano, Francisco; Busch, Ulrich; Engel, Karl-Heinz

2005-12-28

Milling fractions from conventional and transgenic corn were prepared at laboratory scale and used to study the influence of sample composition and heat-induced DNA degradation on the relative quantification of genetically modified organisms (GMO) in food products. Particle size distributions of the obtained fractions (coarse grits, regular grits, meal, and flour) were characterized using a laser diffraction system. The application of two DNA isolation protocols revealed a strong correlation between the degree of comminution of the milling fractions and the DNA yield in the extracts. Mixtures of milling fractions from conventional and transgenic material (1%) were prepared and analyzed via real-time polymerase chain reaction. Accurate quantification of the adjusted GMO content was only possible in mixtures containing conventional and transgenic material in the form of analogous milling fractions, whereas mixtures of fractions exhibiting different particle size distributions delivered significantly over- and underestimated GMO contents depending on their compositions. The process of heat-induced nucleic acid degradation was followed by applying two established quantitative assays showing differences between the lengths of the recombinant and reference target sequences (A, deltal(A) = -25 bp; B, deltal(B) = +16 bp; values related to the amplicon length of the reference gene). Data obtained by the application of method A resulted in underestimated recoveries of GMO contents in the samples of heat-treated products, reflecting the favored degradation of the longer target sequence used for the detection of the transgene. In contrast, data yielded by the application of method B resulted in increasingly overestimated recoveries of GMO contents. The results show how commonly used food technological processes may lead to distortions in the results of quantitative GMO analyses.
Informational Gene Phylogenies Do Not Support a Fourth Domain of Life for Nucleocytoplasmic Large DNA Viruses

PubMed Central

Williams, Tom A.; Embley, T. Martin; Heinz, Eva

2011-01-01

Mimivirus is a nucleocytoplasmic large DNA virus (NCLDV) with a genome size (1.2 Mb) and coding capacity ( 1000 genes) comparable to that of some cellular organisms. Unlike other viruses, Mimivirus and its NCLDV relatives encode homologs of broadly conserved informational genes found in Bacteria, Archaea, and Eukaryotes, raising the possibility that they could be placed on the tree of life. A recent phylogenetic analysis of these genes showed the NCLDVs emerging as a monophyletic group branching between Eukaryotes and Archaea. These trees were interpreted as evidence for an independent “fourth domain” of life that may have contributed DNA processing genes to the ancestral eukaryote. However, the analysis of ancient evolutionary events is challenging, and tree reconstruction is susceptible to bias resulting from non-phylogenetic signals in the data. These include compositional heterogeneity and homoplasy, which can lead to the spurious grouping of compositionally-similar or fast-evolving sequences. Here, we show that these informational gene alignments contain both significant compositional heterogeneity and homoplasy, which were not adequately modelled in the original analysis. When we use more realistic evolutionary models that better fit the data, the resulting trees are unable to reject a simple null hypothesis in which these informational genes, like many other NCLDV genes, were acquired by horizontal transfer from eukaryotic hosts. Our results suggest that a fourth domain is not required to explain the available sequence data. PMID:21698163
Construction and characterization of an in-vivo linear covalently closed DNA vector production system.

PubMed

Nafissi, Nafiseh; Slavcev, Roderick

2012-12-06

While safer than their viral counterparts, conventional non-viral gene delivery DNA vectors offer a limited safety profile. They often result in the delivery of unwanted prokaryotic sequences, antibiotic resistance genes, and the bacterial origins of replication to the target, which may lead to the stimulation of unwanted immunological responses due to their chimeric DNA composition. Such vectors may also impart the potential for chromosomal integration, thus potentiating oncogenesis. We sought to engineer an in vivo system for the quick and simple production of safer DNA vector alternatives that were devoid of non-transgene bacterial sequences and would lethally disrupt the host chromosome in the event of an unwanted vector integration event. We constructed a parent eukaryotic expression vector possessing a specialized manufactured multi-target site called "Super Sequence", and engineered E. coli cells (R-cell) that conditionally produce phage-derived recombinase Tel (PY54), TelN (N15), or Cre (P1). Passage of the parent plasmid vector through R-cells under optimized conditions, resulted in rapid, efficient, and one step in vivo generation of mini lcc--linear covalently closed (Tel/TelN-cell), or mini ccc--circular covalently closed (Cre-cell), DNA constructs, separated from the backbone plasmid DNA. Site-specific integration of lcc plasmids into the host chromosome resulted in chromosomal disruption and 10(5) fold lower viability than that seen with the ccc counterpart. We offer a high efficiency mini DNA vector production system that confers simple, rapid and scalable in vivo production of mini lcc DNA vectors that possess all the benefits of "minicircle" DNA vectors and virtually eliminate the potential for undesirable vector integration events.
The Mitochondrial Genome of Chara vulgaris: Insights into the Mitochondrial DNA Architecture of the Last Common Ancestor of Green Algae and Land PlantsW⃞

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

2003-01-01

Mitochondrial DNA (mtDNA) has undergone radical changes during the evolution of green plants, yet little is known about the dynamics of mtDNA evolution in this phylum. Land plant mtDNAs differ from the few green algal mtDNAs that have been analyzed to date by their expanded size, long spacers, and diversity of introns. We have determined the mtDNA sequence of Chara vulgaris (Charophyceae), a green alga belonging to the charophycean order (Charales) that is thought to be the most closely related alga to land plants. This 67,737-bp mtDNA sequence, displaying 68 conserved genes and 27 introns, was compared with those of three angiosperms, the bryophyte Marchantia polymorpha, the charophycean alga Chaetosphaeridium globosum (Coleochaetales), and the green alga Mesostigma viride. Despite important differences in size and intron composition, Chara mtDNA strikingly resembles Marchantia mtDNA; for instance, all except 9 of 68 conserved genes lie within blocks of colinear sequences. Overall, our genome comparisons and phylogenetic analyses provide unequivocal support for a sister-group relationship between the Charales and the land plants. Only four introns in land plant mtDNAs appear to have been inherited vertically from a charalean algar ancestor. We infer that the common ancestor of green algae and land plants harbored a tightly packed, gene-rich, and relatively intron-poor mitochondrial genome. The group II introns in this ancestral genome appear to have spread to new mtDNA sites during the evolution of bryophytes and charalean green algae, accounting for part of the intron diversity found in Chara and land plant mitochondria. PMID:12897260
[Community composition and diversity of endophytic fungi from roots of Sinopodophyllum hexandrum in forest of Upper-north mountain of Qinghai province].

PubMed

Ning, Yi; Li, Yan-Ling; Zhou, Guo-Ying; Yang, Lu-Cun; Xu, Wen-Hua

2016-04-01

High throughput sequencing technology is also called Next Generation Sequencing (NGS), which can sequence hundreds and thousands sequences in different samples at the same time. In the present study, the culture-independent high throughput sequencing technology was applied to sequence the fungi metagenomic DNA of the fungal internal transcribed spacer 1(ITS 1) in the root of Sinopodophyllum hexandrum. Sequencing data suggested that after the quality control, 22 565 reads were remained. Cluster similarity analysis was done based on 97% sequence similarity, which obtained 517 OTUs for the three samples (LD1, LD2 and LD3). All the fungi which identified from all the reads of OTUs based on 0.8 classification thresholds using the software of RDP classifier were classified as 13 classes, 35 orders, 44 family, 55 genera. Among these genera, the genus of Tetracladium was the dominant genera in all samples(35.49%, 68.55% and 12.96%).The Shannon's diversity indices and the Simpson indices of the endophytic fungi in the samples ranged from 1.75-2.92, 0.11-0.32, respectively.This is the first time for applying high through put sequencing technol-ogyto analyze the community composition and diversity of endophytic fungi in the medicinal plant, and the results showed that there were hyper diver sity and high community composition complexity of endophytic fungi in the root of S. hexandrum. It is also proved that the high through put sequencing technology has great advantage for analyzing ecommunity composition and diversity of endophtye in the plant. Copyright© by the Chinese Pharmaceutical Association.
Alterations of microbiota in urine from women with interstitial cystitis

PubMed Central

2012-01-01

Background Interstitial Cystitis (IC) is a chronic inflammatory condition of the bladder with unknown etiology. The aim of this study was to characterize the microbial community present in the urine from IC female patients by 454 high throughput sequencing of the 16S variable regions V1V2 and V6. The taxonomical composition, richness and diversity of the IC microbiota were determined and compared to the microbial profile of asymptomatic healthy female (HF) urine. Results The composition and distribution of bacterial sequences differed between the urine microbiota of IC patients and HFs. Reduced sequence richness and diversity were found in IC patient urine, and a significant difference in the community structure of IC urine in relation to HF urine was observed. More than 90% of the IC sequence reads were identified as belonging to the bacterial genus Lactobacillus, a marked increase compared to 60% in HF urine. Conclusion The 16S rDNA sequence data demonstrates a shift in the composition of the bacterial community in IC urine. The reduced microbial diversity and richness is accompanied by a higher abundance of the bacterial genus Lactobacillus, compared to HF urine. This study demonstrates that high throughput sequencing analysis of urine microbiota in IC patients is a powerful tool towards a better understanding of this enigmatic disease. PMID:22974186
A complete mitochondrial genome sequence of Asian black bear Sichuan subspecies (Ursus thibetanus mupinensis)

PubMed Central

Hou, Wan-ru; Chen, Yu; Wu, Xia; Hu, Jin-chu; Peng, Zheng-song; Yang, Jung; Tang, Zong-xiang; Zhou, Cai-Quan; Li, Yu-ming; Yang, Shi-kui; Du, Yu-jie; Kong, Ling-lu; Ren, Zheng-long; Zhang, Huai-yu; Shuai, Su-rong

2007-01-01

We obtained the complete mitochondrial genome of U.thibetanus mupinensis by DNA sequencing based on the PCR fragments of 18 primers we designed. The results indicate that the mtDNA is 16 868 bp in size, encodes 13 protein genes, 22 tRNA genes, and 2 rRNA genes, with an overall H-strand base composition of 31.2% A, 25.4% C, 15.5% G and 27.9% T. The sequence of the control region (CR) located between tRNA-Pro and tRNA-Phe is 1422 bp in size, consists of 8.43% of the whole genome, GC content is 51.9% and has a 6bp tandem repeat and two 10bp tandem repeats identified by using the Tandem Repeats Finder. U. thibetanus mupinensis mitochondrial genome shares high similarity with those of three other Ursidae: U. americanus (91.46%), U. arctos (89.25%) and U. maritimus (87.66%). PMID:17205108
Degenerative minimalism in the genome of a psyllid endosymbiont.

PubMed

Clark, M A; Baumann, L; Thao, M L; Moran, N A; Baumann, P

2001-03-01

Psyllids, like aphids, feed on plant phloem sap and are obligately associated with prokaryotic endosymbionts acquired through vertical transmission from an ancestral infection. We have sequenced 37 kb of DNA of the genome of Carsonella ruddii, the endosymbiont of psyllids, and found that it has a number of unusual properties revealing a more extreme case of degeneration than was previously reported from studies of eubacterial genomes, including that of the aphid endosymbiont Buchnera aphidicola. Among the unusual properties are an exceptionally low guanine-plus-cytosine content (19.9%), almost complete absence of intergenic spaces, operon fusion, and lack of the usual promoter sequences upstream of 16S rDNA. These features suggest the synthesis of long mRNAs and translational coupling. The most extreme instances of base compositional bias occur in the genes encoding proteins that have less highly conserved amino acid sequences; the guanine-plus-cytosine content of some protein-coding sequences is as low as 10%. The shift in base composition has a large effect on proteins: in polypeptides of C. ruddii, half of the residues consist of five amino acids with codons low in guanine plus cytosine. Furthermore, the proteins of C. ruddii are reduced in size, with an average of about 9% fewer amino acids than in homologous proteins of related bacteria. These observations suggest that the C. ruddii genome is not subject to constraints that limit the evolution of other known eubacteria.
Do neighboring lakes share common taxa of bacterioplankton? Comparison of 16S rDNA fingerprints and sequences from three geographic regions.

PubMed

Lindström, E S; Leskinen, E

2002-07-01

Bacterioplankton community composition was studied in 12 lakes in three different geographic regions in Scandinavia using denaturing gradient gel electrophoresis (DGGE) and sequencing of 16S rDNA. Area-specific abundant taxa were found in the lakes in two of the regions. In the region of Uppland the lakes had an alpha-proteobacterium, belonging to the subgroup Alpha V in common. The Alpha V bacteria appeared to be favored by neutral or higher pH values. The lakes in Lappland were found to harbor Actinobacteria, which appeared to be favored in bog lakes. No abundant taxon was found to be in common for the lakes in Svalbard, the third region studied.
A Highly Sensitive Electrochemical DNA Biosensor from Acrylic-Gold Nano-composite for the Determination of Arowana Fish Gender

NASA Astrophysics Data System (ADS)

Rahman, Mahbubur; Heng, Lee Yook; Futra, Dedi; Chiang, Chew Poh; Rashid, Zulkafli A.; Ling, Tan Ling

2017-08-01

The present research describes a simple method for the identification of the gender of arowana fish ( Scleropages formosus). The DNA biosensor was able to detect specific DNA sequence at extremely low level down to atto M regimes. An electrochemical DNA biosensor based on acrylic microsphere-gold nanoparticle (AcMP-AuNP) hybrid composite was fabricated. Hydrophobic poly(n-butylacrylate-N-acryloxysuccinimide) microspheres were synthesised with a facile and well-established one-step photopolymerization procedure and physically adsorbed on the AuNPs at the surface of a carbon screen printed electrode (SPE). The DNA biosensor was constructed simply by grafting an aminated DNA probe on the succinimide functionalised AcMPs via a strong covalent attachment. DNA hybridisation response was determined by differential pulse voltammetry (DPV) technique using anthraquinone monosulphonic acid redox probe as an electroactive oligonucleotide label (Table 1). A low detection limit at 1.0 × 10-18 M with a wide linear calibration range of 1.0 × 10-18 to 1.0 × 10-8 M ( R 2 = 0.99) can be achieved by the proposed DNA biosensor under optimal conditions. Electrochemical detection of arowana DNA can be completed within 1 hour. Due to its small size and light weight, the developed DNA biosensor holds high promise for the development of functional kit for fish culture usage.
A Novel Low Energy Electron Microscope for DNA Sequencing and Surface Analysis

PubMed Central

Mankos, M.; Shadman, K.; Persson, H.H.J.; N’Diaye, A.T.; Schmid, A.K.; Davis, R.W.

2014-01-01

Monochromatic, aberration-corrected, dual-beam low energy electron microscopy (MAD-LEEM) is a novel technique that is directed towards imaging nanostructures and surfaces with sub-nanometer resolution. The technique combines a monochromator, a mirror aberration corrector, an energy filter, and dual beam illumination in a single instrument. The monochromator reduces the energy spread of the illuminating electron beam, which significantly improves spectroscopic and spatial resolution. Simulation results predict that the novel aberration corrector design will eliminate the second rank chromatic and third and fifth order spherical aberrations, thereby improving the resolution into the sub-nanometer regime at landing energies as low as one hundred electron-Volts. The energy filter produces a beam that can extract detailed information about the chemical composition and local electronic states of non-periodic objects such as nanoparticles, interfaces, defects, and macromolecules. The dual flood illumination eliminates charging effects that are generated when a conventional LEEM is used to image insulating specimens. A potential application for MAD-LEEM is in DNA sequencing, which requires high resolution to distinguish the individual bases and high speed to reduce the cost. The MAD-LEEM approach images the DNA with low electron impact energies, which provides nucleobase contrast mechanisms without organometallic labels. Furthermore, the micron-size field of view when combined with imaging on the fly provides long read lengths, thereby reducing the demand on assembling the sequence. Experimental results from bulk specimens with immobilized single-base oligonucleotides demonstrate that base specific contrast is available with reflected, photo-emitted, and Auger electrons. Image contrast simulations of model rectangular features mimicking the individual nucleotides in a DNA strand have been developed to translate measurements of contrast on bulk DNA to the detectability of individual DNA bases in a sequence. PMID:24524867
A novel low energy electron microscope for DNA sequencing and surface analysis.

PubMed

Mankos, M; Shadman, K; Persson, H H J; N'Diaye, A T; Schmid, A K; Davis, R W

2014-10-01

Monochromatic, aberration-corrected, dual-beam low energy electron microscopy (MAD-LEEM) is a novel technique that is directed towards imaging nanostructures and surfaces with sub-nanometer resolution. The technique combines a monochromator, a mirror aberration corrector, an energy filter, and dual beam illumination in a single instrument. The monochromator reduces the energy spread of the illuminating electron beam, which significantly improves spectroscopic and spatial resolution. Simulation results predict that the novel aberration corrector design will eliminate the second rank chromatic and third and fifth order spherical aberrations, thereby improving the resolution into the sub-nanometer regime at landing energies as low as one hundred electron-Volts. The energy filter produces a beam that can extract detailed information about the chemical composition and local electronic states of non-periodic objects such as nanoparticles, interfaces, defects, and macromolecules. The dual flood illumination eliminates charging effects that are generated when a conventional LEEM is used to image insulating specimens. A potential application for MAD-LEEM is in DNA sequencing, which requires high resolution to distinguish the individual bases and high speed to reduce the cost. The MAD-LEEM approach images the DNA with low electron impact energies, which provides nucleobase contrast mechanisms without organometallic labels. Furthermore, the micron-size field of view when combined with imaging on the fly provides long read lengths, thereby reducing the demand on assembling the sequence. Experimental results from bulk specimens with immobilized single-base oligonucleotides demonstrate that base specific contrast is available with reflected, photo-emitted, and Auger electrons. Image contrast simulations of model rectangular features mimicking the individual nucleotides in a DNA strand have been developed to translate measurements of contrast on bulk DNA to the detectability of individual DNA bases in a sequence. Copyright © 2014 Elsevier B.V. All rights reserved.
A novel low energy electron microscope for DNA sequencing and surface analysis

DOE PAGES

Mankos, M.; Shadman, K.; Persson, H. H. J.; ...

2014-01-31

Monochromatic, aberration-corrected, dual-beam low energy electron microscopy (MAD-LEEM) is a novel technique that is directed towards imaging nanostructures and surfaces with sub-nanometer resolution. The technique combines a monochromator, a mirror aberration corrector, an energy filter, and dual beam illumination in a single instrument. The monochromator reduces the energy spread of the illuminating electron beam, which significantly improves spectroscopic and spatial resolution. Simulation results predict that the novel aberration corrector design will eliminate the second rank chromatic and third and fifth order spherical aberrations, thereby improving the resolution into the sub-nanometer regime at landing energies as low as one hundred electron-Volts.more » The energy filter produces a beam that can extract detailed information about the chemical composition and local electronic states of non-periodic objects such as nanoparticles, interfaces, defects, and macromolecules. The dual flood illumination eliminates charging effects that are generated when a conventional LEEM is used to image insulating specimens. A potential application for MAD-LEEM is in DNA sequencing, which requires high resolution to distinguish the individual bases and high speed to reduce the cost. The MAD-LEEM approach images the DNA with low electron impact energies, which provides nucleobase contrast mechanisms without organometallic labels. Furthermore, the micron-size field of view when combined with imaging on the fly provides long read lengths, thereby reducing the demand on assembling the sequence. Finally, experimental results from bulk specimens with immobilized single-base oligonucleotides demonstrate that base specific contrast is available with reflected, photo-emitted, and Auger electrons. Image contrast simulations of model rectangular features mimicking the individual nucleotides in a DNA strand have been developed to translate measurements of contrast on bulk DNA to the detectability of individual DNA bases in a sequence.« less

Exploring the Impacts of Anthropogenic Disturbance on Seawater and Sediment Microbial Communities in Korean Coastal Waters Using Metagenomics Analysis

PubMed Central

Won, Nam-Il; Kim, Ki-Hwan; Kang, Ji Hyoun; Park, Sang Rul; Lee, Hyuk Je

2017-01-01

The coastal ecosystems are considered as one of the most dynamic and vulnerable environments under various anthropogenic developments and the effects of climate change. Variations in the composition and diversity of microbial communities may be a good indicator for determining whether the marine ecosystems are affected by complex forcing stressors. DNA sequence-based metagenomics has recently emerged as a promising tool for analyzing the structure and diversity of microbial communities based on environmental DNA (eDNA). However, few studies have so far been performed using this approach to assess the impacts of human activities on the microbial communities in marine systems. In this study, using metagenomic DNA sequencing (16S ribosomal RNA gene), we analyzed and compared seawater and sediment communities between sand mining and control (natural) sites in southern coastal waters of Korea to assess whether anthropogenic activities have significantly affected the microbial communities. The sand mining sites harbored considerably lower levels of microbial diversities in the surface seawater community during spring compared with control sites. Moreover, the sand mining areas had distinct microbial taxonomic group compositions, particularly during spring season. The microbial groups detected solely in the sediment load/dredging areas (e.g., Marinobacter, Alcanivorax, Novosphingobium) are known to be involved in degradation of toxic chemicals such as hydrocarbon, oil, and aromatic compounds, and they also contain potential pathogens. This study highlights the versatility of metagenomics in monitoring and diagnosing the impacts of human disturbance on the environmental health of marine ecosystems from eDNA. PMID:28134828
Exploring the Impacts of Anthropogenic Disturbance on Seawater and Sediment Microbial Communities in Korean Coastal Waters Using Metagenomics Analysis.

PubMed

Won, Nam-Il; Kim, Ki-Hwan; Kang, Ji Hyoun; Park, Sang Rul; Lee, Hyuk Je

2017-01-27

The coastal ecosystems are considered as one of the most dynamic and vulnerable environments under various anthropogenic developments and the effects of climate change. Variations in the composition and diversity of microbial communities may be a good indicator for determining whether the marine ecosystems are affected by complex forcing stressors. DNA sequence-based metagenomics has recently emerged as a promising tool for analyzing the structure and diversity of microbial communities based on environmental DNA (eDNA). However, few studies have so far been performed using this approach to assess the impacts of human activities on the microbial communities in marine systems. In this study, using metagenomic DNA sequencing (16S ribosomal RNA gene), we analyzed and compared seawater and sediment communities between sand mining and control (natural) sites in southern coastal waters of Korea to assess whether anthropogenic activities have significantly affected the microbial communities. The sand mining sites harbored considerably lower levels of microbial diversities in the surface seawater community during spring compared with control sites. Moreover, the sand mining areas had distinct microbial taxonomic group compositions, particularly during spring season. The microbial groups detected solely in the sediment load/dredging areas (e.g., Marinobacter, Alcanivorax, Novosphingobium) are known to be involved in degradation of toxic chemicals such as hydrocarbon, oil, and aromatic compounds, and they also contain potential pathogens. This study highlights the versatility of metagenomics in monitoring and diagnosing the impacts of human disturbance on the environmental health of marine ecosystems from eDNA.
Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

PubMed Central

Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

2015-01-01

DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576
Haplotype phasing and inheritance of copy number variants in nuclear families.

PubMed

Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

2015-01-01

DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.
A Method to Assess Bacteriocin Effects on the Gut Microbiota of Mice.

PubMed

Bäuerl, Chrstine; Umu, Özgun C O; Hernandez, Pablo E; Diep, Dzung B; Pérez-Martínez, Gaspar

2017-07-25

Very intriguing questions arise with our advancing knowledge on gut microbiota composition and the relationship with health, particularly relating to the factors that contribute to maintaining the population balance. However, there are limited available methodologies to evaluate these factors. Bacteriocins are antimicrobial peptides produced by many bacteria that may confer a competitive advantage for food acquisition and/or niche establishment. Many probiotic lactic acid bacteria (LAB) strains have great potential to promote human and animal health by preventing the growth of pathogens. They can also be used for immuno-modulation, as they produce bacteriocins. However, the antagonistic activity of bacteriocins is normally determined by laboratory bioassays under well-defined but over-simplified conditions compared to the complex gut environment in humans and animals, where bacteria face multifactorial influences from the host and hundreds of microbial species sharing the same niche. This work describes a complete and efficient procedure to assess the effect of a variety of bacteriocins with different target specificities in a murine system. Changes in the microbiota composition during the bacteriocin treatment are monitored using compositional 16S rDNA sequencing. Our approach uses both the bacteriocin producers and their isogenic non-bacteriocin-producing mutants, the latter giving the ability to distinguish bacteriocin-related from non-bacteriocin-related modifications of the microbiota. The fecal DNA extraction and 16S rDNA sequencing methods are consistent and, together with the bioinformatics, constitute a powerful procedure to find faint changes in the bacterial profiles and to establish correlations, in terms of cholesterol and triglyceride concentration, between bacterial populations and health markers. Our protocol is generic and can thus be used to study other compounds or nutrients with the potential to alter the host microbiota composition, either when studying toxicity or beneficial effects.
Characterization of Mycobacterium bohemicum Isolated from Human, Veterinary, and Environmental Sources

PubMed Central

Torkko, Pirjo; Suomalainen, Sini; Iivanainen, Eila; Suutari, Merja; Paulin, Lars; Rudbäck, Eeva; Tortoli, Enrico; Vincent, Véronique; Mattila, Rauni; Katila, Marja-Leena

2001-01-01

Chemotaxonomic and genetic properties were determined for 14 mycobacterial isolates identified as members of a newly described species Mycobacterium bohemicum. The isolates recovered from clinical, veterinary, and environmental sources were compared for lipid composition, biochemical test results, and sequencing of the 16S ribosomal DNA (rDNA) and the 16S-23S rDNA internal transcribed spacer (ITS) regions. The isolates had a lipid composition that was different from those of other known species. Though the isolates formed a distinct entity, some variations were detected in the features analyzed. Combined results of the phenotypic and genotypic analyses were used to group the isolates into three clusters. The major cluster (cluster A), very homogenous in all respects, comprised the M. bohemicum type strain, nine clinical and veterinary isolates, and two of the five environmental isolates. Three other environmental isolates displayed an insertion of 14 nucleotides in the ITS region; they also differed from cluster A in fatty alcohol composition and produced a positive result in the Tween 80 hydrolysis test. Among these three, two isolates were identical (cluster B), but one isolate (cluster C) had a unique high-performance liquid chromatography profile, and its gas liquid chromatography profile lacked 2-octadecanol, which was present in all other isolates analyzed. Thus, sequence variation in the 16S-23S ITS region was associated with interesting variations in lipid composition. Two of the isolates analyzed were regarded as potential inducers of human or veterinary infections. Each of the environmental isolates, all of which were unrelated to the cases presented, was cultured from the water of a different stream. Hence, natural waters are potential reservoirs of M. bohemicum. PMID:11136772
Detecting in situ copepod diet diversity using molecular technique: development of a copepod/symbiotic ciliate-excluding eukaryote-inclusive PCR protocol.

PubMed

Hu, Simin; Guo, Zhiling; Li, Tao; Carpenter, Edward J; Liu, Sheng; Lin, Senjie

2014-01-01

Knowledge of in situ copepod diet diversity is crucial for accurately describing pelagic food web structure but is challenging to achieve due to lack of an easily applicable methodology. To enable analysis with whole copepod-derived DNAs, we developed a copepod-excluding 18S rDNA-based PCR protocol. Although it is effective in depressing amplification of copepod 18S rDNA, its applicability to detect diverse eukaryotes in both mono- and mixed-species has not been demonstrated. Besides, the protocol suffers from the problem that sequences from symbiotic ciliates are overrepresented in the retrieved 18S rDNA libraries. In this study, we designed a blocking primer to make a combined primer set (copepod/symbiotic ciliate-excluding eukaryote-common: CEEC) to depress PCR amplification of symbiotic ciliate sequences while maximizing the range of eukaryotes amplified. We firstly examined the specificity and efficacy of CEEC by PCR-amplifying DNAs from 16 copepod species, 37 representative organisms that are potential prey of copepods and a natural microplankton sample, and then evaluated the efficiency in reconstructing diet composition by detecting the food of both lab-reared and field-collected copepods. Our results showed that the CEEC primer set can successfully amplify 18S rDNA from a wide range of isolated species and mixed-species samples while depressing amplification of that from copepod and targeted symbiotic ciliate, indicating the universality of CEEC in specifically detecting prey of copepods. All the predetermined food offered to copepods in the laboratory were successfully retrieved, suggesting that the CEEC-based protocol can accurately reconstruct the diets of copepods without interference of copepods and their associated ciliates present in the DNA samples. Our initial application to analyzing the food composition of field-collected copepods uncovered diverse prey species, including those currently known, and those that are unsuspected, as copepod prey. While testing is required, this protocol provides a useful strategy for depicting in situ dietary composition of copepods.
Compositional Bias in Naïve and Chemically-modified Phage-Displayed Libraries uncovered by Paired-end Deep Sequencing.

PubMed

He, Bifang; Tjhung, Katrina F; Bennett, Nicholas J; Chou, Ying; Rau, Andrea; Huang, Jian; Derda, Ratmir

2018-01-19

Understanding the composition of a genetically-encoded (GE) library is instrumental to the success of ligand discovery. In this manuscript, we investigate the bias in GE-libraries of linear, macrocyclic and chemically post-translationally modified (cPTM) tetrapeptides displayed on the M13KE platform, which are produced via trinucleotide cassette synthesis (19 codons) and NNK-randomized codon. Differential enrichment of synthetic DNA {S}, ligated vector {L} (extension and ligation of synthetic DNA into the vector), naïve libraries {N} (transformation of the ligated vector into the bacteria followed by expression of the library for 4.5 hours to yield a "naïve" library), and libraries chemically modified by aldehyde ligation and cysteine macrocyclization {M} characterized by paired-end deep sequencing, detected a significant drop in diversity in {L} → {N}, but only a minor compositional difference in {S} → {L} and {N} → {M}. Libraries expressed at the N-terminus of phage protein pIII censored positively charged amino acids Arg and Lys; libraries expressed between pIII domains N1 and N2 overcame Arg/Lys-censorship but introduced new bias towards Gly and Ser. Interrogation of biases arising from cPTM by aldehyde ligation and cysteine macrocyclization unveiled censorship of sequences with Ser/Phe. Analogous analysis can be used to explore library diversity in new display platforms and optimize cPTM of these libraries.
Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.

PubMed Central

Ananiev, E V; Phillips, R L; Rines, H W

1998-01-01

The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055
Selfish DNA: homing endonucleases find a home.

PubMed

Edgell, David R

2009-02-10

Self-splicing group I introns come in two flavours - those with a homing endonuclease to promote mobility of the intron, and those without an endonuclease. How homing endonucleases and self-splicing introns associate to form a composite selfish genetic element is a question of long-standing interest. Recent work has revealed that a shared characteristic of both introns and endonucleases, the targeting of conserved sequences, may provide the impetus for the evolution of composite mobile genetic elements.
Halophilic archaebacteria from the Kalamkass oil field

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zvyagintseva, I.S.; Belyaev, S.S.; Borzenkov, I.A.

1995-01-01

Two strains of halophilic archaebacteria, growing in a medium containing from 10 to 25% NaCl, were isolated from the brines of the Kalamkass (Mangyshlak) oil field. Both strains are extremely halophilic archaebacteria according to the complex of their phenotypic properties. Strain M-11 was identified as Haloferax mediterranei on the basis of the composition of polar lipids and DNA-DNA homology. The composition of polar lipids and 16S rRNA sequence of strain M-18 allowed us to assign it to the genus Haloferax. This strain differs from the approved species of the genus Haloferax, H. volcanii, and H. mediterranei. However, to describe itmore » as a new species, additional investigations are necessary. 13 refs., 3 figs.« less
A compositional segmentation of the human mitochondrial genome is related to heterogeneities in the guanine mutation rate

PubMed Central

Samuels, David C.; Boys, Richard J.; Henderson, Daniel A.; Chinnery, Patrick F.

2003-01-01

We applied a hidden Markov model segmentation method to the human mitochondrial genome to identify patterns in the sequence, to compare these patterns to the gene structure of mtDNA and to see whether these patterns reveal additional characteristics important for our understanding of genome evolution, structure and function. Our analysis identified three segmentation categories based upon the sequence transition probabilities. Category 2 segments corresponded to the tRNA and rRNA genes, with a greater strand-symmetry in these segments. Category 1 and 3 segments covered the protein- coding genes and almost all of the non-coding D-loop. Compared to category 1, the mtDNA segments assigned to category 3 had much lower guanine abundance. A comparison to two independent databases of mitochondrial mutations and polymorphisms showed that the high substitution rate of guanine in human mtDNA is largest in the category 3 segments. Analysis of synonymous mutations showed the same pattern. This suggests that this heterogeneity in the mutation rate is partly independent of respiratory chain function and is a direct property of the genome sequence itself. This has important implications for our understanding of mtDNA evolution and its use as a ‘molecular clock’ to determine the rate of population and species divergence. PMID:14530452
High-Throughput Sequencing of 16S rRNA Gene Amplicons: Effects of Extraction Procedure, Primer Length and Annealing Temperature

PubMed Central

Sergeant, Martin J.; Constantinidou, Chrystala; Cogan, Tristan; Penn, Charles W.; Pallen, Mark J.

2012-01-01

The analysis of 16S-rDNA sequences to assess the bacterial community composition of a sample is a widely used technique that has increased with the advent of high throughput sequencing. Although considerable effort has been devoted to identifying the most informative region of the 16S gene and the optimal informatics procedures to process the data, little attention has been paid to the PCR step, in particular annealing temperature and primer length. To address this, amplicons derived from 16S-rDNA were generated from chicken caecal content DNA using different annealing temperatures, primers and different DNA extraction procedures. The amplicons were pyrosequenced to determine the optimal protocols for capture of maximum bacterial diversity from a chicken caecal sample. Even at very low annealing temperatures there was little effect on the community structure, although the abundance of some OTUs such as Bifidobacterium increased. Using shorter primers did not reveal any novel OTUs but did change the community profile obtained. Mechanical disruption of the sample by bead beating had a significant effect on the results obtained, as did repeated freezing and thawing. In conclusion, existing primers and standard annealing temperatures captured as much diversity as lower annealing temperatures and shorter primers. PMID:22666455
High-throughput sequencing of 16S rRNA gene amplicons: effects of extraction procedure, primer length and annealing temperature.

PubMed

Sergeant, Martin J; Constantinidou, Chrystala; Cogan, Tristan; Penn, Charles W; Pallen, Mark J

2012-01-01

The analysis of 16S-rDNA sequences to assess the bacterial community composition of a sample is a widely used technique that has increased with the advent of high throughput sequencing. Although considerable effort has been devoted to identifying the most informative region of the 16S gene and the optimal informatics procedures to process the data, little attention has been paid to the PCR step, in particular annealing temperature and primer length. To address this, amplicons derived from 16S-rDNA were generated from chicken caecal content DNA using different annealing temperatures, primers and different DNA extraction procedures. The amplicons were pyrosequenced to determine the optimal protocols for capture of maximum bacterial diversity from a chicken caecal sample. Even at very low annealing temperatures there was little effect on the community structure, although the abundance of some OTUs such as Bifidobacterium increased. Using shorter primers did not reveal any novel OTUs but did change the community profile obtained. Mechanical disruption of the sample by bead beating had a significant effect on the results obtained, as did repeated freezing and thawing. In conclusion, existing primers and standard annealing temperatures captured as much diversity as lower annealing temperatures and shorter primers.
[Correlation of codon biases and potential secondary structures with mRNA translation efficiency in unicellular organisms].

PubMed

Vladimirov, N V; Likhoshvaĭ, V A; Matushkin, Iu G

2007-01-01

Gene expression is known to correlate with degree of codon bias in many unicellular organisms. However, such correlation is absent in some organisms. Recently we demonstrated that inverted complementary repeats within coding DNA sequence must be considered for proper estimation of translation efficiency, since they may form secondary structures that obstruct ribosome movement. We have developed a program for estimation of potential coding DNA sequence expression in defined unicellular organism using its genome sequence. The program computes elongation efficiency index. Computation is based on estimation of coding DNA sequence elongation efficiency, taking into account three key factors: codon bias, average number of inverted complementary repeats, and free energy of potential stem-loop structures formed by the repeats. The influence of these factors on translation is numerically estimated. An optimal proportion of these factors is computed for each organism individually. Quantitative translational characteristics of 384 unicellular organisms (351 bacteria, 28 archaea, 5 eukaryota) have been computed using their annotated genomes from NCBI GenBank. Five potential evolutionary strategies of translational optimization have been determined among studied organisms. A considerable difference of preferred translational strategies between Bacteria and Archaea has been revealed. Significant correlations between elongation efficiency index and gene expression levels have been shown for two organisms (S. cerevisiae and H. pylori) using available microarray data. The proposed method allows to estimate numerically the coding DNA sequence translation efficiency and to optimize nucleotide composition of heterologous genes in unicellular organisms. http://www.mgs.bionet.nsc.ru/mgs/programs/eei-calculator/.
Markov models of genome segmentation

NASA Astrophysics Data System (ADS)

Thakur, Vivek; Azad, Rajeev K.; Ramaswamy, Ram

2007-01-01

We introduce Markov models for segmentation of symbolic sequences, extending a segmentation procedure based on the Jensen-Shannon divergence that has been introduced earlier. Higher-order Markov models are more sensitive to the details of local patterns and in application to genome analysis, this makes it possible to segment a sequence at positions that are biologically meaningful. We show the advantage of higher-order Markov-model-based segmentation procedures in detecting compositional inhomogeneity in chimeric DNA sequences constructed from genomes of diverse species, and in application to the E. coli K12 genome, boundaries of genomic islands, cryptic prophages, and horizontally acquired regions are accurately identified.
Impact of library preparation protocols and template quantity on the metagenomic reconstruction of a mock microbial community

DOE PAGES

Bowers, Robert M.; Clum, Alicia; Tice, Hope; ...

2015-10-24

Background: The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composi tion and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for largemore » initial DNA template quantities during library preparation. Results: Here, we investigate the effects of varying template concentrations across three low biomass library preparation protocols on their ability to accurately reconstruct a mock microbial community of known composition. We analyze the effects of input DNA quantity and library preparation method on library insert size, GC content, community composition, assembly quality and metagenomic binning. We found that library preparation method and the amount of starting material had significant impacts on the mock community metagenomes. In particular, GC content shifted towards more GC rich sequences at the lower input quantities regardless of library prep method, the number of low quality reads that could not be mapped to the reference genomes increased with decreasing input quantities, and the different library preparation methods had an impact on overall metagenomic community composition. Conclusions: This benchmark study provides recommendations for library creation of representative and minimally biased metagenome shotgun sequencing, enabling insights into functional attributes of low biomass ecosystem microbial communities.« less
Impact of library preparation protocols and template quantity on the metagenomic reconstruction of a mock microbial community

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bowers, Robert M.; Clum, Alicia; Tice, Hope

Background: The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composi tion and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for largemore » initial DNA template quantities during library preparation. Results: Here, we investigate the effects of varying template concentrations across three low biomass library preparation protocols on their ability to accurately reconstruct a mock microbial community of known composition. We analyze the effects of input DNA quantity and library preparation method on library insert size, GC content, community composition, assembly quality and metagenomic binning. We found that library preparation method and the amount of starting material had significant impacts on the mock community metagenomes. In particular, GC content shifted towards more GC rich sequences at the lower input quantities regardless of library prep method, the number of low quality reads that could not be mapped to the reference genomes increased with decreasing input quantities, and the different library preparation methods had an impact on overall metagenomic community composition. Conclusions: This benchmark study provides recommendations for library creation of representative and minimally biased metagenome shotgun sequencing, enabling insights into functional attributes of low biomass ecosystem microbial communities.« less
The complete sequence of the mitochondrial genome of the African Penguin (Spheniscus demersus).

PubMed

Labuschagne, Christiaan; Kotzé, Antoinette; Grobler, J Paul; Dalton, Desiré L

2014-01-15

The complete mitochondrial genome of the African Penguin (Spheniscus demersus) was sequenced. The molecule was sequenced via next generation sequencing and primer walking. The size of the genome is 17,346 bp in length. Comparison with the mitochondrial DNA of two other penguin genomes that have so far been reported was conducted namely; Little blue penguin (Eudyptula minor) and the Rockhopper penguin (Eudyptes chrysocome). This analysis made it possible to identify common penguin mitochondrial DNA characteristics. The S. demersus mtDNA genome is very similar, both in composition and length to both the E. chrysocome and E. minor genomes. The gene content of the African penguin mitochondrial genome is typical of vertebrates and all three penguin species have the standard gene order originally identified in the chicken. The control region for S. demersus is located between tRNA-Glu and tRNA-Phe and all three species of penguins contain two sets of similar repeats with varying copy numbers towards the 3' end of the control region, accounting for the size variance. This is the first report of the complete nucleotide sequence for the mitochondrial genome of the African penguin, S. demersus. These results can be subsequently used to provide information for penguin phylogenetic studies and insights into the evolution of genomes. © 2013 Elsevier B.V. All rights reserved.
Novel features of ARS selection in budding yeast Lachancea kluyveri

PubMed Central

2011-01-01

Background The characterization of DNA replication origins in yeast has shed much light on the mechanisms of initiation of DNA replication. However, very little is known about the evolution of origins or the evolution of mechanisms through which origins are recognized by the initiation machinery. This lack of understanding is largely due to the vast evolutionary distances between model organisms in which origins have been examined. Results In this study we have isolated and characterized autonomously replicating sequences (ARSs) in Lachancea kluyveri - a pre-whole genome duplication (WGD) budding yeast. Through a combination of experimental work and rigorous computational analysis, we show that L. kluyveri ARSs require a sequence that is similar but much longer than the ARS Consensus Sequence well defined in Saccharomyces cerevisiae. Moreover, compared with S. cerevisiae and K. lactis, the replication licensing machinery in L. kluyveri seems more tolerant to variations in the ARS sequence composition. It is able to initiate replication from almost all S. cerevisiae ARSs tested and most Kluyveromyces lactis ARSs. In contrast, only about half of the L. kluyveri ARSs function in S. cerevisiae and less than 10% function in K. lactis. Conclusions Our findings demonstrate a replication initiation system with novel features and underscore the functional diversity within the budding yeasts. Furthermore, we have developed new approaches for analyzing biologically functional DNA sequences with ill-defined motifs. PMID:22204614

Novel features of ARS selection in budding yeast Lachancea kluyveri.

PubMed

Liachko, Ivan; Tanaka, Emi; Cox, Katherine; Chung, Shau Chee Claire; Yang, Lu; Seher, Arael; Hallas, Lindsay; Cha, Eugene; Kang, Gina; Pace, Heather; Barrow, Jasmine; Inada, Maki; Tye, Bik-Kwoon; Keich, Uri

2011-12-28

The characterization of DNA replication origins in yeast has shed much light on the mechanisms of initiation of DNA replication. However, very little is known about the evolution of origins or the evolution of mechanisms through which origins are recognized by the initiation machinery. This lack of understanding is largely due to the vast evolutionary distances between model organisms in which origins have been examined. In this study we have isolated and characterized autonomously replicating sequences (ARSs) in Lachancea kluyveri - a pre-whole genome duplication (WGD) budding yeast. Through a combination of experimental work and rigorous computational analysis, we show that L. kluyveri ARSs require a sequence that is similar but much longer than the ARS Consensus Sequence well defined in Saccharomyces cerevisiae. Moreover, compared with S. cerevisiae and K. lactis, the replication licensing machinery in L. kluyveri seems more tolerant to variations in the ARS sequence composition. It is able to initiate replication from almost all S. cerevisiae ARSs tested and most Kluyveromyces lactis ARSs. In contrast, only about half of the L. kluyveri ARSs function in S. cerevisiae and less than 10% function in K. lactis. Our findings demonstrate a replication initiation system with novel features and underscore the functional diversity within the budding yeasts. Furthermore, we have developed new approaches for analyzing biologically functional DNA sequences with ill-defined motifs.
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

PubMed Central

Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

1987-01-01

To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded. Images PMID:2823109
Comparative analysis of bacteria associated with different mosses by 16S rRNA and 16S rDNA sequencing.

PubMed

Tian, Yang; Li, Yan Hong

2017-01-01

To understand the differences of the bacteria associated with different mosses, a phylogenetic study of bacterial communities in three mosses was carried out based on 16S rDNA and 16S rRNA sequencing. The mosses used were Hygroamblystegium noterophilum, Entodon compressus and Grimmia montana, representing hygrophyte, shady plant and xerophyte, respectively. In total, the operational taxonomic units (OTUs), richness and diversity were different regardless of the moss species and the library level. All the examined 1183 clones were assigned to 248 OTUs, 56 genera were assigned in rDNA libraries and 23 genera were determined at the rRNA level. Proteobacteria and Bacteroidetes were considered as the most dominant phyla in all the libraries, whereas abundant Actinobacteria and Acidobacteria were detected in the rDNA library of Entodon compressus and approximately 24.7% clones were assigned to Candidate division TM7 in Grimmia montana at rRNA level. The heatmap showed the bacterial profiles derived from rRNA and rDNA were partly overlapping. However, the principle component analysis of all the profiles derived from rDNA showed sharper differences between the different mosses than that of rRNA-based profiles. This suggests that the metabolically active bacterial compositions in different mosses were more phylogenetically similar and the differences of the bacteria associated with different mosses were mainly detected at the rDNA level. Obtained results clearly demonstrate that combination of 16S rDNA and 16S rRNA sequencing is preferred approach to have a good understanding on the constitution of the microbial communities in mosses. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Evolution of EF-hand calcium-modulated proteins. IV. Exon shuffling did not determine the domain compositions of EF-hand proteins

NASA Technical Reports Server (NTRS)

Kretsinger, R. H.; Nakayama, S.

1993-01-01

In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.
Urinary cell-free DNA is a versatile analyte for monitoring infections of the urinary tract.

PubMed

Burnham, Philip; Dadhania, Darshana; Heyang, Michael; Chen, Fanny; Westblade, Lars F; Suthanthiran, Manikkam; Lee, John Richard; De Vlaminck, Iwijn

2018-06-20

Urinary tract infections are one of the most common infections in humans. Here we tested the utility of urinary cell-free DNA (cfDNA) to comprehensively monitor host and pathogen dynamics in bacterial and viral urinary tract infections. We isolated cfDNA from 141 urine samples from a cohort of 82 kidney transplant recipients and performed next-generation sequencing. We found that urinary cfDNA is highly informative about bacterial and viral composition of the microbiome, antimicrobial susceptibility, bacterial growth dynamics, kidney allograft injury, and host response to infection. These different layers of information are accessible from a single assay and individually agree with corresponding clinical tests based on quantitative PCR, conventional bacterial culture, and urinalysis. In addition, cfDNA reveals the frequent occurrence of pathologies that remain undiagnosed with conventional diagnostic protocols. Our work identifies urinary cfDNA as a highly versatile analyte to monitor infections of the urinary tract.
Application of Stochastic Labeling with Random-Sequence Barcodes for Simultaneous Quantification and Sequencing of Environmental 16S rRNA Genes.

PubMed

Hoshino, Tatsuhiko; Inagaki, Fumio

2017-01-01

Next-generation sequencing (NGS) is a powerful tool for analyzing environmental DNA and provides the comprehensive molecular view of microbial communities. For obtaining the copy number of particular sequences in the NGS library, however, additional quantitative analysis as quantitative PCR (qPCR) or digital PCR (dPCR) is required. Furthermore, number of sequences in a sequence library does not always reflect the original copy number of a target gene because of biases caused by PCR amplification, making it difficult to convert the proportion of particular sequences in the NGS library to the copy number using the mass of input DNA. To address this issue, we applied stochastic labeling approach with random-tag sequences and developed a NGS-based quantification protocol, which enables simultaneous sequencing and quantification of the targeted DNA. This quantitative sequencing (qSeq) is initiated from single-primer extension (SPE) using a primer with random tag adjacent to the 5' end of target-specific sequence. During SPE, each DNA molecule is stochastically labeled with the random tag. Subsequently, first-round PCR is conducted, specifically targeting the SPE product, followed by second-round PCR to index for NGS. The number of random tags is only determined during the SPE step and is therefore not affected by the two rounds of PCR that may introduce amplification biases. In the case of 16S rRNA genes, after NGS sequencing and taxonomic classification, the absolute number of target phylotypes 16S rRNA gene can be estimated by Poisson statistics by counting random tags incorporated at the end of sequence. To test the feasibility of this approach, the 16S rRNA gene of Sulfolobus tokodaii was subjected to qSeq, which resulted in accurate quantification of 5.0 × 103 to 5.0 × 104 copies of the 16S rRNA gene. Furthermore, qSeq was applied to mock microbial communities and environmental samples, and the results were comparable to those obtained using digital PCR and relative abundance based on a standard sequence library. We demonstrated that the qSeq protocol proposed here is advantageous for providing less-biased absolute copy numbers of each target DNA with NGS sequencing at one time. By this new experiment scheme in microbial ecology, microbial community compositions can be explored in more quantitative manner, thus expanding our knowledge of microbial ecosystems in natural environments.
The primary structure of the thymidine kinase gene of fish lymphocystis disease virus.

PubMed

Schnitzler, P; Handermann, M; Szépe, O; Darai, G

1991-06-01

The DNA nucleotide sequence of the thymidine kinase (TK) gene of fish lymphocystis disease virus (FLDV) which has been localized between the coordinates 0.678 to 0.688 of the viral genome was determined. The analysis of the DNA nucleotide sequence located between the recognition sites of HindIII (0.669 map unit; nucleotide position 1) and AccI (nucleotide position 2032) revealed the presence of an open reading frame of 954 bp on the lower strand of this region between nucleotide positions 1868 (ATG) and 915 (TAA). It encodes for a protein of 318 amino acid residues. The evolutionary relationships of the TK gene of FLDV to the other known TK genes was investigated using the method of progressive sequence alignment. These analyses revealed a high degree of diversity between the protein sequence of FLDV TK gene and the amino acid composition of other TKs tested. However, significant conservations were detected at several regions of amino acid residues of the FLDV TK protein when compared to the amino acid sequence of TKs of African swine fever virus, fowlpox virus, shope fibroma virus, and vaccinia virus and to the amino acid sequences of the cellular cytoplasmic TK of chicken, mouse, and man.
Quantitative Viral Community DNA Analysis Reveals the Dominance of Single-Stranded DNA Viruses in Offshore Upper Bathyal Sediment from Tohoku, Japan

PubMed Central

Yoshida, Mitsuhiro; Mochizuki, Tomohiro; Urayama, Syun-Ichi; Yoshida-Takashima, Yukari; Nishi, Shinro; Hirai, Miho; Nomaki, Hidetaka; Takaki, Yoshihiro; Nunoura, Takuro; Takai, Ken

2018-01-01

Previous studies on marine environmental virology have primarily focused on double-stranded DNA (dsDNA) viruses; however, it has recently been suggested that single-stranded DNA (ssDNA) viruses are more abundant in marine ecosystems. In this study, we performed a quantitative viral community DNA analysis to estimate the relative abundance and composition of both ssDNA and dsDNA viruses in offshore upper bathyal sediment from Tohoku, Japan (water depth = 500 m). The estimated dsDNA viral abundance ranged from 3 × 106 to 5 × 106 genome copies per cm3 sediment, showing values similar to the range of fluorescence-based direct virus counts. In contrast, the estimated ssDNA viral abundance ranged from 1 × 108 to 3 × 109 genome copies per cm3 sediment, thus providing an estimation that the ssDNA viral populations represent 96.3–99.8% of the benthic total DNA viral assemblages. In the ssDNA viral metagenome, most of the identified viral sequences were associated with ssDNA viral families such as Circoviridae and Microviridae. The principle components analysis of the ssDNA viral sequence components from the sedimentary ssDNA viral metagenomic libraries found that the different depth viral communities at the study site all exhibited similar profiles compared with deep-sea sediment ones at other reference sites. Our results suggested that deep-sea benthic ssDNA viruses have been significantly underestimated by conventional direct virus counts and that their contributions to deep-sea benthic microbial mortality and geochemical cycles should be further addressed by such a new quantitative approach. PMID:29467725
Understanding microalgal species composition and contributions in Antarctic glacial melt water through rbcL high throughput sequencing

NASA Astrophysics Data System (ADS)

Barretto, K. M.; Kalmbach, A. J.; de la Torre, J. R.; Falcón, L. I.; Carpenter, E. J.

2016-02-01

The McMurdo Dry Valleys (MDV) in Antarctica present unique research opportunities, both because of the understudied biogeochemical impact of their microbial communities, and their sensitivity to climate change. Despite harsh desiccation, pH, and salinity stress, summer glacial melt water supports life in the MDV in the form of algal mats. These mat communities are complex in structure, with a network of dominant cyanobacteria interspersed with heterotrophic diazotrophs, smaller photoautotrophs, and thick extracellular polymeric substances. Due to their complexity, standard microscopy yields a limited understanding of community assemblages. Our previous high throughput sequencing (HTS) approaches focusing on 16S rRNA have profiled communities with understudied photosynthetic phyla such as Acidobacteria, Gemmatimonadetes, and Chloroflexi. To characterize these phototrophic communities, we are interested in (1) understanding their temporal dynamics and how the dominant cyanobacterial species influence community composition, (2) modeling how pH, nutrients, soil wetness, and temperature act as multivariate drivers of community composition, and (3) establishing a pipeline for HTS of the rbcL gene - which encodes the large subunit of the ubiquitous photosynthetic protein RuBisCO. Our initial screening of community DNA from MDV algal mats has shown the presence of Form IA, IB, and IC cbbL (an rbcL ortholog), and Form ID rbcL - indicating a relatively high degree of photoautotrophic diversity. Soil wetness drives anoxic conditions and we see that it shifts overall microbial composition - we expect photoautotrophs to respond similarly. We also expect photoautotrophic assemblages to shift with pH and soil nutrients. Our deep sequencing efforts suggest an inconsistency between indexing primers and algal DNA that could underestimate cyanobacterial and overestimate eukaryotic abundance. Resolving these issues with new approaches will allow us to more fully understand the dynamics of the MDV.
Complement component 3: characterization and association with mastitis resistance in Egyptian water buffalo and cattle.

PubMed

El-Halawany, Nermin; Abd-El-Monsif, Shawky A; Al-Tohamy Ahmed, F M; Hegazy, Lamees; Abdel-Shafy, Hamdy; Abdel-Latif, Magdy A; Ghazi, Yasser A; Neuhoff, Christiane; Salilew-Wondim, Dessie; Schellander, Karl

2017-03-01

Mastitis is an infectious disease of the mammary gland that leads to reduced milk production and change in milk composition. Complement component C3 plays a major role as a central molecule of the complement cascade involving in killing of microorganisms, either directly or in cooperation with phagocytic cells. C3 cDNA were isolated, from Egyptian buffalo and cattle, sequenced and characterized. The C3 cDNA sequences of buffalo and cattle consist of 5025 and 5019 bp, respectively. Buffalo and cattle C3 cDNAs share 99% of sequence identity with each other. The 4986 bp open reading frame in buffalo encodes a putative protein of 1661 amino acids-as in cattle-and includes all the functional domains. Further, analysis of the C3 cDNA sequences detected six novel single-nucleotide polymorphisms (SNPs) in buffalo and three novel SNPs in cattle. The association analysis of the detected SNPs with milk somatic cell score as an indicator of mastitis revealed that the most significant association in buffalo was found in the C>A substitution (ss: 1752816097) in exon 27, whereas in cattle it was in the C>T substitution (ss: 1752816085) in exon 12. Our findings provide preliminary information about the contribution of C3 polymorphisms to mastitis resistance in buffalo and cattle.
Oncogenic ras-driven cancer cell vesiculation leads to emission of double-stranded DNA capable of interacting with target cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lee, Tae Hoon; Chennakrishnaiah, Shilpa; Audemard, Eric

2014-08-22

Highlights: • Oncogenic H-ras stimulates emission of extracellular vesicles containing double-stranded DNA. • Vesicle-associated extracellular DNA contains mutant N-ras sequences. • Vesicles mediate intercellular transfer of mutant H-ras DNA to normal fibroblasts where it remains for several weeks. • Fibroblasts exposed to vesicles containing H-ras DNA exhibit increased proliferation. - Abstract: Cell free DNA is often regarded as a source of genetic cancer biomarkers, but the related mechanisms of DNA release, composition and biological activity remain unclear. Here we show that rat epithelial cell transformation by the human H-ras oncogene leads to an increase in production of small, exosomal-like extracellularmore » vesicles by viable cancer cells. These EVs contain chromatin-associated double-stranded DNA fragments covering the entire host genome, including full-length H-ras. Oncogenic N-ras and SV40LT sequences were also found in EVs emitted from spontaneous mouse brain tumor cells. Disruption of acidic sphingomyelinase and the p53/Rb pathway did not block emission of EV-related oncogenic DNA. Exposure of non-transformed RAT-1 cells to EVs containing mutant H-ras DNA led to the uptake and retention of this material for an extended (30 days) but transient period of time, and stimulated cell proliferation. Thus, our study suggests that H-ras-mediated transformation stimulates vesicular emission of this histone-bound oncogene, which may interact with non-transformed cells.« less
Two different size classes of 5S rDNA units coexisting in the same tandem array in the razor clam Ensis macha: is this region suitable for phylogeographic studies?

PubMed

Fernández-Tajes, Juan; Méndez, Josefina

2009-12-01

For a study of 5S ribosomal genes (rDNA) in the razor clam Ensis macha, the 5S rDNA region was amplified and sequenced. Two variants, so-called type I or short repeat (approximately 430 bp) and type II or long repeat (approximately 735 bp), appeared to be the main components of the 5S rDNA of this species. Their spacers differed markedly, both in length and nucleotide composition. The organization of the two variants was investigated by amplifying the genomic DNA with primers based on the sequence of the type I and type II spacers. PCR amplification products with primers EMLbF and EMSbR showed that the long and short repeats are associated within the same tandem array, suggesting an intermixed arrangement of both spacers. Nevertheless, amplifications carried out with inverse primers EMSinvF/R and EMLinvF/R revealed that some short and long repeats are contiguous in the same tandem array. This is the first report of the coexistence of two variable spacers in the same tandem array in bivalve mollusks.
Scar-less multi-part DNA assembly design automation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hillson, Nathan J.

The present invention provides a method of a method of designing an implementation of a DNA assembly. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which to assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding flanking homology sequences to each of the DNA oligos. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which tomore » assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding optimized overhang sequences to each of the DNA oligos.« less
Enrichment allows identification of diverse, rare elements in metagenomic resistome-virulome sequencing.

PubMed

Noyes, Noelle R; Weinroth, Maggie E; Parker, Jennifer K; Dean, Chris J; Lakin, Steven M; Raymond, Robert A; Rovira, Pablo; Doster, Enrique; Abdo, Zaid; Martin, Jennifer N; Jones, Kenneth L; Ruiz, Jaime; Boucher, Christina A; Belk, Keith E; Morley, Paul S

2017-10-17

Shotgun metagenomic sequencing is increasingly utilized as a tool to evaluate ecological-level dynamics of antimicrobial resistance and virulence, in conjunction with microbiome analysis. Interest in use of this method for environmental surveillance of antimicrobial resistance and pathogenic microorganisms is also increasing. In published metagenomic datasets, the total of all resistance- and virulence-related sequences accounts for < 1% of all sequenced DNA, leading to limitations in detection of low-abundance resistome-virulome elements. This study describes the extent and composition of the low-abundance portion of the resistome-virulome, using a bait-capture and enrichment system that incorporates unique molecular indices to count DNA molecules and correct for enrichment bias. The use of the bait-capture and enrichment system significantly increased on-target sequencing of the resistome-virulome, enabling detection of an additional 1441 gene accessions and revealing a low-abundance portion of the resistome-virulome that was more diverse and compositionally different than that detected by more traditional metagenomic assays. The low-abundance portion of the resistome-virulome also contained resistance genes with public health importance, such as extended-spectrum betalactamases, that were not detected using traditional shotgun metagenomic sequencing. In addition, the use of the bait-capture and enrichment system enabled identification of rare resistance gene haplotypes that were used to discriminate between sample origins. These results demonstrate that the rare resistome-virulome contains valuable and unique information that can be utilized for both surveillance and population genetic investigations of resistance. Access to the rare resistome-virulome using the bait-capture and enrichment system validated in this study can greatly advance our understanding of microbiome-resistome dynamics.
Bacterial communities in Great Barrier Reef calcareous sediments: Contrasting 16S rDNA libraries from nearshore and outer shelf reefs

NASA Astrophysics Data System (ADS)

Uthicke, S.; McGuire, K.

2007-03-01

Bacterial communities in eight 16S rDNA clone libraries from calcareous sediments were investigated to provide an assessment of the bacterial diversity on sediments of the Great Barrier Reef (GBR) and to investigate differences due to decreased water quality. Sample effort was spread across two locations on each of four coral reefs, with two reefs located nearshore and two reefs on the outer shelf to allow robust statistical comparison of nearshore reefs (subjected to enhanced runoff) and outer shelf reefs (pristine conditions). Out of 221 non-chimeric sequences, 189 (85.5%) were unique and only one sequence occurred in more than one library. Rarefaction analyses and coverage calculations indicated that only a small fraction of the diversity was sampled. Cluster analyses and comparison to published sequences indicated that sequences retrieved belonged to the α, γ and δ subdivision of the Proteobacteria (6.8, 29.4 and 13.6% of the total, respectively), Cytophaga-Flavobacteria-Bacteroidetes (CFB) group (20.4%), Cyanobacteria (5.4%), Planctomycetaceae (7.7%), Verrucomicrobiaceae (6.8%), Acidobacteriaceae (2.7%). Analysis of Similarity (ANOSIM, based on grouping all retrieved sequences into 9 phylogenetic groups) indicated that subtle differences do exist in the community composition between nearshore and outer shelf reefs. Similarity percentage analysis (SIMPER) indicated that Acidobacteriaceae and Cyanobacteriaceae were the main contributors to the dissimilarity. A significant difference between bacteria on nearshore and outer shelf reefs also existed on the molecular level ( FST = 0.008, p = 0.007 for all samples, 0.006, p = 0.022 when repeated sequences within libraries were removed). Thus, bacterial communities on carbonate sediments investigated were highly diverse and differences in community composition may provide important leads for the search for indicator species or communities for water quality differences.
De novo DNA methylation during monkey pre-implantation embryogenesis.

PubMed

Gao, Fei; Niu, Yuyu; Sun, Yi Eve; Lu, Hanlin; Chen, Yongchang; Li, Siguang; Kang, Yu; Luo, Yuping; Si, Chenyang; Yu, Juehua; Li, Chang; Sun, Nianqin; Si, Wei; Wang, Hong; Ji, Weizhi; Tan, Tao

2017-04-01

Critical epigenetic regulation of primate embryogenesis entails DNA methylome changes. Here we report genome-wide composition, patterning, and stage-specific dynamics of DNA methylation in pre-implantation rhesus monkey embryos as well as male and female gametes studied using an optimized tagmentation-based whole-genome bisulfite sequencing method. We show that upon fertilization, both paternal and maternal genomes undergo active DNA demethylation, and genome-wide de novo DNA methylation is also initiated in the same period. By the 8-cell stage, remethylation becomes more pronounced than demethylation, resulting in an increase in global DNA methylation. Promoters of genes associated with oxidative phosphorylation are preferentially remethylated at the 8-cell stage, suggesting that this mode of energy metabolism may not be favored. Unlike in rodents, X chromosome inactivation is not observed during monkey pre-implantation development. Our study provides the first comprehensive illustration of the 'wax and wane' phases of DNA methylation dynamics. Most importantly, our DNA methyltransferase loss-of-function analysis indicates that DNA methylation influences early monkey embryogenesis.
De novo DNA methylation during monkey pre-implantation embryogenesis

PubMed Central

Gao, Fei; Niu, Yuyu; Sun, Yi Eve; Lu, Hanlin; Chen, Yongchang; Li, Siguang; Kang, Yu; Luo, Yuping; Si, Chenyang; Yu, Juehua; Li, Chang; Sun, Nianqin; Si, Wei; Wang, Hong; Ji, Weizhi; Tan, Tao

2017-01-01

Critical epigenetic regulation of primate embryogenesis entails DNA methylome changes. Here we report genome-wide composition, patterning, and stage-specific dynamics of DNA methylation in pre-implantation rhesus monkey embryos as well as male and female gametes studied using an optimized tagmentation-based whole-genome bisulfite sequencing method. We show that upon fertilization, both paternal and maternal genomes undergo active DNA demethylation, and genome-wide de novo DNA methylation is also initiated in the same period. By the 8-cell stage, remethylation becomes more pronounced than demethylation, resulting in an increase in global DNA methylation. Promoters of genes associated with oxidative phosphorylation are preferentially remethylated at the 8-cell stage, suggesting that this mode of energy metabolism may not be favored. Unlike in rodents, X chromosome inactivation is not observed during monkey pre-implantation development. Our study provides the first comprehensive illustration of the 'wax and wane' phases of DNA methylation dynamics. Most importantly, our DNA methyltransferase loss-of-function analysis indicates that DNA methylation influences early monkey embryogenesis. PMID:28233770
Contribution of AT-, GC-, and methylated cytidine-rich DNA to chromatin composition in Malpighian tubule cell nuclei of Panstrongylus megistus (Hemiptera, Reduviidae).

PubMed

Alvarenga, Elenice M; Mondin, Mateus; Rodrigues, Vera L C C; Andrade, Larissa M; Vidal, Benedicto de Campos; Mello, Maria Luiza S

2012-11-01

The Malpighian tubule cell nuclei of male Panstrongylus megistus, a vector of Chagas disease, contain one chromocenter, which is composed solely of the Y chromosome. Considering that different chromosomes contribute to the composition of chromocenters in different triatomini species, the aim of this study was to determine the contribution of AT-, GC-, and methylated cytidine-rich DNA in the chromocenter as well as in euchromatin of Malpighian tubule cell nuclei of P. megistus in comparison with published data for Triatoma infestans. Staining with 4',6-diamidino-2-phenylindole/actinomycin D and chromomycin A(3)/distamycin, immunodetection of 5-methylcytidine and AgNOR test were used. The results revealed AT-rich/GC-poor DNA in the male chromocenter, but equally distributed AT and GC DNA sequences in male and female euchromatin, like in T. infestans. Accumulation of argyrophilic proteins encircling the chromocenter did not always correlate with that of GC-rich DNA. Methylated DNA identified by immunodetection was found sparsely distributed in the euchromatin of both sexes and at some points around the chromocenter edge, but it could not be considered responsible for chromatin condensation in the chromocenter, like in T. infestans. However, unlike in T. infestans, no correlation between the chromocenter AT-rich DNA and nucleolus organizing region (NOR) DNA was found in P. megistus. Copyright © 2011 Elsevier GmbH. All rights reserved.
Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics

DOE PAGES

Rinke, Christian; Low, Serene; Woodcroft, Ben J.; ...

2016-09-22

High-throughput sequencing libraries are typically limited by the requirement for nanograms to micrograms of input DNA. This bottleneck impedes the microscale analysis of ecosystems and the exploration of low biomass samples. Current methods for amplifying environmental DNA to bypass this bottleneck introduce considerable bias into metagenomic profiles. For this study, we describe and validate a simple modification of the Illumina Nextera XT DNA library preparation kit which allows creation of shotgun libraries from sub-nanogram amounts of input DNA. Community composition was reproducible down to 100 fg of input DNA based on analysis of a mock community comprising 54 phylogenetically diversemore » Bacteria and Archaea. The main technical issues with the low input libraries were a greater potential for contamination, limited DNA complexity which has a direct effect on assembly and binning, and an associated higher percentage of read duplicates. We recommend a lower limit of 1 pg (~100–1,000 microbial cells) to ensure community composition fidelity, and the inclusion of negative controls to identify reagent-specific contaminants. Applying the approach to marine surface water, pronounced differences were observed between bacterial community profiles of microliter volume samples, which we attribute to biological variation. This result is consistent with expected microscale patchiness in marine communities. We thus envision that our benchmarked, slightly modified low input DNA protocol will be beneficial for microscale and low biomass metagenomics.« less
Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rinke, Christian; Low, Serene; Woodcroft, Ben J.

High-throughput sequencing libraries are typically limited by the requirement for nanograms to micrograms of input DNA. This bottleneck impedes the microscale analysis of ecosystems and the exploration of low biomass samples. Current methods for amplifying environmental DNA to bypass this bottleneck introduce considerable bias into metagenomic profiles. For this study, we describe and validate a simple modification of the Illumina Nextera XT DNA library preparation kit which allows creation of shotgun libraries from sub-nanogram amounts of input DNA. Community composition was reproducible down to 100 fg of input DNA based on analysis of a mock community comprising 54 phylogenetically diversemore » Bacteria and Archaea. The main technical issues with the low input libraries were a greater potential for contamination, limited DNA complexity which has a direct effect on assembly and binning, and an associated higher percentage of read duplicates. We recommend a lower limit of 1 pg (~100–1,000 microbial cells) to ensure community composition fidelity, and the inclusion of negative controls to identify reagent-specific contaminants. Applying the approach to marine surface water, pronounced differences were observed between bacterial community profiles of microliter volume samples, which we attribute to biological variation. This result is consistent with expected microscale patchiness in marine communities. We thus envision that our benchmarked, slightly modified low input DNA protocol will be beneficial for microscale and low biomass metagenomics.« less

Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics

PubMed Central

Low, Serene; Raina, Jean-Baptiste; Skarshewski, Adam; Le, Xuyen H.; Butler, Margaret K.; Stocker, Roman; Seymour, Justin; Tyson, Gene W.

2016-01-01

High-throughput sequencing libraries are typically limited by the requirement for nanograms to micrograms of input DNA. This bottleneck impedes the microscale analysis of ecosystems and the exploration of low biomass samples. Current methods for amplifying environmental DNA to bypass this bottleneck introduce considerable bias into metagenomic profiles. Here we describe and validate a simple modification of the Illumina Nextera XT DNA library preparation kit which allows creation of shotgun libraries from sub-nanogram amounts of input DNA. Community composition was reproducible down to 100 fg of input DNA based on analysis of a mock community comprising 54 phylogenetically diverse Bacteria and Archaea. The main technical issues with the low input libraries were a greater potential for contamination, limited DNA complexity which has a direct effect on assembly and binning, and an associated higher percentage of read duplicates. We recommend a lower limit of 1 pg (∼100–1,000 microbial cells) to ensure community composition fidelity, and the inclusion of negative controls to identify reagent-specific contaminants. Applying the approach to marine surface water, pronounced differences were observed between bacterial community profiles of microliter volume samples, which we attribute to biological variation. This result is consistent with expected microscale patchiness in marine communities. We thus envision that our benchmarked, slightly modified low input DNA protocol will be beneficial for microscale and low biomass metagenomics. PMID:27688978
Exploring the Gastrointestinal "Nemabiome": Deep Amplicon Sequencing to Quantify the Species Composition of Parasitic Nematode Communities.

PubMed

Avramenko, Russell W; Redman, Elizabeth M; Lewis, Roy; Yazwinski, Thomas A; Wasmuth, James D; Gilleard, John S

2015-01-01

Parasitic helminth infections have a considerable impact on global human health as well as animal welfare and production. Although co-infection with multiple parasite species within a host is common, there is a dearth of tools with which to study the composition of these complex parasite communities. Helminth species vary in their pathogenicity, epidemiology and drug sensitivity and the interactions that occur between co-infecting species and their hosts are poorly understood. We describe the first application of deep amplicon sequencing to study parasitic nematode communities as well as introduce the concept of the gastro-intestinal "nemabiome". The approach is analogous to 16S rDNA deep sequencing used to explore microbial communities, but utilizes the nematode ITS-2 rDNA locus instead. Gastro-intestinal parasites of cattle were used to develop the concept, as this host has many well-defined gastro-intestinal nematode species that commonly occur as complex co-infections. Further, the availability of pure mono-parasite populations from experimentally infected cattle allowed us to prepare mock parasite communities to determine, and correct for, species representation biases in the sequence data. We demonstrate that, once these biases have been corrected, accurate relative quantitation of gastro-intestinal parasitic nematode communities in cattle fecal samples can be achieved. We have validated the accuracy of the method applied to field-samples by comparing the results of detailed morphological examination of L3 larvae populations with those of the sequencing assay. The results illustrate the insights that can be gained into the species composition of parasite communities, using grazing cattle in the mid-west USA as an example. However, both the technical approach and the concept of the 'nemabiome' have a wide range of potential applications in human and veterinary medicine. These include investigations of host-parasite and parasite-parasite interactions during co-infection, parasite epidemiology, parasite ecology and the response of parasite populations to both drug treatments and control programs.
Illumina MiSeq Sequencing for Preliminary Analysis of Microbiome Causing Primary Endodontic Infections in Egypt

PubMed Central

Azab, Marwa Mohamed; Fayyad, Dalia Mukhtar

2018-01-01

The use of high throughput next generation technologies has allowed more comprehensive analysis than traditional Sanger sequencing. The specific aim of this study was to investigate the microbial diversity of primary endodontic infections using Illumina MiSeq sequencing platform in Egyptian patients. Samples were collected from 19 patients in Suez Canal University Hospital (Endodontic Department) using sterile # 15K file and paper points. DNA was extracted using Mo Bio power soil DNA isolation extraction kit followed by PCR amplification and agarose gel electrophoresis. The microbiome was characterized on the basis of the V3 and V4 hypervariable region of the 16S rRNA gene by using paired-end sequencing on Illumina MiSeq device. MOTHUR software was used in sequence filtration and analysis of sequenced data. A total of 1858 operational taxonomic units at 97% similarity were assigned to 26 phyla, 245 families, and 705 genera. Four main phyla Firmicutes, Bacteroidetes, Proteobacteria, and Synergistetes were predominant in all samples. At genus level, Prevotella, Bacillus, Porphyromonas, Streptococcus, and Bacteroides were the most abundant. Illumina MiSeq platform sequencing can be used to investigate oral microbiome composition of endodontic infections. Elucidating the ecology of endodontic infections is a necessary step in developing effective intracanal antimicrobials. PMID:29849646
Population diversity of ammonium oxidizers investigated by specific PCR amplification

USGS Publications Warehouse

Ward, B.B.; Voytek, M.A.; Witzel, K.-P.

1997-01-01

The species composition of ammonia-oxidizing bacteria in aquatic environments was investigated using PCR primers for 16S rRNA genes to amplify specific subsets of the total ammonia-oxidizer population. The specificity of the amplification reactions was determined using total genomic DNA from known nitrifying strains and non-nitrifying strains identified as having similar rDNA sequences. Specificity of amplification was determined both for direct amplification, using the nitrifier specific primers, and with nested amplification, in which the nitrifier primers were used to reamplify a fragment obtained from direct amplification with Eubacterial universal primers. The present level of specificity allows the distinction between Nitrosomonas europaea, Nitrosomonas sp. (marine) and the other known ammonia-oxidizers in the beta subclass of the Proteobacteria. Using total DNA extracted from natural samples, we used direct amplification to determine presence/absence of different species groups. Species composition was found to differ among depths in vertical profiles of lake samples and among samples and enrichments from various other aquatic environments. Nested PCR yielded several more positive reactions, which implies that nitrifier DNA was present in most samples, but often at very low levels.
A genosensor for detection of consensus DNA sequence of Dengue virus using ZnO/Pt-Pd nanocomposites.

PubMed

Singhal, Chaitali; Pundir, C S; Narang, Jagriti

2017-11-15

An electrochemical genosensor based on Zinc oxide/platinum-palladium (ZnO/Pt-Pd) modified fluorine doped tin oxide (FTO) glass plate was fabricated for detection of consensus DNA sequence of Dengue virus (DENV) using methylene blue (MB) as an intercalating agent. To achieve it, probe DNA (PDNA) was immobilized on the surface of ZnO/Pt-Pd nanocomposites modified FTO electrode. The synthesized nano-composites were characterized by high resolution transmission electron microscopy (HRTEM), energy dispersive X-ray analysis (EDX), atomic force microscopy (AFM), scanning electron microscopy (SEM), UV-Vis spectroscopy, X-ray diffraction (XRD) analysis and Fourier transform infra-red (FTIR) spectroscopy. This PDNA modified electrode (PDNA/ZnO/Pt-Pd/FTO) served as a signal amplification platform for the detection of the target hybridized DNA (TDNA). The hybridization between PDNA and TDNA was detected by reduction in current, generated by interaction of anionic mediator, i.e., methylene blue (MB) with free guanine (3'G) of ssDNA. The sensor showed a dynamic linear range of 1 × 10 -6 M to 100 × 10 -6 M with LOD as 4.3 × 10 -5 M and LOQ as 9.5 × 10 -5 M. Till date, majorly serotype specific biosensors for dengue detection have been developed. The genosensor reported here eliminates the possibility of false result as in case of serotype specific DNA sensor. This is the report where conserved sequences present in all the serotypes of Dengue virus has been employed for fabrication of a genosensor. Copyright © 2017 Elsevier B.V. All rights reserved.
Degenerative Minimalism in the Genome of a Psyllid Endosymbiont

PubMed Central

Clark, Marta A.; Baumann, Linda; Thao, MyLo Ly; Moran, Nancy A.; Baumann, Paul

2001-01-01

Psyllids, like aphids, feed on plant phloem sap and are obligately associated with prokaryotic endosymbionts acquired through vertical transmission from an ancestral infection. We have sequenced 37 kb of DNA of the genome of Carsonella ruddii, the endosymbiont of psyllids, and found that it has a number of unusual properties revealing a more extreme case of degeneration than was previously reported from studies of eubacterial genomes, including that of the aphid endosymbiont Buchnera aphidicola. Among the unusual properties are an exceptionally low guanine-plus-cytosine content (19.9%), almost complete absence of intergenic spaces, operon fusion, and lack of the usual promoter sequences upstream of 16S rDNA. These features suggest the synthesis of long mRNAs and translational coupling. The most extreme instances of base compositional bias occur in the genes encoding proteins that have less highly conserved amino acid sequences; the guanine-plus-cytosine content of some protein-coding sequences is as low as 10%. The shift in base composition has a large effect on proteins: in polypeptides of C. ruddii, half of the residues consist of five amino acids with codons low in guanine plus cytosine. Furthermore, the proteins of C. ruddii are reduced in size, with an average of about 9% fewer amino acids than in homologous proteins of related bacteria. These observations suggest that the C. ruddii genome is not subject to constraints that limit the evolution of other known eubacteria. PMID:11222582
Synthetic oligonucleotide antigens modified with locked nucleic acids detect disease specific antibodies

NASA Astrophysics Data System (ADS)

Samuelsen, Simone V.; Solov'Yov, Ilia A.; Balboni, Imelda M.; Mellins, Elizabeth; Nielsen, Christoffer Tandrup; Heegaard, Niels H. H.; Astakhova, Kira

2016-10-01

New techniques to detect and quantify antibodies to nucleic acids would provide a significant advance over current methods, which often lack specificity. We investigate the potential of novel antigens containing locked nucleic acids (LNAs) as targets for antibodies. Particularly, employing molecular dynamics we predict optimal nucleotide composition for targeting DNA-binding antibodies. As a proof of concept, we address a problem of detecting anti-DNA antibodies that are characteristic of systemic lupus erythematosus, a chronic autoimmune disease with multiple manifestations. We test the best oligonucleotide binders in surface plasmon resonance studies to analyze binding and kinetic aspects of interactions between antigens and target DNA. These DNA and LNA/DNA sequences showed improved binding in enzyme-linked immunosorbent assay using human samples of pediatric lupus patients. Our results suggest that the novel method is a promising tool to create antigens for research and point-of-care monitoring of anti-DNA antibodies.
Genetic analysis of Fasciola isolates from cattle in Korea based on second internal transcribed spacer (ITS-2) sequence of nuclear ribosomal DNA.

PubMed

Choe, Se-Eun; Nguyen, Thuy Thi-Dieu; Kang, Tae-Gyu; Kweon, Chang-Hee; Kang, Seung-Won

2011-09-01

Nuclear ribosomal DNA sequence of the second internal transcribed spacer (ITS-2) has been used efficiently to identify the liver fluke species collected from different hosts and various geographic regions. ITS-2 sequences of 19 Fasciola samples collected from Korean native cattle were determined and compared. Sequence comparison including ITS-2 sequences of isolates from this study and reference sequences from Fasciola hepatica and Fasciola gigantica and intermediate Fasciola in Genbank revealed seven identical variable sites of investigated isolates. Among 19 samples, 12 individuals had ITS-2 sequences completely identical to that of pure F. hepatica, five possessed the sequences identical to F. gigantica type, whereas two shared the sequence of both F. hepatica and F. gigantica. No variations in length and nucleotide composition of ITS-2 sequence were observed within isolates that belonged to F. hepatica or F. gigantica. At the position of 218, five Fasciola containing a single-base substitution (C>T) formed a distinct branch inside the F. gigantica-type group which was similar to those of Asian-origin isolates. The phylogenetic tree of the Fasciola spp. based on complete ITS-2 sequences from this study and other representative isolates in different locations clearly showed that pure F. hepatica, F. gigantica type and intermediate Fasciola were observed. The result also provided additional genetic evidence for the existence of three forms of Fasciola isolated from native cattle in Korea by genetic approach using ITS-2 sequence.
mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences.

PubMed

Links, Matthew G; Chaban, Bonnie; Hemmingsen, Sean M; Muirhead, Kevin; Hill, Janet E

2013-08-15

Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance on an external reference database. Here we introduce mPUMA (microbial Profiling Using Metagenomic Assembly, http://mpuma.sourceforge.net), a software package for identification and analysis of protein-coding barcode sequence data. It was developed originally for Cpn60 universal target sequences (also known as GroEL or Hsp60). Using an unattended process that is independent of external reference sequences, mPUMA forms OTUs by DNA sequence assembly and is capable of tracking OTU abundance. mPUMA processes microbial profiles both in terms of the direct DNA sequence as well as in the translated amino acid sequence for protein coding barcodes. By forming OTUs and calculating abundance through an assembly approach, mPUMA is capable of generating inputs for several popular microbiota analysis tools. Using SFF data from sequencing of a synthetic community of Cpn60 sequences derived from the human vaginal microbiome, we demonstrate that mPUMA can faithfully reconstruct all expected OTU sequences and produce compositional profiles consistent with actual community structure. mPUMA enables analysis of microbial communities while empowering the discovery of novel organisms through OTU assembly.
Chicken skin virome analyzed by high-throughput sequencing shows a composition highly different from human skin.

PubMed

Denesvre, Caroline; Dumarest, Marine; Rémy, Sylvie; Gourichon, David; Eloit, Marc

2015-10-01

Recent studies show that human skin at homeostasis is a complex ecosystem whose virome include circular DNA viruses, especially papillomaviruses and polyomaviruses. To determine the chicken skin virome in comparison with human skin virome, a chicken swabs pool sample from fifteen indoor healthy chickens of five genetic backgrounds was examined for the presence of DNA viruses by high-throughput sequencing (HTS). The results indicate a predominance of herpesviruses from the Mardivirus genus, coming from either vaccinal origin or presumably asymptomatic infection. Despite the high sensitivity of the HTS method used herein to detect small circular DNA viruses, we did not detect any papillomaviruses, polyomaviruses, or circoviruses, indicating that these viruses may not be resident of the chicken skin. The results suggest that the turkey herpesvirus is a resident of chicken skin in vaccinated chickens. This study indicates major differences between the skin viromes of chickens and humans. The origin of this difference remains to be further studied in relation with skin physiology, environment, or virus population dynamics.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

2013-06-25

A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.
Molecular simulations of assembly of functionalized spherical nanoparticles

NASA Astrophysics Data System (ADS)

Seifpour, Arezou

Precise assembly of nanoparticles is crucial for creating spatially engineered materials that can be used for photonics, photovoltaic, and metamaterials applications. One way to control nanoparticle assembly is by functionalizing the nanoparticle with ligands, such as polymers, DNA, and proteins, that can manipulate the interactions between the nanoparticles in the medium the particles are placed in. This thesis research aims to design ligands to provide a new route to the programmable assembly of nanoparticles. We first investigate using Monte Carlo simulation the effect of copolymer ligands on nanoparticle assembly. We first study a single nanoparticle grafted with many copolymer chains to understand how monomer sequence (e.g. alternating ABAB, or diblock AxBx) and chemistry of the copolymers affect the grafted chain conformation at various particle diameters, grafting densities, copolymer chain lengths, and monomer-monomer interactions in an implicit small molecule solvent. We find that the size of the grafted chain varies non-monotonically with increasing blockiness of the monomer sequence for a small particle diameter. From this first study, we selected the two sequences with the most different chain conformations---alternating and diblock---and studied the effect of the sequence and a range of monomer chemistries of the copolymer on the characteristics of assembly of multiple copolymer-functionalized nanoparticles. We find that the alternating sequence produces nanoclusters that are relatively isotropic, whereas diblock sequence tends to form anisotropic structures that are smaller and more compact when the block closer to the surface is attractive and larger loosely held together clusters when the outer block is attractive. Next, we conduct molecular dynamics simulations to study the effect of DNA ligands on nanoparticle assembly. Specifically we investigate the effect of grafted DNA strand composition (e.g. G/C content, placement and sequence) and bidispersity in DNA strand lengths on the thermodynamics and structure of assembly of functionalized nanoparticles. We find that higher G/C content increases cluster dissociation temperature for smaller particles. Placement of G/C block inward along the strand decreases number of neighbors within the assembled cluster. Finally, increased bidispersity in DNA strand lengths leads a distribution of inter-particle distances in the assembled cluster.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

2011-01-18

A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
Soil Bacterial Community Shift Correlated with Change from Forest to Pasture Vegetation in a Tropical Soil

PubMed Central

Nüsslein, Klaus; Tiedje, James M.

1999-01-01

The change in vegetative cover of a Hawaiian soil from forest to pasture led to significant changes in the composition of the soil bacterial community. DNAs were extracted from both soil habitats and compared for the abundance of guanine-plus-cytosine (G+C) content, by analysis of abundance of phylotypes of small-subunit ribosomal DNA (SSU rDNA) amplified from fractions with 63 and 35% G+C contents, and by phylogenetic analysis of the dominant rDNA clones in the 63% G+C content fraction. All three methods showed differences between the forest and pasture habitats, providing evidence that vegetation had a strong influence on microbial community composition at three levels of taxon resolution. The forest soil DNA had a peak in G+C content of 61%, while the DNA of the pasture soil had a peak in G+C content of 67%. None of the dominant phylotypes found in the forest soil were detected in the pasture soil. For the 63% G+C fraction SSU rDNA sequence analysis of the three most dominant members revealed that their phyla changed from Fibrobacter and Syntrophomonas assemblages in the forest soil to Burkholderia and Rhizobium–Agrobacterium assemblages in the pasture soil. PMID:10427058
Rapid isolation of microsatellite DNAs and identification of polymorphic mitochondrial DNA regions in the fish rotan (Perccottus glenii) invading European Russia

USGS Publications Warehouse

King, Timothy L.; Eackles, Michael S.; Reshetnikov, Andrey N.

2015-01-01

Human-mediated translocations and subsequent large-scale colonization by the invasive fish rotan (Perccottus glenii Dybowski, 1877; Perciformes, Odontobutidae), also known as Amur or Chinese sleeper, has resulted in dramatic transformations of small lentic ecosystems. However, no detailed genetic information exists on population structure, levels of effective movement, or relatedness among geographic populations of P. glenii within the European part of the range. We used massively parallel genomic DNA shotgun sequencing on the semiconductor-based Ion Torrent Personal Genome Machine (PGM) sequencing platform to identify nuclear microsatellite and mitochondrial DNA sequences in P. glenii from European Russia. Here we describe the characterization of nine nuclear microsatellite loci, ascertain levels of allelic diversity, heterozygosity, and demographic status of P. glenii collected from Ilev, Russia, one of several initial introduction points in European Russia. In addition, we mapped sequence reads to the complete P. glenii mitochondrial DNA sequence to identify polymorphic regions. Nuclear microsatellite markers developed for P. glenii yielded sufficient genetic diversity to: (1) produce unique multilocus genotypes; (2) elucidate structure among geographic populations; and (3) provide unique perspectives for analysis of population sizes and historical demographics. Among 4.9 million filtered P. glenii Ion Torrent PGM sequence reads, 11,304 mapped to the mitochondrial genome (NC_020350). This resulted in 100 % coverage of this genome to a mean coverage depth of 102X. A total of 130 variable sites were observed between the publicly available genome from China and the studied composite mitochondrial genome. Among these, 82 were diagnostic and monomorphic between the mitochondrial genomes and distributed among 15 genome regions. The polymorphic sites (N = 48) were distributed among 11 mitochondrial genome regions. Our results also indicate that sequence reads generated from two three-hour runs on the Ion Torrent PGM can generate a sufficient number of nuclear and mitochondrial markers to improve understanding of the evolutionary and ecological dynamics of non-model and in particular, invasive species.
Development of a Prokaryotic Universal Primer for Simultaneous Analysis of Bacteria and Archaea Using Next-Generation Sequencing

PubMed Central

Takahashi, Shunsuke; Tomita, Junko; Nishioka, Kaori; Hisada, Takayoshi; Nishijima, Miyuki

2014-01-01

For the analysis of microbial community structure based on 16S rDNA sequence diversity, sensitive and robust PCR amplification of 16S rDNA is a critical step. To obtain accurate microbial composition data, PCR amplification must be free of bias; however, amplifying all 16S rDNA species with equal efficiency from a sample containing a large variety of microorganisms remains challenging. Here, we designed a universal primer based on the V3-V4 hypervariable region of prokaryotic 16S rDNA for the simultaneous detection of Bacteria and Archaea in fecal samples from crossbred pigs (Landrace×Large white×Duroc) using an Illumina MiSeq next-generation sequencer. In-silico analysis showed that the newly designed universal prokaryotic primers matched approximately 98.0% of Bacteria and 94.6% of Archaea rRNA gene sequences in the Ribosomal Database Project database. For each sequencing reaction performed with the prokaryotic universal primer, an average of 69,330 (±20,482) reads were obtained, of which archaeal rRNA genes comprised approximately 1.2% to 3.2% of all prokaryotic reads. In addition, the detection frequency of Bacteria belonging to the phylum Verrucomicrobia, including members of the classes Verrucomicrobiae and Opitutae, was higher in the NGS analysis using the prokaryotic universal primer than that performed with the bacterial universal primer. Importantly, this new prokaryotic universal primer set had markedly lower bias than that of most previously designed universal primers. Our findings demonstrate that the prokaryotic universal primer set designed in the present study will permit the simultaneous detection of Bacteria and Archaea, and will therefore allow for a more comprehensive understanding of microbial community structures in environmental samples. PMID:25144201
Sequences characterization of microsatellite DNA sequences in Pacific abalone ( Haliotis discus hannai)

NASA Astrophysics Data System (ADS)

Li, Qi; Akihiro, Kijima

2007-01-01

The microsatellite-enriched library was constructed using magnetic bead hybridization selection method, and the microsatellite DNA sequences were analyzed in Pacific abalone Haliotis discus hannai. Three hundred and fifty white colonies were screened using PCR-based technique, and 84 clones were identified to potentially contain microsatellite repeat motif. The 84 clones were sequenced, and 42 microsatellites and 4 minisatellites with a minimum of five repeats were found (13.1% of white colonies screened). Besides the motif of CA contained in the oligoprobe, we also found other 16 types of microsatellite repeats including a dinucleotide repeat, two tetranucleotide repeats, twelve pentanucleotide repeats and a hexanucleotide repeat. According to Weber (1990), the microsatellite sequences obtained could be categorized structurally into perfect repeats (73.3%), imperfect repeats (13.3%), and compound repeats (13.4%). Among the microsatellite repeats, relatively short arrays (<20 repeats) were most abundant, accounting for 75.0%. The largest length of microsatellites was 48 repeats, and the average number of repeats was 13.4. The data on the composition and length distribution of microsatellites obtained in the present study can be useful for choosing the repeat motifs for microsatellite isolation in other abalone species.
Comparison of pectin-degrading fungal communities in temperate forests using glycosyl hydrolase family 28 pectinase primers targeting Ascomycete fungi

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gacura, Matthew D.; Sprockett, Daniel D.; Heidenreich, Bess

Here, fungi have developed a wide assortment of enzymes to break down pectin, a prevalent polymer in plant cell walls that is important in plant defense and structure. One enzyme family used to degrade pectin is the glycosyl hydrolase family 28 (GH28). In this studywe developed primers for the amplification of GH28 coding genes from a database of 293 GH28 sequences from40 fungal genomes. The primerswere used to successfully amplify GH28 pectinases from all Ascomycota cultures tested, but only three out of seven Basidiomycota cultures. In addition, we further tested the primers in PCRs on metagenomic DNA extracted from senescedmore » tree leaves from different forest ecosystems, followed by cloning and sequencing. Taxonomic specificity for Ascomycota GH28 genes was tested by comparing GH28 composition in leaves to internal transcribed spacer (ITS) amplicon composition using pyrosequencing. All sequences obtained from GH28 primers were classified as Ascomycota; in contrast, ITS sequences indicated that fungal communitieswere up to 39% Basidiomycetes. Analysis of leaf samples indicated that both forest stand and ecosystemtype were important in structuring fungal communities. However, site played the prominent role in explaining GH28 composition, whereas ecosystem type was more important for ITS composition, indicating possible genetic drift between populations of fungi. Overall, these primers will have utility in understanding relationships between fungal community composition and ecosystem processes, as well as detection of potentially pathogenic Ascomycetes.« less
Comparison of pectin-degrading fungal communities in temperate forests using glycosyl hydrolase family 28 pectinase primers targeting Ascomycete fungi

DOE PAGES

Gacura, Matthew D.; Sprockett, Daniel D.; Heidenreich, Bess; ...

2016-02-17

Here, fungi have developed a wide assortment of enzymes to break down pectin, a prevalent polymer in plant cell walls that is important in plant defense and structure. One enzyme family used to degrade pectin is the glycosyl hydrolase family 28 (GH28). In this studywe developed primers for the amplification of GH28 coding genes from a database of 293 GH28 sequences from40 fungal genomes. The primerswere used to successfully amplify GH28 pectinases from all Ascomycota cultures tested, but only three out of seven Basidiomycota cultures. In addition, we further tested the primers in PCRs on metagenomic DNA extracted from senescedmore » tree leaves from different forest ecosystems, followed by cloning and sequencing. Taxonomic specificity for Ascomycota GH28 genes was tested by comparing GH28 composition in leaves to internal transcribed spacer (ITS) amplicon composition using pyrosequencing. All sequences obtained from GH28 primers were classified as Ascomycota; in contrast, ITS sequences indicated that fungal communitieswere up to 39% Basidiomycetes. Analysis of leaf samples indicated that both forest stand and ecosystemtype were important in structuring fungal communities. However, site played the prominent role in explaining GH28 composition, whereas ecosystem type was more important for ITS composition, indicating possible genetic drift between populations of fungi. Overall, these primers will have utility in understanding relationships between fungal community composition and ecosystem processes, as well as detection of potentially pathogenic Ascomycetes.« less
Identification of Cellulose-Responsive Bacterial and Fungal Communities in Geographically and Edaphically Different Soils by Using Stable Isotope Probing

PubMed Central

Eichorst, Stephanie A.

2012-01-01

Many bacteria and fungi are known to degrade cellulose in culture, but their combined response to cellulose in different soils is unknown. Replicate soil microcosms amended with [13C]cellulose were used to identify bacterial and fungal communities responsive to cellulose in five geographically and edaphically different soils. The diversity and composition of the cellulose-responsive communities were assessed by DNA-stable isotope probing combined with Sanger sequencing of small-subunit and large-subunit rRNA genes for the bacterial and fungal communities, respectively. In each soil, the 13C-enriched, cellulose-responsive communities were of distinct composition compared to the original soil community or 12C-nonenriched communities. The composition of cellulose-responsive taxa, as identified by sequence operational taxonomic unit (OTU) similarity, differed in each soil. When OTUs were grouped at the bacterial order level, we found that members of the Burkholderiales, Caulobacteriales, Rhizobiales, Sphingobacteriales, Xanthomonadales, and the subdivision 1 Acidobacteria were prevalent in the 13C-enriched DNA in at least three of the soils. The cellulose-responsive fungi were identified as members of the Trichocladium, Chaetomium, Dactylaria, and Arthrobotrys genera, along with two novel Ascomycota clusters, unique to one soil. Although similarities were identified in higher-level taxa among some soils, the composition of cellulose-responsive bacteria and fungi was generally unique to a certain soil type, suggesting a strong potential influence of multiple edaphic factors in shaping the community. PMID:22287013

Strain diversity and host specificity in bee gut symbionts revealed by deep sampling of single copy protein-coding sequences

PubMed Central

Powell, J. Elijah; Ratnayeke, Nalin; Moran, Nancy A.

2017-01-01

High throughput rRNA amplicon surveys of bacterial communities provide a rapid snapshot of taxonomic composition. But strains with nearly identical rRNA sequences often differ in gene repertoires and metabolic capabilities. To assess strain-level variation within Snodgrassella alvi, a gut symbiont of corbiculate bees, we performed deep sequencing on amplicons of a single copy coding gene (minD) as well as the 16S rDNA V4 region. We surveyed honey bees (Apis mellifera) sampled globally and 12 bumble bee species (Bombus) sampled from two regions of the USA. The minD analyses reveal that S. alvi contains far more strain diversity than is evident from 16S rDNA analysis. Many taxa inferred on the basis of 16S rDNA are shared between A. mellifera and Bombus species, but taxa inferred on the basis of minD are never shared and often are restricted to particular Bombus species. Clustering based on minD revealed that gut communities often reflect host species and geographic location. Both minD and 16S rDNA analyses indicate that strain diversity is higher in A. mellifera than in Bombus species. The minD locus flanks a 16S gene, enabling development of strain-specific 16S fluorescent probes to illuminate the spatial relationship of strains within the bee gut. PMID:27482856
Target sites for the transposition of rat long interspersed repeated DNA elements (LINEs) are not random.

PubMed Central

Furano, A V; Somerville, C C; Tsichlis, P N; D'Ambrosio, E

1986-01-01

The long interspersed repeated DNA family of rats (LINE or L1Rn family) contains about 40,000 6.7-kilobase (kb) long members (1). LINE members may be currently mobile since their presence or absence causes allelic variation at three single copy loci (2, 3): insulin 1, Moloney leukemia virus integration 2 (Mlvi-2) (4), and immunoglobulin heavy chain (Igh). To characterize target sites for LINE insertion, we compared the DNA sequences of the unoccupied Mlvi-2 target site, its LINE-containing allele, and several other LINE-containing sites. Although not homologous overall, the target sites share three characteristics: First, depending on the site, they are from 68% to 86% (A+T) compared to 58% (A+T) for total rat DNA (5). Depending on the site, a 7- to 15-bp target site sequence becomes duplicated and flanks the inserted LINE member. The second is a version (0 or 1 mismatch) of the hexanucleotide, TACTCA, which is also present in the LINE member, in a highly conserved region located just before the A-rich right end of the LINE member. The third is a stretch of alternating purine/pyrimidine (PQ). The A-rich right ends of different LINE members vary in length and composition, and the sequence of a particularly long one suggests that it contains the A-rich target site from a previous transposition. PMID:3012480
A silica sands-based method for faithful analysis of microbial communities and DNA isolation from a wide range of species.

PubMed

Liu, Xia; Xu, Yongdong; Li, Zhi; Jiang, Shengwei; Yao, Shuo; Wu, Rina; An, Yingfeng

2018-04-21

A silica sands-based method has been developed to isolate high quality genomic DNAs from cells of animals, plants and microorganisms, such as Hemisalanx prognathus, Spinacia oleracea, Pichia pastoris, Bacillus licheniformis and Escherichia coli. To the best of our knowledge, no DNA isolation method has so wide application until now. In addition, this method and a commercially available kit were compared in analysis of microbial communities using high-throughput 16s rDNA sequencing. As a result, the silica sands-based method was found to be even more efficient in isolating genomic DNA from gram-positive bacteria than the kit, indicating that it would become a very valuable choice to faithfully reflect the composition of microbial communities.
Complementary DNA sequencing and identification of mRNAs from the venomous gland of Agkistrodon piscivorus leucostoma.

PubMed

Jia, Ying; Cantu, Bruno A; Sánchez, Elda E; Pérez, John C

2008-06-15

To advance our knowledge on the snake venom composition and transcripts expressed in venom gland at the molecular level, we constructed a cDNA library from the venom gland of Agkistrodon piscivorus leucostoma for the generation of expressed sequence tags (ESTs) database. From the randomly sequenced 2112 independent clones, we have obtained ESTs for 1309 (62%) cDNAs, which showed significant deduced amino acid sequence similarity (scores >80) to previously characterized proteins in National Center for Biotechnology Information (NCBI) database. Ribosomal proteins make up 47 clones (2%) and the remaining 756 (36%) cDNAs represent either unknown identity or show BLASTX sequence identity scores of <80 with known GenBank accessions. The most highly expressed gene encoding phospholipase A(2) (PLA(2)) accounting for 35% of A. p. leucostoma venom gland cDNAs was identified and further confirmed by crude venom applied to sodium dodecyl sulfate/polyacrylamide gel electrophoresis (SDS-PAGE) electrophoresis and protein sequencing. A total of 180 representative genes were obtained from the sequence assemblies and deposited to EST database. Clones showing sequence identity to disintegrins, thrombin-like enzymes, hemorrhagic toxins, fibrinogen clotting inhibitors and plasminogen activators were also identified in our EST database. These data can be used to develop a research program that will help us identify genes encoding proteins that are of medical importance or proteins involved in the mechanisms of the toxin venom.
Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

NASA Astrophysics Data System (ADS)

Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

2017-07-01

DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
Classification of Sharks in the Egyptian Mediterranean Waters Using Morphological and DNA Barcoding Approaches

PubMed Central

Moftah, Marie; Abdel Aziz, Sayeda H.; Elramah, Sara; Favereaux, Alexandre

2011-01-01

The identification of species constitutes the first basic step in phylogenetic studies, biodiversity monitoring and conservation. DNA barcoding, i.e. the sequencing of a short standardized region of DNA, has been proposed as a new tool for animal species identification. The present study provides an update on the composition of shark in the Egyptian Mediterranean waters off Alexandria, since the latest study to date was performed 30 years ago, DNA barcoding was used in addition to classical taxonomical methodologies. Thus, 51 specimen were DNA barcoded for a 667 bp region of the mitochondrial COI gene. Although DNA barcoding aims at developing species identification systems, some phylogenetic signals were apparent in the data. In the neighbor-joining tree, 8 major clusters were apparent, each of them containing individuals belonging to the same species, and most with 100% bootstrap value. This study is the first to our knowledge to use DNA barcoding of the mitochondrial COI gene in order to confirm the presence of species Squalus acanthias, Oxynotus centrina, Squatina squatina, Scyliorhinus canicula, Scyliorhinus stellaris, Mustelus mustelus, Mustelus punctulatus and Carcharhinus altimus in the Egyptian Mediterranean waters. Finally, our study is the starting point of a new barcoding database concerning shark composition in the Egyptian Mediterranean waters (Barcoding of Egyptian Mediterranean Sharks [BEMS], http://www.boldsystems.org/views/projectlist.php?&#Barcoding%20Fish%20%28FishBOL%29). PMID:22087242
Genetic composition and connectivity of the Antillean manatee (Trichechus manatus manatus) in Panama

USGS Publications Warehouse

Díaz-Ferguson, Edgardo; Hunter, Margaret; Guzmán, Héctor M.

2017-01-01

Genetic diversity and haplotype composition of the West Indian manatee (Trichechus manatus) population from the San San Pond Sak wetland in Bocas del Toro, Panama was studied using a segment of mitochondrial DNA (D’loop). No genetic information has been published to date for Panamanian populations. Due to the secretive behavior and small population size of the species in the area, DNA extraction was conducted from opportunistically collected fecal (N=20), carcass tissue (N=4) and bone (N=4) samples. However, after DNA processing only 10 samples provided good quality DNA for sequencing (3 fecal, 4 tissue and 3 bone samples). We found three haplotypes in total; two of these haplotypes are reported for the first time, J02 (N=3) and J03 (N=4), and one J01 was previously published (N=3). Genetic diversity showed similar values to previous studies conducted in other Caribbean regions with moderate values of nucleotide diversity (π= 0.00152) and haplotipic diversity (Hd= 0.57). Connectivity assessment was based on sequence similarity, genetic distance and genetic differentiation between San San population and other manatee populations previously studied. The J01 haplotype found in the Panamanian population is shared with populations in the Caribbean mainland and the Gulf of Mexico showing a reduced differentiation corroborated with Fst value between HSSPS and this region of 0.0094. In contrast, comparisons between our sequences and populations in the Eastern Caribbean (South American populations) and North Western Caribbean showed fewer similarities (Fst =0.049 and 0.058, respectively). These results corroborate previous phylogeographic patterns already established for manatee populations and situate Panamanian populations into the Belize and Mexico cluster. In addition, these findings will be a baseline for future studies and comparisons with manatees in other areas of Panama and Central America. These results should be considered to inform management decisions regarding conservation of genetic diversity, future controlled introductions, connectivity and effective population size of the West Indian manatee along the Central American corridor.
Biomonitoring of marine vertebrates in Monterey Bay using eDNA metabarcoding.

PubMed

Andruszkiewicz, Elizabeth A; Starks, Hilary A; Chavez, Francisco P; Sassoubre, Lauren M; Block, Barbara A; Boehm, Alexandria B

2017-01-01

Molecular analysis of environmental DNA (eDNA) can be used to assess vertebrate biodiversity in aquatic systems, but limited work has applied eDNA technologies to marine waters. Further, there is limited understanding of the spatial distribution of vertebrate eDNA in marine waters. Here, we use an eDNA metabarcoding approach to target and amplify a hypervariable region of the mitochondrial 12S rRNA gene to characterize vertebrate communities at 10 oceanographic stations spanning 45 km within the Monterey Bay National Marine Sanctuary (MBNMS). In this study, we collected three biological replicates of small volume water samples (1 L) at 2 depths at each of the 10 stations. We amplified fish mitochondrial DNA using a universal primer set. We obtained 5,644,299 high quality Illumina sequence reads from the environmental samples. The sequence reads were annotated to the lowest taxonomic assignment using a bioinformatics pipeline. The eDNA survey identified, to the lowest taxonomic rank, 7 families, 3 subfamilies, 10 genera, and 72 species of vertebrates at the study sites. These 92 distinct taxa come from 33 unique marine vertebrate families. We observed significantly different vertebrate community composition between sampling depths (0 m and 20/40 m deep) across all stations and significantly different communities at stations located on the continental shelf (<200 m bottom depth) versus in the deeper waters of the canyons of Monterey Bay (>200 m bottom depth). All but 1 family identified using eDNA metabarcoding is known to occur in MBNMS. The study informs the implementation of eDNA metabarcoding for vertebrate biomonitoring.
MitoAge: a database for comparative analysis of mitochondrial DNA, with a special focus on animal longevity.

PubMed

Toren, Dmitri; Barzilay, Thomer; Tacutu, Robi; Lehmann, Gilad; Muradian, Khachik K; Fraifeld, Vadim E

2016-01-04

Mitochondria are the only organelles in the animal cells that have their own genome. Due to a key role in energy production, generation of damaging factors (ROS, heat), and apoptosis, mitochondria and mtDNA in particular have long been considered one of the major players in the mechanisms of aging, longevity and age-related diseases. The rapidly increasing number of species with fully sequenced mtDNA, together with accumulated data on longevity records, provides a new fascinating basis for comparative analysis of the links between mtDNA features and animal longevity. To facilitate such analyses and to support the scientific community in carrying these out, we developed the MitoAge database containing calculated mtDNA compositional features of the entire mitochondrial genome, mtDNA coding (tRNA, rRNA, protein-coding genes) and non-coding (D-loop) regions, and codon usage/amino acids frequency for each protein-coding gene. MitoAge includes 922 species with fully sequenced mtDNA and maximum lifespan records. The database is available through the MitoAge website (www.mitoage.org or www.mitoage.info), which provides the necessary tools for searching, browsing, comparing and downloading the data sets of interest for selected taxonomic groups across the Kingdom Animalia. The MitoAge website assists in statistical analysis of different features of the mtDNA and their correlative links to longevity. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Repetitive DNA in the pea (Pisum sativum L.) genome: comprehensive characterization using 454 sequencing and comparison to soybean and Medicago truncatula

PubMed Central

Macas, Jiří; Neumann, Pavel; Navrátilová, Alice

2007-01-01

Background Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (Pisum sativum). Results Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in M. truncatula. Conclusion We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data provide a starting point for further investigations of legume plant genomes based on their global comparative analysis and for the development of more sophisticated approaches for data mining. PMID:18031571
Linear Lepidopteran ambidensovirus 1 sequences drive random integration of a reporter gene in transfected Spodoptera frugiperda cells.

PubMed

Rizk, Francine; Laverdure, Sylvain; d'Alençon, Emmanuelle; Bossin, Hervé; Dupressoir, Thierry

2018-01-01

The Lepidopteran ambidensovirus 1 isolated from Junonia coenia (hereafter JcDV) is an invertebrate parvovirus considered as a viral transduction vector as well as a potential tool for the biological control of insect pests. Previous works showed that JcDV-based circular plasmids experimentally integrate into insect cells genomic DNA. In order to approach the natural conditions of infection and possible integration, we generated linear JcDV- gfp based molecules which were transfected into non permissive Spodoptera frugiperda ( Sf9 ) cultured cells. Cells were monitored for the expression of green fluorescent protein (GFP) and DNA was analyzed for integration of transduced viral sequences. Non-structural protein modulation of the VP-gene cassette promoter activity was additionally assayed. We show that linear JcDV-derived molecules are capable of long term genomic integration and sustained transgene expression in Sf9 cells. As expected, only the deletion of both inverted terminal repeats (ITR) or the polyadenylation signals of NS and VP genes dramatically impairs the global transduction/expression efficiency. However, all the integrated viral sequences we characterized appear "scrambled" whatever the viral content of the transfected vector. Despite a strong GFP expression, we were unable to recover any full sequence of the original constructs and found rearranged viral and non-viral sequences as well. Cellular flanking sequences were identified as non-coding ones. On the other hand, the kinetics of GFP expression over time led us to investigate the apparent down-regulation by non-structural proteins of the VP-gene cassette promoter. Altogether, our results show that JcDV-derived sequences included in linear DNA molecules are able to drive efficiently the integration and expression of a foreign gene into the genome of insect cells, whatever their composition, provided that at least one ITR is present. However, the transfected sequences were extensively rearranged with cellular DNA during or after random integration in the host cell genome. Lastly, the non-structural proteins seem to participate in the regulation of p9 promoter activity rather than to the integration of viral sequences.
Sequence Analysis of Changes in Microbial Composition in Different Milk Products During Fermentation and Storage.

PubMed

Zalewska, Barbora; Kaevska, Marija; Slana, Iva

2018-02-01

The objective of this study was to analyze the changes in the microbiota of milk products during fermentation and storage. Two kinds of Yoghurt, one Kefir, and one Acidophilus milk were observed during the fermentation process and storage using 16S rDNA amplicon sequencing. Cow's, goat's, raw and pasteurized milk were also examined. The most represented organisms in all manufactured products were shown to be those of the phylum Firmicutes. In some products, Proteobacteria, Bacteroidetes and Actinobacteria were also present in high amounts.
Large-Scale Concatenation cDNA Sequencing

PubMed Central

Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.

1997-01-01

A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
Synthesis of DNA

DOEpatents

Mariella, Jr., Raymond P.

2008-11-18

A method of synthesizing a desired double-stranded DNA of a predetermined length and of a predetermined sequence. Preselected sequence segments that will complete the desired double-stranded DNA are determined. Preselected segment sequences of DNA that will be used to complete the desired double-stranded DNA are provided. The preselected segment sequences of DNA are assembled to produce the desired double-stranded DNA.
Composition and Dynamics of Bacterial Communities of a Drinking Water Supply System as Assessed by RNA- and DNA-Based 16S rRNA Gene Fingerprinting

PubMed Central

Eichler, Stefan; Christen, Richard; Höltje, Claudia; Westphal, Petra; Bötel, Julia; Brettar, Ingrid; Mehling, Arndt; Höfle, Manfred G.

2006-01-01

Bacterial community dynamics of a whole drinking water supply system (DWSS) were studied from source to tap. Raw water for this DWSS is provided by two reservoirs with different water characteristics in the Harz mountains of Northern Germany. Samples were taken after different steps of treatment of raw water (i.e., flocculation, sand filtration, and chlorination) and at different points along the supply system to the tap. RNA and DNA were extracted from the sampled water. The 16S rRNA or its genes were partially amplified by reverse transcription-PCR or PCR and analyzed by single-strand conformation polymorphism community fingerprints. The bacterial community structures of the raw water samples from the two reservoirs were very different, but no major changes of these structures occurred after flocculation and sand filtration. Chlorination of the processed raw water strongly affected bacterial community structure, as reflected by the RNA-based fingerprints. This effect was less pronounced for the DNA-based fingerprints. After chlorination, the bacterial community remained rather constant from the storage containers to the tap. Furthermore, the community structure of the tap water did not change substantially for several months. Community composition was assessed by sequencing of abundant bands and phylogenetic analysis of the sequences obtained. The taxonomic compositions of the bacterial communities from both reservoirs were very different at the species level due to their different limnologies. On the other hand, major taxonomic groups, well known to occur in freshwater, such as Alphaproteobacteria, Betaproteobacteria, and Bacteroidetes, were found in both reservoirs. Significant differences in the detection of the major groups were observed between DNA-based and RNA-based fingerprints irrespective of the reservoir. Chlorination of the drinking water seemed to promote growth of nitrifying bacteria. Detailed analysis of the community dynamics of the whole DWSS revealed a significant influence of both source waters on the overall composition of the drinking water microflora and demonstrated the relevance of the raw water microflora for the drinking water microflora provided to the end user. PMID:16517632
Nanopore Technology: A Simple, Inexpensive, Futuristic Technology for DNA Sequencing.

PubMed

Gupta, P D

2016-10-01

In health care, importance of DNA sequencing has been fully established. Sanger's Capillary Electrophoresis DNA sequencing methodology is time consuming, cumbersome, hence become more expensive. Lately, because of its versatility DNA sequencing became house hold name, and therefore, there is an urgent need of simple, fast, inexpensive, DNA sequencing technology. In the beginning of this century efforts were made, and Nanopore DNA sequencing technology was developed; still it is infancy, nevertheless, it is the futuristic technology.
The genome-wide DNA sequence specificity of the anti-tumour drug bleomycin in human cells.

PubMed

Murray, Vincent; Chen, Jon K; Tanaka, Mark M

2016-07-01

The cancer chemotherapeutic agent, bleomycin, cleaves DNA at specific sites. For the first time, the genome-wide DNA sequence specificity of bleomycin breakage was determined in human cells. Utilising Illumina next-generation DNA sequencing techniques, over 200 million bleomycin cleavage sites were examined to elucidate the bleomycin genome-wide DNA selectivity. The genome-wide bleomycin cleavage data were analysed by four different methods to determine the cellular DNA sequence specificity of bleomycin strand breakage. For the most highly cleaved DNA sequences, the preferred site of bleomycin breakage was at 5'-GT* dinucleotide sequences (where the asterisk indicates the bleomycin cleavage site), with lesser cleavage at 5'-GC* dinucleotides. This investigation also determined longer bleomycin cleavage sequences, with preferred cleavage at 5'-GT*A and 5'- TGT* trinucleotide sequences, and 5'-TGT*A tetranucleotides. For cellular DNA, the hexanucleotide DNA sequence 5'-RTGT*AY (where R is a purine and Y is a pyrimidine) was the most highly cleaved DNA sequence. It was striking that alternating purine-pyrimidine sequences were highly cleaved by bleomycin. The highest intensity cleavage sites in cellular and purified DNA were very similar although there were some minor differences. Statistical nucleotide frequency analysis indicated a G nucleotide was present at the -3 position (relative to the cleavage site) in cellular DNA but was absent in purified DNA.
ampliMethProfiler: a pipeline for the analysis of CpG methylation profiles of targeted deep bisulfite sequenced amplicons.

PubMed

Scala, Giovanni; Affinito, Ornella; Palumbo, Domenico; Florio, Ermanno; Monticelli, Antonella; Miele, Gennaro; Chiariotti, Lorenzo; Cocozza, Sergio

2016-11-25

CpG sites in an individual molecule may exist in a binary state (methylated or unmethylated) and each individual DNA molecule, containing a certain number of CpGs, is a combination of these states defining an epihaplotype. Classic quantification based approaches to study DNA methylation are intrinsically unable to fully represent the complexity of the underlying methylation substrate. Epihaplotype based approaches, on the other hand, allow methylation profiles of cell populations to be studied at the single molecule level. For such investigations, next-generation sequencing techniques can be used, both for quantitative and for epihaplotype analysis. Currently available tools for methylation analysis lack output formats that explicitly report CpG methylation profiles at the single molecule level and that have suited statistical tools for their interpretation. Here we present ampliMethProfiler, a python-based pipeline for the extraction and statistical epihaplotype analysis of amplicons from targeted deep bisulfite sequencing of multiple DNA regions. ampliMethProfiler tool provides an easy and user friendly way to extract and analyze the epihaplotype composition of reads from targeted bisulfite sequencing experiments. ampliMethProfiler is written in python language and requires a local installation of BLAST and (optionally) QIIME tools. It can be run on Linux and OS X platforms. The software is open source and freely available at http://amplimethprofiler.sourceforge.net .
Land use type significantly affects microbial gene transcription in soil.

PubMed

Nacke, Heiko; Fischer, Christiane; Thürmer, Andrea; Meinicke, Peter; Daniel, Rolf

2014-05-01

Soil microorganisms play an essential role in sustaining biogeochemical processes and cycling of nutrients across different land use types. To gain insights into microbial gene transcription in forest and grassland soil, we isolated mRNA from 32 sampling sites. After sequencing of generated complementary DNA (cDNA), a total of 5,824,229 sequences could be further analyzed. We were able to assign nonribosomal cDNA sequences to all three domains of life. A dominance of bacterial sequences, which were affiliated to 25 different phyla, was found. Bacterial groups capable of aromatic compound degradation such as Phenylobacterium and Burkholderia were detected in significantly higher relative abundance in forest soil than in grassland soil. Accordingly, KEGG pathway categories related to degradation of aromatic ring-containing molecules (e.g., benzoate degradation) were identified in high abundance within forest soil-derived metatranscriptomic datasets. The impact of land use type forest on community composition and activity is evidently to a high degree caused by the presence of wood breakdown products. Correspondingly, bacterial groups known to be involved in lignin degradation and containing ligninolytic genes such as Burkholderia, Bradyrhizobium, and Azospirillum exhibited increased transcriptional activity in forest soil. Higher solar radiation in grassland presumably induced increased transcription of photosynthesis-related genes within this land use type. This is in accordance with high abundance of photosynthetic organisms and plant-infecting viruses in grassland.
Application of denaturing gradient gel electrophoresis (DGGE) to the analysis of microbial communities of subgingival plaque.

PubMed

Fujimoto, C; Maeda, H; Kokeguchi, S; Takashiba, S; Nishimura, F; Arai, H; Fukui, K; Murayama, Y

2003-08-01

Denaturing gradient gel electrophoresis (DGGE) was applied to the microbiologic examination of subgingival plaque. The PCR primers were designed from conserved nucleotide sequences on 16S ribosomal RNA gene (16SrDNA) with GC rich clamp at the 5'-end. Polymerase chain reaction (PCR) was performed using the primers and genomic DNAs of typical periodontal bacteria. The generated 16SrDNA fragments were separated by denaturing gel. Although the sizes of the amplified DNA fragments were almost the same among the species, 16SrDNAs of the periodontal bacteria were distinguished according to their specific sequences. The microflora of clinical plaque samples were profiled by the PCR-DGGE method, and the dominant 16SrDNA bands were cloned and sequenced. Simultaneously, Actinobacillus actinomycetemcomitans, Porphyromonas gingivalis and Prevotella intermedia were detected by an ordinary PCR method. In the deep periodontal pockets, the bacterial community structures were complicated and P. gingivalis was the most dominant species, whereas the DGGE profiles were simple and Streptococcus or Neisseria species were dominant in the shallow pockets. The species-specific PCR method revealed the presence of A. actinomycetemcomitans, P. gingivalis and P. intermedia in the clinical samples. However, corresponding bands were not always observed in the DGGE profiles, indicating a lower sensitivity of the DGGE method. Although the DGGE method may have a lower sensitivity than the ordinary PCR methods, it could visualize the bacterial qualitative compositions and reveal the major species of the plaque. The DGGE analysis and following sequencing may have the potential to be a promising bacterial examination procedure in periodontal diseases.

A convenient and adaptable package of DNA sequence analysis programs for microcomputers.

PubMed Central

Pustell, J; Kafatos, F C

1982-01-01

We describe a package of DNA data handling and analysis programs designed for microcomputers. The package is convenient for immediate use by persons with little or no computer experience, and has been optimized by trial in our group for a year. By typing a single command, the user enters a system which asks questions or gives instructions in English. The system will enter, alter, and manage sequence files or a restriction enzyme library. It generates the reverse complement, translates, calculates codon usage, finds restriction sites, finds homologies with various degrees of mismatch, and graphs amino acid composition or base frequencies. A number of options for data handling and printing can be used to produce figures for publication. The package will be available in ANSI Standard FORTRAN for use with virtually any FORTRAN compiler. PMID:6278412
Characterization of a fused protein specified by the adenovirus type 2-simian virus 40 hybrid Ad2+ND1 dp2.

PubMed Central

Fey, G; Lewis, J B; Grodzicker, T; Bothwell, A

1979-01-01

The adenovirus type 2-simian virus 40 (SV40) hybrid virus Ad2+ND1 dp2 (E. Lukanidin, manuscript in preparation) specified two proteins (molecular weights, 24,000 and 23,000) that are, in part, products of an insertion of SV40 early DNA sequences. This was demonstrated by translation in vitro from viral mRNA that had been selected by hybridization to SV40 DNA. These two phosphorylated, nonvirion proteins were produced late in infection in amounts similar to adenovirus 2 structural proteins and were closely related to each other in tryptic peptide composition. The portion of SV40 DNA (map units 0.17 to 0.22 on the SV40 genome) coding for these proteins was joined to sequences coding for the amino-terminal part of the adenovirus type 2 structural protein IV (fiber). The Ad2+ND1 dp2 23,000- and 24,000-molecular-weight proteins were hybrid polypeptides, with about two-thirds of their tryptic peptides contributed by the fiber protein and the remainder contributed by SV40 T-antigen. They shared with T-antigen (molecular weight, 96,000) a carboxy-terminal proline-rich tryptic peptide. Together, the tryptic peptide composition of these proteins and the known SV40 DNA sequences suggested the reading frame for the translation of T-antigen. The carboxy terminus for T-anigen would then be located on the SV40 genome map next to the TAA terminator triplet at position 0.175, 910 bases away from the cleavage site of the restriction endonuclease EcoRI. Seven host range mutants from Ad2+ND1 dp2 were isolated that had lost the capacity to propagate on monkey cells. They did not induce detectable levels of the hybrid proteins. Three of these mutants had lost the SV40 DNA insertion that codes in part for these proteins. Thus, in analogy to the Ad2+ND1 30,000-molecular-weight protein, the presence of these proteins correlates with the presence of the helper function for adenovirus replication on monkey cells. Images PMID:225516
First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Traditional Medicinal Plant

PubMed Central

Liu, Di; Zeng, Shao-Hua; Chen, Jian-Jun; Zhang, Yan-Jun; Xiao, Gong; Zhu, Lin-Yao; Wang, Ying

2013-01-01

Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant. PMID:23807511
Sequence and Structure Dependent DNA-DNA Interactions

NASA Astrophysics Data System (ADS)

Kopchick, Benjamin; Qiu, Xiangyun

Molecular forces between dsDNA strands are largely dominated by electrostatics and have been extensively studied. Quantitative knowledge has been accumulated on how DNA-DNA interactions are modulated by varied biological constituents such as ions, cationic ligands, and proteins. Despite its central role in biology, the sequence of DNA has not received substantial attention and ``random'' DNA sequences are typically used in biophysical studies. However, ~50% of human genome is composed of non-random-sequence DNAs, particularly repetitive sequences. Furthermore, covalent modifications of DNA such as methylation play key roles in gene functions. Such DNAs with specific sequences or modifications often take on structures other than the canonical B-form. Here we present series of quantitative measurements of the DNA-DNA forces with the osmotic stress method on different DNA sequences, from short repeats to the most frequent sequences in genome, and to modifications such as bromination and methylation. We observe peculiar behaviors that appear to be strongly correlated with the incurred structural changes. We speculate the causalities in terms of the differences in hydration shell and DNA surface structures.
Studies on bacterial community composition are affected by the time and storage method of the rumen content

PubMed Central

Duarte Messana, Juliana; Takeshi Kishi, Luciano; Lino Dias, Ana Veronica; Berchielli, Telma Teresinha

2017-01-01

The objective of this study was to investigate three storage methods and four storage times for rumen sampling in terms of quality and yield of extracted metagenomic DNA as well as the composition of the rumen bacterial community. One Nellore steer fitted with a ruminal silicone-type cannula was used as a donor of ruminal contents. The experiment comprised 11 experimental groups: pellet control (PC), lyophilized control (LC), P-20: pellet stored frozen at -20°C for a period of 3, 6, and 12 months, P-80: pellet stored frozen at -80°C for a period of 3, 6, and 12 months, and L-20: lyophilized sample stored frozen at -20°C for a period of 3, 6, and 12 months. Metagenomic DNA concentrations were measured spectrophotometrically and fluorometrically and ion torrent sequencing was used to assess the bacterial community composition. The L-20 method could not maintain the yield of DNA during storage. In addition, the P-80 group showed a greater yield of metagenomic DNA than the other groups after 6 months of storage. Rumen samples stored as pellets (P-20 and P-80) resulted in lower richness Chao 1, ACE, and Shannon Wiener indices when compared to PC, while LC and PC were only different in richness ACE. The storage method and storage time influenced the proportions of 14 of 17 phyla identified by sequencing. In the P-20 group, the proportion of Cyanobacteria, Elusimicrobia, Fibrobacteres, Lentisphaerae, Proteobacteria, and Spirochaetes phyla identified was lower than 1%. In the P-80 group, there was an increase in the proportion of the Bacteroidetes phylum (p = 0.010); however, the proportion of Actinobacteria, Chloroflexi, SR1, Synergistetes, TM7, and WPS.2 phyla were unchanged compared to the PC group (p > 0.05). The class Clostridium was the most abundant in all stored groups and increased in its proportion, especially in the L-20 group. The rumen sample storage time significantly reduced the yield of metagenomic DNA extracted. Therefore, the storage method can influence the abundance of phyla, classes, and bacterial families studied in rumen samples and affect the richness and diversity index. PMID:28453579
Studies on bacterial community composition are affected by the time and storage method of the rumen content.

PubMed

Granja-Salcedo, Yury Tatiana; Ramirez-Uscategui, Ricardo Andrés; Machado, Elwi Guillermo; Duarte Messana, Juliana; Takeshi Kishi, Luciano; Lino Dias, Ana Veronica; Berchielli, Telma Teresinha

2017-01-01

The objective of this study was to investigate three storage methods and four storage times for rumen sampling in terms of quality and yield of extracted metagenomic DNA as well as the composition of the rumen bacterial community. One Nellore steer fitted with a ruminal silicone-type cannula was used as a donor of ruminal contents. The experiment comprised 11 experimental groups: pellet control (PC), lyophilized control (LC), P-20: pellet stored frozen at -20°C for a period of 3, 6, and 12 months, P-80: pellet stored frozen at -80°C for a period of 3, 6, and 12 months, and L-20: lyophilized sample stored frozen at -20°C for a period of 3, 6, and 12 months. Metagenomic DNA concentrations were measured spectrophotometrically and fluorometrically and ion torrent sequencing was used to assess the bacterial community composition. The L-20 method could not maintain the yield of DNA during storage. In addition, the P-80 group showed a greater yield of metagenomic DNA than the other groups after 6 months of storage. Rumen samples stored as pellets (P-20 and P-80) resulted in lower richness Chao 1, ACE, and Shannon Wiener indices when compared to PC, while LC and PC were only different in richness ACE. The storage method and storage time influenced the proportions of 14 of 17 phyla identified by sequencing. In the P-20 group, the proportion of Cyanobacteria, Elusimicrobia, Fibrobacteres, Lentisphaerae, Proteobacteria, and Spirochaetes phyla identified was lower than 1%. In the P-80 group, there was an increase in the proportion of the Bacteroidetes phylum (p = 0.010); however, the proportion of Actinobacteria, Chloroflexi, SR1, Synergistetes, TM7, and WPS.2 phyla were unchanged compared to the PC group (p > 0.05). The class Clostridium was the most abundant in all stored groups and increased in its proportion, especially in the L-20 group. The rumen sample storage time significantly reduced the yield of metagenomic DNA extracted. Therefore, the storage method can influence the abundance of phyla, classes, and bacterial families studied in rumen samples and affect the richness and diversity index.
Signatures of Climatic Change In Human Mitochondrial Dna From Europe

NASA Astrophysics Data System (ADS)

Richards, M. B.; Macaulay, V. A.; Torroni, A.; Bandelt, H.-J.

Founder analysis is an approach to analysing non-recombining DNA sequence data, such as variation in the mitochondrial DNA (mtDNA), which aims at identifying and dating migrations into new territory. We applied the approach to about 4,000 human mtDNA sequences from Europe and the Near East, in order to estimate the proportion of modern lineages whose ancestors arrived at various times during the continent's past. We found that the major signal dates to about 15,000 years ago, at the time of rewarming following the Last Glacial Maximum (LGM). There is little or no archaeological evidence for immigration into Europe at this time, and the record indicates that at least parts of southern Europe remained populated during the LGM. Therefore, we interpret this signal as the trace of a bottleneck at the time of the LGM, as a result of the retreat from northern Europe during the peak of the glaciation, followed by a re-expansion from one or more refugial zones. Immigration episodes then figure at the beginning of the Early Upper Palaeolithic, during the Middle Upper Palaeolithic, and with the Neolithic. The impact of the latter on the composition of the European mtDNA pool was evidently rather minor. This result implies that climate is likely to have been a major force shaping human demographic history in Europe.
Deep Investigation of Arabidopsis thaliana Junk DNA Reveals a Continuum between Repetitive Elements and Genomic Dark Matter

PubMed Central

Maumus, Florian; Quesneville, Hadi

2014-01-01

Eukaryotic genomes contain highly variable amounts of DNA with no apparent function. This so-called junk DNA is composed of two components: repeated and repeat-derived sequences (together referred to as the repeatome), and non-annotated sequences also known as genomic dark matter. Because of their high duplication rates as compared to other genomic features, transposable elements are predominant contributors to the repeatome and the products of their decay is thought to be a major source of genomic dark matter. Determining the origin and composition of junk DNA is thus important to help understanding genome evolution as well as host biology. In this study, we have used a combination of tools enabling to show that the repeatome from the small and reducing A. thaliana genome is significantly larger than previously thought. Furthermore, we present the concepts and results from a series of innovative approaches suggesting that a significant amount of the A. thaliana dark matter is of repetitive origin. As a tentative standard for the community, we propose a deep compendium annotation of the A. thaliana repeatome that may help addressing farther genome evolution as well as transcriptional and epigenetic regulation in this model plant. PMID:24709859
Genetic origin and composition of a natural hybrid poplar Populus × jrtyschensis from two distantly related species.

PubMed

Jiang, Dechun; Feng, Jianju; Dong, Miao; Wu, Guili; Mao, Kangshan; Liu, Jianquan

2016-04-18

The factors that contribute to and maintain hybrid zones between distinct species are highly variable, depending on hybrid origins, frequencies and fitness. In this study, we aimed to examine genetic origins, compositions and possible maintenance of Populus × jrtyschensis, an assumed natural hybrid between two distantly related species. This hybrid poplar occurs mainly on the floodplains along the river valleys between the overlapping distributions of the two putative parents. We collected 566 individuals from 45 typical populations of P. × jrtyschensis, P. nigra and P. laurifolia. We genotyped them based on the sequence variations of one maternally inherited chloroplast DNA (cpDNA) fragment and genetic polymorphisms at 20 SSR loci. We further sequenced eight nuclear genes for 168 individuals from 31 populations. Two groups of cpDNA haplotypes characteristic of P. nigra and P. laurifolia respectively were both recovered for P. × jrtyschensis. Genetic structures and coalescent tests of two sets of nuclear population genetic data suggested that P. × jrtyschensis originated from hybridizations between the two assumed parental species. All examined populations of P. × jrtyschensis comprise mainly F1 hybrids from interspecific hybridizations between P. nigra and P. laurifolia. In the habitats of P. × jrtyschensis, there are lower concentrations of soil nitrogen than in the habitats occupied by the other two species. Our extensive examination of the genetic composition of P. × jrtyschensis suggested that it is typical of F1-dominated hybrid zones. This finding plus the low concentration of soil nitrogen in the floodplain soils support the F1-dominated bounded hybrid superiority hypothesis of hybrid zone maintenance for this particular hybrid poplar.
Ancient DNA analysis identifies marine mollusc shells as new metagenomic archives of the past.

PubMed

Der Sarkissian, Clio; Pichereau, Vianney; Dupont, Catherine; Ilsøe, Peter C; Perrigault, Mickael; Butler, Paul; Chauvaud, Laurent; Eiríksson, Jón; Scourse, James; Paillard, Christine; Orlando, Ludovic

2017-09-01

Marine mollusc shells enclose a wealth of information on coastal organisms and their environment. Their life history traits as well as (palaeo-) environmental conditions, including temperature, food availability, salinity and pollution, can be traced through the analysis of their shell (micro-) structure and biogeochemical composition. Adding to this list, the DNA entrapped in shell carbonate biominerals potentially offers a novel and complementary proxy both for reconstructing palaeoenvironments and tracking mollusc evolutionary trajectories. Here, we assess this potential by applying DNA extraction, high-throughput shotgun DNA sequencing and metagenomic analyses to marine mollusc shells spanning the last ~7,000 years. We report successful DNA extraction from shells, including a variety of ancient specimens, and find that DNA recovery is highly dependent on their biomineral structure, carbonate layer preservation and disease state. We demonstrate positive taxonomic identification of mollusc species using a combination of mitochondrial DNA genomes, barcodes, genome-scale data and metagenomic approaches. We also find shell biominerals to contain a diversity of microbial DNA from the marine environment. Finally, we reconstruct genomic sequences of organisms closely related to the Vibrio tapetis bacteria from Manila clam shells previously diagnosed with Brown Ring Disease. Our results reveal marine mollusc shells as novel genetic archives of the past, which opens new perspectives in ancient DNA research, with the potential to reconstruct the evolutionary history of molluscs, microbial communities and pathogens in the face of environmental changes. Other future applications include conservation of endangered mollusc species and aquaculture management. © 2017 John Wiley & Sons Ltd.
Metagenomic analysis of fungal diversity on strawberry plants and the effect of management practices on the fungal community structure of aerial organs

USDA-ARS?s Scientific Manuscript database

Metabarcoding, defined as Next Generation Sequencing (NGS) of amplicons of the ITS2 region (DNA barcode), was used to identify the composition of the fungal community on different strawberry organs i.e. leaves, flowers, and immature and mature fruits grown on a farm using disease and insect control ...
Diversity and Distribution Characteristics of Viruses in Soils of a Marine-Terrestrial Ecotone in East China.

PubMed

Yu, Dan-Ting; Han, Li-Li; Zhang, Li-Mei; He, Ji-Zheng

2018-02-01

A substantial gap remains in our understanding of the abundance, diversity, and ecology of viruses in soil although some advances have been achieved in recent years. In this study, four soil samples according to the salinity gradient from shore to inland in East China have been characterized. Results showed that spherical virus particles represented the largest viral component in all of the four samples. The viromes had remarkably different taxonomic compositions, and most of the sequences were derived from single-stranded DNA viruses, especially from families Microviridae and Circoviridae. Compared with viromes from other aquatic and sediment samples, the community compositions of our four soil viromes resembled each other, meanwhile coastal sample virome closely congregated with sediment and hypersaline viromes, and high salinity paddy soil sample virome was similar with surface sediment virome. Phylogenetic analysis of functional genes showed that four viromes have high diversity of the subfamily Gokushovirinae in family Microviridae and most of Circoviridae replicase protein sequences grouped within the CRESS-DNA viruses. This work provided an initial outline of the viral communities in marine-terrestrial ecotone and will improve our understanding of the ecological functions of soil viruses.
A High-Throughput Process for the Solid-Phase Purification of Synthetic DNA Sequences

PubMed Central

Grajkowski, Andrzej; Cieślak, Jacek; Beaucage, Serge L.

2017-01-01

An efficient process for the purification of synthetic phosphorothioate and native DNA sequences is presented. The process is based on the use of an aminopropylated silica gel support functionalized with aminooxyalkyl functions to enable capture of DNA sequences through an oximation reaction with the keto function of a linker conjugated to the 5′-terminus of DNA sequences. Deoxyribonucleoside phosphoramidites carrying this linker, as a 5′-hydroxyl protecting group, have been synthesized for incorporation into DNA sequences during the last coupling step of a standard solid-phase synthesis protocol executed on a controlled pore glass (CPG) support. Solid-phase capture of the nucleobase- and phosphate-deprotected DNA sequences released from the CPG support is demonstrated to proceed near quantitatively. Shorter than full-length DNA sequences are first washed away from the capture support; the solid-phase purified DNA sequences are then released from this support upon reaction with tetra-n-butylammonium fluoride in dry dimethylsulfoxide (DMSO) and precipitated in tetrahydrofuran (THF). The purity of solid-phase-purified DNA sequences exceeds 98%. The simulated high-throughput and scalability features of the solid-phase purification process are demonstrated without sacrificing purity of the DNA sequences. PMID:28628204
Complete sequences of the highly rearranged molluscan mitochondrial genomes of the scaphopod graptacme eborea and the bivalve mytilus edulis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Boore, Jeffrey L.; Medina, Monica; Rosenberg, Lewis A.

2004-01-31

We have determined the complete sequence of the mitochondrial genome of the scaphopod mollusk Graptacme eborea (Conrad, 1846) (14,492 nts) and completed the sequence of the mitochondrial genome of the bivalve mollusk Mytilus edulis Linnaeus, 1758 (16,740 nts). (The name Graptacme eborea is a revision of the species formerly known as Dentalium eboreum.) G. eborea mtDNA contains the 37 genes that are typically found and has the genes divided about evenly between the two strands, but M. edulis contains an extra trnM and is missing atp8, and has all genes on the same strand. Each has a highly rearranged genemore » order relative to each other and to all other studied mtDNAs. G. eborea mtDNA has almost no strand skew, but the coding strand of M. edulis mtDNA is very rich in G and T. This is reflected in differential codon usage patterns and even in amino acid compositions. G. eborea mtDNA has fewer non-coding nucleotides than any other mtDNA studied to date, with the largest non-coding region being only 24 nt long. Phylogenetic analysis using 2,420 aligned amino acid positions of concatenated proteins weakly supports an association of the scaphopod with gastropods to the exclusion of Bivalvia, Cephalopoda, and Polyplacophora, but is generally unable to convincingly resolve the relationships among major groups of the Lophotrochozoa, in contrast to the good resolution seen for several other major metazoan groups.« less
Optimization of a one-step heat-inducible in vivo mini DNA vector production system.

PubMed

Nafissi, Nafiseh; Sum, Chi Hong; Wettig, Shawn; Slavcev, Roderick A

2014-01-01

While safer than their viral counterparts, conventional circular covalently closed (CCC) plasmid DNA vectors offer a limited safety profile. They often result in the transfer of unwanted prokaryotic sequences, antibiotic resistance genes, and bacterial origins of replication that may lead to unwanted immunostimulatory responses. Furthermore, such vectors may impart the potential for chromosomal integration, thus potentiating oncogenesis. Linear covalently closed (LCC), bacterial sequence free DNA vectors have shown promising clinical improvements in vitro and in vivo. However, the generation of such minivectors has been limited by in vitro enzymatic reactions hindering their downstream application in clinical trials. We previously characterized an in vivo temperature-inducible expression system, governed by the phage λ pL promoter and regulated by the thermolabile λ CI[Ts]857 repressor to produce recombinant protelomerase enzymes in E. coli. In this expression system, induction of recombinant protelomerase was achieved by increasing culture temperature above the 37°C threshold temperature. Overexpression of protelomerase led to enzymatic reactions, acting on genetically engineered multi-target sites called "Super Sequences" that serve to convert conventional CCC plasmid DNA into LCC DNA minivectors. Temperature up-shift, however, can result in intracellular stress responses and may alter plasmid replication rates; both of which may be detrimental to LCC minivector production. We sought to optimize our one-step in vivo DNA minivector production system under various induction schedules in combination with genetic modifications influencing plasmid replication, processing rates, and cellular heat stress responses. We assessed different culture growth techniques, growth media compositions, heat induction scheduling and temperature, induction duration, post-induction temperature, and E. coli genetic background to improve the productivity and scalability of our system, achieving an overall LCC DNA minivector production efficiency of ∼ 90%.We optimized a robust technology conferring rapid, scalable, one-step in vivo production of LCC DNA minivectors with potential application to gene transfer-mediated therapeutics.
The Making of the African mtDNA Landscape

PubMed Central

Salas, Antonio; Richards, Martin; De la Fe, Tomás; Lareu, María-Victoria; Sobrino, Beatriz; Sánchez-Diz, Paula; Macaulay, Vincent; Carracedo, Ángel

2002-01-01

Africa presents the most complex genetic picture of any continent, with a time depth for mitochondrial DNA (mtDNA) lineages >100,000 years. The most recent widespread demographic shift within the continent was most probably the Bantu dispersals, which archaeological and linguistic evidence suggest originated in West Africa 3,000–4,000 years ago, spreading both east and south. Here, we have carried out a thorough phylogeographic analysis of mtDNA variation in a total of 2,847 samples from throughout the continent, including 307 new sequences from southeast African Bantu speakers. The results suggest that the southeast Bantu speakers have a composite origin on the maternal line of descent, with ∼44% of lineages deriving from West Africa, ∼21% from either West or Central Africa, ∼30% from East Africa, and ∼5% from southern African Khoisan-speaking groups. The ages of the major founder types of both West and East African origin are consistent with the likely timing of Bantu dispersals, with those from the west somewhat predating those from the east. Despite this composite picture, the southeastern African Bantu groups are indistinguishable from each other with respect to their mtDNA, suggesting that they either had a common origin at the point of entry into southeastern Africa or have undergone very extensive gene flow since. PMID:12395296
Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.

PubMed

Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene

2017-02-01

Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.

PubMed

Yin, Changchuan

2015-04-01

To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
Single-cell genomic sequencing using Multiple Displacement Amplification.

PubMed

Lasken, Roger S

2007-10-01

Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
The complete mitochondrial genome of the big-belly seahorse, Hippocampus abdominalis (Lesson 1827).

PubMed

Wang, Lei; Chen, Zaizhong; Leng, Xiangjun; Gao, Jianzhong; Chen, Xiaowu; Li, Zhongpu; Sun, Peiying; Zhao, Yuming

2016-11-01

In this study, the complete mitogenome sequence of the big-belly seahorse, Hippocampus abdominalis (Lesson, 1827) (Syngnathiformes: Syngnathidae), has been sequenced by the next-generation sequencing method. The assembled mitogenome is 16 521 bp in length which includes 13 protein-coding genes, 22 transfer RNAs, and 2 ribosomal RNAs genes. The overall base composition of the seahorse is 31.1% for A, 23.6% for C, 16.0% for G, 29.3% for T and shows 87% identities similar to tiger tail seahorse, Hippocampus comes. The complete mitogenome of the big-belly seahorse provides essential and important DNA molecular data for further phylogeography and evolutionary analysis for seahorse family.

Overproduction and nucleotide sequence of the respiratory D-lactate dehydrogenase of Escherichia coli.

PubMed Central

Rule, G S; Pratt, E A; Chin, C C; Wold, F; Ho, C

1985-01-01

Recombinant DNA plasmids containing the gene for the membrane-bound D-lactate dehydrogenase (D-LDH) of Escherichia coli linked to the promoter PL from lambda were constructed. After induction, the levels of D-LDH were elevated 300-fold over that of the wild type and amounted to 35% of the total cellular protein. The nucleotide sequence of the D-LDH gene was determined and shown to agree with the amino acid composition and the amino-terminal sequence of the purified enzyme. Removal of the amino-terminal formyl-Met from D-LDH was not inhibited in cells which contained these high levels of D-LDH. Images PMID:3882663
Transposition-mediated DNA re-replication in maize

PubMed Central

Zhang, Jianbo; Zuo, Tao; Wang, Dafang; Peterson, Thomas

2014-01-01

Every DNA segment in a eukaryotic genome normally replicates once and only once per cell cycle to maintain genome stability. We show here that this restriction can be bypassed through alternative transposition, a transposition reaction that utilizes the termini of two separate, nearby transposable elements (TEs). Our results suggest that alternative transposition during S phase can induce re-replication of the TEs and their flanking sequences. The DNA re-replication can spontaneously abort to generate double-strand breaks, which can be repaired to generate Composite Insertions composed of transposon termini flanking segmental duplications of various lengths. These results show how alternative transposition coupled with DNA replication and repair can significantly alter genome structure and may have contributed to rapid genome evolution in maize and possibly other eukaryotes. DOI: http://dx.doi.org/10.7554/eLife.03724.001 PMID:25406063
Mitochondrial DNA control region analysis of three ethnic groups in the Republic of Macedonia

PubMed Central

Jankova-Ajanovska, Renata; Zimmermann, Bettina; Huber, Gabriela; Röck, Alexander W.; Bodner, Martin; Jakovski, Zlatko; Janeska, Biljana; Duma, Aleksej; Parson, Walther

2014-01-01

A total of 444 individuals representing three ethnic groups (Albanians, Turks and Romanies) in the Republic of Macedonia were sequenced in the mitochondrial control region. The mtDNA haplogroup composition differed between the three groups. Our results showed relatively high frequencies of haplogroup H12 in Albanians (8.8%) and less in Turks (3.3%), while haplogroups M5a1 and H7a1a were dominant in Romanies (13.7% and 10.3%, respectively) but rare in the former two. This highlights the importance of regional sampling for forensic mtDNA databasing purposes. These population data will be available on EMPOP under accession numbers EMP00644 (Albanians), EMP00645 (Romanies) and EMP00646 (Turks). PMID:25051224
Paleomicrobiology: a Snapshot of Ancient Microbes and Approaches to Forensic Microbiology

PubMed Central

RIVERA-PEREZ, JESSICA I.; SANTIAGO-RODRIGUEZ, TASHA M.; TORANZOS, GARY A.

2017-01-01

Paleomicrobiology, or the study of ancient microorganisms, has raised both fascination and skepticism for many years. While paleomicrobiology is not a recent field, the application of emerging techniques, such as DNA sequencing, is proving essential and has provided novel information regarding the evolution of viruses, antibiotic resistance, saprophytes, and pathogens, as well as ancient health and disease status, cultural customs, ethnic diets, and historical events. In this review, we highlight the importance of studying ancient microbial DNA, its contributions to current knowledge, and the role that forensic paleomicrobiology has played in deciphering historical enigmas. We also discuss the emerging techniques used to study the microbial composition of ancient samples as well as major concerns that accompany ancient DNA analyses. PMID:27726770
Comparison of pectin-degrading fungal communities in temperate forests using glycosyl hydrolase family 28 pectinase primers targeting Ascomycete fungi.

PubMed

Gacura, Matthew D; Sprockett, Daniel D; Heidenreich, Bess; Blackwood, Christopher B

2016-04-01

Fungi have developed a wide assortment of enzymes to break down pectin, a prevalent polymer in plant cell walls that is important in plant defense and structure. One enzyme family used to degrade pectin is the glycosyl hydrolase family 28 (GH28). In this study we developed primers for the amplification of GH28 coding genes from a database of 293 GH28 sequences from 40 fungal genomes. The primers were used to successfully amplify GH28 pectinases from all Ascomycota cultures tested, but only three out of seven Basidiomycota cultures. In addition, we further tested the primers in PCRs on metagenomic DNA extracted from senesced tree leaves from different forest ecosystems, followed by cloning and sequencing. Taxonomic specificity for Ascomycota GH28 genes was tested by comparing GH28 composition in leaves to internal transcribed spacer (ITS) amplicon composition using pyrosequencing. All sequences obtained from GH28 primers were classified as Ascomycota; in contrast, ITS sequences indicated that fungal communities were up to 39% Basidiomycetes. Analysis of leaf samples indicated that both forest stand and ecosystem type were important in structuring fungal communities. However, site played the prominent role in explaining GH28 composition, whereas ecosystem type was more important for ITS composition, indicating possible genetic drift between populations of fungi. Overall, these primers will have utility in understanding relationships between fungal community composition and ecosystem processes, as well as detection of potentially pathogenic Ascomycetes. Copyright © 2016 Elsevier B.V. All rights reserved.
Natural mummification of the human gut preserves bacteriophage DNA.

PubMed

Santiago-Rodriguez, Tasha M; Fornaciari, Gino; Luciani, Stefania; Dowd, Scot E; Toranzos, Gary A; Marota, Isolina; Cano, Raul J

2016-01-01

The natural mummification process of the human gut represents a unique opportunity to study the resulting microbial community structure and composition. While results are providing insights into the preservation of bacteria, fungi, pathogenic eukaryotes and eukaryotic viruses, no studies have demonstrated that the process of natural mummification also results in the preservation of bacteriophage DNA. We characterized the gut microbiome of three pre-Columbian Andean mummies, namely FI3, FI9 and FI12, and found sequences homologous to viruses. From the sequences attributable to viruses, 50.4% (mummy FI3), 1.0% (mummy FI9) and 84.4% (mummy FI12) were homologous to bacteriophages. Sequences corresponding to the Siphoviridae, Myoviridae, Podoviridae and Microviridae families were identified. Predicted putative bacterial hosts corresponded mainly to the Firmicutes and Proteobacteria, and included Bacillus, Staphylococcus, Clostridium, Escherichia, Vibrio, Klebsiella, Pseudomonas and Yersinia. Predicted functional categories associated with bacteriophages showed a representation of structural, replication, integration and entry and lysis genes. The present study suggests that the natural mummification of the human gut results in the preservation of bacteriophage DNA, representing an opportunity to elucidate the ancient phageome and to hypothesize possible mechanisms of preservation. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Micromonospora halotolerans sp. nov., isolated from the rhizosphere of a Pisum sativum plant.

PubMed

Carro, Lorena; Pukall, Rüdiger; Spröer, Cathrin; Kroppenstedt, Reiner M; Trujillo, Martha E

2013-06-01

A filamentous actinomycete strain designated CR18(T) was isolated on humic acid agar from the rhizosphere of a Pisum sativum plant collected in Spain. This isolate was observed to grow optimally at 28 °C, pH 7.0 and in the presence of 5 % NaCl. Phylogenetic analyses based on the 16S rRNA gene sequence indicated a close relationship with the type strains of Micromonospora chersina and Micromonospora endolithica. A further analysis based on a concatenated DNA sequence stretch of 4,523 bp that included partial sequences of the atpD, gyrB, recA, rpoB and 16S rRNA genes clearly differentiated the new strain from recognized Micromonospora species compared. DNA-DNA hybridization studies further supported the taxonomic position of strain CR18(T) as a novel genomic species. Chemotaxonomic analyses which included whole cell sugars, polar lipids, fatty acid profiles and menaquinone composition confirmed the affiliation of the new strain to the genus Micromonospora and also highlighted differences at the species level. These studies were finally complemented with an array of physiological tests to help differentiate between the new strain and its phylogenetic neighbours. Consequently, strain CR18(T) (= CECT 7890(T) = DSM 45598(T)) is proposed as the type strain of a novel species, Micromonospora halotolerans sp. nov.
Genetic distances and phylogenetic trees of different Awassi sheep populations based on DNA sequencing.

PubMed

Al-Atiyat, R M; Aljumaah, R S

2014-08-27

This study aimed to estimate evolutionary distances and to reconstruct phylogeny trees between different Awassi sheep populations. Thirty-two sheep individuals from three different geographical areas of Jordan and the Kingdom of Saudi Arabia (KSA) were randomly sampled. DNA was extracted from the tissue samples and sequenced using the T7 promoter universal primer. Different phylogenetic trees were reconstructed from 0.64-kb DNA sequences using the MEGA software with the best general time reverse distance model. Three methods of distance estimation were then used. The maximum composite likelihood test was considered for reconstructing maximum likelihood, neighbor-joining and UPGMA trees. The maximum likelihood tree indicated three major clusters separated by cytosine (C) and thymine (T). The greatest distance was shown between the South sheep and North sheep. On the other hand, the KSA sheep as an outgroup showed shorter evolutionary distance to the North sheep population than to the others. The neighbor-joining and UPGMA trees showed quite reliable clusters of evolutionary differentiation of Jordan sheep populations from the Saudi population. The overall results support geographical information and ecological types of the sheep populations studied. Summing up, the resulting phylogeny trees may contribute to the limited information about the genetic relatedness and phylogeny of Awassi sheep in nearby Arab countries.
Assessing the impact of fungicide enostroburin application on bacterial community in wheat phyllosphere.

PubMed

Gu, Likun; Bai, Zhihui; Jin, Bo; Hu, Qing; Wang, Huili; Zhuang, Guoqiang; Zhang, Hongxun

2010-01-01

Fungicides have been used extensively for controlling fungal pathogens of plants. However, little is known regarding the effects that fungicides upon the indigenous bacterial communities within the plant phyllosphere. The aims of this study were to assess the impact of fungicide enostroburin upon bacterial communities in wheat phyllosphere. Culture-independent methodologies of 16S rDNA clone library and 16S rDNA directed polymerase chain reaction with denaturing gradient gel electrophoresis (PCR-DGGE) were used for monitoring the change of bacterial community. The 16S rDNA clone library and PCR-DGGE analysis both confirmed the microbial community of wheat plant phyllosphere were predominantly of the gamma-Proteobacteria phyla. Results from PCR-DGGE analysis indicated a significant change in bacterial community structure within the phyllosphere following fungicide enostroburin application. Bands sequenced within control cultures were predominantly of Pseudomonas genus, but those bands sequenced in the treated samples were predominantly strains of Pantoea genus and Pseudomonas genus. Of interest was the appearance of two DGGE bands following fungicide treatment, one of which had sequence similarities (98%) to Pantoea sp. which might be a competitor of plant pathogens. This study revealed the wheat phyllosphere bacterial community composition and a shift in the bacterial community following fungicide enostroburin application.
Metabarcoding analysis of eukaryotic microbiota in the gut of HIV-infected patients.

PubMed

Hamad, Ibrahim; Abou Abdallah, Rita; Ravaux, Isabelle; Mokhtari, Saadia; Tissot-Dupont, Hervé; Michelle, Caroline; Stein, Andreas; Lagier, Jean-Christophe; Raoult, Didier; Bittar, Fadi

2018-01-01

Research on the relationship between changes in the gut microbiota and human disease, including AIDS, is a growing field. However, studies on the eukaryotic component of the intestinal microbiota have just begun and have not yet been conducted in HIV-infected patients. Moreover, eukaryotic community profiling is influenced by the use of different methodologies at each step of culture-independent techniques. Herein, initially, four DNA extraction protocols were compared to test the efficiency of each method in recovering eukaryotic DNA from fecal samples. Our results revealed that recovering eukaryotic components from fecal samples differs significantly among DNA extraction methods. Subsequently, the composition of the intestinal eukaryotic microbiota was evaluated in HIV-infected patients and healthy volunteers through clone sequencing, high-throughput sequencing of nuclear ribosomal internal transcribed spacers 1 (ITS1) and 2 (ITS2) amplicons and real-time PCRs. Our results revealed that not only richness (Chao-1 index) and alpha diversity (Shannon diversity) differ between HIV-infected patients and healthy volunteers, depending on the molecular strategy used, but also the global eukaryotic community composition, with little overlapping taxa found between techniques. Moreover, our results based on cloning libraries and ITS1/ITS2 metabarcoding sequencing showed significant differences in fungal composition between HIV-infected patients and healthy volunteers, but without distinct clusters separating the two groups. Malassezia restricta was significantly more prevalent in fecal samples of HIV-infected patients, according to cloning libraries, whereas operational taxonomic units (OTUs) belonging to Candida albicans and Candida tropicalis were significantly more abundant in fecal samples of HIV-infected patients compared to healthy subjects in both ITS subregions. Finally, real-time PCR showed the presence of Microsporidia, Giardia lamblia, Blastocystis and Hymenolepis diminuta in different proportions in fecal samples from HIV patients as compared to healthy individuals. Our work revealed that the use of different sequencing approaches can impact the perceived eukaryotic diversity results of the human gut. We also provide a more comprehensive view of the eukaryotic community in the gut of HIV-infected patients through the complementarity of the different molecular techniques used. Combining these various methodologies may provide a gold standard for a more complete characterization of the eukaryotic microbiome in future studies.
Acquisition of New DNA Sequences After Infection of Chicken Cells with Avian Myeloblastosis Virus

PubMed Central

Shoyab, M.; Baluda, M. A.; Evans, R.

1974-01-01

DNA-RNA hybridization studies between 70S RNA from avian myeloblastosis virus (AMV) and an excess of DNA from (i) AMV-induced leukemic chicken myeloblasts or (ii) a mixture of normal and of congenitally infected K-137 chicken embryos producing avian leukosis viruses revealed the presence of fast- and slow-hybridizing virus-specific DNA sequences. However, the leukemic cells contained twice the level of AMV-specific DNA sequences observed in normal chicken embryonic cells. The fast-reacting sequences were two to three times more numerous in leukemic DNA than in DNA from the mixed embryos. The slow-reacting sequences had a reiteration frequency of approximately 9 and 6, in the two respective systems. Both the fast- and the slow-reacting DNA sequences in leukemic cells exhibited a higher Tm (2 C) than the respective DNA sequences in normal cells. In normal and leukemic cells the slow hybrid sequences appeared to have a Tm which was 2 C higher than that of the fast hybrid sequences. Individual non-virus-producing chicken embryos, either group-specific antigen positive or negative, contained 40 to 100 copies of the fast sequences and 2 to 6 copies of the slowly hybridizing sequences per cell genome. Normal rat cells did not contain DNA that hybridized with AMV RNA, whereas non-virus-producing rat cells transformed by B-77 avian sarcoma virus contained only the slowly reacting sequences. The results demonstrate that leukemic cells transformed by AMV contain new AMV-specific DNA sequences which were not present before infection. PMID:16789139
Ultrasensitive sensing platform for platelet-derived growth factor BB detection based on layered molybdenum selenide-graphene composites and Exonuclease III assisted signal amplification.

PubMed

Huang, Ke-Jing; Shuai, Hong-Lei; Zhang, Ji-Zong

2016-03-15

A highly sensitive and ultrasensitive electrochemical aptasensor for platelet-derived growth factor BB (PDGF-BB) detection is fabricated based on layered molybdenum selenide-graphene (MoSe2-Gr) composites and Exonuclease III (Exo III)-aided signal amplification. MoSe2-Gr is prepared by a simple hydrothermal method and used as a promising sensing platform. Exo III has a specifical exo-deoxyribonuclease activity for duplex DNAs in the direction from 3' to 5' terminus, however its activity is limited on the duplex DNAs with more than 4 mismatched terminal bases at 3' ends. Herein, aptamer and complementary DNA (cDNA) sequences are designed with four thymine bases on 3' ends. In the presence of target protein, the aptamer associates with it and facilitates the formation of duplex DNA between cDNA and signal DNA. The duplex DNA then is digested by Exo III and releases cDNA, which hybridizes with signal DNA to perform a new cleavage process. Nevertheless, in the absence of target protein, the aptamer hybridizes with cDNA will inhibit the Exo III-assisted nucleotides cleavage. The signal DNA then hybridizes with capture DNA on the electrode. Subsequently, horse radish peroxidase is fixed on electrode by avidin-biotin reaction and then catalyzes hydrogen peroxide and hydroquinone to produce electrochemical response. Therefore, a bridge can be established between the concentration of target protein and the degree of the attenuation of the obtained signal, providing a quantitative measure of target protein with a broad detection range of 0.0001-1 nM and a detection limit of 20 fM. Copyright © 2015 Elsevier B.V. All rights reserved.
Homogeneity of the 16S rDNA sequence among geographically disparate isolates of Taylorella equigenitalis

PubMed Central

Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC

2006-01-01

Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935
Using Environmental DNA to Census Marine Fishes in a Large Mesocosm

PubMed Central

Kelly, Ryan P.; Port, Jesse A.; Yamahara, Kevan M.; Crowder, Larry B.

2014-01-01

The ocean is a soup of its resident species' genetic material, cast off in the forms of metabolic waste, shed skin cells, or damaged tissue. Sampling this environmental DNA (eDNA) is a potentially powerful means of assessing whole biological communities, a significant advance over the manual methods of environmental sampling that have historically dominated marine ecology and related fields. Here, we estimate the vertebrate fauna in a 4.5-million-liter mesocosm aquarium tank at the Monterey Bay Aquarium of known species composition by sequencing the eDNA from its constituent seawater. We find that it is generally possible to detect mitochondrial DNA of bony fishes sufficient to identify organisms to taxonomic family- or genus-level using a 106 bp fragment of the 12S ribosomal gene. Within bony fishes, we observe a low false-negative detection rate, although we did not detect the cartilaginous fishes or sea turtles present with this fragment. We find that the rank abundance of recovered eDNA sequences correlates with the abundance of corresponding species' biomass in the mesocosm, but the data in hand do not allow us to develop a quantitative relationship between biomass and eDNA abundance. Finally, we find a low false-positive rate for detection of exogenous eDNA, and we were able to diagnose non-native species' tissue in the food used to maintain the mesocosm, underscoring the sensitivity of eDNA as a technique for community-level ecological surveys. We conclude that eDNA has substantial potential to become a core tool for environmental monitoring, but that a variety of challenges remain before reliable quantitative assessments of ecological communities in the field become possible. PMID:24454960
Spatial variations of bacterial community and its relationship with water chemistry in Sanya Bay, South China Sea as determined by DGGE fingerprinting and multivariate analysis.

PubMed

Ling, Juan; Zhang, Yan-Ying; Dong, Jun-De; Wang, You-Shao; Feng, Jing-Bing; Zhou, Wei-Hua

2015-10-01

Bacteria play important roles in the structure and function of marine food webs by utilizing nutrients and degrading the pollutants, and their distribution are determined by surrounding water chemistry to a certain extent. It is vital to investigate the bacterial community's structure and identifying the significant factors by controlling the bacterial distribution in the paper. Flow cytometry showed that the total bacterial abundance ranged from 5.27 × 10(5) to 3.77 × 10(6) cells/mL. Molecular fingerprinting technique, denaturing gradient gel electrophoresis (DGGE) followed by DNA sequencing has been employed to investigate the bacterial community composition. The results were then interpreted through multivariate statistical analysis and tended to explain its relationship to the environmental factors. A total of 270 bands at 83 different positions were detected in DGGE profiles and 29 distinct DGGE bands were sequenced. The predominant bacteria were related to Phyla Protebacteria species (31 %, nine sequences), Cyanobacteria (37.9 %, eleven sequences) and Actinobacteria (17.2 %, five sequences). Other phylogenetic groups identified including Firmicutes (6.9 %, two sequences), Bacteroidetes (3.5 %, one sequences) and Verrucomicrobia (3.5 %, one sequences). Conical correspondence analysis was used to elucidate the relationships between the bacterial community compositions and environmental factors. The results showed that the spatial variations in the bacterial community composition was significantly related to phosphate (P = 0.002, P < 0.01), dissolved organic carbon (P = 0.004, P < 0.01), chemical oxygen demand (P = 0.010, P < 0.05) and nitrite (P = 0.016, P < 0.05). This study revealed the spatial variations of bacterial community and significant environmental factors driving the bacterial composition shift. These results may be valuable for further investigation on the functional microbial structure and expression quantitatively under the polluted environments in the world.
Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

DOEpatents

McCutchen-Maloney, Sandra L.

2002-01-01

DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
Metagenomic Analysis of Milk of Healthy and Mastitis-Suffering Women.

PubMed

Jiménez, Esther; de Andrés, Javier; Manrique, Marina; Pareja-Tobes, Pablo; Tobes, Raquel; Martínez-Blanch, Juan F; Codoñer, Francisco M; Ramón, Daniel; Fernández, Leónides; Rodríguez, Juan M

2015-08-01

Some studies have been conducted to assess the composition of the bacterial communities inhabiting human milk, but they did not evaluate the presence of other microorganisms, such as fungi, archaea, protozoa, or viruses. This study aimed to compare the metagenome of human milk samples provided by healthy and mastitis-suffering women. DNA was isolated from human milk samples collected from 10 healthy women and 10 women with symptoms of lactational mastitis. Shotgun libraries from total extracted DNA were constructed and the libraries were sequenced by 454 pyrosequencing. The amount of human DNA sequences was ≥ 90% in all the samples. Among the bacterial sequences, the predominant phyla were Proteobacteria, Firmicutes, and Bacteroidetes. The healthy core microbiome included the genera Staphylococcus, Streptococcus, Bacteroides, Faecalibacterium, Ruminococcus, Lactobacillus, and Propionibacterium. At the species level, a high degree of inter-individual variability was observed among healthy women. In contrast, Staphylococcus aureus clearly dominated the microbiome in the samples from the women with acute mastitis whereas high increases in Staphylococcus epidermidis-related reads were observed in the milk of those suffering from subacute mastitis. Fungal and protozoa-related reads were identified in most of the samples, whereas Archaea reads were absent in samples from women with mastitis. Some viral-related sequence reads were also detected. Human milk contains a complex microbial metagenome constituted by the genomes of bacteria, archaea, viruses, fungi, and protozoa. In mastitis cases, the milk microbiome reflects a loss of bacterial diversity and a high increase of the sequences related to the presumptive etiological agents. © The Author(s) 2015.
Biodiversity hot spot on a hot spot: novel extremophile diversity in Hawaiian fumaroles.

PubMed

Wall, Kate; Cornell, Jennifer; Bizzoco, Richard W; Kelley, Scott T

2015-01-06

Fumaroles (steam vents) are the most common, yet least understood, microbial habitat in terrestrial geothermal settings. Long believed too extreme for life, recent advances in sample collection and DNA extraction methods have found that fumarole deposits and subsurface waters harbor a considerable diversity of viable microbes. In this study, we applied culture-independent molecular methods to explore fumarole deposit microbial assemblages in 15 different fumaroles in four geographic locations on the Big Island of Hawai'i. Just over half of the vents yielded sufficient high-quality DNA for the construction of 16S ribosomal RNA gene sequence clone libraries. The bacterial clone libraries contained sequences belonging to 11 recognized bacterial divisions and seven other division-level phylogenetic groups. Archaeal sequences were less numerous, but similarly diverse. The taxonomic composition among fumarole deposits was highly heterogeneous. Phylogenetic analysis found cloned fumarole sequences were related to microbes identified from a broad array of globally distributed ecotypes, including hot springs, terrestrial soils, and industrial waste sites. Our results suggest that fumarole deposits function as an "extremophile collector" and may be a hot spot of novel extremophile biodiversity. © 2015 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
Biodiversity hot spot on a hot spot: novel extremophile diversity in Hawaiian fumaroles

PubMed Central

Wall, Kate; Cornell, Jennifer; Bizzoco, Richard W; Kelley, Scott T

2015-01-01

Fumaroles (steam vents) are the most common, yet least understood, microbial habitat in terrestrial geothermal settings. Long believed too extreme for life, recent advances in sample collection and DNA extraction methods have found that fumarole deposits and subsurface waters harbor a considerable diversity of viable microbes. In this study, we applied culture-independent molecular methods to explore fumarole deposit microbial assemblages in 15 different fumaroles in four geographic locations on the Big Island of Hawai'i. Just over half of the vents yielded sufficient high-quality DNA for the construction of 16S ribosomal RNA gene sequence clone libraries. The bacterial clone libraries contained sequences belonging to 11 recognized bacterial divisions and seven other division-level phylogenetic groups. Archaeal sequences were less numerous, but similarly diverse. The taxonomic composition among fumarole deposits was highly heterogeneous. Phylogenetic analysis found cloned fumarole sequences were related to microbes identified from a broad array of globally distributed ecotypes, including hot springs, terrestrial soils, and industrial waste sites. Our results suggest that fumarole deposits function as an “extremophile collector” and may be a hot spot of novel extremophile biodiversity. PMID:25565172
Assessing the performance of the Oxford Nanopore Technologies MinION

PubMed Central

Laver, T.; Harrison, J.; O’Neill, P.A.; Moore, K.; Farbos, A.; Paszkiewicz, K.; Studholme, D.J.

2015-01-01

The Oxford Nanopore Technologies (ONT) MinION is a new sequencing technology that potentially offers read lengths of tens of kilobases (kb) limited only by the length of DNA molecules presented to it. The device has a low capital cost, is by far the most portable DNA sequencer available, and can produce data in real-time. It has numerous prospective applications including improving genome sequence assemblies and resolution of repeat-rich regions. Before such a technology is widely adopted, it is important to assess its performance and limitations in respect of throughput and accuracy. In this study we assessed the performance of the MinION by re-sequencing three bacterial genomes, with very different nucleotide compositions ranging from 28.6% to 70.7%; the high G + C strain was underrepresented in the sequencing reads. We estimate the error rate of the MinION (after base calling) to be 38.2%. Mean and median read lengths were 2 kb and 1 kb respectively, while the longest single read was 98 kb. The whole length of a 5 kb rRNA operon was covered by a single read. As the first nanopore-based single molecule sequencer available to researchers, the MinION is an exciting prospect; however, the current error rate limits its ability to compete with existing sequencing technologies, though we do show that MinION sequence reads can enhance contiguity of de novo assembly when used in conjunction with Illumina MiSeq data. PMID:26753127

Complete mitochondrial genome of the monogonont rotifer, Brachionus koreanus (Rotifera, Brachionidae).

PubMed

Hwang, Dae-Sik; Suga, Koushirou; Sakakura, Yoshitaka; Park, Heum Gi; Hagiwara, Atsushi; Rhee, Jae-Sung; Lee, Jae-Seong

2014-02-01

The complete mitochondrial genome was obtained from the assembled genome data sequenced by next generation sequencing (NGS) technology from the monogonont rotifer Brachionus koreanus. The mitochondrial genome of B. koreanus was composed of two circular chromosomes designated as mtDNA-I (10,421 bp) and mtDNA-II (11,923 bp). The gene contents of B. koreanus were identical with previously reported B. plicatilis mitochondrial genomes. However, gene orders of B. koreanus showed one rearrangement between the two species. Of 12 protein-coding genes (PCGs), 3 genes (ATP6, ND1, and ND3) had an incomplete stop codon. The A + T base composition of B. koreanus mitochondrial genome was high (68.81%). They also showed anti-G bias (12.03% and 10.97%) on the second and third position of PCGs as well as slight anti-C bias (15.96% and 14.31%) on the first and third position of PCGs.
Diversity of Bacterial Communities in Container Habitats of Mosquitoes

PubMed Central

Ponnusamy, Loganathan; Xu, Ning; Stav, Gil; Wesson, Dawn M.; Schal, Coby

2010-01-01

We investigated the bacterial diversity of microbial communities in water-filled, human-made and natural container habitats of the mosquitoes Aedes aegypti and Aedes albopictus in suburban landscapes of New Orleans, Louisiana in 2003. We collected water samples from three classes of containers, including tires (n=12), cemetery urns (n=23), and miscellaneous containers that included two tree holes (n=19). Total genomic DNA was extracted from water samples, and 16S ribosomal DNA fragments (operational taxonomic units, OTUs) were amplified by PCR and separated by denaturing gradient gel electrophoresis (DGGE). The bacterial communities in containers represented diverse DGGE-DNA banding patterns that were not related to the class of container or to the local spatial distribution of containers. Mean richness and evenness of OTUs were highest in water samples from tires. Bacterial phylotypes were identified by comparative sequence analysis of 90 16S rDNA DGGE band amplicons. The majority of sequences were placed in five major taxa: Alpha-, Beta- and Gammaproteobacteria, Actinobacteria, Bacteroidetes, Cyanobacteria, Firmicutes, and an unclassified group; Proteobacteria and Bacteroidetes were the predominant heterotrophic bacteria in containers. The bacterial communities in human-made containers consisted mainly of undescribed species, and a phylogenetic analysis based on 16S rRNA sequences suggested that species composition was independent of both container type and the spatial distribution of containers. Comparative PCR-based, cultivation-independent rRNA surveys of microbial communities associated with mosquito habitats can provide significant insight into community organization and dynamics of bacterial species. PMID:18373113
A Multi-Omics Approach to Evaluate the Quality of Milk Whey Used in Ricotta Cheese Production

PubMed Central

Sattin, Eleonora; Andreani, Nadia A.; Carraro, Lisa; Lucchini, Rosaria; Fasolato, Luca; Telatin, Andrea; Balzan, Stefania; Novelli, Enrico; Simionati, Barbara; Cardazzo, Barbara

2016-01-01

In the past, milk whey was only a by-product of cheese production, but currently, it has a high commercial value for use in the food industries. However, the regulation of whey management (i.e., storage and hygienic properties) has not been updated, and as a consequence, its microbiological quality is very challenging for food safety. The Next Generation Sequencing (NGS) technique was applied to several whey samples used for Ricotta production to evaluate the microbial community composition in depth using both RNA and DNA as templates for NGS library construction. Whey samples demonstrating a high microbial and aerobic spore load contained mostly Firmicutes; although variable, some samples contained a relevant amount of Gammaproteobacteria. Several lots of whey acquired as raw material for Ricotta production presented defective organoleptic properties. To define the volatile compounds in normal and defective whey samples, a headspace gas chromatography/mass spectrometry (GC/MS) analysis was conducted. The statistical analysis demonstrated that different microbial communities resulted from DNA or cDNA library sequencing, and distinguishable microbiota composed the communities contained in the organoleptic-defective whey samples. PMID:27582735
Influence of long-term repeated prescribed burning on mycelial communities of ectomycorrhizal fungi.

PubMed

Bastias, Brigitte A; Xu, Zhihong; Cairney, John W G

2006-01-01

To demonstrate the efficacy of direct DNA extraction from hyphal ingrowth bags for community profiling of ectomycorrhizal (ECM) mycelia in soil, we applied the method to investigate the influence of long-term repeated prescribed burning on an ECM fungal community. DNA was extracted from hyphal ingrowth bags buried in forest plots that received different prescribed burning treatments for 30 yr, and denaturing gradient gel electrophoresis (DGGE) profiles of partial fungal rDNA internal transcribed spacer (ITS) regions were compared. Restriction fragment length polymorphism (RFLP) and sequence analyses were also used to compare clone assemblages between the treatments. The majority of sequences derived from the ingrowth bags were apparently those of ECM fungi. DGGE profiles for biennially burned plots were significantly different from those of quadrennially burned and unburned control plots. Analysis of clone assemblages indicated that this reflected altered ECM fungal community composition. The results indicate that hyphal ingrowth bags represent a useful method for investigation of ECM mycelial communities, and that frequent long-term prescribed burning can influence below-ground ECM fungal communities.
Methylation patterns of repetitive DNA sequences in germ cells of Mus musculus.

PubMed

Sanford, J; Forrester, L; Chapman, V; Chandley, A; Hastie, N

1984-03-26

The major and the minor satellite sequences of Mus musculus were undermethylated in both sperm and oocyte DNAs relative to the amount of undermethylation observed in adult somatic tissue DNA. This hypomethylation was specific for satellite sequences in sperm DNA. Dispersed repetitive and low copy sequences show a high degree of methylation in sperm DNA; however, a dispersed repetitive sequence was undermethylated in oocyte DNA. This finding suggests a difference in the amount of total genomic DNA methylation between sperm and oocyte DNA. The methylation levels of the minor satellite sequences did not change during spermiogenesis, and were not associated with the onset of meiosis or a specific stage in sperm development.
Process of labeling specific chromosomes using recombinant repetitive DNA

DOEpatents

Moyzis, R.K.; Meyne, J.

1988-02-12

Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
Next generation sequencing yields the complete mitochondrial genome of the Hornlip mullet Plicomugil labiosus (Teleostei: Mugilidae).

PubMed

Shen, Kang-Ning; Chen, Ching-Hung; Hsiao, Chung-Der

2016-05-01

In this study, the complete mitogenome sequence of hornlip mullet Plicomugil labiosus (Teleostei: Mugilidae) has been sequenced by next-generation sequencing method. The assembled mitogenome, consisting of 16,829 bp, had the typical vertebrate mitochondrial gene arrangement, including 13 protein coding genes, 22 transfer RNAs, 2 ribosomal RNAs genes and a non-coding control region of D-loop. D-loop contains 1057 bp length is located between tRNA-Pro and tRNA-Phe. The overall base composition of P. labiosus is 28.0% for A, 29.3% for C, 15.5% for G and 27.2% for T. The complete mitogenome may provide essential and important DNA molecular data for further population, phylogenetic and evolutionary analysis for Mugilidae.
Next generation sequencing yields the complete mitochondrial genome of the largescale mullet, Liza macrolepis (Teleostei: Mugilidae).

PubMed

Shen, Kang-Ning; Tsai, Shiou-Yi; Chen, Ching-Hung; Hsiao, Chung-Der; Durand, Jean-Dominique

2016-11-01

In this study, the complete mitogenome sequence of largescale mullet (Teleostei: Mugilidae) has been sequenced by the next-generation sequencing method. The assembled mitogenome, consisting of 16,832 bp, had the typical vertebrate mitochondrial gene arrangement, including 13 protein-coding genes, 22 transfer RNAs, two ribosomal RNAs genes, and a non-coding control region of D-loop. D-loop which has a length of 1094 bp is located between tRNA-Pro and tRNA-Phe. The overall base composition of largescale mullet is 27.8% for A, 30.1% for C, 16.2% for G, and 25.9% for T. The complete mitogenome may provide essential and important DNA molecular data for further phylogenetic and evolutionary analysis for Mugilidae.
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

PubMed

Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

2014-11-01

As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Comparing Ecological and Genetic Diversity Within the Marine Diatom Genus Pseudo-nitzschia: A Multiregional Synthesis

NASA Astrophysics Data System (ADS)

Hubbard, K.; Bruzek, S.

2016-02-01

The globally distributed marine diatom genus Pseudo-nitzschia consists of approximately 40 species, more than half of which occur in US coastal waters. Here, sensitive genetic tools targeting a variable portion of the internal transcribed spacer 1 (ITS1) region of the rRNA gene were used to assess Pseudo-nitzschia spp. diversity in more than 600 environmental DNA samples collected from US Atlantic, Pacific, and Gulf of Mexico waters. Community-based approaches employed genus-specific primers for environmental DNA fingerprinting and targeted sequencing. For the Gulf of Mexico samples especially, a nested PCR approach (with or without degenerate primers) improved resolution of species diversity. To date, more than 40 unique ITS1 amplicon sizes have been repeatedly observed in ITS1 fingerprints. Targeted sequencing of environmental DNA as well as single chains isolated from live samples indicate that many of these represent novel and known inter- and intra-specific Pseudo-nitzschia diversity. A few species (e.g., P. pungens, P. cuspidata) occur across all three regions, whereas other species and intraspecific variants occurred at local to regional spatial scales only. Generally, species frequently co-occur in complex assemblages, and transitions in Pseudo-nitzschia community composition occur seasonally, prior to bloom initiation, and across (cross-shelf, latitudinal, and vertical) environmental gradients. These observations highlight the dynamic nature of diatom community composition in the marine environment and the importance of classifying diversity at relevant ecological and/or taxonomic scales.
Fungal partner shifts during the evolution of mycoheterotrophy in Neottia.

PubMed

Yagame, Takahiro; Ogura-Tsujita, Yuki; Kinoshita, Akihiko; Iwase, Koji; Yukawa, Tomohisa

2016-09-01

Few previous studies have examined how mycobionts change during the evolution from autotrophy to mycoheterotrophy based on phylogenetic hypotheses. Neottia (Orchidaceae) comprises leafy species that are autotrophic and related leafless mycoheterotrophic species, and the phylogenetic relationships among them have been clarified. Accordingly, Neottia is a suitable taxon for investigating the question above. Here we clarified the diversity of mycobionts in Neottia plants and elucidated changes in the character of symbiotic associations during the evolution of mycoheterotrophy. We sequenced the internal transcribed spacer (ITS) regions of nuclear ribosomal (nr) DNA for mycobionts of Neottia plants. Furthermore, we selected one representative DNA sample from each fungal operational taxonomic unit (OTU) and used it to amplify the large subunit (LSU) nrDNA sequences. Phylogenetic analyses of Sebacinales (basidiomycetes), the dominant mycobiont of Neottia, were conducted and sample-based rarefaction curves generated for the observed mycobiont richness on each OTU. Leafy and leafless species in Neottia were associated with Sebacinales Group B and Sebacinales Group A, respectively. The composition and specificity level of fungal partners varied among Neottia species. Fungal partner composition and specificity level changed with speciation in both leafy and leafless Neottia species. In particular, mycorrhizal associations likely shifted from Sebacinales Group B to Group A during the evolution from autotrophy to mycoheterotrophy. Partner shifts to Sebacinales Group A have also been reported in the evolution of mycoheterotrophy of other plant groups, suggesting that convergence to this fungal group occurs in association with the evolution of mycoheterotrophy. © 2016 Botanical Society of America.
Long-term changes of bacterial and viral compositions in the intestine of a recovered Clostridium difficile patient after fecal microbiota transplantation.

PubMed

Broecker, Felix; Klumpp, Jochen; Schuppler, Markus; Russo, Giancarlo; Biedermann, Luc; Hombach, Michael; Rogler, Gerhard; Moelling, Karin

2016-01-01

Fecal microbiota transplantation (FMT) is an effective treatment for recurrent Clostridium difficile infections (RCDIs). However, long-term effects on the patients' gut microbiota and the role of viruses remain to be elucidated. Here, we characterized bacterial and viral microbiota in the feces of a cured RCDI patient at various time points until 4.5 yr post-FMT compared with the stool donor. Feces were subjected to DNA sequencing to characterize bacteria and double-stranded DNA (dsDNA) viruses including phages. The patient's microbial communities varied over time and showed little overall similarity to the donor until 7 mo post-FMT, indicating ongoing gut microbiota adaption in this time period. After 4.5 yr, the patient's bacteria attained donor-like compositions at phylum, class, and order levels with similar bacterial diversity. Differences in the bacterial communities between donor and patient after 4.5 yr were seen at lower taxonomic levels. C. difficile remained undetectable throughout the entire timespan. This demonstrated sustainable donor feces engraftment and verified long-term therapeutic success of FMT on the molecular level. Full engraftment apparently required longer than previously acknowledged, suggesting the implementation of year-long patient follow-up periods into clinical practice. The identified dsDNA viruses were mainly Caudovirales phages. Unexpectedly, sequences related to giant algae-infecting Chlorella viruses were also detected. Our findings indicate that intestinal viruses may be implicated in the establishment of gut microbiota. Therefore, virome analyses should be included in gut microbiota studies to determine the roles of phages and other viruses-such as Chlorella viruses-in human health and disease, particularly during RCDI.
Enlightenment of Yeast Mitochondrial Homoplasmy: Diversified Roles of Gene Conversion

PubMed Central

Ling, Feng; Mikawa, Tsutomu; Shibata, Takehiko

2011-01-01

Mitochondria have their own genomic DNA. Unlike the nuclear genome, each cell contains hundreds to thousands of copies of mitochondrial DNA (mtDNA). The copies of mtDNA tend to have heterogeneous sequences, due to the high frequency of mutagenesis, but are quickly homogenized within a cell (“homoplasmy”) during vegetative cell growth or through a few sexual generations. Heteroplasmy is strongly associated with mitochondrial diseases, diabetes and aging. Recent studies revealed that the yeast cell has the machinery to homogenize mtDNA, using a common DNA processing pathway with gene conversion; i.e., both genetic events are initiated by a double-stranded break, which is processed into 3′ single-stranded tails. One of the tails is base-paired with the complementary sequence of the recipient double-stranded DNA to form a D-loop (homologous pairing), in which repair DNA synthesis is initiated to restore the sequence lost by the breakage. Gene conversion generates sequence diversity, depending on the divergence between the donor and recipient sequences, especially when it occurs among a number of copies of a DNA sequence family with some sequence variations, such as in immunoglobulin diversification in chicken. MtDNA can be regarded as a sequence family, in which the members tend to be diversified by a high frequency of spontaneous mutagenesis. Thus, it would be interesting to determine why and how double-stranded breakage and D-loop formation induce sequence homogenization in mitochondria and sequence diversification in nuclear DNA. We will review the mechanisms and roles of mtDNA homoplasmy, in contrast to nuclear gene conversion, which diversifies gene and genome sequences, to provide clues toward understanding how the common DNA processing pathway results in such divergent outcomes. PMID:24710143
Compositional searching of CpG islands in the human genome

NASA Astrophysics Data System (ADS)

Luque-Escamilla, Pedro Luis; Martínez-Aroza, José; Oliver, José L.; Gómez-Lopera, Juan Francisco; Román-Roldán, Ramón

2005-06-01

We report on an entropic edge detector based on the local calculation of the Jensen-Shannon divergence with application to the search for CpG islands. CpG islands are pieces of the genome related to gene expression and cell differentiation, and thus to cancer formation. Searching for these CpG islands is a major task in genetics and bioinformatics. Some algorithms have been proposed in the literature, based on moving statistics in a sliding window, but its size may greatly influence the results. The local use of Jensen-Shannon divergence is a completely different strategy: the nucleotide composition inside the islands is different from that in their environment, so a statistical distance—the Jensen-Shannon divergence—between the composition of two adjacent windows may be used as a measure of their dissimilarity. Sliding this double window over the entire sequence allows us to segment it compositionally. The fusion of those segments into greater ones that satisfy certain identification criteria must be achieved in order to obtain the definitive results. We find that the local use of Jensen-Shannon divergence is very suitable in processing DNA sequences for searching for compositionally different structures such as CpG islands, as compared to other algorithms in literature.
"First generation" automated DNA sequencing technology.

PubMed

Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M

2011-10-01

Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.
Comparison of American Fisheries Society (AFS) standard fish sampling techniques and environmental DNA for characterizing fish communities in a large reservoir

USGS Publications Warehouse

Perez, Christina R.; Bonar, Scott A.; Amberg, Jon J.; Ladell, Bridget; Rees, Christopher B.; Stewart, William T.; Gill, Curtis J.; Cantrell, Chris; Robinson, Anthony

2017-01-01

Recently, methods involving examination of environmental DNA (eDNA) have shown promise for characterizing fish species presence and distribution in waterbodies. We evaluated the use of eDNA for standard fish monitoring surveys in a large reservoir. Specifically, we compared the presence, relative abundance, biomass, and relative percent composition of Largemouth Bass Micropterus salmoides and Gizzard Shad Dorosoma cepedianum measured through eDNA methods and established American Fisheries Society standard sampling methods for Theodore Roosevelt Lake, Arizona. Catches at electrofishing and gillnetting sites were compared with eDNA water samples at sites, within spatial strata, and over the entire reservoir. Gizzard Shad were detected at a higher percentage of sites with eDNA methods than with boat electrofishing in both spring and fall. In contrast, spring and fall gillnetting detected Gizzard Shad at more sites than eDNA. Boat electrofishing and gillnetting detected Largemouth Bass at more sites than eDNA; the exception was fall gillnetting, for which the number of sites of Largemouth Bass detection was equal to that for eDNA. We observed no relationship between relative abundance and biomass of Largemouth Bass and Gizzard Shad measured by established methods and eDNA copies at individual sites or lake sections. Reservoirwide catch composition for Largemouth Bass and Gizzard Shad (numbers and total weight [g] of fish) as determined through a combination of gear types (boat electrofishing plus gillnetting) was similar to the proportion of total eDNA copies from each species in spring and fall field sampling. However, no similarity existed between proportions of fish caught via spring and fall boat electrofishing and the proportion of total eDNA copies from each species. Our study suggests that eDNA field sampling protocols, filtration, DNA extraction, primer design, and DNA sequencing methods need further refinement and testing before incorporation into standard fish sampling surveys.
Influence of DNA sequence on the structure of minicircles under torsional stress

PubMed Central

Wang, Qian; Irobalieva, Rossitza N.; Chiu, Wah; Schmid, Michael F.; Fogg, Jonathan M.; Zechiedrich, Lynn

2017-01-01

Abstract The sequence dependence of the conformational distribution of DNA under various levels of torsional stress is an important unsolved problem. Combining theory and coarse-grained simulations shows that the DNA sequence and a structural correlation due to topology constraints of a circle are the main factors that dictate the 3D structure of a 336 bp DNA minicircle under torsional stress. We found that DNA minicircle topoisomers can have multiple bend locations under high torsional stress and that the positions of these sharp bends are determined by the sequence, and by a positive mechanical correlation along the sequence. We showed that simulations and theory are able to provide sequence-specific information about individual DNA minicircles observed by cryo-electron tomography (cryo-ET). We provided a sequence-specific cryo-ET tomogram fitting of DNA minicircles, registering the sequence within the geometric features. Our results indicate that the conformational distribution of minicircles under torsional stress can be designed, which has important implications for using minicircle DNA for gene therapy. PMID:28609782
Analysis of DNA Sequences by an Optical Time-Integrating Correlator: Proof-of-Concept Experiments.

DTIC Science & Technology

1992-05-01

DNA ANALYSIS STRATEGY 4 2.1 Representation of DNA Bases 4 2.2 DNA Analysis Strategy 6 3.0 CUSTOM GENERATORS FOR DNA SEQUENCES 10 3.1 Hardware Design 10...of the DNA bases where each base is represented by a 7-bits long pseudorandom sequence. 5 Figure 4: Coarse analysis of a DNA sequence. 7 Figure 5: Fine...a 20-bases long database. 32 xiii LIST OF TABLES PAGE Table 1: Short representations of the DNA bases where each base is represented by 7-bits long
Deep sequencing is an appropriate tool for the selection of unique Hepatitis C virus (HCV) variants after single genomic amplification.

PubMed

Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc'h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine

2017-01-01

Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus's but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies.
Deep sequencing is an appropriate tool for the selection of unique Hepatitis C virus (HCV) variants after single genomic amplification

PubMed Central

Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc’h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine

2017-01-01

Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus’s but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies. PMID:28362878

Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting

NASA Astrophysics Data System (ADS)

Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.

1997-05-01

Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.
Molecular evolution of ependymin and the phylogenetic resolution of early divergences among euteleost fishes.

PubMed

Ortí, G; Meyer, A

1996-04-01

The rate and pattern of DNA evolution of ependymin, a single-copy gene coding for a highly expressed glycoprotein in the brain matrix of teleost fishes, is characterized and its phylogenetic utility for fish systematics is assessed. DNA sequences were determined from catfish, electric fish, and characiforms and compared with published ependymin sequences from cyprinids, salmon, pike, and herring. Among these groups, ependymin amino acid sequences were highly divergent (up to 60% sequence difference), but had surprisingly similar hydropathy profiles and invariant glycosylation sites, suggesting that functional properties of the proteins are conserved. Comparison of base composition at third codon positions and introns revealed AT-rich introns and GC-rich third codon positions, suggesting that the biased codon usage observed might not be due to mutational bias. Phylogenetic information content of third codon positions was surprisingly high and sufficient to recover the most basal nodes of the tree, in spite of the observation that pairwise distances (at third codon positions) were well above the presumed saturation level. This finding can be explained by the high proportion of phylogenetically informative nonsynonymous changes at third codon positions among these highly divergent proteins. Ependymin DNA sequences have established the first molecular evidence for the monophyly of a group containing salmonids and esociforms. In addition, ependymin suggests a sister group relationship of electric fish (Gymnotiformes) and Characiformes, constituting a significant departure from currently accepted classifications. However, relationships among characiform lineages were not completely resolved by ependymin sequences in spite of seemingly appropriate levels of variation among taxa and considerably low levels of homoplasy in the data (consistency index = 0.7). If the diversification of Characiformes took place in an "explosive" manner, over a relatively short period of time this pattern should also be observed using other phylogenetic markers. Poor conservation of ependymin's primary structure hinders the design of efficient primers for PCR that could be used in wide-ranging fish systematic studies. However, alternative methods like PCR amplification from cDNA used here should provide promising comparative sequence data for the resolution of phylogenetic relationships among other basal lineages of teleost fishes.
Micronuclear DNA of Oxytricha nova contains sequences with autonomously replicating activity in Saccharomyces cerevisiae.

PubMed Central

Colombo, M M; Swanton, M T; Donini, P; Prescott, D M

1984-01-01

Oxytricha nova is a hypotrichous ciliate with micronuclei and macronuclei. Micronuclei, which contain large, chromosomal-sized DNA, are genetically inert but undergo meiosis and exchange during cell mating. Macronuclei, which contain only small, gene-sized DNA molecules, provide all of the nuclear RNA needed to run the cell. After cell mating the macronucleus is derived from a micronucleus, a derivation that includes excision of the genes from chromosomes and elimination of the remaining DNA. The eliminated DNA includes all of the repetitious sequences and approximately 95% of the unique sequences. We cloned large restriction fragments from the micronucleus that confer replication ability on a replication-deficient plasmid in Saccharomyces cerevisiae. Sequences that confer replication ability are called autonomously replicating sequences. The frequency and effectiveness of autonomously replicating sequences in micronuclear DNA are similar to those reported for DNAs of other organisms introduced into yeast cells. Of the 12 micronuclear fragments with autonomously replicating sequence activity, 9 also showed homology to macronuclear DNA, indicating that they contain a macronuclear gene sequence. We conclude from this that autonomously replicating sequence activity is nonrandomly distributed throughout micronuclear DNA and is preferentially associated with those regions of micronuclear DNA that contain genes. Images PMID:6092934
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation

PubMed Central

Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob

2014-01-01

As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

PubMed

El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

2013-07-01

Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Affordable hands-on DNA sequencing and genotyping: an exercise for teaching DNA analysis to undergraduates.

PubMed

Shah, Kushani; Thomas, Shelby; Stein, Arnold

2013-01-01

In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C Sanger sequencing reactions. They prepare and run the gels, perform Southern blots (which require only 10 min), and detect sequencing ladders using a colorimetric detection system. Students enlarge their sequencing ladders from digital images of their small nylon membranes, and read the sequence manually. They compare their reads with the actual DNA sequence using BLAST2. After mastering the DNA sequencing system, students prepare their own DNA from a cheek swab, polymerase chain reaction-amplify a region of their DNA that encompasses a SNP of interest, and perform sequencing to determine their genotype at the SNP position. A family pedigree can also be constructed. The SNP chosen by the instructor was rs17822931, which is in the ABCC11 gene and is the determinant of human earwax type. Genotypes at the rs178229931 site vary in different ethnic populations. © 2013 by The International Union of Biochemistry and Molecular Biology.
Phylogenetic characterization of a biogas plant microbial community integrating clone library 16S-rDNA sequences and metagenome sequence data obtained by 454-pyrosequencing.

PubMed

Kröber, Magdalena; Bekel, Thomas; Diaz, Naryttza N; Goesmann, Alexander; Jaenicke, Sebastian; Krause, Lutz; Miller, Dimitri; Runte, Kai J; Viehöver, Prisca; Pühler, Alfred; Schlüter, Andreas

2009-06-01

The phylogenetic structure of the microbial community residing in a fermentation sample from a production-scale biogas plant fed with maize silage, green rye and liquid manure was analysed by an integrated approach using clone library sequences and metagenome sequence data obtained by 454-pyrosequencing. Sequencing of 109 clones from a bacterial and an archaeal 16S-rDNA amplicon library revealed that the obtained nucleotide sequences are similar but not identical to 16S-rDNA database sequences derived from different anaerobic environments including digestors and bioreactors. Most of the bacterial 16S-rDNA sequences could be assigned to the phylum Firmicutes with the most abundant class Clostridia and to the class Bacteroidetes, whereas most archaeal 16S-rDNA sequences cluster close to the methanogen Methanoculleus bourgensis. Further sequences of the archaeal library most probably represent so far non-characterised species within the genus Methanoculleus. A similar result derived from phylogenetic analysis of mcrA clone sequences. The mcrA gene product encodes the alpha-subunit of methyl-coenzyme-M reductase involved in the final step of methanogenesis. BLASTn analysis applying stringent settings resulted in assignment of 16S-rDNA metagenome sequence reads to 62 16S-rDNA amplicon sequences thus enabling frequency of abundance estimations for 16S-rDNA clone library sequences. Ribosomal Database Project (RDP) Classifier processing of metagenome 16S-rDNA reads revealed abundance of the phyla Firmicutes, Bacteroidetes and Euryarchaeota and the orders Clostridiales, Bacteroidales and Methanomicrobiales. Moreover, a large fraction of 16S-rDNA metagenome reads could not be assigned to lower taxonomic ranks, demonstrating that numerous microorganisms in the analysed fermentation sample of the biogas plant are still unclassified or unknown.
A species-specific nucleosomal signature defines a periodic distribution of amino acids in proteins.

PubMed

Quintales, Luis; Soriano, Ignacio; Vázquez, Enrique; Segurado, Mónica; Antequera, Francisco

2015-04-01

Nucleosomes are the basic structural units of chromatin. Most of the yeast genome is organized in a pattern of positioned nucleosomes that is stably maintained under a wide range of physiological conditions. In this work, we have searched for sequence determinants associated with positioned nucleosomes in four species of fission and budding yeasts. We show that mononucleosomal DNA follows a highly structured base composition pattern, which differs among species despite the high degree of histone conservation. These nucleosomal signatures are present in transcribed and non-transcribed regions across the genome. In the case of open reading frames, they correctly predict the relative distribution of codons on mononucleosomal DNA, and they also determine a periodicity in the average distribution of amino acids along the proteins. These results establish a direct and species-specific connection between the position of each codon around the histone octamer and protein composition.
Construction and Characterization of an in-vivo Linear Covalently Closed DNA Vector Production System

PubMed Central

2012-01-01

Background While safer than their viral counterparts, conventional non-viral gene delivery DNA vectors offer a limited safety profile. They often result in the delivery of unwanted prokaryotic sequences, antibiotic resistance genes, and the bacterial origins of replication to the target, which may lead to the stimulation of unwanted immunological responses due to their chimeric DNA composition. Such vectors may also impart the potential for chromosomal integration, thus potentiating oncogenesis. We sought to engineer an in vivo system for the quick and simple production of safer DNA vector alternatives that were devoid of non-transgene bacterial sequences and would lethally disrupt the host chromosome in the event of an unwanted vector integration event. Results We constructed a parent eukaryotic expression vector possessing a specialized manufactured multi-target site called “Super Sequence”, and engineered E. coli cells (R-cell) that conditionally produce phage-derived recombinase Tel (PY54), TelN (N15), or Cre (P1). Passage of the parent plasmid vector through R-cells under optimized conditions, resulted in rapid, efficient, and one step in vivo generation of mini lcc—linear covalently closed (Tel/TelN-cell), or mini ccc—circular covalently closed (Cre-cell), DNA constructs, separated from the backbone plasmid DNA. Site-specific integration of lcc plasmids into the host chromosome resulted in chromosomal disruption and 105 fold lower viability than that seen with the ccc counterpart. Conclusion We offer a high efficiency mini DNA vector production system that confers simple, rapid and scalable in vivo production of mini lcc DNA vectors that possess all the benefits of “minicircle” DNA vectors and virtually eliminate the potential for undesirable vector integration events. PMID:23216697
Seasonal succession leads to habitat-dependent differentiation in ribosomal RNA:DNA ratios among freshwater lake bacteria

DOE PAGES

Denef, Vincent J.; Fujimoto, Masanori; Berry, Michelle A.; ...

2016-04-29

Relative abundance profiles of bacterial populations measured by sequencing DNA or RNA of marker genes can widely differ. These differences, made apparent when calculating ribosomal RNA:DNA ratios, have been interpreted as variable activities of bacterial populations. However, inconsistent correlations between ribosomal RNA:DNA ratios and metabolic activity or growth rates have led to a more conservative interpretation of this metric as the cellular protein synthesis potential (PSP). Little is known, particularly in freshwater systems, about how PSP varies for specific taxa across temporal and spatial environmental gradients and how conserved PSP is across bacterial phylogeny. Here, we generated 16S rRNA genemore » sequencing data using simultaneously extracted DNA and RNA from fractionated (free-living and particulate) water samples taken seasonally along a eutrophic freshwater estuary to oligotrophic pelagic transect in Lake Michigan. In contrast to previous reports, we observed frequent clustering of DNA and RNA data from the same sample. Analysis of the overlap in taxa detected at the RNA and DNA level indicated that microbial dormancy may be more common in the estuary, the particulate fraction, and during the stratified period. Across spatiotemporal gradients, PSP was often conserved at the phylum and class levels. PSPs for specific taxa were more similar across habitats in spring than in summer and fall. This was most notable for PSPs of the same taxa when located in the free-living or particulate fractions, but also when contrasting surface to deep, and estuary to Lake Michigan communities. Our results show that community composition assessed by RNA and DNA measurements are more similar than previously assumed in freshwater systems. Furthermore, the similarity between RNA and DNA measurements and taxa-specific PSPs that drive community-level similarities are conditional on spatiotemporal factors.« less
Seasonal succession leads to habitat-dependent differentiation in ribosomal RNA:DNA ratios among freshwater lake bacteria

DOE Office of Scientific and Technical Information (OSTI.GOV)

Denef, Vincent J.; Fujimoto, Masanori; Berry, Michelle A.

Relative abundance profiles of bacterial populations measured by sequencing DNA or RNA of marker genes can widely differ. These differences, made apparent when calculating ribosomal RNA:DNA ratios, have been interpreted as variable activities of bacterial populations. However, inconsistent correlations between ribosomal RNA:DNA ratios and metabolic activity or growth rates have led to a more conservative interpretation of this metric as the cellular protein synthesis potential (PSP). Little is known, particularly in freshwater systems, about how PSP varies for specific taxa across temporal and spatial environmental gradients and how conserved PSP is across bacterial phylogeny. Here, we generated 16S rRNA genemore » sequencing data using simultaneously extracted DNA and RNA from fractionated (free-living and particulate) water samples taken seasonally along a eutrophic freshwater estuary to oligotrophic pelagic transect in Lake Michigan. In contrast to previous reports, we observed frequent clustering of DNA and RNA data from the same sample. Analysis of the overlap in taxa detected at the RNA and DNA level indicated that microbial dormancy may be more common in the estuary, the particulate fraction, and during the stratified period. Across spatiotemporal gradients, PSP was often conserved at the phylum and class levels. PSPs for specific taxa were more similar across habitats in spring than in summer and fall. This was most notable for PSPs of the same taxa when located in the free-living or particulate fractions, but also when contrasting surface to deep, and estuary to Lake Michigan communities. Our results show that community composition assessed by RNA and DNA measurements are more similar than previously assumed in freshwater systems. Furthermore, the similarity between RNA and DNA measurements and taxa-specific PSPs that drive community-level similarities are conditional on spatiotemporal factors.« less
Subspecies composition and founder contribution of the captive U.S. chimpanzee (Pan troglodytes) population.

PubMed

Ely, John J; Dye, Brent; Frels, William I; Fritz, Jo; Gagneux, Pascal; Khun, Henry H; Switzer, William M; Lee, D Rick

2005-10-01

Chimpanzees are presently classified into three subspecies: Pan troglodytes verus from west Africa, P.t. troglodytes from central Africa, and P.t. schweinfurthii from east Africa. A fourth subspecies (P.t. vellerosus), from Cameroon and northern Nigeria, has been proposed. These taxonomic designations are based on geographical origins and are reflected in sequence variation in the first hypervariable region (HVR-I) of the mtDNA D-loop. Although advances have been made in our understanding of chimpanzee phylogenetics, little has been known regarding the subspecies composition of captive chimpanzees. We sequenced part of the mtDNA HVR-I region in 218 African-born population founders and performed a phylogenetic analysis with previously characterized African sequences of known provenance to infer subspecies affiliations. Most founders were P.t. verus (95.0%), distantly followed by the troglodytes schweinfurthii clade (4.6%), and a single P.t. vellerosus (0.4%). Pedigree-based estimates of genomic representation in the descendant population revealed that troglodytes schweinfurthii founder representation was reduced in captivity, vellerosus representation increased due to prolific breeding by a single male, and reproductive variance resulted in uneven representation among male P.t.verus founders. No increase in mortality was evident from between-subspecies interbreeding, indicating a lack of outbreeding depression. Knowledge of subspecies and their genomic representation can form the basis for phylogenetically informed genetic management of extant chimpanzees to preserve rare genetic variation for research, conservation, or possible future breeding. Copyright 2005 Wiley-Liss, Inc.
DNA capture and next-generation sequencing can recover whole mitochondrial genomes from highly degraded samples for human identification

PubMed Central

2013-01-01

Background Mitochondrial DNA (mtDNA) typing can be a useful aid for identifying people from compromised samples when nuclear DNA is too damaged, degraded or below detection thresholds for routine short tandem repeat (STR)-based analysis. Standard mtDNA typing, focused on PCR amplicon sequencing of the control region (HVS I and HVS II), is limited by the resolving power of this short sequence, which misses up to 70% of the variation present in the mtDNA genome. Methods We used in-solution hybridisation-based DNA capture (using DNA capture probes prepared from modern human mtDNA) to recover mtDNA from post-mortem human remains in which the majority of DNA is both highly fragmented (<100 base pairs in length) and chemically damaged. The method ‘immortalises’ the finite quantities of DNA in valuable extracts as DNA libraries, which is followed by the targeted enrichment of endogenous mtDNA sequences and characterisation by next-generation sequencing (NGS). Results We sequenced whole mitochondrial genomes for human identification from samples where standard nuclear STR typing produced only partial profiles or demonstrably failed and/or where standard mtDNA hypervariable region sequences lacked resolving power. Multiple rounds of enrichment can substantially improve coverage and sequencing depth of mtDNA genomes from highly degraded samples. The application of this method has led to the reliable mitochondrial sequencing of human skeletal remains from unidentified World War Two (WWII) casualties approximately 70 years old and from archaeological remains (up to 2,500 years old). Conclusions This approach has potential applications in forensic science, historical human identification cases, archived medical samples, kinship analysis and population studies. In particular the methodology can be applied to any case, involving human or non-human species, where whole mitochondrial genome sequences are required to provide the highest level of maternal lineage discrimination. Multiple rounds of in-solution hybridisation-based DNA capture can retrieve whole mitochondrial genome sequences from even the most challenging samples. PMID:24289217
Thalassospira tepidiphila sp. nov., a polycyclic aromatic hydrocarbon-degrading bacterium isolated from seawater.

PubMed

Kodama, Yumiko; Stiknowati, Lies Indah; Ueki, Atsuko; Ueki, Katsuji; Watanabe, Kazuya

2008-03-01

A Gram-negative, mesophilic bacterial strain, designated 1-1B(T), which degrades polycyclic aromatic hydrocarbons, was isolated from petroleum-contaminated seawater during a bioremediation experiment. A 16S rRNA gene sequence analysis indicated that the isolate was affiliated with the genus Thalassospira in the Alphaproteobacteria; the sequence was found to be most similar to those of Thalassospira profundimaris WP0211(T) (99.8 %), Thalassospira xiamenensis M-5(T) (98.2 %) and Thalassospira lucentensis DSM 14000(T) (98.1 %). However, the levels of DNA-DNA relatedness between strain 1-1B(T) and these type strains were 50.7+/-17.2, 35.7+/-17.8 and 32.0+/-21.1 %, respectively. In addition, strain 1-1B(T) was found to be distinct from the other described species of the genus Thalassospira in terms of some taxonomically important traits, including DNA G+C content, optimum growth temperature, salinity tolerance, utilization of carbon sources and fatty acid composition. Furthermore, strain 1-1B(T) and T. profundimaris were also different with regard to motility and denitrification capacities. On the basis of physiological and DNA-DNA hybridization data, strain 1-1B(T) represents a novel species within the genus Thalassospira, for which the name Thalassospira tepidiphila sp. nov. is proposed. The type strain is 1-1B(T) (=JCM 14578(T) =DSM 18888(T)).
Microbial Analysis of Australian Dry Lake Cores; Analogs For Biogeochemical Processes

NASA Astrophysics Data System (ADS)

Nguyen, A. V.; Baldridge, A. M.; Thomson, B. J.

2014-12-01

Lake Gilmore in Western Australia is an acidic ephemeral lake that is analogous to Martian geochemical processes represented by interbedded phyllosilicates and sulfates. These areas demonstrate remnants of a global-scale change on Mars during the late Noachian era from a neutral to alkaline pH to relatively lower pH in the Hesperian era that continues to persist today. The geochemistry of these areas could possibly be caused by small-scale changes such as microbial metabolism. Two approaches were used to determine the presence of microbes in the Australian dry lake cores: DNA analysis and lipid analysis. Detecting DNA or lipids in the cores will provide evidence of living or deceased organisms since they provide distinct markers for life. Basic DNA analysis consists of extraction, amplification through PCR, plasmid cloning, and DNA sequencing. Once the sequence of unknown DNA is known, an online program, BLAST, will be used to identify the microbes for further analysis. The lipid analysis approach consists of phospholipid fatty acid analysis that is done by Microbial ID, which will provide direct identification any microbes from the presence of lipids. Identified microbes are then compared to mineralogy results from the x-ray diffraction of the core samples to determine if the types of metabolic reactions are consistent with the variation in composition in these analog deposits. If so, it provides intriguing implications for the presence of life in similar Martian deposits.
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.

PubMed

Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

2012-01-01

RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.
Ribosomal DNA Organization Before and After Magnification in Drosophila melanogaster

PubMed Central

Bianciardi, Alessio; Boschi, Manuela; Swanson, Ellen E.; Belloni, Massimo; Robbins, Leonard G.

2012-01-01

In all eukaryotes, the ribosomal RNA genes are stably inherited redundant elements. In Drosophila melanogaster, the presence of a Ybb− chromosome in males, or the maternal presence of the Ribosomal exchange (Rex) element, induces magnification: a heritable increase of rDNA copy number. To date, several alternative classes of mechanisms have been proposed for magnification: in situ replication or extra-chromosomal replication, either of which might act on short or extended strings of rDNA units, or unequal sister chromatid exchange. To eliminate some of these hypotheses, none of which has been clearly proven, we examined molecular-variant composition and compared genetic maps of the rDNA in the bb2 mutant and in some magnified bb+ alleles. The genetic markers used are molecular-length variants of IGS sequences and of R1 and R2 mobile elements present in many 28S sequences. Direct comparison of PCR products does not reveal any particularly intensified electrophoretic bands in magnified alleles compared to the nonmagnified bb2 allele. Hence, the increase of rDNA copy number is diluted among multiple variants. We can therefore reject mechanisms of magnification based on multiple rounds of replication of short strings. Moreover, we find no changes of marker order when pre- and postmagnification maps are compared. Thus, we can further restrict the possible mechanisms to two: replication in situ of an extended string of rDNA units or unequal exchange between sister chromatids. PMID:22505623
Direct Detection and Sequencing of Damaged DNA Bases

PubMed Central

2011-01-01

Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications. PMID:22185597
Direct detection and sequencing of damaged DNA bases.

PubMed

Clark, Tyson A; Spittle, Kristi E; Turner, Stephen W; Korlach, Jonas

2011-12-20

Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications.
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1987-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3575113

A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1990-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2333227
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1988-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3368330
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1989-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2654889
Kilo-sequencing: an ordered strategy for rapid DNA sequence data acquisition.

PubMed Central

Barnes, W M; Bevan, M

1983-01-01

A strategy for rapid DNA sequence acquisition in an ordered, nonrandom manner, while retaining all of the conveniences of the dideoxy method with M13 transducing phage DNA template, is described. Target DNA 3 to 14 kb in size can be stably carried by our M13 vectors. Suitable targets are stretches of DNA which lack an enzyme recognition site which is unique on our cloning vectors and adjacent to the sequencing primer; current sites that are so useful when lacking are Pst, Xba, HindIII, BglII, EcoRI. By an in vitro procedure, we cut RF DNA once randomly and once specifically, to create thousands of deletions which start at the unique restriction site adjacent to the dideoxy sequencing primer and extend various distances across the target DNA. Phage carrying a desired size of deletions, whose DNA as template will give rise to DNA sequence data in a desired location along the target DNA, may be purified by electrophoresis alive on agarose gels. Phage running in the same location on the agarose gel thus conveniently give rise to nucleotide sequence data from the same kilobase of target DNA. Images PMID:6298723
Archaeogenetics of Late Iron Age Çemialo Sırtı, Batman: Investigating maternal genetic continuity in north Mesopotamia since the Neolithic.

PubMed

Yaka, Reyhan; Birand, Ayşegül; Yılmaz, Yasemin; Caner, Ceren; Açan, Sinan Can; Gündüzalp, Sidar; Parvizi, Poorya; Erim Özdoğan, Aslı; Togan, İnci; Somel, Mehmet

2018-05-01

North Mesopotamia has witnessed dramatic social change during the Holocene, but the impact of these events on its demographic history is poorly understood. Here, we study this question by analysing genetic data from the recently excavated Late Iron Age settlement of Çemialo Sırtı in Batman, southeast Turkey. Archaeological and radiocarbon evidence indicate that the site was inhabited during the second and first millennia BCE. Çemialo Sırtı reveals nomadic items of the Early Iron Age, as well as items associated with the Late Achaemenid and subsequent Hellenistic Periods. We compare Çemialo Sırtı mitochondrial DNA profiles with earlier and later populations from west Eurasia to describe genetic continuity patterns in the region. A total of 16 Çemialo Sırtı individuals' remains were studied. PCR and Sanger sequencing were used to obtain mitochondrial DNA HVRI-HVRII sequences. We studied haplotype diversity and pairwise genetic distances using F ST , comparing the Çemialo Sırtı population with ancient and modern-day populations from west Eurasia. Coalescent simulations were carried out to test continuity for specific population comparisons. Mitochondrial DNA (mtDNA) haplotypes from 12 Çemialo Sırtı individuals reveal high haplotype diversity in this population, conspicuously higher than early Holocene west Eurasian populations, which supports the notion of increasing population admixture in west Eurasia through the Holocene. In its mtDNA composition, Çemialo Sırtı shows highest affinity to Neolithic north Syria and Neolithic Anatolia among ancient populations studied, and to modern-day southwest Asian populations. Based on population genetic simulations we cannot reject continuity between Neolithic and Iron Age, or between Iron Age and present-day populations of the region. Despite the region's complex sociopolitical history and indication for increased genetic diversity over time, we find no evidence for sharp shifts in north Mesopotamian maternal genetic composition within the last 10,000 years. © 2018 Wiley Periodicals, Inc.
Silicene nanoribbon as a new DNA sequencing device

NASA Astrophysics Data System (ADS)

Alesheikh, Sara; Shahtahmassebi, Nasser; Roknabadi, Mahmood Rezaee; Pilevar Shahri, Raheleh

2018-02-01

The importance of applying DNA sequencing in different fields, results in looking for fast and cheap methods. Nanotechnology helps this development by introducing nanostructures used for DNA sequencing. In this work we study the interaction between zigzag silicene nanoribbon and DNA nucleobases using DFT and non equilibrium Green's function approach, to investigate the possibility of using zigzag silicene nanoribbons as a biosensor for DNA sequencing.
Isolation and characterization of target sequences of the chicken CdxA homeobox gene.

PubMed Central

Margalit, Y; Yarus, S; Shapira, E; Gruenbaum, Y; Fainsod, A

1993-01-01

The DNA binding specificity of the chicken homeodomain protein CDXA was studied. Using a CDXA-glutathione-S-transferase fusion protein, DNA fragments containing the binding site for this protein were isolated. The sources of DNA were oligonucleotides with random sequence and chicken genomic DNA. The DNA fragments isolated were sequenced and tested in DNA binding assays. Sequencing revealed that most DNA fragments are AT rich which is a common feature of homeodomain binding sites. By electrophoretic mobility shift assays it was shown that the different target sequences isolated bind to the CDXA protein with different affinities. The specific sequences bound by the CDXA protein in the genomic fragments isolated, were determined by DNase I footprinting. From the footprinted sequences, the CDXA consensus binding site was determined. The CDXA protein binds the consensus sequence A, A/T, T, A/T, A, T, A/G. The CAUDAL binding site in the ftz promoter is also included in this consensus sequence. When tested, some of the genomic target sequences were capable of enhancing the transcriptional activity of reporter plasmids when introduced into CDXA expressing cells. This study determined the DNA sequence specificity of the CDXA protein and it also shows that this protein can further activate transcription in cells in culture. Images PMID:7909943
Sequence periodicity in nucleosomal DNA and intrinsic curvature.

PubMed

Nair, T Murlidharan

2010-05-17

Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA.
Assessing the Fidelity of Ancient DNA Sequences Amplified From Nuclear Genes

PubMed Central

Binladen, Jonas; Wiuf, Carsten; Gilbert, M. Thomas P.; Bunce, Michael; Barnett, Ross; Larson, Greger; Greenwood, Alex D.; Haile, James; Ho, Simon Y. W.; Hansen, Anders J.; Willerslev, Eske

2006-01-01

To date, the field of ancient DNA has relied almost exclusively on mitochondrial DNA (mtDNA) sequences. However, a number of recent studies have reported the successful recovery of ancient nuclear DNA (nuDNA) sequences, thereby allowing the characterization of genetic loci directly involved in phenotypic traits of extinct taxa. It is well documented that postmortem damage in ancient mtDNA can lead to the generation of artifactual sequences. However, as yet no one has thoroughly investigated the damage spectrum in ancient nuDNA. By comparing clone sequences from 23 fossil specimens, recovered from environments ranging from permafrost to desert, we demonstrate the presence of miscoding lesion damage in both the mtDNA and nuDNA, resulting in insertion of erroneous bases during amplification. Interestingly, no significant differences in the frequency of miscoding lesion damage are recorded between mtDNA and nuDNA despite great differences in cellular copy numbers. For both mtDNA and nuDNA, we find significant positive correlations between total sequence heterogeneity and the rates of type 1 transitions (adenine → guanine and thymine → cytosine) and type 2 transitions (cytosine → thymine and guanine → adenine), respectively. Type 2 transitions are by far the most dominant and increase relative to those of type 1 with damage load. The results suggest that the deamination of cytosine (and 5-methyl cytosine) to uracil (and thymine) is the main cause of miscoding lesions in both ancient mtDNA and nuDNA sequences. We argue that the problems presented by postmortem damage, as well as problems with contamination from exogenous sources of conserved nuclear genes, allelic variation, and the reliance on single nucleotide polymorphisms, call for great caution in studies relying on ancient nuDNA sequences. PMID:16299392
MiFish, a set of universal PCR primers for metabarcoding environmental DNA from fishes: detection of more than 230 subtropical marine species

PubMed Central

Miya, M.; Sato, Y.; Fukunaga, T.; Sado, T.; Poulsen, J. Y.; Sato, K.; Minamoto, T.; Yamamoto, S.; Yamanaka, H.; Araki, H.; Kondoh, M.; Iwasaki, W.

2015-01-01

We developed a set of universal PCR primers (MiFish-U/E) for metabarcoding environmental DNA (eDNA) from fishes. Primers were designed using aligned whole mitochondrial genome (mitogenome) sequences from 880 species, supplemented by partial mitogenome sequences from 160 elasmobranchs (sharks and rays). The primers target a hypervariable region of the 12S rRNA gene (163–185 bp), which contains sufficient information to identify fishes to taxonomic family, genus and species except for some closely related congeners. To test versatility of the primers across a diverse range of fishes, we sampled eDNA from four tanks in the Okinawa Churaumi Aquarium with known species compositions, prepared dual-indexed libraries and performed paired-end sequencing of the region using high-throughput next-generation sequencing technologies. Out of the 180 marine fish species contained in the four tanks with reference sequences in a custom database, we detected 168 species (93.3%) distributed across 59 families and 123 genera. These fishes are not only taxonomically diverse, ranging from sharks and rays to higher teleosts, but are also greatly varied in their ecology, including both pelagic and benthic species living in shallow coastal to deep waters. We also sampled natural seawaters around coral reefs near the aquarium and detected 93 fish species using this approach. Of the 93 species, 64 were not detected in the four aquarium tanks, rendering the total number of species detected to 232 (from 70 families and 152 genera). The metabarcoding approach presented here is non-invasive, more efficient, more cost-effective and more sensitive than the traditional survey methods. It has the potential to serve as an alternative (or complementary) tool for biodiversity monitoring that revolutionizes natural resource management and ecological studies of fish communities on larger spatial and temporal scales. PMID:26587265
MiFish, a set of universal PCR primers for metabarcoding environmental DNA from fishes: detection of more than 230 subtropical marine species.

PubMed

Miya, M; Sato, Y; Fukunaga, T; Sado, T; Poulsen, J Y; Sato, K; Minamoto, T; Yamamoto, S; Yamanaka, H; Araki, H; Kondoh, M; Iwasaki, W

2015-07-01

We developed a set of universal PCR primers (MiFish-U/E) for metabarcoding environmental DNA (eDNA) from fishes. Primers were designed using aligned whole mitochondrial genome (mitogenome) sequences from 880 species, supplemented by partial mitogenome sequences from 160 elasmobranchs (sharks and rays). The primers target a hypervariable region of the 12S rRNA gene (163-185 bp), which contains sufficient information to identify fishes to taxonomic family, genus and species except for some closely related congeners. To test versatility of the primers across a diverse range of fishes, we sampled eDNA from four tanks in the Okinawa Churaumi Aquarium with known species compositions, prepared dual-indexed libraries and performed paired-end sequencing of the region using high-throughput next-generation sequencing technologies. Out of the 180 marine fish species contained in the four tanks with reference sequences in a custom database, we detected 168 species (93.3%) distributed across 59 families and 123 genera. These fishes are not only taxonomically diverse, ranging from sharks and rays to higher teleosts, but are also greatly varied in their ecology, including both pelagic and benthic species living in shallow coastal to deep waters. We also sampled natural seawaters around coral reefs near the aquarium and detected 93 fish species using this approach. Of the 93 species, 64 were not detected in the four aquarium tanks, rendering the total number of species detected to 232 (from 70 families and 152 genera). The metabarcoding approach presented here is non-invasive, more efficient, more cost-effective and more sensitive than the traditional survey methods. It has the potential to serve as an alternative (or complementary) tool for biodiversity monitoring that revolutionizes natural resource management and ecological studies of fish communities on larger spatial and temporal scales.
[Current applications of high-throughput DNA sequencing technology in antibody drug research].

PubMed

Yu, Xin; Liu, Qi-Gang; Wang, Ming-Rong

2012-03-01

Since the publication of a high-throughput DNA sequencing technology based on PCR reaction was carried out in oil emulsions in 2005, high-throughput DNA sequencing platforms have been evolved to a robust technology in sequencing genomes and diverse DNA libraries. Antibody libraries with vast numbers of members currently serve as a foundation of discovering novel antibody drugs, and high-throughput DNA sequencing technology makes it possible to rapidly identify functional antibody variants with desired properties. Herein we present a review of current applications of high-throughput DNA sequencing technology in the analysis of antibody library diversity, sequencing of CDR3 regions, identification of potent antibodies based on sequence frequency, discovery of functional genes, and combination with various display technologies, so as to provide an alternative approach of discovery and development of antibody drugs.
Cryo-EM Structures Reveal Mechanism and Inhibition of DNA Targeting by a CRISPR-Cas Surveillance Complex.

PubMed

Guo, Tai Wei; Bartesaghi, Alberto; Yang, Hui; Falconieri, Veronica; Rao, Prashant; Merk, Alan; Eng, Edward T; Raczkowski, Ashleigh M; Fox, Tara; Earl, Lesley A; Patel, Dinshaw J; Subramaniam, Sriram

2017-10-05

Prokaryotic cells possess CRISPR-mediated adaptive immune systems that protect them from foreign genetic elements, such as invading viruses. A central element of this immune system is an RNA-guided surveillance complex capable of targeting non-self DNA or RNA for degradation in a sequence- and site-specific manner analogous to RNA interference. Although the complexes display considerable diversity in their composition and architecture, many basic mechanisms underlying target recognition and cleavage are highly conserved. Using cryoelectron microscopy (cryo-EM), we show that the binding of target double-stranded DNA (dsDNA) to a type I-F CRISPR system yersinia (Csy) surveillance complex leads to large quaternary and tertiary structural changes in the complex that are likely necessary in the pathway leading to target dsDNA degradation by a trans-acting helicase-nuclease. Comparison of the structure of the surveillance complex before and after dsDNA binding, or in complex with three virally encoded anti-CRISPR suppressors that inhibit dsDNA binding, reveals mechanistic details underlying target recognition and inhibition. Published by Elsevier Inc.
DNA fingerprinting, DNA barcoding, and next generation sequencing technology in plants.

PubMed

Sucher, Nikolaus J; Hennell, James R; Carles, Maria C

2012-01-01

DNA fingerprinting of plants has become an invaluable tool in forensic, scientific, and industrial laboratories all over the world. PCR has become part of virtually every variation of the plethora of approaches used for DNA fingerprinting today. DNA sequencing is increasingly used either in combination with or as a replacement for traditional DNA fingerprinting techniques. A prime example is the use of short, standardized regions of the genome as taxon barcodes for biological identification of plants. Rapid advances in "next generation sequencing" (NGS) technology are driving down the cost of sequencing and bringing large-scale sequencing projects into the reach of individual investigators. We present an overview of recent publications that demonstrate the use of "NGS" technology for DNA fingerprinting and DNA barcoding applications.
Mammalian DNA enriched for replication origins is enriched for snap-back sequences.

PubMed

Zannis-Hadjopoulos, M; Kaufmann, G; Martin, R G

1984-11-15

Using the instability of replication loops as a method for the isolation of double-stranded nascent DNA, extruded DNA enriched for replication origins was obtained and denatured. Snap-back DNA, single-stranded DNA with inverted repeats (palindromic sequences), reassociates rapidly into stem-loop structures with zero-order kinetics when conditions are changed from denaturing to renaturing, and can be assayed by chromatography on hydroxyapatite. Origin-enriched nascent DNA strands from mouse, rat and monkey cells growing either synchronously or asynchronously were purified and assayed for the presence of snap-back sequences. The results show that origin-enriched DNA is also enriched for snap-back sequences, implying that some origins for mammalian DNA replication contain or lie near palindromic sequences.
Molecular organization and chromosomal localization of 5S rDNA in Amazonian Engystomops (Anura, Leiuperidae)

PubMed Central

2012-01-01

Background For anurans, knowledge of 5S rDNA is scarce. For Engystomops species, chromosomal homeologies are difficult to recognize due to the high level of inter- and intraspecific cytogenetic variation. In an attempt to better compare the karyotypes of the Amazonian species Engystomops freibergi and Engystomops petersi, and to extend the knowledge of 5S rDNA organization in anurans, the 5S rDNA sequences of Amazonian Engystomops species were isolated, characterized, and mapped. Results Two types of 5S rDNA, which were readily differentiated by their NTS (non-transcribed spacer) sizes and compositions, were isolated from specimens of E. freibergi from Brazil and E. petersi from two Ecuadorian localities (Puyo and Yasuní). In the E. freibergi karyotypes, the entire type I 5S rDNA repeating unit hybridized to the pericentromeric region of 3p, whereas the entire type II 5S rDNA repeating unit mapped to the distal region of 6q, suggesting a differential localization of these sequences. The type I NTS probe clearly detected the 3p pericentromeric region in the karyotypes of E. freibergi and E. petersi from Puyo and the 5p pericentromeric region in the karyotype of E. petersi from Yasuní, but no distal or interstitial signals were observed. Interestingly, this probe also detected many centromeric regions in the three karyotypes, suggesting the presence of a satellite DNA family derived from 5S rDNA. The type II NTS probe detected only distal 6q regions in the three karyotypes, corroborating the differential distribution of the two types of 5S rDNA. Conclusions Because the 5S rDNA types found in Engystomops are related to those of Physalaemus with respect to their nucleotide sequences and chromosomal locations, their origin likely preceded the evolutionary divergence of these genera. In addition, our data indicated homeology between Chromosome 5 in E. petersi from Yasuní and Chromosomes 3 in E. freibergi and E. petersi from Puyo. In addition, the chromosomal location of the type II 5S rDNA corroborates the hypothesis that the Chromosomes 6 of E. petersi and E. freibergi are homeologous despite the great differences observed between the karyotypes of the Yasuní specimens and the others. PMID:22433220
Molecular organization and chromosomal localization of 5S rDNA in Amazonian Engystomops (Anura, Leiuperidae).

PubMed

Rodrigues, Débora Silva; Rivera, Miryan; Lourenço, Luciana Bolsoni

2012-03-20

For anurans, knowledge of 5S rDNA is scarce. For Engystomops species, chromosomal homeologies are difficult to recognize due to the high level of inter- and intraspecific cytogenetic variation. In an attempt to better compare the karyotypes of the Amazonian species Engystomops freibergi and Engystomops petersi, and to extend the knowledge of 5S rDNA organization in anurans, the 5S rDNA sequences of Amazonian Engystomops species were isolated, characterized, and mapped. Two types of 5S rDNA, which were readily differentiated by their NTS (non-transcribed spacer) sizes and compositions, were isolated from specimens of E. freibergi from Brazil and E. petersi from two Ecuadorian localities (Puyo and Yasuní). In the E. freibergi karyotypes, the entire type I 5S rDNA repeating unit hybridized to the pericentromeric region of 3p, whereas the entire type II 5S rDNA repeating unit mapped to the distal region of 6q, suggesting a differential localization of these sequences. The type I NTS probe clearly detected the 3p pericentromeric region in the karyotypes of E. freibergi and E. petersi from Puyo and the 5p pericentromeric region in the karyotype of E. petersi from Yasuní, but no distal or interstitial signals were observed. Interestingly, this probe also detected many centromeric regions in the three karyotypes, suggesting the presence of a satellite DNA family derived from 5S rDNA. The type II NTS probe detected only distal 6q regions in the three karyotypes, corroborating the differential distribution of the two types of 5S rDNA. Because the 5S rDNA types found in Engystomops are related to those of Physalaemus with respect to their nucleotide sequences and chromosomal locations, their origin likely preceded the evolutionary divergence of these genera. In addition, our data indicated homeology between Chromosome 5 in E. petersi from Yasuní and Chromosomes 3 in E. freibergi and E. petersi from Puyo. In addition, the chromosomal location of the type II 5S rDNA corroborates the hypothesis that the Chromosomes 6 of E. petersi and E. freibergi are homeologous despite the great differences observed between the karyotypes of the Yasuní specimens and the others.
Traceability of Plant Diet Contents in Raw Cow Milk Samples

PubMed Central

Ponzoni, Elena; Mastromauro, Francesco; Gianì, Silvia; Breviario, Diego

2009-01-01

The use of molecular marker in the dairy sector is gaining large acceptance as a reliable diagnostic approach for food authenticity and traceability. Using a PCR approach, the rbcL marker, a chloroplast-based gene, was selected to amplify plant DNA fragments in raw cow milk samples collected from stock farms or bought on the Italian market. rbcL-specific DNA fragments could be found in total milk, as well as in the skimmed and the cream fractions. When the PCR amplified fragments were sent to sequence, the nucleotide composition of the chromatogram reflected the multiple contents of the polyphytic diet. PMID:22253982
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE PAGES

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio; ...

2016-03-09

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less

Specific minor groove solvation is a crucial determinant of DNA binding site recognition

PubMed Central

Harris, Lydia-Ann; Williams, Loren Dean; Koudelka, Gerald B.

2014-01-01

The DNA sequence preferences of nearly all sequence specific DNA binding proteins are influenced by the identities of bases that are not directly contacted by protein. Discrimination between non-contacted base sequences is commonly based on the differential abilities of DNA sequences to allow narrowing of the DNA minor groove. However, the factors that govern the propensity of minor groove narrowing are not completely understood. Here we show that the differential abilities of various DNA sequences to support formation of a highly ordered and stable minor groove solvation network are a key determinant of non-contacted base recognition by a sequence-specific binding protein. In addition, disrupting the solvent network in the non-contacted region of the binding site alters the protein's ability to recognize contacted base sequences at positions 5–6 bases away. This observation suggests that DNA solvent interactions link contacted and non-contacted base recognition by the protein. PMID:25429976
Census of the bacterial community of the gypsy moth larval midgut by using culturing and culture-independent methods.

PubMed

Broderick, Nichole A; Raffa, Kenneth F; Goodman, Robert M; Handelsman, Jo

2004-01-01

Little is known about bacteria associated with Lepidoptera, the large group of mostly phytophagous insects comprising the moths and butterflies. We inventoried the larval midgut bacteria of a polyphagous foliivore, the gypsy moth (Lymantria dispar L.), whose gut is highly alkaline, by using traditional culturing and culture-independent methods. We also examined the effects of diet on microbial composition. Analysis of individual third-instar larvae revealed a high degree of similarity of microbial composition among insects fed on the same diet. DNA sequence analysis indicated that most of the PCR-amplified 16S rRNA genes belong to the gamma-Proteobacteria and low G+C gram-positive divisions and that the cultured members represented more than half of the phylotypes identified. Less frequently detected taxa included members of the alpha-Proteobacterium, Actinobacterium, and Cytophaga/Flexibacter/Bacteroides divisions. The 16S rRNA gene sequences from 7 of the 15 cultured organisms and 8 of the 9 sequences identified by PCR amplification diverged from previously reported bacterial sequences. The microbial composition of midguts differed substantially among larvae feeding on a sterilized artificial diet, aspen, larch, white oak, or willow. 16S rRNA analysis of cultured isolates indicated that an Enterococcus species and culture-independent analysis indicated that an Entbacter sp. were both present in all larvae, regardless of the feeding substrate; the sequences of these two phylotypes varied less than 1% among individual insects. These results provide the first comprehensive description of the microbial diversity of a lepidopteran midgut and demonstrate that the plant species in the diet influences the composition of the gut bacterial community.
A Method for Preparing DNA Sequencing Templates Using a DNA-Binding Microplate

PubMed Central

Yang, Yu; Hebron, Haroun R.; Hang, Jun

2009-01-01

A DNA-binding matrix was immobilized on the surface of a 96-well microplate and used for plasmid DNA preparation for DNA sequencing. The same DNA-binding plate was used for bacterial growth, cell lysis, DNA purification, and storage. In a single step using one buffer, bacterial cells were lysed by enzymes, and released DNA was captured on the plate simultaneously. After two wash steps, DNA was eluted and stored in the same plate. Inclusion of phosphates in the culture medium was found to enhance the yield of plasmid significantly. Purified DNA samples were used successfully in DNA sequencing with high consistency and reproducibility. Eleven vectors and nine libraries were tested using this method. In 10 μl sequencing reactions using 3 μl sample and 0.25 μl BigDye Terminator v3.1, the results from a 3730xl sequencer gave a success rate of 90–95% and read-lengths of 700 bases or more. The method is fully automatable and convenient for manual operation as well. It enables reproducible, high-throughput, rapid production of DNA with purity and yields sufficient for high-quality DNA sequencing at a substantially reduced cost. PMID:19568455
A New Perspective on Polyploid Fragaria (Strawberry) Genome Composition Based on Large-Scale, Multi-Locus Phylogenetic Analysis

PubMed Central

Yang, Yilong

2017-01-01

Abstract The subgenomic compositions of the octoploid (2n = 8× = 56) strawberry (Fragaria) species, including the economically important cultivated species Fragaria x ananassa, have been a topic of long-standing interest. Phylogenomic approaches utilizing next-generation sequencing technologies offer a new window into species relationships and the subgenomic compositions of polyploids. We have conducted a large-scale phylogenetic analysis of Fragaria (strawberry) species using the Fluidigm Access Array system and 454 sequencing platform. About 24 single-copy or low-copy nuclear genes distributed across the genome were amplified and sequenced from 96 genomic DNA samples representing 16 Fragaria species from diploid (2×) to decaploid (10×), including the most extensive sampling of octoploid taxa yet reported. Individual gene trees were constructed by different tree-building methods. Mosaic genomic structures of diploid Fragaria species consisting of sequences at different phylogenetic positions were observed. Our findings support the presence in octoploid species of genetic signatures from at least five diploid ancestors (F. vesca, F. iinumae, F. bucharica, F. viridis, and at least one additional allele contributor of unknown identity), and questions the extent to which distinct subgenomes are preserved over evolutionary time in the allopolyploid Fragaria species. In addition, our data support divergence between the two wild octoploid species, F. virginiana and F. chiloensis. PMID:29045639
Dendritic Cell-Based Immunotherapy of Breast Cancer: Modulation by CpG DNA

DTIC Science & Technology

2005-09-01

tumor-associated antigens and bacterial DNA oligodeoxynucleotides containing unmethylated CpG sequences (CpG DNA) further augment the immune priming...associated antigens by cytotoxic T lymphocytes, and bacterial DNA oligodeoxy- nucleotides containing unmethylated CpG sequences (CpG DNA) can further...further amplify their immunostimulatory capacity and bacterial DNA oligodeoxynucleotides (ODN) containing unmethylated CpG sequences (CpG DNA) provide such
A rapid and cost-effective method for sequencing pooled cDNA clones by using a combination of transposon insertion and Gateway technology.

PubMed

Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide

2011-09-01

Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.
Biological sequence compression algorithms.

PubMed

Matsumoto, T; Sadakane, K; Imai, H

2000-01-01

Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.
Complementation of a red-light-indifferent cyanobacterial mutant.

PubMed Central

Chiang, G G; Schaefer, M R; Grossman, A R

1992-01-01

Many cyanobacteria alter their phycobilisome composition in response to changes in light wavelength in a process termed complementary chromatic adaptation. Mutant strains FdR1 and FdR2 of the filamentous cyanobacterium Fremyella diplosiphon are characterized by aberrant chromatic adaptation. Instead of adjusting to different wavelengths of light, FdR1 and FdR2 behave as if they are always in green light; they do not respond to red light. We have previously reported complementation of FdR1 by conjugal transfer of a wild-type genomic library. The complementing DNA has now been localized by genetic analysis to a region on the rescued genomic subclone that contains a gene designated rcaC. This region of DNA is also able to complement FdR2. Southern blot analysis of genomic DNA from FdR1 and FdR2 indicates that these strains harbor DNA insertions within the rcaC sequence that may have resulted from the activity of transposable genetic elements. The predicted amino acid sequence of RcaC shares strong identity to response regulators of bacterial two-component regulatory systems. This relationship is discussed in the context of the signal-transduction pathway mediating regulation of genes encoding phycobilisome polypeptides during chromatic adaptation. Images PMID:1409650
Influence of structural variation on nuclear localization of DNA-binding polyamide-fluorophore conjugates.

PubMed

Edelson, Benjamin S; Best, Timothy P; Olenyuk, Bogdan; Nickols, Nicholas G; Doss, Raymond M; Foister, Shane; Heckel, Alexander; Dervan, Peter B

2004-01-01

A pivotal step forward in chemical approaches to controlling gene expression is the development of sequence-specific DNA-binding molecules that can enter live cells and traffic to nuclei unaided. DNA-binding polyamides are a class of programmable, sequence-specific small molecules that have been shown to influence a wide variety of protein-DNA interactions. We have synthesized over 100 polyamide-fluorophore conjugates and assayed their nuclear uptake profiles in 13 mammalian cell lines. The compiled dataset, comprising 1300 entries, establishes a benchmark for the nuclear localization of polyamide-dye conjugates. Compounds in this series were chosen to provide systematic variation in several structural variables, including dye composition and placement, molecular weight, charge, ordering of the aromatic and aliphatic amino-acid building blocks and overall shape. Nuclear uptake does not appear to be correlated with polyamide molecular weight or with the number of imidazole residues, although the positions of imidazole residues affect nuclear access properties significantly. Generally negative determinants for nuclear access include the presence of a beta-Ala-tail residue and the lack of a cationic alkyl amine moiety, whereas the presence of an acetylated 2,4-diaminobutyric acid-turn is a positive factor for nuclear localization. We discuss implications of these data on the design of polyamide-dye conjugates for use in biological systems.
Genomic Organization Under Different Environmental Conditions: Hoplosternum Littorale as a Model

PubMed Central

Schneider, Carlos Henrique; Feldberg, Eliana; Baccaro, Fabricio Beggiato; Carvalho, Natália Dayane Moura; Gross, Maria Claudia

2016-01-01

Abstract The Amazon has abundant rivers, streams, and floodplains in both polluted and nonpolluted environments, which show great adaptability. Thus, the goal of this study was to map repetitive DNA sequences in both mitotic chromosomes and erythrocyte micronuclei of tamoatás from polluted and nonpolluted environments and to assess the possible genotoxic effects of these environments. Individuals were collected in Manaus, Amazonas (AM), and submitted to classical and molecular cytogenetic techniques, as well as to a blood micronucleus test. Diploid number equal to 60 chromosomes are present in all individuals, with 18S ribosomal DNA sites present in one chromosome pair and no interstitial telomeric sites on chromosomes. The micronucleus test showed no significant differences in pairwise comparisons between environments or collection sites, but the Rex3 retroelement was dispersed on the chromosomes of individuals from unpolluted environments and compartmentalized in individuals from polluted environments. Divergent numbers of 5S rDNA sites are present in individuals from unpolluted and polluted environments. The mapping of repetitive sequences revealed that micronuclei have different compositions both intra- and interindividually that suggests different regions are lost in the formation of micronuclei, and no single fragile region undergoes breaks, although repetitive DNA elements are involved in this process. PMID:26981695
Detection of DNA Methylation by Whole-Genome Bisulfite Sequencing.

PubMed

Li, Qing; Hermanson, Peter J; Springer, Nathan M

2018-01-01

DNA methylation plays an important role in the regulation of the expression of transposons and genes. Various methods have been developed to assay DNA methylation levels. Bisulfite sequencing is considered to be the "gold standard" for single-base resolution measurement of DNA methylation levels. Coupled with next-generation sequencing, whole-genome bisulfite sequencing (WGBS) allows DNA methylation to be evaluated at a genome-wide scale. Here, we described a protocol for WGBS in plant species with large genomes. This protocol has been successfully applied to assay genome-wide DNA methylation levels in maize and barley. This protocol has also been successfully coupled with sequence capture technology to assay DNA methylation levels in a targeted set of genomic regions.
A comparison of sedimentary DNA and pollen from lake sediments in recording vegetation composition at the Siberian treeline.

PubMed

Niemeyer, Bastian; Epp, Laura S; Stoof-Leichsenring, Kathleen R; Pestryakova, Luidmila A; Herzschuh, Ulrike

2017-11-01

Reliable information on past and present vegetation is important to project future changes, especially for rapidly transitioning areas such as the boreal treeline. To study past vegetation, pollen analysis is common, while current vegetation is usually assessed by field surveys. Application of detailed sedimentary DNA (sedDNA) records has the potential to enhance our understanding of vegetation changes, but studies systematically investigating the power of this proxy are rare to date. This study compares sedDNA metabarcoding and pollen records from surface sediments of 31 lakes along a north-south gradient of increasing forest cover in northern Siberia (Taymyr peninsula) with data from field surveys in the surroundings of the lakes. sedDNA metabarcoding recorded 114 plant taxa, about half of them to species level, while pollen analyses identified 43 taxa, both exceeding the 31 taxa found by vegetation field surveys. Increasing Larix percentages from north to south were consistently recorded by all three methods and principal component analyses based on percentage data of vegetation surveys and DNA sequences separated tundra from forested sites. Comparisons of the ordinations using procrustes and protest analyses show a significant fit among all compared pairs of records. Despite similarities of sedDNA and pollen records, certain idiosyncrasies, such as high percentages of Alnus and Betula in all pollen and high percentages of Salix in all sedDNA spectra, are observable. Our results from the tundra to single-tree tundra transition zone show that sedDNA analyses perform better than pollen in recording site-specific richness (i.e., presence/absence of taxa in the vicinity of the lake) and perform as well as pollen in tracing vegetation composition. © 2017 John Wiley & Sons Ltd.
DNA/RNA hybrid substrates modulate the catalytic activity of purified AID.

PubMed

Abdouni, Hala S; King, Justin J; Ghorbani, Atefeh; Fifield, Heather; Berghuis, Lesley; Larijani, Mani

2018-01-01

Activation-induced cytidine deaminase (AID) converts cytidine to uridine at Immunoglobulin (Ig) loci, initiating somatic hypermutation and class switching of antibodies. In vitro, AID acts on single stranded DNA (ssDNA), but neither double-stranded DNA (dsDNA) oligonucleotides nor RNA, and it is believed that transcription is the in vivo generator of ssDNA targeted by AID. It is also known that the Ig loci, particularly the switch (S) regions targeted by AID are rich in transcription-generated DNA/RNA hybrids. Here, we examined the binding and catalytic behavior of purified AID on DNA/RNA hybrid substrates bearing either random sequences or GC-rich sequences simulating Ig S regions. If substrates were made up of a random sequence, AID preferred substrates composed entirely of DNA over DNA/RNA hybrids. In contrast, if substrates were composed of S region sequences, AID preferred to mutate DNA/RNA hybrids over substrates composed entirely of DNA. Accordingly, AID exhibited a significantly higher affinity for binding DNA/RNA hybrid substrates composed specifically of S region sequences, than any other substrates composed of DNA. Thus, in the absence of any other cellular processes or factors, AID itself favors binding and mutating DNA/RNA hybrids composed of S region sequences. AID:DNA/RNA complex formation and supporting mutational analyses suggest that recognition of DNA/RNA hybrids is an inherent structural property of AID. Copyright © 2017 Elsevier Ltd. All rights reserved.
Characterization of the repetitive DNA elements in the genome of fish lymphocystis disease viruses.

PubMed

Schnitzler, P; Darai, G

1989-09-01

The complete DNA nucleotide sequence of the repetitive DNA elements in the genome of fish lymphocystis disease virus (FLDV) isolated from two different species (flounder and dab) was determined. The size of these repetitive DNA elements was found to be 1413 bp which corresponds to the DNA sequences of the 5' terminus of the EcoRI DNA fragment B (0.034 to 0.052 m.u.) and to the EcoRI DNA fragment M (0.718 to 0.736 m.u.) of the FLDV genome causing lymphocystis disease in flounder and plaice. The degree of DNA nucleotide homology between both regions was found to be 99%. The repetitive DNA element in the genome of FLDV isolated from other fish species (dab) was identified and is located within the EcoRI DNA fragment B and J of the viral genome. The DNA nucleotide sequence of one duplicate of this repetition (EcoRI DNA fragment J) was determined (1410 bp) and compared to the DNA nucleotide sequences of the repetitive DNA elements of the genome of FLDV isolated from flounder. It was found that the repetitive DNA elements of the genome of FLDV derived from two different fish species are highly conserved and possess a degree of DNA sequence homology of 94%. The DNA sequences of each strand of the individual repetitive element possess one open reading frame.
A comparative study of AMF diversity in annual and perennial plant species from semiarid gypsum soils.

NASA Astrophysics Data System (ADS)

Alguacil, M. M.; Torrecillas, E.; Roldán, A.; Díaz, G.; Torres, P.

2012-04-01

The arbuscular mycorrhizal fungi (AMF) communities composition regulate plant interactions and determine the structure of plant communities. In this study we analysed the diversity of AMF in the roots of two perennial gypsophyte plant species, Herniaria fruticosa and Senecio auricula, and an annual herbaceous species, Bromus rubens, growing in a gypsum soil from a semiarid area. The objective was to determine whether perennial and annual host plants support different AMF communities in their roots and whether there are AMF species that might be indicators of specific functional plant roles in these ecosystems. The roots were analysed by nested PCR, cloning, sequencing of the ribosomal DNA small subunit region and phylogenetic analysis. Twenty AMF sequence types, belonging to the Glomus group A, Glomus group B, Diversisporaceae, Acaulosporaceae, Archaeosporaceae and Paraglomeraceae, were identified. Both gypsophyte perennial species had differing compositions of the AMF community and higher diversity when compared with the annual species, showing preferential selection by specific AMF sequences types. B. rubens did not show host specificity, sharing the full composition of its AMF community with both perennial plant species. Seasonal variations in the competitiveness of AM fungi could explain the observed differences in AMF community composition, but this is still a working hypothesis that requires the analysis of further data obtained from a higher number of both annual and perennial plant species in order to be fully tested.
Long-range correlations and charge transport properties of DNA sequences

NASA Astrophysics Data System (ADS)

Liu, Xiao-liang; Ren, Yi; Xie, Qiong-tao; Deng, Chao-sheng; Xu, Hui

2010-04-01

By using Hurst's analysis and transfer approach, the rescaled range functions and Hurst exponents of human chromosome 22 and enterobacteria phage lambda DNA sequences are investigated and the transmission coefficients, Landauer resistances and Lyapunov coefficients of finite segments based on above genomic DNA sequences are calculated. In a comparison with quasiperiodic and random artificial DNA sequences, we find that λ-DNA exhibits anticorrelation behavior characterized by a Hurst exponent 0.5
The prediction of human exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames

DOE Office of Scientific and Technical Information (OSTI.GOV)

Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.

1994-12-31

Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Evolution of helotialean fungi (Leotiomycetes, Pezizomycotina): a nuclear rDNA phylogeny.

PubMed

Wang, Zheng; Binder, Manfred; Schoch, Conrad L; Johnston, Peter R; Spatafora, Joseph W; Hibbett, David S

2006-11-01

The highly divergent characters of morphology, ecology, and biology in the Helotiales make it one of the most problematic groups in traditional classification and molecular phylogeny. Sequences of three rDNA regions, SSU, LSU, and 5.8S rDNA, were generated for 50 helotialean fungi, representing 11 out of 13 families in the current classification. Data sets with different compositions were assembled, and parsimony and Bayesian analyses were performed. The phylogenetic distribution of lifestyle and ecological factors was assessed. Plant endophytism is distributed across multiple clades in the Leotiomycetes. Our results suggest that (1) the inclusion of LSU rDNA and a wider taxon sampling greatly improves resolution of the Helotiales phylogeny, however, the usefulness of rDNA in resolving the deep relationships within the Leotiomycetes is limited; (2) a new class Geoglossomycetes, including Geoglossum, Trichoglossum, and Sarcoleotia, is the basal lineage of the Leotiomyceta; (3) the Leotiomycetes, including the Helotiales, Erysiphales, Cyttariales, Rhytismatales, and Myxotrichaceae, is monophyletic; and (4) nine clades can be recognized within the Helotiales.
Duplication of the genome in normal and cancer cell cycles.

PubMed

Bandura, Jennifer L; Calvi, Brian R

2002-01-01

It is critical to discover the mechanisms of normal cell cycle regulation if we are to fully understand what goes awry in cancer cells. The normal eukaryotic cell tightly regulates the activity of origins of DNA replication so that the genome is duplicated exactly once per cell cycle. Over the last ten years much has been learned concerning the cell cycle regulation of origin activity. It is now clear that the proteins and cell cycle mechanisms that control origin activity are largely conserved from yeast to humans. Despite this conservation, the composition of origins of DNA replication in higher eukaryotes remains ill defined. A DNA consensus for predicting origins has yet to emerge, and it is of some debate whether primary DNA sequence determines where replication initiates. In this review we outline what is known about origin structure and the mechanism of once per cell cycle DNA replication with an emphasis on recent advances in mammalian cells. We discuss the possible relevance of these regulatory pathways for cancer biology and therapy.
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].

PubMed

Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y

2017-08-01

To analyze and detect the whole genome sequence of human mitochondrial DNA （mtDNA） by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine

Adult monozygotic twins discordant for intra-uterine growth have indistinguishable genome-wide DNA methylation profiles.

PubMed

Souren, Nicole Y P; Lutsik, Pavlo; Gasparoni, Gilles; Tierling, Sascha; Gries, Jasmin; Riemenschneider, Matthias; Fryns, Jean-Pierre; Derom, Catherine; Zeegers, Maurice P; Walter, Jörn

2013-05-26

Low birth weight is associated with an increased adult metabolic disease risk. It is widely discussed that poor intra-uterine conditions could induce long-lasting epigenetic modifications, leading to systemic changes in regulation of metabolic genes. To address this, we acquire genome-wide DNA methylation profiles from saliva DNA in a unique cohort of 17 monozygotic monochorionic female twins very discordant for birth weight. We examine if adverse prenatal growth conditions experienced by the smaller co-twins lead to long-lasting DNA methylation changes. Overall, co-twins show very similar genome-wide DNA methylation profiles. Since observed differences are almost exclusively caused by variable cellular composition, an original marker-based adjustment strategy was developed to eliminate such variation at affected CpGs. Among adjusted and unchanged CpGs 3,153 are differentially methylated between the heavy and light co-twins at nominal significance, of which 45 show sensible absolute mean β-value differences. Deep bisulfite sequencing of eight such loci reveals that differences remain in the range of technical variation, arguing against a reproducible biological effect. Analysis of methylation in repetitive elements using methylation-dependent primer extension assays also indicates no significant intra-pair differences. Severe intra-uterine growth differences observed within these monozygotic twins are not associated with long-lasting DNA methylation differences in cells composing saliva, detectable with up-to-date technologies. Additionally, our results indicate that uneven cell type composition can lead to spurious results and should be addressed in epigenomic studies.
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach

PubMed Central

Watson, Mick; Minot, Samuel S.; Rivera, Maria C.; Franklin, Rima B.

2017-01-01

Abstract Background: Environmental metagenomic analysis is typically accomplished by assigning taxonomy and/or function from whole genome sequencing or 16S amplicon sequences. Both of these approaches are limited, however, by read length, among other technical and biological factors. A nanopore-based sequencing platform, MinION™, produces reads that are ≥1 × 104 bp in length, potentially providing for more precise assignment, thereby alleviating some of the limitations inherent in determining metagenome composition from short reads. We tested the ability of sequence data produced by MinION (R7.3 flow cells) to correctly assign taxonomy in single bacterial species runs and in three types of low-complexity synthetic communities: a mixture of DNA using equal mass from four species, a community with one relatively rare (1%) and three abundant (33% each) components, and a mixture of genomic DNA from 20 bacterial strains of staggered representation. Taxonomic composition of the low-complexity communities was assessed by analyzing the MinION sequence data with three different bioinformatic approaches: Kraken, MG-RAST, and One Codex. Results: Long read sequences generated from libraries prepared from single strains using the version 5 kit and chemistry, run on the original MinION device, yielded as few as 224 to as many as 3497 bidirectional high-quality (2D) reads with an average overall study length of 6000 bp. For the single-strain analyses, assignment of reads to the correct genus by different methods ranged from 53.1% to 99.5%, assignment to the correct species ranged from 23.9% to 99.5%, and the majority of misassigned reads were to closely related organisms. A synthetic metagenome sequenced with the same setup yielded 714 high quality 2D reads of approximately 5500 bp that were up to 98% correctly assigned to the species level. Synthetic metagenome MinION libraries generated using version 6 kit and chemistry yielded from 899 to 3497 2D reads with lengths averaging 5700 bp with up to 98% assignment accuracy at the species level. The observed community proportions for “equal” and “rare” synthetic libraries were close to the known proportions, deviating from 0.1% to 10% across all tests. For a 20-species mock community with staggered contributions, a sequencing run detected all but 3 species (each included at <0.05% of DNA in the total mixture), 91% of reads were assigned to the correct species, 93% of reads were assigned to the correct genus, and >99% of reads were assigned to the correct family. Conclusions: At the current level of output and sequence quality (just under 4 × 103 2D reads for a synthetic metagenome), MinION sequencing followed by Kraken or One Codex analysis has the potential to provide rapid and accurate metagenomic analysis where the consortium is comprised of a limited number of taxa. Important considerations noted in this study included: high sensitivity of the MinION platform to the quality of input DNA, high variability of sequencing results across libraries and flow cells, and relatively small numbers of 2D reads per analysis limit. Together, these limited detection of very rare components of the microbial consortia, and would likely limit the utility of MinION for the sequencing of high-complexity metagenomic communities where thousands of taxa are expected. Furthermore, the limitations of the currently available data analysis tools suggest there is considerable room for improvement in the analytical approaches for the characterization of microbial communities using long reads. Nevertheless, the fact that the accurate taxonomic assignment of high-quality reads generated by MinION is approaching 99.5% and, in most cases, the inferred community structure mirrors the known proportions of a synthetic mixture warrants further exploration of practical application to environmental metagenomics as the platform continues to develop and improve. With further improvement in sequence throughput and error rate reduction, this platform shows great promise for precise real-time analysis of the composition and structure of more complex microbial communities. PMID:28327976
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.

PubMed

Brown, Bonnie L; Watson, Mick; Minot, Samuel S; Rivera, Maria C; Franklin, Rima B

2017-03-01

Environmental metagenomic analysis is typically accomplished by assigning taxonomy and/or function from whole genome sequencing or 16S amplicon sequences. Both of these approaches are limited, however, by read length, among other technical and biological factors. A nanopore-based sequencing platform, MinION™, produces reads that are ≥1 × 104 bp in length, potentially providing for more precise assignment, thereby alleviating some of the limitations inherent in determining metagenome composition from short reads. We tested the ability of sequence data produced by MinION (R7.3 flow cells) to correctly assign taxonomy in single bacterial species runs and in three types of low-complexity synthetic communities: a mixture of DNA using equal mass from four species, a community with one relatively rare (1%) and three abundant (33% each) components, and a mixture of genomic DNA from 20 bacterial strains of staggered representation. Taxonomic composition of the low-complexity communities was assessed by analyzing the MinION sequence data with three different bioinformatic approaches: Kraken, MG-RAST, and One Codex. Results: Long read sequences generated from libraries prepared from single strains using the version 5 kit and chemistry, run on the original MinION device, yielded as few as 224 to as many as 3497 bidirectional high-quality (2D) reads with an average overall study length of 6000 bp. For the single-strain analyses, assignment of reads to the correct genus by different methods ranged from 53.1% to 99.5%, assignment to the correct species ranged from 23.9% to 99.5%, and the majority of misassigned reads were to closely related organisms. A synthetic metagenome sequenced with the same setup yielded 714 high quality 2D reads of approximately 5500 bp that were up to 98% correctly assigned to the species level. Synthetic metagenome MinION libraries generated using version 6 kit and chemistry yielded from 899 to 3497 2D reads with lengths averaging 5700 bp with up to 98% assignment accuracy at the species level. The observed community proportions for “equal” and “rare” synthetic libraries were close to the known proportions, deviating from 0.1% to 10% across all tests. For a 20-species mock community with staggered contributions, a sequencing run detected all but 3 species (each included at <0.05% of DNA in the total mixture), 91% of reads were assigned to the correct species, 93% of reads were assigned to the correct genus, and >99% of reads were assigned to the correct family. Conclusions: At the current level of output and sequence quality (just under 4 × 103 2D reads for a synthetic metagenome), MinION sequencing followed by Kraken or One Codex analysis has the potential to provide rapid and accurate metagenomic analysis where the consortium is comprised of a limited number of taxa. Important considerations noted in this study included: high sensitivity of the MinION platform to the quality of input DNA, high variability of sequencing results across libraries and flow cells, and relatively small numbers of 2D reads per analysis limit. Together, these limited detection of very rare components of the microbial consortia, and would likely limit the utility of MinION for the sequencing of high-complexity metagenomic communities where thousands of taxa are expected. Furthermore, the limitations of the currently available data analysis tools suggest there is considerable room for improvement in the analytical approaches for the characterization of microbial communities using long reads. Nevertheless, the fact that the accurate taxonomic assignment of high-quality reads generated by MinION is approaching 99.5% and, in most cases, the inferred community structure mirrors the known proportions of a synthetic mixture warrants further exploration of practical application to environmental metagenomics as the platform continues to develop and improve. With further improvement in sequence throughput and error rate reduction, this platform shows great promise for precise real-time analysis of the composition and structure of more complex microbial communities. © The Author 2017. Published by Oxford University Press.
Composition of the bacterial community in the gut of the pine engraver, Ips pini (Say) (Coloptera) colonizing red pine

Treesearch

Italo Jr. Delalibera; Archana Vasanthakumar; Benjamin J. Burwitz; Patrick D. Schloss; Kier D. Klepzig; Jo Handelsman; Kenneth F. Raffa

2007-01-01

The gut bacterial community of a bark beetle, the pine engraver Ips pini (Say), was characterized using culture-dependent and culture-independent methods. Bacteria from individual guts of larvae, pupae and adults were cultured and DNA was extracted from samples of pooled larval guts. Analysis of 16S rRNA gene sequences amplified directly from the gut...
Sequence periodicity in nucleosomal DNA and intrinsic curvature

PubMed Central

2010-01-01

Background Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Results Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. Conclusions The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA. PMID:20487515
A survey of the sequence-specific interaction of damaging agents with DNA: emphasis on antitumor agents.

PubMed

Murray, V

1999-01-01

This article reviews the literature concerning the sequence specificity of DNA-damaging agents. DNA-damaging agents are widely used in cancer chemotherapy. It is important to understand fully the determinants of DNA sequence specificity so that more effective DNA-damaging agents can be developed as antitumor drugs. There are five main methods of DNA sequence specificity analysis: cleavage of end-labeled fragments, linear amplification with Taq DNA polymerase, ligation-mediated polymerase chain reaction (PCR), single-strand ligation PCR, and footprinting. The DNA sequence specificity in purified DNA and in intact mammalian cells is reviewed for several classes of DNA-damaging agent. These include agents that form covalent adducts with DNA, free radical generators, topoisomerase inhibitors, intercalators and minor groove binders, enzymes, and electromagnetic radiation. The main sites of adduct formation are at the N-7 of guanine in the major groove of DNA and the N-3 of adenine in the minor groove, whereas free radical generators abstract hydrogen from the deoxyribose sugar and topoisomerase inhibitors cause enzyme-DNA cross-links to form. Several issues involved in the determination of the DNA sequence specificity are discussed. The future directions of the field, with respect to cancer chemotherapy, are also examined.
Using pseudoalignment and base quality to accurately quantify microbial community composition

PubMed Central

Novembre, John

2018-01-01

Pooled DNA from multiple unknown organisms arises in a variety of contexts, for example microbial samples from ecological or human health research. Determining the composition of pooled samples can be difficult, especially at the scale of modern sequencing data and reference databases. Here we propose a novel method for taxonomic profiling in pooled DNA that combines the speed and low-memory requirements of k-mer based pseudoalignment with a likelihood framework that uses base quality information to better resolve multiply mapped reads. We apply the method to the problem of classifying 16S rRNA reads using a reference database of known organisms, a common challenge in microbiome research. Using simulations, we show the method is accurate across a variety of read lengths, with different length reference sequences, at different sample depths, and when samples contain reads originating from organisms absent from the reference. We also assess performance in real 16S data, where we reanalyze previous genetic association data to show our method discovers a larger number of quantitative trait associations than other widely used methods. We implement our method in the software Karp, for k-mer based analysis of read pools, to provide a novel combination of speed and accuracy that is uniquely suited for enhancing discoveries in microbial studies. PMID:29659582
iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.

PubMed

Chen, Wei; Feng, Peng-Mian; Lin, Hao; Chou, Kuo-Chen

2014-01-01

In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapid and effective detection of splice sites that play important roles in gene structure annotation and even in RNA splicing. Although a series of computational methods were proposed for splice site identification, most of them neglected the intrinsic local structural properties. In the present study, a predictor called "iSS-PseDNC" was developed for identifying splice sites. In the new predictor, the sequences were formulated by a novel feature-vector called "pseudo dinucleotide composition" (PseDNC) into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on two benchmark datasets that the overall success rates achieved by iSS-PseDNC in identifying splice donor site and splice acceptor site were 85.45% and 87.73%, respectively. It is anticipated that iSS-PseDNC may become a useful tool for identifying splice sites and that the six DNA local structural properties described in this paper may provide novel insights for in-depth investigations into the mechanism of RNA splicing.
Phylogeography above the species level for perennial species in a composite genus

PubMed Central

Tremetsberger, Karin; Ortiz, María Ángeles; Terrab, Anass; Balao, Francisco; Casimiro-Soriguer, Ramón; Talavera, María; Talavera, Salvador

2016-01-01

In phylogeography, DNA sequence and fingerprint data at the population level are used to infer evolutionary histories of species. Phylogeography above the species level is concerned with the genealogical aspects of divergent lineages. Here, we present a phylogeographic study to examine the evolutionary history of a western Mediterranean composite, focusing on the perennial species of Helminthotheca (Asteraceae, Cichorieae). We used molecular markers (amplified fragment length polymorphism (AFLP), internal transcribed spacer and plastid DNA sequences) to infer relationships among populations throughout the distributional range of the group. Interpretation is aided by biogeographic and molecular clock analyses. Four coherent entities are revealed by Bayesian mixture clustering of AFLP data, which correspond to taxa previously recognized at the rank of subspecies. The origin of the group was in western North Africa, from where it expanded across the Strait of Gibraltar to the Iberian Peninsula and across the Strait of Sicily to Sicily. Pleistocene lineage divergence is inferred within western North Africa as well as within the western Iberian region. The existence of the four entities as discrete evolutionary lineages suggests that they should be elevated to the rank of species, yielding H. aculeata, H. comosa, H. maroccana and H. spinosa, whereby the latter two necessitate new combinations. PMID:26644340
Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing

PubMed Central

Chandran, Anandhakumar; Syed, Junetha; Taylor, Rhys D.; Kashiwazaki, Gengo; Sato, Shinsuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

2016-01-01

Chemically engineered small molecules targeting specific genomic sequences play an important role in drug development research. Pyrrole-imidazole polyamides (PIPs) are a group of molecules that can bind to the DNA minor-groove and can be engineered to target specific sequences. Their biological effects rely primarily on their selective DNA binding. However, the binding mechanism of PIPs at the chromatinized genome level is poorly understood. Herein, we report a method using high-throughput sequencing to identify the DNA-alkylating sites of PIP-indole-seco-CBI conjugates. High-throughput sequencing analysis of conjugate 2 showed highly similar DNA-alkylating sites on synthetic oligos (histone-free DNA) and on human genomes (chromatinized DNA context). To our knowledge, this is the first report identifying alkylation sites across genomic DNA by alkylating PIP conjugates using high-throughput sequencing. PMID:27098039
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.

PubMed

Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence.

PubMed

Shin, Dong-Ho; Webb, Barbara M; Nakao, Miki; Smith, Sylvia L

2009-07-01

Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and -d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (
Violation of an Evolutionarily Conserved Immunoglobulin Diversity Gene Sequence Preference Promotes Production of dsDNA-Specific IgG Antibodies

PubMed Central

Silva-Sanchez, Aaron; Liu, Cun Ren; Vale, Andre M.; Khass, Mohamed; Kapoor, Pratibha; Elgavish, Ada; Ivanov, Ivaylo I.; Ippolito, Gregory C.; Schelonka, Robert L.; Schoeb, Trenton R.; Burrows, Peter D.; Schroeder, Harry W.

2015-01-01

Variability in the developing antibody repertoire is focused on the third complementarity determining region of the H chain (CDR-H3), which lies at the center of the antigen binding site where it often plays a decisive role in antigen binding. The power of VDJ recombination and N nucleotide addition has led to the common conception that the sequence of CDR-H3 is unrestricted in its variability and random in its composition. Under this view, the immune response is solely controlled by somatic positive and negative clonal selection mechanisms that act on individual B cells to promote production of protective antibodies and prevent the production of self-reactive antibodies. This concept of a repertoire of random antigen binding sites is inconsistent with the observation that diversity (DH) gene segment sequence content by reading frame (RF) is evolutionarily conserved, creating biases in the prevalence and distribution of individual amino acids in CDR-H3. For example, arginine, which is often found in the CDR-H3 of dsDNA binding autoantibodies, is under-represented in the commonly used DH RFs rearranged by deletion, but is a frequent component of rarely used inverted RF1 (iRF1), which is rearranged by inversion. To determine the effect of altering this germline bias in DH gene segment sequence on autoantibody production, we generated mice that by genetic manipulation are forced to utilize an iRF1 sequence encoding two arginines. Over a one year period we collected serial serum samples from these unimmunized, specific pathogen-free mice and found that more than one-fifth of them contained elevated levels of dsDNA-binding IgG, but not IgM; whereas mice with a wild type DH sequence did not. Thus, germline bias against the use of arginine enriched DH sequence helps to reduce the likelihood of producing self-reactive antibodies. PMID:25706374
Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence

PubMed Central

Shin, Dong-Ho; Webb, Barbara M.; Nakao, Miki; Smith, Sylvia L.

2009-01-01

Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and –d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (≤) amino acid identities with each other, 35.4 ~ 39.6% and 62.8 ~ 65.9% with factor I of mammals and banded houndshark (Triakis scyllium), respectively. The modular structure of the GcIf is similar to that of mammals with one notable exception, the presence of a novel shark-specific sequence between the leader peptide (LP) and the factor I membrane attack complex (FIMAC) domain. The cDNA sequences differ only in the size and composition of the shark-specific region (SSR). Sequence analysis of each SSR has identified within the region two novel short sequences (SS1 and SS2) and three repeat sequences (RS1, 2 and 3). Genomic analysis has revealed the existence of three introns between the leader peptide and the FIMAC domain, tentatively designated intron 1, intron 2, and intron 3 which span 4067, 2293 and 2082 bp, respectively. Southern blot analysis suggests the presence of a single gene copy for each cDNA type. Phylogenetic analysis suggests that complement factor I of cartilaginous fish diverged prior to the emergence of mammals. All four GcIf cDNA species are expressed in four different tissues and the liver is the main tissue in which expression level of all four is high. This suggests that the expression of GcIf isotypes is tissue-dependent. PMID:19423168
Identification of antigenic regions on VP2 of African horsesickness virus serotype 3 by using phage-displayed epitope libraries.

PubMed

Bentley, L; Fehrsen, J; Jordaan, F; Huismans, H; du Plessis, D H

2000-04-01

VP2 is an outer capsid protein of African horsesickness virus (AHSV) and is recognized by serotype-discriminatory neutralizing antibodies. With the objective of locating its antigenic regions, a filamentous phage library was constructed that displayed peptides derived from the fragmentation of a cDNA copy of the gene encoding VP2. Peptides ranging in size from approximately 30 to 100 amino acids were fused with pIII, the attachment protein of the display vector, fUSE2. To ensure maximum diversity, the final library consisted of three sub-libraries. The first utilized enzymatically fragmented DNA encoding only the VP2 gene, the second included plasmid sequences, while the third included a PCR step designed to allow different peptide-encoding sequences to recombine before ligation into the vector. The resulting composite library was subjected to immunoaffinity selection with AHSV-specific polyclonal chicken IgY, polyclonal horse immunoglobulins and a monoclonal antibody (MAb) known to neutralize AHSV. Antigenic peptides were located by sequencing the DNA of phages bound by the antibodies. Most antigenic determinants capable of being mapped by this method were located in the N-terminal half of VP2. Important binding areas were mapped with high resolution by identifying the minimum overlapping areas of the selected peptides. The MAb was also used to screen a random 17-mer epitope library. Sequences that may be part of a discontinuous neutralization epitope were identified. The amino acid sequences of the antigenic regions on VP2 of serotype 3 were compared with corresponding regions on three other serotypes, revealing regions with the potential to discriminate AHSV serotypes serologically.
Submolecular Structure and Orientation of Oligonucleotide Duplexes Tethered to Gold Electrodes Probed by Infrared Reflection Absorption Spectroscopy: Effect of the Electrode Potentials.

PubMed

Kékedy-Nagy, László; Ferapontova, Elena E; Brand, Izabella

2017-02-23

Unique electronic and ligand recognition properties of the DNA double helix provide basis for DNA applications in biomolecular electronic and biosensor devices. However, the relation between the structure of DNA at electrified interfaces and its electronic properties is still not well understood. Here, potential-driven changes in the submolecular structure of DNA double helices composed of either adenine-thymine (dAdT) 25 or cytosine-guanine (dGdC) 20 base pairs tethered to the gold electrodes are for the first time analyzed by in situ polarization modulation infrared reflection absorption spectroscopy (PM IRRAS) performed under the electrochemical control. It is shown that the conformation of the DNA duplexes tethered to gold electrodes via the C 6 alkanethiol linker strongly depends on the nucleic acid sequence composition. The tilt of purine and pyrimidine rings of the complementary base pairs (dAdT and dGdC) depends on the potential applied to the electrode. By contrast, neither the conformation nor orientation of the ionic in character phosphate-sugar backbone is affected by the electrode potentials. At potentials more positive than the potential of zero charge (pzc), a gradual tilting of the double helix is observed. In this tilted orientation, the planes of the complementary purine and pyrimidine rings lie ideally parallel to each other. These potentials do not affect the integral stability of the DNA double helix at the charged interface. At potentials more negative than the pzc, DNA helices adopt a vertical to the gold surface orientation. Tilt of the purine and pyrimidine rings depends on the composition of the double helix. In monolayers composed of (dAdT) 25 molecules the rings of the complementary base pairs lie parallel to each other. By contrast, the tilt of purine and pyrimidine rings in (dGdC) 20 helices depends on the potential applied to the electrode. Such potential-induced mobility of the complementary base pairs can destabilize the helix structure at a submolecular level. These pioneer results on the potential-driven changes in the submolecular structure of double stranded DNA adsorbed on conductive supports contribute to further understanding of the potential-driven sequence-specific electronic properties of surface-tethered oligonucleotides.
The cyc1-11 mutation in yeast reverts by recombination with a nonallelic gene: composite genes determining the iso-cytochromes c.

PubMed Central

Ernst, J F; Stewart, J W; Sherman, F

1981-01-01

DNA sequence analysis of a cloned fragment directly established that the cyc1-11 mutation of iso-1-cytochrome c in the yeast Saccharomyces cerevisiae is a two-base-pair substitution that changes the CCA proline codon at amino acid position 76 to a UAA nonsense codon. Analysis of 11 revertant proteins and one cloned revertant gene showed that reversion of the cyc1-11 mutation can occur in three ways: a single base-pair substitution, which produces a serine replacement at position 76; recombination with the nonallelic CYC7 gene of iso-2-cytochrome c, which causes replacement of a segment in the cyc1-11 gene by the corresponding segment of the CYC7 gene; and either a two-base-pair substitution or recombination with the CYC7 gene, which causes the formation of the normal iso-1-cytochrome c sequence. These results demonstrate the occurrence of low frequencies of recombination between nonallelic genes having extensive but not complete homology. The formation of composite genes that share sequences from nonallelic genes may be an evolutionary mechanism for producing protein diversities and for maintaining identical sequences at different loci. Images PMID:6273865
Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors.

PubMed

Adalsteinsson, Viktor A; Ha, Gavin; Freeman, Samuel S; Choudhury, Atish D; Stover, Daniel G; Parsons, Heather A; Gydush, Gregory; Reed, Sarah C; Rotem, Denisse; Rhoades, Justin; Loginov, Denis; Livitz, Dimitri; Rosebrock, Daniel; Leshchiner, Ignaty; Kim, Jaegil; Stewart, Chip; Rosenberg, Mara; Francis, Joshua M; Zhang, Cheng-Zhong; Cohen, Ofir; Oh, Coyin; Ding, Huiming; Polak, Paz; Lloyd, Max; Mahmud, Sairah; Helvie, Karla; Merrill, Margaret S; Santiago, Rebecca A; O'Connor, Edward P; Jeong, Seong H; Leeson, Rachel; Barry, Rachel M; Kramkowski, Joseph F; Zhang, Zhenwei; Polacek, Laura; Lohr, Jens G; Schleicher, Molly; Lipscomb, Emily; Saltzman, Andrea; Oliver, Nelly M; Marini, Lori; Waks, Adrienne G; Harshman, Lauren C; Tolaney, Sara M; Van Allen, Eliezer M; Winer, Eric P; Lin, Nancy U; Nakabayashi, Mari; Taplin, Mary-Ellen; Johannessen, Cory M; Garraway, Levi A; Golub, Todd R; Boehm, Jesse S; Wagle, Nikhil; Getz, Gad; Love, J Christopher; Meyerson, Matthew

2017-11-06

Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.
Centromere and telomere sequence alterations reflect the rapid genome evolution within the carnivorous plant genus Genlisea.

PubMed

Tran, Trung D; Cao, Hieu X; Jovtchev, Gabriele; Neumann, Pavel; Novák, Petr; Fojtová, Miloslava; Vu, Giang T H; Macas, Jiří; Fajkus, Jiří; Schubert, Ingo; Fuchs, Joerg

2015-12-01

Linear chromosomes of eukaryotic organisms invariably possess centromeres and telomeres to ensure proper chromosome segregation during nuclear divisions and to protect the chromosome ends from deterioration and fusion, respectively. While centromeric sequences may differ between species, with arrays of tandemly repeated sequences and retrotransposons being the most abundant sequence types in plant centromeres, telomeric sequences are usually highly conserved among plants and other organisms. The genome size of the carnivorous genus Genlisea (Lentibulariaceae) is highly variable. Here we study evolutionary sequence plasticity of these chromosomal domains at an intrageneric level. We show that Genlisea nigrocaulis (1C = 86 Mbp; 2n = 40) and G. hispidula (1C = 1550 Mbp; 2n = 40) differ as to their DNA composition at centromeres and telomeres. G. nigrocaulis and its close relative G. pygmaea revealed mainly 161 bp tandem repeats, while G. hispidula and its close relative G. subglabra displayed a combination of four retroelements at centromeric positions. G. nigrocaulis and G. pygmaea chromosome ends are characterized by the Arabidopsis-type telomeric repeats (TTTAGGG); G. hispidula and G. subglabra instead revealed two intermingled sequence variants (TTCAGG and TTTCAGG). These differences in centromeric and, surprisingly, also in telomeric DNA sequences, uncovered between groups with on average a > 9-fold genome size difference, emphasize the fast genome evolution within this genus. Such intrageneric evolutionary alteration of telomeric repeats with cytosine in the guanine-rich strand, not yet known for plants, might impact the epigenetic telomere chromatin modification. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
An evolution based biosensor receptor DNA sequence generation algorithm.

PubMed

Kim, Eungyeong; Lee, Malrey; Gatton, Thomas M; Lee, Jaewan; Zang, Yupeng

2010-01-01

A biosensor is composed of a bioreceptor, an associated recognition molecule, and a signal transducer that can selectively detect target substances for analysis. DNA based biosensors utilize receptor molecules that allow hybridization with the target analyte. However, most DNA biosensor research uses oligonucleotides as the target analytes and does not address the potential problems of real samples. The identification of recognition molecules suitable for real target analyte samples is an important step towards further development of DNA biosensors. This study examines the characteristics of DNA used as bioreceptors and proposes a hybrid evolution-based DNA sequence generating algorithm, based on DNA computing, to identify suitable DNA bioreceptor recognition molecules for stable hybridization with real target substances. The Traveling Salesman Problem (TSP) approach is applied in the proposed algorithm to evaluate the safety and fitness of the generated DNA sequences. This approach improves efficiency and stability for enhanced and variable-length DNA sequence generation and allows extension to generation of variable-length DNA sequences with diverse receptor recognition requirements.

RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis

PubMed Central

Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

2012-01-01

RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. Availability http://www.cemb.edu.pk/sw.html Abbreviations RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language. PMID:23055611
Structural and Thermodynamic Signatures of DNA Recognition by Mycobacterium tuberculosis DnaA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tsodikov, Oleg V.; Biswas, Tapan

An essential protein, DnaA, binds to 9-bp DNA sites within the origin of replication oriC. These binding events are prerequisite to forming an enigmatic nucleoprotein scaffold that initiates replication. The number, sequences, positions, and orientations of these short DNA sites, or DnaA boxes, within the oriCs of different bacteria vary considerably. To investigate features of DnaA boxes that are important for binding Mycobacterium tuberculosis DnaA (MtDnaA), we have determined the crystal structures of the DNA binding domain (DBD) of MtDnaA bound to a cognate MtDnaA-box (at 2.0 {angstrom} resolution) and to a consensus Escherichia coli DnaA-box (at 2.3 {angstrom}). Thesemore » structures, complemented by calorimetric equilibrium binding studies of MtDnaA DBD in a series of DnaA-box variants, reveal the main determinants of DNA recognition and establish the [T/C][T/A][G/A]TCCACA sequence as a high-affinity MtDnaA-box. Bioinformatic and calorimetric analyses indicate that DnaA-box sequences in mycobacterial oriCs generally differ from the optimal binding sequence. This sequence variation occurs commonly at the first 2 bp, making an in vivo mycobacterial DnaA-box effectively a 7-mer and not a 9-mer. We demonstrate that the decrease in the affinity of these MtDnaA-box variants for MtDnaA DBD relative to that of the highest-affinity box TTGTCCACA is less than 10-fold. The understanding of DnaA-box recognition by MtDnaA and E. coli DnaA enables one to map DnaA-box sequences in the genomes of M. tuberculosis and other eubacteria.« less
Response of Spring Diatoms to CO2 Availability in the Western North Pacific as Determined by Next-Generation Sequencing.

PubMed

Endo, Hisashi; Sugie, Koji; Yoshimura, Takeshi; Suzuki, Koji

2016-01-01

Next-generation sequencing (NGS) technologies have enabled us to determine phytoplankton community compositions at high resolution. However, few studies have adopted this approach to assess the responses of natural phytoplankton communities to environmental change. Here, we report the impact of different CO2 levels on spring diatoms in the Oyashio region of the western North Pacific as estimated by NGS of the diatom-specific rbcL gene (DNA), which encodes the large subunit of RubisCO. We also examined the abundance and composition of rbcL transcripts (cDNA) in diatoms to assess their physiological responses to changing CO2 levels. A short-term (3-day) incubation experiment was carried out on-deck using surface Oyashio waters under different pCO2 levels (180, 350, 750, and 1000 μatm) in May 2011. During the incubation, the transcript abundance of the diatom-specific rbcL gene decreased with an increase in seawater pCO2 levels. These results suggest that CO2 fixation capacity of diatoms decreased rapidly under elevated CO2 levels. In the high CO2 treatments (750 and 1000 μatm), diversity of diatom-specific rbcL gene and its transcripts decreased relative to the control treatment (350 μatm), as well as contributions of Chaetocerataceae, Thalassiosiraceae, and Fragilariaceae to the total population, but the contributions of Bacillariaceae increased. In the low CO2 treatment, contributions of Bacillariaceae also increased together with other eukaryotes. These suggest that changes in CO2 levels can alter the community composition of spring diatoms in the Oyashio region. Overall, the NGS technology provided us a deeper understanding of the response of diatoms to changes in CO2 levels in terms of their community composition, diversity, and photosynthetic physiology.
DNA barcode goes two-dimensions: DNA QR code web server.

PubMed

Liu, Chang; Shi, Linchun; Xu, Xiaolan; Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin

2012-01-01

The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, "DNA barcode" actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications.
TaxI: a software tool for DNA barcoding using distance methods

PubMed Central

Steinke, Dirk; Vences, Miguel; Salzburger, Walter; Meyer, Axel

2005-01-01

DNA barcoding is a promising approach to the diagnosis of biological diversity in which DNA sequences serve as the primary key for information retrieval. Most existing software for evolutionary analysis of DNA sequences was designed for phylogenetic analyses and, hence, those algorithms do not offer appropriate solutions for the rapid, but precise analyses needed for DNA barcoding, and are also unable to process the often large comparative datasets. We developed a flexible software tool for DNA taxonomy, named TaxI. This program calculates sequence divergences between a query sequence (taxon to be barcoded) and each sequence of a dataset of reference sequences defined by the user. Because the analysis is based on separate pairwise alignments this software is also able to work with sequences characterized by multiple insertions and deletions that are difficult to align in large sequence sets (i.e. thousands of sequences) by multiple alignment algorithms because of computational restrictions. Here, we demonstrate the utility of this approach with two datasets of fish larvae and juveniles from Lake Constance and juvenile land snails under different models of sequence evolution. Sets of ribosomal 16S rRNA sequences, characterized by multiple indels, performed as good as or better than cox1 sequence sets in assigning sequences to species, demonstrating the suitability of rRNA genes for DNA barcoding. PMID:16214755
Dna Sequencing

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1995-04-25

A method for sequencing a strand of DNA, including the steps off: providing the strand of DNA; annealing the strand with a primer able to hybridize to the strand to give an annealed mixture; incubating the mixture with four deoxyribonucleoside triphosphates, a DNA polymerase, and at least three deoxyribonucleoside triphosphates in different amounts, under conditions in favoring primer extension to form nucleic acid fragments complementory to the DNA to be sequenced; labelling the nucleic and fragments; separating them and determining the position of the deoxyribonucleoside triphosphates by differences in the intensity of the labels, thereby to determine the DNA sequence.
High-fidelity target sequencing of individual molecules identified using barcode sequences: de novo detection and absolute quantitation of mutations in plasma cell-free DNA from cancer patients.

PubMed

Kukita, Yoji; Matoba, Ryo; Uchida, Junji; Hamakawa, Takuya; Doki, Yuichiro; Imamura, Fumio; Kato, Kikuya

2015-08-01

Circulating tumour DNA (ctDNA) is an emerging field of cancer research. However, current ctDNA analysis is usually restricted to one or a few mutation sites due to technical limitations. In the case of massively parallel DNA sequencers, the number of false positives caused by a high read error rate is a major problem. In addition, the final sequence reads do not represent the original DNA population due to the global amplification step during the template preparation. We established a high-fidelity target sequencing system of individual molecules identified in plasma cell-free DNA using barcode sequences; this system consists of the following two steps. (i) A novel target sequencing method that adds barcode sequences by adaptor ligation. This method uses linear amplification to eliminate the errors introduced during the early cycles of polymerase chain reaction. (ii) The monitoring and removal of erroneous barcode tags. This process involves the identification of individual molecules that have been sequenced and for which the number of mutations have been absolute quantitated. Using plasma cell-free DNA from patients with gastric or lung cancer, we demonstrated that the system achieved near complete elimination of false positives and enabled de novo detection and absolute quantitation of mutations in plasma cell-free DNA. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Sequence-Dependent Diastereospecific and Diastereodivergent Crosslinking of DNA by Decarbamoylmitomycin C.

PubMed

Aguilar, William; Paz, Manuel M; Vargas, Anayatzinc; Clement, Cristina C; Cheng, Shu-Yuan; Champeil, Elise

2018-04-20

Mitomycin C (MC), a potent antitumor drug, and decarbamoylmitomycin C (DMC), a derivative lacking the carbamoyl group, form highly cytotoxic DNA interstrand crosslinks. The major interstrand crosslink formed by DMC is the C1'' epimer of the major crosslink formed by MC. The molecular basis for the stereochemical configuration exhibited by DMC was investigated using biomimetic synthesis. The formation of DNA-DNA crosslinks by DMC is diastereospecific and diastereodivergent: Only the 1''S-diastereomer of the initially formed monoadduct can form crosslinks at GpC sequences, and only the 1''R-diastereomer of the monoadduct can form crosslinks at CpG sequences. We also show that CpG and GpC sequences react with divergent diastereoselectivity in the first alkylation step: 1"S stereochemistry is favored at GpC sequences and 1''R stereochemistry is favored at CpG sequences. Therefore, the first alkylation step results, at each sequence, in the selective formation of the diastereomer able to generate an interstrand DNA-DNA crosslink after the "second arm" alkylation. Examination of the known DNA adduct pattern obtained after treatment of cancer cell cultures with DMC indicates that the GpC sequence is the major target for the formation of DNA-DNA crosslinks in vivo by this drug. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sequencing historical specimens: successful preparation of small specimens with low amounts of degraded DNA.

PubMed

Sproul, John S; Maddison, David R

2017-11-01

Despite advances that allow DNA sequencing of old museum specimens, sequencing small-bodied, historical specimens can be challenging and unreliable as many contain only small amounts of fragmented DNA. Dependable methods to sequence such specimens are especially critical if the specimens are unique. We attempt to sequence small-bodied (3-6 mm) historical specimens (including nomenclatural types) of beetles that have been housed, dried, in museums for 58-159 years, and for which few or no suitable replacement specimens exist. To better understand ideal approaches of sample preparation and produce preparation guidelines, we compared different library preparation protocols using low amounts of input DNA (1-10 ng). We also explored low-cost optimizations designed to improve library preparation efficiency and sequencing success of historical specimens with minimal DNA, such as enzymatic repair of DNA. We report successful sample preparation and sequencing for all historical specimens despite our low-input DNA approach. We provide a list of guidelines related to DNA repair, bead handling, reducing adapter dimers and library amplification. We present these guidelines to facilitate more economical use of valuable DNA and enable more consistent results in projects that aim to sequence challenging, irreplaceable historical specimens. © 2017 John Wiley & Sons Ltd.
i-rDNA: alignment-free algorithm for rapid in silico detection of ribosomal gene fragments from metagenomic sequence data sets.

PubMed

Mohammed, Monzoorul Haque; Ghosh, Tarini Shankar; Chadaram, Sudha; Mande, Sharmila S

2011-11-30

Obtaining accurate estimates of microbial diversity using rDNA profiling is the first step in most metagenomics projects. Consequently, most metagenomic projects spend considerable amounts of time, money and manpower for experimentally cloning, amplifying and sequencing the rDNA content in a metagenomic sample. In the second step, the entire genomic content of the metagenome is extracted, sequenced and analyzed. Since DNA sequences obtained in this second step also contain rDNA fragments, rapid in silico identification of these rDNA fragments would drastically reduce the cost, time and effort of current metagenomic projects by entirely bypassing the experimental steps of primer based rDNA amplification, cloning and sequencing. In this study, we present an algorithm called i-rDNA that can facilitate the rapid detection of 16S rDNA fragments from amongst millions of sequences in metagenomic data sets with high detection sensitivity. Performance evaluation with data sets/database variants simulating typical metagenomic scenarios indicates the significantly high detection sensitivity of i-rDNA. Moreover, i-rDNA can process a million sequences in less than an hour on a simple desktop with modest hardware specifications. In addition to the speed of execution, high sensitivity and low false positive rate, the utility of the algorithmic approach discussed in this paper is immense given that it would help in bypassing the entire experimental step of primer-based rDNA amplification, cloning and sequencing. Application of this algorithmic approach would thus drastically reduce the cost, time and human efforts invested in all metagenomic projects. A web-server for the i-rDNA algorithm is available at http://metagenomics.atc.tcs.com/i-rDNA/
DNA Sequences from Formalin-Fixed Nematodes: Integrating Molecular and Morphological Approaches to Taxonomy

PubMed Central

Thomas, W. Kelley; Vida, J. T.; Frisse, Linda M.; Mundo, Manuel; Baldwin, James G.

1997-01-01

To effectively integrate DNA sequence analysis and classical nematode taxonomy, we must be able to obtain DNA sequences from formalin-fixed specimens. Microdissected sections of nematodes were removed from specimens fixed in formalin, using standard protocols and without destroying morphological features. The fixed sections provided sufficient template for multiple polymerase chain reaction-based DNA sequence analyses. PMID:19274156
A new family of satellite DNA sequences as a major component of centromeric heterochromatin in owls (Strigiformes).

PubMed

Yamada, Kazuhiko; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2004-03-01

We isolated a new family of satellite DNA sequences from HaeIII- and EcoRI-digested genomic DNA of the Blakiston's fish owl ( Ketupa blakistoni). The repetitive sequences were organized in tandem arrays of the 174 bp element, and localized to the centromeric regions of all macrochromosomes, including the Z and W chromosomes, and microchromosomes. This hybridization pattern was consistent with the distribution of C-band-positive centromeric heterochromatin, and the satellite DNA sequences occupied 10% of the total genome as a major component of centromeric heterochromatin. The sequences were homogenized between macro- and microchromosomes in this species, and therefore intraspecific divergence of the nucleotide sequences was low. The 174 bp element cross-hybridized to the genomic DNA of six other Strigidae species, but not to that of the Tytonidae, suggesting that the satellite DNA sequences are conserved in the same family but fairly divergent between the different families in the Strigiformes. Secondly, the centromeric satellite DNAs were cloned from eight Strigidae species, and the nucleotide sequences of 41 monomer fragments were compared within and between species. Molecular phylogenetic relationships of the nucleotide sequences were highly correlated with both the taxonomy based on morphological traits and the phylogenetic tree constructed by DNA-DNA hybridization. These results suggest that the satellite DNA sequence has evolved by concerted evolution in the Strigidae and that it is a good taxonomic and phylogenetic marker to examine genetic diversity between Strigiformes species.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Sobottka, Marcelo, E-mail: sobottka@mtm.ufsc.br; Hart, Andrew G., E-mail: ahart@dim.uchile.cl

Highlights: {yields} We propose a simple stochastic model to construct primitive DNA sequences. {yields} The model provide an explanation for Chargaff's second parity rule in primitive DNA sequences. {yields} The model is also used to predict a novel type of strand symmetry in primitive DNA sequences. {yields} We extend the results for bacterial DNA sequences and compare distributional properties intrinsic to the model to statistical estimates from 1049 bacterial genomes. {yields} We find out statistical evidences that the novel type of strand symmetry holds for bacterial DNA sequences. -- Abstract: Chargaff's second parity rule for short oligonucleotides states that themore » frequency of any short nucleotide sequence on a strand is approximately equal to the frequency of its reverse complement on the same strand. Recent studies have shown that, with the exception of organellar DNA, this parity rule generally holds for double-stranded DNA genomes and fails to hold for single-stranded genomes. While Chargaff's first parity rule is fully explained by the Watson-Crick pairing in the DNA double helix, a definitive explanation for the second parity rule has not yet been determined. In this work, we propose a model based on a hidden Markov process for approximating the distributional structure of primitive DNA sequences. Then, we use the model to provide another possible theoretical explanation for Chargaff's second parity rule, and to predict novel distributional aspects of bacterial DNA sequences.« less
Whole-Exome Sequencing to Decipher the Genetic Heterogeneity of Hearing Loss in a Chinese Family with Deaf by Deaf Mating

PubMed Central

Qing, Jie; Yan, Denise; Zhou, Yuan; Liu, Qiong; Wu, Weijing; Xiao, Zian; Liu, Yuyuan; Liu, Jia; Du, Lilin; Xie, Dinghua; Liu, Xue Zhong

2014-01-01

Inherited deafness has been shown to have high genetic heterogeneity. For many decades, linkage analysis and candidate gene approaches have been the main tools to elucidate the genetics of hearing loss. However, this associated study design is costly, time-consuming, and unsuitable for small families. This is mainly due to the inadequate numbers of available affected individuals, locus heterogeneity, and assortative mating. Exome sequencing has now become technically feasible and a cost-effective method for detection of disease variants underlying Mendelian disorders due to the recent advances in next-generation sequencing (NGS) technologies. In the present study, we have combined both the Deafness Gene Mutation Detection Array and exome sequencing to identify deafness causative variants in a large Chinese composite family with deaf by deaf mating. The simultaneous screening of the 9 common deafness mutations using the allele-specific PCR based universal array, resulted in the identification of the 1555A>G in the mitochondrial DNA (mtDNA) 12S rRNA in affected individuals in one branch of the family. We then subjected the mutation-negative cases to exome sequencing and identified novel causative variants in the MYH14 and WFS1 genes. This report confirms the effective use of a NGS technique to detect pathogenic mutations in affected individuals who were not candidates for classical genetic studies. PMID:25289672
Links between plant and fungal communities across a deforestation chronosequence in the Amazon rainforest.

PubMed

Mueller, Rebecca C; Paula, Fabiana S; Mirza, Babur S; Rodrigues, Jorge L M; Nüsslein, Klaus; Bohannan, Brendan J M

2014-07-01

Understanding the interactions among microbial communities, plant communities and soil properties following deforestation could provide insights into the long-term effects of land-use change on ecosystem functions, and may help identify approaches that promote the recovery of degraded sites. We combined high-throughput sequencing of fungal rDNA and molecular barcoding of plant roots to estimate fungal and plant community composition in soil sampled across a chronosequence of deforestation. We found significant effects of land-use change on fungal community composition, which was more closely correlated to plant community composition than to changes in soil properties or geographic distance, providing evidence for strong links between above- and below-ground communities in tropical forests.
Gene conversion events and variable degree of homogenization of rDNA loci in cultivars of Brassica napus

PubMed Central

Sochorová, Jana; Coriton, Olivier; Kuderová, Alena; Lunerová, Jana; Chèvre, Anne-Marie; Kovařík, Aleš

2017-01-01

Background and aims Brassica napus (AACC, 2n = 38, oilseed rape) is a relatively recent allotetraploid species derived from the putative progenitor diploid species Brassica rapa (AA, 2n = 20) and Brassica oleracea (CC, 2n = 18). To determine the influence of intensive breeding conditions on the evolution of its genome, we analysed structure and copy number of rDNA in 21 cultivars of B. napus, representative of genetic diversity. Methods We used next-generation sequencing genomic approaches, Southern blot hybridization, expression analysis and fluorescence in situ hybridization (FISH). Subgenome-specific sequences derived from rDNA intergenic spacers (IGS) were used as probes for identification of loci composition on chromosomes. Key Results Most B. napus cultivars (18/21, 86 %) had more A-genome than C-genome rDNA copies. Three cultivars analysed by FISH (‘Darmor’, ‘Yudal’ and ‘Asparagus kale’) harboured the same number (12 per diploid set) of loci. In B. napus ‘Darmor’, the A-genome-specific rDNA probe hybridized to all 12 rDNA loci (eight on the A-genome and four on the C-genome) while the C-genome-specific probe showed weak signals on the C-genome loci only. Deep sequencing revealed high homogeneity of arrays suggesting that the C-genome genes were largely overwritten by the A-genome variants in B. napus ‘Darmor’. In contrast, B. napus ‘Yudal’ showed a lack of gene conversion evidenced by additive inheritance of progenitor rDNA variants and highly localized hybridization signals of subgenome-specific probes on chromosomes. Brassica napus ‘Asparagus kale’ showed an intermediate pattern to ‘Darmor’ and ‘Yudal’. At the expression level, most cultivars (95 %) exhibited stable A-genome nucleolar dominance while one cultivar (‘Norin 9’) showed co-dominance. Conclusions The B. napus cultivars differ in the degree and direction of rDNA homogenization. The prevalent direction of gene conversion (towards the A-genome) correlates with the direction of expression dominance indicating that gene activity may be needed for interlocus gene conversion. PMID:27707747
A Simulation of DNA Sequencing Utilizing 3M Post-It[R] Notes

ERIC Educational Resources Information Center

Christensen, Doug

2009-01-01

An inexpensive and equipment free approach to teaching the technical aspects of DNA sequencing. The activity described requires an instructor with a familiarity of DNA sequencing technology but provides a straight forward method of teaching the technical aspects of sequencing in the absence of expensive sequencing equipment. The final sequence…
Revising the phylogenetic position of the extinct Mascarene Parrot Mascarinus mascarin (Linnaeus 1771) (Aves: Psittaciformes: Psittacidae).

PubMed

Podsiadlowski, Lars; Gamauf, Anita; Töpfer, Till

2017-02-01

The phylogenetic position of the extinct Mascarene Parrot Mascarinus mascarin from La Réunion has been unresolved for centuries. A recent molecular study unexpectedly placed M. mascarin within the clade of phenotypically very different Vasa parrots Coracopsis. Based on DNA extracted from the only other preserved Mascarinus specimen, we show that the previously obtained cytb sequence is probably an artificial composite of partial sequences from two other parrot species and that M. mascarin is indeed a part of the Psittacula diversification, placed close to P. eupatria and P. wardi. Copyright © 2016 Elsevier Inc. All rights reserved.
DNA and RNA sequencing by nanoscale reading through programmable electrophoresis and nanoelectrode-gated tunneling and dielectric detection

DOEpatents

Lee, James W.; Thundat, Thomas G.

2005-06-14

An apparatus and method for performing nucleic acid (DNA and/or RNA) sequencing on a single molecule. The genetic sequence information is obtained by probing through a DNA or RNA molecule base by base at nanometer scale as though looking through a strip of movie film. This DNA sequencing nanotechnology has the theoretical capability of performing DNA sequencing at a maximal rate of about 1,000,000 bases per second. This enhanced performance is made possible by a series of innovations including: novel applications of a fine-tuned nanometer gap for passage of a single DNA or RNA molecule; thin layer microfluidics for sample loading and delivery; and programmable electric fields for precise control of DNA or RNA movement. Detection methods include nanoelectrode-gated tunneling current measurements, dielectric molecular characterization, and atomic force microscopy/electrostatic force microscopy (AFM/EFM) probing for nanoscale reading of the nucleic acid sequences.
The sequence specificity of UV-induced DNA damage in a systematically altered DNA sequence.

PubMed

Khoe, Clairine V; Chung, Long H; Murray, Vincent

2018-06-01

The sequence specificity of UV-induced DNA damage was investigated in a specifically designed DNA plasmid using two procedures: end-labelling and linear amplification. Absorption of UV photons by DNA leads to dimerisation of pyrimidine bases and produces two major photoproducts, cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). A previous study had determined that two hexanucleotide sequences, 5'-GCTC*AC and 5'-TATT*AA, were high intensity UV-induced DNA damage sites. The UV clone plasmid was constructed by systematically altering each nucleotide of these two hexanucleotide sequences. One of the main goals of this study was to determine the influence of single nucleotide alterations on the intensity of UV-induced DNA damage. The sequence 5'-GCTC*AC was designed to examine the sequence specificity of 6-4PPs and the highest intensity 6-4PP damage sites were found at 5'-GTTC*CC nucleotides. The sequence 5'-TATT*AA was devised to investigate the sequence specificity of CPDs and the highest intensity CPD damage sites were found at 5'-TTTT*CG nucleotides. It was proposed that the tetranucleotide DNA sequence, 5'-YTC*Y (where Y is T or C), was the consensus sequence for the highest intensity UV-induced 6-4PP adduct sites; while it was 5'-YTT*C for the highest intensity UV-induced CPD damage sites. These consensus tetranucleotides are composed entirely of consecutive pyrimidines and must have a DNA conformation that is highly productive for the absorption of UV photons. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.

Diversity and dynamics of the DNA- and cDNA-derived compost fungal communities throughout the commercial cultivation process for Agaricus bisporus.

PubMed

McGee, C F; Byrne, H; Irvine, A; Wilson, J

2017-01-01

Commercial cultivation of the button mushroom Agaricus bisporus is performed through the inoculation of a semipasteurized composted material. Pasteurization of the compost material prior to inoculation results in a substrate with a fungal community that becomes dominated by A. bisporus. However, little is known about the composition and activity in the wider fungal community beyond the presence of A. bisporus in compost throughout the mushroom cropping process. In this study, the fungal cropping compost community was characterized by sequencing nuc rDNA ITS1-5.8S-ITS2 amplified from extractable DNA and RNA. The fungal community generated from DNA extracts identified a diverse community containing 211 unique species, although only 51 were identified from cDNA. Agaricus bisporus was found to dominate in the DNA-derived fungal community for the duration of the cropping process. However, analysis of cDNA extracts found A. bisporus to dominate only up to the first crop flush, after which activity decreased sharply and a much broader fungal community became active. This study has highlighted the diverse fungal community that is present in mushroom compost during cropping.
Structural features based genome-wide characterization and prediction of nucleosome organization

PubMed Central

2012-01-01

Background Nucleosome distribution along chromatin dictates genomic DNA accessibility and thus profoundly influences gene expression. However, the underlying mechanism of nucleosome formation remains elusive. Here, taking a structural perspective, we systematically explored nucleosome formation potential of genomic sequences and the effect on chromatin organization and gene expression in S. cerevisiae. Results We analyzed twelve structural features related to flexibility, curvature and energy of DNA sequences. The results showed that some structural features such as DNA denaturation, DNA-bending stiffness, Stacking energy, Z-DNA, Propeller twist and free energy, were highly correlated with in vitro and in vivo nucleosome occupancy. Specifically, they can be classified into two classes, one positively and the other negatively correlated with nucleosome occupancy. These two kinds of structural features facilitated nucleosome binding in centromere regions and repressed nucleosome formation in the promoter regions of protein-coding genes to mediate transcriptional regulation. Based on these analyses, we integrated all twelve structural features in a model to predict more accurately nucleosome occupancy in vivo than the existing methods that mainly depend on sequence compositional features. Furthermore, we developed a novel approach, named DLaNe, that located nucleosomes by detecting peaks of structural profiles, and built a meta predictor to integrate information from different structural features. As a comparison, we also constructed a hidden Markov model (HMM) to locate nucleosomes based on the profiles of these structural features. The result showed that the meta DLaNe and HMM-based method performed better than the existing methods, demonstrating the power of these structural features in predicting nucleosome positions. Conclusions Our analysis revealed that DNA structures significantly contribute to nucleosome organization and influence chromatin structure and gene expression regulation. The results indicated that our proposed methods are effective in predicting nucleosome occupancy and positions and that these structural features are highly predictive of nucleosome organization. The implementation of our DLaNe method based on structural features is available online. PMID:22449207
High taxonomic variability despite stable functional structure across microbial communities.

PubMed

Louca, Stilianos; Jacques, Saulo M S; Pires, Aliny P F; Leal, Juliana S; Srivastava, Diane S; Parfrey, Laura Wegener; Farjalla, Vinicius F; Doebeli, Michael

2016-12-05

Understanding the processes that are driving variation of natural microbial communities across space or time is a major challenge for ecologists. Environmental conditions strongly shape the metabolic function of microbial communities; however, other processes such as biotic interactions, random demographic drift or dispersal limitation may also influence community dynamics. The relative importance of these processes and their effects on community function remain largely unknown. To address this uncertainty, here we examined bacterial and archaeal communities in replicate 'miniature' aquatic ecosystems contained within the foliage of wild bromeliads. We used marker gene sequencing to infer the taxonomic composition within nine metabolic functional groups, and shotgun environmental DNA sequencing to estimate the relative abundances of these groups. We found that all of the bromeliads exhibited remarkably similar functional community structures, but that the taxonomic composition within individual functional groups was highly variable. Furthermore, using statistical analyses, we found that non-neutral processes, including environmental filtering and potentially biotic interactions, at least partly shaped the composition within functional groups and were more important than spatial dispersal limitation and demographic drift. Hence both the functional structure and taxonomic composition within functional groups of natural microbial communities may be shaped by non-neutral and roughly separate processes.
Torque measurements reveal sequence-specific cooperative transitions in supercoiled DNA

PubMed Central

Oberstrass, Florian C.; Fernandes, Louis E.; Bryant, Zev

2012-01-01

B-DNA becomes unstable under superhelical stress and is able to adopt a wide range of alternative conformations including strand-separated DNA and Z-DNA. Localized sequence-dependent structural transitions are important for the regulation of biological processes such as DNA replication and transcription. To directly probe the effect of sequence on structural transitions driven by torque, we have measured the torsional response of a panel of DNA sequences using single molecule assays that employ nanosphere rotational probes to achieve high torque resolution. The responses of Z-forming d(pGpC)n sequences match our predictions based on a theoretical treatment of cooperative transitions in helical polymers. “Bubble” templates containing 50–100 bp mismatch regions show cooperative structural transitions similar to B-DNA, although less torque is required to disrupt strand–strand interactions. Our mechanical measurements, including direct characterization of the torsional rigidity of strand-separated DNA, establish a framework for quantitative predictions of the complex torsional response of arbitrary sequences in their biological context. PMID:22474350
Ancient dna from pleistocene fossils: Preservation, recovery, and utility of ancient genetic information for quaternary research

NASA Astrophysics Data System (ADS)

Yang, Hong

Until recently, recovery and analysis of genetic information encoded in ancient DNA sequences from Pleistocene fossils were impossible. Recent advances in molecular biology offered technical tools to obtain ancient DNA sequences from well-preserved Quaternary fossils and opened the possibilities to directly study genetic changes in fossil species to address various biological and paleontological questions. Ancient DNA studies involving Pleistocene fossil material and ancient DNA degradation and preservation in Quaternary deposits are reviewed. The molecular technology applied to isolate, amplify, and sequence ancient DNA is also presented. Authentication of ancient DNA sequences and technical problems associated with modern and ancient DNA contamination are discussed. As illustrated in recent studies on ancient DNA from proboscideans, it is apparent that fossil DNA sequence data can shed light on many aspects of Quaternary research such as systematics and phylogeny. conservation biology, evolutionary theory, molecular taphonomy, and forensic sciences. Improvement of molecular techniques and a better understanding of DNA degradation during fossilization are likely to build on current strengths and to overcome existing problems, making fossil DNA data a unique source of information for Quaternary scientists.
Enantiospecific recognition of DNA sequences by a proflavine Tröger base.

PubMed

Bailly, C; Laine, W; Demeunynck, M; Lhomme, J

2000-07-05

The DNA interaction of a chiral Tröger base derived from proflavine was investigated by DNA melting temperature measurements and complementary biochemical assays. DNase I footprinting experiments demonstrate that the binding of the proflavine-based Tröger base is both enantio- and sequence-specific. The (+)-isomer poorly interacts with DNA in a non-sequence-selective fashion. In sharp contrast, the corresponding (-)-isomer recognizes preferentially certain DNA sequences containing both A. T and G. C base pairs, such as the motifs 5'-GTT. AAC and 5'-ATGA. TCAT. This is the first experimental demonstration that acridine-type Tröger bases can be used for enantiospecific recognition of DNA sequences. Copyright 2000 Academic Press.
Sensitive detection of mercury and copper ions by fluorescent DNA/Ag nanoclusters in guanine-rich DNA hybridization

NASA Astrophysics Data System (ADS)

Peng, Jun; Ling, Jian; Zhang, Xiu-Qing; Bai, Hui-Ping; Zheng, Liyan; Cao, Qiu-E.; Ding, Zhong-Tao

2015-02-01

In this work, we designed a new fluorescent oligonucleotides-stabilized silver nanoclusters (DNA/AgNCs) probe for sensitive detection of mercury and copper ions. This probe contains two tailored DNA sequence. One is a signal probe contains a cytosine-rich sequence template for AgNCs synthesis and link sequence at both ends. The other is a guanine-rich sequence for signal enhancement and link sequence complementary to the link sequence of the signal probe. After hybridization, the fluorescence of hybridized double-strand DNA/AgNCs is 200-fold enhanced based on the fluorescence enhancement effect of DNA/AgNCs in proximity of guanine-rich DNA sequence. The double-strand DNA/AgNCs probe is brighter and stable than that of single-strand DNA/AgNCs, and more importantly, can be used as novel fluorescent probes for detecting mercury and copper ions. Mercury and copper ions in the range of 6.0-160.0 and 6-240 nM, can be linearly detected with the detection limits of 2.1 and 3.4 nM, respectively. Our results indicated that the analytical parameters of the method for mercury and copper ions detection are much better than which using a single-strand DNA/AgNCs.
Ab initio DNA synthesis by Bst polymerase in the presence of nicking endonucleases Nt.AlwI, Nb.BbvCI, and Nb.BsmI.

PubMed

Antipova, Valeriya N; Zheleznaya, Lyudmila A; Zyrina, Nadezhda V

2014-08-01

In the absence of added DNA, thermophilic DNA polymerases synthesize double-stranded DNA from free dNTPs, which consist of numerous repetitive units (ab initio DNA synthesis). The addition of thermophilic restriction endonuclease (REase), or nicking endonuclease (NEase), effectively stimulates ab initio DNA synthesis and determines the nucleotide sequence of reaction products. We have found that NEases Nt.AlwI, Nb.BbvCI, and Nb.BsmI with non-palindromic recognition sites stimulate the synthesis of sequences organized mainly as palindromes. Moreover, the nucleotide sequence of the palindromes appeared to be dependent on NEase recognition/cleavage modes. Thus, the heterodimeric Nb.BbvCI stimulated the synthesis of palindromes composed of two recognition sites of this NEase, which were separated by AT-reach sequences or (A)n (T)m spacers. Palindromic DNA sequences obtained in the ab initio DNA synthesis with the monomeric NEases Nb.BsmI and Nt.AlwI contained, along with the sites of these NEases, randomly synthesized sequences consisted of blocks of short repeats. These findings could help investigation of the potential abilities of highly productive ab initio DNA synthesis for the creation of DNA molecules with desirable sequence. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): A linear DNA molecule encoding a putative DNA-dependent DNA polymerase.

PubMed

Shao, Zhiyong; Graf, Shannon; Chaga, Oleg Y; Lavrov, Dennis V

2006-10-15

The 16,937-nuceotide sequence of the linear mitochondrial DNA (mt-DNA) molecule of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa) - the first mtDNA sequence from the class Scypozoa and the first sequence of a linear mtDNA from Metazoa - has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs. In addition, two open reading frames of 324 and 969 base pairs in length have been found. The deduced amino-acid sequence of one of them, ORF969, displays extensive sequence similarity with the polymerase [but not the exonuclease] domain of family B DNA polymerases, and this ORF has been tentatively identified as dnab. This is the first report of dnab in animal mtDNA. The genes in A. aurita mtDNA are arranged in two clusters with opposite transcriptional polarities; transcription proceeding toward the ends of the molecule. The determined sequences at the ends of the molecule are nearly identical but inverted and lack any obvious potential secondary structures or telomere-like repeat elements. The acquisition of mitochondrial genomic data for the second class of Cnidaria allows us to reconstruct characteristic features of mitochondrial evolution in this animal phylum.
Recent patents of nanopore DNA sequencing technology: progress and challenges.

PubMed

Zhou, Jianfeng; Xu, Bingqian

2010-11-01

DNA sequencing techniques witnessed fast development in the last decades, primarily driven by the Human Genome Project. Among the proposed new techniques, Nanopore was considered as a suitable candidate for the single DNA sequencing with ultrahigh speed and very low cost. Several fabrication and modification techniques have been developed to produce robust and well-defined nanopore devices. Many efforts have also been done to apply nanopore to analyze the properties of DNA molecules. By comparing with traditional sequencing techniques, nanopore has demonstrated its distinctive superiorities in main practical issues, such as sample preparation, sequencing speed, cost-effective and read-length. Although challenges still remain, recent researches in improving the capabilities of nanopore have shed a light to achieve its ultimate goal: Sequence individual DNA strand at single nucleotide level. This patent review briefly highlights recent developments and technological achievements for DNA analysis and sequencing at single molecule level, focusing on nanopore based methods.
Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.

PubMed Central

Benslimane, A A; Dron, M; Hartmann, C; Rode, A

1986-01-01

Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553
Pyrosequencing the Canine Faecal Microbiota: Breadth and Depth of Biodiversity

PubMed Central

Hand, Daniel; Wallis, Corrin; Colyer, Alison; Penn, Charles W.

2013-01-01

Mammalian intestinal microbiota remain poorly understood despite decades of interest and investigation by culture-based and other long-established methodologies. Using high-throughput sequencing technology we now report a detailed analysis of canine faecal microbiota. The study group of animals comprised eleven healthy adult miniature Schnauzer dogs of mixed sex and age, some closely related and all housed in kennel and pen accommodation on the same premises with similar feeding and exercise regimes. DNA was extracted from faecal specimens and subjected to PCR amplification of 16S rDNA, followed by sequencing of the 5′ region that included variable regions V1 and V2. Barcoded amplicons were sequenced by Roche-454 FLX high-throughput pyrosequencing. Sequences were assigned to taxa using the Ribosomal Database Project Bayesian classifier and revealed dominance of Fusobacterium and Bacteroidetes phyla. Differences between animals in the proportions of different taxa, among 10,000 reads per animal, were clear and not supportive of the concept of a “core microbiota”. Despite this variability in prominent genera, littermates were shown to have a more similar faecal microbial composition than unrelated dogs. Diversity of the microbiota was also assessed by assignment of sequence reads into operational taxonomic units (OTUs) at the level of 97% sequence identity. The OTU data were then subjected to rarefaction analysis and determination of Chao1 richness estimates. The data indicated that faecal microbiota comprised possibly as many as 500 to 1500 OTUs. PMID:23382835
Ultra-deep sequencing enables high-fidelity recovery of biodiversity for bulk arthropod samples without PCR amplification

PubMed Central

2013-01-01

Background Next-generation-sequencing (NGS) technologies combined with a classic DNA barcoding approach have enabled fast and credible measurement for biodiversity of mixed environmental samples. However, the PCR amplification involved in nearly all existing NGS protocols inevitably introduces taxonomic biases. In the present study, we developed new Illumina pipelines without PCR amplifications to analyze terrestrial arthropod communities. Results Mitochondrial enrichment directly followed by Illumina shotgun sequencing, at an ultra-high sequence volume, enabled the recovery of Cytochrome c Oxidase subunit 1 (COI) barcode sequences, which allowed for the estimation of species composition at high fidelity for a terrestrial insect community. With 15.5 Gbp Illumina data, approximately 97% and 92% were detected out of the 37 input Operational Taxonomic Units (OTUs), whether the reference barcode library was used or not, respectively, while only 1 novel OTU was found for the latter. Additionally, relatively strong correlation between the sequencing volume and the total biomass was observed for species from the bulk sample, suggesting a potential solution to reveal relative abundance. Conclusions The ability of the new Illumina PCR-free pipeline for DNA metabarcoding to detect small arthropod specimens and its tendency to avoid most, if not all, false positives suggests its great potential in biodiversity-related surveillance, such as in biomonitoring programs. However, further improvement for mitochondrial enrichment is likely needed for the application of the new pipeline in analyzing arthropod communities at higher diversity. PMID:23587339
Next-Generation Sequencing Platforms

NASA Astrophysics Data System (ADS)

Mardis, Elaine R.

2013-06-01

Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.
Regulatory link between DNA methylation and active demethylation in Arabidopsis

PubMed Central

Lei, Mingguang; Zhang, Huiming; Julian, Russell; Tang, Kai; Xie, Shaojun; Zhu, Jian-Kang

2015-01-01

De novo DNA methylation through the RNA-directed DNA methylation (RdDM) pathway and active DNA demethylation play important roles in controlling genome-wide DNA methylation patterns in plants. Little is known about how cells manage the balance between DNA methylation and active demethylation activities. Here, we report the identification of a unique RdDM target sequence, where DNA methylation is required for maintaining proper active DNA demethylation of the Arabidopsis genome. In a genetic screen for cellular antisilencing factors, we isolated several REPRESSOR OF SILENCING 1 (ros1) mutant alleles, as well as many RdDM mutants, which showed drastically reduced ROS1 gene expression and, consequently, transcriptional silencing of two reporter genes. A helitron transposon element (TE) in the ROS1 gene promoter negatively controls ROS1 expression, whereas DNA methylation of an RdDM target sequence between ROS1 5′ UTR and the promoter TE region antagonizes this helitron TE in regulating ROS1 expression. This RdDM target sequence is also targeted by ROS1, and defective DNA demethylation in loss-of-function ros1 mutant alleles causes DNA hypermethylation of this sequence and concomitantly causes increased ROS1 expression. Our results suggest that this sequence in the ROS1 promoter region serves as a DNA methylation monitoring sequence (MEMS) that senses DNA methylation and active DNA demethylation activities. Therefore, the ROS1 promoter functions like a thermostat (i.e., methylstat) to sense DNA methylation levels and regulates DNA methylation by controlling ROS1 expression. PMID:25733903
Wolbachia association with the tsetse fly, Glossina fuscipes fuscipes, reveals high levels of genetic diversity and complex evolutionary dynamics

PubMed Central

2013-01-01

Background Wolbachia pipientis, a diverse group of α-proteobacteria, can alter arthropod host reproduction and confer a reproductive advantage to Wolbachia-infected females (cytoplasmic incompatibility (CI)). This advantage can alter host population genetics because Wolbachia-infected females produce more offspring with their own mitochondrial DNA (mtDNA) haplotypes than uninfected females. Thus, these host haplotypes become common or fixed (selective sweep). Although simulations suggest that for a CI-mediated sweep to occur, there must be a transient phase with repeated initial infections of multiple individual hosts by different Wolbachia strains, this has not been observed empirically. Wolbachia has been found in the tsetse fly, Glossina fuscipes fuscipes, but it is not limited to a single host haplotype, suggesting that CI did not impact its population structure. However, host population genetic differentiation could have been generated if multiple Wolbachia strains interacted in some populations. Here, we investigated Wolbachia genetic variation in G. f. fuscipes populations of known host genetic composition in Uganda. We tested for the presence of multiple Wolbachia strains using Multi-Locus Sequence Typing (MLST) and for an association between geographic region and host mtDNA haplotype using Wolbachia DNA sequence from a variable locus, groEL (heat shock protein 60). Results MLST demonstrated that some G. f. fuscipes carry Wolbachia strains from two lineages. GroEL revealed high levels of sequence diversity within and between individuals (Haplotype diversity = 0.945). We found Wolbachia associated with 26 host mtDNA haplotypes, an unprecedented result. We observed a geographical association of one Wolbachia lineage with southern host mtDNA haplotypes, but it was non-significant (p = 0.16). Though most Wolbachia-infected host haplotypes were those found in the contact region between host mtDNA groups, this association was non-significant (p = 0.17). Conclusions High Wolbachia sequence diversity and the association of Wolbachia with multiple host haplotypes suggest that different Wolbachia strains infected G. f. fuscipes multiple times independently. We suggest that these observations reflect a transient phase in Wolbachia evolution that is influenced by the long gestation and low reproductive output of tsetse. Although G. f. fuscipes is superinfected with Wolbachia, our data does not support that bidirectional CI has influenced host genetic diversity in Uganda. PMID:23384159
Attomole-level Genomics with Single-molecule Direct DNA, cDNA and RNA Sequencing Technologies.

PubMed

Ozsolak, Fatih

2016-01-01

With the introduction of next-generation sequencing (NGS) technologies in 2005, the domination of microarrays in genomics quickly came to an end due to NGS's superior technical performance and cost advantages. By enabling genetic analysis capabilities that were not possible previously, NGS technologies have started to play an integral role in all areas of biomedical research. This chapter outlines the low-quantity DNA and cDNA sequencing capabilities and applications developed with the Helicos single molecule DNA sequencing technology.
Association of dietary type with fecal microbiota in vegetarians and omnivores in Slovenia.

PubMed

Matijašić, Bojana Bogovič; Obermajer, Tanja; Lipoglavšek, Luka; Grabnar, Iztok; Avguštin, Gorazd; Rogelj, Irena

2014-06-01

The purpose of this study was to discover differences in the human fecal microbiota composition driven by long-term omnivore versus vegan/lacto-vegetarian dietary pattern. In addition, the possible association of demographic characteristics and dietary habits such as consumption of particular foods with the fecal microbiota was examined. This study was conducted on a Slovenian population comprising 31 vegetarian participants (11 lacto-vegetarians and 20 vegans) and 29 omnivore participants. Bacterial DNA was extracted from the frozen fecal samples by Maxwell 16 Tissue DNA Purification Kit (Promega). Relative quantification of selected bacterial groups was performed by real-time PCR. Differences in fecal microbiota composition were evaluated by PCR-DGGE fingerprinting of the V3 16S rRNA region. Participants' demographic characteristics, dietary habits and health status information were collected through a questionnaire. Vegetarian diet was associated with higher ratio (% of group-specific DNA in relation to all bacterial DNA) of Bacteroides-Prevotella, Bacteroides thetaiotaomicron, Clostridium clostridioforme and Faecalibacterium prausnitzii, but with lower ratio (%) of Clostridium cluster XIVa. Real-time PCR also showed a higher concentration and ratio of Enterobacteriaceae (16S rDNA copies/g and %) in female participants (p < 0.05 and p < 0.01) and decrease in Bifidobacterium with age (p < 0.01). DGGE analysis of the 16S rRNA V3 region showed that relative quantity of DGGE bands from certain bacterial groups was lower (Bifidobacterium, Streptococus, Collinsella and Lachnospiraceae) or higher (Subdoligranulum) among vegetarians, indicating the association of dietary type with bacterial community composition. Sequencing of selected DGGE bands revealed the presence of common representatives of fecal microbiota: Bacteroides, Eubacterium, Faecalibacterium, Ruminococcaceae, Bifidobacterium and Lachnospiraceae. Up to 4 % of variance in microbial community analyzed by DGGE could be explained by the vegetarian type of diet. Long-term vegetarian diet contributed to quantity and associated bacterial community shifts in fecal microbiota composition. Consumption of foods of animal origin (eggs, red meat, white meat, milk, yoghurt, other dairy products, fish and seafood) and vegetarian type of diet explained the largest share of variance in microbial community structure. Fecal microbiota composition was also associated with participants' age, gender and body mass.
A cDNA from a mouse pancreatic beta cell encoding a putative transcription factor of the insulin gene.

PubMed Central

Walker, M D; Park, C W; Rosen, A; Aronheim, A

1990-01-01

Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Highly multiplexed targeted DNA sequencing from single nuclei.

PubMed

Leung, Marco L; Wang, Yong; Kim, Charissa; Gao, Ruli; Jiang, Jerry; Sei, Emi; Navin, Nicholas E

2016-02-01

Single-cell DNA sequencing methods are challenged by poor physical coverage, high technical error rates and low throughput. To address these issues, we developed a single-cell DNA sequencing protocol that combines flow-sorting of single nuclei, time-limited multiple-displacement amplification (MDA), low-input library preparation, DNA barcoding, targeted capture and next-generation sequencing (NGS). This approach represents a major improvement over our previous single nucleus sequencing (SNS) Nature Protocols paper in terms of generating higher-coverage data (>90%), thereby enabling the detection of genome-wide variants in single mammalian cells at base-pair resolution. Furthermore, by pooling 48-96 single-cell libraries together for targeted capture, this approach can be used to sequence many single-cell libraries in parallel in a single reaction. This protocol greatly reduces the cost of single-cell DNA sequencing, and it can be completed in 5-6 d by advanced users. This single-cell DNA sequencing protocol has broad applications for studying rare cells and complex populations in diverse fields of biological research and medicine.

Capillary electrophoretic separation-based approach to determine the labeling kinetics of oligodeoxynucleotides

PubMed Central

Kanavarioti, Anastassia; Greenman, Kevin L.; Hamalainen, Mark; Jain, Aakriti; Johns, Adam M.; Melville, Chris R.; Kemmish, Kent; Andregg, William

2014-01-01

With the recent advances in electron microscopy (EM), computation, and nanofabrication, the original idea of reading DNA sequence directly from an image can now be tested. One approach is to develop heavy atom labels that can provide the contrast required for EM imaging. While evaluating tentative labels for the respective nucleobases in synthetic oligodeoxynucleotides (oligos), we developed a streamlined capillary electrophoresis (CE) protocol to assess the label stability, reactivity, and selectivity. We report our protocol using osmium tetroxide 2,2′-bipyridine (Osbipy) as a thymidine (T) specific label. The observed rates show that the labeling process is kinetically independent of both the oligo length, and the base composition. The conditions, i.e. temperature, optimal Osbipy concentration, and molar ratio of reagents, to promote 100% conversion of the starting oligo to labeled product were established. Hence the optimized conditions developed with the oligos could be leveraged to allow osmylation of effectively all Ts in single-stranded (ss) DNA, while achieving minimal mislabeling. In addition, the approach and methods employed here may be adapted to the evaluation of other prospective contrasting agents/labels to facilitate next-generation DNA sequencing by EM. PMID:23147698
Nitrogen-fixing and cellulose-producing Gluconacetobacter kombuchae sp. nov., isolated from Kombucha tea.

PubMed

Dutta, Debasree; Gachhui, Ratan

2007-02-01

A few members of the family Acetobacteraceae are cellulose-producers, while only six members fix nitrogen. Bacterial strain RG3T, isolated from Kombucha tea, displays both of these characteristics. A high bootstrap value in the 16S rRNA gene sequence-based phylogenetic analysis supported the position of this strain within the genus Gluconacetobacter, with Gluconacetobacter hansenii LMG 1527T as its nearest neighbour (99.1 % sequence similarity). It could utilize ethanol, fructose, arabinose, glycerol, sorbitol and mannitol, but not galactose or xylose, as sole sources of carbon. Single amino acids such as L-alanine, L-cysteine and L-threonine served as carbon and nitrogen sources for growth of strain RG3T. Strain RG3T produced cellulose in both nitrogen-free broth and enriched medium. The ubiquinone present was Q-10 and the DNA base composition was 55.8 mol% G+C. It exhibited low values of 5.2-27.77 % DNA-DNA relatedness to the type strains of related gluconacetobacters, which placed it within a separate taxon, for which the name Gluconacetobacter kombuchae sp. nov. is proposed, with the type strain RG3T (=LMG 23726T=MTCC 6913T).
Variant ribosomal RNA alleles are conserved and exhibit tissue-specific expression

PubMed Central

Parks, Matthew M.; Kurylo, Chad M.; Dass, Randall A.; Bojmar, Linda; Lyden, David; Vincent, C. Theresa; Blanchard, Scott C.

2018-01-01

The ribosome, the integration point for protein synthesis in the cell, is conventionally considered a homogeneous molecular assembly that only passively contributes to gene expression. Yet, epigenetic features of the ribosomal DNA (rDNA) operon and changes in the ribosome’s molecular composition have been associated with disease phenotypes, suggesting that the ribosome itself may possess inherent regulatory capacity. Analyzing whole-genome sequencing data from the 1000 Genomes Project and the Mouse Genomes Project, we find that rDNA copy number varies widely across individuals, and we identify pervasive intra- and interindividual nucleotide variation in the 5S, 5.8S, 18S, and 28S ribosomal RNA (rRNA) genes of both human and mouse. Conserved rRNA sequence heterogeneities map to functional centers of the assembled ribosome, variant rRNA alleles exhibit tissue-specific expression, and ribosomes bearing variant rRNA alleles are present in the actively translating ribosome pool. These findings provide a critical framework for exploring the possibility that the expression of genomically encoded variant rRNA alleles gives rise to physically and functionally heterogeneous ribosomes that contribute to mammalian physiology and human disease. PMID:29503865
Isolation and characterization of a cDNA from Cuphea lanceolata encoding a beta-ketoacyl-ACP reductase.

PubMed

Klein, B; Pawlowski, K; Höricke-Grandpierre, C; Schell, J; Töpfer, R

1992-05-01

A cDNA encoding beta-ketoacyl-ACP reductase (EC 1.1.1.100), an integral part of the fatty acid synthase type II, was cloned from Cuphea lanceolata. This cDNA of 1276 bp codes for a polypeptide of 320 amino acids with 63 N-terminal residues presumably representing a transit peptide and 257 residues corresponding to the mature protein of 27 kDa. The encoded protein shows strong homology with the amino-terminal sequence and two tryptic peptides from avocado mesocarp beta-ketoacyl-ACP reductase, and its total amino acid composition is highly similar to those of the beta-ketoacyl-ACP reductases of avocado and spinach. Amino acid sequence homologies to polyketide synthase, beta-ketoreductases and short-chain alcohol dehydrogenases are discussed. An engineered fusion protein lacking most of the transit peptide, which was produced in Escherichia coli, was isolated and proved to possess beta-ketoacyl-ACP reductase activity. Hybridization studies revealed that in C. lanceolata beta-ketoacyl-ACP reductase is encoded by a small family of at least two genes and that members of this family are expressed in roots, leaves, flowers and seeds.
A reference linkage map for Eucalyptus

PubMed Central

2012-01-01

Background Genetic linkage maps are invaluable resources in plant research. They provide a key tool for many genetic applications including: mapping quantitative trait loci (QTL); comparative mapping; identifying unlinked (i.e. independent) DNA markers for fingerprinting, population genetics and phylogenetics; assisting genome sequence assembly; relating physical and recombination distances along the genome and map-based cloning of genes. Eucalypts are the dominant tree species in most Australian ecosystems and of economic importance globally as plantation trees. The genome sequence of E. grandis has recently been released providing unprecedented opportunities for genetic and genomic research in the genus. A robust reference linkage map containing sequence-based molecular markers is needed to capitalise on this resource. Several high density linkage maps have recently been constructed for the main commercial forestry species in the genus (E. grandis, E. urophylla and E. globulus) using sequenced Diversity Arrays Technology (DArT) and microsatellite markers. To provide a single reference linkage map for eucalypts a composite map was produced through the integration of data from seven independent mapping experiments (1950 individuals) using a marker-merging method. Results The composite map totalled 1107 cM and contained 4101 markers; comprising 3880 DArT, 213 microsatellite and eight candidate genes. Eighty-one DArT markers were mapped to two or more linkage groups, resulting in the 4101 markers being mapped to 4191 map positions. Approximately 13% of DArT markers mapped to identical map positions, thus the composite map contained 3634 unique loci at an average interval of 0.31 cM. Conclusion The composite map represents the most saturated linkage map yet produced in Eucalyptus. As the majority of DArT markers contained on the map have been sequenced, the map provides a direct link to the E. grandis genome sequence and will serve as an important reference for progressing eucalypt research. PMID:22702473
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE PAGES

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.; ...

2017-07-18

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

PubMed Central

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Richard A.; Brown, Steven D.

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. PMID:28769883
Impact of enzymatic digestion on bacterial community composition in CF airway samples.

PubMed

Williamson, Kayla M; Wagner, Brandie D; Robertson, Charles E; Johnson, Emily J; Zemanick, Edith T; Harris, J Kirk

2017-01-01

Previous studies have demonstrated the importance of DNA extraction methods for molecular detection of Staphylococcus, an important bacterial group in cystic fibrosis (CF). We sought to evaluate the effect of enzymatic digestion (EnzD) prior to DNA extraction on bacterial communities identified in sputum and oropharyngeal swab (OP) samples from patients with CF. DNA from 81 samples (39 sputum and 42 OP) collected from 63 patients with CF was extracted in duplicate with and without EnzD. Bacterial communities were determined by rRNA gene sequencing, and measures of alpha and beta diversity were calculated. Principal Coordinate Analysis (PCoA) was used to assess differences at the community level and Wilcoxon Signed Rank tests were used to compare relative abundance (RA) of individual genera for paired samples with and without EnzD. Shannon Diversity Index (alpha-diversity) decreased in sputum and OP samples with the use of EnzD. Larger shifts in community composition were observed for OP samples (beta-diversity, measured by Morisita-Horn), whereas less change in communities was observed for sputum samples. The use of EnzD with OP swabs resulted in significant increase in RA for the genera Gemella ( p < 0.01), Streptococcus ( p < 0.01), and Rothia ( p < 0.01). Staphylococcus ( p < 0.01) was the only genus with a significant increase in RA from sputum, whereas the following genera decreased in RA with EnzD: Veillonella ( p < 0.01), Granulicatella ( p < 0.01), Prevotella ( p < 0.01), and Gemella ( p = 0.02). In OP samples, higher RA of Gram-positive taxa was associated with larger changes in microbial community composition. We show that the application of EnzD to CF airway samples, particularly OP swabs, results in differences in microbial communities detected by sequencing. Use of EnzD can result in large changes in bacterial community composition, and is particularly useful for detection of Staphylococcus in CF OP samples. The enhanced identification of Staphylococcus aureus is a strong indication to utilize EnzD in studies that use OP swabs to monitor CF airway communities.
Mapping the binding site of aflatoxin B/sub 1/ in DNA: systematic analysis of the reactivity of aflatoxin B/sub 1/ with guanines in different DNA sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Benasutti, M.; Ejadi, S.; Whitlow, M.D.

The mutagenic and carcinogenic chemical aflatoxin B/sub 1/ (AFB/sub 1/) reacts almost exclusively at the N(7)-position of guanine following activation to its reactive form, the 8,9-epoxide (AFB/sub 1/ oxide). In general N(7)-guanine adducts yield DNA strand breaks when heated in base, a property that serves as the basis for the Maxam-Gilbert DNA sequencing reaction specific for guanine. Using DNA sequencing methods, other workers have shown that AFB/sub 1/ oxide gives strand breaks at positions of guanines; however, the guanine bands varied in intensity. This phenomenon has been used to infer that AFB/sub 1/ oxide prefers to react with guanines inmore » some sequence contexts more than in others and has been referred to as sequence specificity of binding. Herein, data on the reaction of AFB/sub 1/ oxide with several synthetic DNA polymers with different sequences are presented, and (following hydrolysis) adduct levels are determine by high-pressure liquid chromatography. These results reveal that for AFB/sub 1/ oxide (1) the N(7)-guanine adduct is the major adduct found in all of the DNA polymers, (2) adduct levels vary in different sequences, and, thus, sequence specificity is also observed by this more direct method, and (3) the intensity of bands in DNA sequencing gels is likely to reflect adduct levels formed at the N(7)-position of guanine. Knowing this, a reinvestigation of the reactivity of guanines in different DNA sequences using DNA sequencing methods was undertaken. Methods are developed to determine the X (5'-side) base and the Y (3'-side) base are most influential in determining guanine reactivity. These rules in conjunction with molecular modeling studies were used to assess the binding sites that might be utilized by AFB/sub 1/ oxide in its reaction with DNA.« less
Chromosome specific repetitive DNA sequences

DOEpatents

Moyzis, Robert K.; Meyne, Julianne

1991-01-01

A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
A multilevel ant colony optimization algorithm for classical and isothermic DNA sequencing by hybridization with multiplicity information available.

PubMed

Kwarciak, Kamil; Radom, Marcin; Formanowicz, Piotr

2016-04-01

The classical sequencing by hybridization takes into account a binary information about sequence composition. A given element from an oligonucleotide library is or is not a part of the target sequence. However, the DNA chip technology has been developed and it enables to receive a partial information about multiplicity of each oligonucleotide the analyzed sequence consist of. Currently, it is not possible to assess the exact data of such type but even partial information should be very useful. Two realistic multiplicity information models are taken into consideration in this paper. The first one, called "one and many" assumes that it is possible to obtain information if a given oligonucleotide occurs in a reconstructed sequence once or more than once. According to the second model, called "one, two and many", one is able to receive from biochemical experiment information if a given oligonucleotide is present in an analyzed sequence once, twice or at least three times. An ant colony optimization algorithm has been implemented to verify the above models and to compare with existing algorithms for sequencing by hybridization which utilize the additional information. The proposed algorithm solves the problem with any kind of hybridization errors. Computational experiment results confirm that using even the partial information about multiplicity leads to increased quality of reconstructed sequences. Moreover, they also show that the more precise model enables to obtain better solutions and the ant colony optimization algorithm outperforms the existing ones. Test data sets and the proposed ant colony optimization algorithm are available on: http://bioserver.cs.put.poznan.pl/download/ACO4mSBH.zip. Copyright © 2016 Elsevier Ltd. All rights reserved.
Affordable Hands-On DNA Sequencing and Genotyping: An Exercise for Teaching DNA Analysis to Undergraduates

ERIC Educational Resources Information Center

Shah, Kushani; Thomas, Shelby; Stein, Arnold

2013-01-01

In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C…
DNA Barcode Goes Two-Dimensions: DNA QR Code Web Server

PubMed Central

Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin

2012-01-01

The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, “DNA barcode” actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications. PMID:22574113
High-resolution characterization of sequence signatures due to non-random cleavage of cell-free DNA.

PubMed

Chandrananda, Dineika; Thorne, Natalie P; Bahlo, Melanie

2015-06-17

High-throughput sequencing of cell-free DNA fragments found in human plasma has been used to non-invasively detect fetal aneuploidy, monitor organ transplants and investigate tumor DNA. However, many biological properties of this extracellular genetic material remain unknown. Research that further characterizes circulating DNA could substantially increase its diagnostic value by allowing the application of more sophisticated bioinformatics tools that lead to an improved signal to noise ratio in the sequencing data. In this study, we investigate various features of cell-free DNA in plasma using deep-sequencing data from two pregnant women (>70X, >50X) and compare them with matched cellular DNA. We utilize a descriptive approach to examine how the biological cleavage of cell-free DNA affects different sequence signatures such as fragment lengths, sequence motifs at fragment ends and the distribution of cleavage sites along the genome. We show that the size distributions of these cell-free DNA molecules are dependent on their autosomal and mitochondrial origin as well as the genomic location within chromosomes. DNA mapping to particular microsatellites and alpha repeat elements display unique size signatures. We show how cell-free fragments occur in clusters along the genome, localizing to nucleosomal arrays and are preferentially cleaved at linker regions by correlating the mapping locations of these fragments with ENCODE annotation of chromatin organization. Our work further demonstrates that cell-free autosomal DNA cleavage is sequence dependent. The region spanning up to 10 positions on either side of the DNA cleavage site show a consistent pattern of preference for specific nucleotides. This sequence motif is present in cleavage sites localized to nucleosomal cores and linker regions but is absent in nucleosome-free mitochondrial DNA. These background signals in cell-free DNA sequencing data stem from the non-random biological cleavage of these fragments. This sequence structure can be harnessed to improve bioinformatics algorithms, in particular for CNV and structural variant detection. Descriptive measures for cell-free DNA features developed here could also be used in biomarker analysis to monitor the changes that occur during different pathological conditions.
Parasitic infections and resource economy of Danish Iron Age settlement through ancient DNA sequencing.

PubMed

Tams, Katrine Wegener; Jensen Søe, Martin; Merkyte, Inga; Valeur Seersholm, Frederik; Henriksen, Peter Steen; Klingenberg, Susanne; Willerslev, Eske; Kjær, Kurt H; Hansen, Anders Johannes; Kapel, Christian Moliin Outzen

2018-01-01

In this study, we screen archaeological soil samples by microscopy and analyse the samples by next generation sequencing to obtain results with parasites at species level and untargeted findings of plant and animal DNA. Three separate sediment layers of an ancient man-made pond in Hoby, Denmark, ranging from 100 BC to 200 AD, were analysed by microscopy for presence of intestinal worm eggs and DNA analysis were performed to identify intestinal worms and dietary components. Ancient DNA of parasites, domestic animals and edible plants revealed a change in use of the pond over time reflecting the household practice in the adjacent Iron Age settlement. The most abundant parasite found belonged to the Ascaris genus, which was not possible to type at species level. For all sediment layers the presence of eggs of the human whipworm Trichuris trichiura and the beef tapeworm Taenia saginata suggests continuous disposal of human faeces in the pond. Moreover, the continuous findings of T. saginata further imply beef consumption and may suggest that cattle were living in the immediate surrounding of the site throughout the period. Findings of additional host-specific parasites suggest fluctuating presence of other domestic animals over time: Trichuris suis (pig), Parascaris univalens (horse), Taenia hydatigena (dog and sheep). Likewise, alternating occurrence of aDNA of edible plants may suggest changes in agricultural practices. Moreover, the composition of aDNA of parasites, plants and vertebrates suggests a significant change in the use of the ancient pond over a period of three centuries.
Analysis of DNA Sequences by An Optical Time-Integrating Correlator: Proof-Of-Concept Experiments.

DTIC Science & Technology

1992-05-01

TABLES xv LIST OF ABBREVIATIONS xvii 1.0 INTRODUCTION 1 2.0 DNA ANALYSIS STRATEGY 4 2.1 Representation of DNA Bases 4 2.2 DNA Analysis Strategy 6 3.0...Zehnder architecture. 3 Figure 3: Short representations of the DNA bases where each base is represented by a 7-bits long pseudorandom sequence. 5... DNA bases where each base is represented by 7-bits long pseudorandom sequences. 4 Table 2: Long representations of the DNA bases with 255-bits maximum
SNP discovery through de novo deep sequencing using the next generation of DNA sequencers

USDA-ARS?s Scientific Manuscript database

The production of high volumes of DNA sequence data using new technologies has permitted more efficient identification of single nucleotide polymorphisms in vertebrate genomes. This chapter presented practical methodology for production and analysis of DNA sequence data for SNP discovery....
A simple procedure for parallel sequence analysis of both strands of 5'-labeled DNA.

PubMed

Razvi, F; Gargiulo, G; Worcel, A

1983-08-01

Ligation of a 5'-labeled DNA restriction fragment results in a circular DNA molecule carrying the two 32Ps at the reformed restriction site. Double digestions of the circular DNA with the original enzyme and a second restriction enzyme cleavage near the labeled site allows direct chemical sequencing of one 5'-labeled DNA strand. Similar double digestions, using an isoschizomer that cleaves differently at the 32P-labeled site, allows direct sequencing of the now 3'-labeled complementary DNA strand. It is possible to directly sequence both strands of cloned DNA inserts by using the above protocol and a multiple cloning site vector that provides the necessary restriction sites. The simultaneous and parallel visualization of both DNA strands eliminates sequence ambiguities. In addition, the labeled circular molecules are particularly useful for single-hit DNA cleavage studies and DNA footprint analysis. As an example, we show here an analysis of the micrococcal nuclease-induced breaks on the two strands of the somatic 5S RNA gene of Xenopus borealis, which suggests that the enzyme may recognize and cleave small AT-containing palindromes along the DNA helix.
A Glimpse into the Satellite DNA Library in Characidae Fish (Teleostei, Characiformes)

PubMed Central

Utsunomia, Ricardo; Ruiz-Ruano, Francisco J.; Silva, Duílio M. Z. A.; Serrano, Érica A.; Rosa, Ivana F.; Scudeler, Patrícia E. S.; Hashimoto, Diogo T.; Oliveira, Claudio; Camacho, Juan Pedro M.; Foresti, Fausto

2017-01-01

Satellite DNA (satDNA) is an abundant fraction of repetitive DNA in eukaryotic genomes and plays an important role in genome organization and evolution. In general, satDNA sequences follow a concerted evolutionary pattern through the intragenomic homogenization of different repeat units. In addition, the satDNA library hypothesis predicts that related species share a series of satDNA variants descended from a common ancestor species, with differential amplification of different satDNA variants. The finding of a same satDNA family in species belonging to different genera within Characidae fish provided the opportunity to test both concerted evolution and library hypotheses. For this purpose, we analyzed here sequence variation and abundance of this satDNA family in ten species, by a combination of next generation sequencing (NGS), PCR and Sanger sequencing, and fluorescence in situ hybridization (FISH). We found extensive between-species variation for the number and size of pericentromeric FISH signals. At genomic level, the analysis of 1000s of DNA sequences obtained by Illumina sequencing and PCR amplification allowed defining 150 haplotypes which were linked in a common minimum spanning tree, where different patterns of concerted evolution were apparent. This also provided a glimpse into the satDNA library of this group of species. In consistency with the library hypothesis, different variants for this satDNA showed high differences in abundance between species, from highly abundant to simply relictual variants. PMID:28855916

Some links on this page may take you to non-federal websites. Their policies may differ from this site.