dna sequence copy: Topics by Science.gov

Sample records for dna sequence copy

DNA copy number, including telomeres and mitochondria, assayed using next-generation sequencing.

PubMed

Castle, John C; Biery, Matthew; Bouzek, Heather; Xie, Tao; Chen, Ronghua; Misura, Kira; Jackson, Stuart; Armour, Christopher D; Johnson, Jason M; Rohl, Carol A; Raymond, Christopher K

2010-04-16

DNA copy number variations occur within populations and aberrations can cause disease. We sought to develop an improved lab-automatable, cost-efficient, accurate platform to profile DNA copy number. We developed a sequencing-based assay of nuclear, mitochondrial, and telomeric DNA copy number that draws on the unbiased nature of next-generation sequencing and incorporates techniques developed for RNA expression profiling. To demonstrate this platform, we assayed UMC-11 cells using 5 million 33 nt reads and found tremendous copy number variation, including regions of single and homogeneous deletions and amplifications to 29 copies; 5 times more mitochondria and 4 times less telomeric sequence than a pool of non-diseased, blood-derived DNA; and that UMC-11 was derived from a male individual. The described assay outputs absolute copy number, outputs an error estimate (p-value), and is more accurate than array-based platforms at high copy number. The platform enables profiling of mitochondrial levels and telomeric length. The assay is lab-automatable and has a genomic resolution and cost that are tunable based on the number of sequence reads.
DNA copy number, including telomeres and mitochondria, assayed using next-generation sequencing

PubMed Central

2010-01-01

Background DNA copy number variations occur within populations and aberrations can cause disease. We sought to develop an improved lab-automatable, cost-efficient, accurate platform to profile DNA copy number. Results We developed a sequencing-based assay of nuclear, mitochondrial, and telomeric DNA copy number that draws on the unbiased nature of next-generation sequencing and incorporates techniques developed for RNA expression profiling. To demonstrate this platform, we assayed UMC-11 cells using 5 million 33 nt reads and found tremendous copy number variation, including regions of single and homogeneous deletions and amplifications to 29 copies; 5 times more mitochondria and 4 times less telomeric sequence than a pool of non-diseased, blood-derived DNA; and that UMC-11 was derived from a male individual. Conclusion The described assay outputs absolute copy number, outputs an error estimate (p-value), and is more accurate than array-based platforms at high copy number. The platform enables profiling of mitochondrial levels and telomeric length. The assay is lab-automatable and has a genomic resolution and cost that are tunable based on the number of sequence reads. PMID:20398377
A comparison of RNA with DNA in template-directed synthesis

NASA Technical Reports Server (NTRS)

Zielinski, M.; Kozlov, I. A.; Orgel, L. E.; Bada, J. L. (Principal Investigator)

2000-01-01

Nonenzymatic template-directed copying of RNA sequences rich in cytidylic acid using nucleoside 5'-(2-methylimidazol-1-yl phosphates) as substrates is substantially more efficient than the copying of corresponding DNA sequences. However, many sequences cannot be copied, and the prospect of replication in this system is remote, even for RNA. Surprisingly, wobble-pairing leads to much more efficient incorporation of G opposite U on RNA templates than of G opposite T on DNA templates.
DNA copy number changes define spatial patterns of heterogeneity in colorectal cancer

PubMed Central

Mamlouk, Soulafa; Childs, Liam Harold; Aust, Daniela; Heim, Daniel; Melching, Friederike; Oliveira, Cristiano; Wolf, Thomas; Durek, Pawel; Schumacher, Dirk; Bläker, Hendrik; von Winterfeld, Moritz; Gastl, Bastian; Möhr, Kerstin; Menne, Andrea; Zeugner, Silke; Redmer, Torben; Lenze, Dido; Tierling, Sascha; Möbs, Markus; Weichert, Wilko; Folprecht, Gunnar; Blanc, Eric; Beule, Dieter; Schäfer, Reinhold; Morkel, Markus; Klauschen, Frederick; Leser, Ulf; Sers, Christine

2017-01-01

Genetic heterogeneity between and within tumours is a major factor determining cancer progression and therapy response. Here we examined DNA sequence and DNA copy-number heterogeneity in colorectal cancer (CRC) by targeted high-depth sequencing of 100 most frequently altered genes. In 97 samples, with primary tumours and matched metastases from 27 patients, we observe inter-tumour concordance for coding mutations; in contrast, gene copy numbers are highly discordant between primary tumours and metastases as validated by fluorescent in situ hybridization. To further investigate intra-tumour heterogeneity, we dissected a single tumour into 68 spatially defined samples and sequenced them separately. We identify evenly distributed coding mutations in APC and TP53 in all tumour areas, yet highly variable gene copy numbers in numerous genes. 3D morpho-molecular reconstruction reveals two clusters with divergent copy number aberrations along the proximal–distal axis indicating that DNA copy number variations are a major source of tumour heterogeneity in CRC. PMID:28120820
Population genetics and molecular evolution of DNA sequences in transposable elements. I. A simulation framework.

PubMed

Kijima, T E; Innan, Hideki

2013-11-01

A population genetic simulation framework is developed to understand the behavior and molecular evolution of DNA sequences of transposable elements. Our model incorporates random transposition and excision of transposable element (TE) copies, two modes of selection against TEs, and degeneration of transpositional activity by point mutations. We first investigated the relationships between the behavior of the copy number of TEs and these parameters. Our results show that when selection is weak, the genome can maintain a relatively large number of TEs, but most of them are less active. In contrast, with strong selection, the genome can maintain only a limited number of TEs but the proportion of active copies is large. In such a case, there could be substantial fluctuations of the copy number over generations. We also explored how DNA sequences of TEs evolve through the simulations. In general, active copies form clusters around the original sequence, while less active copies have long branches specific to themselves, exhibiting a star-shaped phylogeny. It is demonstrated that the phylogeny of TE sequences could be informative to understand the dynamics of TE evolution.
Tandem repeats of the 5' non-transcribed spacer of Tetrahymena rDNA function as high copy number autonomous replicons in the macronucleus but do not prevent rRNA gene dosage regulation.

PubMed Central

Pan, W J; Blackburn, E H

1995-01-01

The rRNA genes in the somatic macronucleus of Tetrahymena thermophila are normally on 21 kb linear palindromic molecules (rDNA). We examined the effect on rRNA gene dosage of transforming T.thermophila macronuclei with plasmid constructs containing a pair of tandemly repeated rDNA replication origin regions unlinked to the rRNA gene. A significant proportion of the plasmid sequences were maintained as high copy circular molecules, eventually consisting solely of tandem arrays of origin regions. As reported previously for cells transformed by a construct in which the same tandem rDNA origins were linked to the rRNA gene [Yu, G.-L. and Blackburn, E. H. (1990) Mol. Cell. Biol., 10, 2070-2080], origin sequences recombined to form linear molecules bearing several tandem repeats of the origin region, as well as rRNA genes. The total number of rDNA origin sequences eventually exceeded rRNA gene copies by approximately 20- to 40-fold and the number of circular replicons carrying only rDNA origin sequences exceeded rRNA gene copies by 2- to 3-fold. However, the rRNA gene dosage was unchanged. Hence, simply monitoring the total number of rDNA origin regions is not sufficient to regulate rRNA gene copy number. Images PMID:7784211
Enlightenment of Yeast Mitochondrial Homoplasmy: Diversified Roles of Gene Conversion

PubMed Central

Ling, Feng; Mikawa, Tsutomu; Shibata, Takehiko

2011-01-01

Mitochondria have their own genomic DNA. Unlike the nuclear genome, each cell contains hundreds to thousands of copies of mitochondrial DNA (mtDNA). The copies of mtDNA tend to have heterogeneous sequences, due to the high frequency of mutagenesis, but are quickly homogenized within a cell (“homoplasmy”) during vegetative cell growth or through a few sexual generations. Heteroplasmy is strongly associated with mitochondrial diseases, diabetes and aging. Recent studies revealed that the yeast cell has the machinery to homogenize mtDNA, using a common DNA processing pathway with gene conversion; i.e., both genetic events are initiated by a double-stranded break, which is processed into 3′ single-stranded tails. One of the tails is base-paired with the complementary sequence of the recipient double-stranded DNA to form a D-loop (homologous pairing), in which repair DNA synthesis is initiated to restore the sequence lost by the breakage. Gene conversion generates sequence diversity, depending on the divergence between the donor and recipient sequences, especially when it occurs among a number of copies of a DNA sequence family with some sequence variations, such as in immunoglobulin diversification in chicken. MtDNA can be regarded as a sequence family, in which the members tend to be diversified by a high frequency of spontaneous mutagenesis. Thus, it would be interesting to determine why and how double-stranded breakage and D-loop formation induce sequence homogenization in mitochondria and sequence diversification in nuclear DNA. We will review the mechanisms and roles of mtDNA homoplasmy, in contrast to nuclear gene conversion, which diversifies gene and genome sequences, to provide clues toward understanding how the common DNA processing pathway results in such divergent outcomes. PMID:24710143
Detection of the free living amoeba Naegleria fowleri by using conventional and real-time PCR based on a single copy DNA sequence.

PubMed

Régoudis, Estelle; Pélandakis, Michel

2016-02-01

The amoeba-flagellate Naegleria fowleri is a causative agent of primary amoebic meningoencephalitis (PAM). This thermophilic species occurs worldwide and tends to proliferate in warm aquatic environment. The PAM cases remain rare but this infection is mostly fatal. Here, we describe a single copy region which has been cloned and sequenced, and was used for both conventional and real-time PCR. Targeting a single-copy DNA sequence allows to directly quantify the N. fowleri cells. The real-time PCR results give a detection limit of 1 copy per reaction with high reproducibility without the need of a Taqman probe. This procedure is of interest as compared to other procedures which are mostly based on the detection of multi-copy DNA associated with a Taqman probe. Copyright © 2015 Elsevier Inc. All rights reserved.
nrDNA:mtDNA copy number ratios as a comparative metric for evolutionary and conservation genetics.

PubMed

Goodall-Copestake, William Paul

2018-05-12

Identifying genetic cues of functional relevance is key to understanding the drivers of evolution and increasingly important for the conservation of biodiversity. This study introduces nuclear ribosomal DNA (nrDNA) to mitochondrial DNA (mtDNA) copy number ratios as a metric with which to screen for this functional genetic variation prior to more extensive omics analyses. To illustrate the metric, quantitative PCR was used to estimate nrDNA (18S) to mtDNA (16S) copy number ratios in muscle tissue from samples of two zooplankton species: Salpa thompsoni caught near Elephant Island (Southern Ocean) and S. fusiformis sampled off Gough Island (South Atlantic). Average 18S:16S ratios in these samples were 9:1 and 3:1, respectively. nrDNA 45S arrays and mitochondrial genomes were then deep sequenced to uncover the sources of intra-individual genetic variation underlying these 18S:16S copy number differences. The deep sequencing profiles obtained were consistent with genetic changes resulting from adaptive processes, including an expansion of nrDNA and damage to mtDNA in S. thompsoni, potentially in response to the polar environment. Beyond this example from zooplankton, nrDNA:mtDNA copy number ratios offer a promising metric to help identify genetic variation of functional relevance in animals more broadly.
Iterated function systems for DNA replication

NASA Astrophysics Data System (ADS)

Gaspard, Pierre

2017-10-01

The kinetic equations of DNA replication are shown to be exactly solved in terms of iterated function systems, running along the template sequence and giving the statistical properties of the copy sequences, as well as the kinetic and thermodynamic properties of the replication process. With this method, different effects due to sequence heterogeneity can be studied, in particular, a transition between linear and sublinear growths in time of the copies, and a transition between continuous and fractal distributions of the local velocities of the DNA polymerase along the template. The method is applied to the human mitochondrial DNA polymerase γ without and with exonuclease proofreading.
Consequences of Normalizing Transcriptomic and Genomic Libraries of Plant Genomes Using a Duplex-Specific Nuclease and Tetramethylammonium Chloride

PubMed Central

Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard

2013-01-01

Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce. PMID:23409088
Consequences of normalizing transcriptomic and genomic libraries of plant genomes using a duplex-specific nuclease and tetramethylammonium chloride.

PubMed

Matvienko, Marta; Kozik, Alexander; Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard

2013-01-01

Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce.
Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data.

PubMed

Favero, F; Joshi, T; Marquard, A M; Birkbak, N J; Krzystanek, M; Li, Q; Szallasi, Z; Eklund, A C

2015-01-01

Exome or whole-genome deep sequencing of tumor DNA along with paired normal DNA can potentially provide a detailed picture of the somatic mutations that characterize the tumor. However, analysis of such sequence data can be complicated by the presence of normal cells in the tumor specimen, by intratumor heterogeneity, and by the sheer size of the raw data. In particular, determination of copy number variations from exome sequencing data alone has proven difficult; thus, single nucleotide polymorphism (SNP) arrays have often been used for this task. Recently, algorithms to estimate absolute, but not allele-specific, copy number profiles from tumor sequencing data have been described. We developed Sequenza, a software package that uses paired tumor-normal DNA sequencing data to estimate tumor cellularity and ploidy, and to calculate allele-specific copy number profiles and mutation profiles. We applied Sequenza, as well as two previously published algorithms, to exome sequence data from 30 tumors from The Cancer Genome Atlas. We assessed the performance of these algorithms by comparing their results with those generated using matched SNP arrays and processed by the allele-specific copy number analysis of tumors (ASCAT) algorithm. Comparison between Sequenza/exome and SNP/ASCAT revealed strong correlation in cellularity (Pearson's r = 0.90) and ploidy estimates (r = 0.42, or r = 0.94 after manual inspecting alternative solutions). This performance was noticeably superior to previously published algorithms. In addition, in artificial data simulating normal-tumor admixtures, Sequenza detected the correct ploidy in samples with tumor content as low as 30%. The agreement between Sequenza and SNP array-based copy number profiles suggests that exome sequencing alone is sufficient not only for identifying small scale mutations but also for estimating cellularity and inferring DNA copy number aberrations. © The Author 2014. Published by Oxford University Press on behalf of the European Society for Medical Oncology.
DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation.

PubMed

Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H; Proukakis, Christos

2017-01-01

Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array "waves", and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance.
DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation

PubMed Central

Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M.; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H.

2017-01-01

Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array “waves”, and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance. PMID:28683077
Single-copy gene detection using branched DNA (bDNA) in situ hybridization.

PubMed

Player, A N; Shen, L P; Kenny, D; Antao, V P; Kolberg, J A

2001-05-01

We have developed a branched DNA in situ hybridization (bDNA ISH) method for detection of human papillomavirus (HPV) DNA in whole cells. Using human cervical cancer cell lines with known copies of HPV DNA, we show that the bDNA ISH method is highly sensitive, detecting as few as one or two copies of HPV DNA per cell. By modifying sample pretreatment, viral mRNA or DNA sequences can be detected using the same set of oligonucleotide probes. In experiments performed on mixed populations of cells, the bDNA ISH method is highly specific and can distinguish cells with HPV-16 from cells with HPV-18 DNA. Furthermore, we demonstrate that the bDNA ISH method provides precise localization, yielding positive signals retained within the subcellular compartments in which the target nucleic acid sequences are localized. As an effective and convenient means for nucleic acid detection, the bDNA ISH method is applicable to the detection of cancers and infectious agents. (J Histochem Cytochem 49:603-611, 2001)
Preparation of 13C/15N-labeled oligomers using the polymerase chain reaction

DOEpatents

Chen, Xian; Gupta, Goutam; Bradbury, E. Morton

2001-01-01

Preparation of .sup.13 C/.sup.15 N-labeled DNA oligomers using the polymerase chain reaction (PCR). A PCR based method for uniform (.sup.13 C/.sup.15 N)-labeling of DNA duplexes is described. Multiple copies of a blunt-ended duplex are cloned into a plasmid, each copy containing the sequence of interest and restriction Hinc II sequences at both the 5' and 3' ends. PCR using bi-directional primers and uniformly .sup.13 C/.sup.15 N-labeled dNTP precursors generates labeled DNA duplexes containing multiple copies of the sequence of interest. Twenty-four cycles of PCR, followed by restriction and purification, gave the uniformly .sup.13 C/.sup.15 N-labeled duplex sequence with a 30% yield. Such labeled duplexes find significant applications in multinuclear magnetic resonance spectroscopy.
Evolutionary relationships among Pinus (Pinaceae) subsections inferred from multiple low-copy nuclear loci.

Treesearch

John Syring; Ann Willyard; Richard Cronn; Aaron Liston

2005-01-01

Sequence data from nrITS and cpDNA have failed to fully resolve phylogenetic relationships among Pinus species. Four low-copy nuclear genes, developed from the screening of 73 mapped conifer anchor loci, were sequenced from 12 species representing all subsections. Individual loci do not uniformly support either the nrITS or cpDNA hypotheses and in...
Navigating the tip of the genomic iceberg: Next-generation sequencing for plant systematics.

PubMed

Straub, Shannon C K; Parks, Matthew; Weitemier, Kevin; Fishbein, Mark; Cronn, Richard C; Liston, Aaron

2012-02-01

Just as Sanger sequencing did more than 20 years ago, next-generation sequencing (NGS) is poised to revolutionize plant systematics. By combining multiplexing approaches with NGS throughput, systematists may no longer need to choose between more taxa or more characters. Here we describe a genome skimming (shallow sequencing) approach for plant systematics. Through simulations, we evaluated optimal sequencing depth and performance of single-end and paired-end short read sequences for assembly of nuclear ribosomal DNA (rDNA) and plastomes and addressed the effect of divergence on reference-guided plastome assembly. We also used simulations to identify potential phylogenetic markers from low-copy nuclear loci at different sequencing depths. We demonstrated the utility of genome skimming through phylogenetic analysis of the Sonoran Desert clade (SDC) of Asclepias (Apocynaceae). Paired-end reads performed better than single-end reads. Minimum sequencing depths for high quality rDNA and plastome assemblies were 40× and 30×, respectively. Divergence from the reference significantly affected plastome assembly, but relatively similar references are available for most seed plants. Deeper rDNA sequencing is necessary to characterize intragenomic polymorphism. The low-copy fraction of the nuclear genome was readily surveyed, even at low sequencing depths. Nearly 160000 bp of sequence from three organelles provided evidence of phylogenetic incongruence in the SDC. Adoption of NGS will facilitate progress in plant systematics, as whole plastome and rDNA cistrons, partial mitochondrial genomes, and low-copy nuclear markers can now be efficiently obtained for molecular phylogenetics studies.
Preselection of EGFR mutations in non-small-cell lung cancer patients by immunohistochemistry: comparison with DNA-sequencing, EGFR wild-type expression, gene copy number gain and clinicopathological data.

PubMed

Gaber, Rania; Watermann, Iris; Kugler, Christian; Vollmer, Ekkehard; Perner, Sven; Reck, Martin; Goldmann, Torsten

2017-01-01

Targeting epidermal growth factor receptor (EGFR) in patients with non-small-cell lung cancer (NSCLC) having EGFR mutations is associated with an improved overall survival. The aim of this study is to verify, if EGFR mutations detected by immunohistochemistry (IHC) is a convincing way to preselect patients for DNA-sequencing and to figure out, the statistical association between EGFR mutation, wild-type EGFR overexpression, gene copy number gain, which are the main factors inducing EGFR tumorigenic activity and the clinicopathological data. Two hundred sixteen tumor tissue samples of primarily chemotherapeutic naïve NSCLC patients were analyzed for EGFR mutations E746-A750del and L858R and correlated with DNA-sequencing. Two hundred six of which were assessed by IHC, using 6B6 and 43B2 specific antibodies followed by DNA-sequencing of positive cases and 10 already genotyped tumor tissues were also included to investigate debugging accuracy of IHC. In addition, EGFR wild-type overexpression was IHC evaluated and EGFR gene copy number determination was performed by fluorescence in situ hybridization (FISH). Forty-one÷206 (19.9%) cases were positive for mutated EGFR by IHC. Eight of them had EGFR mutations of exons 18-21 by DNA-sequencing. Hit rate of 10 already genotyped NSCLC mutated cases was 90% by IHC. Positive association was found between EGFR mutations determined by IHC and both EGFR overexpression and increased gene copy number (p=0.002 and p<0.001, respectively). Additionally, positive association was detected between EGFR mutations, high tumor grade and clinical stage (p<0.001). IHC staining with mutation specific antibodies was demonstrated as a possible useful screening test to preselect patients for DNA-sequencing.

Acquisition of New DNA Sequences After Infection of Chicken Cells with Avian Myeloblastosis Virus

PubMed Central

Shoyab, M.; Baluda, M. A.; Evans, R.

1974-01-01

DNA-RNA hybridization studies between 70S RNA from avian myeloblastosis virus (AMV) and an excess of DNA from (i) AMV-induced leukemic chicken myeloblasts or (ii) a mixture of normal and of congenitally infected K-137 chicken embryos producing avian leukosis viruses revealed the presence of fast- and slow-hybridizing virus-specific DNA sequences. However, the leukemic cells contained twice the level of AMV-specific DNA sequences observed in normal chicken embryonic cells. The fast-reacting sequences were two to three times more numerous in leukemic DNA than in DNA from the mixed embryos. The slow-reacting sequences had a reiteration frequency of approximately 9 and 6, in the two respective systems. Both the fast- and the slow-reacting DNA sequences in leukemic cells exhibited a higher Tm (2 C) than the respective DNA sequences in normal cells. In normal and leukemic cells the slow hybrid sequences appeared to have a Tm which was 2 C higher than that of the fast hybrid sequences. Individual non-virus-producing chicken embryos, either group-specific antigen positive or negative, contained 40 to 100 copies of the fast sequences and 2 to 6 copies of the slowly hybridizing sequences per cell genome. Normal rat cells did not contain DNA that hybridized with AMV RNA, whereas non-virus-producing rat cells transformed by B-77 avian sarcoma virus contained only the slowly reacting sequences. The results demonstrate that leukemic cells transformed by AMV contain new AMV-specific DNA sequences which were not present before infection. PMID:16789139
The DNA of ciliated protozoa.

PubMed Central

Prescott, D M

1994-01-01

Ciliates contain two types of nuclei: a micronucleus and a macronucleus. The micronucleus serves as the germ line nucleus but does not express its genes. The macronucleus provides the nuclear RNA for vegetative growth. Mating cells exchange haploid micronuclei, and a new macronucleus develops from a new diploid micronucleus. The old macronucleus is destroyed. This conversion consists of amplification, elimination, fragmentation, and splicing of DNA sequences on a massive scale. Fragmentation produces subchromosomal molecules in Tetrahymena and Paramecium cells and much smaller, gene-sized molecules in hypotrichous ciliates to which telomere sequences are added. These molecules are then amplified, some to higher copy numbers than others. rDNA is differentially amplified to thousands of copies per macronucleus. Eliminated sequences include transposonlike elements and sequences called internal eliminated sequences that interrupt gene coding regions in the micronuclear genome. Some, perhaps all, of these are excised as circular molecules and destroyed. In at least some hypotrichs, segments of some micronuclear genes are scrambled in a nonfunctional order and are recorded during macronuclear development. Vegetatively growing ciliates appear to possess a mechanism for adjusting copy numbers of individual genes, which corrects gene imbalances resulting from random distribution of DNA molecules during amitosis of the macronucleus. Other distinctive features of ciliate DNA include an altered use of the conventional stop codons. Images PMID:8078435
Detection of Low-Copy-Number Genomic DNA Sequences in Individual Bacterial Cells by Using Peptide Nucleic Acid-Assisted Rolling-Circle Amplification and Fluorescence In Situ Hybridization▿ †

PubMed Central

Smolina, Irina; Lee, Charles; Frank-Kamenetskii, Maxim

2007-01-01

An approach is proposed for in situ detection of short signature DNA sequences present in single copies per bacterial genome. The site is locally opened by peptide nucleic acids, and a circular oligonucleotide is assembled. The amplicon generated by rolling circle amplification is detected by hybridization with fluorescently labeled decorator probes. PMID:17293504
Evidence for a Complex Class of Nonadenylated mRNA in Drosophila

PubMed Central

Zimmerman, J. Lynn; Fouts, David L.; Manning, Jerry E.

1980-01-01

The amount, by mass, of poly(A+) mRNA present in the polyribosomes of third-instar larvae of Drosophila melanogaster, and the relative contribution of the poly(A+) mRNA to the sequence complexity of total polysomal RNA, has been determined. Selective removal of poly(A+) mRNA from total polysomal RNA by use of either oligo-dT-cellulose, or poly(U)-sepharose affinity chromatography, revealed that only 0.15% of the mass of the polysomal RNA was present as poly(A+) mRNA. The present study shows that this RNA hybridized at saturation with 3.3% of the single-copy DNA in the Drosophila genome. After correction for asymmetric transcription and reactability of the DNA, 7.4% of the single-copy DNA in the Drosophila genome is represented in larval poly(A+) mRNA. This corresponds to 6.73 x 106 nucleotides of mRNA coding sequences, or approximately 5,384 diverse RNA sequences of average size 1,250 nucleotides. However, total polysomal RNA hybridizes at saturation to 10.9% of the single-copy DNA sequences. After correcting this value for asymmetric transcription and tracer DNA reactability, 24% of the single-copy DNA in Drosophila is represented in total polysomal RNA. This corresponds to 2.18 x 107 nucleotides of RNA coding sequences or 17,440 diverse RNA molecules of size 1,250 nucleotides. This value is 3.2 times greater than that observed for poly(A+) mRNA, and indicates that ≃69% of the polysomal RNA sequence complexity is contributed by nonadenylated RNA. Furthermore, if the number of different structural genes represented in total polysomal RNA is ≃1.7 x 104, then the number of genes expressed in third-instar larvae exceeds the number of chromomeres in Drosophila by about a factor of three. This numerology indicates that the number of chromomeres observed in polytene chromosomes does not reflect the number of structural gene sequences in the Drosophila genome. PMID:6777246
Plasmid P1 replication: negative control by repeated DNA sequences.

PubMed Central

Chattoraj, D; Cordes, K; Abeles, A

1984-01-01

The incompatibility locus, incA, of the unit-copy plasmid P1 is contained within a fragment that is essentially a set of nine 19-base-pair repeats. One or more copies of the fragment destabilizes the plasmid when present in trans. Here we show that extra copies of incA interfere with plasmid DNA replication and that a deletion of most of incA increases plasmid copy number. Thus, incA is not essential for replication but is required for its control. When cloned in a high-copy-number vector, pieces of the incA fragment that each contain only three repeats destabilize P1 plasmids efficiently. This result makes it unlikely that incA specifies a regulatory product. Our in vivo results suggest that the repeating DNA sequence itself negatively controls replication by titrating a P1-determined protein, RepA, that is essential for replication. Consistent with this hypothesis is the observation that the RepA protein binds to the incA fragment in vitro. Images PMID:6387706
Effect of sustained elevated temperature prior to amplification on template copy number estimation using digital polymerase chain reaction.

PubMed

Bhat, Somanath; McLaughlin, Jacob L H; Emslie, Kerry R

2011-02-21

Digital polymerase chain reaction (dPCR) has the potential to enable accurate quantification of target DNA copy number provided that all target DNA molecules are successfully amplified. Following duplex dPCR analysis from a linear DNA target sequence that contains single copies of two independent template sequences, we have observed that amplification of both templates in a single partition does not always occur. To investigate this finding, we heated the target DNA solution to 95 °C for increasing time intervals and then immediately chilled on ice prior to preparing the dPCR mix. We observed an exponential decline in estimated copy number (R(2)≥ 0.98) of the two template sequences when amplified from either a linearized plasmid or a 388 base pair (bp) amplicon containing the same two template sequences. The distribution of amplifiable templates and the final concentration (copies per µL) were both affected by heat treatment of the samples at 95 °C from 0 s to 30 min. The proportion of target sequences from which only one of the two templates was amplified in a single partition (either 1507 or hmg only) increased over time, while the proportion of target sequences where both templates were amplified (1507 and hmg) in each individual partition declined rapidly from 94% to 52% (plasmid) and 88% to 31% (388 bp amplicon) suggesting an increase in number of targets from which both templates no longer amplify. A 10 min incubation at 95 °C reduced the initial amplifiable template concentration of the plasmid and the 388 bp amplicon by 59% and 91%, respectively. To determine if a similar decrease in amplifiable target occurs during the default pre-activation step of typical PCR amplification protocol, we used mastermixes with a 20 s or 10 min hot-start. The choice of mastermix and consequent pre-activation time did not affect the estimated plasmid concentration. Therefore, we conclude that prolonged exposure of this DNA template to elevated temperatures could lead to significant bias in dPCR measurements. However, care must be taken when designing PCR and non-PCR based experiments by reducing exposure of the DNA template to sustained elevated temperatures in order to improve accuracy in copy number estimation and concentration determination.
DNA replication stress restricts ribosomal DNA copy number.

PubMed

Salim, Devika; Bradford, William D; Freeland, Amy; Cady, Gillian; Wang, Jianmin; Pruitt, Steven C; Gerton, Jennifer L

2017-09-01

Ribosomal RNAs (rRNAs) in budding yeast are encoded by ~100-200 repeats of a 9.1kb sequence arranged in tandem on chromosome XII, the ribosomal DNA (rDNA) locus. Copy number of rDNA repeat units in eukaryotic cells is maintained far in excess of the requirement for ribosome biogenesis. Despite the importance of the repeats for both ribosomal and non-ribosomal functions, it is currently not known how "normal" copy number is determined or maintained. To identify essential genes involved in the maintenance of rDNA copy number, we developed a droplet digital PCR based assay to measure rDNA copy number in yeast and used it to screen a yeast conditional temperature-sensitive mutant collection of essential genes. Our screen revealed that low rDNA copy number is associated with compromised DNA replication. Further, subculturing yeast under two separate conditions of DNA replication stress selected for a contraction of the rDNA array independent of the replication fork blocking protein, Fob1. Interestingly, cells with a contracted array grew better than their counterparts with normal copy number under conditions of DNA replication stress. Our data indicate that DNA replication stresses select for a smaller rDNA array. We speculate that this liberates scarce replication factors for use by the rest of the genome, which in turn helps cells complete DNA replication and continue to propagate. Interestingly, tumors from mini chromosome maintenance 2 (MCM2)-deficient mice also show a loss of rDNA repeats. Our data suggest that a reduction in rDNA copy number may indicate a history of DNA replication stress, and that rDNA array size could serve as a diagnostic marker for replication stress. Taken together, these data begin to suggest the selective pressures that combine to yield a "normal" rDNA copy number.
Application of Stochastic Labeling with Random-Sequence Barcodes for Simultaneous Quantification and Sequencing of Environmental 16S rRNA Genes.

PubMed

Hoshino, Tatsuhiko; Inagaki, Fumio

2017-01-01

Next-generation sequencing (NGS) is a powerful tool for analyzing environmental DNA and provides the comprehensive molecular view of microbial communities. For obtaining the copy number of particular sequences in the NGS library, however, additional quantitative analysis as quantitative PCR (qPCR) or digital PCR (dPCR) is required. Furthermore, number of sequences in a sequence library does not always reflect the original copy number of a target gene because of biases caused by PCR amplification, making it difficult to convert the proportion of particular sequences in the NGS library to the copy number using the mass of input DNA. To address this issue, we applied stochastic labeling approach with random-tag sequences and developed a NGS-based quantification protocol, which enables simultaneous sequencing and quantification of the targeted DNA. This quantitative sequencing (qSeq) is initiated from single-primer extension (SPE) using a primer with random tag adjacent to the 5' end of target-specific sequence. During SPE, each DNA molecule is stochastically labeled with the random tag. Subsequently, first-round PCR is conducted, specifically targeting the SPE product, followed by second-round PCR to index for NGS. The number of random tags is only determined during the SPE step and is therefore not affected by the two rounds of PCR that may introduce amplification biases. In the case of 16S rRNA genes, after NGS sequencing and taxonomic classification, the absolute number of target phylotypes 16S rRNA gene can be estimated by Poisson statistics by counting random tags incorporated at the end of sequence. To test the feasibility of this approach, the 16S rRNA gene of Sulfolobus tokodaii was subjected to qSeq, which resulted in accurate quantification of 5.0 × 103 to 5.0 × 104 copies of the 16S rRNA gene. Furthermore, qSeq was applied to mock microbial communities and environmental samples, and the results were comparable to those obtained using digital PCR and relative abundance based on a standard sequence library. We demonstrated that the qSeq protocol proposed here is advantageous for providing less-biased absolute copy numbers of each target DNA with NGS sequencing at one time. By this new experiment scheme in microbial ecology, microbial community compositions can be explored in more quantitative manner, thus expanding our knowledge of microbial ecosystems in natural environments.
Multiple horizontal transfers of nuclear ribosomal genes between phylogenetically distinct grass lineages.

PubMed

Mahelka, Václav; Krak, Karol; Kopecký, David; Fehrer, Judith; Šafář, Jan; Bartoš, Jan; Hobza, Roman; Blavet, Nicolas; Blattner, Frank R

2017-02-14

The movement of nuclear DNA from one vascular plant species to another in the absence of fertilization is thought to be rare. Here, nonnative rRNA gene [ribosomal DNA (rDNA)] copies were identified in a set of 16 diploid barley ( Hordeum ) species; their origin was traceable via their internal transcribed spacer (ITS) sequence to five distinct Panicoideae genera, a lineage that split from the Pooideae about 60 Mya. Phylogenetic, cytogenetic, and genomic analyses implied that the nonnative sequences were acquired between 1 and 5 Mya after a series of multiple events, with the result that some current Hordeum sp. individuals harbor up to five different panicoid rDNA units in addition to the native Hordeum rDNA copies. There was no evidence that any of the nonnative rDNA units were transcribed; some showed indications of having been silenced via pseudogenization. A single copy of a Panicum sp. rDNA unit present in H. bogdanii had been interrupted by a native transposable element and was surrounded by about 70 kbp of mostly noncoding sequence of panicoid origin. The data suggest that horizontal gene transfer between vascular plants is not a rare event, that it is not necessarily restricted to one or a few genes only, and that it can be selectively neutral.
Methylation patterns of repetitive DNA sequences in germ cells of Mus musculus.

PubMed

Sanford, J; Forrester, L; Chapman, V; Chandley, A; Hastie, N

1984-03-26

The major and the minor satellite sequences of Mus musculus were undermethylated in both sperm and oocyte DNAs relative to the amount of undermethylation observed in adult somatic tissue DNA. This hypomethylation was specific for satellite sequences in sperm DNA. Dispersed repetitive and low copy sequences show a high degree of methylation in sperm DNA; however, a dispersed repetitive sequence was undermethylated in oocyte DNA. This finding suggests a difference in the amount of total genomic DNA methylation between sperm and oocyte DNA. The methylation levels of the minor satellite sequences did not change during spermiogenesis, and were not associated with the onset of meiosis or a specific stage in sperm development.
Improved PCR-Based Detection of Soil Transmitted Helminth Infections Using a Next-Generation Sequencing Approach to Assay Design.

PubMed

Pilotte, Nils; Papaiakovou, Marina; Grant, Jessica R; Bierwert, Lou Ann; Llewellyn, Stacey; McCarthy, James S; Williams, Steven A

2016-03-01

The soil transmitted helminths are a group of parasitic worms responsible for extensive morbidity in many of the world's most economically depressed locations. With growing emphasis on disease mapping and eradication, the availability of accurate and cost-effective diagnostic measures is of paramount importance to global control and elimination efforts. While real-time PCR-based molecular detection assays have shown great promise, to date, these assays have utilized sub-optimal targets. By performing next-generation sequencing-based repeat analyses, we have identified high copy-number, non-coding DNA sequences from a series of soil transmitted pathogens. We have used these repetitive DNA elements as targets in the development of novel, multi-parallel, PCR-based diagnostic assays. Utilizing next-generation sequencing and the Galaxy-based RepeatExplorer web server, we performed repeat DNA analysis on five species of soil transmitted helminths (Necator americanus, Ancylostoma duodenale, Trichuris trichiura, Ascaris lumbricoides, and Strongyloides stercoralis). Employing high copy-number, non-coding repeat DNA sequences as targets, novel real-time PCR assays were designed, and assays were tested against established molecular detection methods. Each assay provided consistent detection of genomic DNA at quantities of 2 fg or less, demonstrated species-specificity, and showed an improved limit of detection over the existing, proven PCR-based assay. The utilization of next-generation sequencing-based repeat DNA analysis methodologies for the identification of molecular diagnostic targets has the ability to improve assay species-specificity and limits of detection. By exploiting such high copy-number repeat sequences, the assays described here will facilitate soil transmitted helminth diagnostic efforts. We recommend similar analyses when designing PCR-based diagnostic tests for the detection of other eukaryotic pathogens.
Using next-generation sequencing for high resolution multiplex analysis of copy number variation from nanogram quantities of DNA from formalin-fixed paraffin-embedded specimens.

PubMed

Wood, Henry M; Belvedere, Ornella; Conway, Caroline; Daly, Catherine; Chalkley, Rebecca; Bickerdike, Melissa; McKinley, Claire; Egan, Phil; Ross, Lisa; Hayward, Bruce; Morgan, Joanne; Davidson, Leslie; MacLennan, Ken; Ong, Thian K; Papagiannopoulos, Kostas; Cook, Ian; Adams, David J; Taylor, Graham R; Rabbitts, Pamela

2010-08-01

The use of next-generation sequencing technologies to produce genomic copy number data has recently been described. Most approaches, however, reply on optimal starting DNA, and are therefore unsuitable for the analysis of formalin-fixed paraffin-embedded (FFPE) samples, which largely precludes the analysis of many tumour series. We have sought to challenge the limits of this technique with regards to quality and quantity of starting material and the depth of sequencing required. We confirm that the technique can be used to interrogate DNA from cell lines, fresh frozen material and FFPE samples to assess copy number variation. We show that as little as 5 ng of DNA is needed to generate a copy number karyogram, and follow this up with data from a series of FFPE biopsies and surgical samples. We have used various levels of sample multiplexing to demonstrate the adjustable resolution of the methodology, depending on the number of samples and available resources. We also demonstrate reproducibility by use of replicate samples and comparison with microarray-based comparative genomic hybridization (aCGH) and digital PCR. This technique can be valuable in both the analysis of routine diagnostic samples and in examining large repositories of fixed archival material.
Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes

PubMed Central

Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney

2012-01-01

RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676
Coprolites as a source of information on the genome and diet of the cave hyena

PubMed Central

Bon, Céline; Berthonaud, Véronique; Maksud, Frédéric; Labadie, Karine; Poulain, Julie; Artiguenave, François; Wincker, Patrick; Aury, Jean-Marc; Elalouf, Jean-Marc

2012-01-01

We performed high-throughput sequencing of DNA from fossilized faeces to evaluate this material as a source of information on the genome and diet of Pleistocene carnivores. We analysed coprolites derived from the extinct cave hyena (Crocuta crocuta spelaea), and sequenced 90 million DNA fragments from two specimens. The DNA reads enabled a reconstruction of the cave hyena mitochondrial genome with up to a 158-fold coverage. This genome, and those sequenced from extant spotted (Crocuta crocuta) and striped (Hyaena hyaena) hyena specimens, allows for the establishment of a robust phylogeny that supports a close relationship between the cave and the spotted hyena. We also demonstrate that high-throughput sequencing yields data for cave hyena multi-copy and single-copy nuclear genes, and that about 50 per cent of the coprolite DNA can be ascribed to this species. Analysing the data for additional species to indicate the cave hyena diet, we retrieved abundant sequences for the red deer (Cervus elaphus), and characterized its mitochondrial genome with up to a 3.8-fold coverage. In conclusion, we have demonstrated the presence of abundant ancient DNA in the coprolites surveyed. Shotgun sequencing of this material yielded a wealth of DNA sequences for a Pleistocene carnivore and allowed unbiased identification of diet. PMID:22456883
Plasma DNA tissue mapping by genome-wide methylation sequencing for noninvasive prenatal, cancer, and transplantation assessments

PubMed Central

Sun, Kun; Jiang, Peiyong; Chan, K. C. Allen; Wong, John; Cheng, Yvonne K. Y.; Liang, Raymond H. S.; Chan, Wai-kong; Ma, Edmond S. K.; Chan, Stephen L.; Cheng, Suk Hang; Chan, Rebecca W. Y.; Tong, Yu K.; Ng, Simon S. M.; Wong, Raymond S. M.; Hui, David S. C.; Leung, Tse Ngong; Leung, Tak Y.; Lai, Paul B. S.; Chiu, Rossa W. K.; Lo, Yuk Ming Dennis

2015-01-01

Plasma consists of DNA released from multiple tissues within the body. Using genome-wide bisulfite sequencing of plasma DNA and deconvolution of the sequencing data with reference to methylation profiles of different tissues, we developed a general approach for studying the major tissue contributors to the circulating DNA pool. We tested this method in pregnant women, patients with hepatocellular carcinoma, and subjects following bone marrow and liver transplantation. In most subjects, white blood cells were the predominant contributors to the circulating DNA pool. The placental contributions in the plasma of pregnant women correlated with the proportional contributions as revealed by fetal-specific genetic markers. The graft-derived contributions to the plasma in the transplant recipients correlated with those determined using donor-specific genetic markers. Patients with hepatocellular carcinoma showed elevated plasma DNA contributions from the liver, which correlated with measurements made using tumor-associated copy number aberrations. In hepatocellular carcinoma patients and in pregnant women exhibiting copy number aberrations in plasma, comparison of methylation deconvolution results using genomic regions with different copy number status pinpointed the tissue type responsible for the aberrations. In a pregnant woman diagnosed as having follicular lymphoma during pregnancy, methylation deconvolution indicated a grossly elevated contribution from B cells into the plasma DNA pool and localized B cells as the origin of the copy number aberrations observed in plasma. This method may serve as a powerful tool for assessing a wide range of physiological and pathological conditions based on the identification of perturbed proportional contributions of different tissues into plasma. PMID:26392541
A novel species-specific tandem repeat DNA family from Sinapis arvensis: detection of telomere-like sequences.

PubMed

Kapila, R; Das, S; Srivastava, P S; Lakshmikumaran, M

1996-08-01

DNA sequences representing a tandemly repeated DNA family of the Sinapis arvensis genome were cloned and characterized. The 700-bp tandem repeat family is represented by two clones, pSA35 and pSA52, which are 697 and 709 bp in length, respectively. Dot matrix analysis of the sequences indicates the presence of repeated elements within each monomeric unit. Sequence analysis of the repetitive region of clones pSA35 and pSA52 shows that there are several copies of a 7-bp repeat element organized in tandem. The consensus sequence of this repeat element is 5'-TTTAGGG-3'. These elements are highly mutated and the difference in length between the two clones is due to different copy numbers of these elements. The repetitive region of clone pSA35 has 26 copies of the element TTTAGGG, whereas clone pSA52 has 28 copies. The repetitive region in both clones is flanked on either side by inverted repeats that may be footprints of a transposition event. Sequence comparison indicates that the element TTTAGGG is identical to telomeric repeats present in Arabidopsis, maize, tomato, and other plants. However, Bal31 digestion kinetics indicates non-telomeric localization of the 700-bp tandem repeats. The clones represent a novel repeat family as (i) they contain telomere-like motifs as subrepeats within each unit; and (ii) they do not hybridize to related crucifers and are species-specific in nature.
Ultra-barcoding in cacao (Theobroma spp.; malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA

USDA-ARS?s Scientific Manuscript database

High-throughput next-generation sequencing was used to scan the genome and generate reliable sequence of high copy number regions. Using this method, we examined whole plastid genomes as well as nearly 6000 bases of nuclear ribosomal DNA sequences for nine genotypes of Theobroma cacao and an indivi...
Complexity and Entropy Analysis of DNMT1 Gene

USDA-ARS?s Scientific Manuscript database

Background: The application of complexity information on DNA sequence and protein in biological processes are well established in this study. Available sequences for DNMT1 gene, which is a maintenance methyltransferase is responsible for copying DNA methylation patterns to the daughter strands durin...
Replication and meiotic transmission of yeast ribosomal RNA genes.

PubMed

Brewer, B J; Zakian, V A; Fangman, W L

1980-11-01

The yeast Saccharomyces cerevisiae has approximately 120 genes for the ribosomal RNAs (rDNA) which are organized in tandem within chromosomal DNA. These multiple-copy genes are homogeneous in sequence but can undergo changes in copy number and topology. To determine if these changes reflect unusual features of rDNA metabolism, we have examined both the replication of rDNA in the mitotic cell cycle and the inheritance of rDNA during meiosis. The results indicate that rDNA behaves identically to chromosomal DNA: each rDNA unit is replicated once during the S phase of each cell cycle and each unit is conserved through meiosis. Therefore, the flexibility in copy number and topology of rDNA does not arise from the selective replication of units in each S phase nor by the selective inheritance of units in meiosis.
DNA replication stress restricts ribosomal DNA copy number

PubMed Central

Salim, Devika; Bradford, William D.; Freeland, Amy; Cady, Gillian; Wang, Jianmin

2017-01-01

Ribosomal RNAs (rRNAs) in budding yeast are encoded by ~100–200 repeats of a 9.1kb sequence arranged in tandem on chromosome XII, the ribosomal DNA (rDNA) locus. Copy number of rDNA repeat units in eukaryotic cells is maintained far in excess of the requirement for ribosome biogenesis. Despite the importance of the repeats for both ribosomal and non-ribosomal functions, it is currently not known how “normal” copy number is determined or maintained. To identify essential genes involved in the maintenance of rDNA copy number, we developed a droplet digital PCR based assay to measure rDNA copy number in yeast and used it to screen a yeast conditional temperature-sensitive mutant collection of essential genes. Our screen revealed that low rDNA copy number is associated with compromised DNA replication. Further, subculturing yeast under two separate conditions of DNA replication stress selected for a contraction of the rDNA array independent of the replication fork blocking protein, Fob1. Interestingly, cells with a contracted array grew better than their counterparts with normal copy number under conditions of DNA replication stress. Our data indicate that DNA replication stresses select for a smaller rDNA array. We speculate that this liberates scarce replication factors for use by the rest of the genome, which in turn helps cells complete DNA replication and continue to propagate. Interestingly, tumors from mini chromosome maintenance 2 (MCM2)-deficient mice also show a loss of rDNA repeats. Our data suggest that a reduction in rDNA copy number may indicate a history of DNA replication stress, and that rDNA array size could serve as a diagnostic marker for replication stress. Taken together, these data begin to suggest the selective pressures that combine to yield a “normal” rDNA copy number. PMID:28915237

The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae).

PubMed

Pan, Hong-Chun; Fang, Hong-Yan; Li, Shi-Wei; Liu, Jun-Hong; Wang, Ying; Wang, An-Tai

2014-12-01

The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae) is composed of two linear DNA molecules. The mitochondrial DNA (mtDNA) molecule 1 is 8010 bp long and contains six protein-coding genes, large subunit rRNA, methionine and tryptophan tRNAs, two pseudogenes consisting respectively of a partial copy of COI, and terminal sequences at two ends of the linear mtDNA, while the mtDNA molecule 2 is 7576 bp long and contains seven protein-coding genes, small subunit rRNA, methionine tRNA, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mtDNA. COI gene begins with GTG as start codon, whereas other 12 protein-coding genes start with a typical ATG initiation codon. In addition, all protein-coding genes are terminated with TAA as stop codon.
The repeating nucleotide sequence in the repetitive mitochondrial DNA from a "low-density" petite mutant of yeast.

PubMed Central

Van Kreijl, C F; Bos, J L

1977-01-01

The repeating nucleotide sequence of 68 base pairs in the mtDNA from an ethidium-induced cytoplasmic petite mutant of yeast has been determined. For sequence analysis specifically primed and terminated RNA copies, obtained by in vitro transcription of the separated strands, were use. The sequence consists of 66 consecutive AT base pairs flanked by two GC pairs and comprises nearly all of the mutant mitochondrial genome. The sequence, moreover, also represents the first part of wild-type mtDNA sequence so far. Images PMID:198740
ATP hydrolysis provides functions that promote rejection of pairings between different copies of long repeated sequences

PubMed Central

Danilowicz, Claudia; Hermans, Laura; Coljee, Vincent; Prévost, Chantal

2017-01-01

Abstract During DNA recombination and repair, RecA family proteins must promote rapid joining of homologous DNA. Repeated sequences with >100 base pair lengths occupy more than 1% of bacterial genomes; however, commitment to strand exchange was believed to occur after testing ∼20–30 bp. If that were true, pairings between different copies of long repeated sequences would usually become irreversible. Our experiments reveal that in the presence of ATP hydrolysis even 75 bp sequence-matched strand exchange products remain quite reversible. Experiments also indicate that when ATP hydrolysis is present, flanking heterologous dsDNA regions increase the reversibility of sequence matched strand exchange products with lengths up to ∼75 bp. Results of molecular dynamics simulations provide insight into how ATP hydrolysis destabilizes strand exchange products. These results inspired a model that shows how pairings between long repeated sequences could be efficiently rejected even though most homologous pairings form irreversible products. PMID:28854739
BATTLE: Biomarker-Based Approaches of Targeted Therapy for Lung Cancer Elimination

DTIC Science & Technology

2008-04-01

although a grade 3 neutropenia was dose-limiting in one importance. Th th ubstrate of the CYP3A4 isoenzyme and P-gp. Its metabolism is sensitive to...tratification in clinis Molecular Pathway Biomarkers Type of Analysis EGFR EGFR Mutation ( exons 18 to 21) DNA sequencing EGFR Increased Copy Number...polysomy/am 1plification) DNA FISH K-Ras/B-Raf K-RAS Mutation (codons 12,13, 61) DNA sequencing B-RAF Mutations ( exons 11 and 15) DNA sequencing
Quantitative analysis of herpes virus sequences from normal tissue and fibropapillomas of marine turtles with real-time PCR

USGS Publications Warehouse

Quackenbush, S.L.; Casey, R.N.; Murcek, R.J.; Paul, T.A.; Work, Thierry M.; Limpus, C.J.; Chaves, A.; duToit, L.; Perez, J.V.; Aguirre, A.A.; Spraker, T.R.; Horrocks, J.A.; Vermeer, L.A.; Balazs, G.S.; Casey, J.W.

2001-01-01

Quantitative real-time PCR has been used to measure fibropapilloma-associated turtle herpesvirus (FPTHV) pol DNA loads in fibropapillomas, fibromas, and uninvolved tissues of green, loggerhead, and olive ridley turtles from Hawaii, Florida, Costa Rica, Australia, Mexico, and the West Indies. The viral DNA loads from tumors obtained from terminal animals were relatively homogenous (range 2a??20 copies/cell), whereas DNA copy numbers from biopsied tumors and skin of otherwise healthy turtles displayed a wide variation (range 0.001a??170 copies/cell) and may reflect the stage of tumor development. FPTHV DNA loads in tumors were 2.5a??4.5 logs higher than in uninvolved skin from the same animal regardless of geographic location, further implying a role for FPTHV in the etiology of fibropapillomatosis. Although FPTHV pol sequences amplified from tumors are highly related to each other, single signature amino acid substitutions distinguish the Australia/Hawaii, Mexico/Costa Rica, and Florida/Caribbean groups.
IDLN-MSP: Idiolocal normalization of real-time methylation-specific PCR for genetic imbalanced DNA specimens.

PubMed

Santourlidis, Simeon; Ghanjati, Foued; Beermann, Agnes; Hermanns, Thomas; Poyet, Cédric

2016-02-01

Sensitive, accurate, and reliable measurements of tumor cell-specific DNA methylation changes are of fundamental importance in cancer diagnosis, prognosis, and monitoring. Real-time methylation-specific PCR (MSP) using intercalating dyes is an established method of choice for this purpose. Here we present a simple but crucial adaptation of this widely applied method that overcomes a major obstacle: genetic abnormalities in the DNA samples, such as aneuploidy or copy number variations, that could result in inaccurate results due to improper normalization if the copy numbers of the target and reference sequences are not the same. In our idiolocal normalization (IDLN) method, the locus for the normalizing, methylation-independent reference amplification is chosen close to the locus of the methylation-dependent target amplification. This ensures that the copy numbers of both the target and reference sequences will be identical in most cases if they are close enough to each other, resulting in accurate normalization and reliable comparative measurements of DNA methylation in clinical samples when using real-time MSP.
Insights on genome size evolution from a miniature inverted repeat transposon driving a satellite DNA.

PubMed

Scalvenzi, Thibault; Pollet, Nicolas

2014-12-01

The genome size in eukaryotes does not correlate well with the number of genes they contain. We can observe this so-called C-value paradox in amphibian species. By analyzing an amphibian genome we asked how repetitive DNA can impact genome size and architecture. We describe here our discovery of a Tc1/mariner miniature inverted-repeat transposon family present in Xenopus frogs. These transposons named miDNA4 are unique since they contain a satellite DNA motif. We found that miDNA4 measured 331 bp, contained 25 bp long inverted terminal repeat sequences and a sequence motif of 119 bp present as a unique copy or as an array of 2-47 copies. We characterized the structure, dynamics, impact and evolution of the miDNA4 family and its satellite DNA in Xenopus frog genomes. This led us to propose a model for the evolution of these two repeated sequences and how they can synergize to increase genome size. Copyright © 2014 Elsevier Inc. All rights reserved.
Evaluation of the authenticity of a highly novel environmental sequence from boreal forest soil using ribosomal RNA secondary structure modeling

Treesearch

D.J. Glass; N. Takebayashi; L. Olson; D.L. Taylor

2013-01-01

The number of sequences from both formally described taxa and uncultured environmental DNA deposited in the International Nucleotide Sequence Databases has increased substantially over the last two decades. Although the majority of these sequences represent authentic gene copies, there is evidence of DNA artifacts in these databases as well. These include lab artifacts...
Two circular chromosomes of unequal copy number make up the mitochondrial genome of the rotifer Brachionus plicatilis.

PubMed

Suga, Koushirou; Mark Welch, David B; Tanaka, Yukari; Sakakura, Yoshitaka; Hagiwara, Atsushi

2008-06-01

The monogonont rotifer Brachionus plicatilis is an emerging model system for a diverse array of questions in limnological ecosystem dynamics, the evolution of sexual recombination, cryptic speciation, and the phylogeny of basal metazoans. We sequenced the complete mitochondrial genome of B. plicatilis sensu strictu NH1L and found that it is composed of 2 circular chromosomes, designated mtDNA-I (11,153 bp) and mtDNA-II (12,672 bp). Hybridization to DNA isolated from mitochondria demonstrated that mtDNA-I is present at 4 times the copy number of mtDNA-II. The only nucleotide similarity between the 2 chromosomes is a 4.9-kbp region of 99.5% identity including a transfer RNA (tRNA) gene and an extensive noncoding region that contains putative D-loop and control sequence. The mtDNA-I chromosome encodes 4 proteins (ATP6, COB, NAD1, and NAD2), 13 tRNAs, and the large and small subunit ribosomal RNAs; mtDNA-II encodes 8 proteins (COX1-3, NAD3-6, and NAD4L) and 9 tRNAs. Gene order is not conserved between B. plicatilis and its closest relative with a sequenced mitochondrial genome, the acanthocephalan Leptorhynchoides thecatus, or other sequenced mitochondrial genomes. Polymerase chain reaction assays and Southern hybridization to DNA from 18 strains of Brachionus suggest that the 2-chromosome structure has been stable for millions of years. The novel organization of the B. plicatilis mitochondrial genome into 2 nearly equal chromosomes of 4-fold different copy number may provide insight into the evolution of metazoan mitochondria and the phylogenetics of rotifers and other basal animal phyla.
Repeated sequence sets in mitochondrial DNA molecules of root knot nematodes (Meloidogyne): nucleotide sequences, genome location and potential for host-race identification.

PubMed Central

Okimoto, R; Chamberlin, H M; Macfarlane, J L; Wolstenholme, D R

1991-01-01

Within a 7 kb segment of the mtDNA molecule of the root knot nematode, Meloidogyne javanica, that lacks standard mitochondrial genes, are three sets of strictly tandemly arranged, direct repeat sequences: approximately 36 copies of a 102 ntp sequence that contains a TaqI site; 11 copies of a 63 ntp sequence, and 5 copies of an 8 ntp sequence. The 7 kb repeat-containing segment is bounded by putative tRNAasp and tRNAf-met genes and the arrangement of sequences within this segment is: the tRNAasp gene; a unique 1,528 ntp segment that contains two highly stable hairpin-forming sequences; the 102 ntp repeat set; the 8 ntp repeat set; a unique 1,068 ntp segment; the 63 ntp repeat set; and the tRNAf-met gene. The nucleotide sequences of the 102 ntp copies and the 63 ntp copies have been conserved among the species examined. Data from Southern hybridization experiments indicate that 102 ntp and 63 ntp repeats occur in the mtDNAs of three, two and two races of M.incognita, M.hapla and M.arenaria, respectively. Nucleotide sequences of the M.incognita Race-3 102 ntp repeat were found to be either identical or highly similar to those of the M.javanica 102 ntp repeat. Differences in migration distance and number of 102 ntp repeat-containing bands seen in Southern hybridization autoradiographs of restriction-digested mtDNAs of M.javanica and the different host races of M.incognita, M.hapla and M.arenaria are sufficient to distinguish the different host races of each species. Images PMID:2027769
Analysis of sequence variability in the macronuclear DNA of Paramecium tetraurelia: A somatic view of the germline

PubMed Central

Duret, Laurent; Cohen, Jean; Jubin, Claire; Dessen, Philippe; Goût, Jean-François; Mousset, Sylvain; Aury, Jean-Marc; Jaillon, Olivier; Noël, Benjamin; Arnaiz, Olivier; Bétermier, Mireille; Wincker, Patrick; Meyer, Eric; Sperling, Linda

2008-01-01

Ciliates are the only unicellular eukaryotes known to separate germinal and somatic functions. Diploid but silent micronuclei transmit the genetic information to the next sexual generation. Polyploid macronuclei express the genetic information from a streamlined version of the genome but are replaced at each sexual generation. The macronuclear genome of Paramecium tetraurelia was recently sequenced by a shotgun approach, providing access to the gene repertoire. The 72-Mb assembly represents a consensus sequence for the somatic DNA, which is produced after sexual events by reproducible rearrangements of the zygotic genome involving elimination of repeated sequences, precise excision of unique-copy internal eliminated sequences (IES), and amplification of the cellular genes to high copy number. We report use of the shotgun sequencing data (>106 reads representing 13× coverage of a completely homozygous clone) to evaluate variability in the somatic DNA produced by these developmental genome rearrangements. Although DNA amplification appears uniform, both of the DNA elimination processes produce sequence heterogeneity. The variability that arises from IES excision allowed identification of hundreds of putative new IESs, compared to 42 that were previously known, and revealed cases of erroneous excision of segments of coding sequences. We demonstrate that IESs in coding regions are under selective pressure to introduce premature termination of translation in case of excision failure. PMID:18256234
Nature and distribution of feline sarcoma virus nucleotide sequences.

PubMed Central

Frankel, A E; Gilbert, J H; Porzig, K J; Scolnick, E M; Aaronson, S A

1979-01-01

The genomes of three independent isolates of feline sarcoma virus (FeSV) were compared by molecular hybridization techniques. Using complementary DNAs prepared from two strains, SM- and ST-FeSV, common complementary DNA'S were selected by sequential hybridization to FeSV and feline leukemia virus RNAs. These DNAs were shown to be highly related among the three independent sarcoma virus isolates. FeSV-specific complementary DNAs were prepared by selection for hybridization by the homologous FeSV RNA and against hybridization by fline leukemia virus RNA. Sarcoma virus-specific sequences of SM-FeSV were shown to differ from those of either ST- or GA-FeSV strains, whereas ST-FeSV-specific DNA shared extensive sequence homology with GA-FeSV. By molecular hybridization, each set of FeSV-specific sequences was demonstrated to be present in normal cat cellular DNA in approximately one copy per haploid genome and was conserved throughout Felidae. In contrast, FeSV-common sequences were present in multiple DNA copies and were found only in Mediterranean cats. The present results are consistent with the concept that each FeSV strain has arisen by a mechanism involving recombination between feline leukemia virus and cat cellular DNA sequences, the latter represented within the cat genome in a manner analogous to that of a cellular gene. PMID:225544
Length Variation, Heteroplasmy and Sequence Divergence in the Mitochondrial DNA of Four Species of Sturgeon (Acipenser)

PubMed Central

Brown, J. R.; Beckenbach, K.; Beckenbach, A. T.; Smith, M. J.

1996-01-01

The extent of mtDNA length variation and heteroplasmy as well as DNA sequences of the control region and two tRNA genes were determined for four North American sturgeon species: Acipenser transmontanus, A. medirostris, A. fulvescens and A. oxyrhnychus. Across the Continental Divide, a division in the occurrence of length variation and heteroplasmy was observed that was concordant with species biogeography as well as with phylogenies inferred from restriction fragment length polymorphisms (RFLP) of whole mtDNA and pairwise comparisons of unique sequences of the control region. In all species, mtDNA length variation was due to repeated arrays of 78-82-bp sequences each containing a D-loop strand synthesis termination associated sequence (TAS). Individual repeats showed greater sequence conservation within individuals and species rather than between species, which is suggestive of concerted evolution. Differences in the frequencies of multiple copy genomes and heteroplasmy among the four species may be ascribed to differences in the rates of recurrent mutation. A mechanism that may offset the high rate of mutation for increased copy number is suggested on the basis that an increase in the number of functional TAS motifs might reduce the frequency of successfully initiated H-strand replications. PMID:8852850
The complete chloroplast genome of Sinopodophyllum hexandrum Ying (Berberidaceae).

PubMed

Meng, Lihua; Liu, Ruijuan; Chen, Jianbing; Ding, Chenxu

2017-05-01

The complete nucleotide sequence of the Sinopodophyllum hexandrum Ying chloroplast genome (cpDNA) was determined based on next-generation sequencing technologies in this study. The genome was 157 203 bp in length, containing a pair of inverted repeat (IRa and IRb) regions of 25 960 bp, which were separated by a large single-copy (LSC) region of 87 065 bp and a small single-copy (SSC) region of 18 218 bp, respectively. The cpDNA contained 148 genes, including 96 protein-coding genes, 8 ribosomal RNA genes, and 44 tRNA genes. In these genes, eight harbored a single intron, and two (ycf3 and clpP) contained a couple of introns. The cpDNA AT content of S. hexandrum cpDNA is 61.5%.
DYZ1 arrays show sequence variation between the monozygotic males

PubMed Central

2014-01-01

Background Monozygotic twins (MZT) are an important resource for genetical studies in the context of normal and diseased genomes. In the present study we used DYZ1, a satellite fraction present in the form of tandem arrays on the long arm of the human Y chromosome, as a tool to uncover sequence variations between the monozygotic males. Results We detected copy number variation, frequent insertions and deletions within the sequences of DYZ1 arrays amongst all the three sets of twins used in the present study. MZT1b showed loss of 35 bp compared to that in 1a, whereas 2a showed loss of 31 bp compared to that in 2b. Similarly, 3b showed 10 bp insertion compared to that in 3a. MZT1a germline DNA showed loss of 5 bp and 1b blood DNA showed loss of 26 bp compared to that of 1a blood and 1b germline DNA, respectively. Of the 69 restriction sites detected in DYZ1 arrays, MboII, BsrI, TspEI and TaqI enzymes showed frequent loss and or gain amongst all the 3 pairs studied. MZT1 pair showed loss/gain of VspI, BsrDI, AgsI, PleI, TspDTI, TspEI, TfiI and TaqI restriction sites in both blood and germline DNA. All the three sets of MZT showed differences in the number of DYZ1 copies. FISH signals reflected somatic mosaicism of the DYZ1 copies across the cells. Conclusions DYZ1 showed both sequence and copy number variation between the MZT males. Sequence variation was also noticed between germline and blood DNA samples of the same individual as we observed at least in one set of sample. The result suggests that DYZ1 faithfully records all the genetical changes occurring after the twining which may be ascribed to the environmental factors. PMID:24495361
Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays

PubMed Central

Binder, Hans; Fasold, Mario; Glomb, Torsten

2009-01-01

Background Single nucleotide polymorphism (SNP) arrays are important tools widely used for genotyping and copy number estimation. This technology utilizes the specific affinity of fragmented DNA for binding to surface-attached oligonucleotide DNA probes. We analyze the variability of the probe signals of Affymetrix GeneChip SNP arrays as a function of the probe sequence to identify relevant sequence motifs which potentially cause systematic biases of genotyping and copy number estimates. Methodology/Principal Findings The probe design of GeneChip SNP arrays enables us to disentangle different sources of intensity modulations such as the number of mismatches per duplex, matched and mismatched base pairings including nearest and next-nearest neighbors and their position along the probe sequence. The effect of probe sequence was estimated in terms of triple-motifs with central matches and mismatches which include all 256 combinations of possible base pairings. The probe/target interactions on the chip can be decomposed into nearest neighbor contributions which correlate well with free energy terms of DNA/DNA-interactions in solution. The effect of mismatches is about twice as large as that of canonical pairings. Runs of guanines (G) and the particular type of mismatched pairings formed in cross-allelic probe/target duplexes constitute sources of systematic biases of the probe signals with consequences for genotyping and copy number estimates. The poly-G effect seems to be related to the crowded arrangement of probes which facilitates complex formation of neighboring probes with at minimum three adjacent G's in their sequence. Conclusions The applied method of “triple-averaging” represents a model-free approach to estimate the mean intensity contributions of different sequence motifs which can be applied in calibration algorithms to correct signal values for sequence effects. Rules for appropriate sequence corrections are suggested. PMID:19924253
In Vivo Control of CpG and Non-CpG DNA Methylation by DNA Methyltransferases

PubMed Central

Arand, Julia; Spieler, David; Karius, Tommy; Branco, Miguel R.; Meilinger, Daniela; Meissner, Alexander; Jenuwein, Thomas; Xu, Guoliang; Leonhardt, Heinrich; Wolf, Verena; Walter, Jörn

2012-01-01

The enzymatic control of the setting and maintenance of symmetric and non-symmetric DNA methylation patterns in a particular genome context is not well understood. Here, we describe a comprehensive analysis of DNA methylation patterns generated by high resolution sequencing of hairpin-bisulfite amplicons of selected single copy genes and repetitive elements (LINE1, B1, IAP-LTR-retrotransposons, and major satellites). The analysis unambiguously identifies a substantial amount of regional incomplete methylation maintenance, i.e. hemimethylated CpG positions, with variant degrees among cell types. Moreover, non-CpG cytosine methylation is confined to ESCs and exclusively catalysed by Dnmt3a and Dnmt3b. This sequence position–, cell type–, and region-dependent non-CpG methylation is strongly linked to neighboring CpG methylation and requires the presence of Dnmt3L. The generation of a comprehensive data set of 146,000 CpG dyads was used to apply and develop parameter estimated hidden Markov models (HMM) to calculate the relative contribution of DNA methyltransferases (Dnmts) for de novo and maintenance DNA methylation. The comparative modelling included wild-type ESCs and mutant ESCs deficient for Dnmt1, Dnmt3a, Dnmt3b, or Dnmt3a/3b, respectively. The HMM analysis identifies a considerable de novo methylation activity for Dnmt1 at certain repetitive elements and single copy sequences. Dnmt3a and Dnmt3b contribute de novo function. However, both enzymes are also essential to maintain symmetrical CpG methylation at distinct repetitive and single copy sequences in ESCs. PMID:22761581
Enzyme-Free Replication with Two or Four Bases.

PubMed

Richert, Clemens; Hänle, Elena

2018-05-20

All known forms of life encode their genetic information in a sequence of bases of a genetic polymer and produce copies of their genes via semiconservative replication. How this process started before polymerase enzymes had been evolved is unclear. Enzyme-free copying of short stretches of DNA or RNA sequence has been demonstrated, using activated nucleotides, but not replication. We have developed a methodology for replication. It involves extension with reversible termination, enzyme-free ligation, and strand capture and allowed us to monitor nucleotide incorporation for an entire helical turn of DNA, both during a first and a second round of copying. When tracking replication mass spectrometrically, we found that with all four bases (A/C/G/T) an 'error catastrophe' occurs, with the correct sequence being 'overwhelmed' by incorrect ones. When only C and G were used, approx. half of all daughter strands had the mass of the correct sequence after 20 nonenzymatic copying steps. We conclude that enzyme-free replication is more likely to be successful with the two strongly pairing bases, rather than all four bases of the genetic alphabet. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
CaMV-35S promoter sequence-specific DNA methylation in lettuce.

PubMed

Okumura, Azusa; Shimada, Asahi; Yamasaki, Satoshi; Horino, Takuya; Iwata, Yuji; Koizumi, Nozomu; Nishihara, Masahiro; Mishiba, Kei-ichiro

2016-01-01

We found 35S promoter sequence-specific DNA methylation in lettuce. Additionally, transgenic lettuce plants having a modified 35S promoter lost methylation, suggesting the modified sequence is subjected to the methylation machinery. We previously reported that cauliflower mosaic virus 35S promoter-specific DNA methylation in transgenic gentian (Gentiana triflora × G. scabra) plants occurs irrespective of the copy number and the genomic location of T-DNA, and causes strong gene silencing. To confirm whether 35S-specific methylation can occur in other plant species, transgenic lettuce (Lactuca sativa L.) plants with a single copy of the 35S promoter-driven sGFP gene were produced and analyzed. Among 10 lines of transgenic plants, 3, 4, and 3 lines showed strong, weak, and no expression of sGFP mRNA, respectively. Bisulfite genomic sequencing of the 35S promoter region showed hypermethylation at CpG and CpWpG (where W is A or T) sites in 9 of 10 lines. Gentian-type de novo methylation pattern, consisting of methylated cytosines at CpHpH (where H is A, C, or T) sites, was also observed in the transgenic lettuce lines, suggesting that lettuce and gentian share similar methylation machinery. Four of five transgenic lettuce lines having a single copy of a modified 35S promoter, which was modified in the proposed core target of de novo methylation in gentian, exhibited 35S hypomethylation, indicating that the modified sequence may be the target of the 35S-specific methylation machinery.
Nucleotide Sequence Analysis of RNA Synthesized from Rabbit Globin Complementary DNA

PubMed Central

Poon, Raymond; Paddock, Gary V.; Heindell, Howard; Whitcome, Philip; Salser, Winston; Kacian, Dan; Bank, Arthur; Gambino, Roberto; Ramirez, Francesco

1974-01-01

Rabbit globin complementary DNA made with RNA-dependent DNA polymerase (reverse transcriptase) was used as template for in vitro synthesis of 32P-labeled RNA. The sequences of the nucleotides in most of the fragments resulting from combined ribonuclease T1 and alkaline phosphatase digestion have been determined. Several fragments were long enough to fit uniquely with the α or β globin amino-acid sequences. These data demonstrate that the cDNA was copied from globin mRNA and contained no detectable contaminants. Images PMID:4139714

Inaugural Genomics Automation Congress and the coming deluge of sequencing data.

PubMed

Creighton, Chad J

2010-10-01

Presentations at Select Biosciences's first 'Genomics Automation Congress' (Boston, MA, USA) in 2010 focused on next-generation sequencing and the platforms and methodology around them. The meeting provided an overview of sequencing technologies, both new and emerging. Speakers shared their recent work on applying sequencing to profile cells for various levels of biomolecular complexity, including DNA sequences, DNA copy, DNA methylation, mRNA and microRNA. With sequencing time and costs continuing to drop dramatically, a virtual explosion of very large sequencing datasets is at hand, which will probably present challenges and opportunities for high-level data analysis and interpretation, as well as for information technology infrastructure.
An Integrated Approach for RNA-seq Data Normalization.

PubMed

Yang, Shengping; Mercante, Donald E; Zhang, Kun; Fang, Zhide

2016-01-01

DNA copy number alteration is common in many cancers. Studies have shown that insertion or deletion of DNA sequences can directly alter gene expression, and significant correlation exists between DNA copy number and gene expression. Data normalization is a critical step in the analysis of gene expression generated by RNA-seq technology. Successful normalization reduces/removes unwanted nonbiological variations in the data, while keeping meaningful information intact. However, as far as we know, no attempt has been made to adjust for the variation due to DNA copy number changes in RNA-seq data normalization. In this article, we propose an integrated approach for RNA-seq data normalization. Comparisons show that the proposed normalization can improve power for downstream differentially expressed gene detection and generate more biologically meaningful results in gene profiling. In addition, our findings show that due to the effects of copy number changes, some housekeeping genes are not always suitable internal controls for studying gene expression. Using information from DNA copy number, integrated approach is successful in reducing noises due to both biological and nonbiological causes in RNA-seq data, thus increasing the accuracy of gene profiling.
The effect of input DNA copy number on genotype call and characterising SNP markers in the humpback whale genome using a nanofluidic array.

PubMed

Bhat, Somanath; Polanowski, Andrea M; Double, Mike C; Jarman, Simon N; Emslie, Kerry R

2012-01-01

Recent advances in nanofluidic technologies have enabled the use of Integrated Fluidic Circuits (IFCs) for high-throughput Single Nucleotide Polymorphism (SNP) genotyping (GT). In this study, we implemented and validated a relatively low cost nanofluidic system for SNP-GT with and without Specific Target Amplification (STA). As proof of principle, we first validated the effect of input DNA copy number on genotype call rate using well characterised, digital PCR (dPCR) quantified human genomic DNA samples and then implemented the validated method to genotype 45 SNPs in the humpback whale, Megaptera novaeangliae, nuclear genome. When STA was not incorporated, for a homozygous human DNA sample, reaction chambers containing, on average 9 to 97 copies, showed 100% call rate and accuracy. Below 9 copies, the call rate decreased, and at one copy it was 40%. For a heterozygous human DNA sample, the call rate decreased from 100% to 21% when predicted copies per reaction chamber decreased from 38 copies to one copy. The tightness of genotype clusters on a scatter plot also decreased. In contrast, when the same samples were subjected to STA prior to genotyping a call rate and a call accuracy of 100% were achieved. Our results demonstrate that low input DNA copy number affects the quality of data generated, in particular for a heterozygous sample. Similar to human genomic DNA, a call rate and a call accuracy of 100% was achieved with whale genomic DNA samples following multiplex STA using either 15 or 45 SNP-GT assays. These calls were 100% concordant with their true genotypes determined by an independent method, suggesting that the nanofluidic system is a reliable platform for executing call rates with high accuracy and concordance in genomic sequences derived from biological tissue.
Cytogenetic evidence for asexual evolution of bdelloid rotifers.

PubMed

Mark Welch, Jessica L; Mark Welch, David B; Meselson, Matthew

2004-02-10

DNA sequencing has shown individual bdelloid rotifer genomes to contain two or more diverged copies of every gene examined and has revealed no closely similar copies. These and other findings are consistent with long-term asexual evolution of bdelloids. It is not entirely ruled out, however, that bdelloid genomes consist of previously undetected pairs of sequences so similar as to be identical over the regions sequenced, as might result if bdelloids were highly inbred sexual diploids or polyploids. Here, we employ fluorescent in situ hybridization with cosmid probes to determine the copy number and chromosomal distribution of the heat shock gene hsp82 and adjacent sequences in the bdelloid Philodina roseola. We conclude that the four copies identified by sequencing are the only ones present and that each is on a separate chromosome. Bdelloids therefore are not highly homozygous sexually reproducing diploids or polyploids.
Reduced rDNA Copy Number Does Not Affect “Competitive” Chromosome Pairing in XYY Males of Drosophila melanogaster

PubMed Central

Maggert, Keith A.

2014-01-01

The ribosomal DNA (rDNA) arrays are causal agents in X-Y chromosome pairing in meiosis I of Drosophila males. Despite broad variation in X-linked and Y-linked rDNA copy number, polymorphisms in regulatory/spacer sequences between rRNA genes, and variance in copy number of interrupting R1 and R2 retrotransposable elements, there is little evidence that different rDNA arrays affect pairing efficacy. I investigated whether induced rDNA copy number polymorphisms affect chromosome pairing in a “competitive” situation in which complex pairing configurations were possible using males with XYY constitution. Using a common normal X chromosome, one of two different full-length Y chromosomes, and a third chromosome from a series of otherwise-isogenic rDNA deletions, I detected no differences in X-Y or Y-Y pairing or chromosome segregation frequencies that could not be attributed to random variation alone. This work was performed in the context of an undergraduate teaching program at Texas A&M University, and I discuss the pedagogical utility of this and other such experiments. PMID:24449686
DNA hypomethylation of individual sequences in aborted cloned bovine fetuses.

PubMed

Chen, Tao; Jiang, Yan; Zhang, Yan-Ling; Liu, Jing-He; Hou, Yi; Schatten, Heide; Chen, Da-Yuan; Sun, Qing-Yuan

2005-09-01

Cloned bovines have a much higher abortion rate than those derived in vivo. Available evidence indicates that inappropriate epigenetic reprogramming of donor nuclei is the primary cause of cloning failure. To gain a better understanding of the DNA methylation changes associated with the high abortion rate of cloned bovines, we examined the DNA methylation status of a repeated sequence (satellite I) and the promoter regions of two single-copy genes (interleukin 3/cytokeratin) in aborted cloned fetuses, aborted fetuses derived from artificial insemination (AI), cloned adults and AI adults by bisulfite sequencing and restriction enzyme analysis. Two of four aborted cloned fetuses show very low methylation levels in the two single-copy gene promoter regions. One of the two fetuses also showed undermethylated status in the satellite I sequence. The other two aborted cloned fetuses have similar methylation levels to those of aborted AI fetuses. However, no difference in methylation was observed between cloned adults and AI adults. Our results demonstrate for the first time the undermethylated status of individual sequences in aborted cloned fetuses. These findings suggest that aberrant DNA methylation may contribute to the developmental failure of cloned bovine fetuses.
Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

PubMed

Duyk, G M; Kim, S W; Myers, R M; Cox, D R

1990-11-01

Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons.
Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

PubMed Central

Duyk, G M; Kim, S W; Myers, R M; Cox, D R

1990-01-01

Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons. PMID:2247475
The landscape of actionable genomic alterations in cell-free circulating tumor DNA from 21,807 advanced cancer patients.

PubMed

Zill, Oliver A; Banks, Kimberly C; Fairclough, Stephen R; Mortimer, Stefanie; Vowles, James V; Mokhtari, Reza; Gandara, David R; Mack, Philip C; Odegaard, Justin I; Nagy, Rebecca J; Baca, Arthur M; Eltoukhy, Helmy; Chudova, Darya I; Lanman, Richard B; Talasaz, AmirAli

2018-05-18

Cell-free DNA (cfDNA) sequencing provides a non-invasive method for obtaining actionable genomic information to guide personalized cancer treatment, but the presence of multiple alterations in circulation related to treatment and tumor heterogeneity complicate the interpretation of the observed variants. Experimental Design: We describe the somatic mutation landscape of 70 cancer genes from cfDNA deep-sequencing analysis of 21,807 patients with treated, late-stage cancers across >50 cancer types. To facilitate interpretation of the genomic complexity of circulating tumor DNA in advanced, treated cancer patients, we developed methods to identify cfDNA copy-number driver alterations and cfDNA clonality. Patterns and prevalence of cfDNA alterations in major driver genes for non-small cell lung, breast, and colorectal cancer largely recapitulated those from tumor tissue sequencing compendia (TCGA and COSMIC; r=0.90-0.99), with the principle differences in alteration prevalence being due to patient treatment. This highly sensitive cfDNA sequencing assay revealed numerous subclonal tumor-derived alterations, expected as a result of clonal evolution, but leading to an apparent departure from mutual exclusivity in treatment-naïve tumors. Upon applying novel cfDNA clonality and copy-number driver identification methods, robust mutual exclusivity was observed among predicted truncal driver cfDNA alterations (FDR=5x10 -7 for EGFR and ERBB2 ), in effect distinguishing tumor-initiating alterations from secondary alterations. Treatment-associated resistance, including both novel alterations and parallel evolution, was common in the cfDNA cohort and was enriched in patients with targetable driver alterations (>18.6% patients). Together these retrospective analyses of a large cfDNA sequencing data set reveal subclonal structures and emerging resistance in advanced solid tumors. Copyright ©2018, American Association for Cancer Research.
Patterns of Viral DNA Integration in Cells Transformed by Wild Type or DNA-Binding Protein Mutants of Adenovirus Type 5 and Effect of Chemical Carcinogens on Integration

PubMed Central

Dorsch-Häsler, Karoline; Fisher, Paul B.; Weinstein, I. Bernard; Ginsberg, Harold S.

1980-01-01

The integration pattern of viral DNA was studied in a number of cell lines transformed by wild-type adenovirus type 5 (Ad5 WT) and two mutants of the DNA-binding protein gene, H5ts125 and H5ts107. The effect of chemical carcinogens on the integration of viral DNA was also investigated. Liquid hybridization (C0t) analyses showed that rat embryo cells transformed by Ad5 WT usually contained only the left-hand end of the viral genome, whereas cell lines transformed by H5ts125 or H5ts107 at either the semipermissive (36°C) or nonpermissive (39.5°C) temperature often contained one to five copies of all or most of the entire adenovirus genome. The arrangement of the integrated adenovirus DNA sequences was determined by cleavage of transformed cell DNA with restriction endonucleases XbaI, EcoRI, or HindIII followed by transfer of separated fragments to nitrocellulose paper and hybridization according to the technique of E. M. Southern (J. Mol. Biol. 98: 503-517, 1975). It was found that the adenovirus genome is integrated as a linear sequence covalently linked to host cell DNA; that the viral DNA is integrated into different host DNA sequences in each cell line studied; that in cell lines that contain multiple copies of the Ad5 genome the viral DNA sequences can be integrated in a single set of host cell DNA sequences and not as concatemers; and that chemical carcinogens do not alter the extent or pattern of viral DNA integration. Images PMID:6246266
Short, interspersed, and repetitive DNA sequences in Spiroplasma species.

PubMed

Nur, I; LeBlanc, D J; Tully, J G

1987-03-01

Small fragments of DNA from an 8-kbp plasmid, pRA1, from a plant pathogenic strain of Spiroplasma citri were shown previously to be present in the chromosomal DNA of at least two species of Spiroplasma. We describe here the shot-gun cloning of chromosomal DNA from S. citri Maroc and the identification of two distinct sequences exhibiting homology to pRA1. Further subcloning experiments provided specific molecular probes for the identification of these two sequences in chromosomal DNA from three distinct plant pathogenic species of Spiroplasma. The results of Southern blot hybridization indicated that each of the pRA1-associated sequences is present as multiple copies in short, dispersed, and repetitive sequences in the chromosomes of these three strains. None of the sequences was detectable in chromosomal DNA from an additional nine Spiroplasma strains examined.
Effect of endogenous reference genes on digital PCR assessment of genetically engineered canola events.

PubMed

Demeke, Tigst; Eng, Monika

2018-05-01

Droplet digital PCR (ddPCR) has been used for absolute quantification of genetically engineered (GE) events. Absolute quantification of GE events by duplex ddPCR requires the use of appropriate primers and probes for target and reference gene sequences in order to accurately determine the amount of GE materials. Single copy reference genes are generally preferred for absolute quantification of GE events by ddPCR. Study has not been conducted on a comparison of reference genes for absolute quantification of GE canola events by ddPCR. The suitability of four endogenous reference sequences ( HMG-I/Y , FatA(A), CruA and Ccf) for absolute quantification of GE canola events by ddPCR was investigated. The effect of DNA extraction methods and DNA quality on the assessment of reference gene copy numbers was also investigated. ddPCR results were affected by the use of single vs. two copy reference genes. The single copy, FatA(A), reference gene was found to be stable and suitable for absolute quantification of GE canola events by ddPCR. For the copy numbers measured, the HMG-I/Y reference gene was less consistent than FatA(A) reference gene. The expected ddPCR values were underestimated when CruA and Ccf (two copy endogenous Cruciferin sequences) were used because of high number of copies. It is important to make an adjustment if two copy reference genes are used for ddPCR in order to obtain accurate results. On the other hand, real-time quantitative PCR results were not affected by the use of single vs. two copy reference genes.
Clinical utility of circulating tumor DNA for molecular assessment in pancreatic cancer.

PubMed

Takai, Erina; Totoki, Yasushi; Nakamura, Hiromi; Morizane, Chigusa; Nara, Satoshi; Hama, Natsuko; Suzuki, Masami; Furukawa, Eisaku; Kato, Mamoru; Hayashi, Hideyuki; Kohno, Takashi; Ueno, Hideki; Shimada, Kazuaki; Okusaka, Takuji; Nakagama, Hitoshi; Shibata, Tatsuhiro; Yachida, Shinichi

2015-12-16

Pancreatic ductal adenocarcinoma (PDAC) remains one of the most lethal malignancies. The genomic landscape of the PDAC genome features four frequently mutated genes (KRAS, CDKN2A, TP53, and SMAD4) and dozens of candidate driver genes altered at low frequency, including potential clinical targets. Circulating cell-free DNA (cfDNA) is a promising resource to detect and monitor molecular characteristics of tumors. In the present study, we determined the mutational status of KRAS in plasma cfDNA using multiplex picoliter-droplet digital PCR in 259 patients with PDAC. We constructed a novel modified SureSelect-KAPA-Illumina platform and an original panel of 60 genes. We then performed targeted deep sequencing of cfDNA and matched germline DNA samples in 48 patients who had ≥1% mutant allele frequencies of KRAS in plasma cfDNA. Importantly, potentially targetable somatic mutations were identified in 14 of 48 patients (29.2%) examined by targeted deep sequencing of cfDNA. We also analyzed somatic copy number alterations based on the targeted sequencing data using our in-house algorithm, and potentially targetable amplifications were detected. Assessment of mutations and copy number alterations in plasma cfDNA may provide a prognostic and diagnostic tool to assist decisions regarding optimal therapeutic strategies for PDAC patients.
Sources of PCR-induced distortions in high-throughput sequencing data sets

PubMed Central

Kebschull, Justus M.; Zador, Anthony M.

2015-01-01

PCR permits the exponential and sequence-specific amplification of DNA, even from minute starting quantities. PCR is a fundamental step in preparing DNA samples for high-throughput sequencing. However, there are errors associated with PCR-mediated amplification. Here we examine the effects of four important sources of error—bias, stochasticity, template switches and polymerase errors—on sequence representation in low-input next-generation sequencing libraries. We designed a pool of diverse PCR amplicons with a defined structure, and then used Illumina sequencing to search for signatures of each process. We further developed quantitative models for each process, and compared predictions of these models to our experimental data. We find that PCR stochasticity is the major force skewing sequence representation after amplification of a pool of unique DNA amplicons. Polymerase errors become very common in later cycles of PCR but have little impact on the overall sequence distribution as they are confined to small copy numbers. PCR template switches are rare and confined to low copy numbers. Our results provide a theoretical basis for removing distortions from high-throughput sequencing data. In addition, our findings on PCR stochasticity will have particular relevance to quantification of results from single cell sequencing, in which sequences are represented by only one or a few molecules. PMID:26187991
Transformation of Chloroplast Ribosomal RNA Genes in Chlamydomonas: Molecular and Genetic Characterization of Integration Events

PubMed Central

Newman, S. M.; Boynton, J. E.; Gillham, N. W.; Randolph-Anderson, B. L.; Johnson, A. M.; Harris, E. H.

1990-01-01

Transformation of chloroplast ribosomal RNA (rRNA) genes in Chlamydomonas has been achieved by the biolistic process using cloned chloroplast DNA fragments carrying mutations that confer antibiotic resistance. The sites of exchange employed during the integration of the donor DNA into the recipient genome have been localized using a combination of antibiotic resistance mutations in the 16S and 23S rRNA genes and restriction fragment length polymorphisms that flank these genes. Complete or nearly complete replacement of a region of the chloroplast genome in the recipient cell by the corresponding sequence from the donor plasmid was the most common integration event. Exchange events between the homologous donor and recipient sequences occurred preferentially near the vector:insert junctions. Insertion of the donor rRNA genes and flanking sequences into one inverted repeat of the recipient genome was followed by intramolecular copy correction so that both copies of the inverted repeat acquired identical sequences. Increased frequencies of rRNA gene transformants were achieved by reducing the copy number of the chloroplast genome in the recipient cells and by decreasing the heterology between donor and recipient DNA sequences flanking the selectable markers. In addition to producing bona fide chloroplast rRNA transformants, the biolistic process induced mutants resistant to low levels of streptomycin, typical of nuclear mutations in Chlamydomonas. PMID:1981764
Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.

PubMed Central

Ananiev, E V; Phillips, R L; Rines, H W

1998-01-01

The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055
Distinct Copy Number, Coding Sequence, and Locus Methylation Patterns Underlie Rhg1-Mediated Soybean Resistance to Soybean Cyst Nematode1[W][OPEN

PubMed Central

Cook, David E.; Bayless, Adam M.; Wang, Kai; Guo, Xiaoli; Song, Qijian; Jiang, Jiming; Bent, Andrew F.

2014-01-01

Copy number variation of kilobase-scale genomic DNA segments, beyond presence/absence polymorphisms, can be an important driver of adaptive traits. Resistance to Heterodera glycines (Rhg1) is a widely utilized quantitative trait locus that makes the strongest known contribution to resistance against soybean cyst nematode (SCN), Heterodera glycines, the most damaging pathogen of soybean (Glycine max). Rhg1 was recently discovered to be a complex locus at which resistance-conferring haplotypes carry up to 10 tandem repeat copies of a 31-kb DNA segment, and three disparate genes present on each repeat contribute to SCN resistance. Here, we use whole-genome sequencing, fiber-FISH (fluorescence in situ hybridization), and other methods to discover the genetic variation at Rhg1 across 41 diverse soybean accessions. Based on copy number variation, transcript abundance, nucleic acid polymorphisms, and differentially methylated DNA regions, we find that SCN resistance is associated with multicopy Rhg1 haplotypes that form two distinct groups. The tested high-copy-number Rhg1 accessions, including plant introduction (PI) 88788, contain a flexible number of copies (seven to 10) of the 31-kb Rhg1 repeat. The identified low-copy-number Rhg1 group, including PI 548402 (Peking) and PI 437654, contains three copies of the Rhg1 repeat and a newly identified allele of Glyma18g02590 (a predicted α-SNAP [α-soluble N-ethylmaleimide–sensitive factor attachment protein]). There is strong evidence for a shared origin of the two resistance-conferring multicopy Rhg1 groups and subsequent independent evolution. Differentially methylated DNA regions also were identified within Rhg1 that correlate with SCN resistance. These data provide insights into copy number variation of multigene segments, using as the example a disease resistance trait of high economic importance. PMID:24733883
Identification of Genetic Elements Associated with EPSPS Gene Amplification

PubMed Central

Gaines, Todd A.; Wright, Alice A.; Molin, William T.; Lorentz, Lothar; Riggins, Chance W.; Tranel, Patrick J.; Beffa, Roland; Westra, Philip; Powles, Stephen B.

2013-01-01

Weed populations can have high genetic plasticity and rapid responses to environmental selection pressures. For example, 100-fold amplification of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene evolved in the weed species Amaranthus palmeri to confer resistance to glyphosate, the world’s most important herbicide. However, the gene amplification mechanism is unknown. We sequenced the EPSPS gene and genomic regions flanking EPSPS loci in A. palmeri, and searched for mobile genetic elements or repetitive sequences. The EPSPS gene was 10,229 bp, containing 8 exons and 7 introns. The gene amplification likely proceeded through a DNA-mediated mechanism, as introns exist in the amplified gene copies and the entire amplified sequence is at least 30 kb in length. Our data support the presence of two EPSPS loci in susceptible (S) A. palmeri, and that only one of these was amplified in glyphosate-resistant (R) A. palmeri. The EPSPS gene amplification event likely occurred recently, as no sequence polymorphisms were found within introns of amplified EPSPS copies from R individuals. Sequences with homology to miniature inverted-repeat transposable elements (MITEs) were identified next to EPSPS gene copies only in R individuals. Additionally, a putative Activator (Ac) transposase and a repetitive sequence region were associated with amplified EPSPS genes. The mechanism controlling this DNA-mediated amplification remains unknown. Further investigation is necessary to determine if the gene amplification may have proceeded via DNA transposon-mediated replication, and/or unequal recombination between different genomic regions resulting in replication of the EPSPS gene. PMID:23762434
A theory that may explain the Hayflick limit--a means to delete one copy of a repeating sequence during each cell cycle in certain human cells such as fibroblasts.

PubMed

Naveilhan, P; Baudet, C; Jabbour, W; Wion, D

1994-09-01

A model that may explain the limited division potential of certain cells such as human fibroblasts in culture is presented. The central postulate of this theory is that there exists, prior to certain key exons that code for materials needed for cell division, a unique sequence of specific repeating segments of DNA. One copy of such repeating segments is deleted during each cell cycle in cells that are not protected from such deletion through methylation of their cytosine residues. According to this theory, the means through which such repeated sequences are removed, one per cycle, is through the sequential action of enzymes that act much as bacterial restriction enzymes do--namely to produce scissions in both strands of DNA in areas that correspond to the DNA base sequence recognition specificities of such enzymes. After the first scission early in a replicative cycle, that enzyme becomes inhibited, but the cleavage of the first site exposes the closest site in the repetitive element to the action of a second restriction enzyme after which that enzyme also becomes inhibited. Then repair occurs, regenerating the original first site. Through this sequential activation and inhibition of two different restriction enzymes, only one copy of the repeating sequence is deleted during each cell cycle. In effect, the repeating sequence operates as a precise counter of the numbers of cell doubling that have occurred since the cells involved differentiated during development.
A common deletion in two gamma ray induced rat pulmonary tumor cell lines.

PubMed

Van Klaveren, P; De Bruijne, J; Van der Winden, H; Kal, H B; Bentvelzen, P

1994-01-01

Subtraction hybridization was performed on normal WAG/Rij rat DNA with DNA from a syngeneic Ir-192 induced pulmonary tumor cell line L37. The residual DNA was amplified by means of sequence-independent PCR. This procedure yielded a sequence, of which multiple copies are present in normal rat DNA. In the tumor line L37 two restriction fragments hybridizing with this repeat sequence are lacking. In another Ir-192 induced pulmonary tumor line, L33, one of these fragments was also lacking. This indicates a common deletion in the two tumor lines.

Partial bisulfite conversion for unique template sequencing

PubMed Central

Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael

2018-01-01

Abstract We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. PMID:29161423
The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.

PubMed

Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo

2016-05-01

The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.
Single-Cell-Based Platform for Copy Number Variation Profiling through Digital Counting of Amplified Genomic DNA Fragments.

PubMed

Li, Chunmei; Yu, Zhilong; Fu, Yusi; Pang, Yuhong; Huang, Yanyi

2017-04-26

We develop a novel single-cell-based platform through digital counting of amplified genomic DNA fragments, named multifraction amplification (mfA), to detect the copy number variations (CNVs) in a single cell. Amplification is required to acquire genomic information from a single cell, while introducing unavoidable bias. Unlike prevalent methods that directly infer CNV profiles from the pattern of sequencing depth, our mfA platform denatures and separates the DNA molecules from a single cell into multiple fractions of a reaction mix before amplification. By examining the sequencing result of each fraction for a specific fragment and applying a segment-merge maximum likelihood algorithm to the calculation of copy number, we digitize the sequencing-depth-based CNV identification and thus provide a method that is less sensitive to the amplification bias. In this paper, we demonstrate a mfA platform through multiple displacement amplification (MDA) chemistry. When performing the mfA platform, the noise of MDA is reduced; therefore, the resolution of single-cell CNV identification can be improved to 100 kb. We can also determine the genomic region free of allelic drop-out with mfA platform, which is impossible for conventional single-cell amplification methods.
An improved divergent synthesis of comb-type branched oligodeoxyribonucleotides (bDNA) containing multiple secondary sequences.

PubMed

Horn, T; Chang, C A; Urdea, M S

1997-12-01

The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays.
An improved divergent synthesis of comb-type branched oligodeoxyribonucleotides (bDNA) containing multiple secondary sequences.

PubMed Central

Horn, T; Chang, C A; Urdea, M S

1997-01-01

The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays. PMID:9365265
Insertion sequence typing of Mycobacterium tuberculosis: characterization of a widespread subtype with a single copy of IS6110.

PubMed

Fomukong, N G; Tang, T H; al-Maamary, S; Ibrahim, W A; Ramayah, S; Yates, M; Zainuddin, Z F; Dale, J W

1994-12-01

DNA fingerprinting with the insertion sequence IS6110 (also known as IS986) has become established as a major tool for investigating the spread of tuberculosis. Most strains of Mycobacterium tuberculosis have multiple copies of IS6110, but a small minority carry a single copy only. We have examined selected strains from Malaysia, Tanzania and Oman, in comparison with M. bovis isolates and BCG strains carrying one or two copies of IS6110. The insertion sequence appears to be present in the same position in all these strains, which suggests that in these organisms the element is defective in transposition and that the loss of transposability may have occurred at an early stage in the evolution of the M. tuberculosis complex.
The relationship between mitochondrial DNA copy number and stallion sperm function.

PubMed

Darr, Christa R; Moraes, Luis E; Connon, Richard E; Love, Charles C; Teague, Sheila; Varner, Dickson D; Meyers, Stuart A

2017-05-01

Mitochondrial DNA (mtDNA) copy number has been utilized as a measure of sperm quality in several species including mice, dogs, and humans, and has been suggested as a potential biomarker of fertility in stallion sperm. The results of the present study extend this recent discovery using sperm samples from American Quarter Horse stallions of varying age. By determining copy number of three mitochondrial genes, cytochrome b (CYTB), NADH dehydrogenase 1 (ND1) and NADH dehydrogenase 4 (ND4), instead of a single gene, we demonstrate an improved understanding of mtDNA fate in stallion sperm mitochondria following spermatogenesis. Sperm samples from 37 stallions ranging from 3 to 24 years old were collected at four breeding ranches in north and central Texas during the 2015 breeding season. Samples were analyzed for sperm motion characteristics, nuclear DNA denaturability and mtDNA copy number. Mitochondrial DNA content in individual sperm was determined by real-time qPCR and normalized with a single copy nuclear gene, Beta actin. Exploratory correlation analysis revealed that total motility was negatively correlated with CYTB copy number and sperm chromatin structure. Stallion age did not have a significant effect on copy number for any of the genes. Copy number differences existed between the three genes with CYTB having the greatest number of copies (20.6 ± 1.2 copies, range: 6.0 to 41.1) followed by ND4 (15.5 ± 0.8 copies, range: 6.7 to 27.8) and finally ND1 (12.0 ± 1.0 copies, range: 0.4 to 26.6) (P < 0.05). Varying copy number across mitochondrial genes is likely to be a result of mtDNA fragmentation and degradation since downregulation of sperm mtDNA occurs during spermatogenesis and may be important for normal sperm function. Beta regression analysis suggested that for every unit increase in mtDNA copy number of CYTB, there was a 4% decrease in the odds of sperm movement (P = 0.001). Influential analysis suggested that results are robust and not highly influenced by data from individual stallions despite the low number of stallions sampled with low sperm motility. Further genome sequencing is necessary to investigate if mutations or deletions are the underlying causes of inconsistent copy numbers across mitochondrial genes. In conclusion, we show, for the first time, that increased mtDNA copy number is associated with decreased total sperm motility in stallions. We therefore suggest that mtDNA copy number may be an indicator of defective spermatogenesis in stallions. Copyright © 2017 Elsevier Inc. All rights reserved.
Array-based detection of genetic alterations associated with disease

DOEpatents

Pinkel, Daniel; Albertson, Donna G.; Gray, Joe W.

2017-09-05

The present invention relates to DNA sequences from regions of copy number change on chromosome 20. The sequences can be used in hybridization methods for the identification of chromosomal abnormalities associated with various diseases.
Array-based detection of genetic alterations associated with disease

DOEpatents

Pinkel, Daniel; Albertson, Donna G.; Gray, Joe W.

2007-09-11

The present invention relates to DNA sequences from regions of copy number change on chromosome 20. The sequences can be used in hybridization methods for the identification of chromosomal abnormalities associated with various diseases.
Specific functions of the Rep and Rep׳ proteins of porcine circovirus during copy-release and rolling-circle DNA replication.

PubMed

Cheung, Andrew K

2015-07-01

The roles of two porcine circovirus replication initiator proteins, Rep and Rep׳, in generating copy-release and rolling-circle DNA replication intermediates were determined. Rep uses the supercoiled closed-circular genome (ccc) to initiate leading-strand synthesis (identical to copy-release replication) and generates the single-stranded circular (ssc) genome from the displaced DNA strand. In the process, a minus-genome primer (MGP) necessary for complementary-strand synthesis, from ssc to ccc, is synthesized. Rep׳ cleaves the growing nascent-strand to regenerate the parent ccc molecule. In the process, a Rep׳-DNA hybrid containing the right palindromic sequence (at the origin of DNA replication) is generated. Analysis of the virus particle showed that it is composed of four components: ssc, MGP, capsid protein and a novel Rep-related protein (designated Protein-3). Copyright © 2015. Published by Elsevier Inc.
B-DNA to Z-DNA structural transitions in the SV40 enhancer: stabilization of Z-DNA in negatively supercoiled DNA minicircles

NASA Technical Reports Server (NTRS)

Gruskin, E. A.; Rich, A.

1993-01-01

During replication and transcription, the SV40 control region is subjected to significant levels of DNA unwinding. There are three, alternating purine-pyrimidine tracts within this region that can adopt the Z-DNA conformation in response to negative superhelix density: a single copy of ACACACAT and two copies of ATGCATGC. Since the control region is essential for both efficient transcription and replication, B-DNA to Z-DNA transitions in these vital sequence tracts may have significant biological consequences. We have synthesized DNA minicircles to detect B-DNA to Z-DNA transitions in the SV40 enhancer, and to determine the negative superhelix density required to stabilize the Z-DNA. A variety of DNA sequences, including the entire SV40 enhancer and the two segments of the enhancer with alternating purine-pyrimidine tracts, were incorporated into topologically relaxed minicircles. Negative supercoils were generated, and the resulting topoisomers were resolved by electrophoresis. Using an anti-Z-DNA Fab and an electrophoretic mobility shift assay, Z-DNA was detected in the enhancer-containing minicircles at a superhelix density of -0.05. Fab saturation binding experiments demonstrated that three, independent Z-DNA tracts were stabilized in the supercoiled minicircles. Two other minicircles, each with one of the two alternating purine-pyrimidine tracts, also contained single Z-DNA sites. These results confirm the identities of the Z-DNA-forming sequences within the control region. Moreover, the B-DNA to Z-DNA transitions were detected at superhelix densities observed during normal replication and transcription processes in the SV40 life cycle.
Accurate, high-throughput typing of copy number variation using paralogue ratios from dispersed repeats

PubMed Central

Armour, John A. L.; Palla, Raquel; Zeeuwen, Patrick L. J. M.; den Heijer, Martin; Schalkwijk, Joost; Hollox, Edward J.

2007-01-01

Recent work has demonstrated an unexpected prevalence of copy number variation in the human genome, and has highlighted the part this variation may play in predisposition to common phenotypes. Some important genes vary in number over a high range (e.g. DEFB4, which commonly varies between two and seven copies), and have posed formidable technical challenges for accurate copy number typing, so that there are no simple, cheap, high-throughput approaches suitable for large-scale screening. We have developed a simple comparative PCR method based on dispersed repeat sequences, using a single pair of precisely designed primers to amplify products simultaneously from both test and reference loci, which are subsequently distinguished and quantified via internal sequence differences. We have validated the method for the measurement of copy number at DEFB4 by comparison of results from >800 DNA samples with copy number measurements by MAPH/REDVR, MLPA and array-CGH. The new Paralogue Ratio Test (PRT) method can require as little as 10 ng genomic DNA, appears to be comparable in accuracy to the other methods, and for the first time provides a rapid, simple and inexpensive method for copy number analysis, suitable for application to typing thousands of samples in large case-control association studies. PMID:17175532
Reanalysis of RNA-Sequencing Data Reveals Several Additional Fusion Genes with Multiple Isoforms

PubMed Central

Kangaspeska, Sara; Hultsch, Susanne; Edgren, Henrik; Nicorici, Daniel; Murumägi, Astrid; Kallioniemi, Olli

2012-01-01

RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts. PMID:23119097
Reanalysis of RNA-sequencing data reveals several additional fusion genes with multiple isoforms.

PubMed

Kangaspeska, Sara; Hultsch, Susanne; Edgren, Henrik; Nicorici, Daniel; Murumägi, Astrid; Kallioniemi, Olli

2012-01-01

RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts.
Improvement and Optimization of Two Engineered Phage Resistance Mechanisms in Lactococcus lactis

PubMed Central

McGrath, Stephen; Fitzgerald, Gerald F.; van Sinderen, Douwe

2001-01-01

Homologous replication module genes were identified for four P335 type phages. DNA sequence analysis revealed that all four phages exhibited more than 90% DNA homology for at least two genes, designated rep2009 and orf17. One of these genes, rep2009, codes for a putative replisome organizer protein and contains an assumed origin of phage DNA replication (ori2009), which was identical for all four phages. DNA fragments representing the ori2009 sequence confer a phage-encoded resistance (Per) phenotype on lactococcal hosts when they are supplied on a high-copy-number vector. Furthermore, cloning multiple copies of the ori2009 sequence was found to increase the effectiveness of the Per phenotype conferred. A number of antisense plasmids targeting specific genes of the replication module were constructed. Two separate plasmids targeting rep2009 and orf17 were found to efficiently inhibit proliferation of all four phages by interfering with intracellular phage DNA replication. These results represent two highly effective strategies for inhibiting bacteriophage proliferation, and they also identify a novel gene, orf17, which appears to be important for phage DNA replication. Furthermore, these results indicate that although the actual mechanisms of DNA replication are very similar, if not identical, for all four phages, expression of the replication genes is significantly different in each case. PMID:11157223
The repetitive landscape of the chicken genome.

PubMed

Wicker, Thomas; Robertson, Jon S; Schulze, Stefan R; Feltus, F Alex; Magrini, Vincent; Morrison, Jason A; Mardis, Elaine R; Wilson, Richard K; Peterson, Daniel G; Paterson, Andrew H; Ivarie, Robert

2005-01-01

Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7 x coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available.
The repetitive landscape of the chicken genome

PubMed Central

Wicker, Thomas; Robertson, Jon S.; Schulze, Stefan R.; Feltus, F. Alex; Magrini, Vincent; Morrison, Jason A.; Mardis, Elaine R.; Wilson, Richard K.; Peterson, Daniel G.; Paterson, Andrew H.; Ivarie, Robert

2005-01-01

Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7× coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available. PMID:15256510
Development of a quantitative-competitive PCR for quantification of human cytomegalovirus load and comparison with antigenaemia, viraemia and pp67 RNA detection by nucleic acid sequence-based amplification.

PubMed

Bergallo, M; Costa, C; Tarallo, S; Daniele, R; Merlino, C; Segoloni, G P; Negro Ponzi, A; Cavallo, R

2006-06-01

The human cytomegalovirus (HCMV) is an important pathogen in immunocompromised patients, such as transplant recipients. The use of sensitive and rapid diagnostic assays can have a great impact on antiviral prophylaxis and therapy monitoring and diagnosing active disease. Quantification of HCMV DNA may additionally have prognostic value and guide routine management. The aim of this study was to develop a reliable internally-controlled quantitative-competitive PCR (QC-PCR) for the detection and quantification of HCMV DNA viral load in peripheral blood and compare it with other methods: the HCMV pp65 antigenaemia assay in leukocyte fraction, the HCMV viraemia, both routinely employed in our laboratory, and the nucleic acid sequence-based amplification (NASBA) for detection of HCMV pp67-mRNA. Quantitative-competitive PCR is a procedure for nucleic acid quantification based on co-amplification of competitive templates, the target DNA and a competitor functioning as internal standard. In particular, a standard curve is generated by amplifying 10(2) to 10(5) copies of target pCMV-435 plasmid with 10(4) copies of competitor pCMV-C plasmid. Clinical samples derived from 40 kidney transplant patients were tested by spiking 10(4) copies of pCMV-C into the PCR mix as internal control, and comparing results with the standard curve. Of the 40 patients studied, 39 (97.5%) were positive for HCMV DNA by QC-PCR. While the correlation between the number of pp65-positive cells and the number of HCMV DNA genome copies/mL and the former and the pp67mRNA-positivity were statistically significant, there was no significant correlation between HCMV DNA viral load assayed by QC-PCR and HCMV viraemia. The QC-PCR assay could detect from 10(2) to over 10(7) copies of HCMV DNA with a range of linearity between 10(2) and 10(5) genomes.
From famine to feast? Selecting nuclear DNA sequence loci for plant species-level phylogeny reconstruction

PubMed Central

Hughes, Colin E; Eastwood, Ruth J; Donovan Bailey, C

2005-01-01

Phylogenetic analyses of DNA sequences have prompted spectacular progress in assembling the Tree of Life. However, progress in constructing phylogenies among closely related species, at least for plants, has been less encouraging. We show that for plants, the rapid accumulation of DNA characters at higher taxonomic levels has not been matched by conventional sequence loci at the species level, leaving a lack of well-resolved gene trees that is hindering investigations of many fundamental questions in plant evolutionary biology. The most popular approach to address this problem has been to use low-copy nuclear genes as a source of DNA sequence data. However, this has had limited success because levels of variation among nuclear intron sequences across groups of closely related species are extremely variable and generally lower than conventionally used loci, and because no universally useful low-copy nuclear DNA sequence loci have been developed. This suggests that solutions will, for the most part, be lineage-specific, prompting a move away from ‘universal’ gene thinking for species-level phylogenetics. The benefits and limitations of alternative approaches to locate more variable nuclear loci are discussed and the potential of anonymous non-genic nuclear loci is highlighted. Given the virtually unlimited number of loci that can be generated using these new approaches, it is clear that effective screening will be critical for efficient selection of the most informative loci. Strategies for screening are outlined. PMID:16553318
Molecular inversion probe assay for allelic quantitation

PubMed Central

Ji, Hanlee; Welch, Katrina

2010-01-01

Molecular inversion probe (MIP) technology has been demonstrated to be a robust platform for large-scale dual genotyping and copy number analysis. Applications in human genomic and genetic studies include the possibility of running dual germline genotyping and combined copy number variation ascertainment. MIPs analyze large numbers of specific genetic target sequences in parallel, relying on interrogation of a barcode tag, rather than direct hybridization of genomic DNA to an array. The MIP approach does not replace, but is complementary to many of the copy number technologies being performed today. Some specific advantages of MIP technology include: Less DNA required (37 ng vs. 250 ng), DNA quality less important, more dynamic range (amplifications detected up to copy number 60), allele specific information “cleaner” (less SNP crosstalk/contamination), and quality of markers better (fewer individual MIPs versus SNPs needed to identify copy number changes). MIPs can be considered a candidate gene (targeted whole genome) approach and can find specific areas of interest that otherwise may be missed with other methods. PMID:19488872

Three copies of a single protein II-encoding sequence in the genome of Neisseria gonorrhoeae JS3: evidence for gene conversion and gene duplication.

PubMed

van der Ley, P

1988-11-01

Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.
Cloning vector

DOEpatents

Guilfoyle, Richard A.; Smith, Lloyd M.

1994-01-01

A vector comprising a filamentous phage sequence containing a first copy of filamentous phage gene X and other sequences necessary for the phage to propagate is disclosed. The vector also contains a second copy of filamentous phage gene X downstream from a promoter capable of promoting transcription in a bacterial host. In a preferred form of the present invention, the filamentous phage is M13 and the vector additionally includes a restriction endonuclease site located in such a manner as to substantially inactivate the second gene X when a DNA sequence is inserted into the restriction site.
Cloning vector

DOEpatents

Guilfoyle, R.A.; Smith, L.M.

1994-12-27

A vector comprising a filamentous phage sequence containing a first copy of filamentous phage gene X and other sequences necessary for the phage to propagate is disclosed. The vector also contains a second copy of filamentous phage gene X downstream from a promoter capable of promoting transcription in a bacterial host. In a preferred form of the present invention, the filamentous phage is M13 and the vector additionally includes a restriction endonuclease site located in such a manner as to substantially inactivate the second gene X when a DNA sequence is inserted into the restriction site. 2 figures.
Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors.

PubMed

Adalsteinsson, Viktor A; Ha, Gavin; Freeman, Samuel S; Choudhury, Atish D; Stover, Daniel G; Parsons, Heather A; Gydush, Gregory; Reed, Sarah C; Rotem, Denisse; Rhoades, Justin; Loginov, Denis; Livitz, Dimitri; Rosebrock, Daniel; Leshchiner, Ignaty; Kim, Jaegil; Stewart, Chip; Rosenberg, Mara; Francis, Joshua M; Zhang, Cheng-Zhong; Cohen, Ofir; Oh, Coyin; Ding, Huiming; Polak, Paz; Lloyd, Max; Mahmud, Sairah; Helvie, Karla; Merrill, Margaret S; Santiago, Rebecca A; O'Connor, Edward P; Jeong, Seong H; Leeson, Rachel; Barry, Rachel M; Kramkowski, Joseph F; Zhang, Zhenwei; Polacek, Laura; Lohr, Jens G; Schleicher, Molly; Lipscomb, Emily; Saltzman, Andrea; Oliver, Nelly M; Marini, Lori; Waks, Adrienne G; Harshman, Lauren C; Tolaney, Sara M; Van Allen, Eliezer M; Winer, Eric P; Lin, Nancy U; Nakabayashi, Mari; Taplin, Mary-Ellen; Johannessen, Cory M; Garraway, Levi A; Golub, Todd R; Boehm, Jesse S; Wagle, Nikhil; Getz, Gad; Love, J Christopher; Meyerson, Matthew

2017-11-06

Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.
Stable transformation of a mosquito cell line results in extraordinarily high copy numbers of the plasmid.

PubMed Central

Monroe, T J; Muhlmann-Diaz, M C; Kovach, M J; Carlson, J O; Bedford, J S; Beaty, B J

1992-01-01

Stable incorporation of high copy numbers (greater than 10,000 per cell) of a plasmid vector containing a gene conferring resistance to the antibiotic hygromycin was achieved in a cell line derived from the Aedes albopictus mosquito. Plasmid sequences were readily observed by ethidium bromide staining of cellular DNA after restriction endonuclease digestion and agarose gel electrophoresis. The plasmid was demonstrated by in situ hybridization to be present in large arrays integrated in metaphase chromosomes and in minute and double-minute replicating elements. In one subclone, approximately 60,000 copies of the plasmid were organized in a large array that resembles a chromosome, morphologically and in the segregation of its chromatids during anaphase. The original as well as modified versions of the plasmid were rescued by transformation of Escherichia coli using total cellular DNA. Southern blot analyses of recovered plasmids indicate the presence of mosquito-derived sequences. Images PMID:1631052
High-Throughput Amplicon-Based Copy Number Detection of 11 Genes in Formalin-Fixed Paraffin-Embedded Ovarian Tumour Samples by MLPA-Seq

PubMed Central

Kondrashova, Olga; Love, Clare J.; Lunke, Sebastian; Hsu, Arthur L.; Waring, Paul M.; Taylor, Graham R.

2015-01-01

Whilst next generation sequencing can report point mutations in fixed tissue tumour samples reliably, the accurate determination of copy number is more challenging. The conventional Multiplex Ligation-dependent Probe Amplification (MLPA) assay is an effective tool for measurement of gene dosage, but is restricted to around 50 targets due to size resolution of the MLPA probes. By switching from a size-resolved format, to a sequence-resolved format we developed a scalable, high-throughput, quantitative assay. MLPA-seq is capable of detecting deletions, duplications, and amplifications in as little as 5ng of genomic DNA, including from formalin-fixed paraffin-embedded (FFPE) tumour samples. We show that this method can detect BRCA1, BRCA2, ERBB2 and CCNE1 copy number changes in DNA extracted from snap-frozen and FFPE tumour tissue, with 100% sensitivity and >99.5% specificity. PMID:26569395
Abundance of Dioxygenase Genes Similar to Ralstonia sp. Strain U2 nagAc Is Correlated with Naphthalene Concentrations in Coal Tar-Contaminated Freshwater Sediments

PubMed Central

Dionisi, Hebe M.; Chewning, Christopher S.; Morgan, Katherine H.; Menn, Fu-Min; Easter, James P.; Sayler, Gary S.

2004-01-01

We designed a real-time PCR assay able to recognize dioxygenase large-subunit gene sequences with more than 90% similarity to the Ralstonia sp. strain U2 nagAc gene (nagAc-like gene sequences) in order to study the importance of organisms carrying these genes in the biodegradation of naphthalene. Sequencing of PCR products indicated that this real-time PCR assay was specific and able to detect a variety of nagAc-like gene sequences. One to 100 ng of contaminated-sediment total DNA in 25-μl reaction mixtures produced an amplification efficiency of 0.97 without evident PCR inhibition. The assay was applied to surficial freshwater sediment samples obtained in or in close proximity to a coal tar-contaminated Superfund site. Naphthalene concentrations in the analyzed samples varied between 0.18 and 106 mg/kg of dry weight sediment. The assay for nagAc-like sequences indicated the presence of (4.1 ± 0.7) × 103 to (2.9 ± 0.3) × 105 copies of nagAc-like dioxygenase genes per μg of DNA extracted from sediment samples. These values corresponded to (1.2 ± 0.6) × 105 to (5.4 ± 0.4) × 107 copies of this target per g of dry weight sediment when losses of DNA during extraction were taken into account. There was a positive correlation between naphthalene concentrations and nagAc-like gene copies per microgram of DNA (r = 0.89) and per gram of dry weight sediment (r = 0.77). These results provide evidence of the ecological significance of organisms carrying nagAc-like genes in the biodegradation of naphthalene. PMID:15240274
High-resolution mapping and sequence analysis of 597 cDNA clones transcribed from the 1 Mb region in human chromosome 4q16.3 containing Huntington disease gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hadano, S.; Ishida, Y.; Tomiyasu, H.

1994-09-01

To complete a transcription map of the 1 Mb region in human chromosome 4p16.3 containing the Huntington disease (HD) gene, the isolation of cDNA clones are being performed throughout. Our method relies on a direct screening of the cDNA libraries probed with single copy microclones from 3 YAC clones spanning 1 Mbp of the HD gene region. AC-DNAs were isolated by a preparative pulsed-field gel electrophoresis, amplified by both a single unique primer (SUP)-PCR and a linker ligation PCR, and 6 microclone-DNA libraries were generated. Then, 8,640 microclones from these libraries were independently amplified by PCR, and arrayed onto themore » membranes. 800-900 microclones that were not cross-hybridized with total human and yeast genomic DNA, TAC vector DNA, and ribosomal cDNA on a dot hybridization (putatively carrying single copy sequences) were pooled to make 9 probe pools. A total of {approximately}1.8x10{sup 7} plaques from the human brain cDNA libraries was screened with 9 pool-probes, and then 672 positive cDNA clones were obtained. So far, 597 cDNA clones were defined and arrayed onto a map of the 1 Mbp of the HD gene region by hybridization with HD region-specific cosmid contigs and YAC clones. Further characterization including a DNA sequencing and Northern blot analysis is currently underway.« less
Prenatal detection of fetal triploidy from cell-free DNA testing in maternal blood.

PubMed

Nicolaides, Kypros H; Syngelaki, Argyro; del Mar Gil, Maria; Quezada, Maria Soledad; Zinevich, Yana

2014-01-01

To investigate potential performance of cell-free DNA (cfDNA) testing in maternal blood in detecting fetal triploidy. Plasma and buffy coat samples obtained at 11-13 weeks' gestation from singleton pregnancies with diandric triploidy (n=4), digynic triploidy (n=4), euploid fetuses (n=48) were sent to Natera, Inc. (San Carlos, Calif., USA) for cfDNA testing. Multiplex polymerase chain reaction amplification of cfDNA followed by sequencing of single nucleotide polymorphic loci covering chromosomes 13, 18, 21, X, and Y was performed. Sequencing data were analyzed using the NATUS algorithm which identifies copy number for each of the five chromosomes. cfDNA testing provided a result in 44 (91.7%) of the 48 euploid cases and correctly predicted the fetal sex and the presence of two copies each of chromosome 21, 18 and 13. In diandric triploidy, cfDNA testing identified multiple paternal haplotypes (indicating fetal trisomy 21, trisomy 18 and trisomy 13) suggesting the presence of either triploidy or dizygotic twins. In digynic triploidy the fetal fraction corrected for maternal weight and gestational age was below the 0.5th percentile. cfDNA testing by targeted sequencing and allelic ratio analysis of single nucleotide polymorphisms covering chromosomes 21, 18, 13, X, and Y can detect diandric triploidy and raise the suspicion of digynic triploidy. © 2013 S. Karger AG, Basel.
Ribosomal DNA copy loss and repeat instability in ATRX-mutated cancers

PubMed Central

Udugama, Maheshi; Sanij, Elaine; Voon, Hsiao P. J.; Son, Jinbae; Hii, Linda; Henson, Jeremy D.; Chan, F. Lyn; Chang, Fiona T. M.; Liu, Yumei; Pearson, Richard B.; Kalitsis, Paul; Mann, Jeffrey R.; Collas, Philippe; Hannan, Ross D.; Wong, Lee H.

2018-01-01

ATRX (alpha thalassemia/mental retardation X-linked) complexes with DAXX to deposit histone variant H3.3 into repetitive heterochromatin. Recent genome sequencing studies in cancers have revealed mutations in ATRX and their association with ALT (alternative lengthening of telomeres) activation. Here we report depletion of ATRX in mouse ES cells leads to selective loss in ribosomal RNA gene (rDNA) copy number. Supporting this, ATRX-mutated human ALT-positive tumors also show a substantially lower rDNA copy than ALT-negative tumors. Further investigation shows that the rDNA copy loss and repeat instability are caused by a disruption in H3.3 deposition and thus a failure in heterochromatin formation at rDNA repeats in the absence of ATRX. We also find that ATRX-depleted cells are reduced in ribosomal RNA transcription output and show increased sensitivity to RNA polymerase I (Pol I) transcription inhibitor CX5461. In addition, human ALT-positive cancer cell lines are also more sensitive to CX5461 treatment. Our study provides insights into the contribution of ATRX loss of function to tumorigenesis through the loss of rDNA stability and suggests the therapeutic potential of targeting Pol I transcription in ALT cancers. PMID:29669917
Colorimetric molecular diagnosis of the HIV gag gene using DNAzyme and a complementary DNA-extended primer.

PubMed

Kim, Seong U; Batule, Bhagwan S; Mun, Hyoyoung; Byun, Ju-Young; Shim, Won-Bo; Kim, Min-Gon

2018-02-07

We have developed a novel strategy for the colorimetric detection of PCR products by utilizing a target-specific primer modified at the 5'-end with an anti-DNAzyme sequence. A single-stranded DNAzyme sequence folds into a G-quadruplex structure with hemin and shows strong peroxidase activity. When the complementary strand binds to the DNAzyme sequence, it blocks the formation of the G-quadraduplex structure and loses its peroxidase activity. In the presence of the target gene, PCR amplification proceeds, and anti-DNAzyme sequence modified primers present in the reaction mixture form a double strand through primer extension. Therefore, it does not block the DNAzyme sequence. Further, a colorimetric signal is generated by the addition of 2,2'-azino-bis(3-ethylbenzothiazoline-6-sulfonate) (ABTS) and H 2 O 2 at the end of the reaction. We have successfully detected a single copy of the HIV type 1 gag gene in buffer and 10 copies in human serum. The strategy developed could be used to detect DNA and RNA in complex biological samples by simple primer designing that includes DNAzyme and a DNA extended primer.
Partial bisulfite conversion for unique template sequencing.

PubMed

Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael; Levy, Dan

2018-01-25

We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Evolutionary Origin of OwlRep, a Megasatellite DNA Associated with Adaptation of Owl Monkeys to Nocturnal Lifestyle

PubMed Central

Nishihara, Hidenori; Stanyon, Roscoe; Kusumi, Junko; Hirai, Hirohisa

2018-01-01

Abstract Rod cells of many nocturnal mammals have a “non-standard” nuclear architecture, which is called the inverted nuclear architecture. Heterochromatin localizes to the central region of the nucleus. This leads to an efficient light transmission to the outer segments of photoreceptors. Rod cells of diurnal mammals have the conventional nuclear architecture. Owl monkeys (genus Aotus) are the only taxon of simian primates that has a nocturnal or cathemeral lifestyle, and this adaptation is widely thought to be secondary. Their rod cells were shown to exhibit an intermediate chromatin distribution: a spherical heterochromatin block was found in the central region of the nucleus although it was less complete than that of typical nocturnal mammals. We recently demonstrated that the primary DNA component of this heterochromatin block was OwlRep, a megasatellite DNA consisting of 187-bp-long repeat units. However, the origin of OwlRep was not known. Here we show that OwlRep was derived from HSAT6, a simple repeat sequence found in the centromere regions of human chromosomes. HSAT6 occurs widely in primates, suggesting that it was already present in the last common ancestor of extant primates. Notably, Strepsirrhini and Tarsiformes apparently carry a single HSAT6 copy, whereas many species of Simiiformes contain multiple copies. Comparison of nucleotide sequences of these copies revealed the entire process of the OwlRep formation. HSAT6, with or without flanking sequences, was segmentally duplicated in New World monkeys. Then, in the owl monkey linage after its divergence from other New World monkeys, a copy of HSAT6 was tandemly amplified, eventually forming a megasatellite DNA. PMID:29294004
Atypical fibroxanthoma and pleomorphic dermal sarcoma harbor frequent NOTCH1/2 and FAT1 mutations and similar DNA copy number alteration profiles.

PubMed

Griewank, Klaus G; Wiesner, Thomas; Murali, Rajmohan; Pischler, Carina; Müller, Hansgeorg; Koelsche, Christian; Möller, Inga; Franklin, Cindy; Cosgarea, Ioana; Sucker, Antje; Schadendorf, Dirk; Schaller, Jörg; Horn, Susanne; Brenn, Thomas; Mentzel, Thomas

2018-03-01

Atypical fibroxanthomas and pleomorphic dermal sarcomas are tumors arising in sun-damaged skin of elderly patients. They have differing prognoses and are currently distinguished using histological criteria, such as invasion of deeper tissue structures, necrosis and lymphovascular or perineural invasion. To investigate the as-yet poorly understood genetics of these tumors, 41 atypical fibroxanthomas and 40 pleomorphic dermal sarcomas were subjected to targeted next-generation sequencing approaches as well as DNA copy number analysis by comparative genomic hybridization. In an analysis of the entire coding region of 341 oncogenes and tumor suppressor genes in 13 atypical fibroxanthomas using an established hybridization-based next-generation sequencing approach, we found that these tumors harbor a large number of mutations. Gene alterations were identified in more than half of the analyzed samples in FAT1, NOTCH1/2, CDKN2A, TP53, and the TERT promoter. The presence of these alterations was verified in 26 atypical fibroxanthoma and 35 pleomorphic dermal sarcoma samples by targeted amplicon-based next-generation sequencing. Similar mutation profiles in FAT1, NOTCH1/2, CDKN2A, TP53, and the TERT promoter were identified in both atypical fibroxanthoma and pleomorphic dermal sarcoma. Activating RAS mutations (G12 and G13) identified in 3 pleomorphic dermal sarcoma were not found in atypical fibroxanthoma. Comprehensive DNA copy number analysis demonstrated a wide array of different copy number gains and losses, with similar profiles in atypical fibroxanthoma and pleomorphic dermal sarcoma. In summary, atypical fibroxanthoma and pleomorphic dermal sarcoma are highly mutated tumors with recurrent mutations in FAT1, NOTCH1/2, CDKN2A, TP53, and the TERT promoter, and a range of DNA copy number alterations. These findings suggest that atypical fibroxanthomas and pleomorphic dermal sarcomas are genetically related, potentially representing two ends of a common tumor spectrum and distinguishing these entities is at present still best performed using histological criteria.
Dead Element Replicating: Degenerate R2 Element Replication and rDNA Genomic Turnover in the Bacillus rossius Stick Insect (Insecta: Phasmida)

PubMed Central

Martoni, Francesco; Eickbush, Danna G.; Scavariello, Claudia; Luchetti, Andrea; Mantovani, Barbara

2015-01-01

R2 is an extensively investigated non-LTR retrotransposon that specifically inserts into the 28S rRNA gene sequences of a wide range of metazoans, disrupting its functionality. During R2 integration, first strand synthesis can be incomplete so that 5’ end deleted copies are occasionally inserted. While active R2 copies repopulate the locus by retrotransposing, the non-functional truncated elements should frequently be eliminated by molecular drive processes leading to the concerted evolution of the rDNA array(s). Although, multiple R2 lineages have been discovered in the genome of many animals, the rDNA of the stick insect Bacillus rossius exhibits a peculiar situation: it harbors both a canonical, functional R2 element (R2Brfun) as well as a full-length but degenerate element (R2Brdeg). An intensive sequencing survey in the present study reveals that all truncated variants in stick insects are present in multiple copies suggesting they were duplicated by unequal recombination. Sequencing results also demonstrate that all R2Brdeg copies are full-length, i. e. they have no associated 5' end deletions, and functional assays indicate they have lost the active ribozyme necessary for R2 RNA maturation. Although it cannot be completely ruled out, it seems unlikely that the degenerate elements replicate via reverse transcription, exploiting the R2Brfun element enzymatic machinery, but rather via genomic amplification of inserted 28S by unequal recombination. That inactive copies (both R2Brdeg or 5'-truncated elements) are not eliminated in a short term in stick insects contrasts with findings for the Drosophila R2, suggesting a widely different management of rDNA loci and a lower efficiency of the molecular drive while achieving the concerted evolution. PMID:25799008
Development of a chemiluminescence competitive PCR for the detection and quantification of parvovirus B19 DNA using a microplate luminometer.

PubMed

Fini, F; Gallinella, G; Girotti, S; Zerbini, M; Musiani, M

1999-09-01

Quantitative PCR of viral nucleic acids can be useful clinically in diagnosis, risk assessment, and monitoring of antiviral therapy. We wished to develop a chemiluminescence competitive PCR (cPCR) for parvovirus B19. Parvovirus DNA target sequences and competitor sequences were coamplified and directly labeled. Amplified products were then separately hybridized by specific biotin-labeled probes, captured onto streptavidin-coated ELISA microplates, and detected immunoenzymatically using chemiluminescent substrates of peroxidase. Chemiluminescent signals were quantitatively analyzed by a microplate luminometer and were correlated to the amounts of amplified products. Luminol-based systems displayed constant emission but had a higher detection limit (100-1000 genome copies) than the acridan-based system (20 genome copies). The detection limit of chemiluminescent substrates was lower (20 genome copies) than colorimetric substrates (50 genome copies). In chemiluminescence cPCR, the titration curves showed linear correlation above 100 target genome copies. Chemiluminescence cPCR was positive in six serum samples from patients with parvovirus infections and negative in six control sera. The chemiluminescence cPCR appears to be a sensitive and specific method for the quantitative detection of viral DNAs.
The complete chloroplast genome sequence of the chlorophycean green alga Scenedesmus obliquus reveals a compact gene organization and a biased distribution of genes on the two DNA strands

PubMed Central

de Cambiaire, Jean-Charles; Otis, Christian; Lemieux, Claude; Turmel, Monique

2006-01-01

Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. While the basal position of the Prasinophyceae is well established, the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae (UTC) remains uncertain. The five complete chloroplast DNA (cpDNA) sequences currently available for representatives of these classes display considerable variability in overall structure, gene content, gene density, intron content and gene order. Among these genomes, that of the chlorophycean green alga Chlamydomonas reinhardtii has retained the least ancestral features. The two single-copy regions, which are separated from one another by the large inverted repeat (IR), have similar sizes, rather than unequal sizes, and differ radically in both gene contents and gene organizations relative to the single-copy regions of prasinophyte and ulvophyte cpDNAs. To gain insights into the various changes that underwent the chloroplast genome during the evolution of chlorophycean green algae, we have sequenced the cpDNA of Scenedesmus obliquus, a member of a distinct chlorophycean lineage. Results The 161,452 bp IR-containing genome of Scenedesmus features single-copy regions of similar sizes, encodes 96 genes, i.e. only two additional genes (infA and rpl12) relative to its Chlamydomonas homologue and contains seven group I and two group II introns. It is clearly more compact than the four UTC algal cpDNAs that have been examined so far, displays the lowest proportion of short repeats among these algae and shows a stronger bias in clustering of genes on the same DNA strand compared to Chlamydomonas cpDNA. Like the latter genome, Scenedesmus cpDNA displays only a few ancestral gene clusters. The two chlorophycean genomes share 11 gene clusters that are not found in previously sequenced trebouxiophyte and ulvophyte cpDNAs as well as a few genes that have an unusual structure; however, their single-copy regions differ considerably in gene content. Conclusion Our results underscore the remarkable plasticity of the chlorophycean chloroplast genome. Owing to this plasticity, only a sketchy portrait could be drawn for the chloroplast genome of the last common ancestor of Scenedesmus and Chlamydomonas. PMID:16638149
Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

PubMed Central

Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

2015-01-01

DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576
Haplotype phasing and inheritance of copy number variants in nuclear families.

PubMed

Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

2015-01-01

DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.
Single-cell paired-end genome sequencing reveals structural variation per cell cycle

PubMed Central

Voet, Thierry; Kumar, Parveen; Van Loo, Peter; Cooke, Susanna L.; Marshall, John; Lin, Meng-Lay; Zamani Esteki, Masoud; Van der Aa, Niels; Mateiu, Ligia; McBride, David J.; Bignell, Graham R.; McLaren, Stuart; Teague, Jon; Butler, Adam; Raine, Keiran; Stebbings, Lucy A.; Quail, Michael A.; D’Hooghe, Thomas; Moreau, Yves; Futreal, P. Andrew; Stratton, Michael R.; Vermeesch, Joris R.; Campbell, Peter J.

2013-01-01

The nature and pace of genome mutation is largely unknown. Because standard methods sequence DNA from populations of cells, the genetic composition of individual cells is lost, de novo mutations in cells are concealed within the bulk signal and per cell cycle mutation rates and mechanisms remain elusive. Although single-cell genome analyses could resolve these problems, such analyses are error-prone because of whole-genome amplification (WGA) artefacts and are limited in the types of DNA mutation that can be discerned. We developed methods for paired-end sequence analysis of single-cell WGA products that enable (i) detecting multiple classes of DNA mutation, (ii) distinguishing DNA copy number changes from allelic WGA-amplification artefacts by the discovery of matching aberrantly mapping read pairs among the surfeit of paired-end WGA and mapping artefacts and (iii) delineating the break points and architecture of structural variants. By applying the methods, we capture DNA copy number changes acquired over one cell cycle in breast cancer cells and in blastomeres derived from a human zygote after in vitro fertilization. Furthermore, we were able to discover and fine-map a heritable inter-chromosomal rearrangement t(1;16)(p36;p12) by sequencing a single blastomere. The methods will expedite applications in basic genome research and provide a stepping stone to novel approaches for clinical genetic diagnosis. PMID:23630320

Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing

PubMed Central

2011-01-01

Background Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. Results A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. Conclusions The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives. This study represents a first step in the development of a community resource for further study of plant-insect co-evolution, anti-herbivore defense, floral developmental genetics, reproductive biology, chemical evolution, population genetics, and comparative genomics using milkweeds, and A. syriaca in particular, as ecological and evolutionary models. PMID:21542930
Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing.

PubMed

Straub, Shannon C K; Fishbein, Mark; Livshultz, Tatyana; Foster, Zachary; Parks, Matthew; Weitemier, Kevin; Cronn, Richard C; Liston, Aaron

2011-05-04

Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives. This study represents a first step in the development of a community resource for further study of plant-insect co-evolution, anti-herbivore defense, floral developmental genetics, reproductive biology, chemical evolution, population genetics, and comparative genomics using milkweeds, and A. syriaca in particular, as ecological and evolutionary models.
Event-specific qualitative and quantitative PCR detection of the GMO carnation (Dianthus caryophyllus) variety Moonlite based upon the 5'-transgene integration sequence.

PubMed

Li, P; Jia, J W; Jiang, L X; Zhu, H; Bai, L; Wang, J B; Tang, X M; Pan, A H

2012-04-27

To ensure the implementation of genetically modified organism (GMO)-labeling regulations, an event-specific detection method was developed based on the junction sequence of an exogenous integrant in the transgenic carnation variety Moonlite. The 5'-transgene integration sequence was isolated by thermal asymmetric interlaced PCR. Based upon the 5'-transgene integration sequence, the event-specific primers and TaqMan probe were designed to amplify the fragments, which spanned the exogenous DNA and carnation genomic DNA. Qualitative and quantitative PCR assays were developed employing the designed primers and probe. The detection limit of the qualitative PCR assay was 0.05% for Moonlite in 100 ng total carnation genomic DNA, corresponding to about 79 copies of the carnation haploid genome; the limit of detection and quantification of the quantitative PCR assay were estimated to be 38 and 190 copies of haploid carnation genomic DNA, respectively. Carnation samples with different contents of genetically modified components were quantified and the bias between the observed and true values of three samples were lower than the acceptance criterion (<25%) of the GMO detection method. These results indicated that these event-specific methods would be useful for the identification and quantification of the GMO carnation Moonlite.
Template-Directed Copolymerization, Random Walks along Disordered Tracks, and Fractals

NASA Astrophysics Data System (ADS)

Gaspard, Pierre

2016-12-01

In biology, template-directed copolymerization is the fundamental mechanism responsible for the synthesis of DNA, RNA, and proteins. More than 50 years have passed since the discovery of DNA structure and its role in coding genetic information. Yet, the kinetics and thermodynamics of information processing in DNA replication, transcription, and translation remain poorly understood. Challenging issues are the facts that DNA or RNA sequences constitute disordered media for the motion of polymerases or ribosomes while errors occur in copying the template. Here, it is shown that these issues can be addressed and sequence heterogeneity effects can be quantitatively understood within a framework revealing universal aspects of information processing at the molecular scale. In steady growth regimes, the local velocities of polymerases or ribosomes along the template are distributed as the continuous or fractal invariant set of a so-called iterated function system, which determines the copying error probabilities. The growth may become sublinear in time with a scaling exponent that can also be deduced from the iterated function system.
Repetitive DNA in the pea (Pisum sativum L.) genome: comprehensive characterization using 454 sequencing and comparison to soybean and Medicago truncatula

PubMed Central

Macas, Jiří; Neumann, Pavel; Navrátilová, Alice

2007-01-01

Background Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (Pisum sativum). Results Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in M. truncatula. Conclusion We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data provide a starting point for further investigations of legume plant genomes based on their global comparative analysis and for the development of more sophisticated approaches for data mining. PMID:18031571
A Children's Oncology Group and TARGET initiative exploring the genetic landscape of Wilms tumor.

PubMed

Gadd, Samantha; Huff, Vicki; Walz, Amy L; Ooms, Ariadne H A G; Armstrong, Amy E; Gerhard, Daniela S; Smith, Malcolm A; Auvil, Jaime M Guidry; Meerzaman, Daoud; Chen, Qing-Rong; Hsu, Chih Hao; Yan, Chunhua; Nguyen, Cu; Hu, Ying; Hermida, Leandro C; Davidsen, Tanja; Gesuwan, Patee; Ma, Yussanne; Zong, Zusheng; Mungall, Andrew J; Moore, Richard A; Marra, Marco A; Dome, Jeffrey S; Mullighan, Charles G; Ma, Jing; Wheeler, David A; Hampton, Oliver A; Ross, Nicole; Gastier-Foster, Julie M; Arold, Stefan T; Perlman, Elizabeth J

2017-10-01

We performed genome-wide sequencing and analyzed mRNA and miRNA expression, DNA copy number, and DNA methylation in 117 Wilms tumors, followed by targeted sequencing of 651 Wilms tumors. In addition to genes previously implicated in Wilms tumors (WT1, CTNNB1, AMER1, DROSHA, DGCR8, XPO5, DICER1, SIX1, SIX2, MLLT1, MYCN, and TP53), we identified mutations in genes not previously recognized as recurrently involved in Wilms tumors, the most frequent being BCOR, BCORL1, NONO, MAX, COL6A3, ASXL1, MAP3K4, and ARID1A. DNA copy number changes resulted in recurrent 1q gain, MYCN amplification, LIN28B gain, and MIRLET7A loss. Unexpected germline variants involved PALB2 and CHEK2. Integrated analyses support two major classes of genetic changes that preserve the progenitor state and/or interrupt normal development.
A remark on copy number variation detection methods.

PubMed

Li, Shuo; Dou, Xialiang; Gao, Ruiqi; Ge, Xinzhou; Qian, Minping; Wan, Lin

2018-01-01

Copy number variations (CNVs) are gain and loss of DNA sequence of a genome. High throughput platforms such as microarrays and next generation sequencing technologies (NGS) have been applied for genome wide copy number losses. Although progress has been made in both approaches, the accuracy and consistency of CNV calling from the two platforms remain in dispute. In this study, we perform a deep analysis on copy number losses on 254 human DNA samples, which have both SNP microarray data and NGS data publicly available from Hapmap Project and 1000 Genomes Project respectively. We show that the copy number losses reported from Hapmap Project and 1000 Genome Project only have < 30% overlap, while these reports are required to have cross-platform (e.g. PCR, microarray and high-throughput sequencing) experimental supporting by their corresponding projects, even though state-of-art calling methods were employed. On the other hand, copy number losses are found directly from HapMap microarray data by an accurate algorithm, i.e. CNVhac, almost all of which have lower read mapping depth in NGS data; furthermore, 88% of which can be supported by the sequences with breakpoint in NGS data. Our results suggest the ability of microarray calling CNVs and the possible introduction of false negatives from the unessential requirement of the additional cross-platform supporting. The inconsistency of CNV reports from Hapmap Project and 1000 Genomes Project might result from the inadequate information containing in microarray data, the inconsistent detection criteria, or the filtration effect of cross-platform supporting. The statistical test on CNVs called from CNVhac show that the microarray data can offer reliable CNV reports, and majority of CNV candidates can be confirmed by raw sequences. Therefore, the CNV candidates given by a good caller could be highly reliable without cross-platform supporting, so additional experimental information should be applied in need instead of necessarily.
Virus-specific DNA sequences present in cells which carry the herpes simplex virus thymidine kinase gene.

PubMed

Minson, A C; Darby, G K; Wildy, P

1979-11-01

Two independently derived cell lines which carry the herpes simplex type 2 thymidine kinase gene have been examined for the presence of HSV-2-specific DNA sequences. Both cell lines contained 1 to 3 copies per cell of a sequence lying within map co-ordinates 0.2 to 0.4 of the HSV-2 genome. Revertant cells, which contained no detectable thymidine kinase, did not contain this DNA sequence. The failure of EcoR1-restricted HSV-2 DNA to act as a donor of the thymidine kinase gene in transformation experiments suggests that the gene lies close to the EcoR1 restriction site within this sequence at a map position of approx. 0.3. The HSV-2 kinase gene is therefore approximately co-linear with the HSV-1 gene.
Intragenomic polymorphisms among high-copy loci: a genus-wide study of nuclear ribosomal DNA in Asclepias (Apocynaceae).

PubMed

Weitemier, Kevin; Straub, Shannon C K; Fishbein, Mark; Liston, Aaron

2015-01-01

Despite knowledge that concerted evolution of high-copy loci is often imperfect, studies that investigate the extent of intragenomic polymorphisms and comparisons across a large number of species are rarely made. We present a bioinformatic pipeline for characterizing polymorphisms within an individual among copies of a high-copy locus. Results are presented for nuclear ribosomal DNA (nrDNA) across the milkweed genus, Asclepias. The 18S-26S portion of the nrDNA cistron of Asclepias syriaca served as a reference for assembly of the region from 124 samples representing 90 species of Asclepias. Reads were mapped back to each individual's consensus and at each position reads differing from the consensus were tallied using a custom perl script. Low frequency polymorphisms existed in all individuals (mean = 5.8%). Most nrDNA positions (91%) were polymorphic in at least one individual, with polymorphic sites being less frequent in subunit regions and loops. Highly polymorphic sites existed in each individual, with highest abundance in the "noncoding" ITS regions. Phylogenetic signal was present in the distribution of intragenomic polymorphisms across the genus. Intragenomic polymorphisms in nrDNA are common in Asclepias, being found at higher frequency than any other study to date. The high and variable frequency of polymorphisms across species highlights concerns that phylogenetic applications of nrDNA may be error-prone. The new analytical approach provided here is applicable to other taxa and other high-copy regions characterized by low coverage genome sequencing (genome skimming).
Cigarette smoking and hOGG1 Ser326Cys polymorphism are associated with 8-OHdG accumulation on mitochondrial DNA in thoracic esophageal squamous cell carcinoma.

PubMed

Lin, Chen-Sung; Wang, Liang-Shun; Chou, Teh-Ying; Hsu, Wen-Hu; Lin, Hui-Chen; Lee, Shu-Yu; Lee, Mau-Hua; Chang, Shi-Chuan; Wei, Yau-Huei

2013-12-01

We examined whether cigarette smoking affects the degrees of oxidative damage (8-hydroxyl-2'-deoxyguanosine [8-OHdG]) on mitochondrial DNA (mtDNA), whether the degree of 8-OHdG accumulation on mtDNA is related to the increased total mtDNA copy number, and whether human 8-oxoguanine DNA glycosylase 1 (hOGG1) Ser326Cys polymorphisms affect the degrees of 8-OHdG accumulation on mtDNA in thoracic esophageal squamous cell carcinoma (TESCC). DNA extracted from microdissected tissues of paired noncancerous esophageal muscles, noncancerous esophageal mucosa, and cancerous TESCC nests (n = 74) along with metastatic lymph nodes (n = 38) of 74 TESCC patients was analyzed. Both the mtDNA copy number and mtDNA integrity were analyzed by quantitative real-time polymerase chain reaction (PCR). The hOGG1 Ser326Cys polymorphisms were identified by restriction fragment length polymorphism PCR and PCR-based direct sequencing. Among noncancerous esophageal mucosa, cancerous TESCC nests, and metastatic lymph nodes, the mtDNA integrity decreased (95.2 to 47.9 to 18.6 %; P < 0.001) and the mtDNA copy number disproportionally increased (0.163 to 0.204 to 0.207; P = 0.026). In TESCC, higher indexes of cigarette smoking (0, 0-20, 20-40, and >40 pack-years) were related to an advanced pathologic N category (P = 0.038), elevated mtDNA copy number (P = 0.013), higher mtDNA copy ratio (P = 0.028), and increased mtDNA integrity (P = 0.069). The TESCC mtDNA integrity in patients with Ser/Ser, Ser/Cys, and Cys/Cys hOGG1 variants decreased stepwise from 65.2 to 52.1 to 41.3 % (P = 0.051). Elevated 8-OHdG accumulations on mtDNA in TESCC were observed. Such accumulations were associated with a compensatory increase in total mtDNA copy number, indexes of cigarette smoking, and hOGG1 Ser326Cys polymorphisms.
The complete chloroplast DNA sequence of the green alga Oltmannsiellopsis viridis reveals a distinctive quadripartite architecture in the chloroplast genome of early diverging ulvophytes

PubMed Central

Pombert, Jean-François; Lemieux, Claude; Turmel, Monique

2006-01-01

Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. The basal position of the Prasinophyceae has been well documented, but the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae is currently debated. The four complete chloroplast DNA (cpDNA) sequences presently available for representatives of these classes have revealed extensive variability in overall structure, gene content, intron composition and gene order. The chloroplast genome of Pseudendoclonium (Ulvophyceae), in particular, is characterized by an atypical quadripartite architecture that deviates from the ancestral type by a large inverted repeat (IR) featuring an inverted rRNA operon and a small single-copy (SSC) region containing 14 genes normally found in the large single-copy (LSC) region. To gain insights into the nature of the events that led to the reorganization of the chloroplast genome in the Ulvophyceae, we have determined the complete cpDNA sequence of Oltmannsiellopsis viridis, a representative of a distinct, early diverging lineage. Results The 151,933 bp IR-containing genome of Oltmannsiellopsis differs considerably from Pseudendoclonium and other chlorophyte cpDNAs in intron content and gene order, but shares close similarities with its ulvophyte homologue at the levels of quadripartite architecture, gene content and gene density. Oltmannsiellopsis cpDNA encodes 105 genes, contains five group I introns, and features many short dispersed repeats. As in Pseudendoclonium cpDNA, the rRNA genes in the IR are transcribed toward the single copy region featuring the genes typically found in the ancestral LSC region, and the opposite single copy region harbours genes characteristic of both the ancestral SSC and LSC regions. The 52 genes that were transferred from the ancestral LSC to SSC region include 12 of those observed in Pseudendoclonium cpDNA. Surprisingly, the overall gene organization of Oltmannsiellopsis cpDNA more closely resembles that of Chlorella (Trebouxiophyceae) cpDNA. Conclusion The chloroplast genome of the last common ancestor of Oltmannsiellopsis and Pseudendoclonium contained a minimum of 108 genes, carried only a few group I introns, and featured a distinctive quadripartite architecture. Numerous changes were experienced by the chloroplast genome in the lineages leading to Oltmannsiellopsis and Pseudendoclonium. Our comparative analyses of chlorophyte cpDNAs support the notion that the Ulvophyceae is sister to the Chlorophyceae. PMID:16472375
DNA sequence analysis of ARS elements from chromosome III of Saccharomyces cerevisiae: identification of a new conserved sequence.

PubMed Central

Palzkill, T G; Oliver, S G; Newlon, C S

1986-01-01

Four fragments of Saccharomyces cerevisiae chromosome III DNA which carry ARS elements have been sequenced. Each fragment contains multiple copies of sequences that have at least 10 out of 11 bases of homology to a previously reported 11 bp core consensus sequence. A survey of these new ARS sequences and previously reported sequences revealed the presence of an additional 11 bp conserved element located on the 3' side of the T-rich strand of the core consensus. Subcloning analysis as well as deletion and transposon insertion mutagenesis of ARS fragments support a role for 3' conserved sequence in promoting ARS activity. PMID:3529036
Amplification of the entire kanamycin biosynthetic gene cluster during empirical strain improvement of Streptomyces kanamyceticus.

PubMed

Yanai, Koji; Murakami, Takeshi; Bibb, Mervyn

2006-06-20

Streptomyces kanamyceticus 12-6 is a derivative of the wild-type strain developed for industrial kanamycin (Km) production. Southern analysis and DNA sequencing revealed amplification of a large genomic segment including the entire Km biosynthetic gene cluster in the chromosome of strain 12-6. At 145 kb, the amplifiable unit of DNA (AUD) is the largest AUD reported in Streptomyces. Striking repetitive DNA sequences belonging to the clustered regularly interspaced short palindromic repeats family were found in the AUD and may play a role in its amplification. Strain 12-6 contains a mixture of different chromosomes with varying numbers of AUDs, sometimes exceeding 36 copies and producing an amplified region >5.7 Mb. The level of Km production depended on the copy number of the Km biosynthetic gene cluster, suggesting that DNA amplification occurred during strain improvement as a consequence of selection for increased Km resistance. Amplification of DNA segments including entire antibiotic biosynthetic gene clusters might be a common mechanism leading to increased antibiotic production in industrial strains.
Cross-subtype Detection of HIV-1 Using Reverse Transcription and Recombinase Polymerase Amplification

PubMed Central

Lillis, Lorraine; Lehman, Dara A.; Siverson, Joshua B.; Weis, Julie; Cantera, Jason; Parker, Mathew; Piepenburg, Olaf; Overbaugh, Julie; Boyle, David S.

2016-01-01

A low complexity diagnostic test that rapidly and reliably detects HIV infection in infants at the point of care could facilitate early treatment, improving outcomes. However, many infant HIV diagnostics can only be performed in laboratory settings. Recombinase polymerase amplification (RPA) is an isothermal amplification technology that can rapidly amplify proviral DNA from multiple subtypes of HIV-1 in under twenty minutes without complex equipment. In this study we added reverse transcription (RT) to RPA to allow detection of both HIV-1 RNA and DNA. We show that this RT-RPA HIV-1 assay has a limit of detection of 10 to 30 copies of an exact sequence matched DNA or RNA, respectively. In addition, at 100 copies of RNA or DNA, the assay detected 171 of 175 (97.7 %) sequence variants that represent all the major subtypes and recombinant forms of HIV-1 Groups M and O. This data suggests that the application of RT-RPA for the combined detection of HIV-1 viral RNA and proviral DNA may prove a highly sensitive tool for rapid and accurate diagnosis of infant HIV. PMID:26821087
Application and comparison of large-scale solution-based DNA capture-enrichment methods on ancient DNA

PubMed Central

Ávila-Arcos, María C.; Cappellini, Enrico; Romero-Navarro, J. Alberto; Wales, Nathan; Moreno-Mayar, J. Víctor; Rasmussen, Morten; Fordyce, Sarah L.; Montiel, Rafael; Vielle-Calzada, Jean-Philippe; Willerslev, Eske; Gilbert, M. Thomas P.

2011-01-01

The development of second-generation sequencing technologies has greatly benefitted the field of ancient DNA (aDNA). Its application can be further exploited by the use of targeted capture-enrichment methods to overcome restrictions posed by low endogenous and contaminating DNA in ancient samples. We tested the performance of Agilent's SureSelect and Mycroarray's MySelect in-solution capture systems on Illumina sequencing libraries built from ancient maize to identify key factors influencing aDNA capture experiments. High levels of clonality as well as the presence of multiple-copy sequences in the capture targets led to biases in the data regardless of the capture method. Neither method consistently outperformed the other in terms of average target enrichment, and no obvious difference was observed either when two tiling designs were compared. In addition to demonstrating the plausibility of capturing aDNA from ancient plant material, our results also enable us to provide useful recommendations for those planning targeted-sequencing on aDNA. PMID:22355593
The dynamics of genome replication using deep sequencing

PubMed Central

Müller, Carolin A.; Hawkins, Michelle; Retkute, Renata; Malla, Sunir; Wilson, Ray; Blythe, Martin J.; Nakato, Ryuichiro; Komata, Makiko; Shirahige, Katsuhiko; de Moura, Alessandro P.S.; Nieduszynski, Conrad A.

2014-01-01

Eukaryotic genomes are replicated from multiple DNA replication origins. We present complementary deep sequencing approaches to measure origin location and activity in Saccharomyces cerevisiae. Measuring the increase in DNA copy number during a synchronous S-phase allowed the precise determination of genome replication. To map origin locations, replication forks were stalled close to their initiation sites; therefore, copy number enrichment was limited to origins. Replication timing profiles were generated from asynchronous cultures using fluorescence-activated cell sorting. Applying this technique we show that the replication profiles of haploid and diploid cells are indistinguishable, indicating that both cell types use the same cohort of origins with the same activities. Finally, increasing sequencing depth allowed the direct measure of replication dynamics from an exponentially growing culture. This is the first time this approach, called marker frequency analysis, has been successfully applied to a eukaryote. These data provide a high-resolution resource and methodological framework for studying genome biology. PMID:24089142
Exploring the loblolly pine (Pinus taeda L.) genome by BAC sequencing and Cot analysis.

PubMed

Perera, Dinum; Magbanua, Zenaida V; Thummasuwan, Supaphan; Mukherjee, Dipaloke; Arick, Mark; Chouvarine, Philippe; Nairn, Campbell J; Schmutz, Jeremy; Grimwood, Jane; Dean, Jeffrey F D; Peterson, Daniel G

2018-07-15

Loblolly pine (LP; Pinus taeda L.) is an economically and ecologically important tree in the southeastern U.S. To advance understanding of the loblolly pine (LP; Pinus taeda L.) genome, we sequenced and analyzed 100 BAC clones and performed a Cot analysis. The Cot analysis indicates that the genome is composed of 57, 24, and 10% highly-repetitive, moderately-repetitive, and single/low-copy sequences, respectively (the remaining 9% of the genome is a combination of fold back and damaged DNA). Although single/low-copy DNA only accounts for 10% of the LP genome, the amount of single/low-copy DNA in LP is still 14 times the size of the Arabidopsis genome. Since gene numbers in LP are similar to those in Arabidopsis, much of the single/low-copy DNA of LP would appear to be composed of DNA that is both gene- and repeat-poor. Macroarrays prepared from a LP bacterial artificial chromosome (BAC) library were hybridized with probes designed from cell wall synthesis/wood development cDNAs, and 50 of the "targeted" clones were selected for further analysis. An additional 25 clones were selected because they contained few repeats, while 25 more clones were selected at random. The 100 BAC clones were Sanger sequenced and assembled. Of the targeted BACs, 80% contained all or part of the cDNA used to target them. One targeted BAC was found to contain fungal DNA and was eliminated from further analysis. Combinations of similarity-based and ab initio gene prediction approaches were utilized to identify and characterize potential coding regions in the 99 BACs containing LP DNA. From this analysis, we identified 154 gene models (GMs) representing both putative protein-coding genes and likely pseudogenes. Ten of the GMs (all of which were specifically targeted) had enough support to be classified as intact genes. Interestingly, the 154 GMs had statistically indistinguishable (α = 0.05) distributions in the targeted and random BAC clones (15.18 and 12.61 GM/Mb, respectively), whereas the low-repeat BACs contained significantly fewer GMs (7.08 GM/Mb). However, when GM length was considered, the targeted BACs had a significantly greater percentage of their length in GMs (3.26%) when compared to random (1.63%) and low-repeat (0.62%) BACs. The results of our study provide insight into LP evolution and inform ongoing efforts to produce a reference genome sequence for LP, while characterization of genes involved in cell wall production highlights carbon metabolism pathways that can be leveraged for increasing wood production. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Clones from a shooty tobacco crown gall tumor I: deletions, rearrangements and amplifications resulting in irregular T-DNA structures and organizations.

PubMed

Peerbolte, R; Leenhouts, K; Hooykaas-van Slogteren, G M; Hoge, J H; Wullems, G J; Schilperoort, R A

1986-07-01

Transformed clones from a shooty tobacco crown gall tumor, induced byAgrobacterium tumefaciens strain LBA1501, having a Tn1831 insertion in the auxin locus, were investigated for their T-DNA structure and expression. In addition to clones with the expected phenotype, i.e. phytohormone autonomy, regeneration of non-rooting shoots and octopine synthesis (Aut(+)Reg(+)Ocs(+) 'type I' clones), clones were obtained with an aberrant phenotype. Among these were the Aut(-)Reg(-)Ocs(+) 'type II' clones. Two shooty type I clones and three type II callus clones (all randomly chosen) as well as a rooting shoot regenerated from a type II clone via a high kinetin treatment, all had a T-DNA structure which differed significantly from 'regular' T-DNA structures. No Tn1831 DNA sequences were detected in these clones. The two type I clones were identical: they both contained the same highly truncated T-DNA segments. One TL-DNA segment of approximately 0.7 kb, originating form the left part of the TL-region, was present at one copy per diploid tobacco genome. Another segment with a maximum size of about 7 kb was derived from the right hand part of the TL-region and was present at minimally two copies. Three copies of a truncated TR-DNA segment were detected, probably starting at the right TR-DNA border repeat and ending halfway the regular TR-region. Indications have been obtained that at least some of the T-DNA segments are closely linked, sometimes via intervening plant DNA sequences. The type I clones harbored TL-DNA transcripts 4, 6a/b and 3 as well as TR-DNA transcript 0'. The type II clones harbored three to six highly truncated T-DNA segments, originating from the right part of the TL-region. In addition they had TR-DNA segments, similar to those of the type I clones. On Northern blots TR-DNA transcripts 0' and 1' were detected as well as the TL-DNA transcripts 3 and 6a/b and an 1800 bp hybrid transcript (tr.Y) containing gene 6b sequences. Possible origins of the observed irregularities in T-DNA structures are discussed in relation to fidelity of transformation of plant cells viaAgrobacterium.
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons

PubMed Central

Pagano, Johanna F.B.; Ensink, Wim A.; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P.; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J.; Dekker, Rob J.

2017-01-01

5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. PMID:28003516
Intra-isolate genome variation in arbuscular mycorrhizal fungi persists in the transcriptome.

PubMed

Boon, E; Zimmerman, E; Lang, B F; Hijri, M

2010-07-01

Arbuscular mycorrhizal fungi (AMF) are heterokaryotes with an unusual genetic makeup. Substantial genetic variation occurs among nuclei within a single mycelium or isolate. AMF reproduce through spores that contain varying fractions of this heterogeneous population of nuclei. It is not clear whether this genetic variation on the genome level actually contributes to the AMF phenotype. To investigate the extent to which polymorphisms in nuclear genes are transcribed, we analysed the intra-isolate genomic and cDNA sequence variation of two genes, the large subunit ribosomal RNA (LSU rDNA) of Glomus sp. DAOM-197198 (previously known as G. intraradices) and the POL1-like sequence (PLS) of Glomus etunicatum. For both genes, we find high sequence variation at the genome and transcriptome level. Reconstruction of LSU rDNA secondary structure shows that all variants are functional. Patterns of PLS sequence polymorphism indicate that there is one functional gene copy, PLS2, which is preferentially transcribed, and one gene copy, PLS1, which is a pseudogene. This is the first study that investigates AMF intra-isolate variation at the transcriptome level. In conclusion, it is possible that, in AMF, multiple nuclear genomes contribute to a single phenotype.

Chemical synthesis and characterization of branched oligodeoxyribonucleotides (bDNA) for use as signal amplifiers in nucleic acid quantification assays.

PubMed

Horn, T; Chang, C A; Urdea, M S

1997-12-01

The divergent synthesis of bDNA structures is described. This new type of branched DNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branching network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb molecules were assembled on a solid support using parameters optimized for bDNA synthesis. The chemistry was used to synthesize bDNA comb molecules containing 15 secondary sequences. The bDNA comb molecules were elaborated by enzymatic ligation into branched amplification multimers, large bDNA molecules (a total of 1068 nt) containing an average of 36 repeated DNA oligomer sequences, each capable of hybridizing specifically to an alkaline phosphatase-labeled oligonucleotide. The bDNA comb molecules were characterized by electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The branched amplification multimers have been used as signal amplifiers in nucleic acid quantification assays for detection of viral infection. It is possible to detect as few as 50 molecules with bDNA technology.
Chemical synthesis and characterization of branched oligodeoxyribonucleotides (bDNA) for use as signal amplifiers in nucleic acid quantification assays.

PubMed Central

Horn, T; Chang, C A; Urdea, M S

1997-01-01

The divergent synthesis of bDNA structures is described. This new type of branched DNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branching network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb molecules were assembled on a solid support using parameters optimized for bDNA synthesis. The chemistry was used to synthesize bDNA comb molecules containing 15 secondary sequences. The bDNA comb molecules were elaborated by enzymatic ligation into branched amplification multimers, large bDNA molecules (a total of 1068 nt) containing an average of 36 repeated DNA oligomer sequences, each capable of hybridizing specifically to an alkaline phosphatase-labeled oligonucleotide. The bDNA comb molecules were characterized by electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The branched amplification multimers have been used as signal amplifiers in nucleic acid quantification assays for detection of viral infection. It is possible to detect as few as 50 molecules with bDNA technology. PMID:9365266
Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats.

PubMed

Spielmann, A; Stutz, E

1983-10-25

The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2.
The CentO satellite confers translational and rotational phasing on cenH3 nucleosomes in rice centromeres.

PubMed

Zhang, Tao; Talbert, Paul B; Zhang, Wenli; Wu, Yufeng; Yang, Zujun; Henikoff, Jorja G; Henikoff, Steven; Jiang, Jiming

2013-12-10

Plant and animal centromeres comprise megabases of highly repeated satellite sequences, yet centromere function can be specified epigenetically on single-copy DNA by the presence of nucleosomes containing a centromere-specific variant of histone H3 (cenH3). We determined the positions of cenH3 nucleosomes in rice (Oryza sativa), which has centromeres composed of both the 155-bp CentO satellite repeat and single-copy non-CentO sequences. We find that cenH3 nucleosomes protect 90-100 bp of DNA from micrococcal nuclease digestion, sufficient for only a single wrap of DNA around the cenH3 nucleosome core. cenH3 nucleosomes are translationally phased with 155-bp periodicity on CentO repeats, but not on non-CentO sequences. CentO repeats have an ∼10-bp periodicity in WW dinucleotides and in micrococcal nuclease cleavage, providing evidence for rotational phasing of cenH3 nucleosomes on CentO and suggesting that satellites evolve for translational and rotational stabilization of centromeric nucleosomes.
High copy number of highly similar mariner-like transposons in planarian (Platyhelminthe): evidence for a trans-phyla horizontal transfer.

PubMed

Garcia-Fernàndez, J; Bayascas-Ramírez, J R; Marfany, G; Muñoz-Mármol, A M; Casali, A; Baguñà, J; Saló, E

1995-05-01

Several DNA sequences similar to the mariner element were isolated and characterized in the platyhelminthe Dugesia (Girardia) tigrina. They were 1,288 bp long, flanked by two 32 bp-inverted repeats, and contained a single 339 amino acid open-reading frame (ORF) encoding the transposase. The number of copies of this element is approximately 8,000 per haploid genome, constituting a member of the middle-repetitive DNA of Dugesia tigrina. Sequence analysis of several elements showed a high percentage of conservation between the different copies. Most of them presented an intact ORF and the standard signals of actively expressed genes, which suggests that some of them are or have recently been functional transposons. The high degree of similarity shared with other mariner elements from some arthropods, together with the fact that this element is undetectable in other planarian species, strongly suggests a case of horizontal transfer between these two distant phyla.
Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

PubMed Central

Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

2012-01-01

The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697
Ancient DNA in human bone remains from Pompeii archaeological site.

PubMed

Cipollaro, M; Di Bernardo, G; Galano, G; Galderisi, U; Guarino, F; Angelini, F; Cascino, A

1998-06-29

aDNA extraction and amplification procedures have been optimized for Pompeian human bone remains whose diagenesis has been determined by histological analysis. Single copy genes amplification (X and Y amelogenin loci and Y specific alphoid repeat sequences) have been performed and compared with anthropometric data on sexing.
Genome-Wide Stochastic Adaptive DNA Amplification at Direct and Inverted DNA Repeats in the Parasite Leishmania

PubMed Central

Plourde, Marie; Gingras, Hélène; Roy, Gaétan; Lapointe, Andréanne; Leprohon, Philippe; Papadopoulou, Barbara; Corbeil, Jacques; Ouellette, Marc

2014-01-01

Gene amplification of specific loci has been described in all kingdoms of life. In the protozoan parasite Leishmania, the product of amplification is usually part of extrachromosomal circular or linear amplicons that are formed at the level of direct or inverted repeated sequences. A bioinformatics screen revealed that repeated sequences are widely distributed in the Leishmania genome and the repeats are chromosome-specific, conserved among species, and generally present in low copy number. Using sensitive PCR assays, we provide evidence that the Leishmania genome is continuously being rearranged at the level of these repeated sequences, which serve as a functional platform for constitutive and stochastic amplification (and deletion) of genomic segments in the population. This process is adaptive as the copy number of advantageous extrachromosomal circular or linear elements increases upon selective pressure and is reversible when selection is removed. We also provide mechanistic insights on the formation of circular and linear amplicons through RAD51 recombinase-dependent and -independent mechanisms, respectively. The whole genome of Leishmania is thus stochastically rearranged at the level of repeated sequences, and the selection of parasite subpopulations with changes in the copy number of specific loci is used as a strategy to respond to a changing environment. PMID:24844805
Development and in-house validation of the event-specific polymerase chain reaction detection methods for genetically modified soybean MON89788 based on the cloned integration flanking sequence.

PubMed

Liu, Jia; Guo, Jinchao; Zhang, Haibo; Li, Ning; Yang, Litao; Zhang, Dabing

2009-11-25

Various polymerase chain reaction (PCR) methods were developed for the execution of genetically modified organism (GMO) labeling policies, of which an event-specific PCR detection method based on the flanking sequence of exogenous integration is the primary trend in GMO detection due to its high specificity. In this study, the 5' and 3' flanking sequences of the exogenous integration of MON89788 soybean were revealed by thermal asymmetric interlaced PCR. The event-specific PCR primers and TaqMan probe were designed based upon the revealed 5' flanking sequence, and the qualitative and quantitative PCR assays were established employing these designed primers and probes. In qualitative PCR, the limit of detection (LOD) was about 0.01 ng of genomic DNA corresponding to 10 copies of haploid soybean genomic DNA. In the quantitative PCR assay, the LOD was as low as two haploid genome copies, and the limit of quantification was five haploid genome copies. Furthermore, the developed PCR methods were in-house validated by five researchers, and the validated results indicated that the developed event-specific PCR methods can be used for identification and quantification of MON89788 soybean and its derivates.
Real-time PCR detection and quantification of nine potential sources of fecal contamination by analysis of mitochondrial Cytochrome b targets

USGS Publications Warehouse

Schill, W.B.; Mathes, M.V.

2008-01-01

We designed and tested real-time PCR probe/primer sets to detect and quantify Cytochrome b sequences of mitochondrial DNA (mtDNA) from nine vertebrate species of pet (dog), farm (cow, chicken, sheep, horse, pig), wildlife (Canada goose, white-tailed deer), and human. Linear ranges of the assays were from 101 to 108 copies/??l. To formally test the performance of the assays, twenty blinded fecal suspension samples were analyzed by real-time PCR to identify the source of the feces. Sixteen of the twenty samples were correctly and unambiguously identified. Average sensitivity was calculated to be 0.850, while average specificity was found to be 0.994. One beef cow sample was not detected, but mtDNA from 11 other beef cattle of both sexes and varying physiological states was found in concentrations similar (3.45 ?? 107 copies/g) to thatfound in human feces (1.1 ?? 107 copies/g). Thus, environmental conditions and sample handling are probably important factors for successful detection of fecal mtDNA. When sewage samples were analyzed, only human mtDNA (7.2 ?? 104 copies/100 mL) was detected. With a detection threshold of 250 copies/reaction, an efficient concentration and purification method resulted in a final detection limit for human feces of 1.8 mg/100 mL water.
Short interspersed CAN SINE elements as prognostic markers in canine mammary neoplasia.

PubMed

Gelaleti, Gabriela B; Granzotto, Adriana; Leonel, Camila; Jardim, Bruna V; Moschetta, Marina G; Carareto, Claudia M A; Zuccari, Debora Ap P C

2014-01-01

The genome of mammals is characterized by a large number of non-LTR retrotransposons, and among them, the CAN SINEs are characteristics of the canine species. Small amounts of DNA freely circulate in normal blood serum and high amounts are found in human patients with cancer, characterizing it as a candidate tumor-biomarker. The aim of this study was to estimate, through its absolute expression, the number of copies of CAN SINE sequences present in free circulating DNA of female dogs with mammary cancer, in order to correlate with the clinical and pathological characteristics and the follow-up period. The copy number of CAN SINE sequences was estimated by qPCR in 28 female dogs with mammary neoplasia. The univariate analysis showed an increased number of copies in female dogs with mammary tumor in female dogs >10 years old (p=0.02) and tumor time >18 months (p<0.05). The Kaplan-Meier test demonstrated a negative correlation between an increased number of copies and survival time (p=0.03). High amounts of CAN SINE fragments can be good markers for the detection of tumor DNA in blood and may characterize it as a marker of poor prognosis, being related to female dogs with shorter survival times. This estimate can be used as a prognostic marker in non-invasive breast cancer research and is useful in predicting tumor progression and patient monitoring.
Evaluation of plasmid and genomic DNA calibrants used for the quantification of genetically modified organisms.

PubMed

Caprioara-Buda, M; Meyer, W; Jeynov, B; Corbisier, P; Trapmann, S; Emons, H

2012-07-01

The reliable quantification of genetically modified organisms (GMOs) by real-time PCR requires, besides thoroughly validated quantitative detection methods, sustainable calibration systems. The latter establishes the anchor points for the measured value and the measurement unit, respectively. In this paper, the suitability of two types of DNA calibrants, i.e. plasmid DNA and genomic DNA extracted from plant leaves, for the certification of the GMO content in reference materials as copy number ratio between two targeted DNA sequences was investigated. The PCR efficiencies and coefficients of determination of the calibration curves as well as the measured copy number ratios for three powder certified reference materials (CRMs), namely ERM-BF415e (NK603 maize), ERM-BF425c (356043 soya), and ERM-BF427c (98140 maize), originally certified for their mass fraction of GMO, were compared for both types of calibrants. In all three systems investigated, the PCR efficiencies of plasmid DNA were slightly closer to the PCR efficiencies observed for the genomic DNA extracted from seed powders rather than those of the genomic DNA extracted from leaves. Although the mean DNA copy number ratios for each CRM overlapped within their uncertainties, the DNA copy number ratios were significantly different using the two types of calibrants. Based on these observations, both plasmid and leaf genomic DNA calibrants would be technically suitable as anchor points for the calibration of the real-time PCR methods applied in this study. However, the most suitable approach to establish a sustainable traceability chain is to fix a reference system based on plasmid DNA.
ACVP-05: Virus Genetic Analysis from Cell-Free Plasma, Virally Infected Cells or Tissues and Cultured Supernatant Via Single Genome Amplification and Direct Sequencing | Frederick National Laboratory for Cancer Research

Cancer.gov

The Viral Evolution Core within the AIDS and Cancer Virus Program will extract viral RNA/DNA from cell-free or cell-associated samples. Complementary (cDNA) will be generated as needed, and cDNA or DNA will be diluted to a single copy prior to nested
A Comparative Study for Detection of EGFR Mutations in Plasma Cell-Free DNA in Korean Clinical Diagnostic Laboratories

PubMed Central

2018-01-01

Liquid biopsies to genotype the epidermal growth factor receptor (EGFR) for targeted therapy have been implemented in clinical decision-making in the field of lung cancer, but harmonization of detection methods is still scarce among clinical laboratories. We performed a pilot external quality assurance (EQA) scheme to harmonize circulating tumor DNA testing among laboratories. For EQA, we created materials containing different levels of spiked cell-free DNA (cfDNA) in normal plasma. The limit of detection (LOD) of the cobas® EGFR Mutation Test v2 (Roche Molecular Systems) was also evaluated. From November 2016 to June 2017, seven clinical diagnostic laboratories participated in the EQA program. The majority (98.94%) of results obtained using the cobas assay and next-generation sequencing (NGS) were acceptable. Quantitative results from the cobas assay were positively correlated with allele frequencies derived from digital droplet PCR measurements and showed good reproducibility among laboratories. The LOD of the cobas assay was 5~27 copies/mL for p.E746_A750del (exon 19 deletion), 35~70 copies/mL for p.L858R, 18~36 copies/mL for p.T790M, and 15~31 copies/mL for p.A767_V769dup (exon 20 insertion). Deep sequencing of materials (>100,000X depth of coverage) resulted in detection of low-level targets present at frequencies of 0.06~0.13%. Our results indicate that the cobas assay is a reliable and rapid method for detecting EGFR mutations in plasma cfDNA. Careful interpretation is particularly important for p.T790M detection in the setting of relapse. Individual laboratories should optimize NGS performance to maximize clinical utility.
DNA methylation polymorphism in a set of elite rice cultivars and its possible contribution to inter-cultivar differential gene expression.

PubMed

Wang, Yongming; Lin, Xiuyun; Dong, Bo; Wang, Yingdian; Liu, Bao

2004-01-01

RAPD (randomly amplified polymorphic DNA) and ISSR (inter-simple sequence repeat) fingerprinting on HpaII/MspI-digested genomic DNA of nine elite japonica rice cultivars implies inter-cultivar DNA methylation polymorphism. Using both DNA fragments isolated from RAPD or ISSR gels and selected low-copy sequences as probes, methylation-sensitive Southern blot analysis confirms the existence of extensive DNA methylation polymorphism in both genes and DNA repeats among the rice cultivars. The cultivar-specific methylation patterns are stably maintained, and can be used as reliable molecular markers. Transcriptional analysis of four selected sequences (RdRP, AC9, HSP90 and MMR) on leaves and roots from normal and 5-azacytidine-treated seedlings of three representative cultivars shows an association between the transcriptional activity of one of the genes, the mismatch repair (MMR) gene, and its CG methylation patterns.
Facile Recovery of Individual High-Molecular-Weight, Low-Copy-Number Natural Plasmids for Genomic Sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Williams, L.E.; Detter, C,; Barrie, K.

2006-06-01

Sequencing of the large (>50 kb), low-copy-number (<5 per cell) plasmids that mediate horizontal gene transfer has been hindered by the difficulty and expense of isolating DNA from individual plasmids of this class. We report here that a kit method previously devised for purification of bacterial artificial chromosomes (BACs) can be adapted for effective preparation of individual plasmids up to 220 kb from wild gram-negative and gram-positive bacteria. Individual plasmid DNA recovered from less than 10 ml of Escherichia coli, Staphylococcus, and Corynebacterium cultures was of sufficient quantity and quality for construction of highcoverage libraries, as shown by sequencing fivemore » native plasmids ranging in size from 30 kb to 94 kb. We also report recommendations for vector screening to optimize plasmid sequence assembly, preliminary annotation of novel plasmid genomes, and insights on mobile genetic element biology derived from these sequences. Adaptation of this BAC method for large plasmid isolation removes one major technical hurdle to expanding our knowledge of the natural plasmid gene pool.« less
Organization and evolution of highly repeated satellite DNA sequences in plant chromosomes.

PubMed

Sharma, S; Raina, S N

2005-01-01

A major component of the plant nuclear genome is constituted by different classes of repetitive DNA sequences. The structural, functional and evolutionary aspects of the satellite repetitive DNA families, and their organization in the chromosomes is reviewed. The tandem satellite DNA sequences exhibit characteristic chromosomal locations, usually at subtelomeric and centromeric regions. The repetitive DNA family(ies) may be widely distributed in a taxonomic family or a genus, or may be specific for a species, genome or even a chromosome. They may acquire large-scale variations in their sequence and copy number over an evolutionary time-scale. These features have formed the basis of extensive utilization of repetitive sequences for taxonomic and phylogenetic studies. Hybrid polyploids have especially proven to be excellent models for studying the evolution of repetitive DNA sequences. Recent studies explicitly show that some repetitive DNA families localized at the telomeres and centromeres have acquired important structural and functional significance. The repetitive elements are under different evolutionary constraints as compared to the genes. Satellite DNA families are thought to arise de novo as a consequence of molecular mechanisms such as unequal crossing over, rolling circle amplification, replication slippage and mutation that constitute "molecular drive". Copyright 2005 S. Karger AG, Basel.
Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.

PubMed

Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L

2013-01-30

Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

PubMed Central

2013-01-01

Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705
Intragenomic polymorphisms among high-copy loci: a genus-wide study of nuclear ribosomal DNA in Asclepias (Apocynaceae)

PubMed Central

Straub, Shannon C.K.; Fishbein, Mark; Liston, Aaron

2015-01-01

Despite knowledge that concerted evolution of high-copy loci is often imperfect, studies that investigate the extent of intragenomic polymorphisms and comparisons across a large number of species are rarely made. We present a bioinformatic pipeline for characterizing polymorphisms within an individual among copies of a high-copy locus. Results are presented for nuclear ribosomal DNA (nrDNA) across the milkweed genus, Asclepias. The 18S-26S portion of the nrDNA cistron of Asclepias syriaca served as a reference for assembly of the region from 124 samples representing 90 species of Asclepias. Reads were mapped back to each individual’s consensus and at each position reads differing from the consensus were tallied using a custom perl script. Low frequency polymorphisms existed in all individuals (mean = 5.8%). Most nrDNA positions (91%) were polymorphic in at least one individual, with polymorphic sites being less frequent in subunit regions and loops. Highly polymorphic sites existed in each individual, with highest abundance in the “noncoding” ITS regions. Phylogenetic signal was present in the distribution of intragenomic polymorphisms across the genus. Intragenomic polymorphisms in nrDNA are common in Asclepias, being found at higher frequency than any other study to date. The high and variable frequency of polymorphisms across species highlights concerns that phylogenetic applications of nrDNA may be error-prone. The new analytical approach provided here is applicable to other taxa and other high-copy regions characterized by low coverage genome sequencing (genome skimming). PMID:25653903

The complete chloroplast genome sequence of Dendrobium officinale.

PubMed

Yang, Pei; Zhou, Hong; Qian, Jun; Xu, Haibin; Shao, Qingsong; Li, Yonghua; Yao, Hui

2016-01-01

The complete chloroplast sequence of Dendrobium officinale, an endangered and economically important traditional Chinese medicine, was reported and characterized. The genome size is 152,018 bp, with 37.5% GC content. A pair of inverted repeats (IRs) of 26,284 bp are separated by a large single-copy region (LSC, 84,944 bp) and a small single-copy region (SSC, 14,506 bp). The complete cp DNA contains 83 protein-coding genes, 39 tRNA genes and 8 rRNA genes. Fourteen genes contained one or two introns.
Differentiation as symbiosis.

PubMed

Chigira, M; Watanabe, H

1994-07-01

Preservation of the identity of DNA is the ultimate goal of multicellular organisms. An abnormal DNA sequence in cells within an individual means its parasitic nature in cell society as shown in tumors. Somatic gene arrangement and gene mutation in development may be considered as de novo formation of parasites. It is likely that the developmental process with genetic alterations means symbiosis between altered cells and germ line cells preserving genetic information without alterations, when somatic alteration of DNA sequence is a major mechanism of differentiation. According to the selfish gene theory of Dawkins, germ line cells permit symbiosis when somatic cell society derives clear profit for the replication of original DNA copies.
A cDNA from a mouse pancreatic beta cell encoding a putative transcription factor of the insulin gene.

PubMed Central

Walker, M D; Park, C W; Rosen, A; Aronheim, A

1990-01-01

Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Abnormal plasma DNA profiles in early ovarian cancer using a non-invasive prenatal testing platform: implications for cancer screening.

PubMed

Cohen, Paul A; Flowers, Nicola; Tong, Stephen; Hannan, Natalie; Pertile, Mark D; Hui, Lisa

2016-08-24

Non-invasive prenatal testing (NIPT) identifies fetal aneuploidy by sequencing cell-free DNA in the maternal plasma. Pre-symptomatic maternal malignancies have been incidentally detected during NIPT based on abnormal genomic profiles. This low coverage sequencing approach could have potential for ovarian cancer screening in the non-pregnant population. Our objective was to investigate whether plasma DNA sequencing with a clinical whole genome NIPT platform can detect early- and late-stage high-grade serous ovarian carcinomas (HGSOC). This is a case control study of prospectively-collected biobank samples comprising preoperative plasma from 32 women with HGSOC (16 'early cancer' (FIGO I-II) and 16 'advanced cancer' (FIGO III-IV)) and 32 benign controls. Plasma DNA from cases and controls were sequenced using a commercial NIPT platform and chromosome dosage measured. Sequencing data were blindly analyzed with two methods: (1) Subchromosomal changes were called using an open source algorithm WISECONDOR (WIthin-SamplE COpy Number aberration DetectOR). Genomic gains or losses ≥ 15 Mb were prespecified as "screen positive" calls, and mapped to recurrent copy number variations reported in an ovarian cancer genome atlas. (2) Selected whole chromosome gains or losses were reported using the routine NIPT pipeline for fetal aneuploidy. We detected 13/32 cancer cases using the subchromosomal analysis (sensitivity 40.6 %, 95 % CI, 23.7-59.4 %), including 6/16 early and 7/16 advanced HGSOC cases. Two of 32 benign controls had subchromosomal gains ≥ 15 Mb (specificity 93.8 %, 95 % CI, 79.2-99.2 %). Twelve of the 13 true positive cancer cases exhibited specific recurrent changes reported in HGSOC tumors. The NIPT pipeline resulted in one "monosomy 18" call from the cancer group, and two "monosomy X" calls in the controls. Low coverage plasma DNA sequencing used for prenatal testing detected 40.6 % of all HGSOC, including 38 % of early stage cases. Our findings demonstrate the potential of a high throughput sequencing platform to screen for early HGSOC in plasma based on characteristic multiple segmental chromosome gains and losses. The performance of this approach may be further improved by refining bioinformatics algorithms and targeting selected cancer copy number variations.
Core histone genes of Giardia intestinalis: genomic organization, promoter structure, and expression

PubMed Central

Yee, Janet; Tang, Anita; Lau, Wei-Ling; Ritter, Heather; Delport, Dewald; Page, Melissa; Adam, Rodney D; Müller, Miklós; Wu, Gang

2007-01-01

Background Giardia intestinalis is a protist found in freshwaters worldwide, and is the most common cause of parasitic diarrhea in humans. The phylogenetic position of this parasite is still much debated. Histones are small, highly conserved proteins that associate tightly with DNA to form chromatin within the nucleus. There are two classes of core histone genes in higher eukaryotes: DNA replication-independent histones and DNA replication-dependent ones. Results We identified two copies each of the core histone H2a, H2b and H3 genes, and three copies of the H4 gene, at separate locations on chromosomes 3, 4 and 5 within the genome of Giardia intestinalis, but no gene encoding a H1 linker histone could be recognized. The copies of each gene share extensive DNA sequence identities throughout their coding and 5' noncoding regions, which suggests these copies have arisen from relatively recent gene duplications or gene conversions. The transcription start sites are at triplet A sequences 1–27 nucleotides upstream of the translation start codon for each gene. We determined that a 50 bp region upstream from the start of the histone H4 coding region is the minimal promoter, and a highly conserved 15 bp sequence called the histone motif (him) is essential for its activity. The Giardia core histone genes are constitutively expressed at approximately equivalent levels and their mRNAs are polyadenylated. Competition gel-shift experiments suggest that a factor within the protein complex that binds him may also be a part of the protein complexes that bind other promoter elements described previously in Giardia. Conclusion In contrast to other eukaryotes, the Giardia genome has only a single class of core histone genes that encode replication-independent histones. Our inability to locate a gene encoding the linker histone H1 leads us to speculate that the H1 protein may not be required for the compaction of Giardia's small and gene-rich genome. PMID:17425802
Computational and experimental analysis of DNA shuffling

PubMed Central

Maheshri, Narendra; Schaffer, David V.

2003-01-01

We describe a computational model of DNA shuffling based on the thermodynamics and kinetics of this process. The model independently tracks a representative ensemble of DNA molecules and records their states at every stage of a shuffling reaction. These data can subsequently be analyzed to yield information on any relevant metric, including reassembly efficiency, crossover number, type and distribution, and DNA sequence length distributions. The predictive ability of the model was validated by comparison to three independent sets of experimental data, and analysis of the simulation results led to several unique insights into the DNA shuffling process. We examine a tradeoff between crossover frequency and reassembly efficiency and illustrate the effects of experimental parameters on this relationship. Furthermore, we discuss conditions that promote the formation of useless “junk” DNA sequences or multimeric sequences containing multiple copies of the reassembled product. This model will therefore aid in the design of optimal shuffling reaction conditions. PMID:12626764
Synthetic Biology: Knowledge Accessed by Everyone (Open Sources)

ERIC Educational Resources Information Center

Sánchez Reyes, Patricia Margarita

2016-01-01

Using the principles of biology, along with engineering and with the help of computer, scientists manage to copy. DNA sequences from nature and use them to create new organisms. DNA is created through engineering and computer science managing to create life inside a laboratory. We cannot dismiss the role that synthetic biology could lead in…
The complete chloroplast genome of Capsicum annuum var. glabriusculum using Illumina sequencing.

PubMed

Raveendar, Sebastin; Na, Young-Wang; Lee, Jung-Ro; Shim, Donghwan; Ma, Kyung-Ho; Lee, Sok-Young; Chung, Jong-Wook

2015-07-20

Chloroplast (cp) genome sequences provide a valuable source for DNA barcoding. Molecular phylogenetic studies have concentrated on DNA sequencing of conserved gene loci. However, this approach is time consuming and more difficult to implement when gene organization differs among species. Here we report the complete re-sequencing of the cp genome of Capsicum pepper (Capsicum annuum var. glabriusculum) using the Illumina platform. The total length of the cp genome is 156,817 bp with a 37.7% overall GC content. A pair of inverted repeats (IRs) of 50,284 bp were separated by a small single copy (SSC; 18,948 bp) and a large single copy (LSC; 87,446 bp). The number of cp genes in C. annuum var. glabriusculum is the same as that in other Capsicum species. Variations in the lengths of LSC; SSC and IR regions were the main contributors to the size variation in the cp genome of this species. A total of 125 simple sequence repeat (SSR) and 48 insertions or deletions variants were found by sequence alignment of Capsicum cp genome. These findings provide a foundation for further investigation of cp genome evolution in Capsicum and other higher plants.
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons.

PubMed

Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M

2017-04-01

5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Copy number variants calling for single cell sequencing data by multi-constrained optimization.

PubMed

Xu, Bo; Cai, Hongmin; Zhang, Changsheng; Yang, Xi; Han, Guoqiang

2016-08-01

Variations in DNA copy number carry important information on genome evolution and regulation of DNA replication in cancer cells. The rapid development of single-cell sequencing technology allows one to explore gene expression heterogeneity among single-cells, thus providing important cancer cell evolution information. Single-cell DNA/RNA sequencing data usually have low genome coverage, which requires an extra step of amplification to accumulate enough samples. However, such amplification will introduce large bias and makes bioinformatics analysis challenging. Accurately modeling the distribution of sequencing data and effectively suppressing the bias influence is the key to success variations analysis. Recent advances demonstrate the technical noises by amplification are more likely to follow negative binomial distribution, a special case of Poisson distribution. Thus, we tackle the problem CNV detection by formulating it into a quadratic optimization problem involving two constraints, in which the underling signals are corrupted by Poisson distributed noises. By imposing the constraints of sparsity and smoothness, the reconstructed read depth signals from single-cell sequencing data are anticipated to fit the CNVs patterns more accurately. An efficient numerical solution based on the classical alternating direction minimization method (ADMM) is tailored to solve the proposed model. We demonstrate the advantages of the proposed method using both synthetic and empirical single-cell sequencing data. Our experimental results demonstrate that the proposed method achieves excellent performance and high promise of success with single-cell sequencing data. Crown Copyright © 2016. Published by Elsevier Ltd. All rights reserved.
Length Variation in Mitochondrial DNA of the Minnow Cyprinella Spiloptera

PubMed Central

Broughton, R. E.; Dowling, T. E.

1994-01-01

Length differences in animal mitochondrial DNA (mtDNA) are common, frequently due to variation in copy number of direct tandem duplications. While such duplications appear to form without great difficulty in some taxonomic groups, they appear to be relatively short-lived, as typical duplication products are geographically restricted within species and infrequently shared among species. To better understand such length variation, we have studied a tandem and direct duplication of approximately 260 bp in the control region of the cyprinid fish, Cyprinella spiloptera. Restriction site analysis of 38 individuals was used to characterize population structure and the distribution of variation in repeat copy number. This revealed two length variants, including individuals with two or three copies of the repeat, and little geographic structure among populations. No standard length (single copy) genomes were found and heteroplasmy, a common feature of length variation in other taxa, was absent. Nucleotide sequence of tandem duplications and flanking regions localized duplication junctions in the phenylalanine tRNA and near the origin of replication. The locations of these junctions and the stability of folded repeat copies support the hypothesized importance of secondary structures in models of duplication formation. PMID:8001785
HIP1 propagates in cyanobacterial DNA via nucleotide substitutions but promotes excision at similar frequencies in Escherichia coli and Synechococcus PCC 7942.

PubMed

Robinson, P J; Cranenburgh, R M; Head, I M; Robinson, N J

1997-04-01

The sequence 5'-GCGATCGC-3', designated HIP1, for highly iterated palindrome, was first identified at the borders of a gene-deletion event and subsequently shown to constitute up to 2.5% of the DNA in some cyanobacteria. It is now reported that HIP1 is polyphyletic, occurring in several distinct cyanobacterial lineages and not defining a clade. HIP1 does not introduce gaps into sequence alignments. It aligns with partial HIP1 sites in related sequences showing that it propagates by nucleotide substitutions rather than insertion. Constructs have been created to determine the frequencies at which deletion events occur between palindromes located within the selectable marker neo. Deletion between HIP1 sites was more frequent in Synechococcus PCC 7942 than deletion between control palindromes, 5'-CCGATCGG-3', designated PAL0. However, this is not due to a recombinase that recognises HIP1 and is peculiar to cyanobacteria because similar deletion frequencies were detected in Escherichia coli. Furthermore, the frequency of deletion of DNA flanked asymmetrically by one HIP1 site and one PAL0 site was less than the frequency of deletion of DNA flanked asymmetrically by identical copies of either palindrome. This is consistent with deletion by copy-choice.
Cross-subtype detection of HIV-1 using reverse transcription and recombinase polymerase amplification.

PubMed

Lillis, Lorraine; Lehman, Dara A; Siverson, Joshua B; Weis, Julie; Cantera, Jason; Parker, Mathew; Piepenburg, Olaf; Overbaugh, Julie; Boyle, David S

2016-04-01

A low complexity diagnostic test that rapidly and reliably detects HIV infection in infants at the point of care could facilitate early treatment, improving outcomes. However, many infant HIV diagnostics can only be performed in laboratory settings. Recombinase polymerase amplification (RPA) is an isothermal amplification technology that can rapidly amplify proviral DNA from multiple subtypes of HIV-1 in under twenty minutes without complex equipment. In this study we added reverse transcription (RT) to RPA to allow detection of both HIV-1 RNA and DNA. We show that this RT-RPA HIV-1 assay has a limit of detection of 10-30 copies of an exact sequence matched DNA or RNA, respectively. In addition, at 100 copies of RNA or DNA, the assay detected 171 of 175 (97.7%) sequence variants that represent all the major subtypes and recombinant forms of HIV-1 Groups M and O. This data suggests that the application of RT-RPA for the combined detection of HIV-1 viral RNA and proviral DNA may prove a highly sensitive tool for rapid and accurate diagnosis of infant HIV. Copyright © 2016 Elsevier B.V. All rights reserved.
Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats.

PubMed Central

Spielmann, A; Stutz, E

1983-01-01

The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2. PMID:6314279
Molecular authentication of Radix Puerariae Lobatae and Radix Puerariae Thomsonii by ITS and 5S rRNA spacer sequencing.

PubMed

Sun, Ye; Shaw, Pang-Chui; Fung, Kwok-Pui

2007-01-01

In the present study, we examined nuclear DNA sequences in an attempt to reveal the relationships between Pueraria lobata (Willd). Ohwi, P. thomsonii Benth., and P. montana (Lour.) Merr. We found that internal transcribed spacer (ITS) sequences of nuclear ribosomal DNA are highly divergent in P. lobata and P. thomsonii, and four types of ITS with different length are found in the two species. On the other hand, DNA sequences of 5S rRNA gene spacer are highly conserved across multiple copies in P. lobata and P. thomsonii, they could be used to identify P. lobata, P. thomsonii, and P. montana of this complex, and may serve as a useful tool in medical authentication of Radix Puerariae Lobatae and Radix Puerariae Thomsonii.
Retroviral DNA Integration Directed by HIV Integration Protein in Vitro

NASA Astrophysics Data System (ADS)

Bushman, Frederic D.; Fujiwara, Tamio; Craigie, Robert

1990-09-01

Efficient retroviral growth requires integration of a DNA copy of the viral RNA genome into a chromosome of the host. As a first step in analyzing the mechanism of integration of human immunodeficiency virus (HIV) DNA, a cell-free system was established that models the integration reaction. The in vitro system depends on the HIV integration (IN) protein, which was partially purified from insect cells engineered to express IN protein in large quantities. Integration was detected in a biological assay that scores the insertion of a linear DNA containing HIV terminal sequences into a λ DNA target. Some integration products generated in this assay contained five-base pair duplications of the target DNA at the recombination junctions, a characteristic of HIV integration in vivo; the remaining products contained aberrant junctional sequences that may have been produced in a variation of the normal reaction. These results indicate that HIV IN protein is the only viral protein required to insert model HIV DNA sequences into a target DNA in vitro.
New families of site-specific repetitive DNA sequences that comprise constitutive heterochromatin of the Syrian hamster (Mesocricetus auratus, Cricetinae, Rodentia).

PubMed

Yamada, Kazuhiko; Kamimura, Eikichi; Kondo, Mariko; Tsuchiya, Kimiyuki; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2006-02-01

We molecularly cloned new families of site-specific repetitive DNA sequences from BglII- and EcoRI-digested genomic DNA of the Syrian hamster (Mesocricetus auratus, Cricetrinae, Rodentia) and characterized them by chromosome in situ hybridization and filter hybridization. They were classified into six different types of repetitive DNA sequence families according to chromosomal distribution and genome organization. The hybridization patterns of the sequences were consistent with the distribution of C-positive bands and/or Hoechst-stained heterochromatin. The centromeric major satellite DNA and sex chromosome-specific and telomeric region-specific repetitive sequences were conserved in the same genus (Mesocricetus) but divergent in different genera. The chromosome-2-specific sequence was conserved in two genera, Mesocricetus and Cricetulus, and a low copy number of repetitive sequences on the heterochromatic chromosome arms were conserved in the subfamily Cricetinae but not in the subfamily Calomyscinae. By contrast, the other type of repetitive sequences on the heterochromatic chromosome arms, which had sequence similarities to a LINE sequence of rodents, was conserved through the three subfamilies, Cricetinae, Calomyscinae and Murinae. The nucleotide divergence of the repetitive sequences of heterochromatin was well correlated with the phylogenetic relationships of the Cricetinae species, and each sequence has been independently amplified and diverged in the same genome.
Characterization of the variable-number tandem repeats in vrrA from different Bacillus anthracis isolates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jackson, P.J.; Walthers, E.A.; Richmond, K.L.

1997-04-01

PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less
The complete chloroplast genome of Tianshan Snow Lotus (Saussurea involucrata), a famous traditional Chinese medicinal plant of the family Asteraceae.

PubMed

Xie, Qing; Shen, Kang-Ning; Hao, Xiuying; Nam, Phan Nhut; Ngoc Hieu, Bui Thi; Chen, Ching-Hung; Zhu, Changqing; Lin, Yen-Chang; Hsiao, Chung-Der

2017-03-01

abtract We decoded the complete chloroplast DNA (cpDNA) sequence of the Tianshan Snow Lotus (Saussurea involucrata), a famous traditional Chinese medicinal plant of the family Asteraceae, by using next-generation sequencing technology. The genome consists of 152 490 bp containing a pair of inverted repeats (IRs) of 25 202 bp, which was separated by a large single-copy region and a small single-copy region of 83 446 bp and 18 639 bp, respectively. The genic regions account for 57.7% of whole cpDNA, and the GC content of the cpDNA was 37.7%. The S. involucrata cpDNA encodes 114 unigenes (82 protein-coding genes, 4 rRNA genes, and 28 tRNA genes). There are eight protein-coding genes (atpF, ndhA, ndhB, rpl2, rpoC1, rps16, clpP, and ycf3) and five tRNA genes (trnA-UGC, trnI-GAU, trnK-UUU, trnL-UAA, and trnV-UAC) containing introns. A phylogenetic analysis of the 11 complete cpDNA from Asteracease showed that S. involucrata is closely related to Centaurea diffusa (Diffuse Knapweed). The complete cpDNA of S. involucrata provides essential and important DNA molecular data for further phylogenetic and evolutionary analysis for Asteraceae.
Correlation of 16S Ribosomal DNA Signature Sequences with Temperature-Dependent Growth Rates of Mesophilic and Psychrotolerant Strains of the Bacillus cereus Group

PubMed Central

Prüß, Birgit M.; Francis, Kevin P.; von Stetten, Felix; Scherer, Siegfried

1999-01-01

Sequences of the 16S ribosomal DNA (rDNA) from psychrotolerant and mesophilic strains of the Bacillus cereus group revealed signatures which were specific for these two thermal groups of bacteria. Further analysis of the genomic DNA from a wide range of food and soil isolates showed that B. cereus group strains have between 6 and 10 copies of 16S rDNA. Moreover, a number of these environmental strains have both rDNA operons with psychrotolerant signatures and rDNA operons with mesophilic signatures. The ability of these isolates to grow at low temperatures correlates with the prevalence of rDNA operons with psychrotolerant signatures, indicating specific nucleotides within the 16S rRNA to play a role in psychrotolerance. PMID:10198030

Telomeres and telomerase.

PubMed Central

Chan, Simon R W L; Blackburn, Elizabeth H

2004-01-01

Telomeres are the protective DNA-protein complexes found at the ends of eukaryotic chromosomes. Telomeric DNA consists of tandem repeats of a simple, often G-rich, sequence specified by the action of telomerase, and complete replication of telomeric DNA requires telomerase. Telomerase is a specialized cellular ribonucleoprotein reverse transcriptase. By copying a short template sequence within its intrinsic RNA moiety, telomerase synthesizes the telomeric DNA strand running 5' to 3' towards the distal end of the chromosome, thus extending it. Fusion of a telomere, either with another telomere or with a broken DNA end, generally constitutes a catastrophic event for genomic stability. Telomerase acts to prevent such fusions. The molecular consequences of telomere failure, and the molecular contributors to telomere function, with an emphasis on telomerase, are discussed here. PMID:15065663
CNV-seq, a new method to detect copy number variation using high-throughput sequencing.

PubMed

Xie, Chao; Tammi, Martti T

2009-03-06

DNA copy number variation (CNV) has been recognized as an important source of genetic variation. Array comparative genomic hybridization (aCGH) is commonly used for CNV detection, but the microarray platform has a number of inherent limitations. Here, we describe a method to detect copy number variation using shotgun sequencing, CNV-seq. The method is based on a robust statistical model that describes the complete analysis procedure and allows the computation of essential confidence values for detection of CNV. Our results show that the number of reads, not the length of the reads is the key factor determining the resolution of detection. This favors the next-generation sequencing methods that rapidly produce large amount of short reads. Simulation of various sequencing methods with coverage between 0.1x to 8x show overall specificity between 91.7 - 99.9%, and sensitivity between 72.2 - 96.5%. We also show the results for assessment of CNV between two individual human genomes.
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells.

PubMed

Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang

2018-01-01

Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. © 2018 Han et al.; Published by Cold Spring Harbor Laboratory Press.
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells

PubMed Central

Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang

2018-01-01

Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. PMID:29208629
Assessing the Fidelity of Ancient DNA Sequences Amplified From Nuclear Genes

PubMed Central

Binladen, Jonas; Wiuf, Carsten; Gilbert, M. Thomas P.; Bunce, Michael; Barnett, Ross; Larson, Greger; Greenwood, Alex D.; Haile, James; Ho, Simon Y. W.; Hansen, Anders J.; Willerslev, Eske

2006-01-01

To date, the field of ancient DNA has relied almost exclusively on mitochondrial DNA (mtDNA) sequences. However, a number of recent studies have reported the successful recovery of ancient nuclear DNA (nuDNA) sequences, thereby allowing the characterization of genetic loci directly involved in phenotypic traits of extinct taxa. It is well documented that postmortem damage in ancient mtDNA can lead to the generation of artifactual sequences. However, as yet no one has thoroughly investigated the damage spectrum in ancient nuDNA. By comparing clone sequences from 23 fossil specimens, recovered from environments ranging from permafrost to desert, we demonstrate the presence of miscoding lesion damage in both the mtDNA and nuDNA, resulting in insertion of erroneous bases during amplification. Interestingly, no significant differences in the frequency of miscoding lesion damage are recorded between mtDNA and nuDNA despite great differences in cellular copy numbers. For both mtDNA and nuDNA, we find significant positive correlations between total sequence heterogeneity and the rates of type 1 transitions (adenine → guanine and thymine → cytosine) and type 2 transitions (cytosine → thymine and guanine → adenine), respectively. Type 2 transitions are by far the most dominant and increase relative to those of type 1 with damage load. The results suggest that the deamination of cytosine (and 5-methyl cytosine) to uracil (and thymine) is the main cause of miscoding lesions in both ancient mtDNA and nuDNA sequences. We argue that the problems presented by postmortem damage, as well as problems with contamination from exogenous sources of conserved nuclear genes, allelic variation, and the reliance on single nucleotide polymorphisms, call for great caution in studies relying on ancient nuDNA sequences. PMID:16299392
Functional DNA quantification guides accurate next-generation sequencing mutation detection in formalin-fixed, paraffin-embedded tumor biopsies

PubMed Central

2013-01-01

The formalin-fixed, paraffin-embedded (FFPE) biopsy is a challenging sample for molecular assays such as targeted next-generation sequencing (NGS). We compared three methods for FFPE DNA quantification, including a novel PCR assay (‘QFI-PCR’) that measures the absolute copy number of amplifiable DNA, across 165 residual clinical specimens. The results reveal the limitations of commonly used approaches, and demonstrate the value of an integrated workflow using QFI-PCR to improve the accuracy of NGS mutation detection and guide changes in input that can rescue low quality FFPE DNA. These findings address a growing need for improved quality measures in NGS-based patient testing. PMID:24001039
Molecular and phylogenetic characterization of the homoeologous EPSP Synthase genes of allohexaploid wheat, Triticum aestivum (L.).

PubMed

Aramrak, Attawan; Kidwell, Kimberlee K; Steber, Camille M; Burke, Ian C

2015-10-23

5-Enolpyruvylshikimate-3-phosphate synthase (EPSPS) is the sixth and penultimate enzyme in the shikimate biosynthesis pathway, and is the target of the herbicide glyphosate. The EPSPS genes of allohexaploid wheat (Triticum aestivum, AABBDD) have not been well characterized. Herein, the three homoeologous copies of the allohexaploid wheat EPSPS gene were cloned and characterized. Genomic and coding DNA sequences of EPSPS from the three related genomes of allohexaploid wheat were isolated using PCR and inverse PCR approaches from soft white spring "Louise'. Development of genome-specific primers allowed the mapping and expression analysis of TaEPSPS-7A1, TaEPSPS-7D1, and TaEPSPS-4A1 on chromosomes 7A, 7D, and 4A, respectively. Sequence alignments of cDNA sequences from wheat and wheat relatives served as a basis for phylogenetic analysis. The three genomic copies of wheat EPSPS differed by insertion/deletion and single nucleotide polymorphisms (SNPs), largely in intron sequences. RT-PCR analysis and cDNA cloning revealed that EPSPS is expressed from all three genomic copies. However, TaEPSPS-4A1 is expressed at much lower levels than TaEPSPS-7A1 and TaEPSPS-7D1 in wheat seedlings. Phylogenetic analysis of 1190-bp cDNA clones from wheat and wheat relatives revealed that: 1) TaEPSPS-7A1 is most similar to EPSPS from the tetraploid AB genome donor, T. turgidum (99.7 % identity); 2) TaEPSPS-7D1 most resembles EPSPS from the diploid D genome donor, Aegilops tauschii (100 % identity); and 3) TaEPSPS-4A1 resembles EPSPS from the diploid B genome relative, Ae. speltoides (97.7 % identity). Thus, EPSPS sequences in allohexaploid wheat are preserved from the most two recent ancestors. The wheat EPSPS genes are more closely related to Lolium multiflorum and Brachypodium distachyon than to Oryza sativa (rice). The three related EPSPS homoeologues of wheat exhibited conservation of the exon/intron structure and of coding region sequence, but contained significant sequence variation within intron regions. The genome-specific primers developed will enable future characterization of natural and induced variation in EPSPS sequence and expression. This can be useful in investigating new causes of glyphosate herbicide resistance.
Whole Exome Sequencing of Pediatric Gastric Adenocarcinoma Reveals an Atypical Presentation of Li-Fraumeni Syndrome

PubMed Central

Chang, Vivian Y.; Federman, Noah; Martinez-Agosto, Julian; Tatishchev, Sergei F.; Nelson, Stanley F.

2014-01-01

Background Gastric adenocarcinoma is a rare diagnosis in childhood. A 14-year old male patient presented with metastatic gastric adenocarcinoma, and a strong family history of colon cancer. Clinical sequencing of CDH1 and APC were negative. Whole exome sequencing was therefore applied to capture the majority of protein-coding regions for the identification of single-nucleotide variants, small insertion/deletions, and copy number abnormalities in the patient’s germline as well as primary tumor. Materials and Methods DNA was extracted from the patient’s blood, primary tumor, and the unaffected mother’s blood. DNA libraries were constructed and sequenced on Illumina HiSeq2000. Data were post-processed using Picard and Samtools, then analyzed with the Genome Analysis Toolkit. Variants were annotated using an in-house Ensembl-based program. Copy number was assessed using ExomeCNV. Results Each sample was sequenced to a mean depth of coverage of greater than 120×. A rare non-synonymous coding SNV in TP53 was identified in the germline. There were 10 somatic cancer protein-damaging variants that were not observed in the unaffected mother genome. ExomeCNV comparing tumor to the patient’s germline, identified abnormal copy number, spanning 6,946 genes. Conclusion We present an unusual case of Li-Fraumeni detected by whole exome sequencing. There were also likely driver somatic mutations in the gastric adenocarcinoma. These results highlight the need for more thorough and broad scale germline and cancer analyses to accurately inform patients of inherited risk to cancer and to identify somatic mutations. PMID:23015295
The construction, identification and partial characterization of plasmids containing guinea-pig milk protein complementary DNA sequences.

PubMed Central

Craig, R K; Hall, L; Parker, D; Campbell, P N

1981-01-01

A complementary DNA (cDNA) plasmid library has been constructed in the plasmid pAT153, using poly(A)-containing RNA isolated from the lactating guinea-pig mammary gland as the starting material. Double stranded cDNA was inserted into the EcoRI site of the plasmid using poly(dA . dT) tails, then transformed into Escherichia coli HB101. From the resulting colonies we have selected and partially characterized plasmids containing cDNA copies of the mRNAs for casein A, casein B, casein C and alpha-lactalbumin. However, the proportion containing casein C cDNA was exceptionally low, and these contained at best 60% of the mRNA sequence. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. PMID:7306038
Subclinical Reactivation and Shed of Infectious Varicella Zoster Virus in Saliva of Astronauts

NASA Technical Reports Server (NTRS)

Cohrs, Randall J.; Mehta, Satish K.; Schmid, D. Scott; Gilden, Donald H.; Pierson, Duane L.

2007-01-01

We have previously detected VZV in healthy astronauts both during spaceflight and shortly after landing. Herein, we show that VZV shed in seropositive astronauts is infectious. A total of 40 saliva samples were obtained from each of the 3 astronauts. From each astronaut, 14 samples were taken 109 to 133 days before liftoff, 1 sample was taken every day during 12 days in space, and one sample was taken for 14 consecutive days beginning the second day after landing. Quantitative PCR was used to detect VZV DNA in saliva. None of 42 preflight saliva samples contained VZV DNA. VZV DNA was detected in saliva from 2 of 3 astronauts. In 1 astronaut, 6 of 12 samples obtained during space flight contained 120 to 2,500 copies of VZV DNA per ml; after landing, 1250 copies of VZV DNA were present on day 2, 45 copies on day 3, and 110 copies on day 5. All samples taken 6 to 15 days after touchdown were negative for VZV DNA. In the second astronaut, 5 of 12 samples obtained during space flight contained 18 to 650 copies of VZV DNA per ml; after landing, 560 copies of VZV DNA were present in saliva on day 2, 340 copies on day 4, 45 copies on day 5, and 23 copes on day 6. All samples taken 7 to 15 days after touchdown were negative for VZV DNA. Saliva taken 2 to 6 days after landing from all 3 astronauts was cultured on human fetal lung cells. After one subcultivation, a cytopathic effect developed in cultures inoculated with saliva from the two astronauts whose saliva contained VZV DNA. Both PCR and immunostaining identified the isolates to be VZV and not HSV-1. Importantly, the astronaut in whom no VZV was detected had a history of zoster 9 years earlier. It is possible that a boost in cell-mediated immunity to VZV which is known to develop after zoster protected him from subclinical reactivation. The genotype of the two VZV isolates was determined by VZV ORF22-based PCR/sequencing along with FRET-based PCR assays that target specific nucleotide polymorphisms. Both VZV isolates were found to be the European genotype which also contained a rare MspI restriction enodnuclease site in VZV ORF62 at position 107,252. These findings extend our previous demonstration of VZV DNA in saliva of astronauts by showing that infectious VZV is also present. Thus, like HSV-1 and HSV-2, VZV can reactivate and shed infectious virus in the absence of clinical disease.
Detection of Merkel Cell Polyomavirus DNA in Serum Samples of Healthy Blood Donors

PubMed Central

Mazzoni, Elisa; Rotondo, John C.; Marracino, Luisa; Selvatici, Rita; Bononi, Ilaria; Torreggiani, Elena; Touzé, Antoine; Martini, Fernanda; Tognon, Mauro G.

2017-01-01

Merkel cell polyomavirus (MCPyV) has been detected in 80% of Merkel cell carcinomas (MCC). In the host, the MCPyV reservoir remains elusive. MCPyV DNA sequences were revealed in blood donor buffy coats. In this study, MCPyV DNA sequences were investigated in the sera (n = 190) of healthy blood donors. Two MCPyV DNA sequences, coding for the viral oncoprotein large T antigen (LT), were investigated using polymerase chain reaction (PCR) methods and DNA sequencing. Circulating MCPyV sequences were detected in sera with a prevalence of 2.6% (5/190), at low-DNA viral load, which is in the range of 1–4 and 1–5 copies/μl by real-time PCR and droplet digital PCR, respectively. DNA sequencing carried out in the five MCPyV-positive samples indicated that the two MCPyV LT sequences which were analyzed belong to the MKL-1 strain. Circulating MCPyV LT sequences are present in blood donor sera. MCPyV-positive samples from blood donors could represent a potential vehicle for MCPyV infection in receivers, whereas an increase in viral load may occur with multiple blood transfusions. In certain patient conditions, such as immune-depression/suppression, additional disease or old age, transfusion of MCPyV-positive samples could be an additional risk factor for MCC onset. PMID:29238698
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.

PubMed

Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V

1985-09-01

The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this.
A genome-specific repetitive DNA sequence from Oryza eichingeri: characterization, localization, and introgression to O. sativa.

PubMed

Yan, H. H.; Liu, G. Q.; Cheng, Z. K.; Li, X. B.; Liu, G. Z.; Min, S. K.; Zhu, L.H.

2002-02-01

In the course of transferring the brown planthopper resistance from a diploid, CC-genome wild rice species, Oryza eichingeri (IRGC acc. 105159 and 105163), to the cultivated rice variety 02428, we have isolated many alien addition and introgression lines. The O. eichingeri chromatin in some of these lines has previously been identified using genomic in situ hybridization and molecular-marker analysis. Here we cloned a tandemly repetitive DNA sequence from O. eichingeri IRGC acc105163, and detected it in 25 introgression lines. This repetitive DNA sequence showed high specificity to the rice CC genome, but was absent from all the four tetraploid species with BBCC or CCDD genomes. The monomer in this repetitive DNA sequence is 325-366-bp long, with a copy number of about 5,000 per 1 C of the O. eichingerigenome, showing 88% homology to a repetitive DNA sequence isolated from Oryza officinalis(2n=2 x=24, CC). Fluorescent in situ hybridization revealed 11 signals distributed over eight O. eichingeri chromosomes, mostly in terminal or subterminal regions.
Bamgineer: Introduction of simulated allele-specific copy number variants into exome and targeted sequence data sets.

PubMed

Samadian, Soroush; Bruce, Jeff P; Pugh, Trevor J

2018-03-01

Somatic copy number variations (CNVs) play a crucial role in development of many human cancers. The broad availability of next-generation sequencing data has enabled the development of algorithms to computationally infer CNV profiles from a variety of data types including exome and targeted sequence data; currently the most prevalent types of cancer genomics data. However, systemic evaluation and comparison of these tools remains challenging due to a lack of ground truth reference sets. To address this need, we have developed Bamgineer, a tool written in Python to introduce user-defined haplotype-phased allele-specific copy number events into an existing Binary Alignment Mapping (BAM) file, with a focus on targeted and exome sequencing experiments. As input, this tool requires a read alignment file (BAM format), lists of non-overlapping genome coordinates for introduction of gains and losses (bed file), and an optional file defining known haplotypes (vcf format). To improve runtime performance, Bamgineer introduces the desired CNVs in parallel using queuing and parallel processing on a local machine or on a high-performance computing cluster. As proof-of-principle, we applied Bamgineer to a single high-coverage (mean: 220X) exome sequence file from a blood sample to simulate copy number profiles of 3 exemplar tumors from each of 10 tumor types at 5 tumor cellularity levels (20-100%, 150 BAM files in total). To demonstrate feasibility beyond exome data, we introduced read alignments to a targeted 5-gene cell-free DNA sequencing library to simulate EGFR amplifications at frequencies consistent with circulating tumor DNA (10, 1, 0.1 and 0.01%) while retaining the multimodal insert size distribution of the original data. We expect Bamgineer to be of use for development and systematic benchmarking of CNV calling algorithms by users using locally-generated data for a variety of applications. The source code is freely available at http://github.com/pughlab/bamgineer.
Quantitative Viral Community DNA Analysis Reveals the Dominance of Single-Stranded DNA Viruses in Offshore Upper Bathyal Sediment from Tohoku, Japan

PubMed Central

Yoshida, Mitsuhiro; Mochizuki, Tomohiro; Urayama, Syun-Ichi; Yoshida-Takashima, Yukari; Nishi, Shinro; Hirai, Miho; Nomaki, Hidetaka; Takaki, Yoshihiro; Nunoura, Takuro; Takai, Ken

2018-01-01

Previous studies on marine environmental virology have primarily focused on double-stranded DNA (dsDNA) viruses; however, it has recently been suggested that single-stranded DNA (ssDNA) viruses are more abundant in marine ecosystems. In this study, we performed a quantitative viral community DNA analysis to estimate the relative abundance and composition of both ssDNA and dsDNA viruses in offshore upper bathyal sediment from Tohoku, Japan (water depth = 500 m). The estimated dsDNA viral abundance ranged from 3 × 106 to 5 × 106 genome copies per cm3 sediment, showing values similar to the range of fluorescence-based direct virus counts. In contrast, the estimated ssDNA viral abundance ranged from 1 × 108 to 3 × 109 genome copies per cm3 sediment, thus providing an estimation that the ssDNA viral populations represent 96.3–99.8% of the benthic total DNA viral assemblages. In the ssDNA viral metagenome, most of the identified viral sequences were associated with ssDNA viral families such as Circoviridae and Microviridae. The principle components analysis of the ssDNA viral sequence components from the sedimentary ssDNA viral metagenomic libraries found that the different depth viral communities at the study site all exhibited similar profiles compared with deep-sea sediment ones at other reference sites. Our results suggested that deep-sea benthic ssDNA viruses have been significantly underestimated by conventional direct virus counts and that their contributions to deep-sea benthic microbial mortality and geochemical cycles should be further addressed by such a new quantitative approach. PMID:29467725
Integrated genomic classification of melanocytic tumors of the central nervous system using mutation analysis, copy number alterations and DNA methylation profiling.

PubMed

Griewank, Klaus; Koelsche, Christian; van de Nes, Johannes A P; Schrimpf, Daniel; Gessi, Marco; Möller, Inga; Sucker, Antje; Scolyer, Richard A; Buckland, Michael E; Murali, Rajmohan; Pietsch, Torsten; von Deimling, Andreas; Schadendorf, Dirk

2018-06-11

In the central nervous system, distinguishing primary leptomeningeal melanocytic tumors from melanoma metastases and predicting their biological behavior solely using histopathologic criteria can be challenging. We aimed to assess the diagnostic and prognostic value of integrated molecular analysis. Targeted next-generation-sequencing, array-based genome-wide methylation analysis and BAP1 immunohistochemistry was performed on the largest cohort of central nervous system melanocytic tumors analyzed to date, incl. 47 primary tumors of the central nervous system, 16 uveal melanomas. 13 cutaneous melanoma metastasis and 2 blue nevus-like melanomas. Gene mutation, DNA-methylation and copy-number profiles were correlated with clinicopathological features. Combining mutation, copy-number and DNA-methylation profiles clearly distinguished cutaneous melanoma metastases from other melanocytic tumors. Primary leptomeningeal melanocytic tumors, uveal melanomas and blue nevus-like melanoma showed common DNA-methylation, copy-number alteration and gene mutation signatures. Notably, tumors demonstrating chromosome 3 monosomy and BAP1 alterations formed a homogeneous subset within this group. Integrated molecular profiling aids in distinguishing primary from metastatic melanocytic tumors of the central nervous system. Primary leptomeningeal melanocytic tumors, uveal melanoma and blue nevus-like melanoma share molecular similarity with chromosome 3 and BAP1 alterations markers of poor prognosis. Copyright ©2018, American Association for Cancer Research.
Cells Comprising the Prostate Cancer Microenvironment Lack Recurrent Clonal Somatic Genomic Aberrations

PubMed Central

Bianchi-Frias, Daniella; Basom, Ryan; Delrow, Jeffrey J; Coleman, Ilsa M; Dakhova, Olga; Qu, Xiaoyu; Fang, Min; Franco, Omar E.; Ericson, Nolan G.; Bielas, Jason H.; Hayward, Simon W.; True, Lawrence; Morrissey, Colm; Brown, Lisha; Bhowmick, Neil A.; Rowley, David; Ittmann, Michael; Nelson, Peter S.

2017-01-01

Prostate cancer-associated stroma (CAS) plays an active role in malignant transformation, tumor progression, and metastasis. Molecular analyses of CAS have demonstrated significant changes in gene expression; however, conflicting evidence exists on whether genomic alterations in benign cells comprising the tumor microenvironment (TME) underlie gene expression changes and oncogenic phenotypes. This study evaluates the nuclear and mitochondrial DNA integrity of prostate carcinoma cells, CAS, matched benign epithelium and benign epithelium-associated stroma by whole genome copy number analyses, targeted sequencing of TP53, and fluorescence in situ hybridization. Comparative genomic hybridization (aCGH) of CAS revealed a copy-neutral diploid genome with only rare and small somatic copy number aberrations (SCNAs). In contrast, several expected recurrent SCNAs were evident in the adjacent prostate carcinoma cells, including gains at 3q, 7p, and 8q, and losses at 8p and 10q. No somatic TP53 mutations were observed in CAS. Mitochondrial DNA (mtDNA) extracted from carcinoma cells and stroma identified 23 somatic mtDNA mutations in neoplastic epithelial cells but only one mutation in stroma. Finally, genomic analyses identified no SCNAs, no loss of heterozygosity (LOH) or copy-neutral LOH in cultured cancer-associated fibroblasts (CAFs), which are known to promote prostate cancer progression in vivo. PMID:26753621
Analysis of Two Cosmid Clones from Chromosome 4 of Drosophila melanogaster Reveals Two New Genes Amid an Unusual Arrangement of Repeated Sequences

PubMed Central

Locke, John; Podemski, Lynn; Roy, Ken; Pilgrim, David; Hodgetts, Ross

1999-01-01

Chromosome 4 from Drosophila melanogaster has several unusual features that distinguish it from the other chromosomes. These include a diffuse appearance in salivary gland polytene chromosomes, an absence of recombination, and the variegated expression of P-element transgenes. As part of a larger project to understand these properties, we are assembling a physical map of this chromosome. Here we report the sequence of two cosmids representing ∼5% of the polytenized region. Both cosmid clones contain numerous repeated DNA sequences, as identified by cross hybridization with labeled genomic DNA, BLAST searches, and dot matrix analysis, which are positioned between and within the transcribed sequences. The repetitive sequences include three copies of the mobile element Hoppel, one copy of the mobile element HB, and 18 DINE repeats. DINE is a novel, short repeated sequence dispersed throughout both cosmid sequences. One cosmid includes the previously described cubitus interruptus (ci) gene and two new genes: that a gene with a predicted amino acid sequence similar to ribosomal protein S3a which is consistent with the Minute(4)101 locus thought to be in the region, and a novel member of the protein family that includes plexin and met–hepatocyte growth factor receptor. The other cosmid contains only the two short 5′-most exons from the zinc-finger-homolog-2 (zfh-2) gene. This is the first extensive sequence analysis of noncoding DNA from chromosome 4. The distribution of the various repeats suggests its organization is similar to the β-heterochromatic regions near the base of the major chromosome arms. Such a pattern may account for the diffuse banding of the polytene chromosome 4 and the variegation of many P-element transgenes on the chromosome. PMID:10022978
Identification and quantification of genetically modified Moonshade carnation lines using conventional and TaqMan real-time polymerase chain reaction methods.

PubMed

Li, Peng; Jia, Junwei; Bai, Lan; Pan, Aihu; Tang, Xueming

2013-07-01

Genetically modified carnation (Dianthus caryophyllus L.) Moonshade was approved for planting and commercialization in several countries from 2004. Developing methods for analyzing Moonshade is necessary for implementing genetically modified organism labeling regulations. In this study, the 5'-transgene integration sequence was isolated using thermal asymmetric interlaced (TAIL)-PCR. Based upon the 5'-transgene integration sequence, conventional and TaqMan real-time PCR assays were established. The relative limit of detection for the conventional PCR assay was 0.05 % for Moonshade using 100 ng total carnation genomic DNA, corresponding to approximately 79 copies of the carnation haploid genome, and the limits of detection and quantification of the TaqMan real-time PCR assay were estimated to be 51 and 254 copies of haploid carnation genomic DNA, respectively. These results are useful for identifying and quantifying Moonshade and its derivatives.
A Children's Oncology Group and TARGET Initiative Exploring the Genetic Landscape of Wilms Tumor

PubMed Central

Gadd, Samantha; Huff, Vicki; Walz, Amy L.; Ooms, Ariadne H.A.G.; Armstrong, Amy E.; Gerhard, Daniela S.; Smith, Malcolm A.; Guidry Auvil, Jaime M.; Meerzaman, Daoud; Chen, Qing-Rong; Hsu, Chih Hao; Yan, Chunhua; Nguyen, Cu; Hu, Ying; Hermida, Leandro C.; Davidsen, Tanja; Gesuwan, Patee; Ma, Yussanne; Zong, Zusheng; Mungall, Andrew J.; Moore, Richard A.; Marra, Marco A.; Dome, Jeffrey S.; Mullighan, Charles G.; Ma, Jing; Wheeler, David A.; Hampton, Oliver A.; Ross, Nicole; Gastier-Foster, Julie M.; Arold, Stefan T.; Perlman, Elizabeth J.

2017-01-01

Genome-wide sequencing, mRNA and miRNA expression, DNA copy number and methylation analyses were performed on 117 Wilms tumors, followed by targeted sequencing of 651 Wilms tumors. In addition to genes previously implicated in Wilms tumors (WT1, CTNNB1, FAM123B, DROSHA, DGCR8, XPO5, DICER1, SIX1, SIX2, MLLT1, MYCN, and TP53), mutations were identified in genes not previously recognized as recurrently involved in Wilms tumors, the most frequent being BCOR, BCORL1, NONO, MAX, COL6A3, ASXL1, MAP3K4, and ARID1A. DNA copy number changes resulted in recurrent 1q gain, MYCN amplification, LIN28B gain, and let-7a loss. Unexpected germline variants involved PALB2 and CHEK2. Integrated analyses support two major classes of genetic changes that preserve the progenitor state and/or interrupt normal development. PMID:28825729

Repetitive sequences in plant nuclear DNA: types, distribution, evolution and function.

PubMed

Mehrotra, Shweta; Goyal, Vinod

2014-08-01

Repetitive DNA sequences are a major component of eukaryotic genomes and may account for up to 90% of the genome size. They can be divided into minisatellite, microsatellite and satellite sequences. Satellite DNA sequences are considered to be a fast-evolving component of eukaryotic genomes, comprising tandemly-arrayed, highly-repetitive and highly-conserved monomer sequences. The monomer unit of satellite DNA is 150-400 base pairs (bp) in length. Repetitive sequences may be species- or genus-specific, and may be centromeric or subtelomeric in nature. They exhibit cohesive and concerted evolution caused by molecular drive, leading to high sequence homogeneity. Repetitive sequences accumulate variations in sequence and copy number during evolution, hence they are important tools for taxonomic and phylogenetic studies, and are known as "tuning knobs" in the evolution. Therefore, knowledge of repetitive sequences assists our understanding of the organization, evolution and behavior of eukaryotic genomes. Repetitive sequences have cytoplasmic, cellular and developmental effects and play a role in chromosomal recombination. In the post-genomics era, with the introduction of next-generation sequencing technology, it is possible to evaluate complex genomes for analyzing repetitive sequences and deciphering the yet unknown functional potential of repetitive sequences. Copyright © 2014 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
A magnetic bead-based method for concentrating DNA from human urine for downstream detection.

PubMed

Bordelon, Hali; Russ, Patricia K; Wright, David W; Haselton, Frederick R

2013-01-01

Due to the presence of PCR inhibitors, PCR cannot be used directly on most clinical samples, including human urine, without pre-treatment. A magnetic bead-based strategy is one potential method to collect biomarkers from urine samples and separate the biomarkers from PCR inhibitors. In this report, a 1 mL urine sample was mixed within the bulb of a transfer pipette containing lyophilized nucleic acid-silica adsorption buffer and silica-coated magnetic beads. After mixing, the sample was transferred from the pipette bulb to a small diameter tube, and captured biomarkers were concentrated using magnetic entrainment of beads through pre-arrayed wash solutions separated by small air gaps. Feasibility was tested using synthetic segments of the 140 bp tuberculosis IS6110 DNA sequence spiked into pooled human urine samples. DNA recovery was evaluated by qPCR. Despite the presence of spiked DNA, no DNA was detectable in unextracted urine samples, presumably due to the presence of PCR inhibitors. However, following extraction with the magnetic bead-based method, we found that ∼50% of spiked TB DNA was recovered from human urine containing roughly 5×10(3) to 5×10(8) copies of IS6110 DNA. In addition, the DNA was concentrated approximately ten-fold into water. The final concentration of DNA in the eluate was 5×10(6), 14×10(6), and 8×10(6) copies/µL for 1, 3, and 5 mL urine samples, respectively. Lyophilized and freshly prepared reagents within the transfer pipette produced similar results, suggesting that long-term storage without refrigeration is possible. DNA recovery increased with the length of the spiked DNA segments from 10±0.9% for a 75 bp DNA sequence to 42±4% for a 100 bp segment and 58±9% for a 140 bp segment. The estimated LOD was 77 copies of DNA/µL of urine. The strategy presented here provides a simple means to achieve high nucleic acid recovery from easily obtained urine samples, which does not contain inhibitors of PCR.
A Magnetic Bead-Based Method for Concentrating DNA from Human Urine for Downstream Detection

PubMed Central

Bordelon, Hali; Russ, Patricia K.; Wright, David W.; Haselton, Frederick R.

2013-01-01

Due to the presence of PCR inhibitors, PCR cannot be used directly on most clinical samples, including human urine, without pre-treatment. A magnetic bead-based strategy is one potential method to collect biomarkers from urine samples and separate the biomarkers from PCR inhibitors. In this report, a 1 mL urine sample was mixed within the bulb of a transfer pipette containing lyophilized nucleic acid-silica adsorption buffer and silica-coated magnetic beads. After mixing, the sample was transferred from the pipette bulb to a small diameter tube, and captured biomarkers were concentrated using magnetic entrainment of beads through pre-arrayed wash solutions separated by small air gaps. Feasibility was tested using synthetic segments of the 140 bp tuberculosis IS6110 DNA sequence spiked into pooled human urine samples. DNA recovery was evaluated by qPCR. Despite the presence of spiked DNA, no DNA was detectable in unextracted urine samples, presumably due to the presence of PCR inhibitors. However, following extraction with the magnetic bead-based method, we found that ∼50% of spiked TB DNA was recovered from human urine containing roughly 5×103 to 5×108 copies of IS6110 DNA. In addition, the DNA was concentrated approximately ten-fold into water. The final concentration of DNA in the eluate was 5×106, 14×106, and 8×106 copies/µL for 1, 3, and 5 mL urine samples, respectively. Lyophilized and freshly prepared reagents within the transfer pipette produced similar results, suggesting that long-term storage without refrigeration is possible. DNA recovery increased with the length of the spiked DNA segments from 10±0.9% for a 75 bp DNA sequence to 42±4% for a 100 bp segment and 58±9% for a 140 bp segment. The estimated LOD was 77 copies of DNA/µL of urine. The strategy presented here provides a simple means to achieve high nucleic acid recovery from easily obtained urine samples, which does not contain inhibitors of PCR. PMID:23861895
Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

PubMed

El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

2013-07-01

Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Strain diversity and host specificity in bee gut symbionts revealed by deep sampling of single copy protein-coding sequences

PubMed Central

Powell, J. Elijah; Ratnayeke, Nalin; Moran, Nancy A.

2017-01-01

High throughput rRNA amplicon surveys of bacterial communities provide a rapid snapshot of taxonomic composition. But strains with nearly identical rRNA sequences often differ in gene repertoires and metabolic capabilities. To assess strain-level variation within Snodgrassella alvi, a gut symbiont of corbiculate bees, we performed deep sequencing on amplicons of a single copy coding gene (minD) as well as the 16S rDNA V4 region. We surveyed honey bees (Apis mellifera) sampled globally and 12 bumble bee species (Bombus) sampled from two regions of the USA. The minD analyses reveal that S. alvi contains far more strain diversity than is evident from 16S rDNA analysis. Many taxa inferred on the basis of 16S rDNA are shared between A. mellifera and Bombus species, but taxa inferred on the basis of minD are never shared and often are restricted to particular Bombus species. Clustering based on minD revealed that gut communities often reflect host species and geographic location. Both minD and 16S rDNA analyses indicate that strain diversity is higher in A. mellifera than in Bombus species. The minD locus flanks a 16S gene, enabling development of strain-specific 16S fluorescent probes to illuminate the spatial relationship of strains within the bee gut. PMID:27482856
Identification of Single-Copy Orthologous Genes between Physalis and Solanum lycopersicum and Analysis of Genetic Diversity in Physalis Using Molecular Markers

PubMed Central

Wei, Jingli; Hu, Xiaorong; Yang, Jingjing; Yang, Wencai

2012-01-01

The genus Physalis includes a number of commercially important edible and ornamental species. Its high nutritional value and potential medicinal properties leads to the increased commercial interest in the products of this genus worldwide. However, lack of molecular markers prevents the detailed study of genetics and phylogeny in Physalis, which limits the progress of breeding. In the present study, we compared the DNA sequences between Physalis and tomato, and attempted to analyze genetic diversity in Physalis using tomato markers. Blasting 23180 DNA sequences derived from Physalis against the International Tomato Annotation Group (ITAG) Release2.3 Predicted CDS (SL2.40) discovered 3356 single-copy orthologous genes between them. A total of 38 accessions from at least six species of Physalis were subjected to genetic diversity analysis using 97 tomato markers and 25 SSR markers derived from P. peruviana. Majority (73.2%) of tomato markers could amplify DNA fragments from at least one accession of Physalis. Diversity in Physalis at molecular level was also detected. The average Nei’s genetic distance between accessions was 0.3806 with a range of 0.2865 to 0.7091. These results indicated Physalis and tomato had similarity at both molecular marker and DNA sequence levels. Therefore, the molecular markers developed in tomato can be used in genetic study in Physalis. PMID:23166835
Identification of single-copy orthologous genes between Physalis and Solanum lycopersicum and analysis of genetic diversity in Physalis using molecular markers.

PubMed

Wei, Jingli; Hu, Xiaorong; Yang, Jingjing; Yang, Wencai

2012-01-01

The genus Physalis includes a number of commercially important edible and ornamental species. Its high nutritional value and potential medicinal properties leads to the increased commercial interest in the products of this genus worldwide. However, lack of molecular markers prevents the detailed study of genetics and phylogeny in Physalis, which limits the progress of breeding. In the present study, we compared the DNA sequences between Physalis and tomato, and attempted to analyze genetic diversity in Physalis using tomato markers. Blasting 23180 DNA sequences derived from Physalis against the International Tomato Annotation Group (ITAG) Release2.3 Predicted CDS (SL2.40) discovered 3356 single-copy orthologous genes between them. A total of 38 accessions from at least six species of Physalis were subjected to genetic diversity analysis using 97 tomato markers and 25 SSR markers derived from P. peruviana. Majority (73.2%) of tomato markers could amplify DNA fragments from at least one accession of Physalis. Diversity in Physalis at molecular level was also detected. The average Nei's genetic distance between accessions was 0.3806 with a range of 0.2865 to 0.7091. These results indicated Physalis and tomato had similarity at both molecular marker and DNA sequence levels. Therefore, the molecular markers developed in tomato can be used in genetic study in Physalis.
Sites of instability in the human TCF3 (E2A) gene adopt G-quadruplex DNA structures in vitro

PubMed Central

Williams, Jonathan D.; Fleetwood, Sara; Berroyer, Alexandra; Kim, Nayun; Larson, Erik D.

2015-01-01

The formation of highly stable four-stranded DNA, called G-quadruplex (G4), promotes site-specific genome instability. G4 DNA structures fold from repetitive guanine sequences, and increasing experimental evidence connects G4 sequence motifs with specific gene rearrangements. The human transcription factor 3 (TCF3) gene (also termed E2A) is subject to genetic instability associated with severe disease, most notably a common translocation event t(1;19) associated with acute lymphoblastic leukemia. The sites of instability in TCF3 are not randomly distributed, but focused to certain sequences. We asked if G4 DNA formation could explain why TCF3 is prone to recombination and mutagenesis. Here we demonstrate that sequences surrounding the major t(1;19) break site and a region associated with copy number variations both contain G4 sequence motifs. The motifs identified readily adopt G4 DNA structures that are stable enough to interfere with DNA synthesis in physiological salt conditions in vitro. When introduced into the yeast genome, TCF3 G4 motifs promoted gross chromosomal rearrangements in a transcription-dependent manner. Our results provide a molecular rationale for the site-specific instability of human TCF3, suggesting that G4 DNA structures contribute to oncogenic DNA breaks and recombination. PMID:26029241
Ectopic Integration of Transforming DNA Is Rare among Neurospora Transformants Selected for Gene Replacement

PubMed Central

Miao, VPW.; Rountree, M. R.; Selker, E. U.

1995-01-01

In a variety of organisms, DNA-mediated transformation experiments commonly produce transformants with multiple copies of the transforming DNA, including both selected and unselected molecules. Such ``cotransformants'' are much more common than expected from the individual transformation frequencies, suggesting that subpopulations of cells, or nuclei, are particularly competent for transformation. We found that Neurospora crassa transformants selected for gene replacement at the am gene had not efficiently incorporated additional DNA, suggesting that nuclei that undergo transformation by homologous recombination are not highly competent at integration of DNA by illegitimate recombination. Spheroplasts were treated with DNA fragments homologous to am and with an Escherichia coli hph plasmid. Transformants were initially selected for hph (hygromycin(R)), allowed to conidiate to generate homokaryons and then selected for either Am(-) (gene replacements) or hph. Surprisingly, most am replacement strains were hygromycin(S) (124/140) and carried no extraneous DNA (116/140). Most transformants selected for hph also had ectopic copies of am DNA and/or multiple copies of hph sequences (32/35), generally at multiple sites, confirming that efficient cotransformation could occur. To test the implication that cotransformation involving gene replacement and ectopic integration is rare, we compared the yields of am replacement strains with or without prior selection for hph. The initial selection did not appreciably help (or hinder) recovery of strains with replacements. PMID:7789758
Fluorescent in situ hybridisation to amphioxus chromosomes.

PubMed

Castro, Luis Filipe Costa; Holland, Peter William Harold

2002-12-01

We describe an efficient protocol for mapping genes and other DNA sequences to amphioxus chromosomes using fluorescent in situ hybridisation. We apply this method to identify the number and location of ribosomal DNA gene clusters and telomere sequences in metaphase spreads of Branchiostoma floridae. We also describe how the locations of two single copy genes can be mapped relative to each other, and demonstrate this by mapping an amphioxus Pax gene relative to a homologue of the Notch gene. These methods have great potential for performing comparative genomics between amphioxus and vertebrates.
BayMeth: improved DNA methylation quantification for affinity capture sequencing data using a flexible Bayesian approach

PubMed Central

2014-01-01

Affinity capture of DNA methylation combined with high-throughput sequencing strikes a good balance between the high cost of whole genome bisulfite sequencing and the low coverage of methylation arrays. We present BayMeth, an empirical Bayes approach that uses a fully methylated control sample to transform observed read counts into regional methylation levels. In our model, inefficient capture can readily be distinguished from low methylation levels. BayMeth improves on existing methods, allows explicit modeling of copy number variation, and offers computationally efficient analytical mean and variance estimators. BayMeth is available in the Repitools Bioconductor package. PMID:24517713
Detection of viral infection and gene expression in clinical tissue specimens using branched DNA (bDNA) in situ hybridization.

PubMed

Kenny, Daryn; Shen, Lu-Ping; Kolberg, Janice A

2002-09-01

In situ hybridization (ISH) methods for detection of nucleic acid sequences have proved especially powerful for revealing genetic markers and gene expression in a morphological context. Although target and signal amplification technologies have enabled researchers to detect relatively low-abundance molecules in cell extracts, the sensitive detection of nucleic acid sequences in tissue specimens has proved more challenging. We recently reported the development of a branched DNA (bDNA) ISH method for detection of DNA and mRNA in whole cells. Based on bDNA signal amplification technology, bDNA ISH is highly sensitive and can detect one or two copies of DNA per cell. In this study we evaluated bDNA ISH for detection of nucleic acid sequences in tissue specimens. Using normal and human papillomavirus (HPV)-infected cervical biopsy specimens, we explored the cell type-specific distribution of HPV DNA and mRNA by bDNA ISH. We found that bDNA ISH allowed rapid, sensitive detection of nucleic acids with high specificity while preserving tissue morphology. As an adjunct to conventional histopathology, bDNA ISH may improve diagnostic accuracy and prognosis for viral and neoplastic diseases.
Biosynthesis of Lipoic Acid in Arabidopsis: Cloning and Characterization of the cDNA for Lipoic Acid Synthase1

PubMed Central

Yasuno, Rie; Wada, Hajime

1998-01-01

Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738
Recent advances in rice genome and chromosome structure research by fluorescence in situ hybridization (FISH).

PubMed

Ohmido, Nobuko; Fukui, Kiichi; Kinoshita, Toshiro

2010-01-01

Fluorescence in situ hybridization (FISH) is an effective method for the physical mapping of genes and repetitive DNA sequences on chromosomes. Physical mapping of unique nucleotide sequences on specific rice chromosome regions was performed using a combination of chromosome identification and highly sensitive FISH. Increases in the detection sensitivity of smaller DNA sequences and improvements in spatial resolution have ushered in a new phase in FISH technology. Thus, it is now possible to perform in situ hybridization on somatic chromosomes, pachytene chromosomes, and even on extended DNA fibers (EDFs). Pachytene-FISH allows the integration of genetic linkage maps and quantitative chromosome maps. Visualization methods using FISH can reveal the spatial organization of the centromere, heterochromatin/euchromatin, and the terminal structures of rice chromosomes. Furthermore, EDF-FISH and the DNA combing technique can resolve a spatial distance of 1 kb between adjacent DNA sequences, and the detection of even a 300-bp target is now feasible. The copy numbers of various repetitive sequences and the sizes of various DNA molecules were quantitatively measured using the molecular combing technique. This review describes the significance of these advances in molecular cytology in rice and discusses future applications in plant studies using visualization techniques.
Autonomous replication and addition of telomerelike sequences to DNA microinjected into Paramecium tetraurelia macronuclei.

PubMed Central

Gilley, D; Preer, J R; Aufderheide, K J; Polisky, B

1988-01-01

Paramecium tetraurelia can be transformed by microinjection of cloned serotype A gene sequences into the macronucleus. Transformants are detected by their ability to express serotype A surface antigen from the injected templates. After injection, the DNA is converted from a supercoiled form to a linear form by cleavage at nonrandom sites. The linear form appears to replicate autonomously as a unit-length molecule and is present in transformants at high copy number. The injected DNA is further processed by the addition of paramecium-type telomeric sequences to the termini of the linear DNA. To examine the fate of injected linear DNA molecules, plasmid pSA14SB DNA containing the A gene was cleaved into two linear pieces, a 14-kilobase (kb) piece containing the A gene and flanking sequences and a 2.2-kb piece consisting of the procaryotic vector. In transformants expressing the A gene, we observed that two linear DNA species were present which correspond to the two species injected. Both species had Paramecium telomerelike sequences added to their termini. For the 2.2-kb DNA, we show that the site of addition of the telomerelike sequences is directly at one terminus and within one nucleotide of the other terminus. These results indicate that injected procaryotic DNA is capable of autonomous replication in Paramecium macronuclei and that telomeric addition in the macronucleus does not require specific recognition sequences. Images PMID:3211128
Population Genetic Structure and Phylogeography of Camellia flavida (Theaceae) Based on Chloroplast and Nuclear DNA Sequences

PubMed Central

Wei, Su-Juan; Lu, Yong-Bin; Ye, Quan-Qing; Tang, Shao-Qing

2017-01-01

Camellia flavida is an endangered species of yellow camellia growing in limestone mountains in southwest China. The current classification of C. flavida into two varieties, var. flavida and var. patens, is controversial. We conducted a genetic analysis of C. flavida to determine its taxonomic structure. A total of 188 individual plants from 20 populations across the entire distribution range in southwest China were analyzed using two DNA fragments: a chloroplast DNA fragment from the small single copy region and a single-copy nuclear gene called phenylalanine ammonia-lyase (PAL). Sequences from both chloroplast and nuclear DNA were highly diverse; with high levels of genetic differentiation and restricted gene flow. This result can be attributed to the high habitat heterogeneity in limestone karst, which isolates C. flavida populations from each other. Our nuclear DNA results demonstrate that there are three differentiated groups within C. flavida: var. flavida 1, var. flavida 2, and var. patens. These genetic groupings are consistent with the morphological characteristics of the plants. We suggest that the samples included in this study constitute three taxa and the var. flavida 2 group is the genuine C. flavida. The three groups should be recognized as three management units for conservation concerns. PMID:28579991
Cattle phenotypes can disguise their maternal ancestry.

PubMed

Srirattana, Kanokwan; McCosker, Kieren; Schatz, Tim; St John, Justin C

2017-06-26

Cattle are bred for, amongst other factors, specific traits, including parasite resistance and adaptation to climate. However, the influence and inheritance of mitochondrial DNA (mtDNA) are not usually considered in breeding programmes. In this study, we analysed the mtDNA profiles of cattle from Victoria (VIC), southern Australia, which is a temperate climate, and the Northern Territory (NT), the northern part of Australia, which has a tropical climate, to determine if the mtDNA profiles of these cattle are indicative of breed and phenotype, and whether these profiles are appropriate for their environments. A phylogenetic tree of the full mtDNA sequences of different breeds of cattle, which were obtained from the NCBI database, showed that the mtDNA profiles of cattle do not always reflect their phenotype as some cattle with Bos taurus phenotypes had Bos indicus mtDNA, whilst some cattle with Bos indicus phenotypes had Bos taurus mtDNA. Using D-loop sequencing, we were able to contrast the phenotypes and mtDNA profiles from different species of cattle from the 2 distinct cattle breeding regions of Australia. We found that 67 of the 121 cattle with Bos indicus phenotypes from NT (55.4%) had Bos taurus mtDNA. In VIC, 92 of the 225 cattle with Bos taurus phenotypes (40.9%) possessed Bos indicus mtDNA. When focusing on oocytes from cattle with the Bos taurus phenotype in VIC, their respective oocytes with Bos indicus mtDNA had significantly lower levels of mtDNA copy number compared with oocytes possessing Bos taurus mtDNA (P < 0.01). However, embryos derived from oocytes with Bos indicus mtDNA had the same ability to develop to the blastocyst stage and the levels of mtDNA copy number in their blastocysts were similar to blastocysts derived from oocytes harbouring Bos taurus mtDNA. Nevertheless, oocytes originating from the Bos indicus phenotype exhibited lower developmental potential due to low mtDNA copy number when compared with oocytes from cattle with a Bos taurus phenotype. The phenotype of cattle is not always related to their mtDNA profiles. MtDNA profiles should be considered for breeding programmes as they also influence phenotypic traits and reproductive capacity in terms of oocyte quality.
Three closely related herpesviruses are associated with fibropapillomatosis in marine turtles

USGS Publications Warehouse

Quackenbush, S.L.; Work, Thierry M.; Balazs, George H.; Casey, Rufina N.; Rovnak, J.; Chaves, A.; duToit, L.; Baines, J.D.; Parrish, C.R.; Bowser, Paul R.; Casey, James W.

1998-01-01

Green turtle fibropapillomatosis is a neoplastic disease of increasingly significant threat to the survivability of this species. Degenerate PCR primers that target highly conserved regions of genes encoding herpesvirus DNA polymerases were used to amplify a DNA sequence from fibropapillomas and fibromas from Hawaiian and Florida green turtles. All of the tumors tested (n= 23) were found to harbor viral DNA, whereas no viral DNA was detected in skin biopsies from tumor-negative turtles. The tissue distribution of the green turtle herpesvirus appears to be generally limited to tumors where viral DNA was found to accumulate at approximately two to five copies per cell and is occasionally detected, only by PCR, in some tissues normally associated with tumor development. In addition, herpesviral DNA was detected in fibropapillomas from two loggerhead and four olive ridley turtles. Nucleotide sequencing of a 483-bp fragment of the turtle herpesvirus DNA polymerase gene determined that the Florida green turtle and loggerhead turtle sequences are identical and differ from the Hawaiian green turtle sequence by five nucleotide changes, which results in two amino acid substitutions. The olive ridley sequence differs from the Florida and Hawaiian green turtle sequences by 15 and 16 nucleotide changes, respectively, resulting in four amino acid substitutions, three of which are unique to the olive ridley sequence. Our data suggest that these closely related turtle herpesviruses are intimately involved in the genesis of fibropapillomatosis.
High resolution optical DNA mapping

NASA Astrophysics Data System (ADS)

Baday, Murat

Many types of diseases including cancer and autism are associated with copy-number variations in the genome. Most of these variations could not be identified with existing sequencing and optical DNA mapping methods. We have developed Multi-color Super-resolution technique, with potential for high throughput and low cost, which can allow us to recognize more of these variations. Our technique has made 10--fold improvement in the resolution of optical DNA mapping. Using a 180 kb BAC clone as a model system, we resolved dense patterns from 108 fluorescent labels of two different colors representing two different sequence-motifs. Overall, a detailed DNA map with 100 bp resolution was achieved, which has the potential to reveal detailed information about genetic variance and to facilitate medical diagnosis of genetic disease.
The impact of targeting repetitive BamHI-W sequences on the sensitivity and precision of EBV DNA quantification.

PubMed

Sanosyan, Armen; Fayd'herbe de Maudave, Alexis; Bollore, Karine; Zimmermann, Valérie; Foulongne, Vincent; Van de Perre, Philippe; Tuaillon, Edouard

2017-01-01

Viral load monitoring and early Epstein-Barr virus (EBV) DNA detection are essential in routine laboratory testing, especially in preemptive management of Post-transplant Lymphoproliferative Disorder. Targeting the repetitive BamHI-W sequence was shown to increase the sensitivity of EBV DNA quantification, but the variability of BamHI-W reiterations was suggested to be a source of quantification bias. We aimed to assess the extent of variability associated with BamHI-W PCR and its impact on the sensitivity of EBV DNA quantification using the 1st WHO international standard, EBV strains and clinical samples. Repetitive BamHI-W- and LMP2 single- sequences were amplified by in-house qPCRs and BXLF-1 sequence by a commercial assay (EBV R-gene™, BioMerieux). Linearity and limits of detection of in-house methods were assessed. The impact of repeated versus single target sequences on EBV DNA quantification precision was tested on B95.8 and Raji cell lines, possessing 11 and 7 copies of the BamHI-W sequence, respectively, and on clinical samples. BamHI-W qPCR demonstrated a lower limit of detection compared to LMP2 qPCR (2.33 log10 versus 3.08 log10 IU/mL; P = 0.0002). BamHI-W qPCR underestimated the EBV DNA load on Raji strain which contained fewer BamHI-W copies than the WHO standard derived from the B95.8 EBV strain (mean bias: - 0.21 log10; 95% CI, -0.54 to 0.12). Comparison of BamHI-W qPCR versus LMP2 and BXLF-1 qPCR showed an acceptable variability between EBV DNA levels in clinical samples with the mean bias being within 0.5 log10 IU/mL EBV DNA, whereas a better quantitative concordance was observed between LMP2 and BXLF-1 assays. Targeting BamHI-W resulted to a higher sensitivity compared to LMP2 but the variable reiterations of BamHI-W segment are associated with higher quantification variability. BamHI-W can be considered for clinical and therapeutic monitoring to detect an early EBV DNA and a dynamic change in viral load.

The impact of targeting repetitive BamHI-W sequences on the sensitivity and precision of EBV DNA quantification

PubMed Central

Fayd’herbe de Maudave, Alexis; Bollore, Karine; Zimmermann, Valérie; Foulongne, Vincent; Van de Perre, Philippe; Tuaillon, Edouard

2017-01-01

Background Viral load monitoring and early Epstein-Barr virus (EBV) DNA detection are essential in routine laboratory testing, especially in preemptive management of Post-transplant Lymphoproliferative Disorder. Targeting the repetitive BamHI-W sequence was shown to increase the sensitivity of EBV DNA quantification, but the variability of BamHI-W reiterations was suggested to be a source of quantification bias. We aimed to assess the extent of variability associated with BamHI-W PCR and its impact on the sensitivity of EBV DNA quantification using the 1st WHO international standard, EBV strains and clinical samples. Methods Repetitive BamHI-W- and LMP2 single- sequences were amplified by in-house qPCRs and BXLF-1 sequence by a commercial assay (EBV R-gene™, BioMerieux). Linearity and limits of detection of in-house methods were assessed. The impact of repeated versus single target sequences on EBV DNA quantification precision was tested on B95.8 and Raji cell lines, possessing 11 and 7 copies of the BamHI-W sequence, respectively, and on clinical samples. Results BamHI-W qPCR demonstrated a lower limit of detection compared to LMP2 qPCR (2.33 log10 versus 3.08 log10 IU/mL; P = 0.0002). BamHI-W qPCR underestimated the EBV DNA load on Raji strain which contained fewer BamHI-W copies than the WHO standard derived from the B95.8 EBV strain (mean bias: - 0.21 log10; 95% CI, -0.54 to 0.12). Comparison of BamHI-W qPCR versus LMP2 and BXLF-1 qPCR showed an acceptable variability between EBV DNA levels in clinical samples with the mean bias being within 0.5 log10 IU/mL EBV DNA, whereas a better quantitative concordance was observed between LMP2 and BXLF-1 assays. Conclusions Targeting BamHI-W resulted to a higher sensitivity compared to LMP2 but the variable reiterations of BamHI-W segment are associated with higher quantification variability. BamHI-W can be considered for clinical and therapeutic monitoring to detect an early EBV DNA and a dynamic change in viral load. PMID:28850597
Nuclear DNA analyses in genetic studies of populations: practice, problems and prospects.

PubMed

Zhang, De-Xing; Hewitt, Godfrey M

2003-03-01

Population-genetic studies have been remarkably productive and successful in the last decade following the invention of PCR technology and the introduction of mitochondrial and microsatellite DNA markers. While mitochondrial DNA has proven powerful for genealogical and evolutionary studies of animal populations, and microsatellite sequences are the most revealing DNA markers available so far for inferring population structure and dynamics, they both have important and unavoidable limitations. To obtain a fuller picture of the history and evolutionary potential of populations, genealogical data from nuclear loci are essential, and the inclusion of other nuclear markers, i.e. single copy nuclear polymorphic (scnp) sequences, is clearly needed. Four major uncertainties for nuclear DNA analyses of populations have been facing us, i.e. the availability of scnp markers for carrying out such analysis, technical laboratory hurdles for resolving haplotypes, difficulty in data analysis because of recombination, low divergence levels and intraspecific multifurcation evolution, and the utility of scnp markers for addressing population-genetic questions. In this review, we discuss the availability of highly polymorphic single copy DNA in the nuclear genome, describe patterns and rate of evolution of nuclear sequences, summarize past empirical and theoretical efforts to recover and analyse data from scnp markers, and examine the difficulties, challenges and opportunities faced in such studies. We show that although challenges still exist, the above-mentioned obstacles are now being removed. Recent advances in technology and increases in statistical power provide the prospect of nuclear DNA analyses becoming routine practice, allowing allele-discriminating characterization of scnp loci and microsatellite loci. This certainly will increase our ability to address more complex questions, and thereby the sophistication of genetic analyses of populations.
Site-Specific Integration of Foreign DNA into Minimal Bacterial and Human Target Sequences Mediated by a Conjugative Relaxase

PubMed Central

Agúndez, Leticia; González-Prieto, Coral; Machón, Cristina; Llosa, Matxalen

2012-01-01

Background Bacterial conjugation is a mechanism for horizontal DNA transfer between bacteria which requires cell to cell contact, usually mediated by self-transmissible plasmids. A protein known as relaxase is responsible for the processing of DNA during bacterial conjugation. TrwC, the relaxase of conjugative plasmid R388, is also able to catalyze site-specific integration of the transferred DNA into a copy of its target, the origin of transfer (oriT), present in a recipient plasmid. This reaction confers TrwC a high biotechnological potential as a tool for genomic engineering. Methodology/Principal Findings We have characterized this reaction by conjugal mobilization of a suicide plasmid to a recipient cell with an oriT-containing plasmid, selecting for the cointegrates. Proteins TrwA and IHF enhanced integration frequency. TrwC could also catalyze integration when it is expressed from the recipient cell. Both Y18 and Y26 catalytic tyrosil residues were essential to perform the reaction, while TrwC DNA helicase activity was dispensable. The target DNA could be reduced to 17 bp encompassing TrwC nicking and binding sites. Two human genomic sequences resembling the 17 bp segment were accepted as targets for TrwC-mediated site-specific integration. TrwC could also integrate the incoming DNA molecule into an oriT copy present in the recipient chromosome. Conclusions/Significance The results support a model for TrwC-mediated site-specific integration. This reaction may allow R388 to integrate into the genome of non-permissive hosts upon conjugative transfer. Also, the ability to act on target sequences present in the human genome underscores the biotechnological potential of conjugative relaxase TrwC as a site-specific integrase for genomic modification of human cells. PMID:22292089
Performance evaluation of DNA copy number segmentation methods.

PubMed

Pierre-Jean, Morgane; Rigaill, Guillem; Neuvial, Pierre

2015-07-01

A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation studies, or based on small real data analyses. To make an objective and reproducible performance assessment, we have designed and implemented a framework to generate realistic DNA copy number profiles of cancer samples with known truth. These profiles are generated by resampling publicly available SNP microarray data from genomic regions with known copy-number state. The original data have been extracted from dilutions series of tumor cell lines with matched blood samples at several concentrations. Therefore, the signal-to-noise ratio of the generated profiles can be controlled through the (known) percentage of tumor cells in the sample. This article describes this framework and its application to a comparison study between methods for segmenting DNA copy number profiles from SNP microarrays. This study indicates that no single method is uniformly better than all others. It also helps identifying pros and cons of the compared methods as a function of biologically informative parameters, such as the fraction of tumor cells in the sample and the proportion of heterozygous markers. This comparison study may be reproduced using the open source and cross-platform R package jointseg, which implements the proposed data generation and evaluation framework: http://r-forge.r-project.org/R/?group_id=1562. © The Author 2014. Published by Oxford University Press.
A calmodulin-like protein (LCALA) is a new Leishmania amazonensis candidate for telomere end-binding protein.

PubMed

Morea, Edna G O; Viviescas, Maria Alejandra; Fernandes, Carlos A H; Matioli, Fabio F; Lira, Cristina B B; Fernandez, Maribel F; Moraes, Barbara S; da Silva, Marcelo S; Storti, Camila B; Fontes, Marcos R M; Cano, Maria Isabel N

2017-11-01

Leishmania spp. telomeres are composed of 5'-TTAGGG-3' repeats associated with proteins. We have previously identified LaRbp38 and LaRPA-1 as proteins that bind the G-rich telomeric strand. At that time, we had also partially characterized a protein: DNA complex, named LaGT1, but we could not identify its protein component. Using protein-DNA interaction and competition assays, we confirmed that LaGT1 is highly specific to the G-rich telomeric single-stranded DNA. Three protein bands, with LaGT1 activity, were isolated from affinity-purified protein extracts in-gel digested, and sequenced de novo using mass spectrometry analysis. In silico analysis of the digested peptide identified them as a putative calmodulin with sequences identical to the T. cruzi calmodulin. In the Leishmania genome, the calmodulin ortholog is present in three identical copies. We cloned and sequenced one of the gene copies, named it LCalA, and obtained the recombinant protein. Multiple sequence alignment and molecular modeling showed that LCalA shares homology to most eukaryotes calmodulin. In addition, we demonstrated that LCalA is nuclear, partially co-localizes with telomeres and binds in vivo the G-rich telomeric strand. Recombinant LCalA can bind specifically and with relative affinity to the G-rich telomeric single-strand and to a 3'G-overhang, and DNA binding is calcium dependent. We have described a novel candidate component of Leishmania telomeres, LCalA, a nuclear calmodulin that binds the G-rich telomeric strand with high specificity and relative affinity, in a calcium-dependent manner. LCalA is the first reported calmodulin that binds in vivo telomeric DNA. Copyright © 2017 Elsevier B.V. All rights reserved.
Dynamics of drug resistance-associated mutations in HIV-1 DNA reverse transcriptase sequence during effective ART.

PubMed

Nouchi, A; Nguyen, T; Valantin, M A; Simon, A; Sayon, S; Agher, R; Calvez, V; Katlama, C; Marcelin, A G; Soulie, C

2018-05-29

To investigate the dynamics of HIV-1 variants archived in cells harbouring drug resistance-associated mutations (DRAMs) to lamivudine/emtricitabine, etravirine and rilpivirine in patients under effective ART free from selective pressure on these DRAMs, in order to assess the possibility of recycling molecules with resistance history. We studied 25 patients with at least one DRAM to lamivudine/emtricitabine, etravirine and/or rilpivirine identified on an RNA sequence in their history and with virological control for at least 5 years under a regimen excluding all drugs from the resistant class. Longitudinal ultra-deep sequencing (UDS) and Sanger sequencing of the reverse transcriptase region were performed on cell-associated HIV-1 DNA samples taken over the 5 years of follow-up. Viral variants harbouring the analysed DRAMs were no longer detected by UDS over the 5 years in 72% of patients, with viruses susceptible to the molecules of interest found after 5 years in 80% of patients with UDS and in 88% of patients with Sanger. Residual viraemia with <50 copies/mL was detected in 52% of patients. The median HIV DNA level remained stable (2.4 at baseline versus 2.1 log10 copies/106 cells 5 years later). These results show a clear trend towards clearance of archived DRAMs to reverse transcriptase inhibitors in cell-associated HIV-1 DNA after a long period of virological control, free from therapeutic selective pressure on these DRAMs, reflecting probable residual replication in some reservoirs of the fittest viruses and leading to persistent evolution of the archived HIV-1 DNA resistance profile.
An Alu-based, MGB Eclipse real-time PCR method for quantitation of human DNA in forensic samples.

PubMed

Nicklas, Janice A; Buel, Eric

2005-09-01

The forensic community needs quick, reliable methods to quantitate human DNA in crime scene samples to replace the laborious and imprecise slot blot method. A real-time PCR based method has the possibility of allowing development of a faster and more quantitative assay. Alu sequences are primate-specific and are found in many copies in the human genome, making these sequences an excellent target or marker for human DNA. This paper describes the development of a real-time Alu sequence-based assay using MGB Eclipse primers and probes. The advantages of this assay are simplicity, speed, less hands-on-time and automated quantitation, as well as a large dynamic range (128 ng/microL to 0.5 pg/microL).
The primary structure of L37--a rat ribosomal protein with a zinc finger-like motif.

PubMed

Chan, Y L; Paz, V; Olvera, J; Wool, I G

1993-04-30

The amino acid sequence of the rat 60S ribosomal subunit protein L37 was deduced from the sequence of nucleotides in a recombinant cDNA. Ribosomal protein L37 has 96 amino acids, the NH2-terminal methionine is removed after translation of the mRNA, and has a molecular weight of 10,939. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type. Hybridization of the cDNA to digests of nuclear DNA suggests that there are 13 or 14 copies of the L37 gene. The mRNA for the protein is about 500 nucleotides in length. Rat L37 is related to Saccharomyces cerevisiae ribosomal protein YL35 and to Caenorhabditis elegans L37. We have identified in the data base a DNA sequence that encodes the chicken homolog of rat L37.
PuLSE: Quality control and quantification of peptide sequences explored by phage display libraries.

PubMed

Shave, Steven; Mann, Stefan; Koszela, Joanna; Kerr, Alastair; Auer, Manfred

2018-01-01

The design of highly diverse phage display libraries is based on assumption that DNA bases are incorporated at similar rates within the randomized sequence. As library complexity increases and expected copy numbers of unique sequences decrease, the exploration of library space becomes sparser and the presence of truly random sequences becomes critical. We present the program PuLSE (Phage Library Sequence Evaluation) as a tool for assessing randomness and therefore diversity of phage display libraries. PuLSE runs on a collection of sequence reads in the fastq file format and generates tables profiling the library in terms of unique DNA sequence counts and positions, translated peptide sequences, and normalized 'expected' occurrences from base to residue codon frequencies. The output allows at-a-glance quantitative quality control of a phage library in terms of sequence coverage both at the DNA base and translated protein residue level, which has been missing from toolsets and literature. The open source program PuLSE is available in two formats, a C++ source code package for compilation and integration into existing bioinformatics pipelines and precompiled binaries for ease of use.
Repetitive DNA loci and their modulation by the non-canonical nucleic acid structures R-loops and G-quadruplexes

PubMed Central

Hall, Amanda C.; Ostrowski, Lauren A.; Mekhail, Karim

2017-01-01

ABSTRACT Cells have evolved intricate mechanisms to maintain genome stability despite allowing mutational changes to drive evolutionary adaptation. Repetitive DNA sequences, which represent the bulk of most genomes, are a major threat to genome stability often driving chromosome rearrangements and disease. The major source of repetitive DNA sequences and thus the most vulnerable constituents of the genome are the rDNA (rDNA) repeats, telomeres, and transposable elements. Maintaining the stability of these loci is critical to overall cellular fitness and lifespan. Therefore, cells have evolved mechanisms to regulate rDNA copy number, telomere length and transposon activity, as well as DNA repair at these loci. In addition, non-canonical structure-forming DNA motifs can also modulate the function of these repetitive DNA loci by impacting their transcription, replication, and stability. Here, we discuss key mechanisms that maintain rDNA repeats, telomeres, and transposons in yeast and human before highlighting emerging roles for non-canonical DNA structures at these repetitive loci. PMID:28406751
[A family of short retroposons (Squaml) from squamate reptiles (Reptilia: Squamata): structure, evolution and correlation with phylogeny].

PubMed

Kosushkin, S A; Borodulina, O R; Solov'eva, E N; Grechko, V V

2008-01-01

We have isolated and characterised sequences of a SINE family specific for squamate reptiles from a genome of lacertid lizard that we called Squam1. Copies are 360-390 bp in length and share a significant similarity with tRNA gene sequence on its 5'-end. This family was also detected by us in DNA of representatives of varanids, iguanids (anolis), gekkonids, and snakes. No signs of it were found in DNA of mammals, birds, amphibians, and crocodiles. Detailed analysis of primary structure of the retroposons obtained by us from genomic libraries or GenBank sequences was carried out. Most taxa possess 2-3 subfamilies of the SINE in their genomes with specific diagnostic features in their primary structure. Individual variability of copies in different families is about 85% and is just slightly lower on the genera level. Comparison of consensus sequences on family level reveals a high degree of structural similarity with a number of specific apomorphic features which makes it a useful marker of phylogeny for this group of reptiles. Snakes do not show specific affinity to varanids when compared to other lizards, as it was suggested earlier.
The molecular genetic makeup of acute lymphoblastic leukemia | Office of Cancer Genomics

Cancer.gov

Abstract: Genomic profiling has transformed our understanding of the genetic basis of acute lymphoblastic leukemia (ALL). Recent years have seen a shift from microarray analysis and candidate gene sequencing to next-generation sequencing. Together, these approaches have shown that many ALL subtypes are characterized by constellations of structural rearrangements, submicroscopic DNA copy number alterations, and sequence mutations, several of which have clear implications for risk stratification and targeted therapeutic intervention.
Evaluation of droplet digital PCR for characterizing plasmid reference material used for quantifying ammonia oxidizers and denitrifiers.

PubMed

Dong, Lianhua; Meng, Ying; Wang, Jing; Liu, Yingying

2014-02-01

DNA reference materials of certified value have a critical function in many analytical processes of DNA measurement. Quantification of amoA genes in ammonia oxidizing bacteria (AOB) and archaea (AOA), and of nirS and nosZ genes in the denitrifiers is very important for determining their distribution and abundance in the natural environment. A plasmid reference material containing nirS, nosZ, amoA-AOB, and amoA-AOA is developed to provide a DNA standard with copy number concentration for ensuring comparability and reliability of quantification of these genes. Droplet digital PCR (ddPCR) was evaluated for characterization of the plasmid reference material. The result revealed that restriction endonuclease digestion of plasmids can improve amplification efficiency and minimize the measurement bias of ddPCR. Compared with the conformation of the plasmid, the size of the DNA fragment containing the target sequence and the location of the restriction site relative to the target sequence are not significant factors affecting plasmid quantification by ddPCR. Liquid chromatography-isotope dilution mass spectrometry (LC-IDMS) was used to provide independent data for quantifying the plasmid reference material. The copy number concentration of the digested plasmid determined by ddPCR agreed well with that determined by LC-IDMS, improving both the accuracy and reliability of the plasmid reference material. The reference value, with its expanded uncertainty (k = 2), of the plasmid reference material was determined to be (5.19 ± 0.41) × 10(9) copies μL(-1) by averaging the results of two independent measurements. Consideration of the factors revealed in this study can improve the reliability and accuracy of ddPCR; thus, this method has the potential to accurately quantify DNA reference materials.
Evaluation of two molecular techniques for rapid detection of the main dermatophytic agents of tinea capitis.

PubMed

Deng, S; Zhou, Z; de Hoog, G S; Wang, X; Abliz, P; Sun, J; Najafzadeh, M J; Pan, W; Lei, W; Zhu, S; Hasimu, H; Zhang, P; Guo, Y; Deng, D; Liao, W

2015-12-01

Tinea capitis is very common in Western China, with the most widespread aetiological agent being Trichophyton violaceum, while Microsporum canis is prevalent in the remainder of China. Conventional diagnostics and internal transcribed spacer (ITS) sequencing analyses have proven relatively limited due to the close phylogenetic relationship of anthropophilic dermatophytes. Therefore, alternative molecular tools with sufficient specificity, reproducibility and sensitivity are necessary. To evaluate two molecular techniques [multiplex ligation-dependent probe amplification (MLPA) and rolling circle amplification (RCA)] for rapid detection of the aetiological agents of tinea capitis, T. violaceum and M. canis. Probes of RCA and MLPA were designed with target sequences in the rDNA ITS gene region. Strains tested consist of 31 T. violaceum, 22 M. canis and 24 reference strains of species that are taxonomically close to the target species. The specificity and reproducibility of RCA and MLPA in detection of T. violaceum and M. canis were both 100% in both species. Sensitivity testing showed that RCA was positive at concentrations down to 1·68 × 10(6) copies of DNA in the TvioRCA probe, and 2·7 × 10(8) copies of DNA in McRCA. MLPA yielded positive results at concentrations of DNA down to 1·68 × 10(1) copies in the TvioMLPA probe and 2·7 × 10(2) in McMLPA. The two techniques were sufficiently specific and sensitive for discriminating the target DNA of T. violaceum and M. canis from that of closely related dermatophytes. RCA and MLPA are advantageous in their reliability and ease of operation compared with standard polymerase chain reaction and conventional methods. © 2015 British Association of Dermatologists.
Transposon-like properties of the major, long repetitive sequence family in the genome of Physarum polycephalum

PubMed Central

Pearston, Douglas H.; Gordon, Mairi; Hardman, Norman

1985-01-01

A family of long, highly-repetitive sequences, referred to previously as `HpaII-repeats', dominates the genome of the eukaryotic slime mould Physarum polycephalum. These sequences are found exclusively in scrambled clusters. They account for about one-half of the total complement of repetitive DNA in Physarum, and represent the major sequence component found in hypermethylated, 20-50 kb segments of Physarum genomic DNA that fail to be cleaved using the restriction endonuclease HpaII. The structure of this abundant repetitive element was investigated by analysing cloned segments derived from the hypermethylated genomic DNA compartment. We show that the `HpaII-repeat' forms part of a larger repetitive DNA structure, ∼8.6 kb in length, with several structural features in common with recognised eukaryotic transposable genetic elements. Scrambled clusters of the sequence probably arise as a result of transposition-like events, during which the element preferentially recombines in either orientation with target sites located in other copies of the same repeated sequence. The target sites for transposition/recombination are not related in sequence but in all cases studied they are potentially capable of promoting the formation of small `cruciforms' or `Z-DNA' structures which might be recognised during the recombination process. ImagesFig. 3.Fig. 4. PMID:16453652
Diversity in copy number and structure of a silkworm morphogenetic gene as a result of domestication.

PubMed

Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

2011-03-01

The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. © 2011 by the Genetics Society of America
Diversity in Copy Number and Structure of a Silkworm Morphogenetic Gene as a Result of Domestication

PubMed Central

Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

2011-01-01

The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. PMID:21242537
Haplotype Detection from Next-Generation Sequencing in High-Ploidy-Level Species: 45S rDNA Gene Copies in the Hexaploid Spartina maritima

PubMed Central

Boutte, Julien; Aliaga, Benoît; Lima, Oscar; Ferreira de Carvalho, Julie; Ainouche, Abdelkader; Macas, Jiri; Rousseau-Gueutin, Mathieu; Coriton, Olivier; Ainouche, Malika; Salmon, Armel

2015-01-01

Gene and whole-genome duplications are widespread in plant nuclear genomes, resulting in sequence heterogeneity. Identification of duplicated genes may be particularly challenging in highly redundant genomes, especially when there are no diploid parents as a reference. Here, we developed a pipeline to detect the different copies in the ribosomal RNA gene family in the hexaploid grass Spartina maritima from next-generation sequencing (Roche-454) reads. The heterogeneity of the different domains of the highly repeated 45S unit was explored by identifying single nucleotide polymorphisms (SNPs) and assembling reads based on shared polymorphisms. SNPs were validated using comparisons with Illumina sequence data sets and by cloning and Sanger (re)sequencing. Using this approach, 29 validated polymorphisms and 11 validated haplotypes were reported (out of 34 and 20, respectively, that were initially predicted by our program). The rDNA domains of S. maritima have similar lengths as those found in other Poaceae, apart from the 5′-ETS, which is approximately two-times longer in S. maritima. Sequence homogeneity was encountered in coding regions and both internal transcribed spacers (ITS), whereas high intragenomic variability was detected in the intergenic spacer (IGS) and the external transcribed spacer (ETS). Molecular cytogenetic analysis by fluorescent in situ hybridization (FISH) revealed the presence of one pair of 45S rDNA signals on the chromosomes of S. maritima instead of three expected pairs for a hexaploid genome, indicating loss of duplicated homeologous loci through the diploidization process. The procedure developed here may be used at any ploidy level and using different sequencing technologies. PMID:26530424
Genome Fragmentation Is Not Confined to the Peridinin Plastid in Dinoflagellates

PubMed Central

Espelund, Mari; Minge, Marianne A.; Gabrielsen, Tove M.; Nederbragt, Alexander J.; Shalchian-Tabrizi, Kamran; Otis, Christian; Turmel, Monique; Lemieux, Claude; Jakobsen, Kjetill S.

2012-01-01

When plastids are transferred between eukaryote lineages through series of endosymbiosis, their environment changes dramatically. Comparison of dinoflagellate plastids that originated from different algal groups has revealed convergent evolution, suggesting that the host environment mainly influences the evolution of the newly acquired organelle. Recently the genome from the anomalously pigmented dinoflagellate Karlodinium veneficum plastid was uncovered as a conventional chromosome. To determine if this haptophyte-derived plastid contains additional chromosomal fragments that resemble the mini-circles of the peridin-containing plastids, we have investigated its genome by in-depth sequencing using 454 pyrosequencing technology, PCR and clone library analysis. Sequence analyses show several genes with significantly higher copy numbers than present in the chromosome. These genes are most likely extrachromosomal fragments, and the ones with highest copy numbers include genes encoding the chaperone DnaK(Hsp70), the rubisco large subunit (rbcL), and two tRNAs (trnE and trnM). In addition, some photosystem genes such as psaB, psaA, psbB and psbD are overrepresented. Most of the dnaK and rbcL sequences are found as shortened or fragmented gene sequences, typically missing the 3′-terminal portion. Both dnaK and rbcL are associated with a common sequence element consisting of about 120 bp of highly conserved AT-rich sequence followed by a trnE gene, possibly serving as a control region. Decatenation assays and Southern blot analysis indicate that the extrachromosomal plastid sequences do not have the same organization or lengths as the minicircles of the peridinin dinoflagellates. The fragmentation of the haptophyte-derived plastid genome K. veneficum suggests that it is likely a sign of a host-driven process shaping the plastid genomes of dinoflagellates. PMID:22719952
Infectious mutants of cassava latent virus generated in vivo from intact recombinant DNA clones containing single copies of the genome.

PubMed Central

Stanley, J; Townsend, R

1986-01-01

Intact recombinant DNAs containing single copies of either component of the cassava latent virus genome can elicit infection when mechanically inoculated to host plants in the presence of the appropriate second component. Characterisation of infectious mutant progeny viruses, by analysis of virus-specific supercoiled DNA intermediates, indicates that most if not all of the cloning vector has been deleted, achieved at least in some cases by intermolecular recombination in vivo between DNAs 1 and 2. Significant rearrangements within the intergenic region of DNA 2, predominantly external to the common region, can be tolerated without loss of infectivity suggesting a somewhat passive role in virus multiplication for the sequences in question. Although packaging constraints might impose limits on the amount of DNA within geminate particles, isolation of an infectious coat protein mutant defective in virion production suggests that packaging is not essential for systemic spread of the viral DNA. Images PMID:2875435

Emerging critical roles of Fe-S clusters in DNA replication and repair

PubMed Central

Fuss, Jill O.; Tsai, Chi-Lin; Ishida, Justin P.; Tainer, John A.

2015-01-01

Fe-S clusters are partners in the origin of life that predate cells, acetyl-CoA metabolism, DNA, and the RNA world. The double helix solved the mystery of DNA replication by base pairing for accurate copying. Yet, for genome stability necessary to life, the double helix has equally important implications for damage repair. Here we examine striking advances that uncover Fe-S cluster roles both in copying the genetic sequence by DNA polymerases and in crucial repair processes for genome maintenance, as mutational defects cause cancer and degenerative disease. Moreover, we examine an exciting, controversial role for Fe-S clusters in a third element required for life – the long-range coordination and regulation of replication and repair events. By their ability to delocalize electrons over both Fe and S centers, Fe-S clusters have unbeatable features for protein conformational control and charge transfer via double-stranded DNA that may fundamentally transform our understanding of life, replication, and repair. PMID:25655665
Characterization of proviruses cloned from mink cell focus-forming virus-infected cellular DNA.

PubMed Central

Khan, A S; Repaske, R; Garon, C F; Chan, H W; Rowe, W P; Martin, M A

1982-01-01

Two proviruses were cloned from EcoRI-digested DNA extracted from mink cells chronically infected with AKR mink cell focus-forming (MCF) 247 murine leukemia virus (MuLV), using a lambda phage host vector system. One cloned MuLV DNA fragment (designated MCF 1) contained sequences extending 6.8 kilobases from an EcoRI restriction site in the 5' long terminal repeat (LTR) to an EcoRI site located in the envelope (env) region and was indistinguishable by restriction endonuclease mapping for 5.1 kilobases (except for the EcoRI site in the LTR) from the 5' end of AKR ecotropic proviral DNA. The DNA segment extending from 5.1 to 6.8 kilobases contained several restriction sites that were not present in the AKR ecotropic provirus. A 0.5-kilobase DNA segment located at the 3' end of MCF 1 DNA contained sequences which hybridized to a xenotropic env-specific DNA probe but not to labeled ecotropic env-specific DNA. This dual character of MCF 1 proviral DNA was also confirmed by analyzing heteroduplex molecules by electron microscopy. The second cloned proviral DNA (designated MCF 2) was a 6.9-kilobase EcoRI DNA fragment which contained LTR sequences at each end and a 2.0-kilobase deletion encompassing most of the env region. The MCF 2 proviral DNA proved to be a useful reagent for detecting LTRs electron microscopically due to the presence of nonoverlapping, terminally located LTR sequences which effected its circularization with DNAs containing homologous LTR sequences. Nucleotide sequence analysis demonstrated the presence of a 104-base-pair direct repeat in the LTR of MCF 2 DNA. In contrast, only a single copy of the reiterated component of the direct repeat was present in MCF 1 DNA. Images PMID:6281459
Quantification of HER2/neu gene amplification by competitive pcr using fluorescent melting curve analysis.

PubMed

Lyon, E; Millson, A; Lowery, M C; Woods, R; Wittwer, C T

2001-05-01

Molecular detection methods for HER2/neu gene amplification include fluorescence in situ hybridization (FISH) and competitive PCR. We designed a quantitative PCR system utilizing fluorescent hybridization probes and a competitor that differed from the HER2/neu sequence by a single base change. Increasing twofold concentrations of competitor were coamplified with DNA from cell lines with various HER2/neu copy numbers at the HER2/neu locus. Competitor DNA was distinguished from the HER2/neu sequence by a fluorescent hybridization probe and melting curve analysis on a fluorescence-monitoring thermal cycler. The percentages of competitor to target peak areas on derivative fluorescence vs temperature curves were used to calculate copy number. Real-time monitoring of the PCR reaction showed comparable relative areas throughout the log phase and during the PCR plateau, indicating that only end-point detection is necessary. The dynamic range was over two logs (2000-250 000 competitor copies) with CVs < 20%. Three cell lines (MRC-5, T-47D, and SK-BR-3) were determined to have gene doses of 1, 3, and 11, respectively. Gene amplification was detected in 3 of 13 tumor samples and was correlated with conventional real-time PCR and FISH analysis. Use of relative peak areas allows gene copy numbers to be quantified against an internal competitive control in < 1 h.
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.

PubMed Central

Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V

1985-01-01

The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this. Images PMID:3016521
Human beta-globin gene polymorphisms characterized in DNA extracted from ancient bones 12,000 years old.

PubMed

Béraud-Colomb, E; Roubin, R; Martin, J; Maroc, N; Gardeisen, A; Trabuchet, G; Goosséns, M

1995-12-01

Analyzing the nuclear DNA from ancient human bones is an essential step to the understanding of genetic diversity in current populations, provided that such systematic studies are experimentally feasible. This article reports the successful extraction and amplification of nuclear DNA from the beta-globin region from 5 of 10 bone specimens up to 12,000 years old. These have been typed for beta-globin frameworks by sequencing through two variable positions and for a polymorphic (AT) chi (T) gamma microsatellite 500 bp upstream of the beta-globin gene. These specimens of human remains are somewhat older than those analyzed in previous nuclear gene sequencing reports and considerably older than those used to study high-copy-number human mtDNA. These results show that the systematic study of nuclear DNA polymorphisms of ancient populations is feasible.
Hyb-Seq: Combining target enrichment and genome skimming for plant phylogenomics1

PubMed Central

Weitemier, Kevin; Straub, Shannon C. K.; Cronn, Richard C.; Fishbein, Mark; Schmickl, Roswitha; McDonnell, Angela; Liston, Aaron

2014-01-01

• Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. • Methods and Results: Genome and transcriptome assemblies for milkweed (Asclepias syriaca) were used to design enrichment probes for 3385 exons from 768 genes (>1.6 Mbp) followed by Illumina sequencing of enriched libraries. Hyb-Seq of 12 individuals (10 Asclepias species and two related genera) resulted in at least partial assembly of 92.6% of exons and 99.7% of genes and an average assembly length >2 Mbp. Importantly, complete plastomes and nuclear ribosomal DNA cistrons were assembled using off-target reads. Phylogenomic analyses demonstrated signal conflict between genomes. • Conclusions: The Hyb-Seq approach enables targeted sequencing of thousands of low-copy nuclear exons and flanking regions, as well as genome skimming of high-copy repeats and organellar genomes, to efficiently produce genome-scale data sets for phylogenomics. PMID:25225629
Electrochemical detection of Francisella tularensis genomic DNA using solid-phase recombinase polymerase amplification.

PubMed

del Río, Jonathan Sabaté; Yehia Adly, Nouran; Acero-Sánchez, Josep Lluis; Henry, Olivier Y F; O'Sullivan, Ciara K

2014-04-15

Solid-phase isothermal DNA amplification was performed exploiting the homology protein recombinase A (recA). The system was primarily tested on maleimide activated microtitre plates as a proof-of-concept and later translated to an electrochemical platform. In both cases, forward primer for Francisella tularensis holarctica genomic DNA was surface immobilised via a thiol or an amino moiety and then elongated during the recA mediated amplification, carried out in the presence of specific target sequence and reverse primers. The formation of the subsequent surface tethered amplicons was either colorimetrically or electrochemically monitored using a horseradish peroxidase (HRP)-labelled DNA secondary probe complementary to the elongated strand. The amplification time was optimised to amplify even low amounts of DNA copies in less than an hour at a constant temperature of 37°C, achieving a limit of detection of 1.3×10(-13) M (4×10(6) copies in 50 μL) for the colorimetric assay and 3.3×10(-14) M (2×10(5) copies in 10 μL) for the chronoamperometric assay. The system was demonstrated to be highly specific with negligible cross-reactivity with non-complementary targets or primers. © 2013 Elsevier B.V. All rights reserved.
nbCNV: a multi-constrained optimization model for discovering copy number variants in single-cell sequencing data.

PubMed

Zhang, Changsheng; Cai, Hongmin; Huang, Jingying; Song, Yan

2016-09-17

Variations in DNA copy number have an important contribution to the development of several diseases, including autism, schizophrenia and cancer. Single-cell sequencing technology allows the dissection of genomic heterogeneity at the single-cell level, thereby providing important evolutionary information about cancer cells. In contrast to traditional bulk sequencing, single-cell sequencing requires the amplification of the whole genome of a single cell to accumulate enough samples for sequencing. However, the amplification process inevitably introduces amplification bias, resulting in an over-dispersing portion of the sequencing data. Recent study has manifested that the over-dispersed portion of the single-cell sequencing data could be well modelled by negative binomial distributions. We developed a read-depth based method, nbCNV to detect the copy number variants (CNVs). The nbCNV method uses two constraints-sparsity and smoothness to fit the CNV patterns under the assumption that the read signals are negatively binomially distributed. The problem of CNV detection was formulated as a quadratic optimization problem, and was solved by an efficient numerical solution based on the classical alternating direction minimization method. Extensive experiments to compare nbCNV with existing benchmark models were conducted on both simulated data and empirical single-cell sequencing data. The results of those experiments demonstrate that nbCNV achieves superior performance and high robustness for the detection of CNVs in single-cell sequencing data.
Accurate quantitation of circulating cell-free mitochondrial DNA in plasma by droplet digital PCR.

PubMed

Ye, Wei; Tang, Xiaojun; Liu, Chu; Wen, Chaowei; Li, Wei; Lyu, Jianxin

2017-04-01

To establish a method for accurate quantitation of circulating cell-free mitochondrial DNA (ccf-mtDNA) in plasma by droplet digital PCR (ddPCR), we designed a ddPCR method to determine the copy number of ccf-mtDNA by amplifying mitochondrial ND1 (MT-ND1). To evaluate the sensitivity and specificity of the method, a recombinant pMD18-T plasmid containing MT-ND1 sequences and mtDNA-deleted (ρ 0 ) HeLa cells were used, respectively. Subsequently, different plasma samples were prepared for ddPCR to evaluate the feasibility of detecting plasma ccf-mtDNA. In the results, the ddPCR method showed high sensitivity and specificity. When the DNA was extracted from plasma prior to ddPCR, the ccf-mtDNA copy number was higher than that measured without extraction. This difference was not due to a PCR inhibitor, such as EDTA-Na 2 , an anti-coagulant in plasma, because standard EDTA-Na 2 concentration (5 mM) did not significantly inhibit ddPCR reactions. The difference might be attributable to plasma exosomal mtDNA, which was 4.21 ± 0.38 copies/μL of plasma, accounting for ∼19% of plasma ccf-mtDNA. Therefore, ddPCR can quickly and reliably detect ccf-mtDNA from plasma with a prior DNA extraction step, providing for a more accurate detection of ccf-mtDNA. The direct use of plasma as a template in ddPCR is suitable for the detection of exogenous cell-free nucleic acids within plasma, but not of nucleic acids that have a vesicle-associated form, such as exosomal mtDNA. Graphical Abstract Designs of the present work. *: Module 1, #: Module 2, &: Module 3.
A multi-locus analysis of phylogenetic relationships within grass subfamily Pooideae (Poaceae) inferred from sequences of nuclear single copy gene regions compared with plastid DNA.

PubMed

Hochbach, Anne; Schneider, Julia; Röser, Martin

2015-06-01

To investigate phylogenetic relationships within the grass subfamily Pooideae we studied about 50 taxa covering all recognized tribes, using one plastid DNA (cpDNA) marker (matK gene-3'trnK exon) and for the first time four nuclear single copy gene loci. DNA sequence information from two parts of the nuclear genes topoisomerase 6 (Topo6) spanning the exons 8-13 and 17-19, the exons 9-13 encoding plastid acetyl-CoA-carboxylase (Acc1) and the partial exon 1 of phytochrome B (PhyB) were generated. Individual and nuclear combined data were evaluated using maximum parsimony, maximum likelihood and Bayesian methods. All of the phylogenetic results show Brachyelytrum and the tribe Nardeae as earliest diverging lineages within the subfamily. The 'core' Pooideae (Hordeeae and the Aveneae/Poeae tribe complex) are also strongly supported, as well as the monophyly of the tribes Brachypodieae, Meliceae and Stipeae (except PhyB). The beak grass tribe Diarrheneae and the tribe Duthieeae are not monophyletic in some of the analyses. However, the combined nuclear DNA (nDNA) tree yields the highest resolution and the best delimitation of the tribes, and provides the following evolutionary hypothesis for the tribes: Brachyelytrum, Nardeae, Duthieeae, Meliceae, Stipeae, Diarrheneae, Brachypodieae and the 'core' Pooideae. Within the individual datasets, the phylogenetic trees obtained from Topo6 exon 8-13 shows the most interesting results. The divergent positions of some clone sequences of Ampelodesmos mauritanicus and Trikeraia pappiformis, for instance, may indicate a hybrid origin of these stipoid taxa. Copyright © 2015 Elsevier Inc. All rights reserved.
Identification and chromosome mapping of repetitive elements in the Astyanax scabripinnis (Teleostei: Characidae) species complex.

PubMed

Barbosa, Patrícia; de Oliveira, Luiz Antonio; Pucci, Marcela Baer; Santos, Mateus Henrique; Moreira-Filho, Orlando; Vicari, Marcelo Ricardo; Nogaroto, Viviane; de Almeida, Mara Cristina; Artoni, Roberto Ferreira

2015-02-01

Most part of the eukaryotic genome is composed of repeated sequences or multiple copies of DNA, which were considered as "junk DNA", and may be associated to the heterochromatin. In this study, three populations of Astyanax aff. scabripinnis from Brazilian rivers of Guaratinguetá and Pindamonhangaba (São Paulo) and a population from Maringá (Paraná) were analyzed concerning the localization of the nucleolar organizer regions (Ag-NORs), the As51 satellite DNA, the 18S ribosomal DNA (rDNA), and the 5S rDNA. Repeated sequences were also isolated and identified by the Cot - 1 method, which indicated similarity (90%) with the LINE UnaL2 retrotransposon. The fluorescence in situ hybridization (FISH) showed the retrotransposon dispersed and more concentrated markers in centromeric and telomeric chromosomal regions. These sequences were co-localized and interspaced with 18S and 5S rDNA and As51, confirmed by fiber-FISH essay. The B chromosome found in these populations pointed to a conspicuous hybridization with LINE probe, which is also co-located in As51 sequences. The NORs were active at unique sites of a homologous pair in the three populations. There were no evidences that transposable elements and repetitive DNA had influence in the transcriptional regulation of ribosomal genes in our analyses.
Correction of the lack of commutability between plasmid DNA and genomic DNA for quantification of genetically modified organisms using pBSTopas as a model.

PubMed

Zhang, Li; Wu, Yuhua; Wu, Gang; Cao, Yinglong; Lu, Changming

2014-10-01

Plasmid calibrators are increasingly applied for polymerase chain reaction (PCR) analysis of genetically modified organisms (GMOs). To evaluate the commutability between plasmid DNA (pDNA) and genomic DNA (gDNA) as calibrators, a plasmid molecule, pBSTopas, was constructed, harboring a Topas 19/2 event-specific sequence and a partial sequence of the rapeseed reference gene CruA. Assays of the pDNA showed similar limits of detection (five copies for Topas 19/2 and CruA) and quantification (40 copies for Topas 19/2 and 20 for CruA) as those for the gDNA. Comparisons of plasmid and genomic standard curves indicated that the slopes, intercepts, and PCR efficiency for pBSTopas were significantly different from CRM Topas 19/2 gDNA for quantitative analysis of GMOs. Three correction methods were used to calibrate the quantitative analysis of control samples using pDNA as calibrators: model a, or coefficient value a (Cva); model b, or coefficient value b (Cvb); and the novel model c or coefficient formula (Cf). Cva and Cvb gave similar estimated values for the control samples, and the quantitative bias of the low concentration sample exceeded the acceptable range within ±25% in two of the four repeats. Using Cfs to normalize the Ct values of test samples, the estimated values were very close to the reference values (bias -13.27 to 13.05%). In the validation of control samples, model c was more appropriate than Cva or Cvb. The application of Cf allowed pBSTopas to substitute for Topas 19/2 gDNA as a calibrator to accurately quantify the GMO.
Inhibition of colorectal cancer genomic copy number alterations and chromosomal fragile site tumor suppressor FHIT and WWOX deletions by DNA mismatch repair

PubMed Central

Gelincik, Ozkan; Blecua, Pedro; Edelmann, Winfried; Kucherlapati, Raju; Zhou, Kathy; Jasin, Maria; Gümüş, Zeynep H.; Lipkin, Steven M.

2017-01-01

Homologous recombination (HR) enables precise DNA repair after DNA double strand breaks (DSBs) using identical sequence templates, whereas homeologous recombination (HeR) uses only partially homologous sequences. Homeologous recombination introduces mutations through gene conversion and genomic deletions through single-strand annealing (SSA). DNA mismatch repair (MMR) inhibits HeR, but the roles of mammalian MMR MutL homologues (MLH1, PMS2 and MLH3) proteins in HeR suppression are poorly characterized. Here, we demonstrate that mouse embryonic fibroblasts (MEFs) carrying Mlh1, Pms2, and Mlh3 mutations have higher HeR rates, by using 7,863 uniquely mapping paired direct repeat sequences (DRs) in the mouse genome as endogenous gene conversion and SSA reporters. Additionally, when DSBs are induced by gamma-radiation, Mlh1, Pms2 and Mlh3 mutant MEFs have higher DR copy number alterations (CNAs), including DR CNA hotspots previously identified in mouse MMR-deficient colorectal cancer (dMMR CRC). Analysis of The Cancer Genome Atlas CRC data revealed that dMMR CRCs have higher genome-wide DR HeR rates than MMR proficient CRCs, and that dMMR CRCs have deletion hotspots in tumor suppressors FHIT/WWOX at chromosomal fragile sites FRA3B and FRA16D (which have elevated DSB rates) flanked by paired homologous DRs and inverted repeats (IR). Overall, these data provide novel insights into the MMR-dependent HeR inhibition mechanism and its role in tumor suppression. PMID:29069730
Comparative Analysis of the Complete Chloroplast Genome of Four Endangered Herbals of Notopterygium

PubMed Central

Yang, Jiao; Yue, Ming; Niu, Chuan; Ma, Xiong-Feng; Li, Zhong-Hu

2017-01-01

Notopterygium H. de Boissieu (Apiaceae) is an endangered perennial herb endemic to China. A good knowledge of phylogenetic evolution and population genomics is conducive to the establishment of effective management and conservation strategies of the genus Notopterygium. In this study, the complete chloroplast (cp) genomes of four Notopterygium species (N. incisum C. C. Ting ex H. T. Chang, N. oviforme R. H. Shan, N. franchetii H. de Boissieu and N. forrestii H. Wolff) were assembled and characterized using next-generation sequencing. We investigated the gene organization, order, size and repeat sequences of the cp genome and constructed the phylogenetic relationships of Notopterygium species based on the chloroplast DNA and nuclear internal transcribed spacer (ITS) sequences. Comparative analysis of plastid genome showed that the cp DNA are the standard double-stranded molecule, ranging from 157,462 bp (N. oviforme) to 159,607 bp (N. forrestii) in length. The circular DNA each contained a large single-copy (LSC) region, a small single-copy (SSC) region, and a pair of inverted repeats (IRs). The cp DNA of four species contained 85 protein-coding genes, 37 transfer RNA (tRNA) genes and 8 ribosomal RNA (rRNA) genes, respectively. We determined the marked conservation of gene content and sequence evolutionary rate in the cp genome of four Notopterygium species. Three genes (psaI, psbI and rpoA) were possibly under positive selection among the four sampled species. Phylogenetic analysis showed that four Notopterygium species formed a monophyletic clade with high bootstrap support. However, the inconsistent interspecific relationships with the genus Notopterygium were identified between the cp DNA and ITS markers. The incomplete lineage sorting, convergence evolution or hybridization, gene infiltration and different sampling strategies among species may have caused the incongruence between the nuclear and cp DNA relationships. The present results suggested that Notopterygium species may have experienced a complex evolutionary history and speciation process. PMID:28422071
Comprehensive Survey of Genetic Diversity in Chloroplast Genomes and 45S nrDNAs within Panax ginseng Species

PubMed Central

Kim, Kyunghee; Lee, Sang-Choon; Lee, Junki; Lee, Hyun Oh; Joh, Ho Jun; Kim, Nam-Hoon; Park, Hyun-Seung; Yang, Tae-Jin

2015-01-01

We report complete sequences of chloroplast (cp) genome and 45S nuclear ribosomal DNA (45S nrDNA) for 11 Panax ginseng cultivars. We have obtained complete sequences of cp and 45S nrDNA, the representative barcoding target sequences for cytoplasm and nuclear genome, respectively, based on low coverage NGS sequence of each cultivar. The cp genomes sizes ranged from 156,241 to 156,425 bp and the major size variation was derived from differences in copy number of tandem repeats in the ycf1 gene and in the intergenic regions of rps16-trnUUG and rpl32-trnUAG. The complete 45S nrDNA unit sequences were 11,091 bp, representing a consensus single transcriptional unit with an intergenic spacer region. Comparative analysis of these sequences as well as those previously reported for three Chinese accessions identified very rare but unique polymorphism in the cp genome within P. ginseng cultivars. There were 12 intra-species polymorphisms (six SNPs and six InDels) among 14 cultivars. We also identified five SNPs from 45S nrDNA of 11 Korean ginseng cultivars. From the 17 unique informative polymorphic sites, we developed six reliable markers for analysis of ginseng diversity and cultivar authentication. PMID:26061692
Quantitative Detection of the Free-Living Amoeba Hartmannella vermiformis in Surface Water by Using Real-Time PCR†

PubMed Central

Kuiper, Melanie W.; Valster, Rinske M.; Wullings, Bart A.; Boonstra, Harry; Smidt, Hauke; van der Kooij, Dick

2006-01-01

A real-time PCR-based method targeting the 18S rRNA gene was developed for the quantitative detection of Hartmannella vermiformis, a free-living amoeba which is a potential host for Legionella pneumophila in warm water systems and cooling towers. The detection specificity was validated using genomic DNA of the closely related amoeba Hartmannella abertawensis as a negative control and sequence analysis of amplified products from environmental samples. Real-time PCR detection of serially diluted DNA extracted from H. vermiformis was linear for microscopic cell counts between 1.14 × 10−1 and 1.14 × 104 cells per PCR. The genome of H. vermiformis harbors multiple copies of the 18S rRNA gene, and an average number (with standard error) of 1,330 ± 127 copies per cell was derived from real-time PCR calibration curves for cell suspensions and plasmid DNA. No significant differences were observed between the 18S rRNA gene copy numbers for trophozoites and cysts of strain ATCC 50237 or between the copy numbers for this strain and strain KWR-1. The developed method was applied to water samples (200 ml) collected from a variety of lakes and rivers serving as sources for drinking water production in The Netherlands. Detectable populations were found in 21 of the 28 samples, with concentrations ranging from 5 to 75 cells/liter. A high degree of similarity (≥98%) was observed between sequences of clones originating from the different surface waters and between these clones and the reference strains. Hence, H. vermiformis, which is highly similar to strains serving as hosts for L. pneumophila, is a common component of the microbial community in fresh surface water. PMID:16957190
Development and in-house validation of the event-specific qualitative and quantitative PCR detection methods for genetically modified cotton MON15985.

PubMed

Jiang, Lingxi; Yang, Litao; Rao, Jun; Guo, Jinchao; Wang, Shu; Liu, Jia; Lee, Seonghun; Zhang, Dabing

2010-02-01

To implement genetically modified organism (GMO) labeling regulations, an event-specific analysis method based on the junction sequence between exogenous integration and host genomic DNA has become the preferential approach for GMO identification and quantification. In this study, specific primers and TaqMan probes based on the revealed 5'-end junction sequence of GM cotton MON15985 were designed, and qualitative and quantitative polymerase chain reaction (PCR) assays were established employing the designed primers and probes. In the qualitative PCR assay, the limit of detection (LOD) was 0.5 g kg(-1) in 100 ng total cotton genomic DNA, corresponding to about 17 copies of haploid cotton genomic DNA, and the LOD and limit of quantification (LOQ) for quantitative PCR assay were 10 and 17 copies of haploid cotton genomic DNA, respectively. Furthermore, the developed quantitative PCR assays were validated in-house by five different researchers. Also, five practical samples with known GM contents were quantified using the developed PCR assay in in-house validation, and the bias between the true and quantification values ranged from 2.06% to 12.59%. This study shows that the developed qualitative and quantitative PCR methods are applicable for the identification and quantification of GM cotton MON15985 and its derivates.
The Organization of Repetitive DNA in the Genomes of Amazonian Lizard Species in the Family Teiidae.

PubMed

Carvalho, Natalia D M; Pinheiro, Vanessa S S; Carmo, Edson J; Goll, Leonardo G; Schneider, Carlos H; Gross, Maria C

2015-01-01

Repetitive DNA is the largest fraction of the eukaryote genome and comprises tandem and dispersed sequences. It presents variations in relation to its composition, number of copies, distribution, dynamics, and genome organization, and participates in the evolutionary diversification of different vertebrate species. Repetitive sequences are usually located in the heterochromatin of centromeric and telomeric regions of chromosomes, contributing to chromosomal structures. Therefore, the aim of this study was to physically map repetitive DNA sequences (5S rDNA, telomeric sequences, tropomyosin gene 1, and retroelements Rex1 and SINE) of mitotic chromosomes of Amazonian species of teiids (Ameiva ameiva, Cnemidophorus sp. 1, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin) to understand their genome organization and karyotype evolution. The mapping of repetitive sequences revealed a distinct pattern in Cnemidophorus sp. 1, whereas the other species showed all sequences interspersed in the heterochromatic region. Physical mapping of the tropomyosin 1 gene was performed for the first time in lizards and showed that in addition to being functional, this gene has a structural function similar to the mapped repetitive elements as it is located preferentially in centromeric regions and termini of chromosomes. © 2016 S. Karger AG, Basel.
Complementary DNA cloning of the pear 1-aminocyclopropane-1-carboxylic acid oxidase gene and agrobacterium-mediated anti-sense genetic transformation.

PubMed

Qi, Jing; Dong, Zhen; Zhang, Yu-Xing

2015-12-01

The aim of the present study was to genetically modify plantlets of the Chinese yali pear to reduce their expression of ripening-associated 1-aminocyclopropane-1-carboxylic acid oxidase (ACO) and therefore increase the shelf-life of the fruit. Primers were designed with selectivity for the conserved regions of published ACO gene sequences, and yali complementary DNA (cDNA) cloning was performed by reverse transcription quantitative polymerase chain reaction (PCR). The obtained cDNA fragment contained 831 base pairs, encoding 276 amino acid residues, and shared no less than 94% nucleotide sequence identity with other published ACO genes. The cDNA fragment was inversely inserted into a pBI121 expression vector, between the cauliflower mosaic virus 35S promoter and the nopaline synthase terminator, in order to construct the anti‑sense expression vector of the ACO gene; it was transfected into cultured yali plants using Agrobacterium LBA4404. Four independent transgenic lines of pear plantlets were obtained and validated by PCR analysis. A Southern blot assay revealed that there were three transgenic lines containing a single copy of exogenous gene and one line with double copies. The present study provided germplasm resources for the cultivation of novel storage varieties of pears, therefore providing a reference for further applications of anti‑sense RNA technology in the genetic improvement of pears and other fruit.
Characterization of the Fb-Nof Transposable Element of Drosophila Melanogaster

PubMed Central

Harden, N.; Ashburner, M.

1990-01-01

FB-NOF is a composite transposable element of Drosophila melanogaster. It is composed of foldback sequences, of variable length, which flank a 4-kb NOF sequence with 308-bp inverted repeat termini. The NOF sequence could potentially code for a 120-kD polypeptide. The FB-NOF element is responsible for unstable mutations of the white gene (w(c) and w(DZL)) and is associated with the large TEs of G. Ising. Although most strains of D. melanogaster have 20-30 sites of FB insertion, FB-NOF elements are usually rare, many strains lack this composite element or have only one copy of it. A few strains, including w(DZL) and Basc have many (8-21) copies of FB-NOF, and these show a tendency to insert at ``hot-spots.'' These strains also have an increased number of FB elements. The DNA sequence of the NOF region associated with TE146(Z) has been determined. PMID:2174013

DAMe: a toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses.

PubMed

Zepeda-Mendoza, Marie Lisandra; Bohmann, Kristine; Carmona Baez, Aldo; Gilbert, M Thomas P

2016-05-03

DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5'-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way. We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe. DAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying unused tag combinations, (iii) evaluate the comparability of PCR replicates of the same sample, and (iv) filter tagged amplicons from a number of PCR replicates using parameters of minimum length, copy number, and reproducibility across the PCR replicates. This enables an efficient analysis of complex datasets, and ultimately increases the ease of handling datasets from large-scale studies.
Hop stunt viroid: molecular cloning and nucleotide sequence of the complete cDNA copy.

PubMed Central

Ohno, T; Takamatsu, N; Meshi, T; Okada, Y

1983-01-01

The complete cDNA of hop stunt viroid (HSV) has been cloned by the method of Okayama and Berg (Mol.Cell.Biol.2,161-170. (1982] and the complete nucleotide sequence has been established. The covalently closed circular single-stranded HSV RNA consists of 297 nucleotides. The secondary structure predicted for HSV contains 67% of its residues base-paired. The native HSV can possess an extended rod-like structure characteristic of viroids previously established. The central region of the native HSV has a similar structure to the conserved region found in all viroids sequenced so far except for avocado sunblotch viroid. The sequence homologous to the 5'-end of U1a RNA is also found in the sequence of HSV but not in the central conserved region. Images PMID:6312412
Noninvasive Prenatal Testing and Incidental Detection of Occult Maternal Malignancies.

PubMed

Bianchi, Diana W; Chudova, Darya; Sehnert, Amy J; Bhatt, Sucheta; Murray, Kathryn; Prosen, Tracy L; Garber, Judy E; Wilkins-Haug, Louise; Vora, Neeta L; Warsof, Stephen; Goldberg, James; Ziainia, Tina; Halks-Miller, Meredith

2015-07-14

Understanding the relationship between aneuploidy detection on noninvasive prenatal testing (NIPT) and occult maternal malignancies may explain results that are discordant with the fetal karyotype and improve maternal clinical care. To evaluate massively parallel sequencing data for patterns of copy-number variations that might prospectively identify occult maternal malignancies. Case series identified from 125,426 samples submitted between February 15, 2012, and September 30, 2014, from asymptomatic pregnant women who underwent plasma cell-free DNA sequencing for clinical prenatal aneuploidy screening. Analyses were conducted in a clinical laboratory that performs DNA sequencing. Among the clinical samples, abnormal results were detected in 3757 (3%); these were reported to the ordering physician with recommendations for further evaluation. NIPT for fetal aneuploidy screening (chromosomes 13, 18, 21, X, and Y). Detailed genome-wide bioinformatics analysis was performed on available sequencing data from 8 of 10 women with known cancers. Genome-wide copy-number changes in the original NIPT samples and in subsequent serial samples from individual patients when available are reported. Copy-number changes detected in NIPT sequencing data in the known cancer cases were compared with the types of aneuploidies detected in the overall cohort. From a cohort of 125,426 NIPT results, 3757 (3%) were positive for 1 or more aneuploidies involving chromosomes 13, 18, 21, X, or Y. From this set of 3757 samples, 10 cases of maternal cancer were identified. Detailed clinical and sequencing data were obtained in 8. Maternal cancers most frequently occurred with the rare NIPT finding of more than 1 aneuploidy detected (7 known cancers among 39 cases of multiple aneuploidies by NIPT, 18% [95% CI, 7.5%-33.5%]). All 8 cases that underwent further bioinformatics analysis showed unique patterns of nonspecific copy-number gains and losses across multiple chromosomes. In 1 case, blood was sampled after completion of treatment for colorectal cancer and the abnormal pattern was no longer evident. In this preliminary study, a small number of cases of occult malignancy were subsequently diagnosed among pregnant women whose noninvasive prenatal testing results showed discordance with the fetal karyotype. The clinical importance of these findings will require further research.
Environmental distribution, abundance and activity of the Miscellaneous Crenarchaeotal Group

NASA Astrophysics Data System (ADS)

Lloyd, K. G.; Biddle, J.; Teske, A.

2011-12-01

Many marine sedimentary microbes have only been identified by 16S rRNA sequences. Consequently, little is known about the types of metabolism, activity levels, or relative abundance of these groups in marine sediments. We found that one of these uncultured groups, called the Miscellaneous Crenarchaeotal Group (MCG), dominated clone libraries made from reverse transcribed 16S rRNA, and 454 pyrosequenced 16S rRNA genes, in the White Oak River estuary. Primers suitable for quantitative PCR were developed for MCG and used to show that 16S rRNA DNA copy numbers from MCG account for nearly all the archaeal 16S rRNA genes present. RT-qPCR shows much less MCG rRNA than total archaeal rRNA, but comparisons of different primers for each group suggest bias in the RNA-based work relative to the DNA-based work. There is no evidence of a population shift with depth below the sulfate-methane transition zone, suggesting that the metabolism of MCG may not be tied to sulfur or methane cycles. We classified 2,771 new sequences within the SSU Silva 106 database that, along with the classified sequences in the Silva database was used to make an MCG database of 4,646 sequences that allowed us to increase the named subgroups of MCG from 7 to 19. Percent terrestrial sequences in each subgroup is positively correlated with percent of the marine sequences that are nearshore, suggesting that membership in the different subgroups is not random, but dictated by environmental selective pressures. Given their high phylogenetic diversity, ubiquitous distribution in anoxic environments, and high DNA copy number relative to total archaea, members of MCG are most likely anaerobic heterotrophs who are integral to the post-depositional marine carbon cycle.
Ribosomal DNA Organization Before and After Magnification in Drosophila melanogaster

PubMed Central

Bianciardi, Alessio; Boschi, Manuela; Swanson, Ellen E.; Belloni, Massimo; Robbins, Leonard G.

2012-01-01

In all eukaryotes, the ribosomal RNA genes are stably inherited redundant elements. In Drosophila melanogaster, the presence of a Ybb− chromosome in males, or the maternal presence of the Ribosomal exchange (Rex) element, induces magnification: a heritable increase of rDNA copy number. To date, several alternative classes of mechanisms have been proposed for magnification: in situ replication or extra-chromosomal replication, either of which might act on short or extended strings of rDNA units, or unequal sister chromatid exchange. To eliminate some of these hypotheses, none of which has been clearly proven, we examined molecular-variant composition and compared genetic maps of the rDNA in the bb2 mutant and in some magnified bb+ alleles. The genetic markers used are molecular-length variants of IGS sequences and of R1 and R2 mobile elements present in many 28S sequences. Direct comparison of PCR products does not reveal any particularly intensified electrophoretic bands in magnified alleles compared to the nonmagnified bb2 allele. Hence, the increase of rDNA copy number is diluted among multiple variants. We can therefore reject mechanisms of magnification based on multiple rounds of replication of short strings. Moreover, we find no changes of marker order when pre- and postmagnification maps are compared. Thus, we can further restrict the possible mechanisms to two: replication in situ of an extended string of rDNA units or unequal exchange between sister chromatids. PMID:22505623
Mitochondrial Genome Sequences of Nematocera (Lower Diptera): Evidence of Rearrangement following a Complete Genome Duplication in a Winter Crane Fly

PubMed Central

Beckenbach, Andrew T.

2012-01-01

The complete mitochondrial DNA sequences of eight representatives of lower Diptera, suborder Nematocera, along with nearly complete sequences from two other species, are presented. These taxa represent eight families not previously represented by complete mitochondrial DNA sequences. Most of the sequences retain the ancestral dipteran mitochondrial gene arrangement, while one sequence, that of the midge Arachnocampa flava (family Keroplatidae), has an inversion of the trnE gene. The most unusual result is the extensive rearrangement of the mitochondrial genome of a winter crane fly, Paracladura trichoptera (family Trichocera). The pattern of rearrangement indicates that the mechanism of rearrangement involved a tandem duplication of the entire mitochondrial genome, followed by random and nonrandom loss of one copy of each gene. Another winter crane fly retains the ancestral diperan gene arrangement. A preliminary mitochondrial phylogeny of the Diptera is also presented. PMID:22155689
Gene Deletion in Barley Mediated by LTR-retrotransposon BARE

PubMed Central

Shang, Yi; Yang, Fei; Schulman, Alan H.; Zhu, Jinghuan; Jia, Yong; Wang, Junmei; Zhang, Xiao-Qi; Jia, Qiaojun; Hua, Wei; Yang, Jianming; Li, Chengdao

2017-01-01

A poly-row branched spike (prbs) barley mutant was obtained from soaking a two-rowed barley inflorescence in a solution of maize genomic DNA. Positional cloning and sequencing demonstrated that the prbs mutant resulted from a 28 kb deletion including the inflorescence architecture gene HvRA2. Sequence annotation revealed that the HvRA2 gene is flanked by two LTR (long terminal repeat) retrotransposons (BARE) sharing 89% sequence identity. A recombination between the integrase (IN) gene regions of the two BARE copies resulted in the formation of an intact BARE and loss of HvRA2. No maize DNA was detected in the recombination region although the flanking sequences of HvRA2 gene showed over 73% of sequence identity with repetitive sequences on 10 maize chromosomes. It is still unknown whether the interaction of retrotransposons between barley and maize has resulted in the recombination observed in the present study. PMID:28252053
CAPRRESI: Chimera Assembly by Plasmid Recovery and Restriction Enzyme Site Insertion.

PubMed

Santillán, Orlando; Ramírez-Romero, Miguel A; Dávila, Guillermo

2017-06-25

Here, we present chimera assembly by plasmid recovery and restriction enzyme site insertion (CAPRRESI). CAPRRESI benefits from many strengths of the original plasmid recovery method and introduces restriction enzyme digestion to ease DNA ligation reactions (required for chimera assembly). For this protocol, users clone wildtype genes into the same plasmid (pUC18 or pUC19). After the in silico selection of amino acid sequence regions where chimeras should be assembled, users obtain all the synonym DNA sequences that encode them. Ad hoc Perl scripts enable users to determine all synonym DNA sequences. After this step, another Perl script searches for restriction enzyme sites on all synonym DNA sequences. This in silico analysis is also performed using the ampicillin resistance gene (ampR) found on pUC18/19 plasmids. Users design oligonucleotides inside synonym regions to disrupt wildtype and ampR genes by PCR. After obtaining and purifying complementary DNA fragments, restriction enzyme digestion is accomplished. Chimera assembly is achieved by ligating appropriate complementary DNA fragments. pUC18/19 vectors are selected for CAPRRESI because they offer technical advantages, such as small size (2,686 base pairs), high copy number, advantageous sequencing reaction features, and commercial availability. The usage of restriction enzymes for chimera assembly eliminates the need for DNA polymerases yielding blunt-ended products. CAPRRESI is a fast and low-cost method for fusing protein-coding genes.
The 5S rDNA in two Abracris grasshoppers (Ommatolampidinae: Acrididae): molecular and chromosomal organization.

PubMed

Bueno, Danilo; Palacios-Gimenez, Octavio Manuel; Martí, Dardo Andrea; Mariguela, Tatiane Casagrande; Cabral-de-Mello, Diogo Cavalcanti

2016-08-01

The 5S ribosomal DNA (rDNA) sequences are subject of dynamic evolution at chromosomal and molecular levels, evolving through concerted and/or birth-and-death fashion. Among grasshoppers, the chromosomal location for this sequence was established for some species, but little molecular information was obtained to infer evolutionary patterns. Here, we integrated data from chromosomal and nucleotide sequence analysis for 5S rDNA in two Abracris species aiming to identify evolutionary dynamics. For both species, two arrays were identified, a larger sequence (named type-I) that consisted of the entire 5S rDNA gene plus NTS (non-transcribed spacer) and a smaller (named type-II) with truncated 5S rDNA gene plus short NTS that was considered a pseudogene. For type-I sequences, the gene corresponding region contained the internal control region and poly-T motif and the NTS presented partial transposable elements. Between the species, nucleotide differences for type-I were noticed, while type-II was identical, suggesting pseudogenization in a common ancestor. At chromosomal point to view, the type-II was placed in one bivalent, while type-I occurred in multiple copies in distinct chromosomes. In Abracris, the evolution of 5S rDNA was apparently influenced by the chromosomal distribution of clusters (single or multiple location), resulting in a mixed mechanism integrating concerted and birth-and-death evolution depending on the unit.
Applying DNA Barcodes to Identify Closely Related Species of Ferns: A Case Study of the Chinese Adiantum (Pteridaceae)

PubMed Central

Wen, Jun; Ebihara, Atsushi; Li, De-Zhu

2016-01-01

DNA barcoding is a fast-developing technique to identify species by using short and standard DNA sequences. Universal selection of DNA barcodes in ferns remains unresolved. In this study, five plastid regions (rbcL, matK, trnH-psbA, trnL-F and rps4-trnS) and eight nuclear regions (ITS, pgiC, gapC, LEAFY, ITS2, IBR3_2, DET1, and SQD1_1) were screened and evaluated in the fern genus Adiantum from China and neighboring areas. Due to low primer universality (matK) and/or the existence of multiple copies (ITS), the commonly used barcodes matK and ITS were not appropriate for Adiantum. The PCR amplification rate was extremely low in all nuclear genes except for IBR3_2. rbcL had the highest PCR amplification rate (94.33%) and sequencing success rate (90.78%), while trnH-psbA had the highest species identification rate (75%). With the consideration of discriminatory power, cost-efficiency and effort, the two-barcode combination of rbcL+ trnH-psbA seems to be the best choice for barcoding Adiantum, and perhaps basal polypod ferns in general. The nuclear IBR3_2 showed 100% PCR amplification success rate in Adiantum, however, it seemed that only diploid species could acquire clean sequences without cloning. With cloning, IBR3_2 can successfully distinguish cryptic species and hybrid species from their related species. Because hybridization and allopolyploidy are common in ferns, we argue for including a selected group of nuclear loci as barcodes, especially via the next-generation sequencing, as it is much more efficient to obtain single-copy nuclear loci without the cloning procedure. PMID:27603700
Forensic strategy to ensure the quality of sequencing data of mitochondrial DNA in highly degraded samples.

PubMed

Adachi, Noboru; Umetsu, Kazuo; Shojo, Hideki

2014-01-01

Mitochondrial DNA (mtDNA) is widely used for DNA analysis of highly degraded samples because of its polymorphic nature and high number of copies in a cell. However, as endogenous mtDNA in deteriorated samples is scarce and highly fragmented, it is not easy to obtain reliable data. In the current study, we report the risks of direct sequencing mtDNA in highly degraded material, and suggest a strategy to ensure the quality of sequencing data. It was observed that direct sequencing data of the hypervariable segment (HVS) 1 by using primer sets that generate an amplicon of 407 bp (long-primer sets) was different from results obtained by using newly designed primer sets that produce an amplicon of 120-139 bp (mini-primer sets). The data aligned with the results of mini-primer sets analysis in an amplicon length-dependent manner; the shorter the amplicon, the more evident the endogenous sequence became. Coding region analysis using multiplex amplified product-length polymorphisms revealed the incongruence of single nucleotide polymorphisms between the coding region and HVS 1 caused by contamination with exogenous mtDNA. Although the sequencing data obtained using long-primer sets turned out to be erroneous, it was unambiguous and reproducible. These findings suggest that PCR primers that produce amplicons shorter than those currently recognized should be used for mtDNA analysis in highly degraded samples. Haplogroup motif analysis of the coding region and HVS should also be performed to improve the reliability of forensic mtDNA data. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Simultaneously measuring multiple protein interactions and their correlations in a cell by Protein-interactome Footprinting

PubMed Central

Luo, Si-Wei; Liang, Zhi; Wu, Jia-Rui

2017-01-01

Quantitatively detecting correlations of multiple protein-protein interactions (PPIs) in vivo is a big challenge. Here we introduce a novel method, termed Protein-interactome Footprinting (PiF), to simultaneously measure multiple PPIs in one cell. The principle of PiF is that each target physical PPI in the interactome is simultaneously transcoded into a specific DNA sequence based on dimerization of the target proteins fused with DNA-binding domains. The interaction intensity of each target protein is quantified as the copy number of the specific DNA sequences bound by each fusion protein dimers. Using PiF, we quantitatively reveal dynamic patterns of PPIs and their correlation network in E. coli two-component systems. PMID:28338015
Detection of genetically modified organisms (GMOs) using isothermal amplification of target DNA sequences.

PubMed

Lee, David; La Mura, Maurizio; Allnutt, Theo R; Powell, Wayne

2009-02-02

The most common method of GMO detection is based upon the amplification of GMO-specific DNA amplicons using the polymerase chain reaction (PCR). Here we have applied the loop-mediated isothermal amplification (LAMP) method to amplify GMO-related DNA sequences, 'internal' commonly-used motifs for controlling transgene expression and event-specific (plant-transgene) junctions. We have tested the specificity and sensitivity of the technique for use in GMO studies. Results show that detection of 0.01% GMO in equivalent background DNA was possible and dilutions of template suggest that detection from single copies of the template may be possible using LAMP. This work shows that GMO detection can be carried out using LAMP for routine screening as well as for specific events detection. Moreover, the sensitivity and ability to amplify targets, even with a high background of DNA, here demonstrated, highlights the advantages of this isothermal amplification when applied for GMO detection.
Multiplexed enrichment of rare DNA variants via sequence-selective and temperature-robust amplification

PubMed Central

Wu, Lucia R.; Chen, Sherry X.; Wu, Yalei; Patel, Abhijit A.; Zhang, David Yu

2018-01-01

Rare DNA-sequence variants hold important clinical and biological information, but existing detection techniques are expensive, complex, allele-specific, or don’t allow for significant multiplexing. Here, we report a temperature-robust polymerase-chain-reaction method, which we term blocker displacement amplification (BDA), that selectively amplifies all sequence variants, including single-nucleotide variants (SNVs), within a roughly 20-nucleotide window by 1,000-fold over wild-type sequences. This allows for easy detection and quantitation of hundreds of potential variants originally at ≤0.1% in allele frequency. BDA is compatible with inexpensive thermocycler instrumentation and employs a rationally designed competitive hybridization reaction to achieve comparable enrichment performance across annealing temperatures ranging from 56 °C to 64 °C. To show the sequence generality of BDA, we demonstrate enrichment of 156 SNVs and the reliable detection of single-digit copies. We also show that the BDA detection of rare driver mutations in cell-free DNA samples extracted from the blood plasma of lung-cancer patients is highly consistent with deep sequencing using molecular lineage tags, with a receiver operator characteristic accuracy of 95%. PMID:29805844
Varicella-zoster virus (VZV) origin of DNA replication oriS influences origin-dependent DNA replication and flanking gene transcription.

PubMed

Khalil, Mohamed I; Sommer, Marvin H; Hay, John; Ruyechan, William T; Arvin, Ann M

2015-07-01

The VZV genome has two origins of DNA replication (oriS), each of which consists of an AT-rich sequence and three origin binding protein (OBP) sites called Box A, C and B. In these experiments, the mutation in the core sequence CGC of the Box A and C not only inhibited DNA replication but also inhibited both ORF62 and ORF63 expression in reporter gene assays. In contrast the Box B mutation did not influence DNA replication or flanking gene transcription. These results suggest that efficient DNA replication enhances ORF62 and ORF63 transcription. Recombinant viruses carrying these mutations in both sites and one with a deletion of the whole oriS were constructed. Surprisingly, the recombinant virus lacking both copies of oriS retained the capacity to replicate in melanoma and HELF cells suggesting that VZV has another origin of DNA replication. Copyright © 2015 Elsevier Inc. All rights reserved.
Variation of 45S rDNA intergenic spacers in Arabidopsis thaliana.

PubMed

Havlová, Kateřina; Dvořáčková, Martina; Peiro, Ramon; Abia, David; Mozgová, Iva; Vansáčová, Lenka; Gutierrez, Crisanto; Fajkus, Jiří

2016-11-01

Approximately seven hundred 45S rRNA genes (rDNA) in the Arabidopsis thaliana genome are organised in two 4 Mbp-long arrays of tandem repeats arranged in head-to-tail fashion separated by an intergenic spacer (IGS). These arrays make up 5 % of the A. thaliana genome. IGS are rapidly evolving sequences and frequent rearrangements inside the rDNA loci have generated considerable interspecific and even intra-individual variability which allows to distinguish among otherwise highly conserved rRNA genes. The IGS has not been comprehensively described despite its potential importance in regulation of rDNA transcription and replication. Here we describe the detailed sequence variation in the complete IGS of A. thaliana WT plants and provide the reference/consensus IGS sequence, as well as genomic DNA analysis. We further investigate mutants dysfunctional in chromatin assembly factor-1 (CAF-1) (fas1 and fas2 mutants), which are known to have a reduced number of rDNA copies, and plant lines with restored CAF-1 function (segregated from a fas1xfas2 genetic background) showing major rDNA rearrangements. The systematic rDNA loss in CAF-1 mutants leads to the decreased variability of the IGS and to the occurrence of distinct IGS variants. We present for the first time a comprehensive and representative set of complete IGS sequences, obtained by conventional cloning and by Pacific Biosciences sequencing. Our data expands the knowledge of the A. thaliana IGS sequence arrangement and variability, which has not been available in full and in detail until now. This is also the first study combining IGS sequencing data with RFLP analysis of genomic DNA.
Structural and sequence diversity of the transposon Galileo in the Drosophila willistoni genome.

PubMed

Gonçalves, Juliana W; Valiati, Victor Hugo; Delprat, Alejandra; Valente, Vera L S; Ruiz, Alfredo

2014-09-13

Galileo is one of three members of the P superfamily of DNA transposons. It was originally discovered in Drosophila buzzatii, in which three segregating chromosomal inversions were shown to have been generated by ectopic recombination between Galileo copies. Subsequently, Galileo was identified in six of 12 sequenced Drosophila genomes, indicating its widespread distribution within this genus. Galileo is strikingly abundant in Drosophila willistoni, a neotropical species that is highly polymorphic for chromosomal inversions, suggesting a role for this transposon in the evolution of its genome. We carried out a detailed characterization of all Galileo copies present in the D. willistoni genome. A total of 191 copies, including 133 with two terminal inverted repeats (TIRs), were classified according to structure in six groups. The TIRs exhibited remarkable variation in their length and structure compared to the most complete copy. Three copies showed extended TIRs due to internal tandem repeats, the insertion of other transposable elements (TEs), or the incorporation of non-TIR sequences into the TIRs. Phylogenetic analyses of the transposase (TPase)-encoding and TIR segments yielded two divergent clades, which we termed Galileo subfamilies V and W. Target-site duplications (TSDs) in D. willistoni Galileo copies were 7- or 8-bp in length, with the consensus sequence GTATTAC. Analysis of the region around the TSDs revealed a target site motif (TSM) with a 15-bp palindrome that may give rise to a stem-loop secondary structure. There is a remarkable abundance and diversity of Galileo copies in the D. willistoni genome, although no functional copies were found. The TIRs in particular have a dynamic structure and extend in different ways, but their ends (required for transposition) are more conserved than the rest of the element. The D. willistoni genome harbors two Galileo subfamilies (V and W) that diverged ~9 million years ago and may have descended from an ancestral element in the genome. Galileo shows a significant insertion preference for a 15-bp palindromic TSM.
Mitochondrial genomic variation associated with higher mitochondrial copy number: the Cache County Study on Memory Health and Aging.

PubMed

Ridge, Perry G; Maxwell, Taylor J; Foutz, Spencer J; Bailey, Matthew H; Corcoran, Christopher D; Tschanz, JoAnn T; Norton, Maria C; Munger, Ronald G; O'Brien, Elizabeth; Kerber, Richard A; Cawthon, Richard M; Kauwe, John S K

2014-01-01

The mitochondria are essential organelles and are the location of cellular respiration, which is responsible for the majority of ATP production. Each cell contains multiple mitochondria, and each mitochondrion contains multiple copies of its own circular genome. The ratio of mitochondrial genomes to nuclear genomes is referred to as mitochondrial copy number. Decreases in mitochondrial copy number are known to occur in many tissues as people age, and in certain diseases. The regulation of mitochondrial copy number by nuclear genes has been studied extensively. While mitochondrial variation has been associated with longevity and some of the diseases known to have reduced mitochondrial copy number, the role that the mitochondrial genome itself has in regulating mitochondrial copy number remains poorly understood. We analyzed the complete mitochondrial genomes from 1007 individuals randomly selected from the Cache County Study on Memory Health and Aging utilizing the inferred evolutionary history of the mitochondrial haplotypes present in our dataset to identify sequence variation and mitochondrial haplotypes associated with changes in mitochondrial copy number. Three variants belonging to mitochondrial haplogroups U5A1 and T2 were significantly associated with higher mitochondrial copy number in our dataset. We identified three variants associated with higher mitochondrial copy number and suggest several hypotheses for how these variants influence mitochondrial copy number by interacting with known regulators of mitochondrial copy number. Our results are the first to report sequence variation in the mitochondrial genome that causes changes in mitochondrial copy number. The identification of these variants that increase mtDNA copy number has important implications in understanding the pathological processes that underlie these phenotypes.
Relations between DNA- and RNA-based molecular methods for cyanobacteria and microcystin concentration at Maumee Bay State Park Lakeside Beach, Oregon, Ohio, 2012

USGS Publications Warehouse

Stelzer, Erin A.; Loftin, Keith A.; Struffolino, Pamela

2013-01-01

Water samples were collected from Maumee Bay State Park Lakeside Beach, Oregon, Ohio, during the 2012 recreational season and analyzed for selected cyanobacteria gene sequences by DNA-based quantitative polymerase chain reaction (qPCR) and RNA-based quantitative reverse-transcription polymerase chain reaction (qRT-PCR). Results from the four DNA assays (for quantifying total cyanobacteria, total Microcystis, and Microcystis and Planktothrix strains that possess the microcystin synthetase E (mcyE) gene) and two RNA assays (for quantifying Microcystis and Planktothrix genera that are expressing the microcystin synthetase E (mcyE) gene) were compared to microcystin concentration results determined by an enzyme-linked immunosorbent assay (ELISA). Concentrations of the target in replicate analyses were log10 transformed. The average value of differences in log10 concentrations for the replicates that had at least one detection were found to range from 0.05 to >0.37 copy per 100 milliliters (copy/100 mL) for DNA-based methods and from >0.04 to >0.17 copy/100 mL for RNA-based methods. RNA has a shorter half-life than DNA; consequently, a 24-hour holding-time study was done to determine the effects of holding time on RNA concentrations. Holding-time comparisons for the RNA-based Microcystis toxin mcyE assay showed reductions in the number of copies per 100 milliliters over 24 hours. The log difference between time 2 hours and time 24 hours was >0.37 copy/100 mL, which was higher than the analytical variability (log difference of >0.17 copy/100 mL). Spearman’s correlation analysis indicated that microcystin toxin concentrations were moderately to highly related to DNA-based assay results for total cyanobacteria (rho=0.69), total Microcystis (rho=0.74), and Microcystis strains that possess the mcyE gene (rho=0.81). Microcystin toxin concentrations were strongly related with RNA-based assay results for Microcystis mcyE gene expression (rho=0.95). Correlation analysis could not be done for Planktothrix mcyE gene expression because of too few detections.
Efficient generation of transgenic cattle using the DNA transposon and their analysis by next-generation sequencing

PubMed Central

Yum, Soo-Young; Lee, Song-Jeon; Kim, Hyun-Min; Choi, Woo-Jae; Park, Ji-Hyun; Lee, Won-Wu; Kim, Hee-Soo; Kim, Hyeong-Jong; Bae, Seong-Hun; Lee, Je-Hyeong; Moon, Joo-Yeong; Lee, Ji-Hyun; Lee, Choong-Il; Son, Bong-Jun; Song, Sang-Hoon; Ji, Su-Min; Kim, Seong-Jin; Jang, Goo

2016-01-01

Here, we efficiently generated transgenic cattle using two transposon systems (Sleeping Beauty and Piggybac) and their genomes were analyzed by next-generation sequencing (NGS). Blastocysts derived from microinjection of DNA transposons were selected and transferred into recipient cows. Nine transgenic cattle have been generated and grown-up to date without any health issues except two. Some of them expressed strong fluorescence and the transgene in the oocytes from a superovulating one were detected by PCR and sequencing. To investigate genomic variants by the transgene transposition, whole genomic DNA were analyzed by NGS. We found that preferred transposable integration (TA or TTAA) was identified in their genome. Even though multi-copies (i.e. fifteen) were confirmed, there was no significant difference in genome instabilities. In conclusion, we demonstrated that transgenic cattle using the DNA transposon system could be efficiently generated, and all those animals could be a valuable resource for agriculture and veterinary science. PMID:27324781

Development of novel low-copy nuclear markers for Hieraciinae (Asteraceae) and their perspective for other tribes.

PubMed

Krak, Karol; Alvarez, Inés; Caklová, Petra; Costa, Andrea; Chrtek, Jindrich; Fehrer, Judith

2012-02-01

The development of three low-copy nuclear markers for low taxonomic level phylogenies in Asteraceae with emphasis on the subtribe Hieraciinae is reported. Marker candidates were selected by comparing a Lactuca complementary DNA (cDNA) library with public DNA sequence databases. Interspecific variation and phylogenetic signal of the selected genes were investigated for diploid taxa from the subtribe Hieraciinae and compared to a reference phylogeny. Their ability to cross-amplify was assessed for other Asteraceae tribes. All three markers had higher variation (2.1-4.5 times) than the internal transcribed spacer (ITS) in Hieraciinae. Cross-amplification was successful in at least seven other tribes of the Asteraceae. Only three cases indicating the presence of paralogs or pseudogenes were detected. The results demonstrate the potential of these markers for phylogeny reconstruction in the Hieraciinae as well as in other Asteraceae tribes, especially for very closely related species.
Establishing a novel single-copy primer-internal intron-spanning PCR (spiPCR) procedure for the direct detection of gene doping.

PubMed

Beiter, Thomas; Zimmermann, Martina; Fragasso, Annunziata; Armeanu, Sorin; Lauer, Ulrich M; Bitzer, Michael; Su, Hua; Young, William L; Niess, Andreas M; Simon, Perikles

2008-01-01

So far, the abuse of gene transfer technology in sport, so-called gene doping, is undetectable. However, recent studies in somatic gene therapy indicate that long-term presence of transgenic DNA (tDNA) following various gene transfer protocols can be found in DNA isolated from whole blood using conventional PCR protocols. Application of these protocols for the direct detection of gene doping would require almost complete knowledge about the sequence of the genetic information that has been transferred. Here, we develop and describe the novel single-copy primer-internal intron-spanning PCR (spiPCR) procedure that overcomes this difficulty. Apart from the interesting perspectives that this spiPCR procedure offers in the fight against gene doping, this technology could also be of interest in biodistribution and biosafety studies for gene therapeutic applications.
A pilot analytic study of a research-level, lower-cost human papillomavirus 16, 18, and 45 test.

PubMed

Yang, Hannah P; Walmer, David K; Merisier, Delson; Gage, Julia C; Bell, Laura; Rangwala, Sameera; Shrestha, Niwashin; Kobayashi, Lori; Eder, Paul S; Castle, Philip E

2011-09-01

The analytic performance of a low-cost, research-stage DNA test for the most carcinogenic human papillomavirus (HPV) genotypes (HPV16, HPV18, and HPV45) in aggregate was evaluated among carcinogenic HPV-positive women, which might be used to decide who needs immediate colposcopy in low-resource settings ("triage test"). We found that HPV16/18/45 test agreed well with two DNA tests, a GP5+/6+ genotyping assay (Kappa = 0.77) and a quantitative PCR assay (at a cutpoint of 5000 viral copies) (Kappa = 0.87). DNA sequencing on a subset of 16 HPV16/18/45 positive and 16 HPV16/18/45 negative verified the analytic specificity of the research test. It is concluded that the HPV16/18/45 assay is a promising triage test with a minimum detection of approximately 5000 viral copies, the clinically relevant threshold. Published by Elsevier B.V.
DNA recombination protein-dependent mechanism of homoplasmy and its proposed functions.

PubMed

Shibata, Takehiko; Ling, Feng

2007-01-01

Homoplasmy is a basic genetic state of mitochondria, in which all of the hundreds to thousands of mitochondrial (mt)DNA copies within a cell or an individual have the same nucleotide-sequence. It was recently found that "vegetative segregation" to generate homoplasmic cells is an active process under genetic control. In the yeast Saccharomyces cerevisiae, the Mhr1 protein which catalyzes a key reaction in mtDNA homologous recombination, plays a pivotal role in vegetative segregation. Conversely, within the nuclear genome, homologous DNA recombination causes genetic diversity. Considering these contradictory roles of this key reaction in DNA recombination, possible functions of homoplasmy are discussed.
The 28S–18S rDNA intergenic spacer from Crithidia fasciculata: repeated sequences, length heterogeneity, putative processing sites and potential interactions between U3 small nucleolar RNA and the ribosomal RNA precursor

PubMed Central

Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.

2000-01-01

In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863
Attomolar quantitation of Mycobacterium tuberculosis by asymmetric helicase-dependent isothermal DNA-amplification and electrochemical detection.

PubMed

Barreda-García, Susana; González-Álvarez, María José; de-Los-Santos-Álvarez, Noemí; Palacios-Gutiérrez, Juan José; Miranda-Ordieres, Arturo J; Lobo-Castañón, María Jesús

2015-06-15

A highly sensitive and robust method for the quantification of specific DNA sequences based on coupling asymmetric helicase-dependent DNA amplification to electrochemical detection is described. This method relies on the entrapment of the amplified ssDNA sequences on magnetic beads followed by a post-amplification hybridization assay to provide an added degree of specificity. As a proof-of-concept a 84-bases long sequence specific of Mycobacterium tuberculosis is amplified at 65°C, providing 3×10(6) amplification after 90 min. Using this system 0.5 aM, corresponding to 15 copies of the target gene in 50 µL of sample, can be successfully detected and reliably quantified under isothermal conditions in less than 4h. The assay has been applied to the detection of M. tuberculosis in sputum, pleural fluid and urine samples. Besides this application, the proposed assays is a powerful and general tool for molecular diagnostic that can be applied to the detection of other specific DNA sequences, taking full advantage of the plethora of genomic information now available. Copyright © 2014 Elsevier B.V. All rights reserved.
Characterization of the exogenous insert and development of event-specific PCR detection methods for genetically modified Huanong No. 1 papaya.

PubMed

Guo, Jinchao; Yang, Litao; Liu, Xin; Guan, Xiaoyan; Jiang, Lingxi; Zhang, Dabing

2009-08-26

Genetically modified (GM) papaya (Carica papaya L.), Huanong No. 1, was approved for commercialization in Guangdong province, China in 2006, and the development of the Huanong No. 1 papaya detection method is necessary for implementing genetically modified organism (GMO) labeling regulations. In this study, we reported the characterization of the exogenous integration of GM Huanong No. 1 papaya by means of conventional polymerase chain reaction (PCR) and thermal asymmetric interlaced (TAIL)-PCR strategies. The results suggested that one intact copy of the initial construction was integrated in the papaya genome and which probably resulted in one deletion (38 bp in size) of the host genomic DNA. Also, one unintended insertion of a 92 bp truncated NptII fragment was observed at the 5' end of the exogenous insert. Furthermore, we revealed its 5' and 3' flanking sequences between the insert DNA and the papaya genomic DNA, and developed the event-specific qualitative and quantitative PCR assays for GM Huanong No. 1 papaya based on the 5' integration flanking sequence. The relative limit of detection (LOD) of the qualitative PCR assay was about 0.01% in 100 ng of total papaya genomic DNA, corresponding to about 25 copies of papaya haploid genome. In the quantitative PCR, the limits of detection and quantification (LOD and LOQ) were as low as 12.5 and 25 copies of papaya haploid genome, respectively. In practical sample quantification, the quantified biases between the test and true values of three samples ranged from 0.44% to 4.41%. Collectively, we proposed that all of these results are useful for the identification and quantification of Huanong No. 1 papaya and its derivates.
Detection of Pasteuria penetrans infection in Meloidogyne arenaria race 1 in planta by polymerase chain reaction.

PubMed

Schmidt, L M; Preston, J F; Nong, G; Dickson, D W; Aldrich, H C

2004-06-01

We report on the development of a PCR-based assay to detect Pasteuria penetrans infection of Meloidogyne arenaria in planta using specific primers for recently sequenced sigE, spoIIAB and atpF genes of P. penetrans biotype P20. Amplification of these genes in crude DNA extracts of ground tomato root galls using real-time kinetic PCR distinguished infected from uninfected M. arenaria race 1 by analysis of consensus thresholds for single copy genes. Fluorescent in situ hybridization (FISH) using the sigE primer sequence as a probe shows hybridization to P. penetrans cells in various stages of vegetative (pre-endospore) development. Ratios of gene copies for sigE and 16S rDNA were obtained for P. penetrans and compared to Bacillus subtilis as a genomic paradigm of endospore-forming bacteria. Phylogenetic analysis of the sigE gene from Gram-positive, endospore-forming bacteria finds P. penetrans most closely related Paenbacillus polymyxa. The sporulation genes (spo genes), particularly sigE, have sequence diversity that recommends them for species and biotype differentiation of the numerous Pasteuria isolates that infect a large number of plant-parasitic nematodes.
Genetic characterization of the Bifidobacterium breve UCC 2003 hrcA locus.

PubMed

Ventura, Marco; Canchaya, Carlos; Bernini, Valentina; Del Casale, Antonio; Dellaglio, Franco; Neviani, Erasmo; Fitzgerald, Gerald F; van Sinderen, Douwe

2005-12-01

The bacterial heat shock response is characterized by the elevated expression of a number of chaperone complexes and transcriptional regulators, including the DnaJ and the HrcA proteins. Genome analysis of Bifidobacterium breve UCC 2003 revealed a second copy of a dnaJ gene, named dnaJ2, which is flanked by the hrcA gene in a genetic constellation that appears to be unique to the actinobacteria. Phylogenetic analysis using 53 bacterial dnaJ sequences, including both dnaJ1 and dnaJ2 sequences, suggests that these genes have followed a different evolutionary development. Furthermore, the B. breve UCC 2003 dnaJ2 gene seems to be regulated in a manner that is different from that of the previously characterized dnaJ1 gene. The dnaJ2 gene, which was shown to be part of a 2.3-kb bicistronic operon with hrcA, was induced by osmotic shock but not significantly by heat stress. This induction pattern is unlike those of other characterized dnaJ genes and may be indicative of a unique stress adaptation strategy by this commensal microorganism.
Genetic Characterization of the Bifidobacterium breve UCC 2003 hrcA Locus

PubMed Central

Ventura, Marco; Canchaya, Carlos; Bernini, Valentina; Del Casale, Antonio; Dellaglio, Franco; Neviani, Erasmo; Fitzgerald, Gerald F.; van Sinderen, Douwe

2005-01-01

The bacterial heat shock response is characterized by the elevated expression of a number of chaperone complexes and transcriptional regulators, including the DnaJ and the HrcA proteins. Genome analysis of Bifidobacterium breve UCC 2003 revealed a second copy of a dnaJ gene, named dnaJ2, which is flanked by the hrcA gene in a genetic constellation that appears to be unique to the actinobacteria. Phylogenetic analysis using 53 bacterial dnaJ sequences, including both dnaJ1 and dnaJ2 sequences, suggests that these genes have followed a different evolutionary development. Furthermore, the B. breve UCC 2003 dnaJ2 gene seems to be regulated in a manner that is different from that of the previously characterized dnaJ1 gene. The dnaJ2 gene, which was shown to be part of a 2.3-kb bicistronic operon with hrcA, was induced by osmotic shock but not significantly by heat stress. This induction pattern is unlike those of other characterized dnaJ genes and may be indicative of a unique stress adaptation strategy by this commensal microorganism. PMID:16332909
Development and validation of an rDNA operon based primer walking strategy applicable to de novo bacterial genome finishing

PubMed Central

Eastman, Alexander W.; Yuan, Ze-Chun

2015-01-01

Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing projects. PMID:25653642
Unexpected Inheritance: Multiple Integrations of Ancient Bornavirus and Ebolavirus/Marburgvirus Sequences in Vertebrate Genomes

PubMed Central

Belyi, Vladimir A.; Levine, Arnold J.; Skalka, Anna Marie

2010-01-01

Vertebrate genomes contain numerous copies of retroviral sequences, acquired over the course of evolution. Until recently they were thought to be the only type of RNA viruses to be so represented, because integration of a DNA copy of their genome is required for their replication. In this study, an extensive sequence comparison was conducted in which 5,666 viral genes from all known non-retroviral families with single-stranded RNA genomes were matched against the germline genomes of 48 vertebrate species, to determine if such viruses could also contribute to the vertebrate genetic heritage. In 19 of the tested vertebrate species, we discovered as many as 80 high-confidence examples of genomic DNA sequences that appear to be derived, as long ago as 40 million years, from ancestral members of 4 currently circulating virus families with single strand RNA genomes. Surprisingly, almost all of the sequences are related to only two families in the Order Mononegavirales: the Bornaviruses and the Filoviruses, which cause lethal neurological disease and hemorrhagic fevers, respectively. Based on signature landmarks some, and perhaps all, of the endogenous virus-like DNA sequences appear to be LINE element-facilitated integrations derived from viral mRNAs. The integrations represent genes that encode viral nucleocapsid, RNA-dependent-RNA-polymerase, matrix and, possibly, glycoproteins. Integrations are generally limited to one or very few copies of a related viral gene per species, suggesting that once the initial germline integration was obtained (or selected), later integrations failed or provided little advantage to the host. The conservation of relatively long open reading frames for several of the endogenous sequences, the virus-like protein regions represented, and a potential correlation between their presence and a species' resistance to the diseases caused by these pathogens, are consistent with the notion that their products provide some important biological advantage to the species. In addition, the viruses could also benefit, as some resistant species (e.g. bats) may serve as natural reservoirs for their persistence and transmission. Given the stringent limitations imposed in this informatics search, the examples described here should be considered a low estimate of the number of such integration events that have persisted over evolutionary time scales. Clearly, the sources of genetic information in vertebrate genomes are much more diverse than previously suspected. PMID:20686665
Unexpected inheritance: multiple integrations of ancient bornavirus and ebolavirus/marburgvirus sequences in vertebrate genomes.

PubMed

Belyi, Vladimir A; Levine, Arnold J; Skalka, Anna Marie

2010-07-29

Vertebrate genomes contain numerous copies of retroviral sequences, acquired over the course of evolution. Until recently they were thought to be the only type of RNA viruses to be so represented, because integration of a DNA copy of their genome is required for their replication. In this study, an extensive sequence comparison was conducted in which 5,666 viral genes from all known non-retroviral families with single-stranded RNA genomes were matched against the germline genomes of 48 vertebrate species, to determine if such viruses could also contribute to the vertebrate genetic heritage. In 19 of the tested vertebrate species, we discovered as many as 80 high-confidence examples of genomic DNA sequences that appear to be derived, as long ago as 40 million years, from ancestral members of 4 currently circulating virus families with single strand RNA genomes. Surprisingly, almost all of the sequences are related to only two families in the Order Mononegavirales: the Bornaviruses and the Filoviruses, which cause lethal neurological disease and hemorrhagic fevers, respectively. Based on signature landmarks some, and perhaps all, of the endogenous virus-like DNA sequences appear to be LINE element-facilitated integrations derived from viral mRNAs. The integrations represent genes that encode viral nucleocapsid, RNA-dependent-RNA-polymerase, matrix and, possibly, glycoproteins. Integrations are generally limited to one or very few copies of a related viral gene per species, suggesting that once the initial germline integration was obtained (or selected), later integrations failed or provided little advantage to the host. The conservation of relatively long open reading frames for several of the endogenous sequences, the virus-like protein regions represented, and a potential correlation between their presence and a species' resistance to the diseases caused by these pathogens, are consistent with the notion that their products provide some important biological advantage to the species. In addition, the viruses could also benefit, as some resistant species (e.g. bats) may serve as natural reservoirs for their persistence and transmission. Given the stringent limitations imposed in this informatics search, the examples described here should be considered a low estimate of the number of such integration events that have persisted over evolutionary time scales. Clearly, the sources of genetic information in vertebrate genomes are much more diverse than previously suspected.
Multiple copies of a bile acid-inducible gene in Eubacterium sp. strain VPI 12708.

PubMed Central

Gopal-Srivastava, R; Mallonee, D H; White, W B; Hylemon, P B

1990-01-01

Eubacterium sp. strain VPI 12708 is an anaerobic intestinal bacterium which possesses inducible bile acid 7-dehydroxylation activity. Several new polypeptides are produced in this strain following induction with cholic acid. Genes coding for two copies of a bile acid-inducible 27,000-dalton polypeptide (baiA1 and baiA2) have been previously cloned and sequenced. We now report on a gene coding for a third copy of this 27,000-dalton polypeptide (baiA3). The baiA3 gene has been cloned in lambda DASH on an 11.2-kilobase DNA fragment from a partial Sau3A digest of the Eubacterium DNA. DNA sequence analysis of the baiA3 gene revealed 100% homology with the baiA1 gene within the coding region of the 27,000-dalton polypeptides. The baiA2 gene shares 81% sequence identity with the other two genes at the nucleotide level. The flanking nucleotide sequences associated with the baiA1 and baiA3 genes are identical for 930 bases in the 5' direction from the initiation codon and for at least 325 bases in the 3' direction from the stop codon, including the putative promoter regions for the genes. An additional open reading frame (occupying from 621 to 648 bases, depending on the correct start codon) was found in the identical 5' regions associated with the baiA1 and baiA3 clones. The 5' sequence 930 bases upstream from the baiA1 and baiA3 genes was totally divergent. The baiA2 gene, which is part of a large bile acid-inducible operon, showed no homology with the other two genes either in the 5' or 3' direction from the polypeptide coding region, except for a 15-base-pair presumed ribosome-binding site in the 5' region. These studies strongly suggest that a gene duplication (baiA1 and baiA3) has occurred and is stably maintained in this bacterium. Images PMID:2376563
Two copies of mthmg1, encoding a novel mitochondrial HMG-like protein, delay accumulation of mitochondrial DNA deletions in Podospora anserina.

PubMed

Dequard-Chablat, Michelle; Allandt, Cynthia

2002-08-01

In the filamentous fungus Podospora anserina, two degenerative processes which result in growth arrest are associated with mitochondrial genome (mitochondrial DNA [mtDNA]) instability. Senescence is correlated with mtDNA rearrangements and amplification of specific regions (senDNAs). Premature death syndrome is characterized by the accumulation of specific mtDNA deletions. This accumulation is due to indirect effects of the AS1-4 mutation, which alters a cytosolic ribosomal protein gene. The mthmg1 gene has been identified as a double-copy suppressor of premature death. It greatly delays premature death and the accumulation of deletions when it is present in two copies in an ASI-4 context. The duplication of mthmg1 has no significant effect on the wild-type life span or on senDNA patterns. In anAS1+ context, deletion of the mthmg1 gene alters germination, growth, and fertility and reduces the life span. The deltamthmg1 senescent strains display a particular senDNA pattern. This deletion is lethal in an AS1-4 context. According to its physical properties (very basic protein with putative mitochondrial targeting sequence and HMG-type DNA-binding domains) and the cellular localization of an mtHMG1-green fluorescent protein fusion, mtHMG1 appears to be a mitochondrial protein possibly associated with mtDNA. It is noteworthy that it is the first example of a protein combining the two DNA-binding domains, AT-hook motif and HMG-1 boxes. It may be involved in the stability and/or transmission of the mitochondrial genome. To date, no structural homologues have been found in other organisms. However, mtHMG1 displays functional similarities with the Saccharomyces cerevisiae mitochondrial HMG-box protein Abf2.
The 2-micron plasmid as a nonselectable, stable, high copy number yeast vector

NASA Technical Reports Server (NTRS)

Ludwig, D. L.; Bruschi, C. V.

1991-01-01

The endogenous 2-microns plasmid of Saccharomyces cerevisiae has been used extensively for the construction of yeast cloning and expression plasmids because it is a native yeast plasmid that is able to be maintained stably in cells at high copy number. Almost invariably, these plasmid constructs, containing some or all 2-microns sequences, exhibit copy number levels lower than 2-microns and are maintained stably only under selective conditions. We were interested in determining if there was a means by which 2-microns could be utilized for vector construction, without forfeiting either copy number or nonselective stability. We identified sites in the 2-microns plasmid that could be used for the insertion of genetic sequences without disrupting 2-microns coding elements and then assessed subsequent plasmid constructs for stability and copy number in vivo. We demonstrate the utility of a previously described 2-microns recombination chimera, pBH-2L, for the manipulation and transformation of 2-microns as a pure yeast plasmid vector. We show that the HpaI site near the STB element in the 2-microns plasmid can be utilized to clone yeast DNA of at least 3.9 kb with no loss of plasmid stability. Additionally, the copy number of these constructs is as high as levels reported for the endogenous 2-microns.
Cloning of cDNA of major antigen of foot and mouth disease virus and expression in E. coli

NASA Astrophysics Data System (ADS)

Küpper, Hans; Keller, Walter; Kurz, Christina; Forss, Sonja; Schaller, Heinz

1981-02-01

Double-stranded DNA copies of the single-stranded genomic RNA of foot and mouth disease virus have been cloned into the Escherichia coli plasmid pBR322. A restriction map of the viral genome was established and aligned with the biochemical map of foot and mouth disease virus. The coding sequence for structural protein VP1, the major antigen of the virus, was identified and inserted into a plasmid vector where the expression of this sequence is under control of the phage λ PL promoter. In an appropriate host the synthesis of antigenic polypeptide can be demonstrated by radioimmunoassay.
Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas

PubMed Central

Hou, Yu; Guo, Huahu; Cao, Chen; Li, Xianlong; Hu, Boqiang; Zhu, Ping; Wu, Xinglong; Wen, Lu; Tang, Fuchou; Huang, Yanyi; Peng, Jirun

2016-01-01

Single-cell genome, DNA methylome, and transcriptome sequencing methods have been separately developed. However, to accurately analyze the mechanism by which transcriptome, genome and DNA methylome regulate each other, these omic methods need to be performed in the same single cell. Here we demonstrate a single-cell triple omics sequencing technique, scTrio-seq, that can be used to simultaneously analyze the genomic copy-number variations (CNVs), DNA methylome, and transcriptome of an individual mammalian cell. We show that large-scale CNVs cause proportional changes in RNA expression of genes within the gained or lost genomic regions, whereas these CNVs generally do not affect DNA methylation in these regions. Furthermore, we applied scTrio-seq to 25 single cancer cells derived from a human hepatocellular carcinoma tissue sample. We identified two subpopulations within these cells based on CNVs, DNA methylome, or transcriptome of individual cells. Our work offers a new avenue of dissecting the complex contribution of genomic and epigenomic heterogeneities to the transcriptomic heterogeneity within a population of cells. PMID:26902283
Single-cell template strand sequencing by Strand-seq enables the characterization of individual homologs.

PubMed

Sanders, Ashley D; Falconer, Ester; Hills, Mark; Spierings, Diana C J; Lansdorp, Peter M

2017-06-01

The ability to distinguish between genome sequences of homologous chromosomes in single cells is important for studies of copy-neutral genomic rearrangements (such as inversions and translocations), building chromosome-length haplotypes, refining genome assemblies, mapping sister chromatid exchange events and exploring cellular heterogeneity. Strand-seq is a single-cell sequencing technology that resolves the individual homologs within a cell by restricting sequence analysis to the DNA template strands used during DNA replication. This protocol, which takes up to 4 d to complete, relies on the directionality of DNA, in which each single strand of a DNA molecule is distinguished based on its 5'-3' orientation. Culturing cells in a thymidine analog for one round of cell division labels nascent DNA strands, allowing for their selective removal during genomic library construction. To preserve directionality of template strands, genomic preamplification is bypassed and labeled nascent strands are nicked and not amplified during library preparation. Each single-cell library is multiplexed for pooling and sequencing, and the resulting sequence data are aligned, mapping to either the minus or plus strand of the reference genome, to assign template strand states for each chromosome in the cell. The major adaptations to conventional single-cell sequencing protocols include harvesting of daughter cells after a single round of BrdU incorporation, bypassing of whole-genome amplification, and removal of the BrdU + strand during Strand-seq library preparation. By sequencing just template strands, the structure and identity of each homolog are preserved.
DNA extraction protocols cause differences in 16S rRNA amplicon sequencing efficiency but not in community profile composition or structure

DOE PAGES

None

2014-12-01

The recent development of methods applying next-generation sequencing to microbial community characterization has led to the proliferation of these studies in a wide variety of sample types. Yet, variation in the physical properties of environmental samples demands that optimal DNA extraction techniques be explored for each new environment. The microbiota associated with many species of insects offer an extraction challenge as they are frequently surrounded by an armored exoskeleton, inhibiting disruption of the tissues within. In this study, we examine the efficacy of several commonly used protocols for extracting bacterial DNA from ants. While bacterial community composition recovered using Illuminamore » 16S rRNA amplicon sequencing was not detectably biased by any method, the quantity of bacterial DNA varied drastically, reducing the number of samples that could be amplified and sequenced. These results indicate that the concentration necessary for dependable sequencing is around 10,000 copies of target DNA per microliter. Exoskeletal pulverization and tissue digestion increased the reliability of extractions, suggesting that these steps should be included in any study of insect-associated microorganisms that relies on obtaining microbial DNA from intact body segments. Although laboratory and analysis techniques should be standardized across diverse sample types as much as possible, minimal modifications such as these will increase the number of environments in which bacterial communities can be successfully studied.« less

Understanding the mechanisms of protein-DNA interactions

NASA Astrophysics Data System (ADS)

Lavery, Richard

2004-03-01

Structural, biochemical and thermodynamic data on protein-DNA interactions show that specific recognition cannot be reduced to a simple set of binary interactions between the partners (such as hydrogen bonds, ion pairs or steric contacts). The mechanical properties of the partners also play a role and, in the case of DNA, variations in both conformation and flexibility as a function of base sequence can be a significant factor in guiding a protein to the correct binding site. All-atom molecular modeling offers a means of analyzing the role of different binding mechanisms within protein-DNA complexes of known structure. This however requires estimating the binding strengths for the full range of sequences with which a given protein can interact. Since this number grows exponentially with the length of the binding site it is necessary to find a method to accelerate the calculations. We have achieved this by using a multi-copy approach (ADAPT) which allows us to build a DNA fragment with a variable base sequence. The results obtained with this method correlate well with experimental consensus binding sequences. They enable us to show that indirect recognition mechanisms involving the sequence dependent properties of DNA play a significant role in many complexes. This approach also offers a means of predicting protein binding sites on the basis of binding energies, which is complementary to conventional lexical techniques.
Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence

PubMed Central

2011-01-01

Background Many plants have large and complex genomes with an abundance of repeated sequences. Many plants are also polyploid. Both of these attributes typify the genome architecture in the tribe Triticeae, whose members include economically important wheat, rye and barley. Large genome sizes, an abundance of repeated sequences, and polyploidy present challenges to genome-wide SNP discovery using next-generation sequencing (NGS) of total genomic DNA by making alignment and clustering of short reads generated by the NGS platforms difficult, particularly in the absence of a reference genome sequence. Results An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions from repetitive sequences and sequences shared by paralogous genes. Multiple genome equivalents of shotgun reads of another genotype generated with SOLiD or Solexa are then mapped to the annotated Roche 454 reads to identify putative SNPs. A pipeline program package, AGSNP, was developed and used for genome-wide SNP discovery in Aegilops tauschii-the diploid source of the wheat D genome, and with a genome size of 4.02 Gb, of which 90% is repetitive sequences. Genomic DNA of Ae. tauschii accession AL8/78 was sequenced with the Roche 454 NGS platform. Genomic DNA and cDNA of Ae. tauschii accession AS75 was sequenced primarily with SOLiD, although some Solexa and Roche 454 genomic sequences were also generated. A total of 195,631 putative SNPs were discovered in gene sequences, 155,580 putative SNPs were discovered in uncharacterized single-copy regions, and another 145,907 putative SNPs were discovered in repeat junctions. These SNPs were dispersed across the entire Ae. tauschii genome. To assess the false positive SNP discovery rate, DNA containing putative SNPs was amplified by PCR from AL8/78 and AS75 and resequenced with the ABI 3730 xl. In a sample of 302 randomly selected putative SNPs, 84.0% in gene regions, 88.0% in repeat junctions, and 81.3% in uncharacterized regions were validated. Conclusion An annotation-based genome-wide SNP discovery pipeline for NGS platforms was developed. The pipeline is suitable for SNP discovery in genomic libraries of complex genomes and does not require a reference genome sequence. The pipeline is applicable to all current NGS platforms, provided that at least one such platform generates relatively long reads. The pipeline package, AGSNP, and the discovered 497,118 Ae. tauschii SNPs can be accessed at (http://avena.pw.usda.gov/wheatD/agsnp.shtml). PMID:21266061
Heterogeneity of three molecular data partition phylogenies of mints related to M. x piperita (Mentha; Lamiaceae).

PubMed

Gobert, V; Moja, S; Taberlet, P; Wink, M

2006-07-01

Phylogenetic reconstructions with molecular tools are now widely used, thanks to advances in PCR and sequencing technologies. The choice of the molecular target still remains a problem because too few comparative data are available. This is particularly true for hybrid taxa, where differential introgression of genome parts leads to incongruity between data sets. We have studied the potential of three data partitions to reconstruct the phylogeny of mints related to M. x piperita. These included nuclear DNA (ITS), chloroplast DNA (non-coding regions trnL intron, intergenic spacers trnL-trnF, and psbA-trnH), and AFLP and ISSR, markers. The taxonomic sampling was composed of hybrids, diploid and polyploid genomes. Since the genealogy of cultivated mint hybrids is known, they represent a model group to compare the usefulness of various molecular markers for phylogeny inference. Incongruities between ITS, chloroplast DNA, and AFLP-ISSR phylogenetic trees were recorded, although DNA fingerprinting data were congruent with morphological classification. Evidence of chloroplast capture events was obtained for M. x piperita. Direct sequencing of ITS led to biased results because of the existence of pseudogenes. Sequencing of cloned ITS further failed to provide evidence of the existence of the two parental copy types for M. x piperita, a sterile hybrid that has had no opportunity for concerted evolution of ITS copies. AFLP-ISSR data clustered M. x piperita with the parent that had the largest genome. This study sheds light on differential of introgression of different genome regions in mint hybrids.
Mitochondrial DNA copy number is regulated in a tissue specific manner by DNA methylation of the nuclear-encoded DNA polymerase gamma A

PubMed Central

Kelly, Richard D. W.; Mahmud, Arsalan; McKenzie, Matthew; Trounce, Ian A.; St John, Justin C.

2012-01-01

DNA methylation is an essential mechanism controlling gene expression during differentiation and development. We investigated the epigenetic regulation of the nuclear-encoded, mitochondrial DNA (mtDNA) polymerase γ catalytic subunit (PolgA) by examining the methylation status of a CpG island within exon 2 of PolgA. Bisulphite sequencing identified low methylation levels (<10%) within exon 2 of mouse oocytes, blastocysts and embryonic stem cells (ESCs), while somatic tissues contained significantly higher levels (>40%). In contrast, induced pluripotent stem (iPS) cells and somatic nuclear transfer ESCs were hypermethylated (>20%), indicating abnormal epigenetic reprogramming. Real time PCR analysis of 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC) immunoprecipitated DNA suggests active DNA methylation and demethylation within exon 2 of PolgA. Moreover, neural differentiation of ESCs promoted de novo methylation and demethylation at the exon 2 locus. Regression analysis demonstrates that cell-specific PolgA expression levels were negatively correlated with DNA methylation within exon 2 and mtDNA copy number. Finally, using chromatin immunoprecipitation (ChIP) against RNA polymerase II (RNApII) phosphorylated on serine 2, we show increased DNA methylation levels are associated with reduced RNApII transcriptional elongation. This is the first study linking nuclear DNA epigenetic regulation with mtDNA regulation during differentiation and cell specialization. PMID:22941637
Comparison of American Fisheries Society (AFS) standard fish sampling techniques and environmental DNA for characterizing fish communities in a large reservoir

USGS Publications Warehouse

Perez, Christina R.; Bonar, Scott A.; Amberg, Jon J.; Ladell, Bridget; Rees, Christopher B.; Stewart, William T.; Gill, Curtis J.; Cantrell, Chris; Robinson, Anthony

2017-01-01

Recently, methods involving examination of environmental DNA (eDNA) have shown promise for characterizing fish species presence and distribution in waterbodies. We evaluated the use of eDNA for standard fish monitoring surveys in a large reservoir. Specifically, we compared the presence, relative abundance, biomass, and relative percent composition of Largemouth Bass Micropterus salmoides and Gizzard Shad Dorosoma cepedianum measured through eDNA methods and established American Fisheries Society standard sampling methods for Theodore Roosevelt Lake, Arizona. Catches at electrofishing and gillnetting sites were compared with eDNA water samples at sites, within spatial strata, and over the entire reservoir. Gizzard Shad were detected at a higher percentage of sites with eDNA methods than with boat electrofishing in both spring and fall. In contrast, spring and fall gillnetting detected Gizzard Shad at more sites than eDNA. Boat electrofishing and gillnetting detected Largemouth Bass at more sites than eDNA; the exception was fall gillnetting, for which the number of sites of Largemouth Bass detection was equal to that for eDNA. We observed no relationship between relative abundance and biomass of Largemouth Bass and Gizzard Shad measured by established methods and eDNA copies at individual sites or lake sections. Reservoirwide catch composition for Largemouth Bass and Gizzard Shad (numbers and total weight [g] of fish) as determined through a combination of gear types (boat electrofishing plus gillnetting) was similar to the proportion of total eDNA copies from each species in spring and fall field sampling. However, no similarity existed between proportions of fish caught via spring and fall boat electrofishing and the proportion of total eDNA copies from each species. Our study suggests that eDNA field sampling protocols, filtration, DNA extraction, primer design, and DNA sequencing methods need further refinement and testing before incorporation into standard fish sampling surveys.
Accurate RNA consensus sequencing for high-fidelity detection of transcriptional mutagenesis-induced epimutations.

PubMed

Reid-Bayliss, Kate S; Loeb, Lawrence A

2017-08-29

Transcriptional mutagenesis (TM) due to misincorporation during RNA transcription can result in mutant RNAs, or epimutations, that generate proteins with altered properties. TM has long been hypothesized to play a role in aging, cancer, and viral and bacterial evolution. However, inadequate methodologies have limited progress in elucidating a causal association. We present a high-throughput, highly accurate RNA sequencing method to measure epimutations with single-molecule sensitivity. Accurate RNA consensus sequencing (ARC-seq) uniquely combines RNA barcoding and generation of multiple cDNA copies per RNA molecule to eliminate errors introduced during cDNA synthesis, PCR, and sequencing. The stringency of ARC-seq can be scaled to accommodate the quality of input RNAs. We apply ARC-seq to directly assess transcriptome-wide epimutations resulting from RNA polymerase mutants and oxidative stress.
Concerted copy number variation balances ribosomal DNA dosage in human and mouse genomes

PubMed Central

Gibbons, John G.; Branco, Alan T.; Godinho, Susana A.; Yu, Shoukai; Lemos, Bernardo

2015-01-01

Tandemly repeated ribosomal DNA (rDNA) arrays are among the most evolutionary dynamic loci of eukaryotic genomes. The loci code for essential cellular components, yet exhibit extensive copy number (CN) variation within and between species. CN might be partly determined by the requirement of dosage balance between the 5S and 45S rDNA arrays. The arrays are nonhomologous, physically unlinked in mammals, and encode functionally interdependent RNA components of the ribosome. Here we show that the 5S and 45S rDNA arrays exhibit concerted CN variation (cCNV). Despite 5S and 45S rDNA elements residing on different chromosomes and lacking sequence similarity, cCNV between these loci is strong, evolutionarily conserved in humans and mice, and manifested across individual genotypes in natural populations and pedigrees. Finally, we observe that bisphenol A induces rapid and parallel modulation of 5S and 45S rDNA CN. Our observations reveal a novel mode of genome variation, indicate that natural selection contributed to the evolution and conservation of cCNV, and support the hypothesis that 5S CN is partly determined by the requirement of dosage balance with the 45S rDNA array. We suggest that human disease variation might be traced to disrupted rDNA dosage balance in the genome. PMID:25583482
rDNA Copy Number Variants Are Frequent Passenger Mutations in Saccharomyces cerevisiae Deletion Collections and de Novo Transformants

PubMed Central

Kwan, Elizabeth X.; Wang, Xiaobin S.; Amemiya, Haley M.; Brewer, Bonita J.; Raghuraman, M. K.

2016-01-01

The Saccharomyces cerevisiae ribosomal DNA (rDNA) locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO) single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae. PMID:27449518
rDNA Copy Number Variants Are Frequent Passenger Mutations in Saccharomyces cerevisiae Deletion Collections and de Novo Transformants.

PubMed

Kwan, Elizabeth X; Wang, Xiaobin S; Amemiya, Haley M; Brewer, Bonita J; Raghuraman, M K

2016-09-08

The Saccharomyces cerevisiae ribosomal DNA (rDNA) locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO) single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae. Copyright © 2016 Kwan et al.
Age-related decline in mitochondrial DNA copy number in isolated human pancreatic islets.

PubMed

Cree, L M; Patel, S K; Pyle, A; Lynn, S; Turnbull, D M; Chinnery, P F; Walker, M

2008-08-01

Pancreatic beta cell function has been shown to decline with age in man. Depletion of mitochondrial DNA (mtDNA) copy number is associated with impaired insulin secretion in pancreatic beta cell lines, and decreased mtDNA copy number has been observed with age in skeletal muscle in man. We investigated whether mtDNA copy number decreases with age in human pancreatic beta cells, which might in turn contribute to the age-related decline in insulin secretory capacity. We quantified mtDNA copy number in isolated human islet preparations from 15 pancreas donors aged between 17 and 75 years. Islets (n = 20) were individually hand-picked and pooled from each donor isolate for the quantification of mtDNA copy number and deleted mtDNA (%), which were determined using real-time PCR methods. There was a significant negative correlation between mtDNA copy number and islet donor age (r = -0.53, p = 0.044). mtDNA copy number was significantly decreased in islet preparations from donors aged > or =50 years (n = 8) compared with those aged <50 years (n = 7) (median [interquartile range]: 418 [236-503] vs 596 [554-729] mtDNA copy number/diploid genome; p = 0.032). None of the islet preparations harboured high levels of deleted mtDNA affecting the major arc. Given the correlation between mtDNA content and respiratory chain activity, the age-related decrease in mtDNA copy number that we observed in human pancreatic islet preparations may contribute to the age-dependent decline in pancreatic beta cell insulin secretory capacity.
Ultra-barcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA.

PubMed

Kane, Nolan; Sveinsson, Saemundur; Dempewolf, Hannes; Yang, Ji Yong; Zhang, Dapeng; Engels, Johannes M M; Cronk, Quentin

2012-02-01

To reliably identify lineages below the species level such as subspecies or varieties, we propose an extension to DNA-barcoding using next-generation sequencing to produce whole organellar genomes and substantial nuclear ribosomal sequence. Because this method uses much longer versions of the traditional DNA-barcoding loci in the plastid and ribosomal DNA, we call our approach ultra-barcoding (UBC). We used high-throughput next-generation sequencing to scan the genome and generate reliable sequence of high copy number regions. Using this method, we examined whole plastid genomes as well as nearly 6000 bases of nuclear ribosomal DNA sequences for nine genotypes of Theobroma cacao and an individual of the related species T. grandiflorum, as well as an additional publicly available whole plastid genome of T. cacao. All individuals of T. cacao examined were uniquely distinguished, and evidence of reticulation and gene flow was observed. Sequence variation was observed in some of the canonical barcoding regions between species, but other regions of the chloroplast were more variable both within species and between species, as were ribosomal spacers. Furthermore, no single region provides the level of data available using the complete plastid genome and rDNA. Our data demonstrate that UBC is a viable, increasingly cost-effective approach for reliably distinguishing varieties and even individual genotypes of T. cacao. This approach shows great promise for applications where very closely related or interbreeding taxa must be distinguished.
Evidence for two transferrin loci in the Salmo trutta genome.

PubMed

Rozman, T; Dovc, P; Marić, S; Kokalj-Vokac, N; Erjavec-Skerget, A; Rab, P; Snoj, A

2008-12-01

To determine the organization of transferrin (TF) locus in the Salmo trutta genome, partial DNA and cDNA sequencing, fluorescent in situ hybridization (FISH) and Salmo salar BAC analysis were performed. TF expression levels and copy number prediction were assessed using real-time PCR. In addition to two previously reported DNA TF variant sequences of S. trutta and Salmo marmoratus (TF1), two novel variant sequences (TF2) were revealed in both species. Variant-specific sequence tags, characterizing two variants for each TF type (TF1 and TF2), were identified in genomic clones from each of the F1 hybrids between S. trutta and S. marmoratus. These clearly documented double heterozygote status at the TF loci. The real-time PCR data showed that each of the two TF types (TF1 and TF2) existed in one copy only and that the transcription of TF2 was considerably lower compared with TF1. Using FISH, hybridization signals were observed on two medium-sized acrocentric chromosomes of S. trutta karyotype. A TF type-specific PCR followed by a restriction analysis revealed the presence of two TF loci in the majority of analysed BAC clones. It was concluded that the TF gene is duplicated in the genome of S. trutta, and that the two TF loci are located adjacent to one another on the same chromosome. The differing transcription levels of TF1 and TF2 appear to depend on the corresponding promoter activity, which at least for TF2 seems to vary between different Salmo congeners.
cWINNOWER algorithm for finding fuzzy dna motifs

NASA Technical Reports Server (NTRS)

Liang, S.; Samanta, M. P.; Biegel, B. A.

2004-01-01

The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if a clique consisting of a sufficiently large number of mutated copies of the motif (i.e., the signals) is present in the DNA sequence. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum detectable clique size qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12,000 for (l, d) = (15, 4). Copyright Imperial College Press.
cWINNOWER Algorithm for Finding Fuzzy DNA Motifs

NASA Technical Reports Server (NTRS)

Liang, Shoudan

2003-01-01

The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if multiple mutated copies of the motif (i.e., the signals) are present in the DNA sequence in sufficient abundance. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum number of detectable motifs qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc, by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12000 for (l,d) = (15,4).
Tagmentation on Microbeads: Restore Long-Range DNA Sequence Information Using Next Generation Sequencing with Library Prepared by Surface-Immobilized Transposomes.

PubMed

Chen, He; Yao, Jiacheng; Fu, Yusi; Pang, Yuhong; Wang, Jianbin; Huang, Yanyi

2018-04-11

The next generation sequencing (NGS) technologies have been rapidly evolved and applied to various research fields, but they often suffer from losing long-range information due to short library size and read length. Here, we develop a simple, cost-efficient, and versatile NGS library preparation method, called tagmentation on microbeads (TOM). This method is capable of recovering long-range information through tagmentation mediated by microbead-immobilized transposomes. Using transposomes with DNA barcodes to identically label adjacent sequences during tagmentation, we can restore inter-read connection of each fragment from original DNA molecule by fragment-barcode linkage after sequencing. In our proof-of-principle experiment, more than 4.5% of the reads are linked with their adjacent reads, and the longest linkage is over 1112 bp. We demonstrate TOM with eight barcodes, but the number of barcodes can be scaled up by an ultrahigh complexity construction. We also show this method has low amplification bias and effectively fits the applications to identify copy number variations.
Conformation of Tax-response elements in the human T-cell leukemia virus type I promoter.

PubMed

Cox, J M; Sloan, L S; Schepartz, A

1995-12-01

HTLV-I Tax is believed to activate viral gene expression by binding bZIP proteins (such as CREB) and increasing their affinities for proviral TRE target sites. Each 21 bp TRE target site contains an imperfect copy of the intrinsically bent CRE target site (the TRE core) surrounded by highly conserved flanking sequences. These flanking sequences are essential for maximal increases in DNA affinity and transactivation, but they are not, apparently, contacted by protein. Here we employ non-denaturing gel electrophoresis to evaluate TRE conformation in the presence and absence of bZIP proteins, and to explore the role of DNA conformation in viral transactivation. Our results show that the TRE-1 flanking sequences modulate the structure and modestly increase the affinity of a CREB bZIP peptide for the TRE-1 core recognition sequence. These flanking sequences are also essential for a maximal increase in stability of the CREB-DNA complex in the presence of Tax. The CRE-like TRE core and the TRE flanking sequences are both essential for formation of stable CREB-TRE-1 and Tax-CREB-TRE-1 complexes. These two DNA segments may have co-evolved into a unique structure capable of recognizing Tax and a bZIP protein.
A family of long intergenic non-coding RNA genes in human chromosomal region 22q11.2 carry a DNA translocation breakpoint/AT-rich sequence

PubMed Central

2018-01-01

FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722
Escaping introns in COI through cDNA barcoding of mushrooms: Pleurotus as a test case.

PubMed

Avin, Farhat A; Subha, Bhassu; Tan, Yee-Shin; Braukmann, Thomas W A; Vikineswary, Sabaratnam; Hebert, Paul D N

2017-09-01

DNA barcoding involves the use of one or more short, standardized DNA fragments for the rapid identification of species. A 648-bp segment near the 5' terminus of the mitochondrial cytochrome c oxidase subunit I (COI) gene has been adopted as the universal DNA barcode for members of the animal kingdom, but its utility in mushrooms is complicated by the frequent occurrence of large introns. As a consequence, ITS has been adopted as the standard DNA barcode marker for mushrooms despite several shortcomings. This study employed newly designed primers coupled with cDNA analysis to examine COI sequence diversity in six species of Pleurotus and compared these results with those for ITS. The ability of the COI gene to discriminate six species of Pleurotus , the commonly cultivated oyster mushroom, was examined by analysis of cDNA. The amplification success, sequence variation within and among species, and the ability to design effective primers was tested. We compared ITS sequences to their COI cDNA counterparts for all isolates. ITS discriminated between all six species, but some sequence results were uninterpretable, because of length variation among ITS copies. By comparison, a complete COI sequences were recovered from all but three individuals of Pleurotus giganteus where only the 5' region was obtained. The COI sequences permitted the resolution of all species when partial data was excluded for P. giganteus . Our results suggest that COI can be a useful barcode marker for mushrooms when cDNA analysis is adopted, permitting identifications in cases where ITS cannot be recovered or where it offers higher resolution when fresh tissue is. The suitability of this approach remains to be confirmed for other mushrooms.
Sensitive and Specific Target Sequences Selected from Retrotransposons of Schistosoma japonicum for the Diagnosis of Schistosomiasis

PubMed Central

Xu, Jing; Zhu, Xing-Quan; Wang, Sheng-Yue; Xia, Chao-Ming

2012-01-01

Background Schistosomiasis japonica is a serious debilitating and sometimes fatal disease. Accurate diagnostic tests play a key role in patient management and control of the disease. However, currently available diagnostic methods are not ideal, and the detection of the parasite DNA in blood samples has turned out to be one of the most promising tools for the diagnosis of schistosomiasis. In our previous investigations, a 230-bp sequence from the highly repetitive retrotransposon SjR2 was identified and it showed high sensitivity and specificity for detecting Schistosoma japonicum DNA in the sera of rabbit model and patients. Recently, 29 retrotransposons were found in S. japonicum genome by our group. The present study highlighted the key factors for selecting a new perspective sensitive target DNA sequence for the diagnosis of schistosomiasis, which can serve as example for other parasitic pathogens. Methodology/Principal Findings In this study, we demonstrated that the key factors based on the bioinformatic analysis for selecting target sequence are the higher genome proportion, repetitive complete copies and partial copies, and active ESTs than the others in the chromosome genome. New primers based on 25 novel retrotransposons and SjR2 were designed and their sensitivity and specificity for detecting S. japonicum DNA were compared. The results showed that a new 303-bp sequence from non-long terminal repeat (LTR) retrotransposon (SjCHGCS19) had high sensitivity and specificity. The 303-bp target sequence was amplified from the sera of rabbit model at 3 d post-infection by nested-PCR and it became negative at 17 weeks post-treatment. Furthermore, the percentage sensitivity of the nested-PCR was 97.67% in 43 serum samples of S. japonicum-infected patients. Conclusions/Significance Our findings highlighted the key factors based on the bioinformatic analysis for selecting target sequence from S. japonicum genome, which provide basis for establishing powerful molecular diagnostic techniques that can be used for monitoring early infection and therapy efficacy to support schistosomiasis control programs. PMID:22479661
Structural Rearrangements in DNA Repair Genes in Breast Cancer

DTIC Science & Technology

2013-10-01

number was measured with the CNV assay from Q biomarkers using a stable region on Chr17 as a control. A line highlights the normal 2 copies. Black...Tanner M, Stokke T, Chen L, Smith HS, Pinkel D, Gray JW, Waldman FM. Detection and mapping of amplified DNA sequences in breast cancer by comparative...1850703 3. Isola JJ, Kallioniemi OP, Chu LW, Fuqua SA, Hilsenbeck SG, Osborne CK, Waldman FM. Genetic aberrations detected by comparative genomic

Respiratory chain complex III deficiency in patients with tRNA-leu mutation.

PubMed

Jiang, J; Wang, X L; Ma, Y Y

2015-12-29

The aim of this study was to investigate the clinical and genetic profiles of mitochondrial disease resulting from deficiencies in the respiratory chain complex III. Three patients, aged between 8 months and 12 years, were recruited for this study. The activities of mitochondrial respiratory chain complexes in the peripheral leucocytes were spectrophotometrically measured. The entire mitochondrial DNA (mtDNA) sequence was analyzed. Samples obtained from the three patients and their families were subjected to restriction fragment length polymorphism and gene sequencing analyses. mtDNA copy numbers of all patients and their mothers were analyzed. The patients displayed nervous system impairment, including motor and mental developmental delay, hypotonia, and motor regression. Two patients also suffered from Leigh syndrome. Assay of the mitochondrial respiratory chain enzymes revealed an isolated complex III deficiency in the three patients. The m.3243 A>G mutation was detected in all patients and their mothers. The mutation loads were 48.3, 57.2, and 45.5% in the patients, and 20.5, 16.4, and 23.6% in their respective mothers. The leukocyte mtDNA copy numbers of the patients and their mothers were within the control range. The clinical manifestation and genetics were observed to be very heterogeneous. Patient carrying an m.3243 A>G mutation may biochemically display a deficiency in the mitochondrial respiratory chain complex III.
Strategy for Sensitive and Specific Detection of Yersinia pestis in Skeletons of the Black Death Pandemic

PubMed Central

Seifert, Lisa; Harbeck, Michaela; Thomas, Astrid; Hoke, Nadja; Zöller, Lothar; Wiechmann, Ingrid; Grupe, Gisela; Scholz, Holger C.; Riehm, Julia M.

2013-01-01

Yersinia pestis has been identified as the causative agent of the Black Death pandemic in the 14th century. However, retrospective diagnostics in human skeletons after more than 600 years are critical. We describe a strategy following a modern diagnostic algorithm and working under strict ancient DNA regime for the identification of medieval human plague victims. An initial screening and DNA quantification assay detected the Y. pestis specific pla gene of the high copy number plasmid pPCP1. Results were confirmed by conventional PCR and sequence analysis targeting both Y. pestis specific virulence plasmids pPCP1 and pMT1. All assays were meticulously validated according to human clinical diagnostics requirements (ISO 15189) regarding efficiency, sensitivity, specificity, and limit of detection (LOD). Assay specificity was 100% tested on 41 clinically relevant bacteria and 29 Y. pseudotuberculosis strains as well as for DNA of 22 Y. pestis strains and 30 previously confirmed clinical human plague samples. The optimized LOD was down to 4 gene copies. 29 individuals from three different multiple inhumations were initially assessed as possible victims of the Black Death pandemic. 7 samples (24%) were positive in the pPCP1 specific screening assay. Confirmation through second target pMT1 specific PCR was successful for 4 of the positive individuals (14%). A maximum of 700 and 560 copies per µl aDNA were quantified in two of the samples. Those were positive in all assays including all repetitions, and are candidates for future continuative investigations such as whole genome sequencing. We discuss that all precautions taken here for the work with aDNA are sufficient to prevent external sample contamination and fulfill the criteria of authenticity. With regard to retrospective diagnostics of a human pathogen and the uniqueness of ancient material we strongly recommend using a careful strategy and validated assays as presented in our study. PMID:24069445
Strategy for sensitive and specific detection of Yersinia pestis in skeletons of the black death pandemic.

PubMed

Seifert, Lisa; Harbeck, Michaela; Thomas, Astrid; Hoke, Nadja; Zöller, Lothar; Wiechmann, Ingrid; Grupe, Gisela; Scholz, Holger C; Riehm, Julia M

2013-01-01

Yersinia pestis has been identified as the causative agent of the Black Death pandemic in the 14(th) century. However, retrospective diagnostics in human skeletons after more than 600 years are critical. We describe a strategy following a modern diagnostic algorithm and working under strict ancient DNA regime for the identification of medieval human plague victims. An initial screening and DNA quantification assay detected the Y. pestis specific pla gene of the high copy number plasmid pPCP1. Results were confirmed by conventional PCR and sequence analysis targeting both Y. pestis specific virulence plasmids pPCP1 and pMT1. All assays were meticulously validated according to human clinical diagnostics requirements (ISO 15189) regarding efficiency, sensitivity, specificity, and limit of detection (LOD). Assay specificity was 100% tested on 41 clinically relevant bacteria and 29 Y. pseudotuberculosis strains as well as for DNA of 22 Y. pestis strains and 30 previously confirmed clinical human plague samples. The optimized LOD was down to 4 gene copies. 29 individuals from three different multiple inhumations were initially assessed as possible victims of the Black Death pandemic. 7 samples (24%) were positive in the pPCP1 specific screening assay. Confirmation through second target pMT1 specific PCR was successful for 4 of the positive individuals (14%). A maximum of 700 and 560 copies per µl aDNA were quantified in two of the samples. Those were positive in all assays including all repetitions, and are candidates for future continuative investigations such as whole genome sequencing. We discuss that all precautions taken here for the work with aDNA are sufficient to prevent external sample contamination and fulfill the criteria of authenticity. With regard to retrospective diagnostics of a human pathogen and the uniqueness of ancient material we strongly recommend using a careful strategy and validated assays as presented in our study.
Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prody, C.A.; Zevin-Sonkin, D.; Gnatt, A.

1987-06-01

To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase and Torpedo electric organ true acetylcholinesterase. Using these probes, the authors isolated several cDNA clones from lambdagt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. Inmore » RNA blots of poly(A)/sup +/ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These finding demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species.« less
Determination of the melon chloroplast and mitochondrial genome sequences reveals that the largest reported mitochondrial genome in plants contains a significant amount of DNA having a nuclear origin

PubMed Central

2011-01-01

Background The melon belongs to the Cucurbitaceae family, whose economic importance among vegetable crops is second only to Solanaceae. The melon has a small genome size (454 Mb), which makes it suitable for molecular and genetic studies. Despite similar nuclear and chloroplast genome sizes, cucurbits show great variation when their mitochondrial genomes are compared. The melon possesses the largest plant mitochondrial genome, as much as eight times larger than that of other cucurbits. Results The nucleotide sequences of the melon chloroplast and mitochondrial genomes were determined. The chloroplast genome (156,017 bp) included 132 genes, with 98 single-copy genes dispersed between the small (SSC) and large (LSC) single-copy regions and 17 duplicated genes in the inverted repeat regions (IRa and IRb). A comparison of the cucumber and melon chloroplast genomes showed differences in only approximately 5% of nucleotides, mainly due to short indels and SNPs. Additionally, 2.74 Mb of mitochondrial sequence, accounting for 95% of the estimated mitochondrial genome size, were assembled into five scaffolds and four additional unscaffolded contigs. An 84% of the mitochondrial genome is contained in a single scaffold. The gene-coding region accounted for 1.7% (45,926 bp) of the total sequence, including 51 protein-coding genes, 4 conserved ORFs, 3 rRNA genes and 24 tRNA genes. Despite the differences observed in the mitochondrial genome sizes of cucurbit species, Citrullus lanatus (379 kb), Cucurbita pepo (983 kb) and Cucumis melo (2,740 kb) share 120 kb of sequence, including the predicted protein-coding regions. Nevertheless, melon contained a high number of repetitive sequences and a high content of DNA of nuclear origin, which represented 42% and 47% of the total sequence, respectively. Conclusions Whereas the size and gene organisation of chloroplast genomes are similar among the cucurbit species, mitochondrial genomes show a wide variety of sizes, with a non-conserved structure both in gene number and organisation, as well as in the features of the noncoding DNA. The transfer of nuclear DNA to the melon mitochondrial genome and the high proportion of repetitive DNA appear to explain the size of the largest mitochondrial genome reported so far. PMID:21854637
Exome-wide Sequencing Shows Low Mutation Rates and Identifies Novel Mutated Genes in Seminomas.

PubMed

Cutcutache, Ioana; Suzuki, Yuka; Tan, Iain Beehuat; Ramgopal, Subhashini; Zhang, Shenli; Ramnarayanan, Kalpana; Gan, Anna; Lee, Heng Hong; Tay, Su Ting; Ooi, Aikseng; Ong, Choon Kiat; Bolthouse, Jonathan T; Lane, Brian R; Anema, John G; Kahnoski, Richard J; Tan, Patrick; Teh, Bin Tean; Rozen, Steven G

2015-07-01

Testicular germ cell tumors are the most common cancer diagnosed in young men, and seminomas are the most common type of these cancers. There have been no exome-wide examinations of genes mutated in seminomas or of overall rates of nonsilent somatic mutations in these tumors. The objective was to analyze somatic mutations in seminomas to determine which genes are affected and to determine rates of nonsilent mutations. Eight seminomas and matched normal samples were surgically obtained from eight patients. DNA was extracted from tissue samples and exome sequenced on massively parallel Illumina DNA sequencers. Single-nucleotide polymorphism chip-based copy number analysis was also performed to assess copy number alterations. The DNA sequencing read data were analyzed to detect somatic mutations including single-nucleotide substitutions and short insertions and deletions. The detected mutations were validated by independent sequencing and further checked for subclonality. The rate of nonsynonymous somatic mutations averaged 0.31 mutations/Mb. We detected nonsilent somatic mutations in 96 genes that were not previously known to be mutated in seminomas, of which some may be driver mutations. Many of the mutations appear to have been present in subclonal populations. In addition, two genes, KIT and KRAS, were affected in two tumors each with mutations that were previously observed in other cancers and are presumably oncogenic. Our study, the first report on exome sequencing of seminomas, detected somatic mutations in 96 new genes, several of which may be targetable drivers. Furthermore, our results show that seminoma mutation rates are five times higher than previously thought, but are nevertheless low compared to other common cancers. Similar low rates are seen in other cancers that also have excellent rates of remission achieved with chemotherapy. We examined the DNA sequences of seminomas, the most common type of testicular germ cell cancer. Our study identified 96 new genes in which mutations occurred during seminoma development, some of which might contribute to cancer development or progression. The study also showed that the rates of DNA mutations during seminoma development are higher than previously thought, but still lower than for other common solid-organ cancers. Such low rates are also observed among other cancers that, like seminomas, show excellent rates of disease remission after chemotherapy. Copyright © 2015 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Alu repeated DNAs are differentially methylated in primate germ cells.

PubMed Central

Rubin, C M; VandeVoort, C A; Teplitz, R L; Schmid, C W

1994-01-01

A significant fraction of Alu repeats in human sperm DNA, previously found to be unmethylated, is nearly completely methylated in DNA from many somatic tissues. A similar fraction of unmethylated Alus is observed here in sperm DNA from rhesus monkey. However, Alus are almost completely methylated at the restriction sites tested in monkey follicular oocyte DNA. The Alu methylation patterns in mature male and female monkey germ cells are consistent with Alu methylation in human germ cell tumors. Alu sequences are hypomethylated in seminoma DNAs and more methylated in a human ovarian dysgerminoma. These results contrast with methylation patterns reported for germ cell single-copy, CpG island, satellite, and L1 sequences. The function of Alu repeats is not known, but differential methylation of Alu repeats in the male and female germ lines suggests that they may serve as markers for genomic imprinting or in maintaining differences in male and female meiosis. Images PMID:7800508
Quantitative detection method of Enterocytozoon hepatopenaei using TaqMan probe real-time PCR.

PubMed

Liu, Ya-Mei; Qiu, Liang; Sheng, An-Zhi; Wan, Xiao-Yuan; Cheng, Dong-Yuan; Huang, Jie

2018-01-01

A TaqMan probe and a pair of specific primers were selected from the small subunit ribosomal DNA (SSU rDNA) sequence of Enterocytozoon hepatopenaei (EHP); this real-time PCR assay was developed and optimized. It showed a good linearity in detecting standards of EHP SSU rDNA fragments from 4 × 10 2 to 4 × 10 8 copies/reaction using the established method. The detection limit of the qPCR method was as low as 4 × 10 1 copies per reaction, which was higher than the conventional PCR and SYBR Green I-based EHP qPCR reported. Using the qPCR assay, EHP was detected in four batches of slow-growing Penaeus vannamei specimens collected from Tianjin and Zhejiang Province in China was detected using qPCR. The results showed that all the hepatopancreas from the slow-growing P. vannamei specimens were detected as EHP-positive. EHP copies of hepatopancreas in some batches had a negative correlation with the body mass index (BMI) of shrimps; however, not all batches of specimens had this negative correlation between EHP copies of hepatopancreas and BMI. This qPCR technique is sensitive, specific and easy to perform (96 tests in <3 h), which provides technical support for the detection and prevention of EHP. Copyright © 2017 Elsevier Inc. All rights reserved.
HER2 copy number of circulating tumour DNA functions as a biomarker to predict and monitor trastuzumab efficacy in advanced gastric cancer.

PubMed

Wang, Haixing; Li, Beifang; Liu, Zhentao; Gong, Jifang; Shao, Lin; Ren, Jun; Niu, Yunyun; Bo, Shiping; Li, Zhongwu; Lai, Yumei; Lu, Sijia; Gao, Jing; Shen, Lin

2018-01-01

HER2 status is significant to trastuzumab therapy; however, it is difficult to determine HER2 status accurately with few pieces of biopsies from advanced gastric cancer (AGC) due to highly heterogeneity and invasive behaviour, which will be investigated in this study. Fifty-six patients with AGC were included in this study. Primary tumour tissues and matched plasmas before medication from 36 patients were retrospectively collected, and the other 20 patients with primary tumour tissues and paired plasmas were prospectively collected. HER2 expression and amplification in 56 tumour tissues were determined by immunohistochemistry (IHC) and dual in situ hybridisation (DISH), and HER2 copy number in 135 circulating tumour DNAs (ctDNAs) was judged by next-generation sequencing. For tumour tissues, HER2 amplification by DISH was most commonly found in patients with HER2 score 3+by IHC. For plasmas, HER2 amplification defined as HER2 copy number >2.22 was identified in 26 of 56 patients. There was a high concordance of HER2 amplification between ctDNA and tumour tissues, suggesting that ctDNA could function as an alternative to screen HER2-targeted population. Moreover, the changes of HER2 copy number in ctDNA could efficiently monitor trastuzumab efficacy, the power of which was superior to commonly used markers carcinoembryonic antigen (CEA) and CA199, suggesting its potential role in clinical practice. ctDNA for HER2 analysis was strongly recommended to serve as a surrogate to screen trastuzumab-suitable population and monitor trastuzumab efficacy. Copyright © 2017 Elsevier Ltd. All rights reserved.
Characterization and assessment of an avian repetitive DNA sequence as an icterid phylogenetic marker.

PubMed

Quinn, J S; Guglich, E; Seutin, G; Lau, R; Marsolais, J; Parna, L; Boag, P T; White, B N

1992-02-01

The first tandemly repeated sequence examined in a passerine bird, a 431-bp PstI fragment named pMAT1, has been cloned from the genome of the brown-headed cowbird (Molothrus ater). The sequence represents about 5-10% of the genome (about 4 x 10(5) copies) and yields prominent ethidium bromide stained bands when genomic DNA cut with a variety of restriction enzymes is electrophoresed in agarose gels. A particularly striking ladder of fragments is apparent when the DNA is cut with HinfI, indicative of a tandem arrangement of the monomer. The cloned PstI monomer has been sequenced, revealing no internal repeated structure. There are sequences that hybridize with pMAT1 found in related nine-primaried oscines but not in more distantly related oscines, suboscines, or nonpasserine species. Little sequence similarity to tandemly repeated PstI cut sequences from the merlin (Falco columbarius), saurus crane (Grus antigone), or Puerto Rican parrot (Amazona vittata) or to HinfI digested sequence from the Toulouse goose (Anser anser) was detected. The isolated sequence was used as a probe to examine DNA samples of eight members of the tribe Icterini. This examination revealed phylogenetically informative characters. The repeat contains cutting sites from a number of restriction enzymes, which, if sufficiently polymorphic, would provide new phylogenetic characters. Sequences like these, conserved within a species, but variable between closely related species, may be very useful for phylogenetic studies of closely related taxa.
Detection of Human Papillomavirus Type 2 Related Sequence in Oral Papilloma

PubMed Central

Yamaguchi, Taihei; Shindoh, Masanobu; Amemiya, Akira; Inoue, Nobuo; Kawamura, Masaaki; Sakaoka, Hiroshi; Inoue, Masakazu; Fujinaga, Kei

1998-01-01

Oral papilloma is a benign tumourous lesion. Part of this lesion is associated with human papillomavirus (HPV) infection. We analysed the genetical and histopathological evidence for HPV type 2 infection in three oral papillomas. Southern blot hybridization showed HPV 2a sequence in one lesion. Cells of the positive specimen appeared to contain high copy numbers of the viral DNA in an episomal state. In situ staining demonstrated virus capsid antigen in koilocytotic cells and surrounding cells in the hyperplastic epithelial layer. Two other specimens contained no HPV sequences by labeled probe of full length linear HPVs 2a, 6b, 11, 16, 18, 31 and 33 DNA under low stringency hybridization conditions. These results showed the possibility that HPV 2 plays a role in oral papilloma. PMID:9699941
Biocompatible artificial DNA linker that is read through by DNA polymerases and is functional in Escherichia coli

PubMed Central

El-Sagheer, Afaf H.; Sanzone, A. Pia; Gao, Rachel; Tavassoli, Ali; Brown, Tom

2011-01-01

A triazole mimic of a DNA phosphodiester linkage has been produced by templated chemical ligation of oligonucleotides functionalized with 5′-azide and 3′-alkyne. The individual azide and alkyne oligonucleotides were synthesized by standard phosphoramidite methods and assembled using a straightforward ligation procedure. This highly efficient chemical equivalent of enzymatic DNA ligation has been used to assemble a 300-mer from three 100-mer oligonucleotides, demonstrating the total chemical synthesis of very long oligonucleotides. The base sequences of the DNA strands containing this artificial linkage were copied during PCR with high fidelity and a gene containing the triazole linker was functional in Escherichia coli. PMID:21709264
Future of human mitochondrial DNA editing technologies.

PubMed

Verechshagina, N; Nikitchina, N; Yamada, Y; Harashima, Н; Tanaka, M; Orishchenko, K; Mazunin, I

2018-05-15

ATP and other metabolites, which are necessary for the development, maintenance, and functioning of bodily cells are all synthesized in the mitochondria. Multiple copies of the genome, present within the mitochondria, together with its maternal inheritance, determine the clinical manifestation and spreading of mutations in mitochondrial DNA (mtDNA). The main obstacle in the way of thorough understanding of mitochondrial biology and the development of gene therapy methods for mitochondrial diseases is the absence of systems that allow to directly change mtDNA sequence. Here, we discuss existing methods of manipulating the level of mtDNA heteroplasmy, as well as the latest systems, that could be used in the future as tools for human mitochondrial genome editing.
DR-78, a novel Drosophila melanogaster genomic DNA fragment highly homologous to the DNA-binding domain of thyroid hormone-retinoic acid-vitamin D receptor subfamily.

PubMed

Martín-Blanco, E; Kornberg, T B

1993-11-16

Degenerate oligodeoxyribonucleotides were designed for both ends of the DNA-binding domain of members of the nuclear receptor superfamily. PCR amplified Drosophila melanogaster DNA was purified and cloned (DR plasmids). Genomic lambda DASH clones were identified at high stringency with an amplified DR-78 plasmid DNA and isolated. The partial sequence shows a very probable open reading frame which would encode a peptide highly homologous to members of the thyroid hormone-retinoic acid-vitamin D receptor subfamily. The fragment corresponds to a single copy gene and was mapped at position 78D of chromosome three by in situ hybridization.
Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies.

PubMed Central

Wincker, P; Jubier-Maurin, V; Roizès, G

1987-01-01

Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies. Images PMID:3684566
Zaba: a novel miniature transposable element present in genomes of legume plants.

PubMed

Macas, J; Neumann, P; Pozárková, D

2003-08-01

A novel family of miniature transposable elements, named Zaba, was identified in pea (Pisum sativum) and subsequently also in other legume species using computer analysis of their DNA sequences. Zaba elements are 141-190 bp long, generate 10-bp target site duplications, and their terminal inverted repeats make up most of the sequence. Zaba elements thus resemble class 3 foldback transposons. The elements are only moderately repetitive in pea (tens to hundreds copies per haploid genome), but they are present in up to thousands of copies in the genomes of several Medicago and Vicia species. More detailed analysis of the elements from pea, including isolation of new sequences from a genomic library, revealed that a fraction of these elements are truncated, and that their last transposition probably did not occur recently. A search for Zaba sequences in EST databases showed that at least some elements are transcribed, most probably due to their association with genic regions.
Complete plastid genome sequence of goosegrass (Eleusine indica) and comparison with other Poaceae.

PubMed

Zhang, Hui; Hall, Nathan; McElroy, J Scott; Lowe, Elijah K; Goertzen, Leslie R

2017-02-05

Eleusine indica, also known as goosegrass, is a serious weed in at least 42 countries. In this paper we report the complete plastid genome sequence of goosegrass obtained by de novo assembly of paired-end and mate-paired reads generated by Illumina sequencing of total genomic DNA. The goosegrass plastome is a circular molecule of 135,151bp in length, consisting of two single-copy regions separated by a pair of inverted repeats (IRs) of 20,919 bases. The large (LSC) and the small (SSC) single-copy regions span 80,667 bases and 12,646 bases, respectively. The plastome of goosegrass has 38.19% GC content and includes 108 unique genes, of which 76 are protein-coding, 28 are transfer RNA, and 4 are ribosomal RNA. The goosegrass plastome sequence was compared to eight other species of Poaceae. Although generally conserved with respect to Poaceae, this genomic resource will be useful for evolutionary studies within this weed species and the genus Eleusine. Copyright © 2016. Published by Elsevier B.V.
Tumor Cell-Free DNA Copy Number Instability Predicts Therapeutic Response to Immunotherapy.

PubMed

Weiss, Glen J; Beck, Julia; Braun, Donald P; Bornemann-Kolatzki, Kristen; Barilla, Heather; Cubello, Rhiannon; Quan, Walter; Sangal, Ashish; Khemka, Vivek; Waypa, Jordan; Mitchell, William M; Urnovitz, Howard; Schütz, Ekkehard

2017-09-01

Purpose: Chromosomal instability is a fundamental property of cancer, which can be quantified by next-generation sequencing (NGS) from plasma/serum-derived cell-free DNA (cfDNA). We hypothesized that cfDNA could be used as a real-time surrogate for imaging analysis of disease status as a function of response to immunotherapy and as a more reliable tool than tumor biomarkers. Experimental Design: Plasma cfDNA sequences from 56 patients with diverse advanced cancers were prospectively collected and analyzed in a single-blind study for copy number variations, expressed as a quantitative chromosomal number instability (CNI) score versus 126 noncancer controls in a training set of 23 and a blinded validation set of 33. Tumor biomarker concentrations and a surrogate marker for T regulatory cells (Tregs) were comparatively analyzed. Results: Elevated CNI scores were observed in 51 of 56 patients prior to therapy. The blinded validation cohort provided an overall prediction accuracy of 83% (25/30) and a positive predictive value of CNI score for progression of 92% (11/12). The combination of CNI score before cycle (Cy) 2 and 3 yielded a correct prediction for progression in all 13 patients. The CNI score also correctly identified cases of pseudo-tumor progression from hyperprogression. Before Cy2 and Cy3, there was no significant correlation for protein tumor markers, total cfDNA, or surrogate Tregs. Conclusions: Chromosomal instability quantification in plasma cfDNA can serve as an early indicator of response to immunotherapy. The method has the potential to reduce health care costs and disease burden for cancer patients following further validation. Clin Cancer Res; 23(17); 5074-81. ©2017 AACR . ©2017 American Association for Cancer Research.
Molecular phylogeny and character evolution in terete-stemmed Andean opuntias (Cactaceae-Opuntioideae).

PubMed

Ritz, C M; Reiker, J; Charles, G; Hoxey, P; Hunt, D; Lowry, M; Stuppy, W; Taylor, N

2012-11-01

The cacti of tribe Tephrocacteae (Cactaceae-Opuntioideae) are adapted to diverse climatic conditions over a wide area of the southern Andes and adjacent lowlands. They exhibit a range of life forms from geophytes and cushion-plants to dwarf shrubs, shrubs or small trees. To confirm or challenge previous morphology-based classifications and molecular phylogenies, we sampled DNA sequences from the chloroplast trnK/matK region and the nuclear low copy gene phyC and compared the resulting phylogenies with previous data gathered from nuclear ribosomal DNA sequences. The here presented chloroplast and nuclear low copy gene phylogenies were mutually congruent and broadly coincident with the classification based on gross morphology and seed micro-morphology and anatomy. Reconstruction of hypothetical ancestral character states suggested that geophytes and cushion-forming species probably evolved several times from dwarf shrubby precursors. We also traced an increase of embryo size at the expense of the nucellus-derived storage tissue during the evolution of the Tephrocacteae, which is thought to be an evolutionary advantage because nutrients are then more rapidly accessible for the germinating embryo. In contrast to these highly concordant phylogenies, nuclear ribosomal DNA data sampled by a previous study yielded conflicting phylogenetic signals. Secondary structure predictions of ribosomal transcribed spacers suggested that this phylogeny is strongly influenced by the inclusion of paralogous sequence probably arisen by genome duplication during the evolution of this plant group. Copyright © 2012 Elsevier Inc. All rights reserved.
Integrative pipeline for profiling DNA copy number and inferring tumor phylogeny.

PubMed

Urrutia, Eugene; Chen, Hao; Zhou, Zilu; Zhang, Nancy R; Jiang, Yuchao

2018-06-15

Copy number variation is an important and abundant source of variation in the human genome, which has been associated with a number of diseases, especially cancer. Massively parallel next-generation sequencing allows copy number profiling with fine resolution. Such efforts, however, have met with mixed successes, with setbacks arising partly from the lack of reliable analytical methods to meet the diverse and unique challenges arising from the myriad experimental designs and study goals in genetic studies. In cancer genomics, detection of somatic copy number changes and profiling of allele-specific copy number (ASCN) are complicated by experimental biases and artifacts as well as normal cell contamination and cancer subclone admixture. Furthermore, careful statistical modeling is warranted to reconstruct tumor phylogeny by both somatic ASCN changes and single nucleotide variants. Here we describe a flexible computational pipeline, MARATHON, which integrates multiple related statistical software for copy number profiling and downstream analyses in disease genetic studies. MARATHON is publicly available at https://github.com/yuchaojiang/MARATHON. Supplementary data are available at Bioinformatics online.

RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing.

PubMed

Chang, Lun-Ching; Das, Biswajit; Lih, Chih-Jian; Si, Han; Camalier, Corinne E; McGregor, Paul M; Polley, Eric

2016-01-01

With rapid advances in DNA sequencing technologies, whole exome sequencing (WES) has become a popular approach for detecting somatic mutations in oncology studies. The initial intent of WES was to characterize single nucleotide variants, but it was observed that the number of sequencing reads that mapped to a genomic region correlated with the DNA copy number variants (CNVs). We propose a method RefCNV that uses a reference set to estimate the distribution of the coverage for each exon. The construction of the reference set includes an evaluation of the sources of variability in the coverage distribution. We observed that the processing steps had an impact on the coverage distribution. For each exon, we compared the observed coverage with the expected normal coverage. Thresholds for determining CNVs were selected to control the false-positive error rate. RefCNV prediction correlated significantly (r = 0.96-0.86) with CNV measured by digital polymerase chain reaction for MET (7q31), EGFR (7p12), or ERBB2 (17q12) in 13 tumor cell lines. The genome-wide CNV analysis showed a good overall correlation (Spearman's coefficient = 0.82) between RefCNV estimation and publicly available CNV data in Cancer Cell Line Encyclopedia. RefCNV also showed better performance than three other CNV estimation methods in genome-wide CNV analysis.
Porcine MAP3K5 analysis: molecular cloning, characterization, tissue expression pattern, and copy number variations associated with residual feed intake.

PubMed

Pu, L; Zhang, L C; Zhang, J S; Song, X; Wang, L G; Liang, J; Zhang, Y B; Liu, X; Yan, H; Zhang, T; Yue, J W; Li, N; Wu, Q Q; Wang, L X

2016-08-12

Mitogen-activated protein kinase kinase kinase 5 (MAP3K5) is essential for apoptosis, proliferation, differentiation, and immune responses, and is a candidate marker for residual feed intake (RFI) in pig. We cloned the full-length cDNA sequence of porcine MAP3K5 by rapid-amplification of cDNA ends. The 5451-bp gene contains a 5'-untranslated region (UTR) (718 bp), a coding region (3738 bp), and a 3'-UTR (995 bp), and encodes a peptide of 1245 amino acids, which shares 97, 99, 97, 93, 91, and 84% sequence identity with cattle, sheep, human, mouse, chicken, and zebrafish MAP3K5, respectively. The deduced MAP3K5 protein sequence contains two conserved domains: a DUF4071 domain and a protein kinase domain. Phylogenetic analysis showed that porcine MAP3K5 forms a separate branch to vicugna and camel MAP3K5. Tissue expression analysis using real-time quantitative polymerase chain reaction (qRT-PCR) revealed that MAP3K5 was expressed in the heart, liver, spleen, lung, kidney, muscle, fat, pancrea, ileum, and stomach tissues. Copy number variation was detected for porcine MAP3K5 and validated by qRT-PCR. Furthermore, a significant increase in average copy number was detected in the low RFI group when compared to the high RFI group in a Duroc pig population. These results provide useful information regarding the influence of MAP3K5 on RFI in pigs.
High-throughput sequencing of three Lemnoideae (duckweeds) chloroplast genomes from total DNA.

PubMed

Wang, Wenqin; Messing, Joachim

2011-01-01

Chloroplast genomes provide a wealth of information for evolutionary and population genetic studies. Chloroplasts play a particularly important role in the adaption for aquatic plants because they float on water and their major surface is exposed continuously to sunlight. The subfamily of Lemnoideae represents such a collection of aquatic species that because of photosynthesis represents one of the fastest growing plant species on earth. We sequenced the chloroplast genomes from three different genera of Lemnoideae, Spirodela polyrhiza, Wolffiella lingulata and Wolffia australiana by high-throughput DNA sequencing of genomic DNA using the SOLiD platform. Unfractionated total DNA contains high copies of plastid DNA so that sequences from the nucleus and mitochondria can easily be filtered computationally. Remaining sequence reads were assembled into contiguous sequences (contigs) using SOLiD software tools. Contigs were mapped to a reference genome of Lemna minor and gaps, selected by PCR, were sequenced on the ABI3730xl platform. This combinatorial approach yielded whole genomic contiguous sequences in a cost-effective manner. Over 1,000-time coverage of chloroplast from total DNA were reached by the SOLiD platform in a single spot on a quadrant slide without purification. Comparative analysis indicated that the chloroplast genome was conserved in gene number and organization with respect to the reference genome of L. minor. However, higher nucleotide substitution, abundant deletions and insertions occurred in non-coding regions of these genomes, indicating a greater genomic dynamics than expected from the comparison of other related species in the Pooideae. Noticeably, there was no transition bias over transversion in Lemnoideae. The data should have immediate applications in evolutionary biology and plant taxonomy with increased resolution and statistical power.
High-Throughput Sequencing of Three Lemnoideae (Duckweeds) Chloroplast Genomes from Total DNA

PubMed Central

Wang, Wenqin; Messing, Joachim

2011-01-01

Background Chloroplast genomes provide a wealth of information for evolutionary and population genetic studies. Chloroplasts play a particularly important role in the adaption for aquatic plants because they float on water and their major surface is exposed continuously to sunlight. The subfamily of Lemnoideae represents such a collection of aquatic species that because of photosynthesis represents one of the fastest growing plant species on earth. Methods We sequenced the chloroplast genomes from three different genera of Lemnoideae, Spirodela polyrhiza, Wolffiella lingulata and Wolffia australiana by high-throughput DNA sequencing of genomic DNA using the SOLiD platform. Unfractionated total DNA contains high copies of plastid DNA so that sequences from the nucleus and mitochondria can easily be filtered computationally. Remaining sequence reads were assembled into contiguous sequences (contigs) using SOLiD software tools. Contigs were mapped to a reference genome of Lemna minor and gaps, selected by PCR, were sequenced on the ABI3730xl platform. Conclusions This combinatorial approach yielded whole genomic contiguous sequences in a cost-effective manner. Over 1,000-time coverage of chloroplast from total DNA were reached by the SOLiD platform in a single spot on a quadrant slide without purification. Comparative analysis indicated that the chloroplast genome was conserved in gene number and organization with respect to the reference genome of L. minor. However, higher nucleotide substitution, abundant deletions and insertions occurred in non-coding regions of these genomes, indicating a greater genomic dynamics than expected from the comparison of other related species in the Pooideae. Noticeably, there was no transition bias over transversion in Lemnoideae. The data should have immediate applications in evolutionary biology and plant taxonomy with increased resolution and statistical power. PMID:21931804
A Children's Oncology Group and TARGET initiative exploring the genetic landscape of Wilms tumor. | Office of Cancer Genomics

Cancer.gov

We performed genome-wide sequencing and analyzed mRNA and miRNA expression, DNA copy number, and DNA methylation in 117 Wilms tumors, followed by targeted sequencing of 651 Wilms tumors. In addition to genes previously implicated in Wilms tumors (WT1, CTNNB1, AMER1, DROSHA, DGCR8, XPO5, DICER1, SIX1, SIX2, MLLT1, MYCN, and TP53), we identified mutations in genes not previously recognized as recurrently involved in Wilms tumors, the most frequent being BCOR, BCORL1, NONO, MAX, COL6A3, ASXL1, MAP3K4, and ARID1A.
Evolution in the block: common elements of 5S rDNA organization and evolutionary patterns in distant fish genera.

PubMed

Campo, Daniel; García-Vázquez, Eva

2012-01-01

The 5S rDNA is organized in the genome as tandemly repeated copies of a structural unit composed of a coding sequence plus a nontranscribed spacer (NTS). The coding region is highly conserved in the evolution, whereas the NTS vary in both length and sequence. It has been proposed that 5S rRNA genes are members of a gene family that have arisen through concerted evolution. In this study, we describe the molecular organization and evolution of the 5S rDNA in the genera Lepidorhombus and Scophthalmus (Scophthalmidae) and compared it with already known 5S rDNA of the very different genera Merluccius (Merluccidae) and Salmo (Salmoninae), to identify common structural elements or patterns for understanding 5S rDNA evolution in fish. High intra- and interspecific diversity within the 5S rDNA family in all the genera can be explained by a combination of duplications, deletions, and transposition events. Sequence blocks with high similarity in all the 5S rDNA members across species were identified for the four studied genera, with evidences of intense gene conversion within noncoding regions. We propose a model to explain the evolution of the 5S rDNA, in which the evolutionary units are blocks of nucleotides rather than the entire sequences or single nucleotides. This model implies a "two-speed" evolution: slow within blocks (homogenized by recombination) and fast within the gene family (diversified by duplications and deletions).
Titration of DnaA protein by oriC DnaA-boxes increases dnaA gene expression in Escherichia coli.

PubMed Central

Hansen, F G; Koefoed, S; Sørensen, L; Atlung, T

1987-01-01

Binding of the DnaA protein to its binding sites, the DnaA-boxes (TTATCCACA), was measured by a simple physiological approach. The presence of extra DnaA-boxes in growing cells leads to a derepression of dnaA gene expression, measured as beta-galactosidase activity of a dnaA-lacZ fusion polypeptide. Different DnaA-boxes caused different degrees of derepression indicating that the DnaA protein requires sequences in addition to the DnaA-box for efficient binding. The DnaA-boxes in oriC might act cooperatively in binding of the DnaA protein. The derepressed levels of DnaA protein obtained in a strain carrying an oriC+-pBR322 chimera were very high and sufficient to activate oriC on the chimeric plasmid, which was maintained at a copy number more than three times that of pBR322. PMID:3034578
Characterization of four species of Trichuris (Nematoda: Enoplida) by their second internal transcribed spacer ribosomal DNA sequence.

PubMed

Oliveros, R; Cutillas, C; De Rojas, M; Arias, P

2000-12-01

Adult worms of Trichuris ovis and T. globulosa were collected from Ovis aries (sheep) and Capra hircus (goats). T. suis was isolated from Sus scrofa domestica (swine) and T. leporis was isolated from Lepus europaeus (rabbits) in Spain. Genomic DNA was isolated and a ribosomal internal transcribed spacer (ITS2) was amplified and sequenced using polymerase-chain-reaction (PCR) techniques. The ITS2 of T. ovis and T. globulosa was 407 nucleotides in length and had a GC content of about 62%. Furthermore, the ITS2 of T. suis and T. leporis was 534 and 418 nucleotides in length and had a GC content of about 64.8% and 62.4%, respectively. There was evidence of slight variation in the sequence within individuals of all species analyzed, indicating intraindividual variation in the sequence of different copies of the ribosomal DNA. Furthermore, low-level intraspecific variation was detected. Sequence analyses of ITS2 products of T. ovis and T. globulosa demonstrated no sequence difference between them. Nevertheless, differences were detected between the ITS2 sequences of T. suis, T. leporis, and T. ovis, indicating that Trichuris species can reliably be differentiated by their ITS2 sequences and PCR-linked restriction-fragment-length polymorphism (RFLP).
[Copy number variation of trinucleotide repeat in dynamic mutation sites of autosomal dominant cerebellar ataxias related genes].

PubMed

Chen, Pu; Ma, Mingyi; Shang, Huifang; Su, Dan; Zhang, Sizhong; Yang, Yuan

2009-12-01

To standardize the experimental procedure of the gene test for autosomal dominant cerebellar ataxias (ADCA), and provide the basis for quantitative criteria of the dynamic mutation of spinocerebellar ataxia (SCA) genes in Chinese population. Genotyping of the dynamic mutation loci of the SCA1, SCA2, SCA3, SCA6 and SCA7 genes was performed, using florescence PCR-capillary electrophoresis followed by DNA sequencing, to investigate the variation range of copy number of CAG tandem repeat of the genes in 263 probands of ADCA pedigrees and 261 non-related normal controls. Based on the sequencing result, the bias of the CAG copy number estimation using capillary electrophoresis with different DNA controls was compared to analyze the technical detailes of the electrophresis method in testing the dynamic mutation sites. PCR products containing dynamic mutation loci of the SCA genes showed significantly higher mobility than that of molecular weigh marker with relatively balanced GC content. This was particularly obvious in the SCA2, SCA 6 and SCA7 genes whereas the deviation of copy number could be corrected to +/-1 when known CAG copy number fragments were used as controls. The mobility of PCR products was primarily related to the copy number of CAG repeat when the fragments contained normal CAG repeat. In the 263 ADCA pedigrees, 6 (2.28%) carried SCA1 gene mutation, 8 (3.04%) had SCA2 mutation and 81 (30.80%) harbored SCA3 mutation. The gene mutation of SCA6 and SCA7 was not found. The normal variation range of the CAG repeat was 17-36 copies in SCA1 gene, 13-30 copies in SCA2, 14-39 copies in SCA3, 6-16 copies in SCA6 and 6-13 copies in SCA7. The heterozygosity was 76.1%, 17.7%, 74.4%, 72.1% and 41.3%, respectively. The mutation range of the CAG repeat was 49-56 copies in SCA1 gene, 36-41 copies in SCA2, 59-81 copies in SCA3. Neither homozygous mutation of an SCA gene nor double heterozygous mutation of the SCA genes was observed in the study. The copy number of the CAG repeat in SCA genes could be calculated accurately based on the result of florescence PCR-capillary electrophoresis when limited amount of known repeat copy number controls were used. Our result supported that the notion that SCA3 gene mutation was the most common cause for ADCA, and the obtained data would be helpful for establishing quantitative criteria of the dynamic mutation of the SCA genes in Chinese.
Complete Genome Sequence of Sporisorium scitamineum and Biotrophic Interaction Transcriptome with Sugarcane

PubMed Central

Benevenuto, Juliana; Peters, Leila P.; Carvalho, Giselle; Palhares, Alessandra; Quecine, Maria C.; Nunes, Filipe R. S.; Kmit, Maria C. P.; Wai, Alvan; Hausner, Georg; Aitken, Karen S.; Berkman, Paul J.; Fraser, James A.; Moolhuijzen, Paula M.; Coutinho, Luiz L.; Creste, Silvana; Vieira, Maria L. C.; Kitajima, João P.; Monteiro-Vitorello, Claudia B.

2015-01-01

Sporisorium scitamineum is a biotrophic fungus responsible for the sugarcane smut, a worldwide spread disease. This study provides the complete sequence of individual chromosomes of S. scitamineum from telomere to telomere achieved by a combination of PacBio long reads and Illumina short reads sequence data, as well as a draft sequence of a second fungal strain. Comparative analysis to previous available sequences of another strain detected few polymorphisms among the three genomes. The novel complete sequence described herein allowed us to identify and annotate extended subtelomeric regions, repetitive elements and the mitochondrial DNA sequence. The genome comprises 19,979,571 bases, 6,677 genes encoding proteins, 111 tRNAs and 3 assembled copies of rDNA, out of our estimated number of copies as 130. Chromosomal reorganizations were detected when comparing to sequences of S. reilianum, the closest smut relative, potentially influenced by repeats of transposable elements. Repetitive elements may have also directed the linkage of the two mating-type loci. The fungal transcriptome profiling from in vitro and from interaction with sugarcane at two time points (early infection and whip emergence) revealed that 13.5% of the genes were differentially expressed in planta and particular to each developmental stage. Among them are plant cell wall degrading enzymes, proteases, lipases, chitin modification and lignin degradation enzymes, sugar transporters and transcriptional factors. The fungus also modulates transcription of genes related to surviving against reactive oxygen species and other toxic metabolites produced by the plant. Previously described effectors in smut/plant interactions were detected but some new candidates are proposed. Ten genomic islands harboring some of the candidate genes unique to S. scitamineum were expressed only in planta. RNAseq data was also used to reassure gene predictions. PMID:26065709
Birth and death of genes linked to chromosomal inversion

PubMed Central

Furuta, Yoshikazu; Kawai, Mikihiko; Yahara, Koji; Takahashi, Noriko; Handa, Naofumi; Tsuru, Takeshi; Oshima, Kenshiro; Yoshida, Masaru; Azuma, Takeshi; Hattori, Masahira; Uchiyama, Ikuo; Kobayashi, Ichizo

2011-01-01

The birth and death of genes is central to adaptive evolution, yet the underlying genome dynamics remain elusive. The availability of closely related complete genome sequences helps to follow changes in gene contents and clarify their relationship to overall genome organization. Helicobacter pylori, bacteria in our stomach, are known for their extreme genome plasticity through mutation and recombination and will make a good target for such an analysis. In comparing their complete genome sequences, we found that gain and loss of genes (loci) for outer membrane proteins, which mediate host interaction, occurred at breakpoints of chromosomal inversions. Sequence comparison there revealed a unique mechanism of DNA duplication: DNA duplication associated with inversion. In this process, a DNA segment at one chromosomal locus is copied and inserted, in an inverted orientation, into a distant locus on the same chromosome, while the entire region between these two loci is also inverted. Recognition of this and three more inversion modes, which occur through reciprocal recombination between long or short sequence similarity or adjacent to a mobile element, allowed reconstruction of synteny evolution through inversion events in this species. These results will guide the interpretation of extensive DNA sequencing results for understanding long- and short-term genome evolution in various organisms and in cancer cells. PMID:21212362
Combined Targeted DNA Sequencing in Non-Small Cell Lung Cancer (NSCLC) Using UNCseq and NGScopy, and RNA Sequencing Using UNCqeR for the Detection of Genetic Aberrations in NSCLC

PubMed Central

Walter, Vonn; Patel, Nirali M.; Eberhard, David A.; Hayward, Michele C.; Salazar, Ashley H.; Jo, Heejoon; Soloway, Matthew G.; Wilkerson, Matthew D.; Parker, Joel S.; Yin, Xiaoying; Zhang, Guosheng; Siegel, Marni B.; Rosson, Gary B.; Earp, H. Shelton; Sharpless, Norman E.; Gulley, Margaret L.; Weck, Karen E.

2015-01-01

The recent FDA approval of the MiSeqDx platform provides a unique opportunity to develop targeted next generation sequencing (NGS) panels for human disease, including cancer. We have developed a scalable, targeted panel-based assay termed UNCseq, which involves a NGS panel of over 200 cancer-associated genes and a standardized downstream bioinformatics pipeline for detection of single nucleotide variations (SNV) as well as small insertions and deletions (indel). In addition, we developed a novel algorithm, NGScopy, designed for samples with sparse sequencing coverage to detect large-scale copy number variations (CNV), similar to human SNP Array 6.0 as well as small-scale intragenic CNV. Overall, we applied this assay to 100 snap-frozen lung cancer specimens lacking same-patient germline DNA (07–0120 tissue cohort) and validated our results against Sanger sequencing, SNP Array, and our recently published integrated DNA-seq/RNA-seq assay, UNCqeR, where RNA-seq of same-patient tumor specimens confirmed SNV detected by DNA-seq, if RNA-seq coverage depth was adequate. In addition, we applied the UNCseq assay on an independent lung cancer tumor tissue collection with available same-patient germline DNA (11–1115 tissue cohort) and confirmed mutations using assays performed in a CLIA-certified laboratory. We conclude that UNCseq can identify SNV, indel, and CNV in tumor specimens lacking germline DNA in a cost-efficient fashion. PMID:26076459
Investigation of the mechanism of meiotic DNA cleavage by VMA1-derived endonuclease uncovers a meiotic alteration in chromatin structure around the target site.

PubMed

Fukuda, Tomoyuki; Ohta, Kunihiro; Ohya, Yoshikazu

2006-06-01

VMA1-derived endonuclease (VDE), a homing endonuclease in Saccharomyces cerevisiae, is encoded by the mobile intein-coding sequence within the nuclear VMA1 gene. VDE recognizes and cleaves DNA at the 31-bp VDE recognition sequence (VRS) in the VMA1 gene lacking the intein-coding sequence during meiosis to insert a copy of the intein-coding sequence at the cleaved site. The mechanism underlying the meiosis specificity of VMA1 intein-coding sequence homing remains unclear. We studied various factors that might influence the cleavage activity in vivo and found that VDE binding to the VRS can be detected only when DNA cleavage by VDE takes place, implying that meiosis-specific DNA cleavage is regulated by the accessibility of VDE to its target site. As a possible candidate for the determinant of this accessibility, we analyzed chromatin structure around the VRS and revealed that local chromatin structure near the VRS is altered during meiosis. Although the meiotic chromatin alteration exhibits correlations with DNA binding and cleavage by VDE at the VMA1 locus, such a chromatin alteration is not necessarily observed when the VRS is embedded in ectopic gene loci. This suggests that nucleosome positioning or occupancy around the VRS by itself is not the sole mechanism for the regulation of meiosis-specific DNA cleavage by VDE and that other mechanisms are involved in the regulation.
Investigation of the Mechanism of Meiotic DNA Cleavage by VMA1-Derived Endonuclease Uncovers a Meiotic Alteration in Chromatin Structure around the Target Site

PubMed Central

Fukuda, Tomoyuki; Ohta, Kunihiro; Ohya, Yoshikazu

2006-01-01

VMA1-derived endonuclease (VDE), a homing endonuclease in Saccharomyces cerevisiae, is encoded by the mobile intein-coding sequence within the nuclear VMA1 gene. VDE recognizes and cleaves DNA at the 31-bp VDE recognition sequence (VRS) in the VMA1 gene lacking the intein-coding sequence during meiosis to insert a copy of the intein-coding sequence at the cleaved site. The mechanism underlying the meiosis specificity of VMA1 intein-coding sequence homing remains unclear. We studied various factors that might influence the cleavage activity in vivo and found that VDE binding to the VRS can be detected only when DNA cleavage by VDE takes place, implying that meiosis-specific DNA cleavage is regulated by the accessibility of VDE to its target site. As a possible candidate for the determinant of this accessibility, we analyzed chromatin structure around the VRS and revealed that local chromatin structure near the VRS is altered during meiosis. Although the meiotic chromatin alteration exhibits correlations with DNA binding and cleavage by VDE at the VMA1 locus, such a chromatin alteration is not necessarily observed when the VRS is embedded in ectopic gene loci. This suggests that nucleosome positioning or occupancy around the VRS by itself is not the sole mechanism for the regulation of meiosis-specific DNA cleavage by VDE and that other mechanisms are involved in the regulation. PMID:16757746
Mitochondrial DNA Copy Number in Sleep Duration Discordant Monozygotic Twins.

PubMed

Wrede, Joanna E; Mengel-From, Jonas; Buchwald, Dedra; Vitiello, Michael V; Bamshad, Michael; Noonan, Carolyn; Christiansen, Lene; Christensen, Kaare; Watson, Nathaniel F

2015-10-01

Mitochondrial DNA (mtDNA) copy number is an important component of mitochondrial function and varies with age, disease, and environmental factors. We aimed to determine whether mtDNA copy number varies with habitual differences in sleep duration within pairs of monozygotic twins. Academic clinical research center. 15 sleep duration discordant monozygotic twin pairs (30 twins, 80% female; mean age 42.1 years [SD 15.0]). Sleep duration was phenotyped with wrist actigraphy. Each twin pair included a "normal" (7-9 h/24) and "short" (< 7 h/24) sleeping twin. Fasting peripheral blood leukocyte DNA was assessed for mtDNA copy number via the n-fold difference between qPCR measured mtDNA and nuclear DNA creating an mtDNA measure without absolute units. We used generalized estimating equation linear regression models accounting for the correlated data structure to assess within-pair effects of sleep duration on mtDNA copy number. Mean within-pair sleep duration difference per 24 hours was 94.3 minutes (SD 62.6 min). We found reduced sleep duration (β = 0.06; 95% CI 0.004, 0.12; P < 0.05) and sleep efficiency (β = 0.51; 95% CI 0.06, 0.95; P < 0.05) were significantly associated with reduced mtDNA copy number within twin pairs. Thus every 1-minute decrease in actigraphy-defined sleep duration was associated with a decrease in mtDNA copy number of 0.06. Likewise, a 1% decrease in actigraphy-defined sleep efficiency was associated with a decrease in mtDNA copy number of 0.51. Reduced sleep duration and sleep efficiency were associated with reduced mitochondrial DNA copy number in sleep duration discordant monozygotic twins offering a potential mechanism whereby short sleep impairs health and longevity through mitochondrial stress. © 2015 Associated Professional Sleep Societies, LLC.
Gene conversion events and variable degree of homogenization of rDNA loci in cultivars of Brassica napus

PubMed Central

Sochorová, Jana; Coriton, Olivier; Kuderová, Alena; Lunerová, Jana; Chèvre, Anne-Marie; Kovařík, Aleš

2017-01-01

Background and aims Brassica napus (AACC, 2n = 38, oilseed rape) is a relatively recent allotetraploid species derived from the putative progenitor diploid species Brassica rapa (AA, 2n = 20) and Brassica oleracea (CC, 2n = 18). To determine the influence of intensive breeding conditions on the evolution of its genome, we analysed structure and copy number of rDNA in 21 cultivars of B. napus, representative of genetic diversity. Methods We used next-generation sequencing genomic approaches, Southern blot hybridization, expression analysis and fluorescence in situ hybridization (FISH). Subgenome-specific sequences derived from rDNA intergenic spacers (IGS) were used as probes for identification of loci composition on chromosomes. Key Results Most B. napus cultivars (18/21, 86 %) had more A-genome than C-genome rDNA copies. Three cultivars analysed by FISH (‘Darmor’, ‘Yudal’ and ‘Asparagus kale’) harboured the same number (12 per diploid set) of loci. In B. napus ‘Darmor’, the A-genome-specific rDNA probe hybridized to all 12 rDNA loci (eight on the A-genome and four on the C-genome) while the C-genome-specific probe showed weak signals on the C-genome loci only. Deep sequencing revealed high homogeneity of arrays suggesting that the C-genome genes were largely overwritten by the A-genome variants in B. napus ‘Darmor’. In contrast, B. napus ‘Yudal’ showed a lack of gene conversion evidenced by additive inheritance of progenitor rDNA variants and highly localized hybridization signals of subgenome-specific probes on chromosomes. Brassica napus ‘Asparagus kale’ showed an intermediate pattern to ‘Darmor’ and ‘Yudal’. At the expression level, most cultivars (95 %) exhibited stable A-genome nucleolar dominance while one cultivar (‘Norin 9’) showed co-dominance. Conclusions The B. napus cultivars differ in the degree and direction of rDNA homogenization. The prevalent direction of gene conversion (towards the A-genome) correlates with the direction of expression dominance indicating that gene activity may be needed for interlocus gene conversion. PMID:27707747
A robust method to analyze copy number alterations of less than 100 kb in single cells using oligonucleotide array CGH.

PubMed

Möhlendick, Birte; Bartenhagen, Christoph; Behrens, Bianca; Honisch, Ellen; Raba, Katharina; Knoefel, Wolfram T; Stoecklein, Nikolas H

2013-01-01

Comprehensive genome wide analyses of single cells became increasingly important in cancer research, but remain to be a technically challenging task. Here, we provide a protocol for array comparative genomic hybridization (aCGH) of single cells. The protocol is based on an established adapter-linker PCR (WGAM) and allowed us to detect copy number alterations as small as 56 kb in single cells. In addition we report on factors influencing the success of single cell aCGH downstream of the amplification method, including the characteristics of the reference DNA, the labeling technique, the amount of input DNA, reamplification, the aCGH resolution, and data analysis. In comparison with two other commercially available non-linear single cell amplification methods, WGAM showed a very good performance in aCGH experiments. Finally, we demonstrate that cancer cells that were processed and identified by the CellSearch® System and that were subsequently isolated from the CellSearch® cartridge as single cells by fluorescence activated cell sorting (FACS) could be successfully analyzed using our WGAM-aCGH protocol. We believe that even in the era of next-generation sequencing, our single cell aCGH protocol will be a useful and (cost-) effective approach to study copy number alterations in single cells at resolution comparable to those reported currently for single cell digital karyotyping based on next generation sequencing data.
A quantitative PCR assay for aerobic, vinyl chloride- and ethene-assimilating microorganisms in groundwater.

PubMed

Jin, Yang Oh; Mattes, Timothy E

2010-12-01

Vinyl chloride (VC) is a known human carcinogen that is primarily formed in groundwater via incomplete anaerobic dechlorination of chloroethenes. Aerobic, ethene-degrading bacteria (etheneotrophs), which are capable of both fortuitous and growth-linked VC oxidation, could be important in natural attenuation of VC plumes that escape anaerobic treatment. In this work, we developed a quantitative, real-time PCR (qPCR) assay for etheneotrophs in groundwater. We designed and tested degenerate qPCR primers for two functional genes involved in aerobic, growth-coupled VC- and ethene-oxidation (etnC and etnE). Primer specificity to these target genes was tested by comparison to nucleotide sequence databases, PCR analysis of template DNA extracted from isolates and environmental samples, and sequencing of qPCR products obtained from VC-contaminated groundwater. The assay was made quantitative by constructing standard curves (threshold cycle vs log gene copy number) with DNA amplified from Mycobacterium strain JS60, an etheneotrophic isolate. Analysis of groundwater samples from three different VC-contaminated sites revealed that etnC abundance ranged from 1.6 × 10(3) - 1.0 × 10(5) copies/L groundwater while etnE abundance ranged from 4.3 × 10(3) - 6.3 × 10(5) copies/L groundwater. Our data suggest this novel environmental measurement method will be useful for supporting VC bioremediation strategies, assisting in site closure, and conducting microbial ecology studies involving etheneotrophs.
Computational Evaluation of the Strict Master and Random Template Models of Endogenous Retrovirus Evolution

PubMed Central

Nascimento, Fabrícia F.; Rodrigo, Allen G.

2016-01-01

Transposable elements (TEs) are DNA sequences that are able to replicate and move within and between host genomes. Their mechanism of replication is also shared with endogenous retroviruses (ERVs), which are also a type of TE that represent an ancient retroviral infection within animal genomes. Two models have been proposed to explain TE proliferation in host genomes: the strict master model (SMM), and the random template (or transposon) model (TM). In SMM only a single copy of a given TE lineage is able to replicate, and all other genomic copies of TEs are derived from that master copy. In TM, any element of a given family is able to replicate in the host genome. In this paper, we simulated ERV phylogenetic trees under variations of SMM and TM. To test whether current phylogenetic programs can recover the simulated ERV phylogenies, DNA sequence alignments were simulated and maximum likelihood trees were reconstructed and compared to the simulated phylogenies. Results indicate that visual inspection of phylogenetic trees alone can be misleading. However, if a set of statistical summaries is calculated, we are able to distinguish between models with high accuracy by using a data mining algorithm that we introduce here. We also demonstrate the use of our data mining algorithm with empirical data for the porcine endogenous retrovirus (PERV), an ERV that is able to replicate in human and pig cells in vitro. PMID:27649303
The vestigial olfactory receptor subgenome of odontocete whales: phylogenetic congruence between gene-tree reconciliation and supermatrix methods.

PubMed

McGowen, Michael R; Clark, Clay; Gatesy, John

2008-08-01

The macroevolutionary transition of whales (cetaceans) from a terrestrial quadruped to an obligate aquatic form involved major changes in sensory abilities. Compared to terrestrial mammals, the olfactory system of baleen whales is dramatically reduced, and in toothed whales is completely absent. We sampled the olfactory receptor (OR) subgenomes of eight cetacean species from four families. A multigene tree of 115 newly characterized OR sequences from these eight species and published data for Bos taurus revealed a diverse array of class II OR paralogues in Cetacea. Evolution of the OR gene superfamily in toothed whales (Odontoceti) featured a multitude of independent pseudogenization events, supporting anatomical evidence that odontocetes have lost their olfactory sense. We explored the phylogenetic utility of OR pseudogenes in Cetacea, concentrating on delphinids (oceanic dolphins), the product of a rapid evolutionary radiation that has been difficult to resolve in previous studies of mitochondrial DNA sequences. Phylogenetic analyses of OR pseudogenes using both gene-tree reconciliation and supermatrix methods yielded fully resolved, consistently supported relationships among members of four delphinid subfamilies. Alternative minimizations of gene duplications, gene duplications plus gene losses, deep coalescence events, and nucleotide substitutions plus indels returned highly congruent phylogenetic hypotheses. Novel DNA sequence data for six single-copy nuclear loci and three mitochondrial genes (> 5000 aligned nucleotides) provided an independent test of the OR trees. Nucleotide substitutions and indels in OR pseudogenes showed a very low degree of homoplasy in comparison to mitochondrial DNA and, on average, provided more variation than single-copy nuclear DNA. Our results suggest that phylogenetic analysis of the large OR superfamily will be effective for resolving relationships within Cetacea whether supermatrix or gene-tree reconciliation procedures are used.

Pleistocene climate change and the origin of two desert plant species, Pugionium cornutum and Pugionium dolabratum (Brassicaceae), in northwest China.

PubMed

Wang, Qian; Abbott, Richard J; Yu, Qiu-Shi; Lin, Kao; Liu, Jian-Quan

2013-07-01

Pleistocene climate change has had an important effect in shaping intraspecific genetic variation in many species; however, its role in driving speciation is less clear. We examined the possibility of a Pleistocene origin of the only two representatives of the genus Pugionium (Brassicaceae), Pugionium cornutum and Pugionium dolabratum, which occupy different desert habitats in northwest China. We surveyed sequence variation for internal transcribed spacer (ITS), three chloroplast (cp) DNA fragments, and eight low-copy nuclear genes among individuals sampled from 11 populations of each species across their geographic ranges. One ITS mutation distinguished the two species, whereas mutations in cpDNA and the eight low-copy nuclear gene sequences were not species-specific. Although interspecific divergence varied greatly among nuclear gene sequences, in each case divergence was estimated to have occurred within the Pleistocene when deserts expanded in northwest China. Our findings point to the importance of Pleistocene climate change, in this case an increase in aridity, as a cause of speciation in Pugionium as a result of divergence in different habitats that formed in association with the expansion of deserts in China. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
Demonstration of human T-cell lymphotropic virus type I (HTLV-I) from an HTLV-I seronegative south Indian patient with chronic, progressive spastic paraparesis.

PubMed

Nishimura, M; Mingioli, E; McFarlin, D E; Jacobson, S

1993-12-01

Here we describe a human T-cell lymphotropic virus type I (HTLV-I) seronegative patient from South India with a chronic, progressive spastic paraparesis from which HTLV-I has been isolated from peripheral blood lymphocytes. HTLV-I pol and tax viral sequences were detected in DNA from fresh peripheral blood lymphocytes (PBL) by polymerase chain reaction (PCR) and liquid hybridization techniques. Southern blot analysis of the PCR products demonstrated a low copy number of HTLV-I at the level of one viral copy per 10,000 fresh PBL. A long-term CD4+ T-cell line was established from PBL of this patient using recombinant interleukin-2, OKT3, and feeder cells. DNA from these cultured lines was amplified and portions of the HTLV-I long terminal repeat (U3), pol, env, and tax regions were sequenced (a total of 1,115 bp). The sequence data showed that the HTLV-I associated with this patient was 98.8% homologous to prototype HTLV-I. Southern blot analysis also confirmed the presence of full-length HTLV-I. These results indicate that HTLV-I can be demonstrated in an HTLV-I seronegative patient from South India with a chronic progressive neurological disorder.
Identification of structural variation in mouse genomes.

PubMed

Keane, Thomas M; Wong, Kim; Adams, David J; Flint, Jonathan; Reymond, Alexandre; Yalcin, Binnaz

2014-01-01

Structural variation is variation in structure of DNA regions affecting DNA sequence length and/or orientation. It generally includes deletions, insertions, copy-number gains, inversions, and transposable elements. Traditionally, the identification of structural variation in genomes has been challenging. However, with the recent advances in high-throughput DNA sequencing and paired-end mapping (PEM) methods, the ability to identify structural variation and their respective association to human diseases has improved considerably. In this review, we describe our current knowledge of structural variation in the mouse, one of the prime model systems for studying human diseases and mammalian biology. We further present the evolutionary implications of structural variation on transposable elements. We conclude with future directions on the study of structural variation in mouse genomes that will increase our understanding of molecular architecture and functional consequences of structural variation.
Morphology and genome organization of the virus PSV of the hyperthermophilic archaeal genera Pyrobaculum and Thermoproteus: a novel virus family, the Globuloviridae.

PubMed

Häring, Monika; Peng, Xu; Brügger, Kim; Rachel, Reinhard; Stetter, Karl O; Garrett, Roger A; Prangishvili, David

2004-06-01

A novel virus, termed Pyrobaculum spherical virus (PSV), is described that infects anaerobic hyperthermophilic archaea of the genera Pyrobaculum and Thermoproteus. Spherical enveloped virions, about 100 nm in diameter, contain a major multimeric 33-kDa protein and host-derived lipids. A viral envelope encases a superhelical nucleoprotein core containing linear double-stranded DNA. The PSV infection cycle does not cause lysis of host cells. The viral genome was sequenced and contains 28337 bp. The genome is unique for known archaeal viruses in that none of the genes, including that encoding the major structural protein, show any significant sequence matches to genes in public sequence databases. Exceptionally for an archaeal double-stranded DNA virus, almost all the recognizable genes are located on one DNA strand. The ends of the genome consist of 190-bp inverted repeats that contain multiple copies of short direct repeats. The two DNA strands are probably covalently linked at their termini. On the basis of the unusual morphological and genomic properties of this DNA virus, we propose to assign PSV to a new viral family, the Globuloviridae.
Sequence heterogeneity in the two 16S rRNA genes of Phormium yellow leaf phytoplasma.

PubMed Central

Liefting, L W; Andersen, M T; Beever, R E; Gardner, R C; Forster, R L

1996-01-01

Phormium yellow leaf (PYL) phytoplasma causes a lethal disease of the monocotyledon, New Zealand flax (Phormium tenax). The 16S rRNA genes of PYL phytoplasma were amplified from infected flax by PCR and cloned, and the nucleotide sequences were determined. DNA sequencing and Southern hybridization analysis of genomic DNA indicated the presence of two copies of the 16S rRNA gene. The two 16S rRNA genes exhibited sequence heterogeneity in 4 nucleotide positions and could be distinguished by the restriction enzymes BpmI and BsrI. This is the first record in which sequence heterogeneity in the 16S rRNA genes of a phytoplasma has been determined by sequence analysis. A phylogenetic tree based on 16S rRNA gene sequences showed that PYL phytoplasma is most closely related to the stolbur and German grapevine yellows phytoplasmas, which form the stolbur subgroup of the aster yellows group. This phylogenetic position of PYL phytoplasma was supported by 16S/23S spacer region sequence data. PMID:8795200
Crossing the LINE toward genomic instability: LINE-1 retrotransposition in cancer

NASA Astrophysics Data System (ADS)

Kemp, Jacqueline; Longworth, Michelle

2015-12-01

Retrotransposons are repetitive DNA sequences that are positioned throughout the human genome. Retrotransposons are capable of copying themselves and mobilizing new copies to novel genomic locations in a process called retrotransposition. While most retrotransposon sequences in the human genome are incomplete and incapable of mobilization, the LINE-1 retrotransposon, which comprises approximately 17% of the human genome, remains active. The disruption of cellular mechanisms that suppress retrotransposon activity is linked to the generation of aneuploidy, a potential driver of tumor development. When retrotransposons insert into a novel genomic region, they have the potential to disrupt the coding sequence of endogenous genes and alter gene expression, which can lead to deleterious consequences for the organism. Additionally, increased LINE-1 copy numbers provide more chances for recombination events to occur between retrotransposons, which can lead to chromosomal breaks and rearrangements. LINE-1 activity is increased in various cancer cell lines and in patient tissues resected from primary tumors. LINE-1 activity also correlates with increased cancer metastasis. This review aims to give a brief overview of the connections between LINE-1 retrotransposition and the loss of genome stability. We will also discuss the mechanisms that repress retrotransposition in human cells and their links to cancer.
Dancing together and separate again: gymnosperms exhibit frequent changes of fundamental 5S and 35S rRNA gene (rDNA) organisation

PubMed Central

Garcia, S; Kovařík, A

2013-01-01

In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S–5.8S–26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S–18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S–5.8S–26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants. PMID:23512008
Dancing together and separate again: gymnosperms exhibit frequent changes of fundamental 5S and 35S rRNA gene (rDNA) organisation.

PubMed

Garcia, S; Kovařík, A

2013-07-01

In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S-5.8S-26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S-18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S-5.8S-26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants.
Calling Chromosome Alterations, DNA Methylation Statuses, and Mutations in Tumors by Simple Targeted Next-Generation Sequencing: A Solution for Transferring Integrated Pangenomic Studies into Routine Practice?

PubMed

Garinet, Simon; Néou, Mario; de La Villéon, Bruno; Faillot, Simon; Sakat, Julien; Da Fonseca, Juliana P; Jouinot, Anne; Le Tourneau, Christophe; Kamal, Maud; Luscap-Rondof, Windy; Boeva, Valentina; Gaujoux, Sebastien; Vidaud, Michel; Pasmant, Eric; Letourneur, Franck; Bertherat, Jérôme; Assié, Guillaume

2017-09-01

Pangenomic studies identified distinct molecular classes for many cancers, with major clinical applications. However, routine use requires cost-effective assays. We assessed whether targeted next-generation sequencing (NGS) could call chromosomal alterations and DNA methylation status. A training set of 77 tumors and a validation set of 449 (43 tumor types) were analyzed by targeted NGS and single-nucleotide polymorphism (SNP) arrays. Thirty-two tumors were analyzed by NGS after bisulfite conversion, and compared to methylation array or methylation-specific multiplex ligation-dependent probe amplification. Considering allelic ratios, correlation was strong between targeted NGS and SNP arrays (r = 0.88). In contrast, considering DNA copy number, for variations of one DNA copy, correlation was weaker between read counts and SNP array (r = 0.49). Thus, we generated TARGOMICs, optimized for detecting chromosome alterations by combining allelic ratios and read counts generated by targeted NGS. Sensitivity for calling normal, lost, and gained chromosomes was 89%, 72%, and 31%, respectively. Specificity was 81%, 93%, and 98%, respectively. These results were confirmed in the validation set. Finally, TARGOMICs could efficiently align and compute proportions of methylated cytosines from bisulfite-converted DNA from targeted NGS. In conclusion, beyond calling mutations, targeted NGS efficiently calls chromosome alterations and methylation status in tumors. A single run and minor design/protocol adaptations are sufficient. Optimizing targeted NGS should expand translation of genomics to clinical routine. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Inhibition of recombinase polymerase amplification by background DNA: a lateral flow-based method for enriching target DNA.

PubMed

Rohrman, Brittany; Richards-Kortum, Rebecca

2015-02-03

Recombinase polymerase amplification (RPA) may be used to detect a variety of pathogens, often after minimal sample preparation. However, previous work has shown that whole blood inhibits RPA. In this paper, we show that the concentrations of background DNA found in whole blood prevent the amplification of target DNA by RPA. First, using an HIV-1 RPA assay with known concentrations of nonspecific background DNA, we show that RPA tolerates more background DNA when higher HIV-1 target concentrations are present. Then, using three additional assays, we demonstrate that the maximum amount of background DNA that may be tolerated in RPA reactions depends on the DNA sequences used in the assay. We also show that changing the RPA reaction conditions, such as incubation time and primer concentration, has little effect on the ability of RPA to function when high concentrations of background DNA are present. Finally, we develop and characterize a lateral flow-based method for enriching the target DNA concentration relative to the background DNA concentration. This sample processing method enables RPA of 10(4) copies of HIV-1 DNA in a background of 0-14 μg of background DNA. Without lateral flow sample enrichment, the maximum amount of background DNA tolerated is 2 μg when 10(6) copies of HIV-1 DNA are present. This method requires no heating or other external equipment, may be integrated with upstream DNA extraction and purification processes, is compatible with the components of lysed blood, and has the potential to detect HIV-1 DNA in infant whole blood with high proviral loads.
Mitochondrial DNA copy number in peripheral blood cell and hypertension risk among mining workers: a case-control study in Chinese coal miners.

PubMed

Lei, L; Guo, J; Shi, X; Zhang, G; Kang, H; Sun, C; Huang, J; Wang, T

2017-09-01

Alteration of mitochondrial DNA (mtDNA) copy number, which reflects oxidant-induced cell damage, has been observed in a wide range of human diseases. However, whether it correlates with hypertension has not been elucidated. We aimed to explore the association between mtDNA copy number and the risk of hypertension in Chinese coal miners. A case-control study was performed with 378 hypertension patients and 325 healthy controls in a large coal mining group located in North China. Face-to-face interviews were conducted by trained staffs with necessary medical knowledge. The mtDNA copy number was measured by a quantitative real-time PCR assay using DNA extracted from peripheral blood. No significant differences in mtDNA copy number were observed between hypertension patients and healthy controls. However, in both case and control groups, the mtDNA copy number was statistically significantly lower in the elder population (≥45 years old) compared with the younger subjects (<45 years old; 7.17 vs 6.64, P=0.005 and 7.21 vs 6.84, P=0.036). A significantly higher mtDNA copy number could be found in hypertension patients consuming alcohol regularly compared with no alcohol consumption patients (7.09 vs 6.69); mtDNA copy number was also positively correlated with age and alcohol consumption. Hypertension was found significantly correlated with factors such as age, work duration, monthly family income and drinking status. Our results suggest that the mtDNA copy number is not associated with hypertension in coal miners.
Highly sensitive detection of mutations in CHO cell recombinant DNA using multi-parallel single molecule real-time DNA sequencing.

PubMed

Cartwright, Joseph F; Anderson, Karin; Longworth, Joseph; Lobb, Philip; James, David C

2018-06-01

High-fidelity replication of biologic-encoding recombinant DNA sequences by engineered mammalian cell cultures is an essential pre-requisite for the development of stable cell lines for the production of biotherapeutics. However, immortalized mammalian cells characteristically exhibit an increased point mutation frequency compared to mammalian cells in vivo, both across their genomes and at specific loci (hotspots). Thus unforeseen mutations in recombinant DNA sequences can arise and be maintained within producer cell populations. These may affect both the stability of recombinant gene expression and give rise to protein sequence variants with variable bioactivity and immunogenicity. Rigorous quantitative assessment of recombinant DNA integrity should therefore form part of the cell line development process and be an essential quality assurance metric for instances where synthetic/multi-component assemblies are utilized to engineer mammalian cells, such as the assessment of recombinant DNA fidelity or the mutability of single-site integration target loci. Based on Pacific Biosciences (Menlo Park, CA) single molecule real-time (SMRT™) circular consensus sequencing (CCS) technology we developed a rDNA sequence analysis tool to process the multi-parallel sequencing of ∼40,000 single recombinant DNA molecules. After statistical filtering of raw sequencing data, we show that this analytical method is capable of detecting single point mutations in rDNA to a minimum single mutation frequency of 0.0042% (<1/24,000 bases). Using a stable CHO transfectant pool harboring a randomly integrated 5 kB plasmid construct encoding GFP we found that 28% of recombinant plasmid copies contained at least one low frequency (<0.3%) point mutation. These mutations were predominantly found in GC base pairs (85%) and that there was no positional bias in mutation across the plasmid sequence. There was no discernable difference between the mutation frequencies of coding and non-coding DNA. The putative ratio of non-synonymous and synonymous changes within the open reading frames (ORFs) in the plasmid sequence indicates that natural selection does not impact upon the prevalence of these mutations. Here we have demonstrated the abundance of mutations that fall outside of the reported range of detection of next generation sequencing (NGS) and second generation sequencing (SGS) platforms, providing a methodology capable of being utilized in cell line development platforms to identify the fidelity of recombinant genes throughout the production process. © 2018 Wiley Periodicals, Inc.
Comparison of repair of DNA double-strand breaks in identical sequences in primary human fibroblast and immortal hamster-human hybrid cells harboring a single copy of human chromosome 11

NASA Technical Reports Server (NTRS)

Fouladi, B.; Waldren, C. A.; Rydberg, B.; Cooper, P. K.; Chatterjee, A. (Principal Investigator)

2000-01-01

We have optimized a pulsed-field gel electrophoresis assay that measures induction and repair of double-strand breaks (DSBs) in specific regions of the genome (Lobrich et al., Proc. Natl. Acad. Sci. USA 92, 12050-12054, 1995). The increased sensitivity resulting from these improvements makes it possible to analyze the size distribution of broken DNA molecules immediately after the introduction of DSBs and after repair incubation. This analysis shows that the distribution of broken DNA pieces after exposure to sparsely ionizing radiation is consistent with the distribution expected from randomly induced DSBs. It is apparent from the distribution of rejoined DNA pieces after repair incubation that DNA ends continue to rejoin between 3 and 24 h postirradiation and that some of these rejoining events are in fact misrejoining events, since novel restriction fragments both larger and smaller than the original fragment are generated after repair. This improved assay was also used to study the kinetics of DSB rejoining and the extent of misrejoining in identical DNA sequences in human GM38 cells and human-hamster hybrid A(L) cells containing a single human chromosome 11. Despite the numerous differences between these cells, which include species and tissue of origin, levels of TP53, expression of telomerase, and the presence or absence of a homologous chromosome for the restriction fragments examined, the kinetics of rejoining of radiation-induced DSBs and the extent of misrejoining were similar in the two cell lines when studied in the G(1) phase of the cell cycle. Furthermore, DSBs were removed from the single-copy human chromosome in the hamster A(L) cells with similar kinetics and misrejoining frequency as at a locus on this hybrid's CHO chromosomes.
Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues.

PubMed Central

Prody, C A; Zevin-Sonkin, D; Gnatt, A; Goldberg, O; Soreq, H

1987-01-01

To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase (BtChoEase; EC 3.1.1.8) and Torpedo electric organ "true" acetylcholinesterase (AcChoEase; EC 3.1.1.7). Using these probes, we isolated several cDNA clones from lambda gt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. In RNA blots of poly(A)+ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These findings demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species. Images PMID:3035536
Reachability bounds for chemical reaction networks and strand displacement systems.

PubMed

Condon, Anne; Kirkpatrick, Bonnie; Maňuch, Ján

2014-01-01

Chemical reaction networks (CRNs) and DNA strand displacement systems (DSDs) are widely-studied and useful models of molecular programming. However, in order for some DSDs in the literature to behave in an expected manner, the initial number of copies of some reagents is required to be fixed. In this paper we show that, when multiple copies of all initial molecules are present, general types of CRNs and DSDs fail to work correctly if the length of the shortest sequence of reactions needed to produce any given molecule exceeds a threshold that grows polynomially with attributes of the system.
Dasytricha dominance in Surti buffalo rumen revealed by 18S rRNA sequences and real-time PCR assay.

PubMed

Singh, K M; Tripathi, A K; Pandya, P R; Rank, D N; Kothari, R K; Joshi, C G

2011-09-01

The genetic diversity of protozoa in Surti buffalo rumen was studied by amplified ribosomal DNA restriction analysis, 18S rDNA sequence homology and phylogenetic and Real-time PCR analysis methods. Three animals were fed diet comprised green fodder Napier bajra 21 (Pennisetum purpureum), mature pasture grass (Dicanthium annulatum) and concentrate mixture (20% crude protein, 65% total digestible nutrients). A protozoa-specific primer (P-SSU-342f) and a eukarya-specific primer (Medlin B) were used to amplify a 1,360 bp fragment of DNA encoding protozoal small subunit (SSU) ribosomal RNA from rumen fluid. A total of 91 clones were examined and identified 14 different 18S RNA sequences based on PCR-RFLP pattern. These 14 phylotypes were distributed into four genera-based 18S rDNA database sequences and identified as Dasytricha (57 clones), Isotricha (14 clones), Ostracodinium (11 clones) and Polyplastron (9 clones). Phylogenetic analyses were also used to infer the makeup of protozoa communities in the rumen of Surti buffalo. Out of 14 sequences, 8 sequences (69 clones) clustered with the Dasytricha ruminantium-like clone and 4 sequences (13 clones) were also phylogenetically placed with the Isotricha prostoma-like clone. Moreover, 2 phylotypes (9 clones) were related to Polyplastron multivesiculatum-like clone. In addition, the number of 18S rDNA gene copies of Dasytricha ruminantium (0.05% to ciliate protozoa) was higher than Entodinium sp. (2.0 × 10(5) vs. 1.3 × 10(4)) in per ml ruminal fluid.
Chromosome rearrangements via template switching between diverged repeated sequences

PubMed Central

Anand, Ranjith P.; Tsaponina, Olga; Greenwell, Patricia W.; Lee, Cheng-Sheng; Du, Wei; Petes, Thomas D.

2014-01-01

Recent high-resolution genome analyses of cancer and other diseases have revealed the occurrence of microhomology-mediated chromosome rearrangements and copy number changes. Although some of these rearrangements appear to involve nonhomologous end-joining, many must have involved mechanisms requiring new DNA synthesis. Models such as microhomology-mediated break-induced replication (MM-BIR) have been invoked to explain these rearrangements. We examined BIR and template switching between highly diverged sequences in Saccharomyces cerevisiae, induced during repair of a site-specific double-strand break (DSB). Our data show that such template switches are robust mechanisms that give rise to complex rearrangements. Template switches between highly divergent sequences appear to be mechanistically distinct from the initial strand invasions that establish BIR. In particular, such jumps are less constrained by sequence divergence and exhibit a different pattern of microhomology junctions. BIR traversing repeated DNA sequences frequently results in complex translocations analogous to those seen in mammalian cells. These results suggest that template switching among repeated genes is a potent driver of genome instability and evolution. PMID:25367035
Low incidence of DNA sequence variation in human induced pluripotent stem cells generated by non-integrating plasmid expression

PubMed Central

Cheng, Linzhao; Hansen, Nancy F.; Zhao, Ling; Du, Yutao; Zou, Chunlin; Donovan, Frank X.; Chou, Bin-Kuan; Zhou, Guangyu; Li, Shijie; Dowey, Sarah N.; Ye, Zhaohui; Chandrasekharappa, Settara C.; Yang, Huanming; Mullikin, James C.; Liu, P. Paul

2012-01-01

Summary The utility of induced pluripotent stem cells (iPSCs) as models to study diseases and as sources for cell therapy depends on the integrity of their genomes. Despite recent publications of DNA sequence variations in the iPSCs, the true scope of such changes for the entire genome is not clear. Here we report the whole-genome sequencing of three human iPSC lines derived from two cell types of an adult donor by episomal vectors. The vector sequence was undetectable in the deeply sequenced iPSC lines. We identified 1058–1808 heterozygous single nucleotide variants (SNVs), but no copy number variants, in each iPSC line. Six to twelve of these SNVs were within coding regions in each iPSC line, but ~50% of them are synonymous changes and the remaining are not selectively enriched for known genes associated with cancers. Our data thus suggest that episome-mediated reprogramming is not inherently mutagenic during integration-free iPSC induction. PMID:22385660
Identification of duck plague virus by polymerase chain reaction.

PubMed

Hansen, W R; Brown, S E; Nashold, S W; Knudson, D L

1999-01-01

A polymerase chain reaction (PCR) assay was developed for detecting duck plague virus. A 765-bp EcoRI fragment cloned from the genome of the duck plague vaccine (DP-VAC) virus was sequenced for PCR primer development. The fragment sequence was found by GenBank alignment searches to be similar to the 3' ends of an undefined open reading frame and the gene for DNA polymerase protein in other herpesviruses. Three of four primers sets were found to be specific for the DP-VAC virus and 100% (7/7) of field isolates but did not amplify DNA from inclusion body disease of cranes virus. The specificity of one primer set was tested with genome templates from other avian herpesviruses, including those from a golden eagle, bald eagle, great horned owl, snowy owl, peregrine falcon, prairie falcon, pigeon, psittacine, and chicken (infectious laryngotracheitis), but amplicons were not produced. Hence, this PCR test is highly specific for duck plague virus DNA. Two primer sets were able to detect 1 fg of DNA from the duck plague vaccine strain, equivalent to five genome copies. In addition, the ratio of tissue culture infectious doses to genome copies of duck plague vaccine virus from infected duck embryo cells was determined to be 1:100, making the PCR assay 20 times more sensitive than tissue culture for detecting duck plague virus. The speed, sensitivity, and specificity of this PCR provide a greatly improved diagnostic and research tool for studying the epizootiology of duck plague.
Formation of rings from segments of HeLa-cell nuclear deoxyribonucleic acid

PubMed Central

Hardman, Norman

1974-01-01

Duplex segments of HeLa-cell nuclear DNA were generated by cleavage with DNA restriction endonuclease from Haemophilus influenzae. About 20–25% of the DNA segments produced, when partly degraded with exonuclease III and annealed, were found to form rings visible in the electron microscope. A further 5% of the DNA segments formed structures that were branched in configuration. Similar structures were generated from HeLa-cell DNA, without prior treatment with restriction endonuclease, when the complementary polynucleotide chains were exposed by exonuclease III action at single-chain nicks. After exposure of an average single-chain length of 1400 nucleotides per terminus at nicks in HeLa-cell DNA by exonuclease III, followed by annealing, the physical length of ring closures was estimated and found to be 0.02–0.1μm, or 50–300 base pairs. An almost identical distribution of lengths was recorded for the regions of complementary base sequence responsible for branch formation. It is proposed that most of the rings and branches are formed from classes of reiterated base sequence with an average length of 180 base pairs arranged intermittenly in HeLa-cell DNA. From the rate of formation of branched structures when HeLa-cell DNA segments were heat-denatured and annealed, it is estimated that the reiterated sequences are in families containing approximately 2400–24000 copies. ImagesPLATE 2PLATE 1 PMID:4462738

Determination of fetal DNA fraction from the plasma of pregnant women using sequence read counts.

PubMed

Kim, Sung K; Hannum, Gregory; Geis, Jennifer; Tynan, John; Hogg, Grant; Zhao, Chen; Jensen, Taylor J; Mazloom, Amin R; Oeth, Paul; Ehrich, Mathias; van den Boom, Dirk; Deciu, Cosmin

2015-08-01

This study introduces a novel method, referred to as SeqFF, for estimating the fetal DNA fraction in the plasma of pregnant women and to infer the underlying mechanism that allows for such statistical modeling. Autosomal regional read counts from whole-genome massively parallel single-end sequencing of circulating cell-free DNA (ccfDNA) from the plasma of 25 312 pregnant women were used to train a multivariate model. The pretrained model was then applied to 505 pregnant samples to assess the performance of SeqFF against known methodologies for fetal DNA fraction calculations. Pearson's correlation between chromosome Y and SeqFF for pregnancies with male fetuses from two independent cohorts ranged from 0.932 to 0.938. Comparison between a single-nucleotide polymorphism-based approach and SeqFF yielded a Pearson's correlation of 0.921. Paired-end sequencing suggests that shorter ccfDNA, that is, less than 150 bp in length, is nonuniformly distributed across the genome. Regions exhibiting an increased proportion of short ccfDNA, which are more likely of fetal origin, tend to provide more information in the SeqFF calculations. SeqFF is a robust and direct method to determine fetal DNA fraction. Furthermore, the method is applicable to both male and female pregnancies and can greatly improve the accuracy of noninvasive prenatal testing for fetal copy number variation. © 2015 John Wiley & Sons, Ltd.
Enzymatic repair of selected cross-linked homoduplex molecules enhances nuclear gene rescue from Pompeii and Herculaneum remains.

PubMed

Di Bernardo, Giovanni; Del Gaudio, Stefania; Cammarota, Marcella; Galderisi, Umberto; Cascino, Antonino; Cipollaro, Marilena

2002-02-15

Ancient DNA (aDNA) samples extracted from the bone remains of six equids buried by the Vesuvius eruption in 79 AD were investigated to test pre-amplification and enzymatic repair procedures designed to enhance the rescue of nuclear genes. The extracts, which proved all positive for Equidae mtDNA amplification, proved positive only four times out of 18 when tested for single-copy Equidae nuclear genes (epsilon globin, p53 and gamma interferon). Pre-amplification did not change the number of retrieved aDNA sequences but 10 times out of 14 enzymatic repair restored the amplifiability of the genes analysed, proving that repair increases the rate of successful rescue from 22 to alpha(lambda)mu(omicron)sigma(tau) 80%. These findings support the hypothesis that some of these cross-linked aDNA molecules, which are not completely separated when DNA is extracted under denaturing conditions, become homoduplex substrates for Pol I and/or T4 ligase action upon renaturation. aDNA authenticity is proved by the homology of the nucleotide sequences of loci tested to the corresponding modern Equidae sequences. Data also indicate that cross-linked homoduplex molecules selected by denaturation of the extract are repaired without any chimera formation. The general features of aDNA amplification with and without denaturation and enzymatic repair are discussed.
Enzymatic repair of selected cross-linked homoduplex molecules enhances nuclear gene rescue from Pompeii and Herculaneum remains

PubMed Central

Di Bernardo, Giovanni; Del Gaudio, Stefania; Cammarota, Marcella; Galderisi, Umberto; Cascino, Antonino; Cipollaro, Marilena

2002-01-01

Ancient DNA (aDNA) samples extracted from the bone remains of six equids buried by the Vesuvius eruption in 79 AD were investigated to test pre-amplification and enzymatic repair procedures designed to enhance the rescue of nuclear genes. The extracts, which proved all positive for Equidae mtDNA amplification, proved positive only four times out of 18 when tested for single-copy Equidae nuclear genes (ɛ globin, p53 and γ interferon). Pre-amplification did not change the number of retrieved aDNA sequences but 10 times out of 14 enzymatic repair restored the amplifiability of the genes analysed, proving that repair increases the rate of successful rescue from 22 to αλµοστ 80%. These findings support the hypothesis that some of these cross-linked aDNA molecules, which are not completely separated when DNA is extracted under denaturing conditions, become homoduplex substrates for Pol I and/or T4 ligase action upon renaturation. aDNA authenticity is proved by the homology of the nucleotide sequences of loci tested to the corresponding modern Equidae sequences. Data also indicate that cross-linked homoduplex molecules selected by denaturation of the extract are repaired without any chimera formation. The general features of aDNA amplification with and without denaturation and enzymatic repair are discussed. PMID:11842122
A method for release and multiple strand amplification of small quantities of DNA from endospores of the fastidious bacterium Pasteuria penetrans.

PubMed

Mauchline, T H; Mohan, S; Davies, K G; Schaff, J E; Opperman, C H; Kerry, B R; Hirsch, P R

2010-05-01

To establish a reliable protocol to extract DNA from Pasteuria penetrans endospores for use as template in multiple strand amplification, thus providing sufficient material for genetic analyses. To develop a highly sensitive PCR-based diagnostic tool for P. penetrans. An optimized method to decontaminate endospores, release and purify DNA enabled multiple strand amplification. DNA purity was assessed by cloning and sequencing gyrB and 16S rRNA gene fragments obtained from PCR using generic primers. Samples indicated to be 100%P. penetrans by the gyrB assay were estimated at 46% using the 16S rRNA gene. No bias was detected on cloning and sequencing 12 housekeeping and sporulation gene fragments from amplified DNA. The detection limit by PCR with Pasteuria-specific 16S rRNA gene primers following multiple strand amplification of DNA extracted using the method was a single endospore. Generation of large quantities DNA will facilitate genomic sequencing of P. penetrans. Apparent differences in sample purity are explained by variations in 16S rRNA gene copy number in Eubacteria leading to exaggerated estimations of sample contamination. Detection of single endospores will facilitate investigations of P. penetrans molecular ecology. These methods will advance studies on P. penetrans and facilitate research on other obligate and fastidious micro-organisms where it is currently impractical to obtain DNA in sufficient quantity and quality.
Modulation of Mitochondrial DNA Copy Number to Induce Hepatocytic Differentiation of Human Amniotic Epithelial Cells.

PubMed

Vaghjiani, Vijesh; Cain, Jason E; Lee, William; Vaithilingam, Vijayaganapathy; Tuch, Bernard E; St John, Justin C

2017-10-15

Mitochondrial deoxyribonucleic acid (mtDNA) copy number is tightly regulated during pluripotency and differentiation. There is increased demand of cellular adenosine triphosphate (ATP) during differentiation for energy-intensive cell types such as hepatocytes and neurons to meet the cell's functional requirements. During hepatocyte differentiation, mtDNA copy number should be synchronously increased to generate sufficient ATP through oxidative phosphorylation. Unlike bone marrow mesenchymal cells, mtDNA copy number failed to increase by 28 days of differentiation of human amniotic epithelial cells (hAEC) into hepatocyte-like cells (HLC) despite their expression of some end-stage hepatic markers. This was due to higher levels of DNA methylation at exon 2 of POLGA, the mtDNA-specific replication factor. Treatment with a DNA demethylation agent, 5-azacytidine, resulted in increased mtDNA copy number, reduced DNA methylation at exon 2 of POLGA, and reduced hepatic gene expression. Depletion of mtDNA followed by subsequent differentiation did not increase mtDNA copy number, but reduced DNA methylation at exon 2 of POLGA and increased expression of hepatic and pluripotency genes. We encapsulated hAEC in barium alginate microcapsules and subsequently differentiated them into HLC. Encapsulation resulted in no net increase of mtDNA copy number but a significant reduction in DNA methylation of POLGA. RNAseq analysis showed that differentiated HLC express hepatocyte-specific genes but also increased expression of inflammatory interferon genes. Differentiation in encapsulated cells showed suppression of inflammatory genes as well as increased expression of genes associated with hepatocyte function pathways and networks. This study demonstrates that an increase in classical hepatic gene expression can be achieved in HLC through encapsulation, although they fail to effectively regulate mtDNA copy number.
Complete Chloroplast Genome Sequences of Important Oilseed Crop Sesamum indicum L

PubMed Central

Yi, Dong-Keun; Kim, Ki-Joong

2012-01-01

Sesamum indicum is an important crop plant species for yielding oil. The complete chloroplast (cp) genome of S. indicum (GenBank acc no. JN637766) is 153,324 bp in length, and has a pair of inverted repeat (IR) regions consisting of 25,141 bp each. The lengths of the large single copy (LSC) and the small single copy (SSC) regions are 85,170 bp and 17,872 bp, respectively. Comparative cp DNA sequence analyses of S. indicum with other cp genomes reveal that the genome structure, gene order, gene and intron contents, AT contents, codon usage, and transcription units are similar to the typical angiosperm cp genomes. Nucleotide diversity of the IR region between Sesamum and three other cp genomes is much lower than that of the LSC and SSC regions in both the coding region and noncoding region. As a summary, the regional constraints strongly affect the sequence evolution of the cp genomes, while the functional constraints weakly affect the sequence evolution of cp genomes. Five short inversions associated with short palindromic sequences that form step-loop structures were observed in the chloroplast genome of S. indicum. Twenty-eight different simple sequence repeat loci have been detected in the chloroplast genome of S. indicum. Almost all of the SSR loci were composed of A or T, so this may also contribute to the A-T richness of the cp genome of S. indicum. Seven large repeated loci in the chloroplast genome of S. indicum were also identified and these loci are useful to developing S. indicum-specific cp genome vectors. The complete cp DNA sequences of S. indicum reported in this paper are prerequisite to modifying this important oilseed crop by cp genetic engineering techniques. PMID:22606240
Duplication in DNA Sequences

NASA Astrophysics Data System (ADS)

Ito, Masami; Kari, Lila; Kincaid, Zachary; Seki, Shinnosuke

The duplication and repeat-deletion operations are the basis of a formal language theoretic model of errors that can occur during DNA replication. During DNA replication, subsequences of a strand of DNA may be copied several times (resulting in duplications) or skipped (resulting in repeat-deletions). As formal language operations, iterated duplication and repeat-deletion of words and languages have been well studied in the literature. However, little is known about single-step duplications and repeat-deletions. In this paper, we investigate several properties of these operations, including closure properties of language families in the Chomsky hierarchy and equations involving these operations. We also make progress toward a characterization of regular languages that are generated by duplicating a regular language.
Human Hrs, a tyrosine kinase substrate in growth factor-stimulated cells: cDNA cloning and mapping of the gene to chromosome 17.

PubMed

Lu, L; Komada, M; Kitamura, N

1998-06-15

Hrs is a 115kDa zinc finger protein which is rapidly tyrosine phosphorylated in cells stimulated with various growth factors. We previously purified the protein from a mouse cell line and cloned its cDNA. In the present study, we cloned a human Hrs cDNA from a human placenta cDNA library by cross-hybridization, using the mouse cDNA as a probe, and determined its nucleotide sequence. The human Hrs cDNA encoded a 777-amino-acid protein whose sequence was 93% identical to that of mouse Hrs. Northern blot analysis showed that the Hrs mRNA was about 3.0kb long and was expressed in all the human adult and fetal tissues tested. In addition, we showed by genomic Southern blot analysis that the human Hrs gene was a single-copy gene with a size of about 20kb. Furthermore, the human Hrs gene was mapped to chromosome 17 by Southern blotting of genomic DNAs from human/rodent somatic cell hybrids. Copyright 1998 Elsevier Science B.V. All rights reserved.
Transforming single DNA molecules into fluorescent magnetic particles for detection and enumeration of genetic variations

PubMed Central

Dressman, Devin; Yan, Hai; Traverso, Giovanni; Kinzler, Kenneth W.; Vogelstein, Bert

2003-01-01

Many areas of biomedical research depend on the analysis of uncommon variations in individual genes or transcripts. Here we describe a method that can quantify such variation at a scale and ease heretofore unattainable. Each DNA molecule in a collection of such molecules is converted into a single magnetic particle to which thousands of copies of DNA identical in sequence to the original are bound. This population of beads then corresponds to a one-to-one representation of the starting DNA molecules. Variation within the original population of DNA molecules can then be simply assessed by counting fluorescently labeled particles via flow cytometry. This approach is called BEAMing on the basis of four of its principal components (beads, emulsion, amplification, and magnetics). Millions of individual DNA molecules can be assessed in this fashion with standard laboratory equipment. Moreover, specific variants can be isolated by flow sorting and used for further experimentation. BEAMing can be used for the identification and quantification of rare mutations as well as to study variations in gene sequences or transcripts in specific populations or tissues. PMID:12857956
Accurate measurement of transgene copy number in crop plants using droplet digital PCR.

PubMed

Collier, Ray; Dasgupta, Kasturi; Xing, Yan-Ping; Hernandez, Bryan Tarape; Shao, Min; Rohozinski, Dominica; Kovak, Emma; Lin, Jeanie; de Oliveira, Maria Luiza P; Stover, Ed; McCue, Kent F; Harmon, Frank G; Blechl, Ann; Thomson, James G; Thilmony, Roger

2017-06-01

Genetic transformation is a powerful means for the improvement of crop plants, but requires labor- and resource-intensive methods. An efficient method for identifying single-copy transgene insertion events from a population of independent transgenic lines is desirable. Currently, transgene copy number is estimated by either Southern blot hybridization analyses or quantitative polymerase chain reaction (qPCR) experiments. Southern hybridization is a convincing and reliable method, but it also is expensive, time-consuming and often requires a large amount of genomic DNA and radioactively labeled probes. Alternatively, qPCR requires less DNA and is potentially simpler to perform, but its results can lack the accuracy and precision needed to confidently distinguish between one- and two-copy events in transgenic plants with large genomes. To address this need, we developed a droplet digital PCR-based method for transgene copy number measurement in an array of crops: rice, citrus, potato, maize, tomato and wheat. The method utilizes specific primers to amplify target transgenes, and endogenous reference genes in a single duplexed reaction containing thousands of droplets. Endpoint amplicon production in the droplets is detected and quantified using sequence-specific fluorescently labeled probes. The results demonstrate that this approach can generate confident copy number measurements in independent transgenic lines in these crop species. This method and the compendium of probes and primers will be a useful resource for the plant research community, enabling the simple and accurate determination of transgene copy number in these six important crop species. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
Identification of genomic aberrations associated with lymph node metastasis in diffuse-type gastric cancer.

PubMed

Choi, Ji-Hye; Kim, Young-Bae; Ahn, Ji Mi; Kim, Min Jae; Bae, Won Jung; Han, Sang-Uk; Woo, Hyun Goo; Lee, Dakeun

2018-04-06

Diffuse-type gastric cancer (DGC) is a GC subtype with heterogeneous clinical outcomes. Lymph node metastasis of DGC heralds a dismal progression, which hampers the curative treatment of patients. However, the genomic heterogeneity of DGC remains unknown. To identify genomic variations associated with lymph node metastasis in DGC, we performed whole exome sequencing on 23 cases of DGC and paired non-tumor tissues and compared the mutation profiles according to the presence (N3, n = 13) or absence (N0, n = 10) of regional lymph node metastasis. Overall, we identified 185 recurrently mutated genes in DGC, which included a significant novel mutation at CMTM2, as well as previously known mutations at CDH1, RHOA, and TP53. Noticeably, CMTM2 expression could predict the prognostic outcomes of DGC but not intestinal-type GC (IGC), indicating pivotal roles of CMTM2 in DGC progression. In addition, we identified a recurrent loss of heterozygosity (LOH) of DNA copy numbers at the 3p12-pcen locus in DGC. A comparison of N0 and N3 tumors showed that N3 tumors exhibited more frequent DNA copy number aberrations, including copy-neutral LOH and mutations of CpTpT trinucleotides, than N0 tumors (P = 0.2 × 10 -3 ). In conclusion, DGCs have distinct profiles of somatic mutations and DNA copy numbers according to the status of lymph node metastasis, and this might be helpful in delineating the pathobiology of DGC.
Investigation of Human Cancers for Retrovirus by Low-Stringency Target Enrichment and High-Throughput Sequencing.

PubMed

Vinner, Lasse; Mourier, Tobias; Friis-Nielsen, Jens; Gniadecki, Robert; Dybkaer, Karen; Rosenberg, Jacob; Langhoff, Jill Levin; Cruz, David Flores Santa; Fonager, Jannik; Izarzugaza, Jose M G; Gupta, Ramneek; Sicheritz-Ponten, Thomas; Brunak, Søren; Willerslev, Eske; Nielsen, Lars Peter; Hansen, Anders Johannes

2015-08-19

Although nearly one fifth of all human cancers have an infectious aetiology, the causes for the majority of cancers remain unexplained. Despite the enormous data output from high-throughput shotgun sequencing, viral DNA in a clinical sample typically constitutes a proportion of host DNA that is too small to be detected. Sequence variation among virus genomes complicates application of sequence-specific, and highly sensitive, PCR methods. Therefore, we aimed to develop and characterize a method that permits sensitive detection of sequences despite considerable variation. We demonstrate that our low-stringency in-solution hybridization method enables detection of <100 viral copies. Furthermore, distantly related proviral sequences may be enriched by orders of magnitude, enabling discovery of hitherto unknown viral sequences by high-throughput sequencing. The sensitivity was sufficient to detect retroviral sequences in clinical samples. We used this method to conduct an investigation for novel retrovirus in samples from three cancer types. In accordance with recent studies our investigation revealed no retroviral infections in human B-cell lymphoma cells, cutaneous T-cell lymphoma or colorectal cancer biopsies. Nonetheless, our generally applicable method makes sensitive detection possible and permits sequencing of distantly related sequences from complex material.
Association of Cell-Free DNA Tumor Fraction and Somatic Copy Number Alterations With Survival in Metastatic Triple-Negative Breast Cancer

PubMed Central

Stover, Daniel G.; Parsons, Heather A.; Ha, Gavin; Freeman, Samuel S.; Barry, William T.; Guo, Hao; Choudhury, Atish D.; Gydush, Gregory; Reed, Sarah C.; Rhoades, Justin; Rotem, Denisse; Hughes, Melissa E.; Dillon, Deborah A.; Partridge, Ann H.; Wagle, Nikhil; Krop, Ian E.; Getz, Gad; Golub, Todd R.; Love, J. Christopher; Winer, Eric P.; Tolaney, Sara M.; Lin, Nancy U.

2018-01-01

Purpose Cell-free DNA (cfDNA) offers the potential for minimally invasive genome-wide profiling of tumor alterations without tumor biopsy and may be associated with patient prognosis. Triple-negative breast cancer (TNBC) is characterized by few mutations but extensive somatic copy number alterations (SCNAs), yet little is known regarding SCNAs in metastatic TNBC. We sought to evaluate SCNAs in metastatic TNBC exclusively via cfDNA and determine if cfDNA tumor fraction is associated with overall survival in metastatic TNBC. Patients and Methods In this retrospective cohort study, we identified 164 patients with biopsy-proven metastatic TNBC at a single tertiary care institution who received prior chemotherapy in the (neo)adjuvant or metastatic setting. We performed low-coverage genome-wide sequencing of cfDNA from plasma. Results Without prior knowledge of tumor mutations, we determined tumor fraction of cfDNA for 96.3% of patients and SCNAs for 63.9% of patients. Copy number profiles and percent genome altered were remarkably similar between metastatic and primary TNBCs. Certain SCNAs were more frequent in metastatic TNBCs relative to paired primary tumors and primary TNBCs in publicly available data sets The Cancer Genome Atlas and METABRIC, including chromosomal gains in drivers NOTCH2, AKT2, and AKT3. Prespecified cfDNA tumor fraction threshold of ≥ 10% was associated with significantly worse metastatic survival (median, 6.4 v 15.9 months) and remained significant independent of clinicopathologic factors (hazard ratio, 2.14; 95% CI, 1.4 to 3.8; P < .001). Conclusion We present the largest genomic characterization of metastatic TNBC to our knowledge, exclusively from cfDNA. Evaluation of cfDNA tumor fraction was feasible for nearly all patients, and tumor fraction ≥ 10% is associated with significantly worse survival in this large metastatic TNBC cohort. Specific SCNAs are enriched and prognostic in metastatic TNBC, with implications for metastasis, resistance, and novel therapeutic approaches. PMID:29298117
Association of Cell-Free DNA Tumor Fraction and Somatic Copy Number Alterations With Survival in Metastatic Triple-Negative Breast Cancer.

PubMed

Stover, Daniel G; Parsons, Heather A; Ha, Gavin; Freeman, Samuel S; Barry, William T; Guo, Hao; Choudhury, Atish D; Gydush, Gregory; Reed, Sarah C; Rhoades, Justin; Rotem, Denisse; Hughes, Melissa E; Dillon, Deborah A; Partridge, Ann H; Wagle, Nikhil; Krop, Ian E; Getz, Gad; Golub, Todd R; Love, J Christopher; Winer, Eric P; Tolaney, Sara M; Lin, Nancy U; Adalsteinsson, Viktor A

2018-02-20

Purpose Cell-free DNA (cfDNA) offers the potential for minimally invasive genome-wide profiling of tumor alterations without tumor biopsy and may be associated with patient prognosis. Triple-negative breast cancer (TNBC) is characterized by few mutations but extensive somatic copy number alterations (SCNAs), yet little is known regarding SCNAs in metastatic TNBC. We sought to evaluate SCNAs in metastatic TNBC exclusively via cfDNA and determine if cfDNA tumor fraction is associated with overall survival in metastatic TNBC. Patients and Methods In this retrospective cohort study, we identified 164 patients with biopsy-proven metastatic TNBC at a single tertiary care institution who received prior chemotherapy in the (neo)adjuvant or metastatic setting. We performed low-coverage genome-wide sequencing of cfDNA from plasma. Results Without prior knowledge of tumor mutations, we determined tumor fraction of cfDNA for 96.3% of patients and SCNAs for 63.9% of patients. Copy number profiles and percent genome altered were remarkably similar between metastatic and primary TNBCs. Certain SCNAs were more frequent in metastatic TNBCs relative to paired primary tumors and primary TNBCs in publicly available data sets The Cancer Genome Atlas and METABRIC, including chromosomal gains in drivers NOTCH2, AKT2, and AKT3. Prespecified cfDNA tumor fraction threshold of ≥ 10% was associated with significantly worse metastatic survival (median, 6.4 v 15.9 months) and remained significant independent of clinicopathologic factors (hazard ratio, 2.14; 95% CI, 1.4 to 3.8; P < .001). Conclusion We present the largest genomic characterization of metastatic TNBC to our knowledge, exclusively from cfDNA. Evaluation of cfDNA tumor fraction was feasible for nearly all patients, and tumor fraction ≥ 10% is associated with significantly worse survival in this large metastatic TNBC cohort. Specific SCNAs are enriched and prognostic in metastatic TNBC, with implications for metastasis, resistance, and novel therapeutic approaches.
Development of Conventional and Real-Time Quantitative PCR Assays for Diagnosis and Monitoring of Scabies

PubMed Central

Wong, Samson S. Y.; Poon, Rosana W. S.; Chau, Sandy; Wong, Sally C. Y.; To, Kelvin K. W.; Cheng, Vincent C. C.; Fung, Kitty S. C.

2015-01-01

Scabies remains the most prevalent, endemic, and neglected ectoparasitic infestation globally and can cause institutional outbreaks. The sensitivity of routine microscopy for demonstration of Sarcoptes scabiei mites or eggs in skin scrapings is only about 50%. Except for three studies using conventional or two-tube nested PCR on a small number of cases, no systematic study has been performed to improve the laboratory diagnosis of this important infection. We developed a conventional and a real-time quantitative PCR (qPCR) assay based on the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene of S. scabiei. The cox1 gene is relatively well conserved, with its sequence having no high levels of similarity to the sequences of other human skin mites, pathogenic zoonotic mites, or common house dust mite species. This mitochondrial gene is also present in large quantities in arthropod cells, potentially improving the sensitivity of a PCR-based assay. In our study, both assays were specific and were more sensitive than microscopy in diagnosing scabies, with positive and negative predictive values of 100%. The S. scabiei DNA copy number in the microscopy-positive specimens was significantly higher than that in the microscopy-negative specimens (median S. scabiei DNA copy number, 3.604 versus 2.457 log10 copies per reaction; P = 0.0213). In the patient with crusted scabies, the qPCR assay performed on lesional skin swabs instead of scrapings revealed that the parasite DNA load took about 2 weeks to become negative after treatment. The utility of using lesional skin swabs as an alternative sample for diagnosis of scabies by PCR should be further evaluated. PMID:25903566
Development of Conventional and Real-Time Quantitative PCR Assays for Diagnosis and Monitoring of Scabies.

PubMed

Wong, Samson S Y; Poon, Rosana W S; Chau, Sandy; Wong, Sally C Y; To, Kelvin K W; Cheng, Vincent C C; Fung, Kitty S C; Yuen, K Y

2015-07-01

Scabies remains the most prevalent, endemic, and neglected ectoparasitic infestation globally and can cause institutional outbreaks. The sensitivity of routine microscopy for demonstration of Sarcoptes scabiei mites or eggs in skin scrapings is only about 50%. Except for three studies using conventional or two-tube nested PCR on a small number of cases, no systematic study has been performed to improve the laboratory diagnosis of this important infection. We developed a conventional and a real-time quantitative PCR (qPCR) assay based on the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene of S. scabiei. The cox1 gene is relatively well conserved, with its sequence having no high levels of similarity to the sequences of other human skin mites, pathogenic zoonotic mites, or common house dust mite species. This mitochondrial gene is also present in large quantities in arthropod cells, potentially improving the sensitivity of a PCR-based assay. In our study, both assays were specific and were more sensitive than microscopy in diagnosing scabies, with positive and negative predictive values of 100%. The S. scabiei DNA copy number in the microscopy-positive specimens was significantly higher than that in the microscopy-negative specimens (median S. scabiei DNA copy number, 3.604 versus 2.457 log10 copies per reaction; P = 0.0213). In the patient with crusted scabies, the qPCR assay performed on lesional skin swabs instead of scrapings revealed that the parasite DNA load took about 2 weeks to become negative after treatment. The utility of using lesional skin swabs as an alternative sample for diagnosis of scabies by PCR should be further evaluated. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
The complete chloroplast DNA sequence of Eleutherococcus senticosus (Araliaceae); comparative evolutionary analyses with other three asterids.

PubMed

Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong

2012-05-01

This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or intron regions. These medium size repeats may contribute to developing a cp genome-specific gene introduction vector because the region may use for specific recombination sites.
Mitochondrial DNA Copy Number in Sleep Duration Discordant Monozygotic Twins

PubMed Central

Wrede, Joanna E.; Mengel-From, Jonas; Buchwald, Dedra; Vitiello, Michael V.; Bamshad, Michael; Noonan, Carolyn; Christiansen, Lene; Christensen, Kaare; Watson, Nathaniel F.

2015-01-01

Study Objectives: Mitochondrial DNA (mtDNA) copy number is an important component of mitochondrial function and varies with age, disease, and environmental factors. We aimed to determine whether mtDNA copy number varies with habitual differences in sleep duration within pairs of monozygotic twins. Setting: Academic clinical research center. Participants: 15 sleep duration discordant monozygotic twin pairs (30 twins, 80% female; mean age 42.1 years [SD 15.0]). Design: Sleep duration was phenotyped with wrist actigraphy. Each twin pair included a “normal” (7–9 h/24) and “short” (< 7 h/24) sleeping twin. Fasting peripheral blood leukocyte DNA was assessed for mtDNA copy number via the n-fold difference between qPCR measured mtDNA and nuclear DNA creating an mtDNA measure without absolute units. We used generalized estimating equation linear regression models accounting for the correlated data structure to assess within-pair effects of sleep duration on mtDNA copy number. Measurements and Results: Mean within-pair sleep duration difference per 24 hours was 94.3 minutes (SD 62.6 min). We found reduced sleep duration (β = 0.06; 95% CI 0.004, 0.12; P < 0.05) and sleep efficiency (β = 0.51; 95% CI 0.06, 0.95; P < 0.05) were significantly associated with reduced mtDNA copy number within twin pairs. Thus every 1-minute decrease in actigraphy-defined sleep duration was associated with a decrease in mtDNA copy number of 0.06. Likewise, a 1% decrease in actigraphy-defined sleep efficiency was associated with a decrease in mtDNA copy number of 0.51. Conclusions: Reduced sleep duration and sleep efficiency were associated with reduced mitochondrial DNA copy number in sleep duration discordant monozygotic twins offering a potential mechanism whereby short sleep impairs health and longevity through mitochondrial stress. Citation: Wrede JE, Mengel-From J, Buchwald D, Vitiello MV, Bamshad M, Noonan C, Christiansen L, Christensen K, Watson NF. Mitochondrial DNA copy number in sleep duration discordant monozygotic twins. SLEEP 2015;38(10):1655–1658. PMID:26039967
A paper and plastic device for the combined isothermal amplification and lateral flow detection of Plasmodium DNA.

PubMed

Cordray, Michael S; Richards-Kortum, Rebecca R

2015-11-26

Isothermal amplification techniques are emerging as a promising method for malaria diagnosis since they are capable of detecting extremely low concentrations of parasite target while mitigating the need for infrastructure and training required by other nucleic acid based tests. Recombinase polymerase amplification (RPA) is promising for further development since it operates in a short time frame (<30 min) and produces a product that can be visually detected on a lateral flow dipstick. A self-sealing paper and plastic system that performs both the amplification and detection of a malaria DNA sequence is presented. Primers were designed using the NCBI nBLAST tools and screened using gel electrophoresis. Paper and plastic devices were prototyped using commercial design software and parts were cut using a laser cutter and assembled by hand. Synthetic copies of the Plasmodium 18S gene were spiked into solution and used as targets for the RPA reaction. To test the performance of the device the same samples spiked with synthetic target were run in parallel both in the paper and plastic devices and using conventional bench top methods. Novel RPA primers were developed that bind to sequences present in the four species of Plasmodium which infect humans. The paper and plastic devices were found to be capable of detecting as few as 5 copies/µL of synthetic Plasmodium DNA (50 copies total), comparable to the same reaction run on the bench top. The devices produce visual results in an hour, cost approximately $1, and are self-contained once the device is sealed. The device was capable of carrying out the RPA reaction and detecting meaningful amounts of synthetic Plasmodium DNA in a self-sealing and self-contained device. This device may be a step towards making nucleic acid tests more accessible for malaria detection.
Gene capture from across the grass family in the allohexaploid Elymus repens (L.) Gould (Poaceae, Triticeae) as evidenced by ITS, GBSSI, and molecular cytogenetics.

PubMed

Mahelka, Václav; Kopecký, David

2010-06-01

Four accessions of hexaploid Elymus repens from its native Central European distribution area were analyzed using sequencing of multicopy (internal transcribed spacer, ITS) and single-copy (granule-bound starch synthase I, GBSSI) DNA in concert with genomic and fluorescent in situ hybridization (GISH and FISH) to disentangle its allopolyploid origin. Despite extensive ITS homogenization, nrDNA in E. repens allowed us to identify at least four distinct lineages. Apart from Pseudoroegneria and Hordeum, representing the major genome constituents, the presence of further unexpected alien genetic material, originating from species outside the Triticeae and close to Panicum (Paniceae) and Bromus (Bromeae), was revealed. GBSSI sequences provided information complementary to the ITS. Apart from Pseudoroegneria and Hordeum, two additional gene variants from within the Triticeae were discovered: One was Taeniatherum-like, but the other did not have a close relationship with any of the diploids sampled. GISH results were largely congruent with the sequence-based markers. GISH clearly confirmed Pseudoroegneria and Hordeum as major genome constituents and further showed the presence of a small chromosome segment corresponding to Panicum. It resided in the Hordeum subgenome and probably represents an old acquisition of a Hordeum progenitor. Spotty hybridization signals across all chromosomes after GISH with Taeniatherum and Bromus probes suggested that gene acquisition from these species is more likely due to common ancestry of the grasses or early introgression than to recent hybridization or allopolyploid origin of E. repens. Physical mapping of rDNA loci using FISH revealed that all rDNA loci except one minor were located on Pseudoroegneria-derived chromosomes, which suggests the loss of all Hordeum-derived loci but one. Because homogenization mechanisms seem to operate effectively among Pseudoroegneria-like copies in this species, incomplete ITS homogenization in our samples is probably due to an interstitial position of an individual minor rDNA locus located within the Hordeum-derived subgenome.

Recombinant SINEs are formed at high frequency during induced retrotransposition in vivo.

PubMed

Yadav, Vijay Pal; Mandal, Prabhat Kumar; Bhattacharya, Alok; Bhattacharya, Sudha

2012-05-22

Non-long terminal repeat Retrotransposons are referred to as long interspersed nuclear elements (LINEs) and their non-autonomous partners are short interspersed nuclear elements (SINEs). It is believed that an active SINE copy, upon retrotransposition, generates near identical copies of itself, which subsequently accumulate mutations resulting in sequence polymorphism. Here we show that when a retrotransposition-competent cell line of the parasitic protist Entamoeba histolytica, transfected with a marked SINE copy, is induced to retrotranspose, >20% of the newly retrotransposed copies are neither identical to the marked SINE nor to the mobilized resident SINEs. Rather they are recombinants of resident SINEs and the marked SINE. They are a consequence of retrotransposition and not DNA recombination, as they are absent in cells not expressing the retrotransposition functions. This high-frequency recombination provides a new explanation for the existence of mosaic SINEs, which may impact on genetic analysis of SINE lineages, and measurement of phylogenetic distances.
Mitochondrial DNA in Residual Leukemia Cells in Cerebrospinal Fluid in Children with Acute Lymphoblastic Leukemia

PubMed Central

Egan, Kathryn; Kusao, Ian; Troelstrup, David; Agsalda, Melissa; Shiramizu, Bruce

2010-01-01

This feasibility study was designed to assess the ability to measure mitochondrial DNA (mtDNA) in cerebrospinal fluid (CSF) cells that contributed to minimal disease/persistent or residual disease (MD/PRD) from children with acute lymphoblastic leukemia (ALL). Increase in mtDNA copies in cancer cells has been suggested to play a role in MD/PRD. CSF as well as blood specimens from 6 children were assayed for MD/PRD and mtDNA copy numbers by quantitative real-time polymerase chain reaction. Of 7 MD/PRD-positive specimens, 6 had increased mtDNA copy numbers; while 11 MD/PRD-negative specimens had no increase in mtDNA copy numbers, p < 0.003. This is the first proof-of-concept study to measure mtDNA copy numbers in MD/PRD-positive CSF specimens from children with ALL. Increase of mtDNA copy numbers in MD/PRD childhood ALL cells and its significance as a mechanism for recurrence requires further investigation. Keywords Minimal residual disease; Acute lymphoblastic leukemia; Central nervous system; Cerebrospinal fluid; Mitochondria PMID:21331151
Peripheral artery disease, calf skeletal muscle mitochondrial DNA copy number, and functional performance.

PubMed

McDermott, Mary M; Peterson, Charlotte A; Sufit, Robert; Ferrucci, Luigi; Guralnik, Jack M; Kibbe, Melina R; Polonsky, Tamar S; Tian, Lu; Criqui, Michael H; Zhao, Lihui; Stein, James H; Li, Lingyu; Leeuwenburgh, Christiaan

2018-05-01

In people without lower extremity peripheral artery disease (PAD), mitochondrial DNA copy number declines with aging, and this decline is associated with declines in mitochondrial activity and functional performance. However, whether lower extremity ischemia is associated with lower mitochondrial DNA copy number and whether mitochondrial DNA copy number is associated with the degree of functional impairment in people with PAD is unknown. In people with and without PAD, age 65 years and older, we studied associations of the ankle-brachial index (ABI) with mitochondrial DNA copy number and associations of mitochondrial DNA copy number with functional impairment. Calf muscle biopsies were obtained from 34 participants with PAD (mean age: 73.5 years (SD 6.4), mean ABI: 0.67 (SD 0.15), mean 6-minute walk distance: 1191 feet (SD 223)) and 10 controls without PAD (mean age: 73.1 years (SD 4.7), mean ABI: 1.14 (SD 0.07), mean 6-minute walk distance: 1387 feet (SD 488)). Adjusting for age and sex, lower ABI values were associated with higher mitochondrial DNA copy number, measured in relative copy number (ABI<0.60: 914, ABI 0.60-0.90: 731, ABI 0.90-1.50: 593; p trend=0.016). The association of mitochondrial DNA copy number with the 6-minute walk distance and 4-meter walking velocity differed significantly between participants with versus without PAD ( p-value for interaction=0.001 and p=0.015, respectively). The correlation coefficient between mitochondrial DNA copy number and the 6-minute walk distance was 0.653 ( p=0.056) among people without PAD and -0.254 ( p=0.154) among people with PAD and ABI < 0.90. In conclusion, lower ABI values are associated with increased mitochondrial DNA copy number. Associations of mitochondrial DNA copy number with the 6-minute walk distance and 4-meter walking velocity significantly differed between people with versus without PAD, with stronger positive associations observed in people without PAD than in people with PAD. The cross-sectional and exploratory nature of the analyses precludes conclusions regarding causal inferences. ClinicalTrials.gov Identifier: NCT02246660.
Molecular genetic characterization of the RD-114 gene family of endogenous feline retroviral sequences.

PubMed Central

Reeves, R H; O'Brien, S J

1984-01-01

RD-114 is a replication-competent, xenotropic retrovirus which is homologous to a family of moderately repetitive DNA sequences present at ca. 20 copies in the normal cellular genome of domestic cats. To examine the extent and character of genomic divergence of the RD-114 gene family as well as to assess their positional association within the cat genome, we have prepared a series of molecular clones of endogenous RD-114 DNA segments from a genomic library of cat cellular DNA. Their restriction endonuclease maps were compared with each other as well as to that of the prototype-inducible RD-114 which was molecularly cloned from a chronically infected human cell line. The endogenous sequences analyzed were similar to each other in that they were colinear with RD-114 proviral DNA, were bounded by long terminal redundancies, and conserved many restriction sites in the gag and pol regions. However, the env regions of many of the sequences examined were substantially deleted. Several of the endogenous RD-114 genomes contained a novel envelope sequence which was unrelated to the env gene of the prototype RD-114 env gene but which, like RD-114 and endogenous feline leukemia virus provirus, was found only in species of the genus Felis, and not in other closely related Felidae genera. The endogenous RD-114 sequences each had a distinct cellular flank which indicates that these sequences are not tandem but dispersed nonspecifically throughout the genome. Southern analysis of cat cellular DNA confirmed the conclusions about conserved restriction sites in endogenous sequences and indicated that a single locus may be responsible for the production of the major inducible form of RD-114. Images PMID:6090693
Improving accuracy of DNA diet estimates using food tissue control materials and an evaluation of proxies for digestion bias.

PubMed

Thomas, Austen C; Jarman, Simon N; Haman, Katherine H; Trites, Andrew W; Deagle, Bruce E

2014-08-01

Ecologists are increasingly interested in quantifying consumer diets based on food DNA in dietary samples and high-throughput sequencing of marker genes. It is tempting to assume that food DNA sequence proportions recovered from diet samples are representative of consumer's diet proportions, despite the fact that captive feeding studies do not support that assumption. Here, we examine the idea of sequencing control materials of known composition along with dietary samples in order to correct for technical biases introduced during amplicon sequencing and biological biases such as variable gene copy number. Using the Ion Torrent PGM(©) , we sequenced prey DNA amplified from scats of captive harbour seals (Phoca vitulina) fed a constant diet including three fish species in known proportions. Alongside, we sequenced a prey tissue mix matching the seals' diet to generate tissue correction factors (TCFs). TCFs improved the diet estimates (based on sequence proportions) for all species and reduced the average estimate error from 28 ± 15% (uncorrected) to 14 ± 9% (TCF-corrected). The experimental design also allowed us to infer the magnitude of prey-specific digestion biases and calculate digestion correction factors (DCFs). The DCFs were compared with possible proxies for differential digestion (e.g. fish protein%, fish lipid%) revealing a strong relationship between the DCFs and percent lipid of the fish prey, suggesting prey-specific corrections based on lipid content would produce accurate diet estimates in this study system. These findings demonstrate the value of parallel sequencing of food tissue mixtures in diet studies and offer new directions for future research in quantitative DNA diet analysis. © 2013 John Wiley & Sons Ltd.
Coupling Spore Traps and Quantitative PCR Assays for Detection of the Downy Mildew Pathogens of Spinach (Peronospora effusa) and Beet (P. schachtii)

PubMed Central

Klosterman, Steven J.; Anchieta, Amy; McRoberts, Neil; Koike, Steven T.; Subbarao, Krishna V.; Voglmayr, Hermann; Choi, Young-Joon; Thines, Marco; Martin, Frank N.

2016-01-01

Downy mildew of spinach (Spinacia oleracea), caused by Peronospora effusa, is a production constraint on production worldwide, including in California, where the majority of U.S. spinach is grown. The aim of this study was to develop a real-time quantitative polymerase chain reaction (qPCR) assay for detection of airborne inoculum of P. effusa in California. Among oomycete ribosomal DNA (rDNA) sequences examined for assay development, the highest nucleotide sequence identity was observed between rDNA sequences of P. effusa and P. schachtii, the cause of downy mildew on sugar beet and Swiss chard in the leaf beet group (Beta vulgaris subsp. vulgaris). Single-nucleotide polymorphisms were detected between P. effusa and P. schachtii in the 18S rDNA regions for design of P. effusa- and P. schachtii-specific TaqMan probes and reverse primers. An allele-specific probe and primer amplification method was applied to determine the frequency of both P. effusa and P. schachtii rDNA target sequences in pooled DNA samples, enabling quantification of rDNA of P. effusa from impaction spore trap samples collected from spinach production fields. The rDNA copy numbers of P. effusa were, on average, ≈3,300-fold higher from trap samples collected near an infected field compared with those levels recorded at a site without a nearby spinach field. In combination with disease-conducive weather forecasting, application of the assays may be helpful to time fungicide applications for disease management. PMID:24964150
Comparison of cancer-associated genetic abnormalities in columnar-lined esophagus tissues with and without goblet cells.

PubMed

Bandla, Santhoshi; Peters, Jeffrey H; Ruff, David; Chen, Shiaw-Min; Li, Chieh-Yuan; Song, Kunchang; Thoms, Kimberly; Litle, Virginia R; Watson, Thomas; Chapurin, Nikita; Lada, Michal; Pennathur, Arjun; Luketich, James D; Peterson, Derick; Dulak, Austin; Lin, Lin; Bass, Adam; Beer, David G; Godfrey, Tony E; Zhou, Zhongren

2014-07-01

To determine and compare the frequency of cancer-associated genetic abnormalities in esophageal metaplasia biopsies with and without goblet cells. Barrett's esophagus is associated with increased risk of esophageal adenocarcinoma (EAC), but the appropriate histologic definition of Barrett's esophagus is debated. Intestinal metaplasia (IM) is defined by the presence of goblet cells whereas nongoblet cell metaplasia (NGM) lacks goblet cells. Both have been implicated in EAC risk but this is controversial. Although IM is known to harbor genetic changes associated with EAC, little is known about NGM. We hypothesized that if NGM and IM infer similar EAC risk, then they would harbor similar genetic aberrations in genes associated with EAC. Ninety frozen NGM, IM, and normal tissues from 45 subjects were studied. DNA copy number abnormalities were identified using microarrays and fluorescence in situ hybridization. Targeted sequencing of all exons from 20 EAC-associated genes was performed on metaplasia biopsies using Ion AmpliSeq DNA sequencing. Frequent copy number abnormalities targeting cancer-associated genes were found in IM whereas no such changes were observed in NGM. In 1 subject, fluorescence in situ hybridization confirmed loss of CDKN2A and amplification of chromosome 8 in IM but not in a nearby NGM biopsy. Targeted sequencing revealed 11 nonsynonymous mutations in 16 IM samples and 2 mutations in 19 NGM samples. This study reports the largest and most comprehensive comparison of DNA aberrations in IM and NGM genomes. Our results show that IM has a much higher frequency of cancer-associated mutations than NGM.
Precursors of vertebrate peptide antibiotics dermaseptin b and adenoregulin have extensive sequence identities with precursors of opioid peptides dermorphin, dermenkephalin, and deltorphins.

PubMed

Amiche, M; Ducancel, F; Mor, A; Boulain, J C; Menez, A; Nicolas, P

1994-07-08

The dermaseptins are a family of broad spectrum antimicrobial peptides, 27-34 amino acids long, involved in the defense of the naked skin of frogs against microbial invasion. They are the first vertebrate peptides to show lethal effects against the filamentous fungi responsible for severe opportunistic infections accompanying immunodeficiency syndrome and the use of immunosuppressive agents. A cDNA library was constructed from skin poly(A+) RNA of the arboreal frog Phyllomedusa bicolor and screened with an oligonucleotide probe complementary to the COOH terminus of dermaseptin b. Several clones contained a full-length DNA copy of a 443-nucleotide mRNA that encoded a 78-residue dermaseptin b precursor protein. The deduced precursor contained a putative signal sequence at the NH2 terminus, a 20-residue spacer sequence extremely rich (60%) in glutamic and aspartic acids, and a single copy of a dermaseptin b progenitor sequence at the COOH terminus. One clone contained a complete copy of adenoregulin, a 33-residue peptide reported to enhance the binding of agonists to the A1 adenosine receptor. The mRNAs encoding adenoregulin and dermaseptin b were very similar: 70 and 75% nucleotide identities between the 5'- and 3'-untranslated regions, respectively; 91% amino acid identity between the signal peptides; 82% identity between the acidic spacer sequences; and 38% identity between adenoregulin and dermaseptin b. Because adenoregulin and dermaseptin b have similar precursor designs and antimicrobial spectra, adenoregulin should be considered as a new member of the dermaseptin family and alternatively named dermaseptin b II. Preprodermaseptin b and preproadenoregulin have considerable sequence identities to the precursors encoding the opioid heptapeptides dermorphin, dermenkephalin, and deltorphins. This similarity extended into the 5'-untranslated regions of the mRNAs. These findings suggest that the genes encoding the four preproproteins are all members of the same family despite the fact that they encode end products having very different biological activities. These genes might contain a homologous export exon comprising the 5'-untranslated region, the 22-residue signal peptide, the 20-24-residue acidic spacer, and the basic pair Lys-Arg.
Evaluation of the X-Linked High-Grade Myopia Locus (MYP1) with Cone Dysfunction and Color Vision Deficiencies

PubMed Central

Metlapally, Ravikanth; Michaelides, Michel; Bulusu, Anuradha; Li, Yi-Ju; Schwartz, Marianne; Rosenberg, Thomas; Hunt, David M.; Moore, Anthony T.; Züchner, Stephan; Rickman, Catherine Bowes; Young, Terri L.

2014-01-01

Purpose X-linked high myopia with mild cone dysfunction and color vision defects has been mapped to chromosome Xq28 (MYP1 locus). CXorf2/TEX28 is a nested, intercalated gene within the red-green opsin cone pigment gene tandem array on Xq28. The authors investigated whether TEX28 gene alterations were associated with the Xq28-linked myopia phenotype. Genomic DNA from five pedigrees (with high myopia and either protanopia or deuteranopia) that mapped to Xq28 were screened for TEX28 copy number variations (CNVs) and sequence variants. Methods To examine for CNVs, ultra-high resolution array-comparative genomic hybridization (array-CGH) assays were performed comparing the subject genomic DNA with control samples (two pairs from two pedigrees). Opsin or TEX28 gene-targeted quantitative real-time gene expression assays (comparative CT method) were performed to validate the array-CGH findings. All exons of TEX28, including intron/exon boundaries, were amplified and sequenced using standard techniques. Results Array-CGH findings revealed predicted duplications in affected patient samples. Although only three copies of TEX28 were previously reported within the opsin array, quantitative real-time analysis of the TEX28 targeted assay of affected male or carrier female individuals in these pedigrees revealed either fewer (one) or more (four or five) copies than did related and control unaffected individuals. Sequence analysis of TEX28 did not reveal any variants associated with the disease status. Conclusions CNVs have been proposed to play a role in disease inheritance and susceptibility as they affect gene dosage. TEX28 gene CNVs appear to be associated with the MYP1 X-linked myopia phenotypes. PMID:19098318
DNA methylation-based reclassification of olfactory neuroblastoma.

PubMed

Capper, David; Engel, Nils W; Stichel, Damian; Lechner, Matt; Glöss, Stefanie; Schmid, Simone; Koelsche, Christian; Schrimpf, Daniel; Niesen, Judith; Wefers, Annika K; Jones, David T W; Sill, Martin; Weigert, Oliver; Ligon, Keith L; Olar, Adriana; Koch, Arend; Forster, Martin; Moran, Sebastian; Tirado, Oscar M; Sáinz-Japeado, Miguel; Mora, Jaume; Esteller, Manel; Alonso, Javier; Del Muro, Xavier Garcia; Paulus, Werner; Felsberg, Jörg; Reifenberger, Guido; Glatzel, Markus; Frank, Stephan; Monoranu, Camelia M; Lund, Valerie J; von Deimling, Andreas; Pfister, Stefan; Buslei, Rolf; Ribbat-Idel, Julika; Perner, Sven; Gudziol, Volker; Meinhardt, Matthias; Schüller, Ulrich

2018-05-05

Olfactory neuroblastoma/esthesioneuroblastoma (ONB) is an uncommon neuroectodermal neoplasm thought to arise from the olfactory epithelium. Little is known about its molecular pathogenesis. For this study, a retrospective cohort of n = 66 tumor samples with the institutional diagnosis of ONB was analyzed by immunohistochemistry, genome-wide DNA methylation profiling, copy number analysis, and in a subset, next-generation panel sequencing of 560 tumor-associated genes. DNA methylation profiles were compared to those of relevant differential diagnoses of ONB. Unsupervised hierarchical clustering analysis of DNA methylation data revealed four subgroups among institutionally diagnosed ONB. The largest group (n = 42, 64%, Core ONB) presented with classical ONB histology and no overlap with other classes upon methylation profiling-based t-distributed stochastic neighbor embedding (t-SNE) analysis. A second DNA methylation group (n = 7, 11%) with CpG island methylator phenotype (CIMP) consisted of cases with strong expression of cytokeratin, no or scarce chromogranin A expression and IDH2 hotspot mutation in all cases. T-SNE analysis clustered these cases together with sinonasal carcinoma with IDH2 mutation. Four cases (6%) formed a small group characterized by an overall high level of DNA methylation, but without CIMP. The fourth group consisted of 13 cases that had heterogeneous DNA methylation profiles and strong cytokeratin expression in most cases. In t-SNE analysis, these cases mostly grouped among sinonasal adenocarcinoma, squamous cell carcinoma, and undifferentiated carcinoma. Copy number analysis indicated highly recurrent chromosomal changes among Core ONB with a high frequency of combined loss of chromosome 1-4, 8-10, and 12. NGS sequencing did not reveal highly recurrent mutations in ONB, with the only recurrently mutated genes being TP53 and DNMT3A. In conclusion, we demonstrate that institutionally diagnosed ONB are a heterogeneous group of tumors. Expression of cytokeratin, chromogranin A, the mutational status of IDH2 as well as DNA methylation patterns may greatly aid in the precise classification of ONB.
Capsicum annuum dehydrin, an osmotic-stress gene in hot pepper plants.

PubMed

Chung, Eunsook; Kim, Soo-Yong; Yi, So Young; Choi, Doil

2003-06-30

Osmotic stress-related genes were selected from an EST database constructed from 7 cDNA libraries from different tissues of the hot pepper. A full-length cDNA of Capsicum annuum dehydrin (Cadhn), a late embryogenesis abundant (lea) gene, was selected from the 5' single pass sequenced cDNA clones and sequenced. The deduced polypeptide has 87% identity with potato dehydrin C17, but very little identity with the dehydrin genes of other organisms. It contains a serine-tract (S-segment) and 3 conserved lysine-rich domains (K-segments). Southern blot analysis showed that 2 copies are present in the hot pepper genome. Cadhn was induced by osmotic stress in leaf tissues as well as by the application of abscisic acid. The RNA was most abundant in green fruit. The expression of several osmotic stress-related genes was examined and Cadhn proved to be the most abundantly expressed of these in response to osmotic stress.
Molecular cloning and expression of the gene encoding the kinetoplast-associated type II DNA topoisomerase of Crithidia fasciculata.

PubMed

Pasion, S G; Hines, J C; Aebersold, R; Ray, D S

1992-01-01

A type II DNA topoisomerase, topoIImt, was shown previously to be associated with the kinetoplast DNA of the trypanosomatid Crithidia fasciculata. The gene encoding this kinetoplast-associated topoisomerase has been cloned by immunological screening of a Crithidia genomic expression library with monoclonal antibodies raised against the purified enzyme. The gene CfaTOP2 is a single copy gene and is expressed as a 4.8-kb polyadenylated transcript. The nucleotide sequence of CfaTOP2 has been determined and encodes a predicted polypeptide of 1239 amino acids with a molecular mass of 138,445. The identification of the cloned gene is supported by immunoblot analysis of the beta-galactosidase-CfaTOP2 fusion protein expressed in Escherichia coli and by analysis of tryptic peptide sequences derived from purified topoIImt. CfaTOP2 shares significant homology with nuclear type II DNA topoisomerases of other eukaryotes suggesting that in Crithidia both nuclear and mitochondrial forms of topoisomerase II are encoded by the same gene.
Concerted evolution at the population level: pupfish HindIII satellite DNA sequences.

PubMed Central

Elder, J F; Turner, B J

1994-01-01

The canonical monomers (approximately 170 bp) of an abundant (1.9 x 10(6) copies per diploid genome) satellite DNA sequence family in the genome of Cyprinodon variegatus, a "pupfish" that ranges along the Atlantic coast from Cape Cod to central Mexico, are divergent in base sequence in 10 of 12 samples collected from natural populations. The divergence involves substitutions, deletions, and insertions, is marked in scope (mean pairwise sequence similarity = 61.6%; range = 35-95.9%), is largely confined to the 3' half of the monomer, and is not correlated with the distance among collecting sites. Repetitive cloning and direct genomic sequencing experiments failed to detect intrapopulation and intraindividual variation, suggesting high levels of sequence homogeneity within populations. The satellite sequence has therefore undergone "concerted evolution," at the level of the local population. Concerted evolution has previously almost always been discussed in terms of the divergence of species or higher taxa; its intraspecific occurrence apparently has not been reported previously. The generality of the observation is difficult to evaluate, for although satellite DNAs from a large number of organisms have been studied in detail, there appear to be little or no other data on their sequence variation in natural populations. The relationship (if any) between concerted, population level, satellite DNA divergence and the extent of gene flow/genetic isolation among conspecific natural populations remains to be established. Images PMID:8302879
A feasibility study of colorectal cancer diagnosis via circulating tumor DNA derived CNV detection.

PubMed

Molparia, Bhuvan; Oliveira, Glenn; Wagner, Jennifer L; Spencer, Emily G; Torkamani, Ali

2018-01-01

Circulating tumor DNA (ctDNA) has shown great promise as a biomarker for early detection of cancer. However, due to the low abundance of ctDNA, especially at early stages, it is hard to detect at high accuracies while keeping sequencing costs low. Here we present a pilot stage study to detect large scale somatic copy numbers variations (CNVs), which contribute more molecules to ctDNA signal compared to point mutations, via cell free DNA sequencing. We show that it is possible to detect somatic CNVs in early stage colorectal cancer (CRC) patients and subsequently discriminate them from normal patients. With 25 normal and 24 CRC samples, we achieve 100% specificity (lower bound confidence interval: 86%) and ~79% sensitivity (95% confidence interval: 63% - 95%,), though the performance should be considered with caution given the limited sample size. We report a lack of concordance between the CNVs detected via cfDNA sequencing and CNVs identified in parent tissue samples. However, recent findings suggest that a lack of concordance is expected for CNVs in CRC because of their sub-clonal nature. Finally, the CNVs we detect very likely contribute to cancer progression as they lie in functionally important regions, and have been shown to be associated with CRC specifically. This study paves the path for a larger scale exploration of the potential of CNV detection for both diagnoses and prognoses of cancer.
Mapping copy number variation by population-scale genome sequencing.

PubMed

Mills, Ryan E; Walter, Klaudia; Stewart, Chip; Handsaker, Robert E; Chen, Ken; Alkan, Can; Abyzov, Alexej; Yoon, Seungtai Chris; Ye, Kai; Cheetham, R Keira; Chinwalla, Asif; Conrad, Donald F; Fu, Yutao; Grubert, Fabian; Hajirasouliha, Iman; Hormozdiari, Fereydoun; Iakoucheva, Lilia M; Iqbal, Zamin; Kang, Shuli; Kidd, Jeffrey M; Konkel, Miriam K; Korn, Joshua; Khurana, Ekta; Kural, Deniz; Lam, Hugo Y K; Leng, Jing; Li, Ruiqiang; Li, Yingrui; Lin, Chang-Yun; Luo, Ruibang; Mu, Xinmeng Jasmine; Nemesh, James; Peckham, Heather E; Rausch, Tobias; Scally, Aylwyn; Shi, Xinghua; Stromberg, Michael P; Stütz, Adrian M; Urban, Alexander Eckehart; Walker, Jerilyn A; Wu, Jiantao; Zhang, Yujun; Zhang, Zhengdong D; Batzer, Mark A; Ding, Li; Marth, Gabor T; McVean, Gil; Sebat, Jonathan; Snyder, Michael; Wang, Jun; Ye, Kenny; Eichler, Evan E; Gerstein, Mark B; Hurles, Matthew E; Lee, Charles; McCarroll, Steven A; Korbel, Jan O

2011-02-03

Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.
Expression of the Pasteurella haemolytica leukotoxin is inhibited by a locus that encodes an ATP-binding cassette homolog.

PubMed Central

Highlander, S K; Wickersham, E A; Garza, O; Weinstock, G M

1993-01-01

Multicopy and single-copy chromosomal fusions between the Pasteurella haemolytica leukotoxin regulatory region and the Escherichia coli beta-galactosidase gene have been constructed. These fusions were used as reporters to identify and isolate regulators of leukotoxin expression from a P. haemolytica cosmid library. A cosmid clone, which inhibited leukotoxin expression from multicopy and single-copy protein fusions, was isolated and found to contain the complete leukotoxin gene cluster plus additional upstream sequences. The locus responsible for inhibition of expression from leukotoxin-beta-galactosidase fusions was mapped within these upstream sequences, by transposon mutagenesis with Tn5, and its DNA sequence was determined. The inhibitory activity was found to be associated with a predicted 440-amino-acid reading frame (lapA) that lies within a four-gene arginine transport locus. LapA is predicted to be the nucleotide-binding component of this transport system and shares homology with the Clp family of proteases. Images PMID:8359916
UV Decontamination of MDA Reagents for Single Cell Genomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lee, Janey; Tighe, Damon; Sczyrba, Alexander

2011-03-18

Single cell genomics, the amplification and sequencing of genomes from single cells, can provide a glimpse into the genetic make-up and thus life style of the vast majority of uncultured microbial cells, making it an immensely powerful and increasingly popular tool. This is accomplished by use of multiple displacement amplification (MDA), which can generate billions of copies of a single bacterial genome producing microgram-range DNA required for shotgun sequencing. Here, we address a key challenge inherent to this approach and propose a solution for the improved recovery of single cell genomes. While DNA-free reagents for the amplification of a singlemore » cell genome are a prerequisite for successful single cell sequencing and analysis, DNA contamination has been detected in various reagents, which poses a considerable challenge. Our study demonstrates the effect of UV irradiation in efficient elimination of exogenous contaminant DNA found in MDA reagents, while maintaining Phi29 activity. Consequently, we also find that increased UV exposure to Phi29 does not adversely affect genome coverage of MDA amplified single cells. While additional challenges in single cell genomics remain to be resolved, the proposed methodology is relatively quick and simple and we believe that its application will be of high value for future single cell sequencing projects.« less
Evaluation of Novel Broad-Range Real-Time PCR Assay for Rapid Detection of Human Pathogenic Fungi in Various Clinical Specimens▿

PubMed Central

Vollmer, Tanja; Störmer, Melanie; Kleesiek, Knut; Dreier, Jens

2008-01-01

In the present study, a novel broad-range real-time PCR was developed for the rapid detection of human pathogenic fungi. The assay targets a part of the 28S large-subunit ribosomal RNA (rDNA) gene. We investigated its application for the most important human pathogenic fungal genera, including Aspergillus, Candida, Cryptococcus, Mucor, Penicillium, Pichia, Microsporum, Trichophyton, and Scopulariopsis. Species were identified in PCR-positive reactions by direct DNA sequencing. A noncompetitive internal control was applied to prevent false-negative results due to PCR inhibition. The minimum detection limit for the PCR was determined to be one 28S rDNA copy per PCR, and the 95% detection limit was calculated to 15 copies per PCR. To assess the clinical applicability of the PCR method, intensive-care patients with artificial respiration and patients with infective endocarditis were investigated. For this purpose, 76 tracheal secretion samples and 70 heart valve tissues were analyzed in parallel by real-time PCR and cultivation. No discrepancies in results were observed between PCR analysis and cultivation methods. Furthermore, the application of the PCR method was investigated for other clinical specimens, including cervical swabs, nail and horny skin scrapings, and serum, blood, and urine samples. The combination of a broad-range real-time PCR and direct sequencing facilitates rapid screening for fungal infection in various clinical specimens. PMID:18385440
Evaluation of novel broad-range real-time PCR assay for rapid detection of human pathogenic fungi in various clinical specimens.

PubMed

Vollmer, Tanja; Störmer, Melanie; Kleesiek, Knut; Dreier, Jens

2008-06-01

In the present study, a novel broad-range real-time PCR was developed for the rapid detection of human pathogenic fungi. The assay targets a part of the 28S large-subunit ribosomal RNA (rDNA) gene. We investigated its application for the most important human pathogenic fungal genera, including Aspergillus, Candida, Cryptococcus, Mucor, Penicillium, Pichia, Microsporum, Trichophyton, and Scopulariopsis. Species were identified in PCR-positive reactions by direct DNA sequencing. A noncompetitive internal control was applied to prevent false-negative results due to PCR inhibition. The minimum detection limit for the PCR was determined to be one 28S rDNA copy per PCR, and the 95% detection limit was calculated to 15 copies per PCR. To assess the clinical applicability of the PCR method, intensive-care patients with artificial respiration and patients with infective endocarditis were investigated. For this purpose, 76 tracheal secretion samples and 70 heart valve tissues were analyzed in parallel by real-time PCR and cultivation. No discrepancies in results were observed between PCR analysis and cultivation methods. Furthermore, the application of the PCR method was investigated for other clinical specimens, including cervical swabs, nail and horny skin scrapings, and serum, blood, and urine samples. The combination of a broad-range real-time PCR and direct sequencing facilitates rapid screening for fungal infection in various clinical specimens.
Unlabeled probes for the detection and typing of herpes simplex virus.

PubMed

Dames, Shale; Pattison, David C; Bromley, L Kathryn; Wittwer, Carl T; Voelkerding, Karl V

2007-10-01

Unlabeled probe detection with a double-stranded DNA (dsDNA) binding dye is one method to detect and confirm target amplification after PCR. Unlabeled probes and amplicon melting have been used to detect small deletions and single-nucleotide polymorphisms in assays where template is in abundance. Unlabeled probes have not been applied to low-level target detection, however. Herpes simplex virus (HSV) was chosen as a model to compare the unlabeled probe method to an in-house reference assay using dual-labeled, minor groove binding probes. A saturating dsDNA dye (LCGreen Plus) was used for real-time PCR. HSV-1, HSV-2, and an internal control were differentiated by PCR amplicon and unlabeled probe melting analysis after PCR. The unlabeled probe technique displayed 98% concordance with the reference assay for the detection of HSV from a variety of archived clinical samples (n = 182). HSV typing using unlabeled probes was 99% concordant (n = 104) to sequenced clinical samples and allowed for the detection of sequence polymorphisms in the amplicon and under the probe. Unlabeled probes and amplicon melting can be used to detect and genotype as few as 10 copies of target per reaction, restricted only by stochastic limitations. The use of unlabeled probes provides an attractive alternative to conventional fluorescence-labeled, probe-based assays for genotyping and detection of HSV and might be useful for other low-copy targets where typing is informative.

Molecular characterization of Theileria orientalis from cattle in Ethiopia.

PubMed

Gebrekidan, Hagos; Gasser, Robin B; Baneth, Gad; Yasur-Landau, Daniel; Nachum-Biala, Yaarit; Hailu, Asrat; Jabbar, Abdul

2016-07-01

This study reports the first molecular characterization of Theileria orientalis in local breeds of cattle in Ethiopia. A conventional PCR utilizing major piroplasm surface protein (MPSP) gene and an established multiplexed tandem PCR (MT-PCR) were used to characterize T. orientalis and to assess the infection intensity, respectively. Of 232 blood samples tested, T. orientalis DNA was detected in only 2.2% of samples using conventional PCR; two genotypes buffeli (1.3%; 3/232) and type 5 (0.9%; 2/232) of T. orientalis were detected. Phylogenetic analysis revealed that the buffeli MPSP sequences from Ethiopia were closely related to those reported from Kenya, Sri Lanka and Myanmar, and type 5 sequences from Ethiopia grouped with those from Korea, Japan, Vietnam and Thailand. A higher number of samples (3.9%; 9/232) were test-positive by MT-PCR and four genotypes (buffeli, chitose, ikeda and type 5) of T. orientalis were detected. The average intensity of infections with genotypes buffeli (DNA copy numbers 11,056) and type 5 (7508) were significantly higher (P<0.0001) than the pathogenic genotype ikeda (61 DNA copies). This first insight into T. orientalis from cattle in Ethiopia using MPSP gene provides a basis for future studies of T. orientalis in various agroclimatic zones and of the impact of oriental theilerosis on cattle in this and other countries of Africa. Copyright © 2016 Elsevier GmbH. All rights reserved.
A single mini-barcode test to screen for Australian mammalian predators from environmental samples

PubMed Central

MacDonald, Anna J; Sarre, Stephen D

2017-01-01

Abstract Identification of species from trace samples is now possible through the comparison of diagnostic DNA fragments against reference DNA sequence databases. DNA detection of animals from non-invasive samples, such as predator faeces (scats) that contain traces of DNA from their species of origin, has proved to be a valuable tool for the management of elusive wildlife. However, application of this approach can be limited by the availability of appropriate genetic markers. Scat DNA is often degraded, meaning that longer DNA sequences, including standard DNA barcoding markers, are difficult to recover. Instead, targeted short diagnostic markers are required to serve as diagnostic mini-barcodes. The mitochondrial genome is a useful source of such trace DNA markers because it provides good resolution at the species level and occurs in high copy numbers per cell. We developed a mini-barcode based on a short (178 bp) fragment of the conserved 12S ribosomal ribonucleic acid mitochondrial gene sequence, with the goal of discriminating amongst the scats of large mammalian predators of Australia. We tested the sensitivity and specificity of our primers and can accurately detect and discriminate amongst quolls, cats, dogs, foxes, and devils from trace DNA samples. Our approach provides a cost-effective, time-efficient, and non-invasive tool that enables identification of all 8 medium-large mammal predators in Australia, including native and introduced species, using a single test. With modification, this approach is likely to be of broad applicability elsewhere. PMID:28810700
Reduced mtDNA copy number increases the sensitivity of tumor cells to chemotherapeutic drugs.

PubMed

Mei, H; Sun, S; Bai, Y; Chen, Y; Chai, R; Li, H

2015-04-02

Many cancer drugs are toxic to cells by activating apoptotic pathways. Previous studies have shown that mitochondria have key roles in apoptosis in mammalian cells, but the role of mitochondrial DNA (mtDNA) copy number variation in the pathogenesis of tumor cell apoptosis remains largely unknown. We used the HEp-2, HNE2, and A549 tumor cell lines to explore the relationship between mtDNA copy number variation and cell apoptosis. We first induced apoptosis in three tumor cell lines and one normal adult human skin fibroblast cell line (HSF) with cisplatin (DDP) or doxorubicin (DOX) treatment and found that the mtDNA copy number significantly increased in apoptotic tumor cells, but not in HSF cells. We then downregulated the mtDNA copy number by transfection with shRNA-TFAM plasmids or treatment with ethidium bromide and found that the sensitivity of tumor cells to DDP or DOX was significantly increased. Furthermore, we observed that levels of reactive oxygen species (ROS) increased significantly in tumor cells with lower mtDNA copy numbers, and this might be related to a low level of antioxidant gene expression. Finally, we rescued the increase of ROS in tumor cells with lipoic acid or N-acetyl-L-cysteine and found that the apoptosis rate decreased. Our studies suggest that the increase of mtDNA copy number is a self-protective mechanism of tumor cells to prevent apoptosis and that reduced mtDNA copy number increases ROS levels in tumor cells, increases the tumor cells' sensitivity to chemotherapeutic drugs, and increases the rate of apoptosis. This research provides evidence that mtDNA copy number variation might be a promising new therapeutic target for the clinical treatment of tumors.
Association of mitochondrial DNA in peripheral blood with depression, anxiety and stress- and adjustment disorders in primary health care patients.

PubMed

Wang, Xiao; Sundquist, Kristina; Rastkhani, Hamideh; Palmér, Karolina; Memon, Ashfaque A; Sundquist, Jan

2017-08-01

Mitochondrial dysfunction may result in a variety of diseases. The objectives here were to examine possible differences in mtDNA copy number between healthy controls and patients with depression, anxiety or stress- and adjustment disorders; the association between mtDNA copy number and disease severity at baseline; and the association between mtDNA copy number and response after an 8-week treatment (mindfulness, cognitive based therapy). A total of 179 patients in primary health care (age 20-64 years) with depression, anxiety and stress- and adjustment disorders, and 320 healthy controls (aged 19-70 years) were included in the study. Relative mtDNA copy number was measured using quantitative real-time PCR on peripheral blood samples. We found that the mean mtDNA copy number was significantly higher in patients compared to controls (84.9 vs 75.9, p<0.0001) at baseline. The difference in mtDNA copy number between patients and controls remained significant after controlling for age and sex (ß=8.13, p<0.0001; linear regression analysis). The mtDNA copy number was significantly associated with Patient Health Questionnaire (PHQ-9) scores (β=0.57, p=0.02) at baseline. After treatment, the change in mtDNA copy number was significantly associated with the treatment response, i.e., change in Hospital Anxiety and Depression Scale (HADS-D) and PHQ-9 scores (ß=1.00, p=0.03 and ß=0.65, p=0.04, respectively), after controlling for baseline scores, age, sex, BMI, smoking status, alcohol drinking and medication. Our findings show that mtDNA copy number is associated with symptoms of depression, anxiety and stress- and adjustment disorders and treatment response in these disorders. Copyright © 2017 Elsevier B.V. and ECNP. All rights reserved.
Transgenesis in fish.

PubMed

Houdebine, L M; Chourrout, D

1991-09-15

Gene transfer into fish embryo is being performed in several species (trout, salmon, carps, tilapia, medaka, goldfish, zebrafish, loach, catfish, etc.). In most cases, pronuclei are not visible and microinjection must be done into the cytoplasm of early embryos. Several million copies of the gene are generally injected. In medaka, transgenesis was attempted by injection of the foreign gene into the nucleus of oocyte. Several reports indicate that the injected DNA was rapidly replicated in the early phase of embryo development, regardless of the origin and the sequence of the foreign DNA. The survival of the injected embryos was reasonably good and a large number reached maturity. The proportion of transgenic animals ranged from 1 to 50% or more, according to species and to experimentators. The reasons for this discrepancy have not been elucidated. In all species, the transgenic animals were mosaic. The copy number of the foreign DNA was different in the various tissues of an animal and a proportion lower than 50% of F1 offsprings received the gene from their parents. This suggests that the foreign DNA was integrated into the fish genome at the two cells stage or later. An examination of the integrated DNA in different cell types of an animal revealed that integration occurred mainly during early development. The transgene was found essentially unrearranged in the fish genome of the founders and offsprings. The transgenes were therefore stably transmitted to progeny in a Mendelian fashion. Southern blot analysis revealed the presence of possible junction fragments and also of minor bands which may result from a rearrangement of the injected DNA. In all species, the integrated DNA appeared mainly as random end-to-end concatemers. In adult trout blood cells, a small proportion of the foreign DNA was maintained in the form of non-integrated concatemers, as judged by the existence of end fragments. The transgenes were generally only poorly expressed. The majority of the injected gene constructs contained essentially mammalian or higher vertebrates sequences. The comparison of the expression efficiency of these constructs in transfected fish and mammalian cells indicates that some of the mammalian DNA sequences are most efficiently understood by the fish cell machinery. Chloramphenicol acetyl transferase gene under the control of promoters from Rous sarcoma virus, and human cytomegalovirus, was expressed in several tissues of transgenic fish. Chicken delta-crystallin gene was expressed in several tissues of transgenic fish.(ABSTRACT TRUNCATED AT 400 WORDS)
Structure of the highly repeated, long interspersed DNA family (LINE or L1Rn) of the rat.

PubMed Central

D'Ambrosio, E; Waitzkin, S D; Witney, F R; Salemme, A; Furano, A V

1986-01-01

We present the DNA sequence of a 6.7-kilobase member of the rat long interspersed repeated DNA family (LINE or L1Rn). This member (LINE 3) is flanked by a perfect 14-base-pair (bp) direct repeat and is a full-length, or close-to-full-length, member of this family. LINE 3 contains an approximately 100-bp A-rich right end, a number of long (greater than 400-bp) open reading frames, and a ca. 200-bp G + C-rich (ca. 60%) cluster near each terminus. Comparison of the LINE 3 sequence with the sequence of about one-half of another member, which we also present, as well as restriction enzyme analysis of the genomic copies of this family, indicates that in length and overall structure LINE 3 is quite typical of the 40,000 or so other genomic members of this family which would account for as much as 10% of the rat genome. Therefore, the rat LINE family is relatively homogeneous, which contrasts with the heterogeneous LINE families in primates and mice. Transcripts corresponding to the entire LINE sequence are abundant in the nuclear RNA of rat liver. The characteristics of the rat LINE family are discussed with respect to the possible function and evolution of this family of DNA sequences. Images PMID:3023845
Droplet digital PCR technology promises new applications and research areas.

PubMed

Manoj, P

2016-01-01

Digital Polymerase Chain Reaction (dPCR) is used to quantify nucleic acids and its applications are in the detection and precise quantification of low-level pathogens, rare genetic sequences, quantification of copy number variants, rare mutations and in relative gene expressions. Here the PCR is performed in large number of reaction chambers or partitions and the reaction is carried out in each partition individually. This separation allows a more reliable collection and sensitive measurement of nucleic acid. Results are calculated by counting amplified target sequence (positive droplets) and the number of partitions in which there is no amplification (negative droplets). The mean number of target sequences was calculated by Poisson Algorithm. Poisson correction compensates the presence of more than one copy of target gene in any droplets. The method provides information with accuracy and precision which is highly reproducible and less susceptible to inhibitors than qPCR. It has been demonstrated in studying variations in gene sequences, such as copy number variants and point mutations, distinguishing differences between expression of nearly identical alleles, assessment of clinically relevant genetic variations and it is routinely used for clonal amplification of samples for NGS methods. dPCR enables more reliable predictors of tumor status and patient prognosis by absolute quantitation using reference normalizations. Rare mitochondrial DNA deletions associated with a range of diseases and disorders as well as aging can be accurately detected with droplet digital PCR.
Characterizing partial AZFc deletions of the Y chromosome with amplicon-specific sequence markers

PubMed Central

Navarro-Costa, Paulo; Pereira, Luísa; Alves, Cíntia; Gusmão, Leonor; Proença, Carmen; Marques-Vidal, Pedro; Rocha, Tiago; Correia, Sónia C; Jorge, Sónia; Neves, António; Soares, Ana P; Nunes, Joaquim; Calhaz-Jorge, Carlos; Amorim, António; Plancha, Carlos E; Gonçalves, João

2007-01-01

Background The AZFc region of the human Y chromosome is a highly recombinogenic locus containing multi-copy male fertility genes located in repeated DNA blocks (amplicons). These AZFc gene families exhibit slight sequence variations between copies which are considered to have functional relevance. Yet, partial AZFc deletions yield phenotypes ranging from normospermia to azoospermia, thwarting definite conclusions on their real impact on fertility. Results The amplicon content of partial AZFc deletion products was characterized with novel amplicon-specific sequence markers. Data indicate that partial AZFc deletions are a male infertility risk [odds ratio: 5.6 (95% CI: 1.6–30.1)] and although high diversity of partial deletion products and sequence conversion profiles were recorded, the AZFc marker profiles detected in fertile men were also observed in infertile men. Additionally, the assessment of rearrangement recurrence by Y-lineage analysis indicated that while partial AZFc deletions occurred in highly diverse samples, haplotype diversity was minimal in fertile men sharing identical marker profiles. Conclusion Although partial AZFc deletion products are highly heterogeneous in terms of amplicon content, this plasticity is not sufficient to account for the observed phenotypical variance. The lack of causative association between the deletion of specific gene copies and infertility suggests that AZFc gene content might be part of a multifactorial network, with Y-lineage evolution emerging as a possible phenotype modulator. PMID:17903263
Targeted next-generation sequencing at copy-number breakpoints for personalized analysis of rearranged ends in solid tumors.

PubMed

Kim, Hyun-Kyoung; Park, Won Cheol; Lee, Kwang Man; Hwang, Hai-Li; Park, Seong-Yeol; Sorn, Sungbin; Chandra, Vishal; Kim, Kwang Gi; Yoon, Woong-Bae; Bae, Joon Seol; Shin, Hyoung Doo; Shin, Jong-Yeon; Seoh, Ju-Young; Kim, Jong-Il; Hong, Kyeong-Man

2014-01-01

The concept of the utilization of rearranged ends for development of personalized biomarkers has attracted much attention owing to its clinical applicability. Although targeted next-generation sequencing (NGS) for recurrent rearrangements has been successful in hematologic malignancies, its application to solid tumors is problematic due to the paucity of recurrent translocations. However, copy-number breakpoints (CNBs), which are abundant in solid tumors, can be utilized for identification of rearranged ends. As a proof of concept, we performed targeted next-generation sequencing at copy-number breakpoints (TNGS-CNB) in nine colon cancer cases including seven primary cancers and two cell lines, COLO205 and SW620. For deduction of CNBs, we developed a novel competitive single-nucleotide polymorphism (cSNP) microarray method entailing CNB-region refinement by competitor DNA. Using TNGS-CNB, 19 specific rearrangements out of 91 CNBs (20.9%) were identified, and two polymerase chain reaction (PCR)-amplifiable rearrangements were obtained in six cases (66.7%). And significantly, TNGS-CNB, with its high positive identification rate (82.6%) of PCR-amplifiable rearrangements at candidate sites (19/23), just from filtering of aligned sequences, requires little effort for validation. Our results indicate that TNGS-CNB, with its utility for identification of rearrangements in solid tumors, can be successfully applied in the clinical laboratory for cancer-relapse and therapy-response monitoring.
Homologous recombination between overlapping thymidine kinase gene fragments stably inserted into a mouse cell genome.

PubMed

Lin, F L; Sternberg, N

1984-05-01

We have constructed a substrate to study homologous recombination between adjacent segments of chromosomal DNA. This substrate, designated lambda tk2 , consists of one completely defective and one partially defective herpes simplex virus thymidine kinase (tk) gene cloned in bacteriophage lambda DNA. The two genes have homologous 984-base-pair sequences and are separated by 3 kilobases of largely vector DNA. When lambda tk2 DNA was transferred into mouse LMtk- cells by the calcium phosphate method, rare TK+ transformants were obtained that contained many (greater than 40) copies of the unrecombined DNA. Tk- revertants, which had lost most of the copies of unrecombined DNA, were isolated from these TK+-transformed lines. Two of these Tk- lines were further studied by analysis of their reversion back to the Tk+ phenotype. They generated ca. 200 Tk+ revertants per 10(8) cells after growth in nonselecting medium for 5 days. All of these Tk+ revertants have an intact tk gene reconstructed by homologous recombination; they also retain various amounts of unrecombined lambda tk2 DNA. Southern blot analysis suggested that at least some of the recombination events involve unequal sister chromatid exchanges. We also tested three agents, mitomycin C, 12-O-tetradecanoyl-phorbol-13-acetate, and mezerein, that are thought to stimulate recombination to determine whether they affect the reversion from Tk- to Tk+. Only mitomycin C increased the number of Tk+ revertants.
A new polymorphic and multicopy MHC gene family related to nonmammalian class I

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leelayuwat, C.; Degli-Esposti, M.A.; Abraham, L.J.

1994-12-31

The authors have used genomic analysis to characterize a region of the central major histocompatibility complex (MHC) spanning {approximately} 300 kilobases (kb) between TNF and HLA-B. This region has been suggested to carry genetic factors relevant to the development of autoimmune diseases such as myasthenia gravis (MG) and insulin dependent diabetes mellitus (IDDM). Genomic sequence was analyzed for coding potential, using two neural network programs, GRAIL and GeneParser. A genomic probe, JAB, containing putative coding sequences (PERB11) located 60 kb centromeric of HLA-B, was used for northern analysis of human tissues. Multiple transcripts were detected. Southern analysis of genomic DNAmore » and overlapping YAC clones, covering the region from BAT1 to HLA-F, indicated that there are at least five copies of PERB11, four of which are located within this region of the MHC. The partial cDNA sequence of PERB11 was obtained from poly-A RNA derived from skeletal muscle. The putative amino acid sequence of PERB11 shares {approximately} 30% identity to MHC class I molecules from various species, including reptiles, chickens, and frogs, as well as to other MHC class I-like molecules, such as the IgG FcR of the mouse and rat and the human Zn-{alpha}2-glycoprotein. From direct comparison of amino acid sequences, it is concluded that PERB11 is a distinct molecule more closely related to nonmammalian than known mammalian MHC class I molecules. Genomic sequence analysis of PERB11 from five MHC ancestral haplotypes (AH) indicated that the gene is polymorphic at both DNA and protein level. The results suggest that the authors have identified a novel polymorphic gene family with multiple copies within the MHC. 48 refs., 10 figs., 2 tabs.« less
Highly conserved intragenic HSV-2 sequences: Results from next-generation sequencing of HSV-2 UL and US regions from genital swabs collected from 3 continents.

PubMed

Johnston, Christine; Magaret, Amalia; Roychoudhury, Pavitra; Greninger, Alexander L; Cheng, Anqi; Diem, Kurt; Fitzgibbon, Matthew P; Huang, Meei-Li; Selke, Stacy; Lingappa, Jairam R; Celum, Connie; Jerome, Keith R; Wald, Anna; Koelle, David M

2017-10-01

Understanding the variability in circulating herpes simplex virus type 2 (HSV-2) genomic sequences is critical to the development of HSV-2 vaccines. Genital lesion swabs containing ≥ 10 7 log 10 copies HSV DNA collected from Africa, the USA, and South America underwent next-generation sequencing, followed by K-mer based filtering and de novo genomic assembly. Sites of heterogeneity within coding regions in unique long and unique short (U L _U S ) regions were identified. Phylogenetic trees were created using maximum likelihood reconstruction. Among 46 samples from 38 persons, 1468 intragenic base-pair substitutions were identified. The maximum nucleotide distance between strains for concatenated U L_ U S segments was 0.4%. Phylogeny did not reveal geographic clustering. The most variable proteins had non-synonymous mutations in < 3% of amino acids. Unenriched HSV-2 DNA can undergo next-generation sequencing to identify intragenic variability. The use of clinical swabs for sequencing expands the information that can be gathered directly from these specimens. Copyright © 2017 Elsevier Inc. All rights reserved.
Chromatin structure and methylation of rat rRNA genes studied by formaldehyde fixation and psoralen cross-linking.

PubMed Central

Stancheva, I; Lucchini, R; Koller, T; Sogo, J M

1997-01-01

By using formaldehyde cross-linking of histones to DNA and gel retardation assays we show that formaldehyde fixation, similar to previously established psoralen photocross-linking, discriminates between nucleosome- packed (inactive) and nucleosome-free (active) fractions of ribosomal RNA genes. By both cross-linking techniques we were able to purify fragments from agarose gels, corresponding to coding, enhancer and promoter sequences of rRNA genes, which were further investigated with respect to DNA methylation. This approach allows us to analyse independently and in detail methylation patterns of active and inactive rRNA gene copies by the combination of Hpa II and Msp I restriction enzymes. We found CpG methylation mainly present in enhancer and promoter regions of inactive rRNA gene copies. The methylation of one single Hpa II site, located in the promoter region, showed particularly strong correlation with the transcriptional activity. PMID:9108154
A Helitron-like Transposon Superfamily from Lepidoptera Disrupts (GAAA)n Microsatellites and is Responsible for Flanking Sequence Similarity within a Microsatellite Family

USDA-ARS?s Scientific Manuscript database

Transposable elements (TEs) are mobile DNA regions that alter host genome structure and gene expression. A novel 588 bp non-autonomous high copy number TE in the Ostrinia nubilalis genome has features in common with miniature inverted-repeat transposable elements (MITEs): high A+T content (62.3%),...
7 CFR 340.2 - Groups of organisms which are or contain plant pests and exemptions.

Code of Federal Regulations, 2010 CFR

2010-01-01

... within the group listed are included as organisms that may be or may contain plant pests, and are... deemed a plant pest for purposes of § 340.2, if the scientific literature refers to the organism as a... DNA or RNA sequences, organelles, plasmids, parts, copies, and/or analogs, of or from any of the...
7 CFR 340.2 - Groups of organisms which are or contain plant pests and exemptions.

Code of Federal Regulations, 2012 CFR

2012-01-01

... within the group listed are included as organisms that may be or may contain plant pests, and are... deemed a plant pest for purposes of § 340.2, if the scientific literature refers to the organism as a... DNA or RNA sequences, organelles, plasmids, parts, copies, and/or analogs, of or from any of the...
7 CFR 340.2 - Groups of organisms which are or contain plant pests and exemptions.

Code of Federal Regulations, 2014 CFR

2014-01-01

... within the group listed are included as organisms that may be or may contain plant pests, and are... deemed a plant pest for purposes of § 340.2, if the scientific literature refers to the organism as a... DNA or RNA sequences, organelles, plasmids, parts, copies, and/or analogs, of or from any of the...
7 CFR 340.2 - Groups of organisms which are or contain plant pests and exemptions.

Code of Federal Regulations, 2011 CFR

2011-01-01

... within the group listed are included as organisms that may be or may contain plant pests, and are... deemed a plant pest for purposes of § 340.2, if the scientific literature refers to the organism as a... DNA or RNA sequences, organelles, plasmids, parts, copies, and/or analogs, of or from any of the...
7 CFR 340.2 - Groups of organisms which are or contain plant pests and exemptions.

Code of Federal Regulations, 2013 CFR

2013-01-01

... within the group listed are included as organisms that may be or may contain plant pests, and are... deemed a plant pest for purposes of § 340.2, if the scientific literature refers to the organism as a... DNA or RNA sequences, organelles, plasmids, parts, copies, and/or analogs, of or from any of the...
Dominant genetics using a yeast genomic library under the control of a strong inducible promoter.

PubMed

Ramer, S W; Elledge, S J; Davis, R W

1992-12-01

In Saccharomyces cerevisiae, numerous genes have been identified by selection from high-copy-number libraries based on "multicopy suppression" or other phenotypic consequences of overexpression. Although fruitful, this approach suffers from two major drawbacks. First, high copy number alone may not permit high-level expression of tightly regulated genes. Conversely, other genes expressed in proportion to dosage cannot be identified if their products are toxic at elevated levels. This work reports construction of a genomic DNA expression library for S. cerevisiae that circumvents both limitations by fusing randomly sheared genomic DNA to the strong, inducible yeast GAL1 promoter, which can be regulated by carbon source. The library obtained contains 5 x 10(7) independent recombinants, representing a breakpoint at every base in the yeast genome. This library was used to examine aberrant gene expression in S. cerevisiae. A screen for dominant activators of yeast mating response identified eight genes that activate the pathway in the absence of exogenous mating pheromone, including one previously unidentified gene. One activator was a truncated STE11 gene lacking approximately 1000 base pairs of amino-terminal coding sequence. In two different clones, the same GAL1 promoter-proximal ATG is in-frame with the coding sequence of STE11, suggesting that internal initiation of translation there results in production of a biologically active, truncated STE11 protein. Thus this library allows isolation based on dominant phenotypes of genes that might have been difficult or impossible to isolate from high-copy-number libraries.

Molecular characterization of Mycobacterium tuberculosis isolated in the State of Parana in southern Brazil.

PubMed

Malaghini, Marcelo; Brockelt, Sonia Regina; Burger, Marion; Kritski, Afrânio; Thomaz-Soccol, Vanete

2009-01-01

Sequence IS6110 has been successfully used throughout the world for characterizing the Mycobacterium tuberculosis lineages. The aim of this study was to obtain data about circulating strains of M. tuberculosis in patients from the State of Parana in southern Brazil. Sixty-two clinical specimens obtained from sputum, bronchial aspirate, biopsy and urine from 62 patients clinically diagnosed with tuberculosis and admitted to the SUS-Brazil - The Brazilian Centralized Health Service System - were genotyped by the mixed-linker PCR DNA fingerprinting technique. The analysis demonstrated that the number of copies of the IS6110 sequence per isolates varied from four to 13 bands, with an average number of 8.5. From this, 93% of the isolates presented multiple copies. Isolates with no copies of the IS6110 element were not observed. The genetic analysis by UPGMA grouped the 62 isolates by similarity into three different groups: the first group contained two strains, the second was composed of 23, and the third, a more heterogeneous group, contained 37 isolates. Only two isolates (3.2%) formed a cluster; in other words, they presented a pattern of polymorphism with similarity above 95%. Such findings suggest that in the State of Parana, illness predominantly develops through reactivation of the latent infection as opposed to exogenous transmission. The methodology used (mixed-linker PCR DNA fingerprinting) allowed for 93.5% differentiation of the isolates tested, and proved to be a powerful tool for differentiation in the molecular genotyping of M. tuberculosis.
[Study of alpha-satellite DNA in cosmid libraries, specific for chromosomes 13, 21, and 22, using fluorescence in situ hybridization].

PubMed

Solov'ev, I V; Iurov, Iu B; Vorsanova, S G; Marcais, B; Rogaev, E I; Kapanadze, B I; Brodianskiĭ, V M; Iankovskiĭ, N K; Roizes, G

1998-11-01

Fluorescent in situ hybridization (FISH) was employed in mapping the alpha-satellite DNA that was revealed in the cosmid libraries specific for human chromosomes 13, 21, and 22. In total, 131 clones were revealed. They contained various elements of centromeric alphoid DNA sequences of acrocentric chromosomes, including those located close to SINEs, LINEs, and classical satellite sequences. The heterochromatin of acrocentric chromosomes was shown to contain two different groups of alphoid sequences: (1) those immediately adjacent to the centromeric regions (alpha 13-1, alpha 21-1, and alpha 22-1 loci) and (2) those located in the short arm of acrocentric chromosomes (alpha 13-2, alpha 21-2, and alpha 22-2 loci). Alphoid DNA sequences from the alpha 13-2, alpha 21-2, and alpha 22-2 loci are apparently not involved in the formation of centromeres and are absent from mitotically stable marker chromosomes with a deleted short arm. Robertsonian translocations t(13q; 21q) and t(14q; 22q), and chromosome 21p-. The heterochromatic regions of chromosomes 13, 21, and 22 were also shown to contain relatively chromosome-specific repetitive sequences of various alphoid DNA families, whose numerous copies occur in other chromosomes. Pools of centromeric alphoid cosmids can be of use in further studies of the structural and functional properties of heterochromatic DNA and the identification of centromeric sequences. Moreover, these clones can be employed in high-resolution mapping and in sequencing the heterochromatic regions of the human genome. The detailed FISH analysis of numerous alphoid cosmid clones allowed the identification of several new, highly specific DNA probes of molecular cytogenetic studies--in particular, the interphase and metaphase analyses of chromosomes 2, 9, 11, 14, 15, 16, 18, 20, 21-13, 22-14, and X.
Alu sequence involvement in transcriptional insulation of the keratin 18 gene in transgenic mice.

PubMed Central

Thorey, I S; Ceceña, G; Reynolds, W; Oshima, R G

1993-01-01

The human keratin 18 (K18) gene is expressed in a variety of adult simple epithelial tissues, including liver, intestine, lung, and kidney, but is not normally found in skin, muscle, heart, spleen, or most of the brain. Transgenic animals derived from the cloned K18 gene express the transgene in appropriate tissues at levels directly proportional to the copy number and independently of the sites of integration. We have investigated in transgenic mice the dependence of K18 gene expression on the distal 5' and 3' flanking sequences and upon the RNA polymerase III promoter of an Alu repetitive DNA transcription unit immediately upstream of the K18 promoter. Integration site-independent expression of tandemly duplicated K18 transgenes requires the presence of either an 825-bp fragment of the 5' flanking sequence or the 3.5-kb 3' flanking sequence. Mutation of the RNA polymerase III promoter of the Alu element within the 825-bp fragment abolishes copy number-dependent expression in kidney but does not abolish integration site-independent expression when assayed in the absence of the 3' flanking sequence of the K18 gene. The characteristics of integration site-independent expression and copy number-dependent expression are separable. In addition, the formation of the chromatin state of the K18 gene, which likely restricts the tissue-specific expression of this gene, is not dependent upon the distal flanking sequences of the 10-kb K18 gene but rather may depend on internal regulatory regions of the gene. Images PMID:7692231
Insertion and deletion mutagenesis of the human cytomegalovirus genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Spaete, R.R.; Mocarski, E.S.

1987-10-01

Studies on human cytomegalovirus (CMV) have been limited by a paucity of molecular genetic techniques available for manipulating the viral genome. The authors have developed methods for site-specific insertion and deletion mutagenesis of CMV utilizing a modified Escherichia coli lacZ gene as a genetic marker. The lacZ gene was placed under the control of the major ..beta.. gene regulatory signals and inserted into the viral genome by homologous recombination, disrupting one of two copies of this ..beta.. gene within the L-component repeats of CMV DNA. They observed high-level expression of ..beta..-galactosidase by the recombinant in a temporally authentic manner, withmore » levels of this enzyme approaching 1% of total protein in infected cells. Thus, CMV is an efficient vector for high-level expression of foreign gene products in human cells. Using back selection of lacZ-deficient virus in the presence of the chromogenic substrate 5-bromo-4-chloro-3-indolyl ..beta..-D-galactoside, they generated random endpoint deletion mutants. Analysis of these mutant revealed that CMV DNA sequences flanking the insert had been removed, thereby establishing this approach as a means of determining whether sequences flanking a lacZ insertion are dispensable for viral growth. In an initial test of the methods, they have shown that 7800 base pairs of one copy of L-component repeat sequences can be deleted without affecting viral growth in human fibroblasts.« less
In and out of the rRNA genes: characterization of Pokey elements in the sequenced Daphnia genome

PubMed Central

2013-01-01

Background Only a few transposable elements are known to exhibit site-specific insertion patterns, including the well-studied R-element retrotransposons that insert into specific sites within the multigene rDNA. The only known rDNA-specific DNA transposon, Pokey (superfamily: piggyBac) is found in the freshwater microcrustacean, Daphnia pulex. Here, we present a genome-wide analysis of Pokey based on the recently completed whole genome sequencing project for D. pulex. Results Phylogenetic analysis of Pokey elements recovered from the genome sequence revealed the presence of four lineages corresponding to two divergent autonomous families and two related lineages of non-autonomous miniature inverted repeat transposable elements (MITEs). The MITEs are also found at the same 28S rRNA gene insertion site as the Pokey elements, and appear to have arisen as deletion derivatives of autonomous elements. Several copies of the full-length Pokey elements may be capable of producing an active transposase. Surprisingly, both families of Pokey possess a series of 200 bp repeats upstream of the transposase that is derived from the rDNA intergenic spacer (IGS). The IGS sequences within the Pokey elements appear to be evolving in concert with the rDNA units. Finally, analysis of the insertion sites of Pokey elements outside of rDNA showed a target preference for sites similar to the specific sequence that is targeted within rDNA. Conclusions Based on the target site preference of Pokey elements and the concerted evolution of a segment of the element with the rDNA unit, we propose an evolutionary path by which the ancestors of Pokey elements have invaded the rDNA niche. We discuss how specificity for the rDNA unit may have evolved and how this specificity has played a role in the long-term survival of these elements in the subgenus Daphnia. PMID:24059783
SeeGH--a software tool for visualization of whole genome array comparative genomic hybridization data.

PubMed

Chi, Bryan; DeLeeuw, Ronald J; Coe, Bradley P; MacAulay, Calum; Lam, Wan L

2004-02-09

Array comparative genomic hybridization (CGH) is a technique which detects copy number differences in DNA segments. Complete sequencing of the human genome and the development of an array representing a tiling set of tens of thousands of DNA segments spanning the entire human genome has made high resolution copy number analysis throughout the genome possible. Since array CGH provides signal ratio for each DNA segment, visualization would require the reassembly of individual data points into chromosome profiles. We have developed a visualization tool for displaying whole genome array CGH data in the context of chromosomal location. SeeGH is an application that translates spot signal ratio data from array CGH experiments to displays of high resolution chromosome profiles. Data is imported from a simple tab delimited text file obtained from standard microarray image analysis software. SeeGH processes the signal ratio data and graphically displays it in a conventional CGH karyotype diagram with the added features of magnification and DNA segment annotation. In this process, SeeGH imports the data into a database, calculates the average ratio and standard deviation for each replicate spot, and links them to chromosome regions for graphical display. Once the data is displayed, users have the option of hiding or flagging DNA segments based on user defined criteria, and retrieve annotation information such as clone name, NCBI sequence accession number, ratio, base pair position on the chromosome, and standard deviation. SeeGH represents a novel software tool used to view and analyze array CGH data. The software gives users the ability to view the data in an overall genomic view as well as magnify specific chromosomal regions facilitating the precise localization of genetic alterations. SeeGH is easily installed and runs on Microsoft Windows 2000 or later environments.
DNA methylation inhibits expression and transposition of the Neurospora Tad retrotransposon.

PubMed

Zhou, Y; Cambareri, E B; Kinsey, J A

2001-06-01

Tad is a LINE-like retrotransposon of the filamentous fungus Neurospora crassa. We have analyzed both expression and transposition of this element using strains with a single copy of Tad located in the 5' noncoding sequences of the am (glutamate dehydrogenase) gene. Tad in this position has been shown to carry a de novo cytosine methylation signal which causes reversible methylation of both Tad and am upstream sequences. Here we find that methylation of the Tad sequences inhibits both Tad expression and transposition. This inhibition can be relieved by the use of 5-azacytidine, a drug which reduces cytosine methylation, or by placing the Tad/am sequences in a dim-2 genetic background.
Development and analysis of a tick-borne encephalitis virus infectious clone using a novel and rapid strategy.

PubMed

Gritsun, T S; Gould, E A

1998-12-01

In less than 1 month we have constructed an infectious clone of attenuated tick-borne encephalitis virus (strain Vasilchenko) from 100 microl of unpurified virus suspension using long high fidelity PCR and a modified bacterial cloning system. Optimization of the 3' antisense primer concentration was essential to achieve PCR synthesis of an 11 kb cDNA copy of RNA from infectious virus. A novel system utilising two antisense primers, a 14-mer for reverse transcription and a 35-mer for long PCR, produced high yields of genomic length cDNA. Use of low copy number Able K cells and an incubation temperature of 28 degrees C increased the genetic stability of cloned cDNA. Clones containing 11 kb cDNA inserts produced colonies of reduced size, thus providing a positive selection system for full length clones. Sequencing of the infectious clone emphasised the improved fidelity of the method compared with conventional PCR and cloning methods. A simple and rapid strategy for genetic manipulation of the infectious clone is also described. These developments represent a significant advance in recombinant technology and should be applicable to positive stranded RNA viruses which cannot easily be purified or genetically manipulated.
The adnAB Locus, Encoding a Putative Helicase-Nuclease Activity, Is Essential in Streptomyces

PubMed Central

Zhang, Lingli; Nguyen, Hoang Chuong; Chipot, Ludovic; Piotrowski, Emilie; Bertrand, Claire

2014-01-01

Homologous recombination is a crucial mechanism that repairs a wide range of DNA lesions, including the most deleterious ones, double-strand breaks (DSBs). This multistep process is initiated by the resection of the broken DNA ends by a multisubunit helicase-nuclease complex exemplified by Escherichia coli RecBCD, Bacillus subtilis AddAB, and newly discovered Mycobacterium tuberculosis AdnAB. Here we show that in Streptomyces, neither recBCD nor addAB homologues could be detected. The only putative helicase-nuclease-encoding genes identified were homologous to M. tuberculosis adnAB genes. These genes are conserved as a single copy in all sequenced genomes of Streptomyces. The disruption of adnAB in Streptomyces ambofaciens and Streptomyces coelicolor could not be achieved unless an ectopic copy was provided, indicating that adnAB is essential for growth. Both adnA and adnB genes were shown to be inducible in response to DNA damage (mitomycin C) and to be independently transcribed. Introduction of S. ambofaciens adnAB genes in an E. coli recB mutant restored viability and resistance to UV light, suggesting that Streptomyces AdnAB could be a functional homologue of RecBCD and be involved in DNA damage resistance. PMID:24837284
RNA metabolism in the regulation of protein synthesis in plants. Progress report, 1975-1979

DOE Office of Scientific and Technical Information (OSTI.GOV)

Key, J L

1979-01-01

The major objectives of the research for the contract period covered by this report were (1) to gain an insight into the sequence organization of the DNA of soybean, emphasizing the arrangement of single copy or unique sequences and repetitive sequences of DNA throughout the genome, (2) to characterize soybean RNAs relative to nucleotide sequence complexity and kinetics of synthesis and turnover of poly A/sup +/ mRNA, and (3) to study ribosomal proteins directed to an analysis of possible changes in proteins which relate to the activation of 80S ribosomes and thus mRNA utilization and protein synthesis in response tomore » environmental stimuli. Even with greatly reduced funding compared to that requested, objectives 1 and 2 were substantially accomplished. Because of reduced funding and the 20-month no cost extension, relatively little progress was made on objective 3. Accordingly objectives 1 and 2 will be summarized in some detail; a brief account of progress is presented on objective 3.« less
A comprehensive resource of genomic, epigenomic and transcriptomic sequencing data for the black truffle Tuber melanosporum

PubMed Central

2014-01-01

Background Tuber melanosporum, also known in the gastronomic community as “truffle”, features one of the largest fungal genomes (125 Mb) with an exceptionally high transposable element (TE) and repetitive DNA content (>58%). The main purpose of DNA methylation in fungi is TE silencing. As obligate outcrossing organisms, truffles are bound to a sexual mode of propagation, which together with TEs is thought to represent a major force driving the evolution of DNA methylation. Thus, it was of interest to examine if and how T. melanosporum exploits DNA methylation to maintain genome integrity. Findings We performed whole-genome DNA bisulfite sequencing and mRNA sequencing on different developmental stages of T. melanosporum; namely, fruitbody (“truffle”), free-living mycelium and ectomycorrhiza. The data revealed a high rate of cytosine methylation (>44%), selectively targeting TEs rather than genes with a strong preference for CpG sites. Whole genome DNA sequencing uncovered multiple TE-enriched, copy number variant regions bearing a significant fraction of hypomethylated and expressed TEs, almost exclusively in free-living mycelium propagated in vitro. Treatment of mycelia with 5-azacytidine partially reduced DNA methylation and increased TE transcription. Our transcriptome assembly also resulted in the identification of a set of novel transcripts from 614 genes. Conclusions The datasets presented here provide valuable and comprehensive (epi)genomic information that can be of interest for evolutionary genomics studies of multicellular (filamentous) fungi, in particular Ascomycetes belonging to the subphylum, Pezizomycotina. Evidence derived from comparative methylome and transcriptome analyses indicates that a non-exhaustive and partly reversible methylation process operates in truffles. PMID:25392735
A comprehensive resource of genomic, epigenomic and transcriptomic sequencing data for the black truffle Tuber melanosporum.

PubMed

Chen, Pao-Yang; Montanini, Barbara; Liao, Wen-Wei; Morselli, Marco; Jaroszewicz, Artur; Lopez, David; Ottonello, Simone; Pellegrini, Matteo

2014-01-01

Tuber melanosporum, also known in the gastronomic community as "truffle", features one of the largest fungal genomes (125 Mb) with an exceptionally high transposable element (TE) and repetitive DNA content (>58%). The main purpose of DNA methylation in fungi is TE silencing. As obligate outcrossing organisms, truffles are bound to a sexual mode of propagation, which together with TEs is thought to represent a major force driving the evolution of DNA methylation. Thus, it was of interest to examine if and how T. melanosporum exploits DNA methylation to maintain genome integrity. We performed whole-genome DNA bisulfite sequencing and mRNA sequencing on different developmental stages of T. melanosporum; namely, fruitbody ("truffle"), free-living mycelium and ectomycorrhiza. The data revealed a high rate of cytosine methylation (>44%), selectively targeting TEs rather than genes with a strong preference for CpG sites. Whole genome DNA sequencing uncovered multiple TE-enriched, copy number variant regions bearing a significant fraction of hypomethylated and expressed TEs, almost exclusively in free-living mycelium propagated in vitro. Treatment of mycelia with 5-azacytidine partially reduced DNA methylation and increased TE transcription. Our transcriptome assembly also resulted in the identification of a set of novel transcripts from 614 genes. The datasets presented here provide valuable and comprehensive (epi)genomic information that can be of interest for evolutionary genomics studies of multicellular (filamentous) fungi, in particular Ascomycetes belonging to the subphylum, Pezizomycotina. Evidence derived from comparative methylome and transcriptome analyses indicates that a non-exhaustive and partly reversible methylation process operates in truffles.
DNA Copy Number Signature to Predict Recurrence in Early Stage Ovarian Cancer

DTIC Science & Technology

2016-08-01

AWARD NUMBER: W81XWH-14-1-0194 TITLE: DNA Copy Number Signature to Predict Recurrence in Early-Stage Ovarian Cancer PRINCIPAL INVESTIGATOR...SUBTITLE 5a. CONTRACT NUMBER DNA Copy Number Signature to Predict Recurrence in Early-Stage Ovarian Cancer 5b. GRANT NUMBER W81XWH-14-1-0194 5c. PROGRAM...determine the copy number gain and loss for early stage high grade ovarian cancers through IlluminaHumanOmniExpress-FFPE BeadChip system • Subtask 1 DNA
CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data

PubMed Central

De, Rajat K.

2015-01-01

Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based on depth of coverage data generated by NGS technology. In this work, we have used a novel way to represent the read count data as a two dimensional geometrical point. A key aspect of detecting the regions with CNVs, is to devise a proper segmentation algorithm that will distinguish the genomic locations having a significant difference in read count data. We have designed a new segmentation approach in this context, using convex hull algorithm on the geometrical representation of read count data. To our knowledge, most algorithms have used a single distribution model of read count data, but here in our approach, we have considered the read count data to follow two different distribution models independently, which adds to the robustness of detection of CNVs. In addition, our algorithm calls CNVs based on the multiple sample analysis approach resulting in a low false discovery rate with high precision. PMID:26291322
CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data.

PubMed

Sinha, Rituparna; Samaddar, Sandip; De, Rajat K

2015-01-01

Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based on depth of coverage data generated by NGS technology. In this work, we have used a novel way to represent the read count data as a two dimensional geometrical point. A key aspect of detecting the regions with CNVs, is to devise a proper segmentation algorithm that will distinguish the genomic locations having a significant difference in read count data. We have designed a new segmentation approach in this context, using convex hull algorithm on the geometrical representation of read count data. To our knowledge, most algorithms have used a single distribution model of read count data, but here in our approach, we have considered the read count data to follow two different distribution models independently, which adds to the robustness of detection of CNVs. In addition, our algorithm calls CNVs based on the multiple sample analysis approach resulting in a low false discovery rate with high precision.
Hacking DNA copy number for circuit engineering.

PubMed

Wu, Feilun; You, Lingchong

2017-07-27

DNA copy number represents an essential parameter in the dynamics of synthetic gene circuits but typically is not explicitly considered. A new study demonstrates how dynamic control of DNA copy number can serve as an effective strategy to program robust oscillations in gene expression circuits.
Mitochondrial DNA copy numbers in pyramidal neurons are decreased and mitochondrial biogenesis transcriptome signaling is disrupted in Alzheimer's disease hippocampi.

PubMed

Rice, Ann C; Keeney, Paula M; Algarzae, Norah K; Ladd, Amy C; Thomas, Ravindar R; Bennett, James P

2014-01-01

Alzheimer's disease (AD) is the major cause of adult-onset dementia and is characterized in its pre-diagnostic stage by reduced cerebral cortical glucose metabolism and in later stages by reduced cortical oxygen uptake, implying reduced mitochondrial respiration. Using quantitative PCR we determined the mitochondrial DNA (mtDNA) gene copy numbers from multiple groups of 15 or 20 pyramidal neurons, GFAP(+) astrocytes and dentate granule neurons isolated using laser capture microdissection, and the relative expression of mitochondrial biogenesis (mitobiogenesis) genes in hippocampi from 10 AD and 9 control (CTL) cases. AD pyramidal but not dentate granule neurons had significantly reduced mtDNA copy numbers compared to CTL neurons. Pyramidal neuron mtDNA copy numbers in CTL, but not AD, positively correlated with cDNA levels of multiple mitobiogenesis genes. In CTL, but not in AD, hippocampal cDNA levels of PGC1α were positively correlated with multiple downstream mitobiogenesis factors. Mitochondrial DNA copy numbers in pyramidal neurons did not correlate with hippocampal Aβ1-42 levels. After 48 h exposure of H9 human neural stem cells to the neurotoxic fragment Aβ25-35, mtDNA copy numbers were not significantly altered. In summary, AD postmortem hippocampal pyramidal neurons have reduced mtDNA copy numbers. Mitochondrial biogenesis pathway signaling relationships are disrupted in AD, but are mostly preserved in CTL. Our findings implicate complex alterations of mitochondria-host cell relationships in AD.
Presence of high-risk human papillomavirus genotype and human immunodeficiency virus DNA in anal high-grade and low-grade squamous intraepithelial lesions.

PubMed

Shiramizu, Bruce; Liang, Chin-Yuan; Agsalda-Garcia, Melissa; Nagata, Ian; Milne, Cris; Zhu, Xuemei; Killeen, Jeffrey; Berry, J Michael; Goodman, Marc T

2013-01-01

Human immunodeficiency virus type 1 (HIV)-infected individuals are at risk for anal cancer, which is caused by human papillomavirus (HPV). The relationship between HIV and HPV that leads to anal cancer remains unclear. Recent data, however, suggest that the continued persistence of HIV DNA in patients treated with combined antiretroviral therapy leads to progression of HIV disease and other HIV-associated complications. Therefore, we investigated the relationship among anal low- and high-grade squamous intraepithelial lesions (LGSIL/HGSIL), high-risk HPV genotypes, and high HIV DNA copy numbers. Anal cytology specimens were assayed for HPV genotype and HIV DNA copy number. High-risk HPV genotypes (odds ratio OR: 3.73; 95% confidence interval CI: 1.08-12.91; p=0.04) and high HIV DNA copy numbers (OR(per 100 HIV DNA copies): 1.13; 95% CI: 1.01-1.27, p=0.04) were both associated with LGSIL/HGSIL. When considering both high-risk HPV genotypes and HIV DNA copy numbers in predicting LGSIL/HGSIL, HIV DNA copy number was significant (OR(per 100 HIV DNA copies): 1.09; 95% CI: 0.96-1.23, p=0.04) but not high-risk HPV genotypes (OR: 2.30, p=0.28), which did not change when adjusted for nadir CD4 cell count and HIV RNA levels. The findings warrant further investigation of HIV DNA and its relationship with HPV in LGSIL/HGSIL pathogenesis.
Molecular cloning of a cDNA encoding the precursor of adenoregulin from frog skin. Relationships with the vertebrate defensive peptides, dermaseptins.

PubMed

Amiche, M; Ducancel, F; Lajeunesse, E; Boulain, J C; Ménez, A; Nicolas, P

1993-03-31

Adenoregulin has recently been isolated from Phyllomedusa skin as a 33 amino acid residues peptide which enhanced binding of agonists to the A1 adenosine receptor. In order to study the structure of the precursor of adenoregulin we constructed a cDNA library from mRNAs extracted from the skin of Phyllomedusa bicolor. We detected the complete nucleotide sequence of a cDNA encoding the adenoregulin biosynthetic precursor. The deduced sequence of the precursor is 81 amino acids long, exhibits a putative signal sequence at the NH2 terminus and contains a single copy of the biologically active peptide at the COOH terminus. Structural and conformational homologies that are observed between adenoregulin and the dermaseptins, antimicrobial peptides exhibiting strong membranolytic activities against various pathogenic agents, suggest that adenoregulin is an additional member of the growing family of cytotropic antimicrobial peptides that allow vertebrate animals to defend themselves against microorganisms. As such, the adenosine receptor regulating activity of adenoregulin could be due to its ability to interact with and disrupt membranes lipid bilayers.
Using circulating cell-free DNA to monitor personalized cancer therapy.

PubMed

Oellerich, Michael; Schütz, Ekkehard; Beck, Julia; Kanzow, Philipp; Plowman, Piers N; Weiss, Glen J; Walson, Philip D

2017-05-01

High-quality genomic analysis is critical for personalized pharmacotherapy in patients with cancer. Tumor-specific genomic alterations can be identified in cell-free DNA (cfDNA) from patient blood samples and can complement biopsies for real-time molecular monitoring of treatment, detection of recurrence, and tracking resistance. cfDNA can be especially useful when tumor tissue is unavailable or insufficient for testing. For blood-based genomic profiling, next-generation sequencing (NGS) and droplet digital PCR (ddPCR) have been successfully applied. The US Food and Drug Administration (FDA) recently approved the first such "liquid biopsy" test for EGFR mutations in patients with non-small cell lung cancer (NSCLC). Such non-invasive methods allow for the identification of specific resistance mutations selected by treatment, such as EGFR T790M, in patients with NSCLC treated with gefitinib. Chromosomal aberration pattern analysis by low coverage whole genome sequencing is a more universal approach based on genomic instability. Gains and losses of chromosomal regions have been detected in plasma tumor-specific cfDNA as copy number aberrations and can be used to compute a genomic copy number instability (CNI) score of cfDNA. A specific CNI index obtained by massive parallel sequencing discriminated those patients with prostate cancer from both healthy controls and men with benign prostatic disease. Furthermore, androgen receptor gene aberrations in cfDNA were associated with therapeutic resistance in metastatic castration resistant prostate cancer. Change in CNI score has been shown to serve as an early predictor of response to standard chemotherapy for various other cancer types (e.g. NSCLC, colorectal cancer, pancreatic ductal adenocarcinomas). CNI scores have also been shown to predict therapeutic responses to immunotherapy. Serial genomic profiling can detect resistance mutations up to 16 weeks before radiographic progression. There is a potential for cost savings when ineffective use of expensive new anticancer drugs is avoided or halted. Challenges for routine implementation of liquid biopsy tests include the necessity of specialized personnel, instrumentation, and software, as well as further development of quality management (e.g. external quality control). Validation of blood-based tumor genomic profiling in additional multicenter outcome studies is necessary; however, cfDNA monitoring can provide clinically important actionable information for precision oncology approaches.

The campaign to DNA barcode all fishes, FISH-BOL.

PubMed

Ward, R D; Hanner, R; Hebert, P D N

2009-02-01

FISH-BOL, the Fish Barcode of Life campaign, is an international research collaboration that is assembling a standardized reference DNA sequence library for all fishes. Analysis is targeting a 648 base pair region of the mitochondrial cytochrome c oxidase I (COI) gene. More than 5000 species have already been DNA barcoded, with an average of five specimens per species, typically vouchers with authoritative identifications. The barcode sequence from any fish, fillet, fin, egg or larva can be matched against these reference sequences using BOLD; the Barcode of Life Data System (http://www.barcodinglife.org). The benefits of barcoding fishes include facilitating species identification, highlighting cases of range expansion for known species, flagging previously overlooked species and enabling identifications where traditional methods cannot be applied. Results thus far indicate that barcodes separate c. 98 and 93% of already described marine and freshwater fish species, respectively. Several specimens with divergent barcode sequences have been confirmed by integrative taxonomic analysis as new species. Past concerns in relation to the use of fish barcoding for species discrimination are discussed. These include hybridization, recent radiations, regional differentiation in barcode sequences and nuclear copies of the barcode region. However, current results indicate these issues are of little concern for the great majority of specimens.
Small gene family encoding an eggshell (chorion) protein of the human parasite Schistosoma mansoni

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bobek, L.A.; Rekosh, D.M.; Lo Verde, P.T.

1988-08-01

The authors isolated six independent genomic clones encoding schistosome chorion or eggshell proteins from a Schistosoma mansoni genomic library. A linkage map of five of the clones spanning 35 kilobase pairs (kbp) of the S. mansoni genome was constructed. The region contained two eggshell protein genes closely linked, separated by 7.5 kbp of intergenic DNA. The two genes of the cluster were arranged in the same orientation, that is, they were transcribed from the same strand. The sixth clone probably represents a third copy of the eggshell gene that is not contained within the 35-kbp region. The 5- end ofmore » the mRNA transcribed from these genes was defined by primer extension directly off the RNA. The ATCAT cap site sequence was homologous to a silkmoth chorion PuTCATT cap site sequence, where Pu indicates any purine. DNA sequence analysis showed that there were no introns in these genes. The DNA sequences of the three genes were very homologous to each other and to a cDNA clone, pSMf61-46, differing only in three or four nucleotices. A multiple TATA box was located at positions -23 to -31, and a CAAAT sequence was located at -52 upstream of the eggshell transcription unit. Comparison of sequences in regions further upstream with silkmoth and Drosophila sequences revealed very short elements that were shared. One such element, TCACGT, recently shown to be an essential cis-regulatory element for silkmoth chorion gene promoter function, was found at a similar position in all three organisms.« less
The cytochrome oxidase subunit I and subunit III genes in Oenothera mitochondria are transcribed from identical promoter sequences

PubMed Central

Hiesel, Rudolf; Schobel, Werner; Schuster, Wolfgang; Brennicke, Axel

1987-01-01

Two loci encoding subunit III of the cytochrome oxidase (COX) in Oenothera mitochondria have been identified from a cDNA library of mitochondrial transcripts. A 657-bp sequence block upstream from the open reading frame is also present in the two copies of the COX subunit I gene and is presumably involved in homologous sequence rearrangement. The proximal points of sequence rearrangements are located 3 bp upstream from the COX I and 1139 bp upstream from the COX III initiation codons. The 5'-termini of both COX I and COX III mRNAs have been mapped in this common sequence confining the promoter region for the Oenothera mitochondrial COX I and COX III genes to the homologous sequence block. ImagesFig. 5. PMID:15981332
Xenopus laevis ribosomal protein genes: isolation of recombinant cDNA clones and study of the genomic organization.

PubMed Central

Bozzoni, I; Beccari, E; Luo, Z X; Amaldi, F

1981-01-01

Poly-A+ mRNA from Xenopus laevis oocytes, partially enriched for r-protein coding capacity has been used as starting material for preparing a cDNA bank in plasmid pBR322. The clones containing sequences specific for r-proteins have been selected by translation of the complementary mRNAs. Clones for six different r-proteins have been identified and utilized as probes for studying their genomic organization. Two gene copies per haploid genome were found for r-proteins L1, L14, S19, and four-five for protein S1, S8 and L32. Moreover a population polymorphism has been observed for the genomic regions containing sequences for r-protein S1, S8 and L14. Images PMID:6112733
New insights into Trypanosoma cruzi evolution, genotyping and molecular diagnostics from satellite DNA sequence analysis.

PubMed

Ramírez, Juan C; Torres, Carolina; Curto, María de Los A; Schijman, Alejandro G

2017-12-01

Trypanosoma cruzi has been subdivided into seven Discrete Typing Units (DTUs), TcI-TcVI and Tcbat. Two major evolutionary models have been proposed to explain the origin of hybrid lineages, but while it is widely accepted that TcV and TcVI are the result of genetic exchange between TcII and TcIII strains, the origin of TcIII and TcIV is still a matter of debate. T. cruzi satellite DNA (SatDNA), comprised of 195 bp units organized in tandem repeats, from both TcV and TcVI stocks were found to have SatDNA copies type TcI and TcII; whereas contradictory results were observed for TcIII stocks and no TcIV sequence has been analyzed yet. Herein, we have gone deeper into this matter analyzing 335 distinct SatDNA sequences from 19 T. cruzi stocks representative of DTUs TcI-TcVI for phylogenetic inference. Bayesian phylogenetic tree showed that all sequences were grouped in three major clusters, which corresponded to sequences from DTUs TcI/III, TcII and TcIV; whereas TcV and TcVI stocks had two sets of sequences distributed into TcI/III and TcII clusters. As expected, the lowest genetic distances were found between TcI and TcIII, and between TcV and TcVI sequences; whereas the highest ones were observed between TcII and TcI/III, and among TcIV sequences and those from the remaining DTUs. In addition, signature patterns associated to specific T. cruzi lineages were identified and new primers that improved SatDNA-based qPCR sensitivity were designed. Our findings support the theory that TcIII is not the result of a hybridization event between TcI and TcII, and that TcIV had an independent origin from the other DTUs, contributing to clarifying the evolutionary history of T. cruzi lineages. Moreover, this work opens the possibility of typing samples from Chagas disease patients with low parasitic loads and improving molecular diagnostic methods of T. cruzi infection based on SatDNA sequence amplification.
Presequence-Independent Mitochondrial Import of DNA Ligase Facilitates Establishment of Cell Lines with Reduced mtDNA Copy Number

PubMed Central

Spadafora, Domenico; Kozhukhar, Natalia; Alexeyev, Mikhail F.

2016-01-01

Due to the essential role played by mitochondrial DNA (mtDNA) in cellular physiology and bioenergetics, methods for establishing cell lines with altered mtDNA content are of considerable interest. Here, we report evidence for the existence in mammalian cells of a novel, low- efficiency, presequence-independent pathway for mitochondrial protein import, which facilitates mitochondrial uptake of such proteins as Chlorella virus ligase (ChVlig) and Escherichia coli LigA. Mouse cells engineered to depend on this pathway for mitochondrial import of the LigA protein for mtDNA maintenance had severely (up to >90%) reduced mtDNA content. These observations were used to establish a method for the generation of mouse cell lines with reduced mtDNA copy number by, first, transducing them with a retrovirus encoding LigA, and then inactivating in these transductants endogenous Lig3 with CRISPR-Cas9. Interestingly, mtDNA depletion to an average level of one copy per cell proceeds faster in cells engineered to maintain mtDNA at low copy number. This makes a low-mtDNA copy number phenotype resulting from dependence on mitochondrial import of DNA ligase through presequence-independent pathway potentially useful for rapidly shifting mtDNA heteroplasmy through partial mtDNA depletion. PMID:27031233
DNA looping by FokI: the impact of synapse geometry on loop topology at varied site orientations

PubMed Central

Rusling, David A.; Laurens, Niels; Pernstich, Christian; Wuite, Gijs J. L.; Halford, Stephen E.

2012-01-01

Most restriction endonucleases, including FokI, interact with two copies of their recognition sequence before cutting DNA. On DNA with two sites they act in cis looping out the intervening DNA. While many restriction enzymes operate symmetrically at palindromic sites, FokI acts asymmetrically at a non-palindromic site. The directionality of its sequence means that two FokI sites can be bridged in either parallel or anti-parallel alignments. Here we show by biochemical and single-molecule biophysical methods that FokI aligns two recognition sites on separate DNA molecules in parallel and that the parallel arrangement holds for sites in the same DNA regardless of whether they are in inverted or repeated orientations. The parallel arrangement dictates the topology of the loop trapped between sites in cis: the loop from inverted sites has a simple 180° bend, while that with repeated sites has a convoluted 360° turn. The ability of FokI to act at asymmetric sites thus enabled us to identify the synapse geometry for sites in trans and in cis, which in turn revealed the relationship between synapse geometry and loop topology. PMID:22362745
A novel, sensitive and label-free loop-mediated isothermal amplification detection method for nucleic acids using luminophore dyes.

PubMed

Roy, Sharmili; Wei, Sim Xiao; Ying, Jean Liew Zhi; Safavieh, Mohammadali; Ahmed, Minhaz Uddin

2016-12-15

Electrochemiluminescence (ECL) has been widely rendered for nucleic acid testing. Here, we integrate loop-mediated isothermal amplification (LAMP) with ECL technique for DNA detection and quantification. The target LAMP DNA bound electrostatically with [Ru(bpy)3](+2) on the carbon electrode surface, and an ECL reaction was triggered by tripropylamine (TPrA) to yield luminescence. We illustrated this method as a new and highly sensitive strategy for the detection of sequence-specific DNA from different meat species at picogram levels. The proposed strategy renders the signal amplification capacities of TPrA and combines LAMP with inherently high sensitivity of the ECL technique, to facilitate the detection of low quantities of DNA. By leveraging this technique, target DNA of Sus scrofa (pork) meat was detected as low as 1pg/µL (3.43×10(-1)copies/µL). In addition, the proposed technique was applied for detection of Bacillus subtilis DNA samples and detection limit of 10pg/µL (2.2×10(3)copies/µL) was achieved. The advantages of being isothermal, sensitive and robust with ability for multiplex detection of bio-analytes makes this method a facile and appealing sensing modality in hand-held devices to be used at the point-of-care (POC). Copyright © 2016 Elsevier B.V. All rights reserved.
Prevalence and persistence of male DNA identified in mixed saliva samples after intense kissing.

PubMed

Kamodyová, Natália; Durdiaková, Jaroslava; Celec, Peter; Sedláčková, Tatiana; Repiská, Gabriela; Sviežená, Barbara; Minárik, Gabriel

2013-01-01

Identification of foreign biological material by genetic profiling is widely used in forensic DNA testing in different cases of sexual violence, sexual abuse or sexual harassment. In all these kinds of sexual assaults, the perpetrator could constrain the victim to kissing. The value of the victim's saliva taken after such an assault has not been investigated in the past with currently widely used molecular methods of extremely high sensitivity (e.g. qPCR) and specificity (e.g. multiplex Y-STR PCR). In our study, 12 voluntary pairs were tested at various intervals after intense kissing and saliva samples were taken from the women to assess the presence of male DNA. Sensitivity-focused assays based on the SRY (single-copy gene) and DYS (multi-copy gene) sequence motifs confirmed the presence of male DNA in female saliva after 10 and even 60min after kissing, respectively. For specificity, standard multiplex Y-STR PCR profiling was performed and male DNA was found in female saliva samples, as the entire Y-STR profile, even after 30min in one sample. Our study confirms that foreign DNA tends to persist for a restricted period of time in the victim's mouth, can be isolated from saliva after prompt collection and can be used as a valuable source of evidence. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Molecular Characterization of Transgenic Events Using Next Generation Sequencing Approach.

PubMed

Guttikonda, Satish K; Marri, Pradeep; Mammadov, Jafar; Ye, Liang; Soe, Khaing; Richey, Kimberly; Cruse, James; Zhuang, Meibao; Gao, Zhifang; Evans, Clive; Rounsley, Steve; Kumpatla, Siva P

2016-01-01

Demand for the commercial use of genetically modified (GM) crops has been increasing in light of the projected growth of world population to nine billion by 2050. A prerequisite of paramount importance for regulatory submissions is the rigorous safety assessment of GM crops. One of the components of safety assessment is molecular characterization at DNA level which helps to determine the copy number, integrity and stability of a transgene; characterize the integration site within a host genome; and confirm the absence of vector DNA. Historically, molecular characterization has been carried out using Southern blot analysis coupled with Sanger sequencing. While this is a robust approach to characterize the transgenic crops, it is both time- and resource-consuming. The emergence of next-generation sequencing (NGS) technologies has provided highly sensitive and cost- and labor-effective alternative for molecular characterization compared to traditional Southern blot analysis. Herein, we have demonstrated the successful application of both whole genome sequencing and target capture sequencing approaches for the characterization of single and stacked transgenic events and compared the results and inferences with traditional method with respect to key criteria required for regulatory submissions.
Organization and transient expression of the gene for human U11 snRNA

PubMed Central

Clemens, Suter-Crazzolara; Walter, Keller

1991-01-01

The nucleotide sequence of U11 small nuclear RNA, a minor U RNA from HeLa cells, was determined. Computer analysis of the sequence (135 residues) predicts two strong hairpin loops which are separated by seventeen nucleotides containing an Sm binding site (AAUUUUUUGG). A synthetic gene was constructed in which the coding region of U11 RNA is under the control of a T7 promoter. This vector can be used to produce U11 RNA in vitro. Southern hybridization and PCR analysis of HeLa genomic DNA suggest that U11 RNA is encoded by a single copy gene, and that at least three genomic regions could be U11 RNA pseudogenes. A HeLa genomic copy of a U11 gene was isolated by inverted PCR. This gene contains the U11 RNA coding sequence and several sequence elements unique for the U RNA genes. These include a Distal Sequence Element (DSE, ATTTGCATA) present between positions −215 and −223 relative to the start of transcription; a Proximal Sequence Element (PSE, TTCACCTTTACCAAAAATG) located between positions −43 and −63 ; and a 3′box (GTTAGGCGAAATATTA) between positions +150 and +166. Transfection of HeLa cells with this gene revealed that it is functioning in vivo and can produce U11 RNA. PMID:1820214
The site-specific ribosomal insertion element type II of Bombyx mori (R2Bm) contains the coding sequence for a reverse transcriptase-like enzyme.

PubMed Central

Burke, W D; Calalang, C C; Eickbush, T H

1987-01-01

Two classes of DNA elements interrupt a fraction of the rRNA repeats of Bombyx mori. We have analyzed by genomic blotting and sequence analysis one class of these elements which we have named R2. These elements occupy approximately 9% of the rDNA units of B. mori and appear to be homologous to the type II rDNA insertions detected in Drosophila melanogaster. Approximately 25 copies of R2 exist within the B. mori genome, of which at least 20 are located at a precise location within otherwise typical rDNA units. Nucleotide sequence analysis has revealed that the 4.2-kilobase-pair R2 element has a single large open reading frame, occupying over 82% of the total length of the element. The central region of this 1,151-amino-acid open reading frame shows homology to the reverse transcriptase enzymes found in retroviruses and certain transposable elements. Amino acid homology of this region is highest to the mobile line 1 elements of mammals, followed by the mitochondrial type II introns of fungi, and the pol gene of retroviruses. Less homology exists with transposable elements of D. melanogaster and Saccharomyces cerevisiae. Two additional regions of sequence homology between L1 and R2 elements were also found outside the reverse transcriptase region. We suggest that the R2 elements are retrotransposons that are site specific in their insertion into the genome. Such mobility would enable these elements to occupy a small fraction of the rDNA units of B. mori despite their continual elimination from the rDNA locus by sequence turnover. Images PMID:2439905
DNA Methylation Patterns in Normal Tissue Correlate more Strongly with Breast Cancer Status than Copy-Number Variants.

PubMed

Gao, Yang; Widschwendter, Martin; Teschendorff, Andrew E

2018-05-04

Normal tissue at risk of neoplastic transformation is characterized by somatic mutations, copy-number variation and DNA methylation changes. It is unclear however, which type of alteration may be more informative of cancer risk. We analyzed genome-wide DNA methylation and copy-number calls from the same DNA assay in a cohort of healthy breast samples and age-matched normal samples collected adjacent to breast cancer. Using statistical methods to adjust for cell type heterogeneity, we show that DNA methylation changes can discriminate normal-adjacent from normal samples better than somatic copy-number variants. We validate this important finding in an independent dataset. These results suggest that DNA methylation alterations in the normal cell of origin may offer better cancer risk prediction and early detection markers than copy-number changes. Copyright © 2018. Published by Elsevier B.V.
The coding region of the UFGT gene is a source of diagnostic SNP markers that allow single-locus DNA genotyping for the assessment of cultivar identity and ancestry in grapevine (Vitis vinifera L.)

PubMed Central

2013-01-01

Background Vitis vinifera L. is one of society’s most important agricultural crops with a broad genetic variability. The difficulty in recognizing grapevine genotypes based on ampelographic traits and secondary metabolites prompted the development of molecular markers suitable for achieving variety genetic identification. Findings Here, we propose a comparison between a multi-locus barcoding approach based on six chloroplast markers and a single-copy nuclear gene sequencing method using five coding regions combined with a character-based system with the aim of reconstructing cultivar-specific haplotypes and genotypes to be exploited for the molecular characterization of 157 V. vinifera accessions. The analysis of the chloroplast target regions proved the inadequacy of the DNA barcoding approach at the subspecies level, and hence further DNA genotyping analyses were targeted on the sequences of five nuclear single-copy genes amplified across all of the accessions. The sequencing of the coding region of the UFGT nuclear gene (UDP-glucose: flavonoid 3-0-glucosyltransferase, the key enzyme for the accumulation of anthocyanins in berry skins) enabled the discovery of discriminant SNPs (1/34 bp) and the reconstruction of 130 V. vinifera distinct genotypes. Most of the genotypes proved to be cultivar-specific, and only few genotypes were shared by more, although strictly related, cultivars. Conclusion On the whole, this technique was successful for inferring SNP-based genotypes of grapevine accessions suitable for assessing the genetic identity and ancestry of international cultivars and also useful for corroborating some hypotheses regarding the origin of local varieties, suggesting several issues of misidentification (synonymy/homonymy). PMID:24298902
The Paramecium germline genome provides a niche for intragenic parasitic DNA: evolutionary dynamics of internal eliminated sequences.

PubMed

Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda

2012-01-01

Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of -45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a -10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated.
The Paramecium Germline Genome Provides a Niche for Intragenic Parasitic DNA: Evolutionary Dynamics of Internal Eliminated Sequences

PubMed Central

Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E.; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda

2012-01-01

Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of ∼45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a ∼10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated. PMID:23071448
Mitochondrial DNA content and 4977 bp deletion in unfertilized oocytes.

PubMed

Chan, C C W; Liu, V W S; Lau, E Y L; Yeung, W S B; Ng, E H Y; Ho, P C

2005-12-01

Previous studies analysing the incidences of mitochondrial DNA (mtDNA) deletions and mtDNA content in unfertilized oocytes in relation to donors' age have been controversial. The objective of the study was to compare these two parameters in unfertilized oocytes and relate them to the donors' age. Fifty-two women donated 155 unfertilized metaphase II (MII) oocytes. The incidence of 4977 bp deletion was 34.6%, and the mtDNA copy number was 598 350 +/- 265 862. Women >or=35 years of age had a significantly higher incidence of 4977 bp deletion, lower mtDNA copy number, higher FSH level and poorer ovarian response when compared with younger women. The mtDNA copy number was negatively correlated with the donor's age. The higher incidence of mtDNA deletion and lower mtDNA copy number in older women suggested that these two parameters may reflect ovarian ageing.
[Relationship between mitochondrial DNA copy number, membrane potential of human embryo and embryo morphology].

PubMed

Zhao, H; Teng, X M; Li, Y F

2017-11-25

Objective: To explore the relationship between the embryo with the different morphological types in the third day and its mitochondrial copy number, the membrane potential. Methods: Totally 117 embryos with poor development after normal fertilization and were not suitable transferred in the fresh cycle and 106 frozen embryos that were discarded voluntarily by infertility patients with in vitro fertilization-embryo transfer after successful pregnancy were selected. According to evaluation of international standard in embryos, all cleavage stage embryos were divided into class Ⅰ frozen embryo group ( n= 64), class Ⅱ frozen embryo group ( n= 42) and class Ⅲ fresh embryonic group (not transplanted embryos; n= 117). Real-time PCR and confocal microscopy methods were used to detect mitochondrial DNA (mtDNA) copy number and the mitochondrial membrane potential of a single embryo. The differences between embryo quality and mtDNA copy number and membrane potential of each group were compared. Results: The copy number of mtDNA and the mitochondrial membrane potential in class Ⅲ fresh embryonic group [(1.7±1.0)×10(5) copy/μl, 1.56±0.32] were significantly lower than those in class Ⅰ frozen embryo group [(3.4±1.7)×10(5) copy/μl, 2.66±0.21] and class Ⅱ frozen embryo group [(2.6±1.2)×10(5) copy/μl, 1.80±0.32; all P< 0.05]. The copy number of mtDNA and the mitochondrial membrane potential in classⅠ frozen embryo group were significantly higher than those in classⅡ frozen embryo group (both P< 0.05). Conclusion: The mtDNA copy number and the mitochondrial membrane potential of embryos of the better quality embryo are higher.
Colorectal cancer cell lines are representative models of the main molecular subtypes of primary cancer.

PubMed

Mouradov, Dmitri; Sloggett, Clare; Jorissen, Robert N; Love, Christopher G; Li, Shan; Burgess, Antony W; Arango, Diego; Strausberg, Robert L; Buchanan, Daniel; Wormald, Samuel; O'Connor, Liam; Wilding, Jennifer L; Bicknell, David; Tomlinson, Ian P M; Bodmer, Walter F; Mariadason, John M; Sieber, Oliver M

2014-06-15

Human colorectal cancer cell lines are used widely to investigate tumor biology, experimental therapy, and biomarkers. However, to what extent these established cell lines represent and maintain the genetic diversity of primary cancers is uncertain. In this study, we profiled 70 colorectal cancer cell lines for mutations and DNA copy number by whole-exome sequencing and SNP microarray analyses, respectively. Gene expression was defined using RNA-Seq. Cell line data were compared with those published for primary colorectal cancers in The Cancer Genome Atlas. Notably, we found that exome mutation and DNA copy-number spectra in colorectal cancer cell lines closely resembled those seen in primary colorectal tumors. Similarities included the presence of two hypermutation phenotypes, as defined by signatures for defective DNA mismatch repair and DNA polymerase ε proofreading deficiency, along with concordant mutation profiles in the broadly altered WNT, MAPK, PI3K, TGFβ, and p53 pathways. Furthermore, we documented mutations enriched in genes involved in chromatin remodeling (ARID1A, CHD6, and SRCAP) and histone methylation or acetylation (ASH1L, EP300, EP400, MLL2, MLL3, PRDM2, and TRRAP). Chromosomal instability was prevalent in nonhypermutated cases, with similar patterns of chromosomal gains and losses. Although paired cell lines derived from the same tumor exhibited considerable mutation and DNA copy-number differences, in silico simulations suggest that these differences mainly reflected a preexisting heterogeneity in the tumor cells. In conclusion, our results establish that human colorectal cancer lines are representative of the main subtypes of primary tumors at the genomic level, further validating their utility as tools to investigate colorectal cancer biology and drug responses. ©2014 American Association for Cancer Research.
Rat L (long interspersed repeated DNA) elements contain guanine-rich homopurine sequences that induce unpairing of contiguous duplex DNA.

PubMed Central

Usdin, K; Furano, A V

1988-01-01

The L family (long interspersed repeated DNA) of mobile genetic elements is a persistent feature of the mammalian genome. In rats, this family contains approximately equal to 40,000 members and accounts for approximately equal to 10% of the haploid genome. We demonstrate here that the guanine-rich homopurine stretches located at the right end of L-DNA induce oligonucleotide uptake by contiguous duplex DNA. The uptake is dependent on negative supercoiling and the length of the homopurine stretch and occurs even when the L-DNA homopurine stretches are introduced into a different DNA environment. The bound oligomer primes DNA synthesis when DNA polymerase and deoxyribonucleoside triphosphates are added, resulting in a faithful copy of the template to which the oligonucleotide had bound. The implications of this property of the L-DNA guanine-rich homopurine stretches in the amplification, recombination, and dispersal of L elements is discussed. Images PMID:2837766

The effects of DNA supercoiling on G-quadruplex formation.

PubMed

Sekibo, Doreen A T; Fox, Keith R

2017-12-01

Guanine-rich DNAs can fold into four-stranded structures that contain stacks of G-quartets. Bioinformatics studies have revealed that G-rich sequences with the potential to adopt these structures are unevenly distributed throughout genomes, and are especially found in gene promoter regions. With the exception of the single-stranded telomeric DNA, all genomic G-rich sequences will always be present along with their C-rich complements, and quadruplex formation will be in competition with the corresponding Watson-Crick duplex. Quadruplex formation must therefore first require local dissociation (melting) of the duplex strands. Since negative supercoiling is known to facilitate the formation of alternative DNA structures, we have investigated G-quadruplex formation within negatively supercoiled DNA plasmids. Plasmids containing multiple copies of (G3T)n and (G3T4)n repeats, were probed with dimethylsulphate, potassium permanganate and S1 nuclease. While dimethylsulphate footprinting revealed some evidence for G-quadruplex formation in (G3T)n sequences, this was not affected by supercoiling, and permanganate failed to detect exposed thymines in the loop regions. (G3T4)n sequences were not protected from DMS and showed no reaction with permanganate. Similarly, both S1 nuclease and 2D gel electrophoresis of DNA topoisomers did not detect any supercoil-dependent structural transitions. These results suggest that negative supercoiling alone is not sufficient to drive G-quadruplex formation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health

PubMed Central

Martin, William F.

2017-01-01

Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. PMID:28444372
SSR_pipeline--computer software for the identification of microsatellite sequences from paired-end Illumina high-throughput DNA sequence data

USGS Publications Warehouse

Miller, Mark P.; Knaus, Brian J.; Mullins, Thomas D.; Haig, Susan M.

2013-01-01

SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (SSRs; for example, microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains three analysis modules along with a fourth control module that can be used to automate analyses of large volumes of data. The modules are used to (1) identify the subset of paired-end sequences that pass quality standards, (2) align paired-end reads into a single composite DNA sequence, and (3) identify sequences that possess microsatellites conforming to user specified parameters. Each of the three separate analysis modules also can be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc). All modules are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, Windows). The program suite relies on a compiled Python extension module to perform paired-end alignments. Instructions for compiling the extension from source code are provided in the documentation. Users who do not have Python installed on their computers or who do not have the ability to compile software also may choose to download packaged executable files. These files include all Python scripts, a copy of the compiled extension module, and a minimal installation of Python in a single binary executable. See program documentation for more information.
Cytophotometric and biochemical analyses of DNA in pentaploid and diploid Agave species.

PubMed

Cavallini, A; Natali, L; Cionini, G; Castorena-Sanchez, I

1996-04-01

Nuclear DNA content, chromatin structure, and DNA composition were investigated in four Agave species: two diploid, Agave tequilana Weber and Agave angustifolia Haworth var. marginata Hort., and two pentaploid, Agave fourcroydes Lemaire and Agave sisalana Perrine. It was determined that the genome size of pentaploid species is nearly 2.5 times that of diploid ones. Cytophotometric analyses of chromatin structure were performed following Feulgen or DAPI staining to determine optical density profiles of interphase nuclei. Pentaploid species showed higher frequencies of condensed chromatin (heterochromatin) than diploid species. On the other hand, a lower frequency of A-T rich (DAPI stained) heterochromatin was found in pentaploid species than in diploid ones, indicating that heterochromatin in pentaploid species is made up of sequences with base compositions different from those of diploid species. Since thermal denaturation profiles of extracted DNA showed minor variations in the base composition of the genomes of the four species, it is supposed that, in pentaploid species, the large heterochromatin content is not due to an overrepresentation of G-C repetitive sequences but rather to the condensation of nonrepetitive sequences, such as, for example, redundant gene copies switched off in the polyploid complement. It is suggested that speciation in the genus Agave occurs through point mutations and minor DNA rearrangements, as is also indicated by the relative stability of the karyotype of this genus. Key words : Agave, DNA cytophotometry, DNA melting profiles, chromatin structure, genome size.
Classification of Plant Associated Bacteria Using RIF, a Computationally Derived DNA Marker

PubMed Central

Schneider, Kevin L.; Marrero, Glorimar; Alvarez, Anne M.; Presting, Gernot G.

2011-01-01

A DNA marker that distinguishes plant associated bacteria at the species level and below was derived by comparing six sequenced genomes of Xanthomonas, a genus that contains many important phytopathogens. This DNA marker comprises a portion of the dnaA replication initiation factor (RIF). Unlike the rRNA genes, dnaA is a single copy gene in the vast majority of sequenced bacterial genomes, and amplification of RIF requires genus-specific primers. In silico analysis revealed that RIF has equal or greater ability to differentiate closely related species of Xanthomonas than the widely used ribosomal intergenic spacer region (ITS). Furthermore, in a set of 263 Xanthomonas, Ralstonia and Clavibacter strains, the RIF marker was directly sequenced in both directions with a success rate approximately 16% higher than that for ITS. RIF frameworks for Xanthomonas, Ralstonia and Clavibacter were constructed using 682 reference strains representing different species, subspecies, pathovars, races, hosts and geographic regions, and contain a total of 109 different RIF sequences. RIF sequences showed subspecific groupings but did not place strains of X. campestris or X. axonopodis into currently named pathovars nor R. solanacearum strains into their respective races, confirming previous conclusions that pathovar and race designations do not necessarily reflect genetic relationships. The RIF marker also was sequenced for 24 reference strains from three genera in the Enterobacteriaceae: Pectobacterium, Pantoea and Dickeya. RIF sequences of 70 previously uncharacterized strains of Ralstonia, Clavibacter, Pectobacterium and Dickeya matched, or were similar to, those of known reference strains, illustrating the utility of the frameworks to classify bacteria below the species level and rapidly match unknown isolates to reference strains. The RIF sequence frameworks are available at the online RIF database, RIFdb, and can be queried for diagnostic purposes with RIF sequences obtained from unknown strains in both chromatogram and FASTA format. PMID:21533033
Exome copy number variation detection: Use of a pool of unrelated healthy tissue as reference sample.

PubMed

Wenric, Stephane; Sticca, Tiberio; Caberg, Jean-Hubert; Josse, Claire; Fasquelle, Corinne; Herens, Christian; Jamar, Mauricette; Max, Stéphanie; Gothot, André; Caers, Jo; Bours, Vincent

2017-01-01

An increasing number of bioinformatic tools designed to detect CNVs (copy number variants) in tumor samples based on paired exome data where a matched healthy tissue constitutes the reference have been published in the recent years. The idea of using a pool of unrelated healthy DNA as reference has previously been formulated but not thoroughly validated. As of today, the gold standard for CNV calling is still aCGH but there is an increasing interest in detecting CNVs by exome sequencing. We propose to design a metric allowing the comparison of two CNV profiles, independently of the technique used and assessed the validity of using a pool of unrelated healthy DNA instead of a matched healthy tissue as reference in exome-based CNV detection. We compared the CNV profiles obtained with three different approaches (aCGH, exome sequencing with a matched healthy tissue as reference, exome sequencing with a pool of eight unrelated healthy tissue as reference) on three multiple myeloma samples. We show that the usual analyses performed to compare CNV profiles (deletion/amplification ratios and CNV size distribution) lack in precision when confronted with low LRR values, as they only consider the binary status of each CNV. We show that the metric-based distance constitutes a more accurate comparison of two CNV profiles. Based on these analyses, we conclude that a reliable picture of CNV alterations in multiple myeloma samples can be obtained from whole-exome sequencing in the absence of a matched healthy sample. © 2016 WILEY PERIODICALS, INC.
Detection and assessment of copy number variation using PacBio long-read and Illumina sequencing in New Zealand dairy cattle.

PubMed

Couldrey, C; Keehan, M; Johnson, T; Tiplady, K; Winkelman, A; Littlejohn, M D; Scott, A; Kemper, K E; Hayes, B; Davis, S R; Spelman, R J

2017-07-01

Single nucleotide polymorphisms have been the DNA variant of choice for genomic prediction, largely because of the ease of single nucleotide polymorphism genotype collection. In contrast, structural variants (SV), which include copy number variants (CNV), translocations, insertions, and inversions, have eluded easy detection and characterization, particularly in nonhuman species. However, evidence increasingly shows that SV not only contribute a substantial proportion of genetic variation but also have significant influence on phenotypes. Here we present the discovery of CNV in a prominent New Zealand dairy bull using long-read PacBio (Pacific Biosciences, Menlo Park, CA) sequencing technology and the Sniffles SV discovery tool (version 0.0.1; https://github.com/fritzsedlazeck/Sniffles). The CNV identified from long reads were compared with CNV discovered in the same bull from Illumina sequencing using CNVnator (read depth-based tool; Illumina Inc., San Diego, CA) as a means of validation. Subsequently, further validation was undertaken using whole-genome Illumina sequencing of 556 cattle representing the wider New Zealand dairy cattle population. Very limited overlap was observed in CNV discovered from the 2 sequencing platforms, in part because of the differences in size of CNV detected. Only a few CNV were therefore able to be validated using this approach. However, the ability to use CNVnator to genotype the 557 cattle for copy number across all regions identified as putative CNV allowed a genome-wide assessment of transmission level of copy number based on pedigree. The more highly transmissible a putative CNV region was observed to be, the more likely the distribution of copy number was multimodal across the 557 sequenced animals. Furthermore, visual assessment of highly transmissible CNV regions provided evidence supporting the presence of CNV across the sequenced animals. This transmission-based approach was able to confirm a subset of CNV that segregates in the New Zealand dairy cattle population. Genome-wide identification and validation of CNV is an important step toward their inclusion in genomic selection strategies. The Authors. Published by the Federation of Animal Science Societies and Elsevier Inc. on behalf of the American Dairy Science Association®. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/).
Dynamics of actin evolution in dinoflagellates.

PubMed

Kim, Sunju; Bachvaroff, Tsvetan R; Handy, Sara M; Delwiche, Charles F

2011-04-01

Dinoflagellates have unique nuclei and intriguing genome characteristics with very high DNA content making complete genome sequencing difficult. In dinoflagellates, many genes are found in multicopy gene families, but the processes involved in the establishment and maintenance of these gene families are poorly understood. Understanding the dynamics of gene family evolution in dinoflagellates requires comparisons at different evolutionary scales. Studies of closely related species provide fine-scale information relative to species divergence, whereas comparisons of more distantly related species provides broad context. We selected the actin gene family as a highly expressed conserved gene previously studied in dinoflagellates. Of the 142 sequences determined in this study, 103 were from the two closely related species, Dinophysis acuminata and D. caudata, including full length and partial cDNA sequences as well as partial genomic amplicons. For these two Dinophysis species, at least three types of sequences could be identified. Most copies (79%) were relatively similar and in nucleotide trees, the sequences formed two bushy clades corresponding to the two species. In comparisons within species, only eight to ten nucleotide differences were found between these copies. The two remaining types formed clades containing sequences from both species. One type included the most similar sequences in between-species comparisons with as few as 12 nucleotide differences between species. The second type included the most divergent sequences in comparisons between and within species with up to 93 nucleotide differences between sequences. In all the sequences, most variation occurred in synonymous sites or the 5' UnTranslated Region (UTR), although there was still limited amino acid variation between most sequences. Several potential pseudogenes were found (approximately 10% of all sequences depending on species) with incomplete open reading frames due to frameshifts or early stop codons. Overall, variation in the actin gene family fits best with the "birth and death" model of evolution based on recent duplications, pseudogenes, and incomplete lineage sorting. Divergence between species was similar to variation within species, so that actin may be too conserved to be useful for phylogenetic estimation of closely related species.
Families of short interspersed elements in the genome of the oomycete plant pathogen, Phytophthora infestans.

PubMed

Whisson, Stephen C; Avrova, Anna O; Lavrova, Olga; Pritchard, Leighton

2005-04-01

The first known families of tRNA-related short interspersed elements (SINEs) in the oomycetes were identified by exploiting the genomic DNA sequence resources for the potato late blight pathogen, Phytophthora infestans. Fifteen families of tRNA-related SINEs, as well as predicted tRNAs, and other possible RNA polymerase III-transcribed sequences were identified. The size of individual elements ranges from 101 to 392 bp, representing sequences present from low (1) to highly abundant (over 2000) copy number in the P. infestans genome, based on quantitative PCR analysis. Putative short direct repeat sequences (6-14 bp) flanking the elements were also identified for eight of the SINEs. Predicted SINEs were named in a series prefixed infSINE (for infestans-SINE). Two SINEs were apparently present as multimers of tRNA-related units; four copies of a related unit for infSINEr, and two unrelated units for infSINEz. Two SINEs, infSINEh and infSINEi, were typically located within 400 bp of each other. These were also the only two elements identified as being actively transcribed in the mycelial stage of P. infestans by RT-PCR. It is possible that infSINEh and infSINEi represent active retrotransposons in P. infestans. Based on the quantitative PCR estimates of copy number for all of the elements identified, tRNA-related SINEs were estimated to comprise 0.3% of the 250 Mb P. infestans genome. InfSINE-related sequences were found to occur in species throughout the genus Phytophthora. However, seven elements were shown to be exclusive to P. infestans.
Contrasting patterns of evolution of 45S and 5S rDNA families uncover new aspects in the genome constitution of the agronomically important grass Thinopyrum intermedium (Triticeae).

PubMed

Mahelka, Václav; Kopecky, David; Baum, Bernard R

2013-09-01

We employed sequencing of clones and in situ hybridization (genomic and fluorescent in situ hybridization [GISH and rDNA-FISH]) to characterize both the sequence variation and genomic organization of 45S (herein ITS1-5.8S-ITS2 region) and 5S (5S gene + nontranscribed spacer) ribosomal DNA (rDNA) families in the allohexaploid grass Thinopyrum intermedium. Both rDNA families are organized within several rDNA loci within all three subgenomes of the allohexaploid species. Both families have undergone different patterns of evolution. The 45S rDNA family has evolved in a concerted manner: internal transcribed spacer (ITS) sequences residing within the arrays of two subgenomes out of three got homogenized toward one major ribotype, whereas the third subgenome contained a minor proportion of distinct unhomogenized copies. Homogenization mechanisms such as unequal crossover and/or gene conversion were coupled with the loss of certain 45S rDNA loci. Unlike in the 45S family, the data suggest that neither interlocus homogenization among homeologous chromosomes nor locus loss occurred in 5S rDNA. Consistently with other Triticeae, the 5S rDNA family in intermediate wheatgrass comprised two distinct array types-the long- and short-spacer unit classes. Within the long and short units, we distinguished five and three different types, respectively, likely representing homeologous unit classes donated by putative parental species. Although the major ITS ribotype corresponds in our phylogenetic analysis to the E-genome species, the minor ribotype corresponds to Dasypyrum. 5S sequences suggested the contributions from Pseudoroegneria, Dasypyrum, and Aegilops. The contribution from Aegilops to the intermediate wheatgrass' genome is a new finding with implications in wheat improvement. We discuss rDNA evolution and potential origin of intermediate wheatgrass.
Homologous recombination between overlapping thymidine kinase gene fragments stably inserted into a mouse cell genome.

PubMed Central

Lin, F L; Sternberg, N

1984-01-01

We have constructed a substrate to study homologous recombination between adjacent segments of chromosomal DNA. This substrate, designated lambda tk2 , consists of one completely defective and one partially defective herpes simplex virus thymidine kinase (tk) gene cloned in bacteriophage lambda DNA. The two genes have homologous 984-base-pair sequences and are separated by 3 kilobases of largely vector DNA. When lambda tk2 DNA was transferred into mouse LMtk- cells by the calcium phosphate method, rare TK+ transformants were obtained that contained many (greater than 40) copies of the unrecombined DNA. Tk- revertants, which had lost most of the copies of unrecombined DNA, were isolated from these TK+-transformed lines. Two of these Tk- lines were further studied by analysis of their reversion back to the Tk+ phenotype. They generated ca. 200 Tk+ revertants per 10(8) cells after growth in nonselecting medium for 5 days. All of these Tk+ revertants have an intact tk gene reconstructed by homologous recombination; they also retain various amounts of unrecombined lambda tk2 DNA. Southern blot analysis suggested that at least some of the recombination events involve unequal sister chromatid exchanges. We also tested three agents, mitomycin C, 12-O-tetradecanoyl-phorbol-13-acetate, and mezerein, that are thought to stimulate recombination to determine whether they affect the reversion from Tk- to Tk+. Only mitomycin C increased the number of Tk+ revertants. Images PMID:6328272
Molecular Analysis and Genomic Organization of Major DNA Satellites in Banana (Musa spp.)

PubMed Central

Čížková, Jana; Hřibová, Eva; Humplíková, Lenka; Christelová, Pavla; Suchánková, Pavla; Doležel, Jaroslav

2013-01-01

Satellite DNA sequences consist of tandemly arranged repetitive units up to thousands nucleotides long in head-to-tail orientation. The evolutionary processes by which satellites arise and evolve include unequal crossing over, gene conversion, transposition and extra chromosomal circular DNA formation. Large blocks of satellite DNA are often observed in heterochromatic regions of chromosomes and are a typical component of centromeric and telomeric regions. Satellite-rich loci may show specific banding patterns and facilitate chromosome identification and analysis of structural chromosome changes. Unlike many other genomes, nuclear genomes of banana (Musa spp.) are poor in satellite DNA and the information on this class of DNA remains limited. The banana cultivars are seed sterile clones originating mostly from natural intra-specific crosses within M. acuminata (A genome) and inter-specific crosses between M. acuminata and M. balbisiana (B genome). Previous studies revealed the closely related nature of the A and B genomes, including similarities in repetitive DNA. In this study we focused on two main banana DNA satellites, which were previously identified in silico. Their genomic organization and molecular diversity was analyzed in a set of nineteen Musa accessions, including representatives of A, B and S (M. schizocarpa) genomes and their inter-specific hybrids. The two DNA satellites showed a high level of sequence conservation within, and a high homology between Musa species. FISH with probes for the satellite DNA sequences, rRNA genes and a single-copy BAC clone 2G17 resulted in characteristic chromosome banding patterns in M. acuminata and M. balbisiana which may aid in determining genomic constitution in interspecific hybrids. In addition to improving the knowledge on Musa satellite DNA, our study increases the number of cytogenetic markers and the number of individual chromosomes, which can be identified in Musa. PMID:23372772
Molecular analysis and genomic organization of major DNA satellites in banana (Musa spp.).

PubMed

Čížková, Jana; Hřibová, Eva; Humplíková, Lenka; Christelová, Pavla; Suchánková, Pavla; Doležel, Jaroslav

2013-01-01

Satellite DNA sequences consist of tandemly arranged repetitive units up to thousands nucleotides long in head-to-tail orientation. The evolutionary processes by which satellites arise and evolve include unequal crossing over, gene conversion, transposition and extra chromosomal circular DNA formation. Large blocks of satellite DNA are often observed in heterochromatic regions of chromosomes and are a typical component of centromeric and telomeric regions. Satellite-rich loci may show specific banding patterns and facilitate chromosome identification and analysis of structural chromosome changes. Unlike many other genomes, nuclear genomes of banana (Musa spp.) are poor in satellite DNA and the information on this class of DNA remains limited. The banana cultivars are seed sterile clones originating mostly from natural intra-specific crosses within M. acuminata (A genome) and inter-specific crosses between M. acuminata and M. balbisiana (B genome). Previous studies revealed the closely related nature of the A and B genomes, including similarities in repetitive DNA. In this study we focused on two main banana DNA satellites, which were previously identified in silico. Their genomic organization and molecular diversity was analyzed in a set of nineteen Musa accessions, including representatives of A, B and S (M. schizocarpa) genomes and their inter-specific hybrids. The two DNA satellites showed a high level of sequence conservation within, and a high homology between Musa species. FISH with probes for the satellite DNA sequences, rRNA genes and a single-copy BAC clone 2G17 resulted in characteristic chromosome banding patterns in M. acuminata and M. balbisiana which may aid in determining genomic constitution in interspecific hybrids. In addition to improving the knowledge on Musa satellite DNA, our study increases the number of cytogenetic markers and the number of individual chromosomes, which can be identified in Musa.
Automatic polymerase chain reaction product detection system for food safety monitoring using zinc finger protein fused to luciferase.

PubMed

Yoshida, Wataru; Kezuka, Aki; Murakami, Yoshiyuki; Lee, Jinhee; Abe, Koichi; Motoki, Hiroaki; Matsuo, Takafumi; Shimura, Nobuaki; Noda, Mamoru; Igimi, Shizunobu; Ikebukuro, Kazunori

2013-11-01

An automatic polymerase chain reaction (PCR) product detection system for food safety monitoring using zinc finger (ZF) protein fused to luciferase was developed. ZF protein fused to luciferase specifically binds to target double stranded DNA sequence and has luciferase enzymatic activity. Therefore, PCR products that comprise ZF protein recognition sequence can be detected by measuring the luciferase activity of the fusion protein. We previously reported that PCR products from Legionella pneumophila and Escherichia coli (E. coli) O157 genomic DNA were detected by Zif268, a natural ZF protein, fused to luciferase. In this study, Zif268-luciferase was applied to detect the presence of Salmonella and coliforms. Moreover, an artificial zinc finger protein (B2) fused to luciferase was constructed for a Norovirus detection system. In the luciferase activity detection assay, several bound/free separation process is required. Therefore, an analyzer that automatically performed the bound/free separation process was developed to detect PCR products using the ZF-luciferase fusion protein. By means of the automatic analyzer with ZF-luciferase fusion protein, target pathogenic genomes were specifically detected in the presence of other pathogenic genomes. Moreover, we succeeded in the detection of 10 copies of E. coli BL21 without extraction of genomic DNA by the automatic analyzer and E. coli was detected with a logarithmic dependency in the range of 1.0×10 to 1.0×10(6) copies. Copyright © 2013 Elsevier B.V. All rights reserved.
Copy number of ArsR reporter plasmid determines its arsenite response and metal specificity.

PubMed

Fang, Yun; Zhu, Chunjie; Chen, Xingjuan; Wang, Yan; Xu, Meiying; Sun, Guoping; Guo, Jun; Yoo, Jinnon; Tie, Cuijuan; Jiang, Xin; Li, Xianqiang

2018-05-16

The key component in bacteria-based biosensors is a transcriptional reporter employed to monitor induction or repression of a reporter gene corresponding to environmental change. In this study, we made a series of reporters in order to achieve highly sensitive detection of arsenite. From these reporters, two biosensors were developed by transformation of Escherichia coli DH5α with pLHPars9 and pLLPars9, consisting of either a high or low copy number plasmid, along with common elements of ArsR-luciferase fusion and addition of two binding sequences, one each from E. coli and Acidithiobacillus ferrooxidans chromosome, in front of the R773 ArsR operon. Both of them were highly sensitive to arsenite, with a low detection limit of 0.04 μM arsenite (~ 5 μg/L). They showed a wide dynamic range of detection up to 50 μM using high copy number pLHPars9 and 100 μM using low copy number pLLPars9. Significantly, they differ in metal specificity, pLLPars9 more specific to arsenite, while pLHPars9 to both arsenite and antimonite. The only difference between pLHPars9 and pLLPars9 is their copy numbers of plasmid and corresponding ratios of ArsR to its binding promoter/operator sequence. Therefore, we propose a working model in which DNA bound-ArsR is different from its free form in metal specificity.
Sequence polymorphisms at the growth hormone GH1/GH2-N and GH2-Z gene copies and their relationship with dairy traits in domestic sheep (Ovis aries).

PubMed

Vacca, G M; Dettori, M L; Balia, F; Luridiana, S; Mura, M C; Carcangiu, V; Pazzola, M

2013-09-01

The purpose was to analyze the growth hormone GH1/GH2-N and GH2-Z gene copies and to assess their possible association with milk traits in Sarda sheep. Two hundred multiparous lactating ewes were monitored. The two gene copies were amplified separately and each was used as template for a nested PCR, to investigate single strand conformation polymorphism (SSCP) of the 5'UTR, exon-1, exon-5 and 3'UTR DNA regions. SSCP analysis revealed marked differences in the number of polymorphic patterns between the two genes. Sequencing revealed five nucleotide changes at the GH1/GH2-N gene. Five nucleotide changes occurred at the GH2-Z gene: one was located in exon-5 (c.556G > A) and resulted in a putative amino acid substitution G186S. All the nucleotide changes were copy-specific, except c.*30delT, which was common to both GH1/GH2-N and GH2-Z. Variability in the promoter regions of each gene might have consequences on the expression level, due to the involvement in potential transcription factor binding sites. Both gene copies influenced milk yield. A correlation with milk protein and casein content was also evidenced. These results may have implications that make them useful for future breeding strategies in dairy sheep breeding.
Intragenomic sequence variation at the ITS1 - ITS2 region and at the 18S and 28S nuclear ribosomal DNA genes of the New Zealand mud snail, Potamopyrgus antipodarum (Hydrobiidae: mollusca)

USGS Publications Warehouse

Hoy, Marshal S.; Rodriguez, Rusty J.

2013-01-01

Molecular genetic analysis was conducted on two populations of the invasive non-native New Zealand mud snail (Potamopyrgus antipodarum), one from a freshwater ecosystem in Devil's Lake (Oregon, USA) and the other from an ecosystem of higher salinity in the Columbia River estuary (Hammond Harbor, Oregon, USA). To elucidate potential genetic differences between the two populations, three segments of nuclear ribosomal DNA (rDNA), the ITS1-ITS2 regions and the 18S and 28S rDNA genes were cloned and sequenced. Variant sequences within each individual were found in all three rDNA segments. Folding models were utilized for secondary structure analysis and results indicated that there were many sequences which contained structure-altering polymorphisms, which suggests they could be nonfunctional pseudogenes. In addition, analysis of molecular variance (AMOVA) was used for hierarchical analysis of genetic variance to estimate variation within and among populations and within individuals. AMOVA revealed significant variation in the ITS region between the populations and among clones within individuals, while in the 5.8S rDNA significant variation was revealed among individuals within the two populations. High levels of intragenomic variation were found in the ITS regions, which are known to be highly variable in many organisms. More interestingly, intragenomic variation was also found in the 18S and 28S rDNA, which has rarely been observed in animals and is so far unreported in Mollusca. We postulate that in these P. antipodarum populations the effects of concerted evolution are diminished due to the fact that not all of the rDNA genes in their polyploid genome should be essential for sustaining cellular function. This could lead to a lessening of selection pressures, allowing mutations to accumulate in some copies, changing them into variant sequences.
Appraising the relevance of DNA copy number loss and gain in prostate cancer using whole genome DNA sequence data

PubMed Central

Van Loo, Peter; Kay, Jonathan D.; Matthews, Lucy; Haase, Kerstin; Clark, Jeremy; Thomas, Sarah; Butler, Adam P.; Gundem, Gunes; Merson, Sue; Luxton, Hayley; Hawkins, Steve; Ghori, Mohammed; Marsden, Luke; Lambert, Adam; Pelvender, Gill; Massie, Charlie E.; Hazell, Steven; Livni, Naomi; Fisher, Cyril; Ogden, Christopher; Kumar, Pardeep; Thompson, Alan; Nicol, David; Yu, Yongwei; Zhang, Hongwei; Isaacs, William; Visakorpi, Tapio; Verrill, Clare; Lynch, Andrew G.; Lu, Yong Jie; Whitaker, Hayley C.; Neal, David E.; Cooper, Colin S.

2017-01-01

A variety of models have been proposed to explain regions of recurrent somatic copy number alteration (SCNA) in human cancer. Our study employs Whole Genome DNA Sequence (WGS) data from tumor samples (n = 103) to comprehensively assess the role of the Knudson two hit genetic model in SCNA generation in prostate cancer. 64 recurrent regions of loss and gain were detected, of which 28 were novel, including regions of loss with more than 15% frequency at Chr4p15.2-p15.1 (15.53%), Chr6q27 (16.50%) and Chr18q12.3 (17.48%). Comprehensive mutation screens of genes, lincRNA encoding sequences, control regions and conserved domains within SCNAs demonstrated that a two-hit genetic model was supported in only a minor proportion of recurrent SCNA losses examined (15/40). We found that recurrent breakpoints and regions of inversion often occur within Knudson model SCNAs, leading to the identification of ZNF292 as a target gene for the deletion at 6q14.3-q15 and NKX3.1 as a two-hit target at 8p21.3-p21.2. The importance of alterations of lincRNA sequences was illustrated by the identification of a novel mutational hotspot at the KCCAT42, FENDRR, CAT1886 and STCAT2 loci at the 16q23.1-q24.3 loss. Our data confirm that the burden of SCNAs is predictive of biochemical recurrence, define nine individual regions that are associated with relapse, and highlight the possible importance of ion channel and G-protein coupled-receptor (GPCR) pathways in cancer development. We concluded that a two-hit genetic model accounts for about one third of SCNA indicating that mechanisms, such haploinsufficiency and epigenetic inactivation, account for the remaining SCNA losses. PMID:28945760
Appraising the relevance of DNA copy number loss and gain in prostate cancer using whole genome DNA sequence data.

PubMed

Camacho, Niedzica; Van Loo, Peter; Edwards, Sandra; Kay, Jonathan D; Matthews, Lucy; Haase, Kerstin; Clark, Jeremy; Dennis, Nening; Thomas, Sarah; Kremeyer, Barbara; Zamora, Jorge; Butler, Adam P; Gundem, Gunes; Merson, Sue; Luxton, Hayley; Hawkins, Steve; Ghori, Mohammed; Marsden, Luke; Lambert, Adam; Karaszi, Katalin; Pelvender, Gill; Massie, Charlie E; Kote-Jarai, Zsofia; Raine, Keiran; Jones, David; Howat, William J; Hazell, Steven; Livni, Naomi; Fisher, Cyril; Ogden, Christopher; Kumar, Pardeep; Thompson, Alan; Nicol, David; Mayer, Erik; Dudderidge, Tim; Yu, Yongwei; Zhang, Hongwei; Shah, Nimish C; Gnanapragasam, Vincent J; Isaacs, William; Visakorpi, Tapio; Hamdy, Freddie; Berney, Dan; Verrill, Clare; Warren, Anne Y; Wedge, David C; Lynch, Andrew G; Foster, Christopher S; Lu, Yong Jie; Bova, G Steven; Whitaker, Hayley C; McDermott, Ultan; Neal, David E; Eeles, Rosalind; Cooper, Colin S; Brewer, Daniel S

2017-09-01

A variety of models have been proposed to explain regions of recurrent somatic copy number alteration (SCNA) in human cancer. Our study employs Whole Genome DNA Sequence (WGS) data from tumor samples (n = 103) to comprehensively assess the role of the Knudson two hit genetic model in SCNA generation in prostate cancer. 64 recurrent regions of loss and gain were detected, of which 28 were novel, including regions of loss with more than 15% frequency at Chr4p15.2-p15.1 (15.53%), Chr6q27 (16.50%) and Chr18q12.3 (17.48%). Comprehensive mutation screens of genes, lincRNA encoding sequences, control regions and conserved domains within SCNAs demonstrated that a two-hit genetic model was supported in only a minor proportion of recurrent SCNA losses examined (15/40). We found that recurrent breakpoints and regions of inversion often occur within Knudson model SCNAs, leading to the identification of ZNF292 as a target gene for the deletion at 6q14.3-q15 and NKX3.1 as a two-hit target at 8p21.3-p21.2. The importance of alterations of lincRNA sequences was illustrated by the identification of a novel mutational hotspot at the KCCAT42, FENDRR, CAT1886 and STCAT2 loci at the 16q23.1-q24.3 loss. Our data confirm that the burden of SCNAs is predictive of biochemical recurrence, define nine individual regions that are associated with relapse, and highlight the possible importance of ion channel and G-protein coupled-receptor (GPCR) pathways in cancer development. We concluded that a two-hit genetic model accounts for about one third of SCNA indicating that mechanisms, such haploinsufficiency and epigenetic inactivation, account for the remaining SCNA losses.
Recombination and evolution of duplicate control regions in the mitochondrial genome of the Asian big-headed turtle, Platysternon megacephalum.

PubMed

Zheng, Chenfei; Nie, Liuwang; Wang, Jue; Zhou, Huaxing; Hou, Huazhen; Wang, Hao; Liu, Juanjuan

2013-01-01

Complete mitochondrial (mt) genome sequences with duplicate control regions (CRs) have been detected in various animal species. In Testudines, duplicate mtCRs have been reported in the mtDNA of the Asian big-headed turtle, Platysternon megacephalum, which has three living subspecies. However, the evolutionary pattern of these CRs remains unclear. In this study, we report the completed sequences of duplicate CRs from 20 individuals belonging to three subspecies of this turtle and discuss the micro-evolutionary analysis of the evolution of duplicate CRs. Genetic distances calculated with MEGA 4.1 using the complete duplicate CR sequences revealed that within turtle subspecies, genetic distances between orthologous copies from different individuals were 0.63% for CR1 and 1.2% for CR2app:addword:respectively, and the average distance between paralogous copies of CR1 and CR2 was 4.8%. Phylogenetic relationships were reconstructed from the CR sequences, excluding the variable number of tandem repeats (VNTRs) at the 3' end using three methods: neighbor-joining, maximum likelihood algorithm, and Bayesian inference. These data show that any two CRs within individuals were more genetically distant from orthologous genes in different individuals within the same subspecies. This suggests independent evolution of the two mtCRs within each P. megacephalum subspecies. Reconstruction of separate phylogenetic trees using different CR components (TAS, CD, CSB, and VNTRs) suggested the role of recombination in the evolution of duplicate CRs. Consequently, recombination events were detected using RDP software with break points at ≈290 bp and ≈1,080 bp. Based on these results, we hypothesize that duplicate CRs in P. megacephalum originated from heterological ancestral recombination of mtDNA. Subsequent recombination could have resulted in homogenization during independent evolutionary events, thus maintaining the functions of duplicate CRs in the mtDNA of P. megacephalum.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.