Sample records for dna sequences recognized

  1. Crystal structure of MboIIA methyltransferase.

    PubMed

    Osipiuk, Jerzy; Walsh, Martin A; Joachimiak, Andrzej

    2003-09-15

    DNA methyltransferases (MTases) are sequence-specific enzymes which transfer a methyl group from S-adenosyl-L-methionine (AdoMet) to the amino group of either cytosine or adenine within a recognized DNA sequence. Methylation of a base in a specific DNA sequence protects DNA from nucleolytic cleavage by restriction enzymes recognizing the same DNA sequence. We have determined at 1.74 A resolution the crystal structure of a beta-class DNA MTase MboIIA (M.MboIIA) from the bacterium Moraxella bovis, the smallest DNA MTase determined to date. M.MboIIA methylates the 3' adenine of the pentanucleotide sequence 5'-GAAGA-3'. The protein crystallizes with two molecules in the asymmetric unit which we propose to resemble the dimer when M.MboIIA is not bound to DNA. The overall structure of the enzyme closely resembles that of M.RsrI. However, the cofactor-binding pocket in M.MboIIA forms a closed structure which is in contrast to the open-form structures of other known MTases.

  2. Crystal structure of MboIIA methyltransferase.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Osipiuk, J.; Walsh, M. A.; Joachimiak, A.

    2003-09-15

    DNA methyltransferases (MTases) are sequence-specific enzymes which transfer a methyl group from S-adenosyl-L-methionine (AdoMet) to the amino group of either cytosine or adenine within a recognized DNA sequence. Methylation of a base in a specific DNA sequence protects DNA from nucleolytic cleavage by restriction enzymes recognizing the same DNA sequence. We have determined at 1.74 {angstrom} resolution the crystal structure of a {beta}-class DNA MTase MboIIA (M {center_dot} MboIIA) from the bacterium Moraxella bovis, the smallest DNA MTase determined to date. M {center_dot} MboIIA methylates the 3' adenine of the pentanucleotide sequence 5'-GAAGA-3'. The protein crystallizes with two molecules inmore » the asymmetric unit which we propose to resemble the dimer when M {center_dot} MboIIA is not bound to DNA. The overall structure of the enzyme closely resembles that of M {center_dot} RsrI. However, the cofactor-binding pocket in M {center_dot} MboIIA forms a closed structure which is in contrast to the open-form structures of other known MTases.« less

  3. Antibody recognition of melphalan adducts characterized using immobilized DNA: enhanced alkylation of G-Rich regions in cells compared to in vitro.

    PubMed

    McCartney, H; Martin, A M; Middleton, P G; Tilby, M J

    2001-01-01

    The bifunctional alkylating agent, melphalan, forms adducts on DNA that are recognized by two previously described monoclonal antibodies, MP5/73 and Amp4/42. Immunoreactivity to MP5/73 was lost when alkylated DNA was exposed to alkaline pH, while Amp4/42 only recognized the structures formed after the alkali treatment. Competitive enzyme-linked immunoadsorbent assays (ELISAs) indicated that in 0.01 and 0.1 M NaOH, loss of immunoreactivity to MP5/73 occurred with half-lives that were at least 2-fold longer than half-lives for gain of immunoreactivity to Amp4/42. This supported previously published evidence that Amp4/42 did not simply recognize all the products formed by alkali treatment of adducts that were initially recognized by MP5/73. Adducts recognized by MP5/73 on RNA were considerably more stable at 100 degrees C and pH 7 than adducts on DNA. This was consistent with the hypothesis that immunorecognition involved N7 guanine adducts and ruled out the involvement of phosphotriesters in immunoreactivity. Synthetic oligodeoxyribonucleotides, covalently immobilized onto 96-well plates, were reacted with melphalan and incubated for various periods with alkali, and then the levels of adducts recognized by each antibody in replicate wells were assayed by a direct binding ELISA. Adducts formed on oligodeoxyguanylic acid were recognized very weakly by Amp4/42, unlike other DNA sequences that were tested. Retention of immobilized DNA during alkali treatment was confirmed by immunoassay of cisplatin adducts. Poor recognition by Amp4/42 of adducts in oligodeoxyguanylic acid was confirmed by a competitive ELISA. Amp4/42, unlike MP5/73, efficiently recognized adducts resulting from alkylation of DNA with chlorambucil. It is concluded that the two antibodies recognized melphalan adducts in different DNA sequence environments and that this explains (a) the different alkali stability of immunoreactive adducts and (b) previous results which showed that, in DNA from melphalan-treated cells, adducts recognized by Amp4/42 formed a smaller proportion of total adducts compared to DNA alkylated in vitro. The results presented here indicate that this was caused by a marked cellular influence on the overall sequence-dependent pattern of DNA alkylation or repair.

  4. Functional specificity of a Hox protein mediated by the recognition of minor groove structure.

    PubMed

    Joshi, Rohit; Passner, Jonathan M; Rohs, Remo; Jain, Rinku; Sosinsky, Alona; Crickmore, Michael A; Jacob, Vinitha; Aggarwal, Aneel K; Honig, Barry; Mann, Richard S

    2007-11-02

    The recognition of specific DNA-binding sites by transcription factors is a critical yet poorly understood step in the control of gene expression. Members of the Hox family of transcription factors bind DNA by making nearly identical major groove contacts via the recognition helices of their homeodomains. In vivo specificity, however, often depends on extended and unstructured regions that link Hox homeodomains to a DNA-bound cofactor, Extradenticle (Exd). Using a combination of structure determination, computational analysis, and in vitro and in vivo assays, we show that Hox proteins recognize specific Hox-Exd binding sites via residues located in these extended regions that insert into the minor groove but only when presented with the correct DNA sequence. Our results suggest that these residues, which are conserved in a paralog-specific manner, confer specificity by recognizing a sequence-dependent DNA structure instead of directly reading a specific DNA sequence.

  5. Recognition of the DNA sequence by an inorganic crystal surface

    PubMed Central

    Sampaolese, Beatrice; Bergia, Anna; Scipioni, Anita; Zuccheri, Giampaolo; Savino, Maria; Samorì, Bruno; De Santis, Pasquale

    2002-01-01

    The sequence-dependent curvature is generally recognized as an important and biologically relevant property of DNA because it is involved in the formation and stability of association complexes with proteins. When a DNA tract, intrinsically curved for the periodical recurrence on the same strand of A-tracts phased with the B-DNA periodicity, is deposited on a flat surface, it exposes to that surface either a T- or an A-rich face. The surface of a freshly cleaved mica crystal recognizes those two faces and preferentially interacts with the former one. Statistical analysis of scanning force microscopy (SFM) images provides evidence of this recognition between an inorganic crystal surface and nanoscale structures of double-stranded DNA. This finding could open the way toward the use of the sequence-dependent adhesion to specific crystal faces for nanotechnological purposes. PMID:12361979

  6. Herpes simplex virus DNA packaging sequences adopt novel structures that are specifically recognized by a component of the cleavage and packaging machinery.

    PubMed

    Adelman, K; Salmon, B; Baines, J D

    2001-03-13

    The product of the herpes simplex virus type 1 U(L)28 gene is essential for cleavage of concatemeric viral DNA into genome-length units and packaging of this DNA into viral procapsids. To address the role of U(L)28 in this process, purified U(L)28 protein was assayed for the ability to recognize conserved herpesvirus DNA packaging sequences. We report that DNA fragments containing the pac1 DNA packaging motif can be induced by heat treatment to adopt novel DNA conformations that migrate faster than the corresponding duplex in nondenaturing gels. Surprisingly, these novel DNA structures are high-affinity substrates for U(L)28 protein binding, whereas double-stranded DNA of identical sequence composition is not recognized by U(L)28 protein. We demonstrate that only one strand of the pac1 motif is responsible for the formation of novel DNA structures that are bound tightly and specifically by U(L)28 protein. To determine the relevance of the observed U(L)28 protein-pac1 interaction to the cleavage and packaging process, we have analyzed the binding affinity of U(L)28 protein for pac1 mutants previously shown to be deficient in cleavage and packaging in vivo. Each of the pac1 mutants exhibited a decrease in DNA binding by U(L)28 protein that correlated directly with the reported reduction in cleavage and packaging efficiency, thereby supporting a role for the U(L)28 protein-pac1 interaction in vivo. These data therefore suggest that the formation of novel DNA structures by the pac1 motif confers added specificity on recognition of DNA packaging sequences by the U(L)28-encoded component of the herpesvirus cleavage and packaging machinery.

  7. Enantiospecific recognition of DNA sequences by a proflavine Tröger base.

    PubMed

    Bailly, C; Laine, W; Demeunynck, M; Lhomme, J

    2000-07-05

    The DNA interaction of a chiral Tröger base derived from proflavine was investigated by DNA melting temperature measurements and complementary biochemical assays. DNase I footprinting experiments demonstrate that the binding of the proflavine-based Tröger base is both enantio- and sequence-specific. The (+)-isomer poorly interacts with DNA in a non-sequence-selective fashion. In sharp contrast, the corresponding (-)-isomer recognizes preferentially certain DNA sequences containing both A. T and G. C base pairs, such as the motifs 5'-GTT. AAC and 5'-ATGA. TCAT. This is the first experimental demonstration that acridine-type Tröger bases can be used for enantiospecific recognition of DNA sequences. Copyright 2000 Academic Press.

  8. Novel division level bacterial diversity in a Yellowstone hot spring.

    PubMed

    Hugenholtz, P; Pitulle, C; Hershberger, K L; Pace, N R

    1998-01-01

    A culture-independent molecular phylogenetic survey was carried out for the bacterial community in Obsidian Pool (OP), a Yellowstone National Park hot spring previously shown to contain remarkable archaeal diversity (S. M. Barns, R. E. Fundyga, M. W. Jeffries, and N. R. Page, Proc. Natl. Acad. Sci. USA 91:1609-1613, 1994). Small-subunit rRNA genes (rDNA) were amplified directly from OP sediment DNA by PCR with universally conserved or Bacteria-specific rDNA primers and cloned. Unique rDNA types among > 300 clones were identified by restriction fragment length polymorphism, and 122 representative rDNA sequences were determined. These were found to represent 54 distinct bacterial sequence types or clusters (> or = 98% identity) of sequences. A majority (70%) of the sequence types were affiliated with 14 previously recognized bacterial divisions (main phyla; kingdoms); 30% were unaffiliated with recognized bacterial divisions. The unaffiliated sequence types (represented by 38 sequences) nominally comprise 12 novel, division level lineages termed candidate divisions. Several OP sequences were nearly identical to those of cultivated chemolithotrophic thermophiles, including the hydrogen-oxidizing Calderobacterium and the sulfate reducers Thermodesulfovibrio and Thermodesulfobacterium, or belonged to monophyletic assemblages recognized for a particular type of metabolism, such as the hydrogen-oxidizing Aquificales and the sulfate-reducing delta-Proteobacteria. The occurrence of such organisms is consistent with the chemical composition of OP (high in reduced iron and sulfur) and suggests a lithotrophic base for primary productivity in this hot spring, through hydrogen oxidation and sulfate reduction. Unexpectedly, no archaeal sequences were encountered in OP clone libraries made with universal primers. Hybridization analysis of amplified OP DNA with domain-specific probes confirmed that the analyzed community rDNA from OP sediment was predominantly bacterial. These results expand substantially our knowledge of the extent of bacterial diversity and call into question the commonly held notion that Archaea dominate hydrothermal environments. Finally, the currently known extent of division level bacterial phylogenetic diversity is collated and summarized.

  9. Gene Identification Algorithms Using Exploratory Statistical Analysis of Periodicity

    NASA Astrophysics Data System (ADS)

    Mukherjee, Shashi Bajaj; Sen, Pradip Kumar

    2010-10-01

    Studying periodic pattern is expected as a standard line of attack for recognizing DNA sequence in identification of gene and similar problems. But peculiarly very little significant work is done in this direction. This paper studies statistical properties of DNA sequences of complete genome using a new technique. A DNA sequence is converted to a numeric sequence using various types of mappings and standard Fourier technique is applied to study the periodicity. Distinct statistical behaviour of periodicity parameters is found in coding and non-coding sequences, which can be used to distinguish between these parts. Here DNA sequences of Drosophila melanogaster were analyzed with significant accuracy.

  10. Specific minor groove solvation is a crucial determinant of DNA binding site recognition

    PubMed Central

    Harris, Lydia-Ann; Williams, Loren Dean; Koudelka, Gerald B.

    2014-01-01

    The DNA sequence preferences of nearly all sequence specific DNA binding proteins are influenced by the identities of bases that are not directly contacted by protein. Discrimination between non-contacted base sequences is commonly based on the differential abilities of DNA sequences to allow narrowing of the DNA minor groove. However, the factors that govern the propensity of minor groove narrowing are not completely understood. Here we show that the differential abilities of various DNA sequences to support formation of a highly ordered and stable minor groove solvation network are a key determinant of non-contacted base recognition by a sequence-specific binding protein. In addition, disrupting the solvent network in the non-contacted region of the binding site alters the protein's ability to recognize contacted base sequences at positions 5–6 bases away. This observation suggests that DNA solvent interactions link contacted and non-contacted base recognition by the protein. PMID:25429976

  11. Mutations altering the cleavage specificity of a homing endonuclease

    PubMed Central

    Seligman, Lenny M.; Chisholm, Karen M.; Chevalier, Brett S.; Chadsey, Meggen S.; Edwards, Samuel T.; Savage, Jeremiah H.; Veillet, Adeline L.

    2002-01-01

    The homing endonuclease I-CreI recognizes and cleaves a particular 22 bp DNA sequence. The crystal structure of I-CreI bound to homing site DNA has previously been determined, leading to a number of predictions about specific protein–DNA contacts. We test these predictions by analyzing a set of endonuclease mutants and a complementary set of homing site mutants. We find evidence that all structurally predicted I-CreI/DNA contacts contribute to DNA recognition and show that these contacts differ greatly in terms of their relative importance. We also describe the isolation of a collection of altered specificity I-CreI derivatives. The in vitro DNA-binding and cleavage properties of two such endonucleases demonstrate that our genetic approach is effective in identifying homing endonucleases that recognize and cleave novel target sequences. PMID:12202772

  12. GENESUS: a two-step sequence design program for DNA nanostructure self-assembly.

    PubMed

    Tsutsumi, Takanobu; Asakawa, Takeshi; Kanegami, Akemi; Okada, Takao; Tahira, Tomoko; Hayashi, Kenshi

    2014-01-01

    DNA has been recognized as an ideal material for bottom-up construction of nanometer scale structures by self-assembly. The generation of sequences optimized for unique self-assembly (GENESUS) program reported here is a straightforward method for generating sets of strand sequences optimized for self-assembly of arbitrarily designed DNA nanostructures by a generate-candidates-and-choose-the-best strategy. A scalable procedure to prepare single-stranded DNA having arbitrary sequences is also presented. Strands for the assembly of various structures were designed and successfully constructed, validating both the program and the procedure.

  13. Two DNA-binding factors recognize specific sequences at silencers, upstream activating sequences, autonomously replicating sequences, and telomeres in Saccharomyces cerevisiae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Buchman, A.R.; Kimmerly, W.J.; Rine, J.

    1988-01-01

    Two DNA-binding factors from Saccharomyces cerevisiae have been characterized, GRFI (general regulatory factor I) and ABFI (ARS-binding factor I), that recognize specific sequences within diverse genetic elements. GRFI bound to sequences at the negative regulatory elements (silencers) of the silent mating type loci HML E and HMR E and to the upstream activating sequence (UAS) required for transcription of the MAT ..cap alpha.. genes. A putative conserved UAS located at genes involved in translation (RPG box) was also recognized by GRFI. In addition, GRFI bound with high affinity to sequences within the (C/sub 1-3/A)-repeat region at yeast telomeres. Binding sitesmore » for GRFI with the highest affinity appeared to be of the form 5'-(A/G)(A/C)ACCCAN NCA(T/C)(T/C)-3', where N is any nucleotide. ABFI-binding sites were located next to autonomously replicating sequences (ARSs) at controlling elements of the silent mating type loci HMR E, HMR I, and HML I and were associated with ARS1, ARS2, and the 2..mu..m plasmid ARS. Two tandem ABFI binding sites were found between the HIS3 and DED1 genes, several kilobase pairs from any ARS, indicating that ABFI-binding sites are not restricted to ARSs. The sequences recognized by AFBI showed partial dyad-symmetry and appeared to be variations of the consensus 5'-TATCATTNNNNACGA-3'. GRFI and ABFI were both abundant DNA-binding factors and did not appear to be encoded by the SIR genes, whose product are required for repression of the silent mating type loci. Together, these results indicate that both GRFI and ABFI play multiple roles within the cell.« less

  14. A rapid, generally applicable method to engineer zinc fingers illustrated by targeting the HIV-1 promoter.

    PubMed

    Isalan, M; Klug, A; Choo, Y

    2001-07-01

    DNA-binding domains with predetermined sequence specificity are engineered by selection of zinc finger modules using phage display, allowing the construction of customized transcription factors. Despite remarkable progress in this field, the available protein-engineering methods are deficient in many respects, thus hampering the applicability of the technique. Here we present a rapid and convenient method that can be used to design zinc finger proteins against a variety of DNA-binding sites. This is based on a pair of pre-made zinc finger phage-display libraries, which are used in parallel to select two DNA-binding domains each of which recognizes given 5 base pair sequences, and whose products are recombined to produce a single protein that recognizes a composite (9 base pair) site of predefined sequence. Engineering using this system can be completed in less than two weeks and yields proteins that bind sequence-specifically to DNA with Kd values in the nanomolar range. To illustrate the technique, we have selected seven different proteins to bind various regions of the human immunodeficiency virus 1 (HIV-1) promoter.

  15. Two-Way Gold Nanoparticle Label-Free Sensing of Specific Sequence and Small Molecule Targets Using Switchable Concatemers.

    PubMed

    Zhu, Longjiao; Shao, Xiangli; Luo, Yunbo; Huang, Kunlung; Xu, Wentao

    2017-05-19

    A two-way colorimetric biosensor based on unmodified gold nanoparticles (GNPs) and a switchable double-stranded DNA (dsDNA) concatemer have been demonstrated. Two hairpin probes (H1 and H2) were first designed that provided the fuels to assemble the dsDNA concatemers via hybridization chain reaction (HCR). A functional hairpin (FH) was rationally designed to recognize the target sequences. All the hairpins contained a single-stranded DNA (ssDNA) loop and sticky end to prevent GNPs from salt-induced aggregation. In the presence of target sequence, the capture probe blocked in the FH recognizes the target to form a duplex DNA, which causes the release of the initiator probe by FH conformational change. This process then starts the alternate-opening of H1 and H2 through HCR, and dsDNA concatemers grow from the target sequence. As a result, unmodified GNPs undergo salt-induced aggregation because the formed dsDNA concatemers are stiffer and provide less stabilization. A light purple-to-blue color variation was observed in the bulk solution, termed the light-off sensing way. Furthermore, H1 ingeniously inserted an aptamer sequence to generate dsDNA concatemers with multiple small molecule binding sites. In the presence of small molecule targets, concatemers can be disassembled into mixtures with ssDNA sticky ends. A blue-to-purple reverse color variation was observed due to the regeneration of the ssDNA, termed the light-on way. The two-way biosensor can detect both nucleic acids and small molecule targets with one sensing device. This switchable sensing element is label-free, enzyme-free, and sophisticated-instrumentation-free. The detection limits of both targets were below nanomolar.

  16. Identification of GATC- and CCGG- recognizing Type II REases and their putative specificity-determining positions using Scan2S—a novel motif scan algorithm with optional secondary structure constraints

    PubMed Central

    Niv, Masha Y.; Skrabanek, Lucy; Roberts, Richard J.; Scheraga, Harold A.; Weinstein, Harel

    2008-01-01

    Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering. PMID:17972284

  17. Identification of GATC- and CCGG-recognizing Type II REases and their putative specificity-determining positions using Scan2S--a novel motif scan algorithm with optional secondary structure constraints.

    PubMed

    Niv, Masha Y; Skrabanek, Lucy; Roberts, Richard J; Scheraga, Harold A; Weinstein, Harel

    2008-05-01

    Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering.

  18. A simple procedure for parallel sequence analysis of both strands of 5'-labeled DNA.

    PubMed

    Razvi, F; Gargiulo, G; Worcel, A

    1983-08-01

    Ligation of a 5'-labeled DNA restriction fragment results in a circular DNA molecule carrying the two 32Ps at the reformed restriction site. Double digestions of the circular DNA with the original enzyme and a second restriction enzyme cleavage near the labeled site allows direct chemical sequencing of one 5'-labeled DNA strand. Similar double digestions, using an isoschizomer that cleaves differently at the 32P-labeled site, allows direct sequencing of the now 3'-labeled complementary DNA strand. It is possible to directly sequence both strands of cloned DNA inserts by using the above protocol and a multiple cloning site vector that provides the necessary restriction sites. The simultaneous and parallel visualization of both DNA strands eliminates sequence ambiguities. In addition, the labeled circular molecules are particularly useful for single-hit DNA cleavage studies and DNA footprint analysis. As an example, we show here an analysis of the micrococcal nuclease-induced breaks on the two strands of the somatic 5S RNA gene of Xenopus borealis, which suggests that the enzyme may recognize and cleave small AT-containing palindromes along the DNA helix.

  19. Formation of (DNA)2-LNA triplet with recombinant base recognition: A quantum mechanical study

    NASA Astrophysics Data System (ADS)

    Mall, Vijaya Shri; Tiwari, Rakesh Kumar

    2018-05-01

    The formation of DNA triple helix offers the verity of new possibilities in molecular biology. However its applications are limited to purine and pyrimidine rich sequences recognized by forming Hoogsteen/Reverse Hoogsteen triplets in major groove sites of DNA duplex. To overcome this drawback modification in bases backbone and glucose of nucleotide unit of DNA have been proposed so that the third strand base recognized by both the bases of DNA duplex by forming Recombinant type(R-type) of bonding in mixed sequences. Here we performed Quanrum Mechanical (Hartree-Fock and DFT) methodology on natural DNA and Locked Nucleic Acids(LNA) triplets using 6-31G and some other new advance basis sets. Study suggests energetically stable conformation has been observed for recombinant triplets in order of G-C*G > A-T*A > G-C*C > T-A*T for both type of triplets. Interestingly LNA leads to more stable conformation in all set of triplets, clearly suggests an important biological tool to overcome above mentioned drawbacks.

  20. A knowledge engineering approach to recognizing and extracting sequences of nucleic acids from scientific literature.

    PubMed

    García-Remesal, Miguel; Maojo, Victor; Crespo, José

    2010-01-01

    In this paper we present a knowledge engineering approach to automatically recognize and extract genetic sequences from scientific articles. To carry out this task, we use a preliminary recognizer based on a finite state machine to extract all candidate DNA/RNA sequences. The latter are then fed into a knowledge-based system that automatically discards false positives and refines noisy and incorrectly merged sequences. We created the knowledge base by manually analyzing different manuscripts containing genetic sequences. Our approach was evaluated using a test set of 211 full-text articles in PDF format containing 3134 genetic sequences. For such set, we achieved 87.76% precision and 97.70% recall respectively. This method can facilitate different research tasks. These include text mining, information extraction, and information retrieval research dealing with large collections of documents containing genetic sequences.

  1. Influence of quasi-specific sites on kinetics of target DNA search by a sequence-specific DNA-binding protein.

    PubMed

    Kemme, Catherine A; Esadze, Alexandre; Iwahara, Junji

    2015-11-10

    Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such "quasi-specific" sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1's association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins.

  2. Influence of Quasi-Specific Sites on Kinetics of Target DNA Search by a Sequence-Specific DNA-Binding Protein

    PubMed Central

    2015-01-01

    Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such “quasi-specific” sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1’s association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins. PMID:26502071

  3. DNA Barcodes for Forensically Important Fly Species in Brazil.

    PubMed

    Koroiva, Ricardo; de Souza, Mirian S; Roque, Fabio de Oliveira; Pepinelli, Mateus

    2018-04-07

    Here, we analyze 248 DNA barcode sequences of 35 fly species of forensic importance in Brazil. DNA barcoding can be effectively used for specimen identification of these species, allowing the unambiguous identification of 31 species, an overall success rate of 88%. Our results show a high rate of success for molecular identification using DNA barcoding sequences and open new perspectives for immature species identification, a subject on which limited forensic investigations exist in Tropical regions. We also address the implications of building a robust forensic DNA barcode database. A geographic bias is recognized for the COI dataset available for forensically important fly species in Brazil, with concentration of sequences from specimens collected mainly in sites located in the Cerrado, Mata Atlântica, and Pampa biomes.

  4. RNA-dependent RNA targeting by CRISPR-Cas9

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Strutt, Steven C.; Torrez, Rachel M.; Kaya, Emine

    Double-stranded DNA (dsDNA) binding and cleavage by Cas9 is a hallmark of type II CRISPR-Cas bacterial adaptive immunity. All known Cas9 enzymes are thought to recognize DNA exclusively as a natural substrate, providing protection against DNA phage and plasmids. Here, we show that Cas9 enzymes from both subtypes II-A and II-C can recognize and cleave single-stranded RNA (ssRNA) by an RNA-guided mechanism that is independent of a protospacer-adjacent motif (PAM) sequence in the target RNA. RNA-guided RNA cleavage is programmable and site-specific, and we find that this activity can be exploited to reduce infection by single-stranded RNA phage in vivo.more » We also demonstrate that Cas9 can direct PAM-independent repression of gene expression in bacteria. In conclusion, these results indicate that a subset of Cas9 enzymes have the ability to act on both DNA and RNA target sequences, and suggest the potential for use in programmable RNA targeting applications.« less

  5. RNA-dependent RNA targeting by CRISPR-Cas9

    DOE PAGES

    Strutt, Steven C.; Torrez, Rachel M.; Kaya, Emine; ...

    2018-01-05

    Double-stranded DNA (dsDNA) binding and cleavage by Cas9 is a hallmark of type II CRISPR-Cas bacterial adaptive immunity. All known Cas9 enzymes are thought to recognize DNA exclusively as a natural substrate, providing protection against DNA phage and plasmids. Here, we show that Cas9 enzymes from both subtypes II-A and II-C can recognize and cleave single-stranded RNA (ssRNA) by an RNA-guided mechanism that is independent of a protospacer-adjacent motif (PAM) sequence in the target RNA. RNA-guided RNA cleavage is programmable and site-specific, and we find that this activity can be exploited to reduce infection by single-stranded RNA phage in vivo.more » We also demonstrate that Cas9 can direct PAM-independent repression of gene expression in bacteria. In conclusion, these results indicate that a subset of Cas9 enzymes have the ability to act on both DNA and RNA target sequences, and suggest the potential for use in programmable RNA targeting applications.« less

  6. RNA-dependent RNA targeting by CRISPR-Cas9

    PubMed Central

    Strutt, Steven C; Torrez, Rachel M; Kaya, Emine; Negrete, Oscar A

    2018-01-01

    Double-stranded DNA (dsDNA) binding and cleavage by Cas9 is a hallmark of type II CRISPR-Cas bacterial adaptive immunity. All known Cas9 enzymes are thought to recognize DNA exclusively as a natural substrate, providing protection against DNA phage and plasmids. Here, we show that Cas9 enzymes from both subtypes II-A and II-C can recognize and cleave single-stranded RNA (ssRNA) by an RNA-guided mechanism that is independent of a protospacer-adjacent motif (PAM) sequence in the target RNA. RNA-guided RNA cleavage is programmable and site-specific, and we find that this activity can be exploited to reduce infection by single-stranded RNA phage in vivo. We also demonstrate that Cas9 can direct PAM-independent repression of gene expression in bacteria. These results indicate that a subset of Cas9 enzymes have the ability to act on both DNA and RNA target sequences, and suggest the potential for use in programmable RNA targeting applications. PMID:29303478

  7. Architecture of a Fur Binding Site: a Comparative Analysis

    PubMed Central

    Lavrrar, Jennifer L.; McIntosh, Mark A.

    2003-01-01

    Fur is an iron-binding transcriptional repressor that recognizes a 19-bp consensus site of the sequence 5′-GATAATGATAATCATTATC-3′. This site can be defined as three adjacent hexamers of the sequence 5′-GATAAT-3′, with the third being slightly imperfect (an F-F-F configuration), or as two hexamers in the forward orientation separated by one base pair from a third hexamer in the reverse orientation (an F-F-x-R configuration). Although Fur can bind synthetic DNA sequences containing the F-F-F arrangement, most natural binding sites are variations of the F-F-x-R arrangement. The studies presented here compared the ability of Fur to recognize synthetic DNA sequences containing two to four adjacent hexamers with binding to sequences containing variations of the F-F-x-R arrangement (including natural operator sequences from the entS and fepB promoter regions of Escherichia coli). Gel retardation assays showed that the F-F-x-R architecture was necessary for high-affinity Fur-DNA interactions and that contiguous hexamers were not recognized as effectively. In addition, the stoichiometry of Fur at each binding site was determined, showing that Fur interacted with its minimal 19-bp binding site as two overlapping dimers. These data confirm the proposed overlapping-dimer binding model, where the unit of interaction with a single Fur dimer is two inverted hexamers separated by a C:G base pair, with two overlapping units comprising the 19-bp consensus binding site required for the high-affinity interaction with two Fur dimers. PMID:12644489

  8. A cDNA from a mouse pancreatic beta cell encoding a putative transcription factor of the insulin gene.

    PubMed Central

    Walker, M D; Park, C W; Rosen, A; Aronheim, A

    1990-01-01

    Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401

  9. Genetic dissection of the consensus sequence for the class 2 and class 3 flagellar promoters

    PubMed Central

    Wozniak, Christopher E.; Hughes, Kelly T.

    2008-01-01

    Summary Computational searches for DNA binding sites often utilize consensus sequences. These search models make assumptions that the frequency of a base pair in an alignment relates to the base pair’s importance in binding and presume that base pairs contribute independently to the overall interaction with the DNA binding protein. These two assumptions have generally been found to be accurate for DNA binding sites. However, these assumptions are often not satisfied for promoters, which are involved in additional steps in transcription initiation after RNA polymerase has bound to the DNA. To test these assumptions for the flagellar regulatory hierarchy, class 2 and class 3 flagellar promoters were randomly mutagenized in Salmonella. Important positions were then saturated for mutagenesis and compared to scores calculated from the consensus sequence. Double mutants were constructed to determine how mutations combined for each promoter type. Mutations in the binding site for FlhD4C2, the activator of class 2 promoters, better satisfied the assumptions for the binding model than did mutations in the class 3 promoter, which is recognized by the σ28 transcription factor. These in vivo results indicate that the activator sites within flagellar promoters can be modeled using simple assumptions but that the DNA sequences recognized by the flagellar sigma factor require more complex models. PMID:18486950

  10. Modification-dependent restriction endonuclease, MspJI, flips 5-methylcytosine out of the DNA helix

    DOE PAGES

    Horton, J. R.; Wang, H.; Mabuchi, M. Y.; ...

    2014-09-27

    MspJI belongs to a family of restriction enzymes that cleave DNA containing 5-methylcytosine (5mC) or 5-hydroxymethylcytosine (5hmC). MspJI is specific for the sequence 5(h)mC-N-N-G or A and cleaves with some variability 9/13 nucleotides downstream. Earlier, we reported the crystal structure of MspJI without DNA and proposed how it might recognize this sequence and catalyze cleavage. Here we report its co-crystal structure with a 27-base pair oligonucleotide containing 5mC. This structure confirms that MspJI acts as a homotetramer and that the modified cytosine is flipped from the DNA helix into an SRA-like-binding pocket. We expected the structure to reveal two DNAmore » molecules bound specifically to the tetramer and engaged with the enzyme's two DNA-cleavage sites. A coincidence of crystal packing precluded this organization, however. We found that each DNA molecule interacted with two adjacent tetramers, binding one specifically and the other non-specifically. The latter interaction, which prevented cleavage-site engagement, also involved base flipping and might represent the sequence-interrogation phase that precedes specific recognition. MspJI is unusual in that DNA molecules are recognized and cleaved by different subunits. Such interchange of function might explain how other complex multimeric restriction enzymes act.« less

  11. Comparison between TRF2 and TRF1 of their telomeric DNA-bound structures and DNA-binding activities

    PubMed Central

    Hanaoka, Shingo; Nagadoi, Aritaka; Nishimura, Yoshifumi

    2005-01-01

    Mammalian telomeres consist of long tandem arrays of double-stranded telomeric TTAGGG repeats packaged by the telomeric DNA-binding proteins TRF1 and TRF2. Both contain a similar C-terminal Myb domain that mediates sequence-specific binding to telomeric DNA. In a DNA complex of TRF1, only the single Myb-like domain consisting of three helices can bind specifically to double-stranded telomeric DNA. TRF2 also binds to double-stranded telomeric DNA. Although the DNA binding mode of TRF2 is likely identical to that of TRF1, TRF2 plays an important role in the t-loop formation that protects the ends of telomeres. Here, to clarify the details of the double-stranded telomeric DNA-binding modes of TRF1 and TRF2, we determined the solution structure of the DNA-binding domain of human TRF2 bound to telomeric DNA; it consists of three helices, and like TRF1, the third helix recognizes TAGGG sequence in the major groove of DNA with the N-terminal arm locating in the minor groove. However, small but significant differences are observed; in contrast to the minor groove recognition of TRF1, in which an arginine residue recognizes the TT sequence, a lysine residue of TRF2 interacts with the TT part. We examined the telomeric DNA-binding activities of both DNA-binding domains of TRF1 and TRF2 and found that TRF1 binds more strongly than TRF2. Based on the structural differences of both domains, we created several mutants of the DNA-binding domain of TRF2 with stronger binding activities compared to the wild-type TRF2. PMID:15608118

  12. Sequence-dependent DNA deformability studied using molecular dynamics simulations.

    PubMed

    Fujii, Satoshi; Kono, Hidetoshi; Takenaka, Shigeori; Go, Nobuhiro; Sarai, Akinori

    2007-01-01

    Proteins recognize specific DNA sequences not only through direct contact between amino acids and bases, but also indirectly based on the sequence-dependent conformation and deformability of the DNA (indirect readout). We used molecular dynamics simulations to analyze the sequence-dependent DNA conformations of all 136 possible tetrameric sequences sandwiched between CGCG sequences. The deformability of dimeric steps obtained by the simulations is consistent with that by the crystal structures. The simulation results further showed that the conformation and deformability of the tetramers can highly depend on the flanking base pairs. The conformations of xATx tetramers show the most rigidity and are not affected by the flanking base pairs and the xYRx show by contrast the greatest flexibility and change their conformations depending on the base pairs at both ends, suggesting tetramers with the same central dimer can show different deformabilities. These results suggest that analysis of dimeric steps alone may overlook some conformational features of DNA and provide insight into the mechanism of indirect readout during protein-DNA recognition. Moreover, the sequence dependence of DNA conformation and deformability may be used to estimate the contribution of indirect readout to the specificity of protein-DNA recognition as well as nucleosome positioning and large-scale behavior of nucleic acids.

  13. Biophysics of protein-DNA interactions and chromosome organization

    PubMed Central

    Marko, John F.

    2014-01-01

    The function of DNA in cells depends on its interactions with protein molecules, which recognize and act on base sequence patterns along the double helix. These notes aim to introduce basic polymer physics of DNA molecules, biophysics of protein-DNA interactions and their study in single-DNA experiments, and some aspects of large-scale chromosome structure. Mechanisms for control of chromosome topology will also be discussed. PMID:25419039

  14. Methods for decoding Cas9 protospacer adjacent motif (PAM) sequences: A brief overview.

    PubMed

    Karvelis, Tautvydas; Gasiunas, Giedrius; Siksnys, Virginijus

    2017-05-15

    Recently the Cas9, an RNA guided DNA endonuclease, emerged as a powerful tool for targeted genome manipulations. Cas9 protein can be reprogrammed to cleave, bind or nick any DNA target by simply changing crRNA sequence, however a short nucleotide sequence, termed PAM, is required to initiate crRNA hybridization to the DNA target. PAM sequence is recognized by Cas9 protein and must be determined experimentally for each Cas9 variant. Exploration of Cas9 orthologs could offer a diversity of PAM sequences and novel biochemical properties that may be beneficial for genome editing applications. Here we briefly review and compare Cas9 PAM identification assays that can be adopted for other PAM-dependent CRISPR-Cas systems. Copyright © 2017 Elsevier Inc. All rights reserved.

  15. Creating a monomeric endonuclease TALE-I-SceI with high specificity and low genotoxicity in human cells.

    PubMed

    Lin, Jianfei; Chen, He; Luo, Ling; Lai, Yongrong; Xie, Wei; Kee, Kehkooi

    2015-01-01

    To correct a DNA mutation in the human genome for gene therapy, homology-directed repair (HDR) needs to be specific and have the lowest off-target effects to protect the human genome from deleterious mutations. Zinc finger nucleases, transcription activator-like effector nuclease (TALEN) and CRISPR-CAS9 systems have been engineered and used extensively to recognize and modify specific DNA sequences. Although TALEN and CRISPR/CAS9 could induce high levels of HDR in human cells, their genotoxicity was significantly higher. Here, we report the creation of a monomeric endonuclease that can recognize at least 33 bp by fusing the DNA-recognizing domain of TALEN (TALE) to a re-engineered homing endonuclease I-SceI. After sequentially re-engineering I-SceI to recognize 18 bp of the human β-globin sequence, the re-engineered I-SceI induced HDR in human cells. When the re-engineered I-SceI was fused to TALE (TALE-ISVB2), the chimeric endonuclease induced the same HDR rate at the human β-globin gene locus as that induced by TALEN, but significantly reduced genotoxicity. We further demonstrated that TALE-ISVB2 specifically targeted at the β-globin sequence in human hematopoietic stem cells. Therefore, this monomeric endonuclease has the potential to be used in therapeutic gene targeting in human cells. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

    PubMed

    El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

    2013-07-01

    Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.

  17. Using FRET to Measure the Angle at Which a Protein Bends DNA: TBP Binding a TATA Box as a Model System

    ERIC Educational Resources Information Center

    Kugel, Jennifer F.

    2008-01-01

    An undergraduate biochemistry laboratory experiment that will teach the technique of fluorescence resonance energy transfer (FRET) while analyzing protein-induced DNA bending is described. The experiment uses the protein TATA binding protein (TBP), which is a general transcription factor that recognizes and binds specific DNA sequences known as…

  18. Previously unknown and highly divergent ssDNA viruses populate the oceans.

    PubMed

    Labonté, Jessica M; Suttle, Curtis A

    2013-11-01

    Single-stranded DNA (ssDNA) viruses are economically important pathogens of plants and animals, and are widespread in oceans; yet, the diversity and evolutionary relationships among marine ssDNA viruses remain largely unknown. Here we present the results from a metagenomic study of composite samples from temperate (Saanich Inlet, 11 samples; Strait of Georgia, 85 samples) and subtropical (46 samples, Gulf of Mexico) seawater. Most sequences (84%) had no evident similarity to sequenced viruses. In total, 608 putative complete genomes of ssDNA viruses were assembled, almost doubling the number of ssDNA viral genomes in databases. These comprised 129 genetically distinct groups, each represented by at least one complete genome that had no recognizable similarity to each other or to other virus sequences. Given that the seven recognized families of ssDNA viruses have considerable sequence homology within them, this suggests that many of these genetic groups may represent new viral families. Moreover, nearly 70% of the sequences were similar to one of these genomes, indicating that most of the sequences could be assigned to a genetically distinct group. Most sequences fell within 11 well-defined gene groups, each sharing a common gene. Some of these encoded putative replication and coat proteins that had similarity to sequences from viruses infecting eukaryotes, suggesting that these were likely from viruses infecting eukaryotic phytoplankton and zooplankton.

  19. Cloning, sequencing, and expression of dnaK-operon proteins from the thermophilic bacterium Thermus thermophilus.

    PubMed

    Osipiuk, J; Joachimiak, A

    1997-09-12

    We propose that the dnaK operon of Thermus thermophilus HB8 is composed of three functionally linked genes: dnaK, grpE, and dnaJ. The dnaK and dnaJ gene products are most closely related to their cyanobacterial homologs. The DnaK protein sequence places T. thermophilus in the plastid Hsp70 subfamily. In contrast, the grpE translated sequence is most similar to GrpE from Clostridium acetobutylicum, a Gram-positive anaerobic bacterium. A single promoter region, with homology to the Escherichia coli consensus promoter sequences recognized by the sigma70 and sigma32 transcription factors, precedes the postulated operon. This promoter is heat-shock inducible. The dnaK mRNA level increased more than 30 times upon 10 min of heat shock (from 70 degrees C to 85 degrees C). A strong transcription terminating sequence was found between the dnaK and grpE genes. The individual genes were cloned into pET expression vectors and the thermophilic proteins were overproduced at high levels in E. coli and purified to homogeneity. The recombinant T. thermophilus DnaK protein was shown to have a weak ATP-hydrolytic activity, with an optimum at 90 degrees C. The ATPase was stimulated by the presence of GrpE and DnaJ. Another open reading frame, coding for ClpB heat-shock protein, was found downstream of the dnaK operon.

  20. BplI, a new BcgI-like restriction endonuclease, which recognizes a symmetric sequence.

    PubMed Central

    Vitkute, J; Maneliene, Z; Petrusyte, M; Janulaitis, A

    1997-01-01

    Bcg I and Bcg I-like restriction endonucleases cleave double stranded DNA specifically on both sides of their asymmetric recognition sequences which are interrupted by several ambiguous base pairs. Their heterosubunit structure, bifunctionality and stimulation by AdoMet make them different from other classified restriction enzymes. Here we report on a new Bcg I-like restriction endonuclease, Bpl I from Bacillus pumilus , which in contrast to all other Bcg I-like enzymes, recognizes a symmetric interrupted sequence, and which, like Bcg I, cleaves double stranded DNA upstream and downstream of its recognition sequence (8/13)GAGN5CTC(13/8). Like Bcg I, Bpl I is a bifunctional enzyme revealing both DNA cleavage and methyltransferase activities. There are two polypeptides in the homogeneous preparation of Bpl I with molecular masses of approximately 74 and 37 kDa. The sizes of the Bpl I subunits are close to those of Bcg I, but the proportion 1:1 in the final preparation is different from that of 2:1 in Bcg I. Low activity observed with Mg2+increases >100-fold in the presence of AdoMet. Even with AdoMet though, specific cleavage is incomplete. S -adenosylhomocysteine (AdoHcy) or sinefungin can replace AdoMet in the cleavage reaction. AdoHcy activated Bpl I yields complete cleavage of DNA. PMID:9358150

  1. TFBSshape: a motif database for DNA shape features of transcription factor binding sites.

    PubMed

    Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W; Gordân, Raluca; Rohs, Remo

    2014-01-01

    Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.

  2. TFBSshape: a motif database for DNA shape features of transcription factor binding sites

    PubMed Central

    Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W.; Gordân, Raluca; Rohs, Remo

    2014-01-01

    Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein–DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone. PMID:24214955

  3. FA-SAT Is an Old Satellite DNA Frozen in Several Bilateria Genomes

    PubMed Central

    Chaves, Raquel; Ferreira, Daniela; Mendes-da-Silva, Ana; Meles, Susana; Adega, Filomena

    2017-01-01

    Abstract In recent years, a growing body of evidence has recognized the tandem repeat sequences, and specifically satellite DNA, as a functional class of sequences in the genomic “dark matter.” Using an original, complementary, and thus an eclectic experimental design, we show that the cat archetypal satellite DNA sequence, FA-SAT, is “frozen” conservatively in several Bilateria genomes. We found different genomic FA-SAT architectures, and the interspersion pattern was conserved. In Carnivora genomes, the FA-SAT-related sequences are also amplified, with the predominance of a specific FA-SAT variant, at the heterochromatic regions. We inspected the cat genome project to locate FA-SAT array flanking regions and revealed an intensive intermingling with transposable elements. Our results also show that FA-SAT-related sequences are transcribed and that the most abundant FA-SAT variant is not always the most transcribed. We thus conclude that the DNA sequences of FA-SAT and their transcripts are “frozen” in these genomes. Future work is needed to disclose any putative function that these sequences may play in these genomes. PMID:29608678

  4. The structures of non-CG-repeat Z-DNAs co-crystallized with the Z-DNA-binding domain, hZ alpha(ADAR1).

    PubMed

    Ha, Sung Chul; Choi, Jongkeun; Hwang, Hye-Yeon; Rich, Alexander; Kim, Yang-Gyun; Kim, Kyeong Kyu

    2009-02-01

    The Z-DNA conformation preferentially occurs at alternating purine-pyrimidine repeats, and is specifically recognized by Z alpha domains identified in several Z-DNA-binding proteins. The binding of Z alpha to foreign or chromosomal DNA in various sequence contexts is known to influence various biological functions, including the DNA-mediated innate immune response and transcriptional modulation of gene expression. For these reasons, understanding its binding mode and the conformational diversity of Z alpha bound Z-DNAs is of considerable importance. However, structural studies of Z alpha bound Z-DNA have been mostly limited to standard CG-repeat DNAs. Here, we have solved the crystal structures of three representative non-CG repeat DNAs, d(CACGTG)(2), d(CGTACG)(2) and d(CGGCCG)(2) complexed to hZ alpha(ADAR1) and compared those structures with that of hZ alpha(ADAR1)/d(CGCGCG)(2) and the Z alpha-free Z-DNAs. hZ alpha(ADAR1) bound to each of the three Z-DNAs showed a well conserved binding mode with very limited structural deviation irrespective of the DNA sequence, although varying numbers of residues were in contact with Z-DNA. Z-DNAs display less structural alterations in the Z alpha-bound state than in their free form, thereby suggesting that conformational diversities of Z-DNAs are restrained by the binding pocket of Z alpha. These data suggest that Z-DNAs are recognized by Z alpha through common conformational features regardless of the sequence and structural alterations.

  5. Sequence Discrimination by Alternatively Spliced Isoforms of a DNA Binding Zinc Finger Domain

    NASA Astrophysics Data System (ADS)

    Gogos, Joseph A.; Hsu, Tien; Bolton, Jesse; Kafatos, Fotis C.

    1992-09-01

    Two major developmentally regulated isoforms of the Drosophila chorion transcription factor CF2 differ by an extra zinc finger within the DNA binding domain. The preferred DNA binding sites were determined and are distinguished by an internal duplication of TAT in the site recognized by the isoform with the extra finger. The results are consistent with modular interactions between zinc fingers and trinucleotides and also suggest rules for recognition of AT-rich DNA sites by zinc finger proteins. The results show how modular finger interactions with trinucleotides can be used, in conjunction with alternative splicing, to alter the binding specificity and increase the spectrum of sites recognized by a DNA binding domain. Thus, CF2 may potentially regulate distinct sets of target genes during development.

  6. A multiple-alignment based primer design algorithm for genetically highly variable DNA targets

    PubMed Central

    2013-01-01

    Background Primer design for highly variable DNA sequences is difficult, and experimental success requires attention to many interacting constraints. The advent of next-generation sequencing methods allows the investigation of rare variants otherwise hidden deep in large populations, but requires attention to population diversity and primer localization in relatively conserved regions, in addition to recognized constraints typically considered in primer design. Results Design constraints include degenerate sites to maximize population coverage, matching of melting temperatures, optimizing de novo sequence length, finding optimal bio-barcodes to allow efficient downstream analyses, and minimizing risk of dimerization. To facilitate primer design addressing these and other constraints, we created a novel computer program (PrimerDesign) that automates this complex procedure. We show its powers and limitations and give examples of successful designs for the analysis of HIV-1 populations. Conclusions PrimerDesign is useful for researchers who want to design DNA primers and probes for analyzing highly variable DNA populations. It can be used to design primers for PCR, RT-PCR, Sanger sequencing, next-generation sequencing, and other experimental protocols targeting highly variable DNA samples. PMID:23965160

  7. On the Sequence-Directed Nature of Human Gene Mutation: The Role of Genomic Architecture and the Local DNA Sequence Environment in Mediating Gene Mutations Underlying Human Inherited Disease

    PubMed Central

    Cooper, David N.; Bacolla, Albino; Férec, Claude; Vasquez, Karen M.; Kehrer-Sawatzki, Hildegard; Chen, Jian-Min

    2011-01-01

    Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher-order features of the genomic architecture. The human genome is now recognized to contain ‘pervasive architectural flaws’ in that certain DNA sequences are inherently mutation-prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of non-canonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair, and may serve to increase mutation frequencies in generalized fashion (i.e. both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease. PMID:21853507

  8. Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.

    PubMed

    Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru

    2015-01-01

    The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.

  9. Phylogenetic position of the pentastomida and [pan]crustacean relationships

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lavrov, Dennis V.; Brown, Wesley M.; Boore, Jeffrey L.

    2004-01-31

    Pentastomids are a small group of vermiform animals with unique morphology and parasitic lifestyle. They are generally recognized as being related to the Arthropoda, however the nature of this relationship is controversial. We have determined the complete sequence of the mitochondrial DNA (mtDNA) of the pentastomid Armillifer armillatus and complete, or nearly complete, mtDNA sequences from representatives of four previously unsampled groups of Crustacea: Remipedia (Speleonectes tulumensis), Cephalocarida (Hutchinsoniella macracantha), Cirripedia (Pollicipes polymerus), and Branchiura (Argulus americanus). Analyses of the mtDNA gene arrangements and sequences determined in this study indicate unambiguously that pentastomids are a group of modified crustaceans likelymore » related to branchiurans. In addition, gene arrangement comparisons strongly support an unforeseen assemblage of pentastomids with maxillopod and cephalocarid crustaceans, to the exclusion of remipedes, branchiopods, malacos tracans and insects.« less

  10. Data Release: DNA barcodes of plant species collected for the Global Genome Initiative for Gardens Program, National Museum of Natural History, Smithsonian Institution

    PubMed Central

    Zúñiga, Jose D.; Gostel, Morgan R.; Mulcahy, Daniel G.; Barker, Katharine; Asia Hill; Sedaghatpour, Maryam; Vo, Samantha Q.; Funk, Vicki A.; Coddington, Jonathan A.

    2017-01-01

    Abstract The Global Genome Initiative has sequenced and released 1961 DNA barcodes for genetic samples obtained as part of the Global Genome Initiative for Gardens Program. The dataset includes barcodes for 29 plant families and 309 genera that did not have sequences flagged as barcodes in GenBank and sequences from officially recognized barcoding genetic markers meet the data standard of the Consortium for the Barcode of Life. The genetic samples were deposited in the Smithsonian Institution’s National Museum of Natural History Biorepository and their records were made public through the Global Genome Biodiversity Network’s portal. The DNA barcodes are now available on GenBank. PMID:29118648

  11. Structure and DNA-Binding Sites of the SWI1 AT-rich Interaction Domain (ARID) Suggest Determinants for Sequence-Specific DNA Recognition

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kim, Suhkmann; Zhang, Ziming; Upchurch, Sean

    2004-04-16

    2 ARID is a homologous family of DNA-binding domains that occur in DNA binding proteins from a wide variety of species, ranging from yeast to nematodes, insects, mammals and plants. SWI1, a member of the SWI/SNF protein complex that is involved in chromatin remodeling during transcription, contains the ARID motif. The ARID domain of human SWI1 (also known as p270) does not select for a specific DNA sequence from a random sequence pool. The lack of sequence specificity shown by the SWI1 ARID domain stands in contrast to the other characterized ARID domains, which recognize specific AT-rich sequences. We havemore » solved the three-dimensional structure of human SWI1 ARID using solution NMR methods. In addition, we have characterized non-specific DNA-binding by the SWI1 ARID domain. Results from this study indicate that a flexible long internal loop in ARID motif is likely to be important for sequence specific DNA-recognition. The structure of human SWI1 ARID domain also represents a distinct structural subfamily. Studies of ARID indicate that boundary of the DNA binding structural and functional domains can extend beyond the sequence homologous region in a homologous family of proteins. Structural studies of homologous domains such as ARID family of DNA-binding domains should provide information to better predict the boundary of structural and functional domains in structural genomic studies. Key Words: ARID, SWI1, NMR, structural genomics, protein-DNA interaction.« less

  12. Structural Analysis of HMGD-DNA Complexes Reveal Influence of Intercalation on Sequence Selectivity and DNA Bending

    PubMed Central

    Churchill, Mair E.A.; Klass, Janet; Zoetewey, David L.

    2010-01-01

    The ubiquitous eukaryotic High-Mobility-Group-Box (HMGB) chromosomal proteins promote many chromatin-mediated cellular activities through their non-sequence-specific binding and bending of DNA. Minor groove DNA binding by the HMG box results in substantial DNA bending toward the major groove owing to electrostatic interactions, shape complementarity and DNA intercalation that occurs at two sites. Here, the structures of the complexes formed with DNA by a partially DNA intercalation-deficient mutant of Drosophila melanogaster HMGD have been determined by X-ray crystallography at a resolution of 2.85 Å. The six proteins and fifty base pairs of DNA in the crystal structure revealed a variety of bound conformations. All of the proteins bound in the minor groove, bridging DNA molecules, presumably because these DNA regions are easily deformed. The loss of the primary site of DNA intercalation decreased overall DNA bending and shape complementarity. However, DNA bending at the secondary site of intercalation was retained and most protein-DNA contacts were preserved. The mode of binding resembles the HMGB1-boxA-cisplatin-DNA complex, which also lacks a primary intercalating residue. This study provides new insights into the binding mechanisms used by HMG boxes to recognize varied DNA structures and sequences as well as modulate DNA structure and DNA bending. PMID:20800069

  13. In vitro selection of DNA elements highly responsive to the human T-cell lymphotropic virus type I transcriptional activator, Tax.

    PubMed

    Paca-Uccaralertkun, S; Zhao, L J; Adya, N; Cross, J V; Cullen, B R; Boros, I M; Giam, C Z

    1994-01-01

    The human T-cell lymphotropic virus type I (HTLV-I) transactivator, Tax, the ubiquitous transcriptional factor cyclic AMP (cAMP) response element-binding protein (CREB protein), and the 21-bp repeats in the HTLV-I transcriptional enhancer form a ternary nucleoprotein complex (L. J. Zhao and C. Z. Giam, Proc. Natl. Acad. Sci. USA 89:7070-7074, 1992). Using an antibody directed against the COOH-terminal region of Tax along with purified Tax and CREB proteins, we selected DNA elements bound specifically by the Tax-CREB complex in vitro. Two distinct but related groups of sequences containing the cAMP response element (CRE) flanked by long runs of G and C residues in the 5' and 3' regions, respectively, were preferentially recognized by Tax-CREB. In contrast, CREB alone binds only to CRE motifs (GNTGACG[T/C]) without neighboring G- or C-rich sequences. The Tax-CREB-selected sequences bear a striking resemblance to the 5' or 3' two-thirds of the HTLV-I 21-bp repeats and are highly inducible by Tax. Gel electrophoretic mobility shift assays, DNA transfection, and DNase I footprinting analyses indicated that the G- and C-rich sequences flanking the CRE motif are crucial for Tax-CREB-DNA ternary complex assembly and Tax transactivation but are not in direct contact with the Tax-CREB complex. These data show that Tax recruits CREB to form a multiprotein complex that specifically recognizes the viral 21-bp repeats. The expanded DNA binding specificity of Tax-CREB and the obligatory role the ternary Tax-CREB-DNA complex plays in transactivation reveal a novel mechanism for regulating the transcriptional activity of leucine zipper proteins like CREB.

  14. Murine J774 Macrophages Recognize LPS/IFN-g, Non-CpG DNA or Two-CpG DNA-containing Sequences as Immunologically Distinct

    PubMed Central

    Crosby, Lynn; Casey, Warren; Morgan, Kevin; Ni, Hong; Yoon, Lawrence; Easton, Marilyn; Misukonis, Mary; Burleson, Gary; Ghosh, Dipak K.

    2010-01-01

    Specific bacterial lipopolysaccharides (LPS), IFN-γ, and unmethylated cytosine or guanosine-phosphorothioate containing DNAs (CpG) activate host immunity, influencing infectious responses. Macrophages detect, inactivate and destroy infectious particles, and synthetic CpG sequences invoke similar responses of the innate immune system. Previously, murine macrophage J774 cells treated with CpG induced the expression of nitric oxide synthase 2 (NOS2) and cyclo-oxygenase 2 (COX2) mRNA and protein. In this study murine J774 macrophages were exposed to vehicle, interferon γ + lipopolysaccharide (IFN-g/LPS), non-CpG (SAK1), or two-CpG sequence-containing DNA (SAK2) for 0–18 hr and gene expression changes measured. A large number of immunostimulatory and inflammatory changes were observed. SAK2 was a stronger activator of TNFα- and chemokine expression-related changes than LPS/IFN-g. Up regulation included tumor necrosis factor receptor superfamily genes (TNFRSF’s), IL-1 receptor signaling via stress-activated protein kinase (SAPK), NF-κB activation, hemopoietic maturation factors and sonic hedgehog/wingless integration site (SHH/Wnt) pathway genes. Genes of the TGF-β pathway were down regulated. In contrast, LPS/IFN-g -treated cells showed increased levels for TGF-β signaling genes, which may be linked to the observed up regulation of numerous collagens and down regulation of Wnt pathway genes. SAK1 produced distinct changes from LPS/IFN-g or SAK2. Therefore, J774 macrophages recognize LPS/IFN-g, non-CpG DNA or two-CpG DNA-containing sequences as immunologically distinct. PMID:20097302

  15. Recognition of the Xenopus ribosomal core promoter by the transcription factor xUBF involves multiple HMG box domains and leads to an xUBF interdomain interaction.

    PubMed

    Leblanc, B; Read, C; Moss, T

    1993-02-01

    The interaction of the ribosomal transcription factor xUBF with the RNA polymerase I core promoter of Xenopus laevis has been studied both at the DNA and protein levels. It is shown that a single xUBF-DNA complex forms over the 40S initiation site (+1) and involves at least the DNA sequences between -20 and +60 bp. DNA sequences upstream of +10 and downstream of +18 are each sufficient to direct complex formation independently. HMG box 1 of xUBF independently recognizes the sequences -20 to -1 and +1 to +22 and the addition of the N-terminal dimerization domain to HMG box 1 stabilizes its interaction with these sequences approximately 10-fold. HMG boxes 2/3 interact with the DNA downstream of +22 and can independently position xUBF across the initiation site. The C-terminal segment of xUBF, HMG boxes 4, 5 or the acidic domain, directly or indirectly interact with HMG box 1, making the core promoter sequences between -11 and -15 hypersensitive to DNase. This interaction also requires the DNA sequences between +17 and +32, i.e. the HMG box 2/3 binding site. The data suggest extensive folding of the core promoter within the xUBF complex.

  16. SRY, like HMG1, recognizes sharp angles in DNA.

    PubMed Central

    Ferrari, S; Harley, V R; Pontiggia, A; Goodfellow, P N; Lovell-Badge, R; Bianchi, M E

    1992-01-01

    HMG boxes are DNA binding domains present in chromatin proteins, general transcription factors for nucleolar and mitochondrial RNA polymerases, and gene- and tissue-specific transcriptional regulators. The HMG boxes of HMG1, an abundant component of chromatin, interact specifically with four-way junctions, DNA structures that are cross-shaped and contain angles of approximately 60 and 120 degrees between their arms. We show here also that the HMG box of SRY, the protein that determines the expression of male-specific genes in humans, recognizes four-way junction DNAs irrespective of their sequence. In addition, when SRY binds to linear duplex DNA containing its specific target AACAAAG, it produces a sharp bend. Therefore, the interaction between HMG boxes and DNA appears to be predominantly structure-specific. The production of the recognition of a kink in DNA can serve several distinct functions, such as the repair of DNA lesions, the folding of DNA segments with bound transcriptional factors into productive complexes or the wrapping of DNA in chromatin. Images PMID:1425584

  17. Recognition of Local DNA Structures by p53 Protein

    PubMed Central

    Brázda, Václav; Coufal, Jan

    2017-01-01

    p53 plays critical roles in regulating cell cycle, apoptosis, senescence and metabolism and is commonly mutated in human cancer. These roles are achieved by interaction with other proteins, but particularly by interaction with DNA. As a transcription factor, p53 is well known to bind consensus target sequences in linear B-DNA. Recent findings indicate that p53 binds with higher affinity to target sequences that form cruciform DNA structure. Moreover, p53 binds very tightly to non-B DNA structures and local DNA structures are increasingly recognized to influence the activity of wild-type and mutant p53. Apart from cruciform structures, p53 binds to quadruplex DNA, triplex DNA, DNA loops, bulged DNA and hemicatenane DNA. In this review, we describe local DNA structures and summarize information about interactions of p53 with these structural DNA motifs. These recent data provide important insights into the complexity of the p53 pathway and the functional consequences of wild-type and mutant p53 activation in normal and tumor cells. PMID:28208646

  18. Two dimensional molecular electronics spectroscopy for molecular fingerprinting, DNA sequencing, and cancerous DNA recognition.

    PubMed

    Rajan, Arunkumar Chitteth; Rezapour, Mohammad Reza; Yun, Jeonghun; Cho, Yeonchoo; Cho, Woo Jong; Min, Seung Kyu; Lee, Geunsik; Kim, Kwang S

    2014-02-25

    Laser-driven molecular spectroscopy of low spatial resolution is widely used, while electronic current-driven molecular spectroscopy of atomic scale resolution has been limited because currents provide only minimal information. However, electron transmission of a graphene nanoribbon on which a molecule is adsorbed shows molecular fingerprints of Fano resonances, i.e., characteristic features of frontier orbitals and conformations of physisorbed molecules. Utilizing these resonance profiles, here we demonstrate two-dimensional molecular electronics spectroscopy (2D MES). The differential conductance with respect to bias and gate voltages not only distinguishes different types of nucleobases for DNA sequencing but also recognizes methylated nucleobases which could be related to cancerous cell growth. This 2D MES could open an exciting field to recognize single molecule signatures at atomic resolution. The advantages of the 2D MES over the one-dimensional (1D) current analysis can be comparable to those of 2D NMR over 1D NMR analysis.

  19. Sequence Dependent Interactions Between DNA and Single-Walled Carbon Nanotubes

    NASA Astrophysics Data System (ADS)

    Roxbury, Daniel

    It is known that single-stranded DNA adopts a helical wrap around a single-walled carbon nanotube (SWCNT), forming a water-dispersible hybrid molecule. The ability to sort mixtures of SWCNTs based on chirality (electronic species) has recently been demonstrated using special short DNA sequences that recognize certain matching SWCNTs of specific chirality. This thesis investigates the intricacies of DNA-SWCNT sequence-specific interactions through both experimental and molecular simulation studies. The DNA-SWCNT binding strengths were experimentally quantified by studying the kinetics of DNA replacement by a surfactant on the surface of particular SWCNTs. Recognition ability was found to correlate strongly with measured binding strength, e.g. DNA sequence (TAT)4 was found to bind 20 times stronger to the (6,5)-SWCNT than sequence (TAT)4T. Next, using replica exchange molecular dynamics (REMD) simulations, equilibrium structures formed by (a) single-strands and (b) multiple-strands of 12-mer oligonucleotides adsorbed on various SWCNTs were explored. A number of structural motifs were discovered in which the DNA strand wraps around the SWCNT and 'stitches' to itself via hydrogen bonding. Great variability among equilibrium structures was observed and shown to be directly influenced by DNA sequence and SWCNT type. For example, the (6,5)-SWCNT DNA recognition sequence, (TAT)4, was found to wrap in a tight single-stranded right-handed helical conformation. In contrast, DNA sequence T12 forms a beta-barrel left-handed structure on the same SWCNT. These are the first theoretical indications that DNA-based SWCNT selectivity can arise on a molecular level. In a biomedical collaboration with the Mayo Clinic, pathways for DNA-SWCNT internalization into healthy human endothelial cells were explored. Through absorbance spectroscopy, TEM imaging, and confocal fluorescence microscopy, we showed that intracellular concentrations of SWCNTs far exceeded those of the incubation solution, which suggested an energy-dependent pathway. Additionally, by means of pharmacological inhibition and vector-induced gene knockout studies, the DNA-SWCNTs were shown to enter the cells via Rac1-mediated macropinocytosis.

  20. Curated collection of yeast transcription factor DNA binding specificity data reveals novel structural and gene regulatory insights

    PubMed Central

    2011-01-01

    Background Transcription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has been lacking. Results We analyzed in vitro and in vivo TF-DNA binding data reported in previous large-scale studies to generate a comprehensive, curated resource of DNA binding specificity data for all characterized S. cerevisiae TFs. Our collection comprises DNA binding site motifs and comprehensive in vitro DNA binding specificity data for all possible 8-bp sequences. Investigation of the DNA binding specificities within the basic leucine zipper (bZIP) and VHT1 regulator (VHR) TF families revealed unexpected plasticity in TF-DNA recognition: intriguingly, the VHR TFs, newly characterized by protein binding microarrays in this study, recognize bZIP-like DNA motifs, while the bZIP TF Hac1 recognizes a motif highly similar to the canonical E-box motif of basic helix-loop-helix (bHLH) TFs. We identified several TFs with distinct primary and secondary motifs, which might be associated with different regulatory functions. Finally, integrated analysis of in vivo TF binding data with protein binding microarray data lends further support for indirect DNA binding in vivo by sequence-specific TFs. Conclusions The comprehensive data in this curated collection allow for more accurate analyses of regulatory TF-DNA interactions, in-depth structural studies of TF-DNA specificity determinants, and future experimental investigations of the TFs' predicted target genes and regulatory roles. PMID:22189060

  1. A paper-based device for double-stranded DNA detection with Zif268

    NASA Astrophysics Data System (ADS)

    Zhang, Daohong

    2017-05-01

    Here, a small analytical device was fabricated on both nitrocellulose membrane and filter paper, for the detection of biotinylated double-stranded DNA (dsDNA) from 1 nM. Zif268 was utilized for capturing the target DNA, which was a zinc finger protein that recognized only a dsDNA with specific sequence. Therefore, this detection platform could be utilized for PCR result detection, with the well-designed primers (interpolate both biotin and Zif268 binding sequence). The result of the assay could be recorded by a camera-phone, and analyzed with software. The whole assay finished within 1 hour. Due to the easy fabrication, operation and disposal of this device, this method can be employed in point-of-care detection or on-site monitoring.

  2. The twilight zone of cis element alignments.

    PubMed

    Sebastian, Alvaro; Contreras-Moreira, Bruno

    2013-02-01

    Sequence alignment of proteins and nucleic acids is a routine task in bioinformatics. Although the comparison of complete peptides, genes or genomes can be undertaken with a great variety of tools, the alignment of short DNA sequences and motifs entails pitfalls that have not been fully addressed yet. Here we confront the structural superposition of transcription factors with the sequence alignment of their recognized cis elements. Our goals are (i) to test TFcompare (http://floresta.eead.csic.es/tfcompare), a structural alignment method for protein-DNA complexes; (ii) to benchmark the pairwise alignment of regulatory elements; (iii) to define the confidence limits and the twilight zone of such alignments and (iv) to evaluate the relevance of these thresholds with elements obtained experimentally. We find that the structure of cis elements and protein-DNA interfaces is significantly more conserved than their sequence and measures how this correlates with alignment errors when only sequence information is considered. Our results confirm that DNA motifs in the form of matrices produce better alignments than individual sequences. Finally, we report that empirical and theoretically derived twilight thresholds are useful for estimating the natural plasticity of regulatory sequences, and hence for filtering out unreliable alignments.

  3. The twilight zone of cis element alignments

    PubMed Central

    Sebastian, Alvaro; Contreras-Moreira, Bruno

    2013-01-01

    Sequence alignment of proteins and nucleic acids is a routine task in bioinformatics. Although the comparison of complete peptides, genes or genomes can be undertaken with a great variety of tools, the alignment of short DNA sequences and motifs entails pitfalls that have not been fully addressed yet. Here we confront the structural superposition of transcription factors with the sequence alignment of their recognized cis elements. Our goals are (i) to test TFcompare (http://floresta.eead.csic.es/tfcompare), a structural alignment method for protein–DNA complexes; (ii) to benchmark the pairwise alignment of regulatory elements; (iii) to define the confidence limits and the twilight zone of such alignments and (iv) to evaluate the relevance of these thresholds with elements obtained experimentally. We find that the structure of cis elements and protein–DNA interfaces is significantly more conserved than their sequence and measures how this correlates with alignment errors when only sequence information is considered. Our results confirm that DNA motifs in the form of matrices produce better alignments than individual sequences. Finally, we report that empirical and theoretically derived twilight thresholds are useful for estimating the natural plasticity of regulatory sequences, and hence for filtering out unreliable alignments. PMID:23268451

  4. Cooperative interactions between paired domain and homeodomain.

    PubMed

    Jun, S; Desplan, C

    1996-09-01

    The Pax proteins are a family of transcriptional regulators involved in many developmental processes in all higher eukaryotes. They are characterized by the presence of a paired domain (PD), a bipartite DNA binding domain composed of two helix-turn-helix (HTH) motifs,the PAI and RED domains. The PD is also often associated with a homeodomain (HD) which is itself able to form homo- and hetero-dimers on DNA. Many of these proteins therefore contain three HTH motifs each able to recognize DNA. However, all PDs recognize highly related DNA sequences, and most HDs also recognize almost identical sites. We show here that different Pax proteins use multiple combinations of their HTHs to recognize several types of target sites. For instance, the Drosophila Paired protein can bind, in vitro, exclusively through its PAI domain, or through a dimer of its HD, or through cooperative interaction between PAI domain and HD. However, prd function in vivo requires the synergistic action of both the PAI domain and the HD. Pax proteins with only a PD appear to require both PAI and RED domains, while a Pax-6 isoform and a new Pax protein, Lune, may rely on the RED domain and HD. We propose a model by which Pax proteins recognize different target genes in vivo through various combinations of their DNA binding domains, thus expanding their recognition repertoire.

  5. Investigation of the Causes of Breast Cancer at the Cellular Level: Isolation of In Vivo Binding Sites of the Human Origin Recognition Complex

    DTIC Science & Technology

    2002-08-01

    We study the process of DNA replication in proliferating human cells. Our efforts are directed to the identification and characterization of proteins...that promote DNA replication (initiators) as well as the DNA sequences recognized by them (replicators) . We have focused in a group of initiator...to be a critical factor for the coordination of DNA replication with the cell division cycle. hOrclp levels are higher between the exit of mitosis and

  6. Development of a Diagnostic Tool to Detect DNA Methylation Biomarkers for Early-Stage Lung Cancer

    DTIC Science & Technology

    2015-02-01

    include: 1) a DNA recognition domain that recognizes the specific DNA sequence of interest and 2) one half of the leucine zipper pair. The second...piece will include 1) the second half of the leucine zipper pair, 2) a flexible linker flanked by a FRET pair that determines the local (within 30 bp...each other to determine the resolution of our probes. All DNA fragments are methylated using bacterial methyltransferase. Since only a single CG

  7. High resolution optical DNA mapping

    NASA Astrophysics Data System (ADS)

    Baday, Murat

    Many types of diseases including cancer and autism are associated with copy-number variations in the genome. Most of these variations could not be identified with existing sequencing and optical DNA mapping methods. We have developed Multi-color Super-resolution technique, with potential for high throughput and low cost, which can allow us to recognize more of these variations. Our technique has made 10--fold improvement in the resolution of optical DNA mapping. Using a 180 kb BAC clone as a model system, we resolved dense patterns from 108 fluorescent labels of two different colors representing two different sequence-motifs. Overall, a detailed DNA map with 100 bp resolution was achieved, which has the potential to reveal detailed information about genetic variance and to facilitate medical diagnosis of genetic disease.

  8. Helix Unwinding and Base Flipping Enable Human MTERF1 to Terminate Mitochondrial Transcription

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yakubovskaya, E.; Mejia, E; Byrnes, J

    2010-01-01

    Defects in mitochondrial gene expression are associated with aging and disease. Mterf proteins have been implicated in modulating transcription, replication and protein synthesis. We have solved the structure of a member of this family, the human mitochondrial transcriptional terminator MTERF1, bound to dsDNA containing the termination sequence. The structure indicates that upon sequence recognition MTERF1 unwinds the DNA molecule, promoting eversion of three nucleotides. Base flipping is critical for stable binding and transcriptional termination. Additional structural and biochemical results provide insight into the DNA binding mechanism and explain how MTERF1 recognizes its target sequence. Finally, we have demonstrated that themore » mitochondrial pathogenic G3249A and G3244A mutations interfere with key interactions for sequence recognition, eliminating termination. Our results provide insight into the role of mterf proteins and suggest a link between mitochondrial disease and the regulation of mitochondrial transcription.« less

  9. DNA microdevice for electrochemical detection of Escherichia coli 0157:H7 molecular markers.

    PubMed

    Berganza, J; Olabarria, G; García, R; Verdoy, D; Rebollo, A; Arana, S

    2007-04-15

    An electrochemical DNA sensor based on the hybridization recognition of a single-stranded DNA (ssDNA) probe immobilized onto a gold electrode to its complementary ssDNA is presented. The DNA probe is bound on gold surface electrode by using self-assembled monolayer (SAM) technology. An optimized mixed SAM with a blocking molecule preventing the nonspecific adsorption on the electrode surface has been prepared. In this paper, a DNA biosensor is designed by means of the immobilization of a single stranded DNA probe on an electrochemical transducer surface to recognize specifically Escherichia coli (E. coli) 0157:H7 complementary target DNA sequence via cyclic voltammetry experiments. The 21 mer DNA probe including a C6 alkanethiol group at the 5' phosphate end has been synthesized to form the SAM onto the gold surface through the gold sulfur bond. The goal of this paper has been to design, characterise and optimise an electrochemical DNA sensor. In order to investigate the oligonucleotide probe immobilization and the hybridization detection, experiments with different concentration of DNA and mismatch sequences have been performed. This microdevice has demonstrated the suitability of oligonucleotide Self-assembled monolayers (SAMs) on gold as immobilization method. The DNA probes deposited on gold surface have been functional and able to detect changes in bases sequence in a 21-mer oligonucleotide.

  10. Single-stranded DNA cleavage by divergent CRISPR-Cas9 enzymes

    PubMed Central

    Ma, Enbo; Harrington, Lucas B.; O’Connell, Mitchell R.; Zhou, Kaihong; Doudna, Jennifer A.

    2015-01-01

    Summary Double-stranded DNA (dsDNA) cleavage by Cas9 is a hallmark of type II CRISPR-Cas immune systems. Cas9–guide RNA complexes recognize 20-base-pair sequences in DNA and generate a site-specific double-strand break, a robust activity harnessed for genome editing. DNA recognition by all studied Cas9 enzymes requires a protospacer adjacent motif (PAM) next to the target site. We show that Cas9 enzymes from evolutionarily divergent bacteria can recognize and cleave single-stranded DNA (ssDNA) by an RNA-guided, PAM-independent recognition mechanism. Comparative analysis shows that in contrast to the type II-A S. pyogenes Cas9 that is widely used for genome engineering, the smaller type II-C Cas9 proteins have limited dsDNA binding and unwinding activity and promiscuous guide-RNA specificity. These results indicate that inefficiency of type II-C Cas9 enzymes for genome editing results from a limited ability to cleave dsDNA, and suggest that ssDNA cleavage was an ancestral function of the Cas9 enzyme family. PMID:26545076

  11. Forensic strategy to ensure the quality of sequencing data of mitochondrial DNA in highly degraded samples.

    PubMed

    Adachi, Noboru; Umetsu, Kazuo; Shojo, Hideki

    2014-01-01

    Mitochondrial DNA (mtDNA) is widely used for DNA analysis of highly degraded samples because of its polymorphic nature and high number of copies in a cell. However, as endogenous mtDNA in deteriorated samples is scarce and highly fragmented, it is not easy to obtain reliable data. In the current study, we report the risks of direct sequencing mtDNA in highly degraded material, and suggest a strategy to ensure the quality of sequencing data. It was observed that direct sequencing data of the hypervariable segment (HVS) 1 by using primer sets that generate an amplicon of 407 bp (long-primer sets) was different from results obtained by using newly designed primer sets that produce an amplicon of 120-139 bp (mini-primer sets). The data aligned with the results of mini-primer sets analysis in an amplicon length-dependent manner; the shorter the amplicon, the more evident the endogenous sequence became. Coding region analysis using multiplex amplified product-length polymorphisms revealed the incongruence of single nucleotide polymorphisms between the coding region and HVS 1 caused by contamination with exogenous mtDNA. Although the sequencing data obtained using long-primer sets turned out to be erroneous, it was unambiguous and reproducible. These findings suggest that PCR primers that produce amplicons shorter than those currently recognized should be used for mtDNA analysis in highly degraded samples. Haplogroup motif analysis of the coding region and HVS should also be performed to improve the reliability of forensic mtDNA data. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  12. Programmable DNA-Guided Artificial Restriction Enzymes.

    PubMed

    Enghiad, Behnam; Zhao, Huimin

    2017-05-19

    Restriction enzymes are essential tools for recombinant DNA technology that have revolutionized modern biological research. However, they have limited sequence specificity and availability. Here we report a Pyrococcus furiosus Argonaute (PfAgo) based platform for generating artificial restriction enzymes (AREs) capable of recognizing and cleaving DNA sequences at virtually any arbitrary site and generating defined sticky ends of varying length. Short DNA guides are used to direct PfAgo to target sites for cleavage at high temperatures (>87 °C) followed by reannealing of the cleaved single stranded DNAs. We used this platform to generate over 18 AREs for DNA fingerprinting and molecular cloning of PCR-amplified or genomic DNAs. These AREs work as efficiently as their naturally occurring counterparts, and some of them even do not have any naturally occurring counterparts, demonstrating easy programmability, generality, versatility, and high efficiency for this new technology.

  13. Rapid detection of IHNV by molecular padlock recognition and surface-associated isothermal amplification

    NASA Astrophysics Data System (ADS)

    McCarthy, Erik L.; Egeler, Teressa J.; Bickerstaff, Lee E.; Pereira da Cunha, Mauricio; Millard, Paul J.

    2005-11-01

    RNA sequences derived from infectious hematopoeitic necrosis virus (IHNV) could be detected using a combination of surface-associated molecular padlock DNA probes (MPP) and rolling circle amplification (RCA) in microcapillary tubes. DNA oligonucleotides with base sequences identical to RNA obtained from IHNV were recognized by MPP. Circularized MPP were then captured on the inner surface of glass microcapillary tubes by immobilized DNA oligonucleotide primers. Extension of the immobilized primers by isothermal RCA gave rise to DNA concatamers, which were in turn bound by the fluorescent reporter SYBR Green II nucleic acid stain, and measured by microfluorimetry. Surface-associated molecular padlock technology, combined with isothermal RCA, exhibited high selectivity and sensitivity without thermal cycling. This technology is applicable to direct RNA and DNA detection, permitting detection of a variety of viral or bacterial pathogens.

  14. Characterization of a restriction-modification system of the thermotolerant methylotroph Bacillus methanolicus.

    PubMed Central

    Cue, D; Lam, H; Hanson, R S; Flickinger, M C

    1996-01-01

    We report the isolation of a restriction endonuclease, BmeTI, an isoschizomer of BclI, that recognizes the DNA sequence 5' TGATCA 3'. We also report that BmeTI sites are modified to TGm6ATCA. These findings provide the basis for devising strategies to prevent BmeTI restriction of any DNA introduced into Bacillus methanolicus. PMID:8975604

  15. Structural basis of DNA target recognition by the B3 domain of Arabidopsis epigenome reader VAL1

    PubMed Central

    Sasnauskas, Giedrius; Kauneckaitė, Kotryna; Siksnys, Virginijus

    2018-01-01

    Abstract Arabidopsis thaliana requires a prolonged period of cold exposure during winter to initiate flowering in a process termed vernalization. Exposure to cold induces epigenetic silencing of the FLOWERING LOCUS C (FLC) gene by Polycomb group (PcG) proteins. A key role in this epigenetic switch is played by transcriptional repressors VAL1 and VAL2, which specifically recognize Sph/RY DNA sequences within FLC via B3 DNA binding domains, and mediate recruitment of PcG silencing machinery. To understand the structural mechanism of site-specific DNA recognition by VAL1, we have solved the crystal structure of VAL1 B3 domain (VAL1-B3) bound to a 12 bp oligoduplex containing the canonical Sph/RY DNA sequence 5′-CATGCA-3′/5′-TGCATG-3′. We find that VAL1-B3 makes H-bonds and van der Waals contacts to DNA bases of all six positions of the canonical Sph/RY element. In agreement with the structure, in vitro DNA binding studies show that VAL1-B3 does not tolerate substitutions at any position of the 5′-TGCATG-3′ sequence. The VAL1-B3–DNA structure presented here provides a structural model for understanding the specificity of plant B3 domains interacting with the Sph/RY and other DNA sequences. PMID:29660015

  16. Direct sequencing of hepatitis A virus and norovirus RT-PCR products from environmentally contaminated oyster using M13-tailed primers.

    PubMed

    Williams-Woods, Jacquelina; González-Escalona, Narjol; Burkhardt, William

    2011-12-01

    Human norovirus (HuNoV) and hepatitis A (HAV) are recognized as leading causes of non-bacterial foodborne associated illnesses in the United States. DNA sequencing is generally considered the standard for accurate viral genotyping in support of epidemiological investigations. Due to the genetic diversity of noroviruses (NoV), degenerate primer sets are often used in conventional reverse transcription (RT) PCR and real-time RT-quantitative PCR (RT-qPCR) for the detection of these viruses and cDNA fragments are generally cloned prior to sequencing. HAV detection methods that are sensitive and specific for real-time RT-qPCR yields small fragments sizes of 89-150bp, which can be difficult to sequence. In order to overcome these obstacles, norovirus and HAV primers were tailed with M13 forward and reverse primers. This modification increases the sequenced product size and allows for direct sequencing of the amplicons utilizing complementary M13 primers. HuNoV and HAV cDNA products from environmentally contaminated oysters were analyzed using this method. Alignments of the sequenced samples revealed ≥95% nucleotide identities. Tailing NoV and HAV primers with M13 sequence increases the cDNA product size, offers an alternative to cloning, and allows for rapid, accurate and direct sequencing of cDNA products produced by conventional or real time RT-qPCR assays. Published by Elsevier B.V.

  17. Informative priors based on transcription factor structural class improve de novo motif discovery.

    PubMed

    Narlikar, Leelavati; Gordân, Raluca; Ohler, Uwe; Hartemink, Alexander J

    2006-07-15

    An important problem in molecular biology is to identify the locations at which a transcription factor (TF) binds to DNA, given a set of DNA sequences believed to be bound by that TF. In previous work, we showed that information in the DNA sequence of a binding site is sufficient to predict the structural class of the TF that binds it. In particular, this suggests that we can predict which locations in any DNA sequence are more likely to be bound by certain classes of TFs than others. Here, we argue that traditional methods for de novo motif finding can be significantly improved by adopting an informative prior probability that a TF binding site occurs at each sequence location. To demonstrate the utility of such an approach, we present priority, a powerful new de novo motif finding algorithm. Using data from TRANSFAC, we train three classifiers to recognize binding sites of basic leucine zipper, forkhead, and basic helix loop helix TFs. These classifiers are used to equip priority with three class-specific priors, in addition to a default prior to handle TFs of other classes. We apply priority and a number of popular motif finding programs to sets of yeast intergenic regions that are reported by ChIP-chip to be bound by particular TFs. priority identifies motifs the other methods fail to identify, and correctly predicts the structural class of the TF recognizing the identified binding sites. Supplementary material and code can be found at http://www.cs.duke.edu/~amink/.

  18. DNA barcode-based delineation of putative species: efficient start for taxonomic workflows

    PubMed Central

    Kekkonen, Mari; Hebert, Paul D N

    2014-01-01

    The analysis of DNA barcode sequences with varying techniques for cluster recognition provides an efficient approach for recognizing putative species (operational taxonomic units, OTUs). This approach accelerates and improves taxonomic workflows by exposing cryptic species and decreasing the risk of synonymy. This study tested the congruence of OTUs resulting from the application of three analytical methods (ABGD, BIN, GMYC) to sequence data for Australian hypertrophine moths. OTUs supported by all three approaches were viewed as robust, but 20% of the OTUs were only recognized by one or two of the methods. These OTUs were examined for three criteria to clarify their status. Monophyly and diagnostic nucleotides were both uninformative, but information on ranges was useful as sympatric sister OTUs were viewed as distinct, while allopatric OTUs were merged. This approach revealed 124 OTUs of Hypertrophinae, a more than twofold increase from the currently recognized 51 species. Because this analytical protocol is both fast and repeatable, it provides a valuable tool for establishing a basic understanding of species boundaries that can be validated with subsequent studies. PMID:24479435

  19. Context influences on TALE–DNA binding revealed by quantitative profiling

    PubMed Central

    Rogers, Julia M.; Barrera, Luis A.; Reyon, Deepak; Sander, Jeffry D.; Kellis, Manolis; Joung, J Keith; Bulyk, Martha L.

    2015-01-01

    Transcription activator-like effector (TALE) proteins recognize DNA using a seemingly simple DNA-binding code, which makes them attractive for use in genome engineering technologies that require precise targeting. Although this code is used successfully to design TALEs to target specific sequences, off-target binding has been observed and is difficult to predict. Here we explore TALE–DNA interactions comprehensively by quantitatively assaying the DNA-binding specificities of 21 representative TALEs to ∼5,000–20,000 unique DNA sequences per protein using custom-designed protein-binding microarrays (PBMs). We find that protein context features exert significant influences on binding. Thus, the canonical recognition code does not fully capture the complexity of TALE–DNA binding. We used the PBM data to develop a computational model, Specificity Inference For TAL-Effector Design (SIFTED), to predict the DNA-binding specificity of any TALE. We provide SIFTED as a publicly available web tool that predicts potential genomic off-target sites for improved TALE design. PMID:26067805

  20. Context influences on TALE-DNA binding revealed by quantitative profiling.

    PubMed

    Rogers, Julia M; Barrera, Luis A; Reyon, Deepak; Sander, Jeffry D; Kellis, Manolis; Joung, J Keith; Bulyk, Martha L

    2015-06-11

    Transcription activator-like effector (TALE) proteins recognize DNA using a seemingly simple DNA-binding code, which makes them attractive for use in genome engineering technologies that require precise targeting. Although this code is used successfully to design TALEs to target specific sequences, off-target binding has been observed and is difficult to predict. Here we explore TALE-DNA interactions comprehensively by quantitatively assaying the DNA-binding specificities of 21 representative TALEs to ∼5,000-20,000 unique DNA sequences per protein using custom-designed protein-binding microarrays (PBMs). We find that protein context features exert significant influences on binding. Thus, the canonical recognition code does not fully capture the complexity of TALE-DNA binding. We used the PBM data to develop a computational model, Specificity Inference For TAL-Effector Design (SIFTED), to predict the DNA-binding specificity of any TALE. We provide SIFTED as a publicly available web tool that predicts potential genomic off-target sites for improved TALE design.

  1. Binding to the DNA Minor Groove by Heterocyclic Dications: From AT Specific Monomers to GC Recognition with Dimers

    PubMed Central

    Nanjunda, Rupesh; Wilson, W. David

    2012-01-01

    Compounds that bind in the DNA minor groove have provided critical information on DNA molecular recognition, they have found extensive uses in biotechnology and they are providing clinically useful drugs against diseases as diverse as cancer and sleeping sickness. This review focuses on the development of clinically useful heterocyclic diamidine minor groove binders. These compounds have shown us that the classical model for minor groove binding in AT DNA sequences must be expanded in several ways: compounds with nonstandard shapes can bind strongly to the groove, water can be directly incorporated into the minor groove complex in an interfacial interaction, and the compounds can form cooperative stacked dimers to recognize GC and mixed AT/GC base pair sequences. PMID:23255206

  2. Naumovozyma Kurtzman (2008)

    USDA-ARS?s Scientific Manuscript database

    This chapter describes the ascomycetous yeast genus Naumovozyma, which was recognized from multigene deoxyribonucleic acid (DNA) sequence analysis. The genus has two describes species, which were formerly classified in the genus Saccharomyces. The species reproduce by multilateral budding but do not...

  3. Contamination of sequence databases with adaptor sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yoshikawa, Takeo; Sanders, A.R.; Detera-Wadleigh, S.D.

    Because of the exponential increase in the amount of DNA sequences being added to the public databases on a daily basis, it has become imperative to identify sources of contamination rapidly. Previously, contaminations of sequence databases have been reported to alert the scientific community to the problem. These contaminations can be divided into two categories. The first category comprises host sequences that have been difficult for submitters to manage or control. Examples include anomalous sequences derived from Escherichia coli, which are inserted into the chromosomes (and plasmids) of the bacterial hosts. Insertion sequences are highly mobile and are capable ofmore » transposing themselves into plasmids during cloning manipulation. Another example of the first category is the infection with yeast genomic DNA or with bacterial DNA of some commercially available cDNA libraries from Clontech. The second category of database contamination is due to the inadvertent inclusion of nonhost sequences. This category includes incorporation of cloning-vector sequences and multicloning sites in the database submission. M13-derived artifacts have been common, since M13-based vectors have been widely used for subcloning DNA fragments. Recognizing this problem, the National Center for Biotechnology Information (NCBI) started to screen, in April 1994, all sequences directly submitted to GenBank, against a set of vector data retrieved from GenBank by use of key-word searches, such as {open_quotes}vector.{close_quotes} In this report, we present evidence for another sequence artifact that is widespread but that, to our knowledge, has not yet been reported. 11 refs., 1 tab.« less

  4. MICA: desktop software for comprehensive searching of DNA databases

    PubMed Central

    Stokes, William A; Glick, Benjamin S

    2006-01-01

    Background Molecular biologists work with DNA databases that often include entire genomes. A common requirement is to search a DNA database to find exact matches for a nondegenerate or partially degenerate query. The software programs available for such purposes are normally designed to run on remote servers, but an appealing alternative is to work with DNA databases stored on local computers. We describe a desktop software program termed MICA (K-Mer Indexing with Compact Arrays) that allows large DNA databases to be searched efficiently using very little memory. Results MICA rapidly indexes a DNA database. On a Macintosh G5 computer, the complete human genome could be indexed in about 5 minutes. The indexing algorithm recognizes all 15 characters of the DNA alphabet and fully captures the information in any DNA sequence, yet for a typical sequence of length L, the index occupies only about 2L bytes. The index can be searched to return a complete list of exact matches for a nondegenerate or partially degenerate query of any length. A typical search of a long DNA sequence involves reading only a small fraction of the index into memory. As a result, searches are fast even when the available RAM is limited. Conclusion MICA is suitable as a search engine for desktop DNA analysis software. PMID:17018144

  5. Global DNA methylation analysis using methyl-sensitive amplification polymorphism (MSAP).

    PubMed

    Yaish, Mahmoud W; Peng, Mingsheng; Rothstein, Steven J

    2014-01-01

    DNA methylation is a crucial epigenetic process which helps control gene transcription activity in eukaryotes. Information regarding the methylation status of a regulatory sequence of a particular gene provides important knowledge of this transcriptional control. DNA methylation can be detected using several methods, including sodium bisulfite sequencing and restriction digestion using methylation-sensitive endonucleases. Methyl-Sensitive Amplification Polymorphism (MSAP) is a technique used to study the global DNA methylation status of an organism and hence to distinguish between two individuals based on the DNA methylation status determined by the differential digestion pattern. Therefore, this technique is a useful method for DNA methylation mapping and positional cloning of differentially methylated genes. In this technique, genomic DNA is first digested with a methylation-sensitive restriction enzyme such as HpaII, and then the DNA fragments are ligated to adaptors in order to facilitate their amplification. Digestion using a methylation-insensitive isoschizomer of HpaII, MspI is used in a parallel digestion reaction as a loading control in the experiment. Subsequently, these fragments are selectively amplified by fluorescently labeled primers. PCR products from different individuals are compared, and once an interesting polymorphic locus is recognized, the desired DNA fragment can be isolated from a denaturing polyacrylamide gel, sequenced and identified based on DNA sequence similarity to other sequences available in the database. We will use analysis of met1, ddm1, and atmbd9 mutants and wild-type plants treated with a cytidine analogue, 5-azaC, or zebularine to demonstrate how to assess the genetic modulation of DNA methylation in Arabidopsis. It should be noted that despite the fact that MSAP is a reliable technique used to fish for polymorphic methylated loci, its power is limited to the restriction recognition sites of the enzymes used in the genomic DNA digestion.

  6. A systematic molecular dynamics study of nearest-neighbor effects on base pair and base pair step conformations and fluctuations in B-DNA

    PubMed Central

    Lavery, Richard; Zakrzewska, Krystyna; Beveridge, David; Bishop, Thomas C.; Case, David A.; Cheatham, Thomas; Dixit, Surjit; Jayaram, B.; Lankas, Filip; Laughton, Charles; Maddocks, John H.; Michon, Alexis; Osman, Roman; Orozco, Modesto; Perez, Alberto; Singh, Tanya; Spackova, Nada; Sponer, Jiri

    2010-01-01

    It is well recognized that base sequence exerts a significant influence on the properties of DNA and plays a significant role in protein–DNA interactions vital for cellular processes. Understanding and predicting base sequence effects requires an extensive structural and dynamic dataset which is currently unavailable from experiment. A consortium of laboratories was consequently formed to obtain this information using molecular simulations. This article describes results providing information not only on all 10 unique base pair steps, but also on all possible nearest-neighbor effects on these steps. These results are derived from simulations of 50–100 ns on 39 different DNA oligomers in explicit solvent and using a physiological salt concentration. We demonstrate that the simulations are converged in terms of helical and backbone parameters. The results show that nearest-neighbor effects on base pair steps are very significant, implying that dinucleotide models are insufficient for predicting sequence-dependent behavior. Flanking base sequences can notably lead to base pair step parameters in dynamic equilibrium between two conformational sub-states. Although this study only provides limited data on next-nearest-neighbor effects, we suggest that such effects should be analyzed before attempting to predict the sequence-dependent behavior of DNA. PMID:19850719

  7. In vitro fluorescence studies of transcription factor IIB-DNA interaction.

    PubMed

    Górecki, Andrzej; Figiel, Małgorzata; Dziedzicka-Wasylewska, Marta

    2015-01-01

    General transcription factor TFIIB is one of the basal constituents of the preinitiation complex of eukaryotic RNA polymerase II, acting as a bridge between the preinitiation complex and the polymerase, and binding promoter DNA in an asymmetric manner, thereby defining the direction of the transcription. Methods of fluorescence spectroscopy together with circular dichroism spectroscopy were used to observe conformational changes in the structure of recombinant human TFIIB after binding to specific DNA sequence. To facilitate the exploration of the structural changes, several site-directed mutations have been introduced altering the fluorescence properties of the protein. Our observations showed that binding of specific DNA sequences changed the protein structure and dynamics, and TFIIB may exist in two conformational states, which can be described by a different microenvironment of W52. Fluorescence studies using both intrinsic and exogenous fluorophores showed that these changes significantly depended on the recognition sequence and concerned various regions of the protein, including those interacting with other transcription factors and RNA polymerase II. DNA binding can cause rearrangements in regions of proteins interacting with the polymerase in a manner dependent on the recognized sequences, and therefore, influence the gene expression.

  8. Bipartite recognition of target RNAs activates DNA cleavage by the Type III-B CRISPR–Cas system

    PubMed Central

    Elmore, Joshua R.; Sheppard, Nolan F.; Ramia, Nancy; Deighan, Trace; Li, Hong; Terns, Rebecca M.; Terns, Michael P.

    2016-01-01

    CRISPR–Cas systems eliminate nucleic acid invaders in bacteria and archaea. The effector complex of the Type III-B Cmr system cleaves invader RNAs recognized by the CRISPR RNA (crRNA ) of the complex. Here we show that invader RNAs also activate the Cmr complex to cleave DNA. As has been observed for other Type III systems, Cmr eliminates plasmid invaders in Pyrococcus furiosus by a mechanism that depends on transcription of the crRNA target sequence within the plasmid. Notably, we found that the target RNA per se induces DNA cleavage by the Cmr complex in vitro. DNA cleavage activity does not depend on cleavage of the target RNA but notably does require the presence of a short sequence adjacent to the target sequence within the activating target RNA (rPAM [RNA protospacer-adjacent motif]). The activated complex does not require a target sequence (or a PAM) in the DNA substrate. Plasmid elimination by the P. furiosus Cmr system also does not require the Csx1 (CRISPR-associated Rossman fold [CARF] superfamily) protein. Plasmid silencing depends on the HD nuclease and Palm domains of the Cmr2 (Cas10 superfamily) protein. The results establish the Cmr complex as a novel DNA nuclease activated by invader RNAs containing a crRNA target sequence and a rPAM. PMID:26848045

  9. A septal chromosome segregator protein evolved into a conjugative DNA-translocator protein

    PubMed Central

    Sepulveda, Edgardo; Vogelmann, Jutta

    2011-01-01

    Streptomycetes, Gram-positive soil bacteria well known for the production of antibiotics feature a unique conjugative DNA transfer system. In contrast to classical conjugation which is characterized by the secretion of a pilot protein covalently linked to a single-stranded DNA molecule, in Streptomyces a double-stranded DNA molecule is translocated during conjugative transfer. This transfer involves a single plasmid encoded protein, TraB. A detailed biochemical and biophysical characterization of TraB, revealed a close relationship to FtsK, mediating chromosome segregation during bacterial cell division. TraB translocates plasmid DNA by recognizing 8-bp direct repeats located in a specific plasmid region clt. Similar sequences accidentally also occur on chromosomes and have been shown to be bound by TraB. We suggest that TraB mobilizes chromosomal genes by the interaction with these chromosomal clt-like sequences not relying on the integration of the conjugative plasmid into the chromosome. PMID:22479692

  10. Isolation of a cDNA Encoding a Granule-Bound 152-Kilodalton Starch-Branching Enzyme in Wheat1

    PubMed Central

    Båga, Monica; Nair, Ramesh B.; Repellin, Anne; Scoles, Graham J.; Chibbar, Ravindra N.

    2000-01-01

    Screening of a wheat (Triticum aestivum) cDNA library for starch-branching enzyme I (SBEI) genes combined with 5′-rapid amplification of cDNA ends resulted in isolation of a 4,563-bp composite cDNA, Sbe1c. Based on sequence alignment to characterized SBEI cDNA clones isolated from plants, the SBEIc predicted from the cDNA sequence was produced with a transit peptide directing the polypeptide into plastids. Furthermore, the predicted mature form of SBEIc was much larger (152 kD) than previously characterized plant SBEI (80–100 kD) and contained a partial duplication of SBEI sequences. The first SBEI domain showed high amino acid similarity to a 74-kD wheat SBEI-like protein that is inactive as a branching enzyme when expressed in Escherichia coli. The second SBEI domain on SBEIc was identical in sequence to a functional 87-kD SBEI produced in the wheat endosperm. Immunoblot analysis of proteins produced in developing wheat kernels demonstrated that the 152-kD SBEIc was, in contrast to the 87- to 88-kD SBEI, preferentially associated with the starch granules. Proteins similar in size and recognized by wheat SBEI antibodies were also present in Triticum monococcum, Triticum tauschii, and Triticum turgidum subsp. durum. PMID:10982440

  11. The Role of DNA Barcodes in Understanding and Conservation of Mammal Diversity in Southeast Asia

    PubMed Central

    Francis, Charles M.; Borisenko, Alex V.; Ivanova, Natalia V.; Eger, Judith L.; Lim, Burton K.; Guillén-Servent, Antonio; Kruskop, Sergei V.; Mackie, Iain; Hebert, Paul D. N.

    2010-01-01

    Background Southeast Asia is recognized as a region of very high biodiversity, much of which is currently at risk due to habitat loss and other threats. However, many aspects of this diversity, even for relatively well-known groups such as mammals, are poorly known, limiting ability to develop conservation plans. This study examines the value of DNA barcodes, sequences of the mitochondrial COI gene, to enhance understanding of mammalian diversity in the region and hence to aid conservation planning. Methodology and Principal Findings DNA barcodes were obtained from nearly 1900 specimens representing 165 recognized species of bats. All morphologically or acoustically distinct species, based on classical taxonomy, could be discriminated with DNA barcodes except four closely allied species pairs. Many currently recognized species contained multiple barcode lineages, often with deep divergence suggesting unrecognized species. In addition, most widespread species showed substantial genetic differentiation across their distributions. Our results suggest that mammal species richness within the region may be underestimated by at least 50%, and there are higher levels of endemism and greater intra-specific population structure than previously recognized. Conclusions DNA barcodes can aid conservation and research by assisting field workers in identifying species, by helping taxonomists determine species groups needing more detailed analysis, and by facilitating the recognition of the appropriate units and scales for conservation planning. PMID:20838635

  12. Insights into the Emergent Bacterial Pathogen Cronobacter spp., Generated by Multilocus Sequence Typing and Analysis

    PubMed Central

    Joseph, Susan; Forsythe, Stephen J.

    2012-01-01

    Cronobacter spp. (previously known as Enterobacter sakazakii) is a bacterial pathogen affecting all age groups, with particularly severe clinical complications in neonates and infants. One recognized route of infection being the consumption of contaminated infant formula. As a recently recognized bacterial pathogen of considerable importance and regulatory control, appropriate detection, and identification schemes are required. The application of multilocus sequence typing (MLST) and analysis (MLSA) of the seven alleles atpD, fusA, glnS, gltB, gyrB, infB, and ppsA (concatenated length 3036 base pairs) has led to considerable advances in our understanding of the genus. This approach is supported by both the reliability of DNA sequencing over subjective phenotyping and the establishment of a MLST database which has open access and is also curated; http://www.pubMLST.org/cronobacter. MLST has been used to describe the diversity of the newly recognized genus, instrumental in the formal recognition of new Cronobacter species (C. universalis and C. condimenti) and revealed the high clonality of strains and the association of clonal complex 4 with neonatal meningitis cases. Clearly the MLST approach has considerable benefits over the use of non-DNA sequence based methods of analysis for newly emergent bacterial pathogens. The application of MLST and MLSA has dramatically enabled us to better understand this opportunistic bacterium which can cause irreparable damage to a newborn baby’s brain, and has contributed to improved control measures to protect neonatal health. PMID:23189075

  13. SNPs in putative regulatory regions identified by human mouse comparative sequencing and transcription factor binding site data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Banerjee, Poulabi; Bahlo, Melanie; Schwartz, Jody R.

    2002-01-01

    Genome wide disease association analysis using SNPs is being explored as a method for dissecting complex genetic traits and a vast number of SNPs have been generated for this purpose. As there are cost and throughput limitations of genotyping large numbers of SNPs and statistical issues regarding the large number of dependent tests on the same data set, to make association analysis practical it has been proposed that SNPs should be prioritized based on likely functional importance. The most easily identifiable functional SNPs are coding SNPs (cSNPs) and accordingly cSNPs have been screened in a number of studies. SNPs inmore » gene regulatory sequences embedded in noncoding DNA are another class of SNPs suggested for prioritization due to their predicted quantitative impact on gene expression. The main challenge in evaluating these SNPs, in contrast to cSNPs is a lack of robust algorithms and databases for recognizing regulatory sequences in noncoding DNA. Approaches that have been previously used to delineate noncoding sequences with gene regulatory activity include cross-species sequence comparisons and the search for sequences recognized by transcription factors. We combined these two methods to sift through mouse human genomic sequences to identify putative gene regulatory elements and subsequently localized SNPs within these sequences in a 1 Megabase (Mb) region of human chromosome 5q31, orthologous to mouse chromosome 11 containing the Interleukin cluster.« less

  14. Are Africans, Europeans, and Asians different "races"? A guided-inquiry lab for introducing undergraduate students to genetic diversity and preparing them to study natural selection.

    PubMed

    Kalinowski, Steven T; Andrews, Tessa M; Leonard, Mary J; Snodgrass, Meagan

    2012-01-01

    Many students do not recognize that individual organisms within populations vary, and this may make it difficult for them to recognize the essential role variation plays in natural selection. Also, many students have weak scientific reasoning skills, and this makes it difficult for them to recognize misconceptions they might have. This paper describes a 2-h laboratory for college students that introduces them to genetic diversity and gives them practice using hypothetico-deductive reasoning. In brief, the lab presents students with DNA sequences from Africans, Europeans, and Asians, and asks students to determine whether people from each continent qualify as distinct "races." Comparison of the DNA sequences shows that people on each continent are not more similar to one another than to people on other continents, and therefore do not qualify as distinct races. Ninety-four percent of our students reported that the laboratory was interesting, and 79% reported that it was a valuable learning experience. We developed and used a survey to measure the extent to which students recognized variation and its significance within populations and showed that the lab increased student awareness of variation. We also showed that the lab improved the ability of students to construct hypothetico-deductive arguments.

  15. Distribution of cytotoxic and DNA ADP-ribosylating activity in crude extracts from butterflies among the family Pieridae

    PubMed Central

    Matsumoto, Yasuko; Nakano, Tsuyoshi; Yamamoto, Masafumi; Matsushima-Hibiya, Yuko; Odagiri, Ken-Ichi; Yata, Osamu; Koyama, Kotaro; Sugimura, Takashi; Wakabayashi, Keiji

    2008-01-01

    Cabbage butterflies, Pieris rapae and Pieris brassicae, contain strong cytotoxic proteins, designated as pierisin-1 and -2, against cancer cell lines. These proteins exhibit DNA ADP-ribosylating activity. To determine the distribution of substances with cytotoxicity and DNA ADP-ribosylating activity among other species, crude extracts from 20 species of the family Pieridae were examined for cytotoxicity in HeLa cells and DNA ADP-ribosylating activity. Both activities were detected in extracts from 13 species: subtribes Pierina (Pieris rapae, Pieris canidia, Pieris napi, Pieris melete, Pieris brassicae, Pontia daplidice, and Talbotia naganum), Aporiina (Aporia gigantea, Aporia crataegi, Aporia hippia, and Delias pasithoe), and Appiadina (Appias nero and Appias paulina). All of these extracts contained substances recognized by anti-pierisin-1 antibodies, with a molecular mass of ≈100 kDa established earlier for pierisin-1. Moreover, sequences containing NAD-binding sites, conserved in ADP-ribosyltransferases, were amplified from genomic DNA from 13 species of butterflies with cytotoxicity and DNA ADP-ribosylating activity by PCR. Extracts from seven species, Appias lyncida, Leptosia nina, Anthocharis scolymus, Eurema hecabe, Catopsilia pomona, Catopsilia scylla, and Colias erate, showed neither cytotoxicity nor DNA ADP-ribosylating activity, and did not contain substances recognized by anti-pierisin-1 antibodies. Sequences containing NAD-binding sites were not amplified from genomic DNA from these seven species. Thus, pierisin-like proteins, showing cytotoxicity and DNA ADP-ribosylating activity, are suggested to be present in the extracts from butterflies not only among the subtribe Pierina, but also among the subtribes Aporiina and Appiadina. These findings offer insight to understanding the nature of DNA ADP-ribosylating activity in the butterfly. PMID:18256183

  16. Conformation of Tax-response elements in the human T-cell leukemia virus type I promoter.

    PubMed

    Cox, J M; Sloan, L S; Schepartz, A

    1995-12-01

    HTLV-I Tax is believed to activate viral gene expression by binding bZIP proteins (such as CREB) and increasing their affinities for proviral TRE target sites. Each 21 bp TRE target site contains an imperfect copy of the intrinsically bent CRE target site (the TRE core) surrounded by highly conserved flanking sequences. These flanking sequences are essential for maximal increases in DNA affinity and transactivation, but they are not, apparently, contacted by protein. Here we employ non-denaturing gel electrophoresis to evaluate TRE conformation in the presence and absence of bZIP proteins, and to explore the role of DNA conformation in viral transactivation. Our results show that the TRE-1 flanking sequences modulate the structure and modestly increase the affinity of a CREB bZIP peptide for the TRE-1 core recognition sequence. These flanking sequences are also essential for a maximal increase in stability of the CREB-DNA complex in the presence of Tax. The CRE-like TRE core and the TRE flanking sequences are both essential for formation of stable CREB-TRE-1 and Tax-CREB-TRE-1 complexes. These two DNA segments may have co-evolved into a unique structure capable of recognizing Tax and a bZIP protein.

  17. Methylobacterium phyllosphaerae sp. nov., a pink-pigmented, facultative methylotroph from the phyllosphere of rice.

    PubMed

    Madhaiyan, Munusamy; Poonguzhali, Selvaraj; Kwon, Soon-Wo; Sa, Tong-Min

    2009-01-01

    A pink-pigmented, aerobic, facultatively methylotrophic bacterial strain, CBMB27T, isolated from leaf tissues of rice (Oryza sativa L. 'Dong-Jin'), was analysed using a polyphasic taxonomic approach. Comparative 16S rRNA gene sequence-based phylogenetic analysis placed the strain in a clade with the species Methylobacterium oryzae, Methylobacterium fujisawaense and Methylobacterium mesophilicum; strain CBMB27T showed sequence similarities of 98.3, 98.5 and 97.3 %, respectively, to the type strains of these three species. DNA-DNA hybridization experiments revealed low levels (<38 %) of DNA-DNA relatedness between strain CBMB27T and its closest relatives. The sequence of the 1-aminocyclopropane-1-carboxylate deaminase gene (acdS) in strain CBMB27T differed from those of close relatives. The major fatty acid of the isolate was C(18 : 1)omega7c and the G+C content of the genomic DNA was 66.8 mol%. Based on the results of 16S rRNA gene sequence analysis, DNA-DNA hybridization, and physiological and biochemical characterization, which enabled the isolate to be differentiated from all recognized species of the genus Methylobacterium, it was concluded that strain CBMB27T represents a novel species in the genus Methylobacterium for which the name Methylobacterium phyllosphaerae sp. nov. is proposed (type strain CBMB27T =LMG 24361T =KACC 11716T =DSM 19779T).

  18. Highly Iterated Palindromic Sequences (HIPs) and Their Relationship to DNA Methyltransferases

    PubMed Central

    Elhai, Jeff

    2015-01-01

    The sequence GCGATCGC (Highly Iterated Palindrome, HIP1) is commonly found in high frequency in cyanobacterial genomes. An important clue to its function may be the presence of two orphan DNA methyltransferases that recognize internal sequences GATC and CGATCG. An examination of genomes from 97 cyanobacteria, both free-living and obligate symbionts, showed that there are exceptional cases in which HIP1 is at a low frequency or nearly absent. In some of these cases, it appears to have been replaced by a different GC-rich palindromic sequence, alternate HIPs. When HIP1 is at a high frequency, GATC- and CGATCG-specific methyltransferases are generally present in the genome. When an alternate HIP is at high frequency, a methyltransferase specific for that sequence is present. The pattern of 1-nt deviations from HIP1 sequences is biased towards the first and last nucleotides, i.e., those distinguish CGATCG from HIP1. Taken together, the results point to a role of DNA methylation in the creation or functioning of HIP sites. A model is presented that postulates the existence of a GmeC-dependent mismatch repair system whose activity creates and maintains HIP sequences. PMID:25789551

  19. Highly Iterated Palindromic Sequences (HIPs) and Their Relationship to DNA Methyltransferases.

    PubMed

    Elhai, Jeff

    2015-03-17

    The sequence GCGATCGC (Highly Iterated Palindrome, HIP1) is commonly found in high frequency in cyanobacterial genomes. An important clue to its function may be the presence of two orphan DNA methyltransferases that recognize internal sequences GATC and CGATCG. An examination of genomes from 97 cyanobacteria, both free-living and obligate symbionts, showed that there are exceptional cases in which HIP1 is at a low frequency or nearly absent. In some of these cases, it appears to have been replaced by a different GC-rich palindromic sequence, alternate HIPs. When HIP1 is at a high frequency, GATC- and CGATCG-specific methyltransferases are generally present in the genome. When an alternate HIP is at high frequency, a methyltransferase specific for that sequence is present. The pattern of 1-nt deviations from HIP1 sequences is biased towards the first and last nucleotides, i.e., those distinguish CGATCG from HIP1. Taken together, the results point to a role of DNA methylation in the creation or functioning of HIP sites. A model is presented that postulates the existence of a GmeC-dependent mismatch repair system whose activity creates and maintains HIP sequences.

  20. Molecular characterization of a distinct begomovirus species from Vernonia cinerea and its associated DNA-beta using the bacteriophage Phi 29 DNA polymerase.

    PubMed

    Packialakshmi, R M; Srivastava, N; Girish, K R; Usha, R

    2010-08-01

    Vernonia cinerea plants with yellow vein symptoms were collected around crop fields in Madurai. A portion (550 bp) of the AV1 gene amplified using degenerate primers from the total DNA purified from diseased leaf sample was cloned and sequenced. Specific primers derived from the above sequence were used to amplify 2,745 nucleotides with the typical genome organization of begomoviral DNA A (EMBL Accession No. AM182232). Sequence comparison with other begomoviruses revealed the greatest identity (82.4%) with Emilia yellow vein virus (EmYVV-[Fz1]) from China and less than 80% with all other known begomoviruses. The International Committee on Taxonomy of Viruses (ICTV) has therefore recognized Vernonia yellow vein virus (VeYVV) as a distinct begomovirus species. Conventional PCR could not amplify the DNA B or DNA beta from the diseased tissue. However, the beta DNA (1364 bp) associated with the disease was obtained (Accession No. FN435836) by the rolling circle amplification-restriction fragment length polymorphism method (RCA-RFLP) using Phi 29 DNA polymerase. Sequence analysis shows that DNA beta of VeYVV has the highest identity (56.8%) with DNA beta of Sigesbeckia yellow vein Guangxi betasatellite (SibYVGxB-[CN: Gx111:05]) and 56-53% with DNA beta associated with other begomoviruses. This is the first report of the molecular characterization of VeYVV from V. cinerea in India. The complete molecular characterization, phylogenetic analysis, and putative recombination events in VeYVV are reported.

  1. A duplex DNA-gold nanoparticle probe composed as a colorimetric biosensor for sequence-specific DNA-binding proteins.

    PubMed

    Ahn, Junho; Choi, Yeonweon; Lee, Ae-Ree; Lee, Joon-Hwa; Jung, Jong Hwa

    2016-03-21

    Using duplex DNA-AuNP aggregates, a sequence-specific DNA-binding protein, SQUAMOSA Promoter-binding-Like protein 12 (SPL-12), was directly determined by SPL-12-duplex DNA interaction-based colorimetric actions of DNA-Au assemblies. In order to prepare duplex DNA-Au aggregates, thiol-modified DNA 1 and DNA 2 were attached onto the surface of AuNPs, respectively, by the salt-aging method and then the DNA-attached AuNPs were mixed. Duplex-DNA-Au aggregates having the average size of 160 nm diameter and the maximum absorption at 529 nm were able to recognize SPL-12 and reached the equivalent state by the addition of ∼30 equivalents of SPL-12 accompanying a color change from red to blue with a red shift of the maximum absorption at 570 nm. As a result, the aggregation size grew to about 247 nm. Also, at higher temperatures of the mixture of duplex-DNA-Au aggregate solution and SPL-12, the equivalent state was reached rapidly. On the contrary, in the control experiment using Bovine Serum Albumin (BSA), no absorption band shift of duplex-DNA-Au aggregates was observed.

  2. Structural insight into the specificity of the B3 DNA-binding domains provided by the co-crystal structure of the C-terminal fragment of BfiI restriction enzyme

    PubMed Central

    Golovenko, Dmitrij; Manakova, Elena; Zakrys, Linas; Zaremba, Mindaugas; Sasnauskas, Giedrius; Gražulis, Saulius; Siksnys, Virginijus

    2014-01-01

    The B3 DNA-binding domains (DBDs) of plant transcription factors (TF) and DBDs of EcoRII and BfiI restriction endonucleases (EcoRII-N and BfiI-C) share a common structural fold, classified as the DNA-binding pseudobarrel. The B3 DBDs in the plant TFs recognize a diverse set of target sequences. The only available co-crystal structure of the B3-like DBD is that of EcoRII-N (recognition sequence 5′-CCTGG-3′). In order to understand the structural and molecular mechanisms of specificity of B3 DBDs, we have solved the crystal structure of BfiI-C (recognition sequence 5′-ACTGGG-3′) complexed with 12-bp cognate oligoduplex. Structural comparison of BfiI-C–DNA and EcoRII-N–DNA complexes reveals a conserved DNA-binding mode and a conserved pattern of interactions with the phosphodiester backbone. The determinants of the target specificity are located in the loops that emanate from the conserved structural core. The BfiI-C–DNA structure presented here expands a range of templates for modeling of the DNA-bound complexes of the B3 family of plant TFs. PMID:24423868

  3. Identification of the DNA-Binding Domains of Human Replication Protein A That Recognize G-Quadruplex DNA

    PubMed Central

    Prakash, Aishwarya; Natarajan, Amarnath; Marky, Luis A.; Ouellette, Michel M.; Borgstahl, Gloria E. O.

    2011-01-01

    Replication protein A (RPA), a key player in DNA metabolism, has 6 single-stranded DNA-(ssDNA-) binding domains (DBDs) A-F. SELEX experiments with the DBDs-C, -D, and -E retrieve a 20-nt G-quadruplex forming sequence. Binding studies show that RPA-DE binds preferentially to the G-quadruplex DNA, a unique preference not observed with other RPA constructs. Circular dichroism experiments show that RPA-CDE-core can unfold the G-quadruplex while RPA-DE stabilizes it. Binding studies show that RPA-C binds pyrimidine- and purine-rich sequences similarly. This difference between RPA-C and RPA-DE binding was also indicated by the inability of RPA-CDE-core to unfold an oligonucleotide containing a TC-region 5′ to the G-quadruplex. Molecular modeling studies of RPA-DE and telomere-binding proteins Pot1 and Stn1 reveal structural similarities between the proteins and illuminate potential DNA-binding sites for RPA-DE and Stn1. These data indicate that DBDs of RPA have different ssDNA recognition properties. PMID:21772997

  4. Engineering and Application of Zinc Finger Proteins and TALEs for Biomedical Research.

    PubMed

    Kim, Moon-Soo; Kini, Anu Ganesh

    2017-08-01

    Engineered DNA-binding domains provide a powerful technology for numerous biomedical studies due to their ability to recognize specific DNA sequences. Zinc fingers (ZF) are one of the most common DNA-binding domains and have been extensively studied for a variety of applications, such as gene regulation, genome engineering and diagnostics. Another novel DNA-binding domain known as a transcriptional activator-like effector (TALE) has been more recently discovered, which has a previously undescribed DNA-binding mode. Due to their modular architecture and flexibility, TALEs have been rapidly developed into artificial gene targeting reagents. Here, we describe the methods used to design these DNA-binding proteins and their key applications in biomedical research.

  5. Investigation of the mechanism of meiotic DNA cleavage by VMA1-derived endonuclease uncovers a meiotic alteration in chromatin structure around the target site.

    PubMed

    Fukuda, Tomoyuki; Ohta, Kunihiro; Ohya, Yoshikazu

    2006-06-01

    VMA1-derived endonuclease (VDE), a homing endonuclease in Saccharomyces cerevisiae, is encoded by the mobile intein-coding sequence within the nuclear VMA1 gene. VDE recognizes and cleaves DNA at the 31-bp VDE recognition sequence (VRS) in the VMA1 gene lacking the intein-coding sequence during meiosis to insert a copy of the intein-coding sequence at the cleaved site. The mechanism underlying the meiosis specificity of VMA1 intein-coding sequence homing remains unclear. We studied various factors that might influence the cleavage activity in vivo and found that VDE binding to the VRS can be detected only when DNA cleavage by VDE takes place, implying that meiosis-specific DNA cleavage is regulated by the accessibility of VDE to its target site. As a possible candidate for the determinant of this accessibility, we analyzed chromatin structure around the VRS and revealed that local chromatin structure near the VRS is altered during meiosis. Although the meiotic chromatin alteration exhibits correlations with DNA binding and cleavage by VDE at the VMA1 locus, such a chromatin alteration is not necessarily observed when the VRS is embedded in ectopic gene loci. This suggests that nucleosome positioning or occupancy around the VRS by itself is not the sole mechanism for the regulation of meiosis-specific DNA cleavage by VDE and that other mechanisms are involved in the regulation.

  6. Investigation of the Mechanism of Meiotic DNA Cleavage by VMA1-Derived Endonuclease Uncovers a Meiotic Alteration in Chromatin Structure around the Target Site

    PubMed Central

    Fukuda, Tomoyuki; Ohta, Kunihiro; Ohya, Yoshikazu

    2006-01-01

    VMA1-derived endonuclease (VDE), a homing endonuclease in Saccharomyces cerevisiae, is encoded by the mobile intein-coding sequence within the nuclear VMA1 gene. VDE recognizes and cleaves DNA at the 31-bp VDE recognition sequence (VRS) in the VMA1 gene lacking the intein-coding sequence during meiosis to insert a copy of the intein-coding sequence at the cleaved site. The mechanism underlying the meiosis specificity of VMA1 intein-coding sequence homing remains unclear. We studied various factors that might influence the cleavage activity in vivo and found that VDE binding to the VRS can be detected only when DNA cleavage by VDE takes place, implying that meiosis-specific DNA cleavage is regulated by the accessibility of VDE to its target site. As a possible candidate for the determinant of this accessibility, we analyzed chromatin structure around the VRS and revealed that local chromatin structure near the VRS is altered during meiosis. Although the meiotic chromatin alteration exhibits correlations with DNA binding and cleavage by VDE at the VMA1 locus, such a chromatin alteration is not necessarily observed when the VRS is embedded in ectopic gene loci. This suggests that nucleosome positioning or occupancy around the VRS by itself is not the sole mechanism for the regulation of meiosis-specific DNA cleavage by VDE and that other mechanisms are involved in the regulation. PMID:16757746

  7. Structural Basis for the Altered PAM Recognition by Engineered CRISPR-Cpf1.

    PubMed

    Nishimasu, Hiroshi; Yamano, Takashi; Gao, Linyi; Zhang, Feng; Ishitani, Ryuichiro; Nureki, Osamu

    2017-07-06

    The RNA-guided Cpf1 nuclease cleaves double-stranded DNA targets complementary to the CRISPR RNA (crRNA), and it has been harnessed for genome editing technologies. Recently, Acidaminococcus sp. BV3L6 (AsCpf1) was engineered to recognize altered DNA sequences as the protospacer adjacent motif (PAM), thereby expanding the target range of Cpf1-mediated genome editing. Whereas wild-type AsCpf1 recognizes the TTTV PAM, the RVR (S542R/K548V/N552R) and RR (S542R/K607R) variants can efficiently recognize the TATV and TYCV PAMs, respectively. However, their PAM recognition mechanisms remained unknown. Here we present the 2.0 Å resolution crystal structures of the RVR and RR variants bound to a crRNA and its target DNA. The structures revealed that the RVR and RR variants primarily recognize the PAM-complementary nucleotides via the substituted residues. Our high-resolution structures delineated the altered PAM recognition mechanisms of the AsCpf1 variants, providing a basis for the further engineering of CRISPR-Cpf1. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. [Phylogenetic relationships among members of the subfamily sedoideae (Crassulaceae) inferred from the ITS region sequences of nuclear rDNA].

    PubMed

    Goncharova, S B; Artiukova, E V; Goncharov, A A

    2006-06-01

    Nucleotide sequences of the nuclear rDNA ITS regions were determined in 20 species of the subfamily Sedoideae (Crassulaceae). The phylogenetic relationships of these species with other members of the subfamily, occurring mainly in Southeast Asia, were analyzed. It was shown that the genus Orostachys was not monophyletic; its typical subsection was reliably included into the clade of the genus Hylotelephium. Synapomorphic substitutions and indels, specific for the subsection Orostachys, were detected in ITS1. Sister relationships were established between clades Aizopsis and Phedimus, based on which they can be recognized as isolated genera.

  9. Strong spurious transcription likely contributes to DNA insert bias in typical metagenomic clone libraries.

    PubMed

    Lam, Kathy N; Charles, Trevor C

    2015-01-01

    Clone libraries provide researchers with a powerful resource to study nucleic acid from diverse sources. Metagenomic clone libraries in particular have aided in studies of microbial biodiversity and function, and allowed the mining of novel enzymes. Libraries are often constructed by cloning large inserts into cosmid or fosmid vectors. Recently, there have been reports of GC bias in fosmid metagenomic libraries, and it was speculated to be a result of fragmentation and loss of AT-rich sequences during cloning. However, evidence in the literature suggests that transcriptional activity or gene product toxicity may play a role. To explore possible mechanisms responsible for sequence bias in clone libraries, we constructed a cosmid library from a human microbiome sample and sequenced DNA from different steps during library construction: crude extract DNA, size-selected DNA, and cosmid library DNA. We confirmed a GC bias in the final cosmid library, and we provide evidence that the bias is not due to fragmentation and loss of AT-rich sequences but is likely occurring after DNA is introduced into Escherichia coli. To investigate the influence of strong constitutive transcription, we searched the sequence data for promoters and found that rpoD/σ(70) promoter sequences were underrepresented in the cosmid library. Furthermore, when we examined the genomes of taxa that were differentially abundant in the cosmid library relative to the original sample, we found the bias to be more correlated with the number of rpoD/σ(70) consensus sequences in the genome than with simple GC content. The GC bias of metagenomic libraries does not appear to be due to DNA fragmentation. Rather, analysis of promoter sequences provides support for the hypothesis that strong constitutive transcription from sequences recognized as rpoD/σ(70) consensus-like in E. coli may lead to instability, causing loss of the plasmid or loss of the insert DNA that gives rise to the transcription. Despite widespread use of E. coli to propagate foreign DNA in metagenomic libraries, the effects of in vivo transcriptional activity on clone stability are not well understood. Further work is required to tease apart the effects of transcription from those of gene product toxicity.

  10. Directed evolution of the TALE N-terminal domain for recognition of all 5' bases.

    PubMed

    Lamb, Brian M; Mercer, Andrew C; Barbas, Carlos F

    2013-11-01

    Transcription activator-like effector (TALE) proteins can be designed to bind virtually any DNA sequence. General guidelines for design of TALE DNA-binding domains suggest that the 5'-most base of the DNA sequence bound by the TALE (the N0 base) should be a thymine. We quantified the N0 requirement by analysis of the activities of TALE transcription factors (TALE-TF), TALE recombinases (TALE-R) and TALE nucleases (TALENs) with each DNA base at this position. In the absence of a 5' T, we observed decreases in TALE activity up to >1000-fold in TALE-TF activity, up to 100-fold in TALE-R activity and up to 10-fold reduction in TALEN activity compared with target sequences containing a 5' T. To develop TALE architectures that recognize all possible N0 bases, we used structure-guided library design coupled with TALE-R activity selections to evolve novel TALE N-terminal domains to accommodate any N0 base. A G-selective domain and broadly reactive domains were isolated and characterized. The engineered TALE domains selected in the TALE-R format demonstrated modularity and were active in TALE-TF and TALEN architectures. Evolved N-terminal domains provide effective and unconstrained TALE-based targeting of any DNA sequence as TALE binding proteins and designer enzymes.

  11. Rapid electrochemical assessment of tumor suppressor gene methylations in raw human serum, and tumor cells and tissues using immuno-magnetic beads and selective DNA hybridization.

    PubMed

    Povedano, Eloy; Valverde, Alejandro; Ruiz-Valdepeñas Montiel, Víctor; Pedrero, María; Yáñez-Sedeño, Paloma; Barderas, Rodrigo; San Segundo-Acosta, Pablo; Peláez-García, Alberto; Mendiola, Marta; Hardisson, David; Campuzano, Susana; Pingarron, José Manuel

    2018-05-09

    We report a rapid and sensitive electrochemical strategy for the detection of gene-specific 5-methylcytosine DNA methylation. Magnetic beads (MBs) modified with an antibody specific for 5-methylcytosines (5-mC) are employed for the selective capture of any 5-mC methylated single-stranded (ss)DNA sequence. A flanking region next to the 5-mCs of the captured methylated ssDNA is recognized by selective hybridization with a synthetic biotinylated DNA sequence, further labeled with an HRP streptavidin conjugate. Amperometric transduction at disposable screen-printed carbon electrodes (SPCEs) is employed. The developed biosensor exhibits a dynamic range from 3.9 to 500 pM and a detection limit of 1.2 pM for the methylated synthetic sequence of the tumor suppressor gene O-6-methylguanine-DNA methyltransferase (MGMT) promoter region. The applicability of this strategy is demonstrated through the 45 min-analysis of specific methylation in the MGMT promoter region directly in raw spiked human serum samples and in genomic DNA extracted from U-87 glioblastoma cells and paraffin-embedded brain tumor tissues without any amplification and pretreatment step. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. End Joining-Mediated Gene Expression in Mammalian Cells Using PCR-Amplified DNA Constructs that Contain Terminator in Front of Promoter.

    PubMed

    Nakamura, Mikiko; Suzuki, Ayako; Akada, Junko; Tomiyoshi, Keisuke; Hoshida, Hisashi; Akada, Rinji

    2015-12-01

    Mammalian gene expression constructs are generally prepared in a plasmid vector, in which a promoter and terminator are located upstream and downstream of a protein-coding sequence, respectively. In this study, we found that front terminator constructs-DNA constructs containing a terminator upstream of a promoter rather than downstream of a coding region-could sufficiently express proteins as a result of end joining of the introduced DNA fragment. By taking advantage of front terminator constructs, FLAG substitutions, and deletions were generated using mutagenesis primers to identify amino acids specifically recognized by commercial FLAG antibodies. A minimal epitope sequence for polyclonal FLAG antibody recognition was also identified. In addition, we analyzed the sequence of a C-terminal Ser-Lys-Leu peroxisome localization signal, and identified the key residues necessary for peroxisome targeting. Moreover, front terminator constructs of hepatitis B surface antigen were used for deletion analysis, leading to the identification of regions required for the particle formation. Collectively, these results indicate that front terminator constructs allow for easy manipulations of C-terminal protein-coding sequences, and suggest that direct gene expression with PCR-amplified DNA is useful for high-throughput protein analysis in mammalian cells.

  13. Structural impact of complete CpG methylation within target DNA on specific complex formation of the inducible transcription factor Egr-1.

    PubMed

    Zandarashvili, Levani; White, Mark A; Esadze, Alexandre; Iwahara, Junji

    2015-07-08

    The inducible transcription factor Egr-1 binds specifically to 9-bp target sequences containing two CpG sites that can potentially be methylated at four cytosine bases. Although it appears that complete CpG methylation would make an unfavorable steric clash in the previous crystal structures of the complexes with unmethylated or partially methylated DNA, our affinity data suggest that DNA recognition by Egr-1 is insensitive to CpG methylation. We have determined, at a 1.4-Å resolution, the crystal structure of the Egr-1 zinc-finger complex with completely methylated target DNA. Structural comparison of the three different methylation states reveals why Egr-1 can recognize the target sequences regardless of CpG methylation. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  14. Molecular determinants of origin discrimination by Orc1 initiators in archaea.

    PubMed

    Dueber, Erin C; Costa, Alessandro; Corn, Jacob E; Bell, Stephen D; Berger, James M

    2011-05-01

    Unlike bacteria, many eukaryotes initiate DNA replication from genomic sites that lack apparent sequence conservation. These loci are identified and bound by the origin recognition complex (ORC), and subsequently activated by a cascade of events that includes recruitment of an additional factor, Cdc6. Archaeal organisms generally possess one or more Orc1/Cdc6 homologs, belonging to the Initiator clade of ATPases associated with various cellular activities (AAA(+)) superfamily; however, these proteins recognize specific sequences within replication origins. Atomic resolution studies have shown that archaeal Orc1 proteins contact double-stranded DNA through an N-terminal AAA(+) domain and a C-terminal winged-helix domain (WHD), but use remarkably few base-specific contacts. To investigate the biochemical effects of these associations, we mutated the DNA-interacting elements of the Orc1-1 and Orc1-3 paralogs from the archaeon Sulfolobus solfataricus, and tested their effect on origin binding and deformation. We find that the AAA(+) domain has an unpredicted role in controlling the sequence selectivity of DNA binding, despite an absence of base-specific contacts to this region. Our results show that both the WHD and ATPase region influence origin recognition by Orc1/Cdc6, and suggest that not only DNA sequence, but also local DNA structure help define archaeal initiator binding sites. © The Author(s) 2011. Published by Oxford University Press.

  15. Muricauda lutimaris sp. nov., isolated from a tidal flat of the Yellow Sea.

    PubMed

    Yoon, Jung-Hoon; Kang, So-Jung; Jung, Yong-Taek; Oh, Tae-Kwang

    2008-07-01

    A Gram-negative, non-motile, rod-shaped bacterial strain, SMK-108(T), was isolated from a tidal flat of the Yellow Sea in Korea and was subjected to a polyphasic taxonomic investigation. Strain SMK-108(T) grew optimally at pH 7.0-8.0 and at 30 degrees C. It contained MK-6 as the predominant menaquinone. The major fatty acids were iso-C(17 : 0) 3-OH, iso-C(15 : 1) and iso-C(15 : 0). The DNA G+C content was 41.1 mol%. Comparative 16S rRNA gene sequence analysis showed that strain SMK-108(T) was related most closely to members of the genus Muricauda, exhibiting 96.6-98.8 % sequence similarity to the type strains of recognized Muricauda species. Strain SMK-108(T) was distinguishable from recognized Muricauda species on the basis of differential phenotypic characteristics, levels of DNA-DNA relatedness and phylogenetic distinctiveness. This organism is thus considered to represent a novel species of the genus Muricauda, for which the name Muricauda lutimaris sp. nov. is proposed. The type strain is SMK-108(T) (=KCTC 22173(T) =CCUG 55324(T)).

  16. Are Africans, Europeans, and Asians Different “Races”? A Guided-Inquiry Lab for Introducing Undergraduate Students to Genetic Diversity and Preparing Them to Study Natural Selection

    PubMed Central

    Kalinowski, Steven T.; Andrews, Tessa M.; Leonard, Mary J.; Snodgrass, Meagan

    2012-01-01

    Many students do not recognize that individual organisms within populations vary, and this may make it difficult for them to recognize the essential role variation plays in natural selection. Also, many students have weak scientific reasoning skills, and this makes it difficult for them to recognize misconceptions they might have. This paper describes a 2-h laboratory for college students that introduces them to genetic diversity and gives them practice using hypothetico-deductive reasoning. In brief, the lab presents students with DNA sequences from Africans, Europeans, and Asians, and asks students to determine whether people from each continent qualify as distinct “races.” Comparison of the DNA sequences shows that people on each continent are not more similar to one another than to people on other continents, and therefore do not qualify as distinct races. Ninety-four percent of our students reported that the laboratory was interesting, and 79% reported that it was a valuable learning experience. We developed and used a survey to measure the extent to which students recognized variation and its significance within populations and showed that the lab increased student awareness of variation. We also showed that the lab improved the ability of students to construct hypothetico-deductive arguments. PMID:22665587

  17. Canis mtDNA HV1 database: a web-based tool for collecting and surveying Canis mtDNA HV1 haplotype in public database.

    PubMed

    Thai, Quan Ke; Chung, Dung Anh; Tran, Hoang-Dung

    2017-06-26

    Canine and wolf mitochondrial DNA haplotypes, which can be used for forensic or phylogenetic analyses, have been defined in various schemes depending on the region analyzed. In recent studies, the 582 bp fragment of the HV1 region is most commonly used. 317 different canine HV1 haplotypes have been reported in the rapidly growing public database GenBank. These reported haplotypes contain several inconsistencies in their haplotype information. To overcome this issue, we have developed a Canis mtDNA HV1 database. This database collects data on the HV1 582 bp region in dog mitochondrial DNA from the GenBank to screen and correct the inconsistencies. It also supports users in detection of new novel mutation profiles and assignment of new haplotypes. The Canis mtDNA HV1 database (CHD) contains 5567 nucleotide entries originating from 15 subspecies in the species Canis lupus. Of these entries, 3646 were haplotypes and grouped into 804 distinct sequences. 319 sequences were recognized as previously assigned haplotypes, while the remaining 485 sequences had new mutation profiles and were marked as new haplotype candidates awaiting further analysis for haplotype assignment. Of the 3646 nucleotide entries, only 414 were annotated with correct haplotype information, while 3232 had insufficient or lacked haplotype information and were corrected or modified before storing in the CHD. The CHD can be accessed at http://chd.vnbiology.com . It provides sequences, haplotype information, and a web-based tool for mtDNA HV1 haplotyping. The CHD is updated monthly and supplies all data for download. The Canis mtDNA HV1 database contains information about canine mitochondrial DNA HV1 sequences with reconciled annotation. It serves as a tool for detection of inconsistencies in GenBank and helps identifying new HV1 haplotypes. Thus, it supports the scientific community in naming new HV1 haplotypes and to reconcile existing annotation of HV1 582 bp sequences.

  18. The Relationship Between Human Nucleolar Organizer Regions and Nucleoli, Probed by 3D-ImmunoFISH.

    PubMed

    van Sluis, Marjolein; van Vuuren, Chelly; McStay, Brian

    2016-01-01

    3D-immunoFISH is a valuable technique to compare the localization of DNA sequences and proteins in cells where three-dimensional structure has been preserved. As nucleoli contain a multitude of protein factors dedicated to ribosome biogenesis and form around specific chromosomal loci, 3D-immunoFISH is a particularly relevant technique for their study. In human cells, nucleoli form around transcriptionally active ribosomal gene (rDNA) arrays termed nucleolar organizer regions (NORs) positioned on the p-arms of each of the acrocentric chromosomes. Here, we provide a protocol for fixing and permeabilizing human cells grown on microscope slides such that nucleolar proteins can be visualized using antibodies and NORs visualized by DNA FISH. Antibodies against UBF recognize transcriptionally active rDNA/NORs and NOP52 antibodies provide a convenient way of visualizing the nucleolar volume. We describe a probe designed to visualize rDNA and introduce a probe comprised of NOR distal sequences, which can be used to identify or count individual NORs.

  19. Active Site Sharing and Subterminal Hairpin Recognition in a New Class of DNA Transposases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ronning, Donald R.; Guynet, Catherine; Ton-Hoang, Bao

    2010-07-20

    Many bacteria harbor simple transposable elements termed insertion sequences (IS). In Helicobacter pylori, the chimeric IS605 family elements are particularly interesting due to their proximity to genes encoding gastric epithelial invasion factors. Protein sequences of IS605 transposases do not bear the hallmarks of other well-characterized transposases. We have solved the crystal structure of full-length transposase (TnpA) of a representative member, ISHp608. Structurally, TnpA does not resemble any characterized transposase; rather, it is related to rolling circle replication (RCR) proteins. Consistent with RCR, Mg{sup 2+} and a conserved tyrosine, Tyr127, are essential for DNA nicking and the formation of a covalentmore » intermediate between TnpA and DNA. TnpA is dimeric, contains two shared active sites, and binds two DNA stem loops representing the conserved inverted repeats near each end of ISHp608. The cocrystal structure with stem-loop DNA illustrates how this family of transposases specifically recognizes and pairs ends, necessary steps during transposition.« less

  20. [Tale nucleases--new tool for genome editing].

    PubMed

    Glazkova, D V; Shipulin, G A

    2014-01-01

    The ability to introduce targeted changes in the genome of living cells or entire organisms enables researchers to meet the challenges of basic life sciences, biotechnology and medicine. Knockdown of target genes in the zygotes gives the opportunity to investigate the functions of these genes in different organisms. Replacement of single nucleotide in the DNA sequence allows to correct mutations in genes and thus to cure hereditary diseases. Adding transgene to specific genomic.loci can be used in biotechnology for generation of organisms with certain properties or cell lines for biopharmaceutical production. Such manipulations of gene sequences in their natural chromosomal context became possible after the emergence of the technology called "genome editing". This technology is based on the induction of a double-strand break in a specific genomic target DNA using endonucleases that recognize the unique sequences in the genome and on subsequent recovery of DNA integrity through the use of cellular repair mechanisms. A necessary tool for the genome editing is a custom-designed endonuclease which is able to recognize selected sequences. The emergence of a new type of programmable endonucleases, which were constructed on the basis of bacterial proteins--TAL-effectors (Transcription activators like effector), has become an important stage in the development of technology and promoted wide spread of the genome editing. This article reviews the history of the discovery of TAL effectors and creation of TALE nucleases, and describes their advantages over zinc finger endonucleases that appeared earlier. A large section is devoted to description of genetic modifications that can be performed using the genome editing.

  1. DNA barcoding as an aid for species identification in austral black flies (Insecta: Diptera: Simuliidae).

    PubMed

    Hernández-Triana, Luis M; Montes De Oca, Fernanda; Prosser, Sean W J; Hebert, Paul D N; Gregory, T Ryan; McMurtrie, Shelley

    2017-04-01

    In this paper, the utility of a partial sequence of the COI gene, the DNA barcoding region, for the identification of species of black flies in the austral region was assessed. Twenty-eight morphospecies were analyzed: eight of the genus Austrosimulium (four species in the subgenus Austrosimulium s. str., three species in the subgenus Novaustrosimulium, and one species unassigned to subgenus), two of the genus Cnesia, eight of Gigantodax, three of Paracnephia, one of Paraustrosimulium, and six of Simulium (subgenera Morops, Nevermannia, and Pternaspatha). The neighbour-joining tree derived from the DNA barcode sequences grouped most specimens according to species or species groups recognized by morphotaxonomic studies. Intraspecific sequence divergences within morphologically distinct species ranged from 0% to 1.8%, while higher divergences (2%-4.2%) in certain species suggested the presence of cryptic diversity. The existence of well-defined groups within S. simile revealed the likely inclusion of cryptic diversity. DNA barcodes also showed that specimens identified as C. dissimilis, C. nr. pussilla, and C. ornata might be conspecific, suggesting possible synonymy. DNA barcoding combined with a sound morphotaxonomic framework would provide an effective approach for the identification of black flies in the region.

  2. Sequence variation between 462 human individuals fine-tunes functional sites of RNA processing

    NASA Astrophysics Data System (ADS)

    Ferreira, Pedro G.; Oti, Martin; Barann, Matthias; Wieland, Thomas; Ezquina, Suzana; Friedländer, Marc R.; Rivas, Manuel A.; Esteve-Codina, Anna; Estivill, Xavier; Guigó, Roderic; Dermitzakis, Emmanouil; Antonarakis, Stylianos; Meitinger, Thomas; Strom, Tim M.; Palotie, Aarno; François Deleuze, Jean; Sudbrak, Ralf; Lerach, Hans; Gut, Ivo; Syvänen, Ann-Christine; Gyllensten, Ulf; Schreiber, Stefan; Rosenstiel, Philip; Brunner, Han; Veltman, Joris; Hoen, Peter A. C. T.; Jan van Ommen, Gert; Carracedo, Angel; Brazma, Alvis; Flicek, Paul; Cambon-Thomsen, Anne; Mangion, Jonathan; Bentley, David; Hamosh, Ada; Rosenstiel, Philip; Strom, Tim M.; Lappalainen, Tuuli; Guigó, Roderic; Sammeth, Michael

    2016-09-01

    Recent advances in the cost-efficiency of sequencing technologies enabled the combined DNA- and RNA-sequencing of human individuals at the population-scale, making genome-wide investigations of the inter-individual genetic impact on gene expression viable. Employing mRNA-sequencing data from the Geuvadis Project and genome sequencing data from the 1000 Genomes Project we show that the computational analysis of DNA sequences around splice sites and poly-A signals is able to explain several observations in the phenotype data. In contrast to widespread assessments of statistically significant associations between DNA polymorphisms and quantitative traits, we developed a computational tool to pinpoint the molecular mechanisms by which genetic markers drive variation in RNA-processing, cataloguing and classifying alleles that change the affinity of core RNA elements to their recognizing factors. The in silico models we employ further suggest RNA editing can moonlight as a splicing-modulator, albeit less frequently than genomic sequence diversity. Beyond existing annotations, we demonstrate that the ultra-high resolution of RNA-Seq combined from 462 individuals also provides evidence for thousands of bona fide novel elements of RNA processing—alternative splice sites, introns, and cleavage sites—which are often rare and lowly expressed but in other characteristics similar to their annotated counterparts.

  3. AzaHx, a novel fluorescent, DNA minor groove and G·C recognition element: Synthesis and DNA binding properties of a p-anisyl-4-aza-benzimidazole-pyrrole-imidazole (azaHx-PI) polyamide.

    PubMed

    Satam, Vijay; Babu, Balaji; Patil, Pravin; Brien, Kimberly A; Olson, Kevin; Savagian, Mia; Lee, Megan; Mepham, Andrew; Jobe, Laura Beth; Bingham, John P; Pett, Luke; Wang, Shuo; Ferrara, Maddi; Bruce, Chrystal D; Wilson, W David; Lee, Moses; Hartley, John A; Kiakos, Konstantinos

    2015-09-01

    The design, synthesis, and DNA binding properties of azaHx-PI or p-anisyl-4-aza-benzimidazole-pyrrole-imidazole (5) are described. AzaHx, 2-(p-anisyl)-4-aza-benzimidazole-5-carboxamide, is a novel, fluorescent DNA recognition element, derived from Hoechst 33258 to recognize G·C base pairs. Supported by theoretical data, the results from DNase I footprinting, CD, ΔT(M), and SPR studies provided evidence that an azaHx/IP pairing, formed from antiparallel stacking of two azaHx-PI molecules in a side-by-side manner in the minor groove, selectively recognized a C-G doublet. AzaHx-PI was found to target 5'-ACGCGT-3', the Mlu1 Cell Cycle Box (MCB) promoter sequence with specificity and significant affinity (K(eq) 4.0±0.2×10(7) M(-1)). Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Expression of exogenous DNA methyltransferases: application in molecular and cell biology.

    PubMed

    Dyachenko, O V; Tarlachkov, S V; Marinitch, D V; Shevchuk, T V; Buryanov, Y I

    2014-02-01

    DNA methyltransferases might be used as powerful tools for studies in molecular and cell biology due to their ability to recognize and modify nitrogen bases in specific sequences of the genome. Methylation of the eukaryotic genome using exogenous DNA methyltransferases appears to be a promising approach for studies on chromatin structure. Currently, the development of new methods for targeted methylation of specific genetic loci using DNA methyltransferases fused with DNA-binding proteins is especially interesting. In the present review, expression of exogenous DNA methyltransferase for purposes of in vivo analysis of the functional chromatin structure along with investigation of the functional role of DNA methylation in cell processes are discussed, as well as future prospects for application of DNA methyltransferases in epigenetic therapy and in plant selection.

  5. A universal colorimetry for nucleic acids and aptamer-specific ligands detection based on DNA hybridization amplification.

    PubMed

    Li, Shuang; Shang, Xinxin; Liu, Jia; Wang, Yujie; Guo, Yingshu; You, Jinmao

    2017-07-01

    We present a universal amplified-colorimetric for detecting nucleic acid targets or aptamer-specific ligand targets based on gold nanoparticle-DNA (GNP-DNA) hybridization chain reaction (HCR). The universal arrays consisted of capture probe and hairpin DNA-GNP. First, capture probe recognized target specificity and released the initiator sequence. Then dispersed hairpin DNA modified GNPs were cross-linked to form aggregates through HCR events triggered by initiator sequence. As the aggregates accumulate, a significant red-to purple color change can be easily visualized by the naked eye. We used miRNA target sequence (miRNA-203) and aptamer-specific ligand (ATP) as target molecules for this proof-of-concept experiment. Initiator sequence (DNA2) was released from the capture probe (MNP/DNA1/2 conjugates) under the strong competitiveness of miRNA-203. Hairpin DNA (H1 and H2) can be complementary with the help of initiator DNA2 to form GNP-H1/GNP-H2 aggregates. The absorption ratio (A 620 /A 520 ) values of solutions were a sensitive function of miRNA-203 concentration covering from 1.0 × 10 -11  M to 9.0 × 10 -10  M, and as low as 1.0 × 10 -11  M could be detected. At the same time, the color changed from light wine red to purple and then to light blue have occurred in the solution. For ATP, initiator sequence (5'-end of DNA3) was released from the capture probe (DNA3) under the strong combination of aptamer-ATP. The present colorimetric for specific detection of ATP exhibited good sensitivity and 1.0 × 10 -8  M ATP could be detected. The proposed strategy also showed good performances for qualitative analysis and quantitative analysis of intracellular nucleic acids and aptamer-specific ligands. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. Selection of a DNA barcode for Nectriaceae from fungal whole-genomes.

    PubMed

    Zeng, Zhaoqing; Zhao, Peng; Luo, Jing; Zhuang, Wenying; Yu, Zhihe

    2012-01-01

    A DNA barcode is a short segment of sequence that is able to distinguish species. A barcode must ideally contain enough variation to distinguish every individual species and be easily obtained. Fungi of Nectriaceae are economically important and show high species diversity. To establish a standard DNA barcode for this group of fungi, the genomes of Neurospora crassa and 30 other filamentous fungi were compared. The expect value was treated as a criterion to recognize homologous sequences. Four candidate markers, Hsp90, AAC, CDC48, and EF3, were tested for their feasibility as barcodes in the identification of 34 well-established species belonging to 13 genera of Nectriaceae. Two hundred and fifteen sequences were analyzed. Intra- and inter-specific variations and the success rate of PCR amplification and sequencing were considered as important criteria for estimation of the candidate markers. Ultimately, the partial EF3 gene met the requirements for a good DNA barcode: No overlap was found between the intra- and inter-specific pairwise distances. The smallest inter-specific distance of EF3 gene was 3.19%, while the largest intra-specific distance was 1.79%. In addition, there was a high success rate in PCR and sequencing for this gene (96.3%). CDC48 showed sufficiently high sequence variation among species, but the PCR and sequencing success rate was 84% using a single pair of primers. Although the Hsp90 and AAC genes had higher PCR and sequencing success rates (96.3% and 97.5%, respectively), overlapping occurred between the intra- and inter-specific variations, which could lead to misidentification. Therefore, we propose the EF3 gene as a possible DNA barcode for the nectriaceous fungi.

  7. Bioinformatics Approaches for Fetal DNA Fraction Estimation in Noninvasive Prenatal Testing

    PubMed Central

    Peng, Xianlu Laura; Jiang, Peiyong

    2017-01-01

    The discovery of cell-free fetal DNA molecules in plasma of pregnant women has created a paradigm shift in noninvasive prenatal testing (NIPT). Circulating cell-free DNA in maternal plasma has been increasingly recognized as an important proxy to detect fetal abnormalities in a noninvasive manner. A variety of approaches for NIPT using next-generation sequencing have been developed, which have been rapidly transforming clinical practices nowadays. In such approaches, the fetal DNA fraction is a pivotal parameter governing the overall performance and guaranteeing the proper clinical interpretation of testing results. In this review, we describe the current bioinformatics approaches developed for estimating the fetal DNA fraction and discuss their pros and cons. PMID:28230760

  8. Bioinformatics Approaches for Fetal DNA Fraction Estimation in Noninvasive Prenatal Testing.

    PubMed

    Peng, Xianlu Laura; Jiang, Peiyong

    2017-02-20

    The discovery of cell-free fetal DNA molecules in plasma of pregnant women has created a paradigm shift in noninvasive prenatal testing (NIPT). Circulating cell-free DNA in maternal plasma has been increasingly recognized as an important proxy to detect fetal abnormalities in a noninvasive manner. A variety of approaches for NIPT using next-generation sequencing have been developed, which have been rapidly transforming clinical practices nowadays. In such approaches, the fetal DNA fraction is a pivotal parameter governing the overall performance and guaranteeing the proper clinical interpretation of testing results. In this review, we describe the current bioinformatics approaches developed for estimating the fetal DNA fraction and discuss their pros and cons.

  9. Ribosomal DNA sequence heterogeneity reflects intraspecies phylogenies and predicts genome structure in two contrasting yeast species.

    PubMed

    West, Claire; James, Stephen A; Davey, Robert P; Dicks, Jo; Roberts, Ian N

    2014-07-01

    The ribosomal RNA encapsulates a wealth of evolutionary information, including genetic variation that can be used to discriminate between organisms at a wide range of taxonomic levels. For example, the prokaryotic 16S rDNA sequence is very widely used both in phylogenetic studies and as a marker in metagenomic surveys and the internal transcribed spacer region, frequently used in plant phylogenetics, is now recognized as a fungal DNA barcode. However, this widespread use does not escape criticism, principally due to issues such as difficulties in classification of paralogous versus orthologous rDNA units and intragenomic variation, both of which may be significant barriers to accurate phylogenetic inference. We recently analyzed data sets from the Saccharomyces Genome Resequencing Project, characterizing rDNA sequence variation within multiple strains of the baker's yeast Saccharomyces cerevisiae and its nearest wild relative Saccharomyces paradoxus in unprecedented detail. Notably, both species possess single locus rDNA systems. Here, we use these new variation datasets to assess whether a more detailed characterization of the rDNA locus can alleviate the second of these phylogenetic issues, sequence heterogeneity, while controlling for the first. We demonstrate that a strong phylogenetic signal exists within both datasets and illustrate how they can be used, with existing methodology, to estimate intraspecies phylogenies of yeast strains consistent with those derived from whole-genome approaches. We also describe the use of partial Single Nucleotide Polymorphisms, a type of sequence variation found only in repetitive genomic regions, in identifying key evolutionary features such as genome hybridization events and show their consistency with whole-genome Structure analyses. We conclude that our approach can transform rDNA sequence heterogeneity from a problem to a useful source of evolutionary information, enabling the estimation of highly accurate phylogenies of closely related organisms, and discuss how it could be extended to future studies of multilocus rDNA systems. [concerted evolution; genome hydridisation; phylogenetic analysis; ribosomal DNA; whole genome sequencing; yeast]. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  10. MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

    PubMed

    Ozaki, Haruka; Iwasaki, Wataru

    2016-08-01

    As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Uncommonly isolated clinical Pseudomonas: identification and phylogenetic assignation.

    PubMed

    Mulet, M; Gomila, M; Ramírez, A; Cardew, S; Moore, E R B; Lalucat, J; García-Valdés, E

    2017-02-01

    Fifty-two Pseudomonas strains that were difficult to identify at the species level in the phenotypic routine characterizations employed by clinical microbiology laboratories were selected for genotypic-based analysis. Species level identifications were done initially by partial sequencing of the DNA dependent RNA polymerase sub-unit D gene (rpoD). Two other gene sequences, for the small sub-unit ribosonal RNA (16S rRNA) and for DNA gyrase sub-unit B (gyrB) were added in a multilocus sequence analysis (MLSA) study to confirm the species identifications. These sequences were analyzed with a collection of reference sequences from the type strains of 161 Pseudomonas species within an in-house multi-locus sequence analysis database. Whole-cell matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) analyses of these strains complemented the DNA sequenced-based phylogenetic analyses and were observed to be in accordance with the results of the sequence data. Twenty-three out of 52 strains were assigned to 12 recognized species not commonly detected in clinical specimens and 29 (56 %) were considered representatives of at least ten putative new species. Most strains were distributed within the P. fluorescens and P. aeruginosa lineages. The value of rpoD sequences in species-level identifications for Pseudomonas is emphasized. The correct species identifications of clinical strains is essential for establishing the intrinsic antibiotic resistance patterns and improved treatment plans.

  12. Biorecognition by DNA oligonucleotides after Exposure to Photoresists and Resist Removers

    PubMed Central

    Dean, Stacey L.; Morrow, Thomas J.; Patrick, Sue; Li, Mingwei; Clawson, Gary; Mayer, Theresa S.; Keating, Christine D.

    2013-01-01

    Combining biological molecules with integrated circuit technology is of considerable interest for next generation sensors and biomedical devices. Current lithographic microfabrication methods, however, were developed for compatibility with silicon technology rather than bioorganic molecules and consequently it cannot be assumed that biomolecules will remain attached and intact during on-chip processing. Here, we evaluate the effects of three common photoresists (Microposit S1800 series, PMGI SF6, and Megaposit SPR 3012) and two photoresist removers (acetone and 1165 remover) on the ability of surface-immobilized DNA oligonucleotides to selectively recognize their reverse-complementary sequence. Two common DNA immobilization methods were compared: adsorption of 5′-thiolated sequences directly to gold nanowires and covalent attachment of 5′-thiolated sequences to surface amines on silica coated nanowires. We found that acetone had deleterious effects on selective hybridization as compared to 1165 remover, presumably due to incomplete resist removal. Use of the PMGI photoresist, which involves a high temperature bake step, was detrimental to the later performance of nanowire-bound DNA in hybridization assays, especially for DNA attached via thiol adsorption. The other three photoresists did not substantially degrade DNA binding capacity or selectivity for complementary DNA sequences. To determine if the lithographic steps caused more subtle damage, we also tested oligonucleotides containing a single base mismatch. Finally, a two-step photolithographic process was developed and used in combination with dielectrophoretic nanowire assembly to produce an array of doubly-contacted, electrically isolated individual nanowire components on a chip. Post-fabrication fluorescence imaging indicated that nanowire-bound DNA was present and able to selectively bind complementary strands. PMID:23952639

  13. A DNA barcode library for ground beetles (Insecta, Coleoptera, Carabidae) of Germany: The genus Bembidion Latreille, 1802 and allied taxa

    PubMed Central

    Raupach, Michael J.; Hannig, Karsten; Morinière, Jérome; Hendrich, Lars

    2016-01-01

    Abstract As molecular identification method, DNA barcoding based on partial cytochrome c oxidase subunit 1 (COI) sequences has been proven to be a useful tool for species determination in many insect taxa including ground beetles. In this study we tested the effectiveness of DNA barcodes to discriminate species of the ground beetle genus Bembidion and some closely related taxa of Germany. DNA barcodes were obtained from 819 individuals and 78 species, including sequences from previous studies as well as more than 300 new generated DNA barcodes. We found a 1:1 correspondence between BIN and traditionally recognized species for 69 species (89%). Low interspecific distances with maximum pairwise K2P values below 2.2% were found for three species pairs, including two species pairs with haplotype sharing (Bembidion atrocaeruleum/Bembidion varicolor and Bembidion guttula/Bembidion mannerheimii). In contrast to this, deep intraspecific sequence divergences with distinct lineages were revealed for two species (Bembidion geniculatum/Ocys harpaloides). Our study emphasizes the use of DNA barcodes for the identification of the analyzed ground beetles species and represents an important step in building-up a comprehensive barcode library for the Carabidae in Germany and Central Europe as well. PMID:27408547

  14. Plasmodium falciparum Nucleosomes Exhibit Reduced Stability and Lost Sequence Dependent Nucleosome Positioning

    PubMed Central

    Silberhorn, Elisabeth; Schwartz, Uwe; Symelka, Anne; de Koning-Ward, Tania; Längst, Gernot

    2016-01-01

    The packaging and organization of genomic DNA into chromatin represents an additional regulatory layer of gene expression, with specific nucleosome positions that restrict the accessibility of regulatory DNA elements. The mechanisms that position nucleosomes in vivo are thought to depend on the biophysical properties of the histones, sequence patterns, like phased di-nucleotide repeats and the architecture of the histone octamer that folds DNA in 1.65 tight turns. Comparative studies of human and P. falciparum histones reveal that the latter have a strongly reduced ability to recognize internal sequence dependent nucleosome positioning signals. In contrast, the nucleosomes are positioned by AT-repeat sequences flanking nucleosomes in vivo and in vitro. Further, the strong sequence variations in the plasmodium histones, compared to other mammalian histones, do not present adaptations to its AT-rich genome. Human and parasite histones bind with higher affinity to GC-rich DNA and with lower affinity to AT-rich DNA. However, the plasmodium nucleosomes are overall less stable, with increased temperature induced mobility, decreased salt stability of the histones H2A and H2B and considerable reduced binding affinity to GC-rich DNA, as compared with the human nucleosomes. In addition, we show that plasmodium histone octamers form the shortest known nucleosome repeat length (155bp) in vitro and in vivo. Our data suggest that the biochemical properties of the parasite histones are distinct from the typical characteristics of other eukaryotic histones and these properties reflect the increased accessibility of the P. falciparum genome. PMID:28033404

  15. Retrotransposon insertion targeting: a mechanism for homogenization of centromere sequences on nonhomologous chromosomes.

    PubMed

    Birchler, James A; Presting, Gernot G

    2012-04-01

    The centromeres of most eukaryotic organisms consist of highly repetitive arrays that are similar across nonhomologous chromosomes. These sequences evolve rapidly, thus posing a mystery as to how such arrays can be homogenized. Recent work in species in which centromere-enriched retrotransposons occur indicates that these elements preferentially insert into the centromeric regions. In two different Arabidopsis species, a related element was recognized in which the specificity for such targeting was altered. These observations provide a partial explanation for how homogenization of centromere DNA sequences occurs.

  16. Megabase sequencing of human genome by ordered-shotgun-sequencing (OSS) strategy

    NASA Astrophysics Data System (ADS)

    Chen, Ellson Y.

    1997-05-01

    So far we have used OSS strategy to sequence over 2 megabases DNA in large-insert clones from regions of human X chromosomes with different characteristic levels of GC content. The method starts by randomly fragmenting a BAC, YAC or PAC to 8-12 kb pieces and subcloning those into lambda phage. Insert-ends of these clones are sequenced and overlapped to create a partial map. Complete sequencing is then done on a minimal tiling path of selected subclones, recursively focusing on those at the edges of contigs to facilitate mergers of clones across the entire target. To reduce manual labor, PCR processes have been adapted to prepare sequencing templates throughout the entire operation. The streamlined process can thus lend itself to further automation. The OSS approach is suitable for large- scale genomic sequencing, providing considerable flexibility in the choice of subclones or regions for more or less intensive sequencing. For example, subclones containing contaminating host cell DNA or cloning vector can be recognized and ignored with minimal sequencing effort; regions overlapping a neighboring clone already sequenced need not be redone; and segments containing tandem repeats or long repetitive sequences can be spotted early on and targeted for additional attention.

  17. A new restriction endonuclease from Citrobacter freundii

    PubMed Central

    Janulaitis, A.A.; Stakenas, P.S.; Lebedenko, E.N.; Berlin, Yu.A.

    1982-01-01

    CfrI, a new restriction endonuclease of unique substrate specificity, has been isolated from a Citrobacter freundii strain. The enzyme recognizes a degenerated sequence PyGGCCPu in double-strand DNA and cleaves it between Py and G residues to yield 5′ -protruding tetranucleotide ends GGCC. Images PMID:6294607

  18. Zygosaccharomyces kombuchaensis, a new ascosporogenous yeast from 'Kombucha tea'.

    PubMed

    Kurtzman, C P; Robnett, C J; Basehoar-Powers, E

    2001-07-01

    A new ascosporogenous yeast, Zygosaccharomyces kombuchaensis sp. n. (type strain NRRL YB-4811, CBS 8849), is described; it was isolated from Kombucha tea, a popular fermented tea-based beverage. The four known strains of the new species have identical nucleotide sequences in domain D1/D2 of 26S rDNA. Phylogenetic analysis of D1/D2 and 18S rDNA sequences places Z. kombuchaensis near Zygosaccharomyces lentus. The two species are indistinguishable on standard physiological tests used for yeast identification, but can be recognized from differences in restriction fragment length polymorphism patterns obtained by digestion of 18S-ITS1 amplicons with the restriction enzymes DdeI and MboI.

  19. A Children's Oncology Group and TARGET initiative exploring the genetic landscape of Wilms tumor. | Office of Cancer Genomics

    Cancer.gov

    We performed genome-wide sequencing and analyzed mRNA and miRNA expression, DNA copy number, and DNA methylation in 117 Wilms tumors, followed by targeted sequencing of 651 Wilms tumors. In addition to genes previously implicated in Wilms tumors (WT1, CTNNB1, AMER1, DROSHA, DGCR8, XPO5, DICER1, SIX1, SIX2, MLLT1, MYCN, and TP53), we identified mutations in genes not previously recognized as recurrently involved in Wilms tumors, the most frequent being BCOR, BCORL1, NONO, MAX, COL6A3, ASXL1, MAP3K4, and ARID1A.

  20. Lactobacillus hammesii sp. nov., isolated from French sourdough.

    PubMed

    Valcheva, Rosica; Korakli, Maher; Onno, Bernard; Prévost, Hervé; Ivanova, Iskra; Ehrmann, Matthias A; Dousset, Xavier; Gänzle, Michael G; Vogel, Rudi F

    2005-03-01

    Twenty morphologically different strains were chosen from French wheat sourdough isolates. Cells were Gram-positive, non-spore-forming, non-motile rods. The isolates were identified using amplified-fragment length polymorphism, randomly amplified polymorphic DNA and 16S rRNA gene sequence analysis. All isolates were members of the genus Lactobacillus. They were identified as representing Lactobacillus plantarum, Lactobacillus paralimentarius, Lactobacillus sanfranciscensis, Lactobacillus spicheri and Lactobacillus sakei. However, two isolates (LP38(T) and LP39) could be clearly discriminated from recognized Lactobacillus species on the basis of genotyping methods. 16S rRNA gene sequence similarity and DNA-DNA relatedness data indicate that the two strains belong to a novel Lactobacillus species, for which the name Lactobacillus hammesii is proposed. The type strain is LP38(T) (=DSM 16381(T)=CIP 108387(T)=TMW 1.1236(T)).

  1. Impact of Somatic Mutations in the D-Loop of Mitochondrial DNA on the Survival of Oral Squamous Cell Carcinoma Patients

    PubMed Central

    Lin, Jin-Ching; Wang, Chen-Chi; Jiang, Rong-San; Wang, Wen-Yi; Liu, Shih-An

    2015-01-01

    Objectives The aim of this study was to investigate somatic mutations in the D-loop of mitochondrial DNA (mtDNA) and their impact on survival in oral squamous cell carcinoma patients. Materials and Methods Surgical specimen confirmed by pathological examination and corresponding non-cancerous tissues were collected from 120 oral squamous cell carcinoma patients. The sequence in the D-loop of mtDNA from non-cancerous tissues was compared with that from paired cancer samples and any sequence differences were recognized as somatic mutations. Results Somatic mutations in the D-loop of mtDNA were identified in 75 (62.5%) oral squamous cell carcinoma patients and most of them occurred in the poly-C tract. Although there were no significant differences in demographic and tumor-related features between participants with and without somatic mutation, the mutation group had a better survival rate (5 year disease-specific survival rate: 64.0% vs. 43.0%, P = 0.0266). Conclusion Somatic mutation in D-loop of mtDNA was associated with a better survival in oral squamous cell carcinoma patients. PMID:25906372

  2. Modeling Structure-Function Relationships in Synthetic DNA Sequences using Attribute Grammars

    PubMed Central

    Cai, Yizhi; Lux, Matthew W.; Adam, Laura; Peccoud, Jean

    2009-01-01

    Recognizing that certain biological functions can be associated with specific DNA sequences has led various fields of biology to adopt the notion of the genetic part. This concept provides a finer level of granularity than the traditional notion of the gene. However, a method of formally relating how a set of parts relates to a function has not yet emerged. Synthetic biology both demands such a formalism and provides an ideal setting for testing hypotheses about relationships between DNA sequences and phenotypes beyond the gene-centric methods used in genetics. Attribute grammars are used in computer science to translate the text of a program source code into the computational operations it represents. By associating attributes with parts, modifying the value of these attributes using rules that describe the structure of DNA sequences, and using a multi-pass compilation process, it is possible to translate DNA sequences into molecular interaction network models. These capabilities are illustrated by simple example grammars expressing how gene expression rates are dependent upon single or multiple parts. The translation process is validated by systematically generating, translating, and simulating the phenotype of all the sequences in the design space generated by a small library of genetic parts. Attribute grammars represent a flexible framework connecting parts with models of biological function. They will be instrumental for building mathematical models of libraries of genetic constructs synthesized to characterize the function of genetic parts. This formalism is also expected to provide a solid foundation for the development of computer assisted design applications for synthetic biology. PMID:19816554

  3. Signatures of DNA Methylation across Insects Suggest Reduced DNA Methylation Levels in Holometabola

    PubMed Central

    Provataris, Panagiotis; Meusemann, Karen; Niehuis, Oliver; Grath, Sonja; Misof, Bernhard

    2018-01-01

    Abstract It has been experimentally shown that DNA methylation is involved in the regulation of gene expression and the silencing of transposable element activity in eukaryotes. The variable levels of DNA methylation among different insect species indicate an evolutionarily flexible role of DNA methylation in insects, which due to a lack of comparative data is not yet well-substantiated. Here, we use computational methods to trace signatures of DNA methylation across insects by analyzing transcriptomic and genomic sequence data from all currently recognized insect orders. We conclude that: 1) a functional methylation system relying exclusively on DNA methyltransferase 1 is widespread across insects. 2) DNA methylation has potentially been lost or extremely reduced in species belonging to springtails (Collembola), flies and relatives (Diptera), and twisted-winged parasites (Strepsiptera). 3) Holometabolous insects display signs of reduced DNA methylation levels in protein-coding sequences compared with hemimetabolous insects. 4) Evolutionarily conserved insect genes associated with housekeeping functions tend to display signs of heavier DNA methylation in comparison to the genomic/transcriptomic background. With this comparative study, we provide the much needed basis for experimental and detailed comparative analyses required to gain a deeper understanding on the evolution and function of DNA methylation in insects. PMID:29697817

  4. Two distinct DNA sequences recognized by transcription factors represent enthalpy and entropy optima

    PubMed Central

    Yin, Yimeng; Das, Pratyush K; Jolma, Arttu; Zhu, Fangjie; Popov, Alexander; Xu, You; Nilsson, Lennart

    2018-01-01

    Most transcription factors (TFs) can bind to a population of sequences closely related to a single optimal site. However, some TFs can bind to two distinct sequences that represent two local optima in the Gibbs free energy of binding (ΔG). To determine the molecular mechanism behind this effect, we solved the structures of human HOXB13 and CDX2 bound to their two optimal DNA sequences, CAATAAA and TCGTAAA. Thermodynamic analyses by isothermal titration calorimetry revealed that both sites were bound with similar ΔG. However, the interaction with the CAA sequence was driven by change in enthalpy (ΔH), whereas the TCG site was bound with similar affinity due to smaller loss of entropy (ΔS). This thermodynamic mechanism that leads to at least two local optima likely affects many macromolecular interactions, as ΔG depends on two partially independent variables ΔH and ΔS according to the central equation of thermodynamics, ΔG = ΔH - TΔS. PMID:29638214

  5. Directed evolution of the TALE N-terminal domain for recognition of all 5′ bases

    PubMed Central

    Lamb, Brian M.; Mercer, Andrew C.; Barbas, Carlos F.

    2013-01-01

    Transcription activator-like effector (TALE) proteins can be designed to bind virtually any DNA sequence. General guidelines for design of TALE DNA-binding domains suggest that the 5′-most base of the DNA sequence bound by the TALE (the N0 base) should be a thymine. We quantified the N0 requirement by analysis of the activities of TALE transcription factors (TALE-TF), TALE recombinases (TALE-R) and TALE nucleases (TALENs) with each DNA base at this position. In the absence of a 5′ T, we observed decreases in TALE activity up to >1000-fold in TALE-TF activity, up to 100-fold in TALE-R activity and up to 10-fold reduction in TALEN activity compared with target sequences containing a 5′ T. To develop TALE architectures that recognize all possible N0 bases, we used structure-guided library design coupled with TALE-R activity selections to evolve novel TALE N-terminal domains to accommodate any N0 base. A G-selective domain and broadly reactive domains were isolated and characterized. The engineered TALE domains selected in the TALE-R format demonstrated modularity and were active in TALE-TF and TALEN architectures. Evolved N-terminal domains provide effective and unconstrained TALE-based targeting of any DNA sequence as TALE binding proteins and designer enzymes. PMID:23980031

  6. Universal Readers Based on Hydrogen Bonding or π-π Stacking for Identification of DNA Nucleotides in Electron Tunnel Junctions.

    PubMed

    Biswas, Sovan; Sen, Suman; Im, JongOne; Biswas, Sudipta; Krstic, Predrag; Ashcroft, Brian; Borges, Chad; Zhao, Yanan; Lindsay, Stuart; Zhang, Peiming

    2016-12-27

    A reader molecule, which recognizes all the naturally occurring nucleobases in an electron tunnel junction, is required for sequencing DNA by a recognition tunneling (RT) technique, referred to as a universal reader. In the present study, we have designed a series of heterocyclic carboxamides based on hydrogen bonding and a large-sized pyrene ring based on a π-π stacking interaction as universal reader candidates. Each of these compounds was synthesized to bear a thiolated linker for attachment to metal electrodes and examined for their interactions with naturally occurring DNA nucleosides and nucleotides by 1 H NMR, ESI-MS, computational calculations, and surface plasmon resonance. RT measurements were carried out in a scanning tunnel microscope. All of these molecules generated electrical signals with DNA nucleotides in tunneling junctions under physiological conditions (phosphate buffered aqueous solution, pH 7.4). Using a support vector machine as a tool for data analysis, we found that these candidates distinguished among naturally occurring DNA nucleotides with the accuracy of pyrene (by π-π stacking interactions) > azole carboxamides (by hydrogen-bonding interactions). In addition, the pyrene reader operated efficiently in a larger tunnel junction. However, the azole carboxamide could read abasic (AP) monophosphate, a product from spontaneous base hydrolysis or an intermediate of base excision repair. Thus, we envision that sequencing DNA using both π-π stacking and hydrogen-bonding-based universal readers in parallel should generate more comprehensive genome sequences than sequencing based on either reader molecule alone.

  7. Differing roles for zinc fingers in DNA recognition: Structure of a six-finger transcription factor IIIA complex

    PubMed Central

    Nolte, Robert T.; Conlin, Rachel M.; Harrison, Stephen C.; Brown, Raymond S.

    1998-01-01

    The crystal structure of the six NH2-terminal zinc fingers of Xenopus laevis transcription factor IIIA (TFIIIA) bound with 31 bp of the 5S rRNA gene promoter has been determined at 3.1 Å resolution. Individual zinc fingers are positioned differently in the major groove and across the minor groove of DNA to span the entire length of the duplex. These results show how TFIIIA can recognize several separated DNA sequences by using fewer fingers than necessary for continuous winding in the major groove. PMID:9501194

  8. The fluorescently responsive 3-(naphthalen-1-ylethynyl)-3-deaza-2'-deoxyguanosine discriminates cytidine via the DNA minor groove.

    PubMed

    Suzuki, Azusa; Yanagi, Masaki; Takeda, Takuya; Hudson, Robert H E; Saito, Yoshio

    2017-09-26

    A new environmentally responsive fluorescent nucleoside, 3-(naphthalen-1-ylethynyl)-3-deaza-2'-deoxyguanosine ( 3nz G), has been synthesized. The nucleoside, 3nz G, exhibited solvatochromic properties and when introduced into ODN probes it was able to recognize 2'-deoxycytidine in target strands by a distinct change in its emission wavelength through probing microenvironmental changes in the DNA minor groove. Thus, 3nz G has the potential for use as a fluorescent probe molecule for micro-structural studies of nucleic acids including the detection of single-base alterations in target DNA sequences.

  9. The recognition and modification sites for the bacterial type I restriction systems KpnAI, StySEAI, StySENI and StySGI

    PubMed Central

    Kasarjian, Julie K. A.; Hidaka, Masumi; Horiuchi, Takashi; Iida, Masatake; Ryu, Junichi

    2004-01-01

    Using an in vivo plasmid transformation method, we have determined the DNA sequences recognized by the KpnAI, StySEAI, StySENI and StySGI R-M systems from Klebsiella oxytoca strain M5a1, Salmonella eastbourne, Salmonella enteritidis and Salmonella gelsenkirchen, respectively. These type I restriction-modification systems were originally identified using traditional phage assay, and described here is the plasmid transformation test and computer program used to determine their DNA recognition sequences. For this test, we constructed two sets of plasmids, pL and pE, that contain phage lambda and Escherichia coli K-12 chromosomal DNA fragments, respectively. Further, using the methylation sensitivities of various known type II restriction enzymes, we identified the target adenines for methylation (listed in bold italics below as A or T in case of the complementary strand). The recognition sequence and methylation sites are GAA(6N)TGCC (KpnAI), ACA(6N)TYCA (StySEAI), CGA(6N)TACC (StySENI) and TAAC(7N)RTCG (StySGI). These DNA recognition sequences all have a typical type I bipartite pattern and represent three novel specificities and one isoschizomer (StySENI). For confirmation, oligonucleotides containing each of the predicted sequences were synthesized, cloned into plasmid pMECA and transformed into each strain, resulting in a large reduction in efficiency of transformation (EOT). PMID:15199175

  10. An Evolutionary/Biochemical Connection Between Promoter- and Primer-Dependent Polymerases Revealed by Selective Evolution of Ligands by Exponential Enrichment (SELEX).

    PubMed

    Fenstermacher, Katherine J; Achuthan, Vasudevan; Schneider, Thomas D; DeStefano, Jeffrey J

    2018-01-16

    DNA polymerases (DNAPs) recognize 3' recessed termini on duplex DNA and carry out nucleotide catalysis. Unlike promoter-specific RNA polymerases (RNAPs), no sequence specificity is required for binding or initiation of catalysis. Despite this, previous results indicate that viral reverse transcriptases bind much more tightly to DNA primers that mimic the polypurine tract. In the current report, primer sequences that bind with high affinity to Taq and Klenow polymerases were identified using a modified Selective Evolution of Ligands by Exponential Enrichment (SELEX) approach. Two Taq -specific primers that bound ∼10 (Taq1) and over 100 (Taq2) times more stably than controls to Taq were identified. Taq1 contained 8 nucleotides (5' -CACTAAAG-3') that matched the phage T3 RNAP "core" promoter. Both primers dramatically outcompeted primers with similar binding thermodynamics in PCR reactions. Similarly, exonuclease minus Klenow polymerase also selected a high affinity primer that contained a related core promoter sequence from phage T7 RNAP (5' -ACTATAG-3'). For both Taq and Klenow, even small modifications to the sequence resulted in large losses in binding affinity suggesting that binding was highly sequence-specific. The results are discussed in the context of possible effects on multi-primer (multiplex) PCR assays, molecular information theory, and the evolution of RNAPs and DNAPs. Importance This work further demonstrates that primer-dependent DNA polymerases can have strong sequence biases leading to dramatically tighter binding to specific sequences. These may be related to biological function, or be a consequences of the structural architecture of the enzyme. New sequence specificity for Taq and Klenow polymerases were uncovered and among them were sequences that contained the core promoter elements from T3 and T7 phage RNA polymerase promoters. This suggests the intriguing possibility that phage RNA polymerases exploited intrinsic binding affinities of ancestral DNA polymerases to develop their promotors. Conversely, DNA polymerases could have evolved from related RNA polymerases and retained the intrinsic binding preference despite there being no clear function for such a preference in DNA biology. Copyright © 2018 American Society for Microbiology.

  11. Mitochondrial DNA sequences of 37 collar-spined echinostomes (Digenea: Echinostomatidae) in Thailand and Lao PDR reveals presence of two species: Echinostoma revolutum and E. miyagawai.

    PubMed

    Nagataki, Mitsuru; Tantrawatpan, Chairat; Agatsuma, Takeshi; Sugiura, Tetsuro; Duenngai, Kunyarat; Sithithaworn, Paiboon; Andrews, Ross H; Petney, Trevor N; Saijuntha, Weerachai

    2015-10-01

    The "37 collar-spined" or "revolutum" group of echinostomes is recognized as a species complex. The identification of members of this complex by morphological taxonomic characters is difficult and confusing, and hence, molecular analyses are a useful alternative method for molecular systematic studies. The current study examined the genetic diversity of those 37 collar-spined echinostomes which are recognized morphologically as Echinostoma revolutum in Thailand and Lao PDR using the cytochrome c oxidase subunit 1 (CO1) and the NADH dehydrogenase subunit 1 (ND1) sequences. On the basis of molecular investigations, at least two species of 37 collar-spined echinostomes exist in Southeast Asia, namely E. revolutum and Echinostoma miyagawai. The specimens examined in this study, coming from ducks in Thailand and Lao PDR, were compared to isolates from America, Europe and Australia for which DNA sequences are available in public databases. Haplotype analysis detected 6 and 26 haplotypes when comparing the CO1 sequences of E. revolutum and E. miyagawai, respectively, from different geographical isolates from Thailand and Lao PDR. The phylogenetic trees, ND1 haplotype network and genetic differentiation (ɸST) analyses showed that E. revolutum were genetically different on a continental scale, i.e. Eurasian and American lineages. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. DNA motifs associated with aberrant CpG island methylation.

    PubMed

    Feltus, F Alex; Lee, Eva K; Costello, Joseph F; Plass, Christoph; Vertino, Paula M

    2006-05-01

    Epigenetic silencing involving the aberrant methylation of promoter region CpG islands is widely recognized as a tumor suppressor silencing mechanism in cancer. However, the molecular pathways underlying aberrant DNA methylation remain elusive. Recently we showed that, on a genome-wide level, CpG island loci differ in their intrinsic susceptibility to aberrant methylation and that this susceptibility can be predicted based on underlying sequence context. These data suggest that there are sequence/structural features that contribute to the protection from or susceptibility to aberrant methylation. Here we use motif elicitation coupled with classification techniques to identify DNA sequence motifs that selectively define methylation-prone or methylation-resistant CpG islands. Motifs common to 28 methylation-prone or 47 methylation-resistant CpG island-containing genomic fragments were determined using the MEME and MAST algorithms (). The five most discriminatory motifs derived from methylation-prone sequences were found to be associated with CpG islands in general and were nonrandomly distributed throughout the genome. In contrast, the eight most discriminatory motifs derived from the methylation-resistant CpG islands were randomly distributed throughout the genome. Interestingly, this latter group tended to associate with Alu and other repetitive sequences. Used together, the frequency of occurrence of these motifs successfully discriminated methylation-prone and methylation-resistant CpG island groups with an accuracy of 87% after 10-fold cross-validation. The motifs identified here are candidate methylation-targeting or methylation-protection DNA sequences.

  13. First report of the root-rot pathogen, Armillaria nabsnona, from Hawaii

    Treesearch

    J. W. Hanna; N. B. Klopfenstein; M. -S. Kim

    2007-01-01

    The genus Armillaria (2) and Armillaria mellea sensu lato (3) have been reported previously from Hawaii. However, Armillaria species in Hawaii have not been previously identified by DNA sequences, compatibility tests, or other methods that distinguish currently recognized taxa. In August 2005, Armillaria rhizomorphs and mycelial bark fans were collected from two...

  14. Recognition of AT-Rich DNA Binding Sites by the MogR Repressor

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shen, Aimee; Higgins, Darren E.; Panne, Daniel

    2009-07-22

    The MogR transcriptional repressor of the intracellular pathogen Listeria monocytogenes recognizes AT-rich binding sites in promoters of flagellar genes to downregulate flagellar gene expression during infection. We describe here the 1.8 A resolution crystal structure of MogR bound to the recognition sequence 5' ATTTTTTAAAAAAAT 3' present within the flaA promoter region. Our structure shows that MogR binds as a dimer. Each half-site is recognized in the major groove by a helix-turn-helix motif and in the minor groove by a loop from the symmetry-related molecule, resulting in a 'crossover' binding mode. This oversampling through minor groove interactions is important for specificity.more » The MogR binding site has structural features of A-tract DNA and is bent by approximately 52 degrees away from the dimer. The structure explains how MogR achieves binding specificity in the AT-rich genome of L. monocytogenes and explains the evolutionary conservation of A-tract sequence elements within promoter regions of MogR-regulated flagellar genes.« less

  15. Campylobacter fetus subsp. testudinum subsp. nov., isolated from humans and reptiles.

    PubMed

    Fitzgerald, Collette; Tu, Zheng Chao; Patrick, Mary; Stiles, Tracy; Lawson, Andy J; Santovenia, Monica; Gilbert, Maarten J; van Bergen, Marcel; Joyce, Kevin; Pruckler, Janet; Stroika, Steven; Duim, Birgitta; Miller, William G; Loparev, Vladimir; Sinnige, Jan C; Fields, Patricia I; Tauxe, Robert V; Blaser, Martin J; Wagenaar, Jaap A

    2014-09-01

    A polyphasic study was undertaken to determine the taxonomic position of 13 Campylobacter fetus-like strains from humans (n = 8) and reptiles (n = 5). The results of matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) MS and genomic data from sap analysis, 16S rRNA gene and hsp60 sequence comparison, pulsed-field gel electrophoresis, amplified fragment length polymorphism analysis, DNA-DNA hybridization and whole genome sequencing demonstrated that these strains are closely related to C. fetus but clearly differentiated from recognized subspecies of C. fetus. Therefore, this unique cluster of 13 strains represents a novel subspecies within the species C. fetus, for which the name Campylobacter fetus subsp. testudinum subsp. nov. is proposed, with strain 03-427(T) ( = ATCC BAA-2539(T) = LMG 27499(T)) as the type strain. Although this novel taxon could not be differentiated from C. fetus subsp. fetus and C. fetus subsp. venerealis using conventional phenotypic tests, MALDI-TOF MS revealed the presence of multiple phenotypic biomarkers which distinguish Campylobacter fetus subsp. testudinum subsp. nov. from recognized subspecies of C. fetus.

  16. Rapid Detection and Identification of a Pathogen's DNA Using Phi29 DNA Polymerase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xu, Y.; Dunn, J.; Gao, S.

    2008-10-31

    Zoonotic pathogens including those transmitted by insect vectors are some of the most deadly of all infectious diseases known to mankind. A number of these agents have been further weaponized and are widely recognized as being potentially significant biothreat agents. We describe a novel method based on multiply-primed rolling circle in vitro amplification for profiling genomic DNAs to permit rapid, cultivation-free differential detection and identification of circular plasmids in infectious agents. Using Phi29 DNA polymerase and a two-step priming reaction we could reproducibly detect and characterize by DNA sequencing circular DNA from Borrelia burgdorferi B31 in DNA samples containing asmore » little as 25 pg of Borrelia DNA amongst a vast excess of human DNA. This simple technology can ultimately be adapted as a sensitive method to detect specific DNA from both known and unknown pathogens in a wide variety of complex environments.« less

  17. A conserved mechanism for replication origin recognition and binding in archaea.

    PubMed

    Majerník, Alan I; Chong, James P J

    2008-01-15

    To date, methanogens are the only group within the archaea where firing DNA replication origins have not been demonstrated in vivo. In the present study we show that a previously identified cluster of ORB (origin recognition box) sequences do indeed function as an origin of replication in vivo in the archaeon Methanothermobacter thermautotrophicus. Although the consensus sequence of ORBs in M. thermautotrophicus is somewhat conserved when compared with ORB sequences in other archaea, the Cdc6-1 protein from M. thermautotrophicus (termed MthCdc6-1) displays sequence-specific binding that is selective for the MthORB sequence and does not recognize ORBs from other archaeal species. Stabilization of in vitro MthORB DNA binding by MthCdc6-1 requires additional conserved sequences 3' to those originally described for M. thermautotrophicus. By testing synthetic sequences bearing mutations in the MthORB consensus sequence, we show that Cdc6/ORB binding is critically dependent on the presence of an invariant guanine found in all archaeal ORB sequences. Mutation of a universally conserved arginine residue in the recognition helix of the winged helix domain of archaeal Cdc6-1 shows that specific origin sequence recognition is dependent on the interaction of this arginine residue with the invariant guanine. Recognition of a mutated origin sequence can be achieved by mutation of the conserved arginine residue to a lysine or glutamine residue. Thus despite a number of differences in protein and DNA sequences between species, the mechanism of origin recognition and binding appears to be conserved throughout the archaea.

  18. A Children's Oncology Group and TARGET initiative exploring the genetic landscape of Wilms tumor.

    PubMed

    Gadd, Samantha; Huff, Vicki; Walz, Amy L; Ooms, Ariadne H A G; Armstrong, Amy E; Gerhard, Daniela S; Smith, Malcolm A; Auvil, Jaime M Guidry; Meerzaman, Daoud; Chen, Qing-Rong; Hsu, Chih Hao; Yan, Chunhua; Nguyen, Cu; Hu, Ying; Hermida, Leandro C; Davidsen, Tanja; Gesuwan, Patee; Ma, Yussanne; Zong, Zusheng; Mungall, Andrew J; Moore, Richard A; Marra, Marco A; Dome, Jeffrey S; Mullighan, Charles G; Ma, Jing; Wheeler, David A; Hampton, Oliver A; Ross, Nicole; Gastier-Foster, Julie M; Arold, Stefan T; Perlman, Elizabeth J

    2017-10-01

    We performed genome-wide sequencing and analyzed mRNA and miRNA expression, DNA copy number, and DNA methylation in 117 Wilms tumors, followed by targeted sequencing of 651 Wilms tumors. In addition to genes previously implicated in Wilms tumors (WT1, CTNNB1, AMER1, DROSHA, DGCR8, XPO5, DICER1, SIX1, SIX2, MLLT1, MYCN, and TP53), we identified mutations in genes not previously recognized as recurrently involved in Wilms tumors, the most frequent being BCOR, BCORL1, NONO, MAX, COL6A3, ASXL1, MAP3K4, and ARID1A. DNA copy number changes resulted in recurrent 1q gain, MYCN amplification, LIN28B gain, and MIRLET7A loss. Unexpected germline variants involved PALB2 and CHEK2. Integrated analyses support two major classes of genetic changes that preserve the progenitor state and/or interrupt normal development.

  19. [Polymorphic loci and polymorphism analysis of short tandem repeats within XNP gene].

    PubMed

    Liu, Qi-Ji; Gong, Yao-Qin; Guo, Chen-Hong; Chen, Bing-Xi; Li, Jiang-Xia; Guo, Yi-Shou

    2002-01-01

    To select polymorphic short tandem repeat markers within X-linked nuclear protein (XNP) gene, genomic clones which contain XNP gene were recognized by homologous analysis with XNP cDNA. By comparing the cDNA with genomic DNA, non-exonic sequences were identified, and short tandem repeats were selected from non-exonic sequences by using BCM search Launcher. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five short tandem repeats were identified from XNP gene, two of which were polymorphic. Four and 11 alleles were observed in Chinese population for XNPSTR1 and XNPSTR4, respectively. Heterozygosities were 47% for XNPSTR1 and 70% for XNPSTR4. XNPSTR1 and XNPSTR4 localized within 3' end and intron 10, respectively. Two polymorphic short tandem repeats have been identified within XNP gene and will be useful for linkage analysis and gene diagnosis of XNP gene.

  20. Extending the language of DNA molecular recognition by polyamides: unexpected influence of imidazole and pyrrole arrangement on binding affinity and specificity.

    PubMed

    Buchmueller, Karen L; Staples, Andrew M; Howard, Cameron M; Horick, Sarah M; Uthe, Peter B; Le, N Minh; Cox, Kari K; Nguyen, Binh; Pacheco, Kimberly A O; Wilson, W David; Lee, Moses

    2005-01-19

    Pyrrole (Py) and imidazole (Im) polyamides can be designed to target specific DNA sequences. The effect that the pyrrole and imidazole arrangement, plus DNA sequence, have on sequence specificity and binding affinity has been investigated using DNA melting (DeltaT(M)), circular dichroism (CD), and surface plasmon resonance (SPR) studies. SPR results obtained from a complete set of triheterocyclic polyamides show a dramatic difference in the affinity of f-ImPyIm for its cognate DNA (K(eq) = 1.9 x 10(8) M(-1)) and f-PyPyIm for its cognate DNA (K(eq) = 5.9 x 10(5) M(-1)), which could not have been anticipated prior to characterization of these compounds. Moreover, f-ImPyIm has a 10-fold greater affinity for CGCG than distamycin A has for its cognate, AATT. To understand this difference, the triamide dimers are divided into two structural groupings: central and terminal pairings. The four possible central pairings show decreasing selectivity and affinity for their respective cognate sequences: -ImPy > -PyPy- > -PyIm- approximately -ImIm-. These results extend the language of current design motifs for polyamide sequence recognition to include the use of "words" for recognizing two adjacent base pairs, rather than "letters" for binding to single base pairs. Thus, polyamides designed to target Watson-Crick base pairs should utilize the strength of -ImPy- and -PyPy- central pairings. The f/Im and f/Py terminal groups yielded no advantage for their respective C/G or T/A base pairs. The exception is with the -ImPy- central pairing, for which f/Im has a 10-fold greater affinity for C/G than f/Py has for T/A.

  1. AP1 Keeps Chromatin Poised for Action | Center for Cancer Research

    Cancer.gov

    The human genome harbors gene-encoding DNA, the blueprint for building proteins that regulate cellular function. Embedded across the genome, in non-coding regions, are DNA elements to which regulatory factors bind. The interaction of regulatory factors with DNA at these sites modifies gene expression to modulate cell activity. In cells, DNA exists in a complex with proteins called chromatin that compacts the DNA in the nucleus, strongly restricting access to DNA sequences. As a result, regulatory factors only interact with a small subset of their potential binding elements in a given cell to regulate genes. How factors recognize and select sites in chromatin across the genome is not well understood -- but several discoveries in CCR’s Laboratory of Receptor Biology and Gene Expression (LRBGE) have shed light on the mechanisms that direct factors to DNA.

  2. Structural Basis for Sequence-specific DNA Recognition by an Arabidopsis WRKY Transcription Factor*

    PubMed Central

    Yamasaki, Kazuhiko; Kigawa, Takanori; Watanabe, Satoru; Inoue, Makoto; Yamasaki, Tomoko; Seki, Motoaki; Shinozaki, Kazuo; Yokoyama, Shigeyuki

    2012-01-01

    The WRKY family transcription factors regulate plant-specific reactions that are mostly related to biotic and abiotic stresses. They share the WRKY domain, which recognizes a DNA element (TTGAC(C/T)) termed the W-box, in target genes. Here, we determined the solution structure of the C-terminal WRKY domain of Arabidopsis WRKY4 in complex with the W-box DNA by NMR. A four-stranded β-sheet enters the major groove of DNA in an atypical mode termed the β-wedge, where the sheet is nearly perpendicular to the DNA helical axis. Residues in the conserved WRKYGQK motif contact DNA bases mainly through extensive apolar contacts with thymine methyl groups. The importance of these contacts was verified by substituting the relevant T bases with U and by surface plasmon resonance analyses of DNA binding. PMID:22219184

  3. Natrinema gari sp. nov., a halophilic archaeon isolated from fish sauce in Thailand.

    PubMed

    Tapingkae, Wanaporn; Tanasupawat, Somboon; Itoh, Takashi; Parkin, Kirk L; Benjakul, Soottawat; Visessanguan, Wonnop; Valyasevi, Ruud

    2008-10-01

    Two Gram-negative, rod-shaped, halophilic archaea, designated strains HIS40-3(T) and HDS3-1, were isolated from anchovy fish sauce (nam-pla) collected from two different locations in Thailand. The two strains were able to grow at 20-60 degrees C (optimum 37-40 degrees C), at 1.7-5.1 M NaCl (optimum 2.6-3.4 M NaCl) and at pH 5.5-8.5 (optimum pH 6.0-6.5). Hypotonic treatment with less than 1.7 M NaCl caused cell lysis. The major polar lipids of the isolates were C(20)C(20) and C(20)C(25) derivatives of phosphatidylglycerol, phosphatidylglycerol phosphate methyl ester, phosphatidylglycerol sulfate, two glycolipids and one unidentified lipid. The DNA G+C contents were 64.0-65.4 mol%. In addition to phenotypic and chemotaxonomic characteristics, phylogenetic analysis based on 16S rRNA gene sequence similarities showed that strains HIS40-3(T) and HDS3-1 were related most closely to species of the genus Natrinema. Levels of 16S rRNA gene sequence similarity between strains HIS40-3(T) and HDS3-1 and the type strains of recognized Natrinema species were 99.1-96.6 %. The two novel strains could be distinguished from recognized Natrinema species on the basis of low levels of DNA-DNA relatedness and differences in whole-cell protein patterns and phenotypic properties. Levels of 16S rRNA gene sequence similarity and DNA-DNA relatedness between the two strains were 99.7 and 77.7 %, respectively, suggesting that they should be classified as representing a single species. Based on these taxonomic data, strains HIS40-3(T) and HDS3-1 are considered to represent a novel species of the genus Natrinema, for which the name Natrinema gari sp. nov. is proposed. The type strain is HIS40-3(T) (=BCC 24370(T) =JCM 14663(T) =PCU 303(T)).

  4. Crystal Structures of SlyA Protein, a Master Virulence Regulator of Salmonella, in Free and DNA-bound States

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dolan, Kyle T.; Duguid, Erica M.; He, Chuan

    2011-11-17

    SlyA is a master virulence regulator that controls the transcription of numerous genes in Salmonella enterica. We present here crystal structures of SlyA by itself and bound to a high-affinity DNA operator sequence in the slyA gene. SlyA interacts with DNA through direct recognition of a guanine base by Arg-65, as well as interactions between conserved Arg-86 and the minor groove and a large network of non-base-specific contacts with the sugar phosphate backbone. Our structures, together with an unpublished structure of SlyA bound to the small molecule effector salicylate (Protein Data Bank code 3DEU), reveal that, unlike many other MarRmore » family proteins, SlyA dissociates from DNA without large conformational changes when bound to this effector. We propose that SlyA and other MarR global regulators rely more on indirect readout of DNA sequence to exert control over many genes, in contrast to proteins (such as OhrR) that recognize a single operator.« less

  5. Molecular Architecture of Full-length TRF1 Favors Its Interaction with DNA.

    PubMed

    Boskovic, Jasminka; Martinez-Gago, Jaime; Mendez-Pertuz, Marinela; Buscato, Alberto; Martinez-Torrecuadrada, Jorge Luis; Blasco, Maria A

    2016-10-07

    Telomeres are specific DNA-protein structures found at both ends of eukaryotic chromosomes that protect the genome from degradation and from being recognized as double-stranded breaks. In vertebrates, telomeres are composed of tandem repeats of the TTAGGG sequence that are bound by a six-subunit complex called shelterin. Molecular mechanisms of telomere functions remain unknown in large part due to lack of structural data on shelterins, shelterin complex, and its interaction with the telomeric DNA repeats. TRF1 is one of the best studied shelterin components; however, the molecular architecture of the full-length protein remains unknown. We have used single-particle electron microscopy to elucidate the structure of TRF1 and its interaction with telomeric DNA sequence. Our results demonstrate that full-length TRF1 presents a molecular architecture that assists its interaction with telometic DNA and at the same time makes TRFH domains accessible to other TRF1 binding partners. Furthermore, our studies suggest hypothetical models on how other proteins as TIN2 and tankyrase contribute to regulate TRF1 function. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  6. Molecular Architecture of Full-length TRF1 Favors Its Interaction with DNA*

    PubMed Central

    Boskovic, Jasminka; Martinez-Gago, Jaime; Mendez-Pertuz, Marinela; Buscato, Alberto; Martinez-Torrecuadrada, Jorge Luis; Blasco, Maria A.

    2016-01-01

    Telomeres are specific DNA-protein structures found at both ends of eukaryotic chromosomes that protect the genome from degradation and from being recognized as double-stranded breaks. In vertebrates, telomeres are composed of tandem repeats of the TTAGGG sequence that are bound by a six-subunit complex called shelterin. Molecular mechanisms of telomere functions remain unknown in large part due to lack of structural data on shelterins, shelterin complex, and its interaction with the telomeric DNA repeats. TRF1 is one of the best studied shelterin components; however, the molecular architecture of the full-length protein remains unknown. We have used single-particle electron microscopy to elucidate the structure of TRF1 and its interaction with telomeric DNA sequence. Our results demonstrate that full-length TRF1 presents a molecular architecture that assists its interaction with telometic DNA and at the same time makes TRFH domains accessible to other TRF1 binding partners. Furthermore, our studies suggest hypothetical models on how other proteins as TIN2 and tankyrase contribute to regulate TRF1 function. PMID:27563064

  7. Sequencing our way towards understanding global eukaryotic biodiversity

    PubMed Central

    Bik, Holly M.; Porazinska, Dorota L.; Creer, Simon; Caporaso, J. Gregory; Knight, Rob; Thomas, W. Kelley

    2011-01-01

    Microscopic eukaryotes are abundant, diverse, and fill critical ecological roles across every ecosystem on earth, yet there is a well-recognized gap in our understanding of their global biodiversity. Fundamental advances in DNA sequencing and bioinformatics now allow accurate en masse biodiversity assessments of microscopic eukaryotes from environmental samples. Despite a promising outlook, the field of eukaryotic marker gene surveys faces significant challenges: how to generate data that is most useful to the community, especially in the face of evolving sequencing technology and bioinformatics pipelines, and how to incorporate an expanding number of target genes. PMID:22244672

  8. Dynamic epigenetic states of maize centromeres

    PubMed Central

    Liu, Yalin; Su, Handong; Zhang, Jing; Liu, Yang; Han, Fangpu; Birchler, James A.

    2015-01-01

    The centromere is a specialized chromosomal region identified as the major constriction, upon which the kinetochore complex is formed, ensuring accurate chromosome orientation and segregation during cell division. The rapid evolution of centromere DNA sequence and the conserved centromere function are two contradictory aspects of centromere biology. Indeed, the sole presence of genetic sequence is not sufficient for centromere formation. Various dicentric chromosomes with one inactive centromere have been recognized. It has also been found that de novo centromere formation is common on fragments in which centromeric DNA sequences are lost. Epigenetic factors play important roles in centromeric chromatin assembly and maintenance. Non-disjunction of the supernumerary B chromosome centromere is independent of centromere function, but centromere pairing during early prophase of meiosis I requires an active centromere. This review discusses recent studies in maize about genetic and epigenetic elements regulating formation and maintenance of centromere chromatin, as well as centromere behavior in meiosis. PMID:26579154

  9. A protocol for isolating insect mitochondrial genomes: a case study of NUMT in Melipona flavolineata (Hymenoptera: Apidae).

    PubMed

    Françoso, Elaine; Gomes, Fernando; Arias, Maria Cristina

    2016-07-01

    Nuclear mitochondrial DNA insertions (NUMTs) are mitochondrial DNA sequences that have been transferred into the nucleus and are recognized by the presence of indels and stop codons. Although NUMTs have been identified in a diverse range of species, their discovery was frequently accidental. Here, our initial goal was to develop and standardize a simple method for isolating NUMTs from the nuclear genome of a single bee. Subsequently, we tested our new protocol by determining whether the indels and stop codons of the cytochrome c oxidase subunit I (COI) sequence of Melipona flavolineata are of nuclear origin. The new protocol successfully demonstrated the presence of a COI NUMT. In addition to NUMT investigations, the protocol described here will also be very useful for studying mitochondrial mutations related to diseases and for sequencing complete mitochondrial genomes with high read coverage by Next-Generation technology.

  10. Dynamic epigenetic states of maize centromeres.

    PubMed

    Liu, Yalin; Su, Handong; Zhang, Jing; Liu, Yang; Han, Fangpu; Birchler, James A

    2015-01-01

    The centromere is a specialized chromosomal region identified as the major constriction, upon which the kinetochore complex is formed, ensuring accurate chromosome orientation and segregation during cell division. The rapid evolution of centromere DNA sequence and the conserved centromere function are two contradictory aspects of centromere biology. Indeed, the sole presence of genetic sequence is not sufficient for centromere formation. Various dicentric chromosomes with one inactive centromere have been recognized. It has also been found that de novo centromere formation is common on fragments in which centromeric DNA sequences are lost. Epigenetic factors play important roles in centromeric chromatin assembly and maintenance. Non-disjunction of the supernumerary B chromosome centromere is independent of centromere function, but centromere pairing during early prophase of meiosis I requires an active centromere. This review discusses recent studies in maize about genetic and epigenetic elements regulating formation and maintenance of centromere chromatin, as well as centromere behavior in meiosis.

  11. Label-free detection of DNA hybridization using carbon nanotube network field-effect transistors

    NASA Astrophysics Data System (ADS)

    Star, Alexander; Tu, Eugene; Niemann, Joseph; Gabriel, Jean-Christophe P.; Joiner, C. Steve; Valcke, Christian

    2006-01-01

    We report carbon nanotube network field-effect transistors (NTNFETs) that function as selective detectors of DNA immobilization and hybridization. NTNFETs with immobilized synthetic oligonucleotides have been shown to specifically recognize target DNA sequences, including H63D single-nucleotide polymorphism (SNP) discrimination in the HFE gene, responsible for hereditary hemochromatosis. The electronic responses of NTNFETs upon single-stranded DNA immobilization and subsequent DNA hybridization events were confirmed by using fluorescence-labeled oligonucleotides and then were further explored for label-free DNA detection at picomolar to micromolar concentrations. We have also observed a strong effect of DNA counterions on the electronic response, thus suggesting a charge-based mechanism of DNA detection using NTNFET devices. Implementation of label-free electronic detection assays using NTNFETs constitutes an important step toward low-cost, low-complexity, highly sensitive and accurate molecular diagnostics. hemochromatosis | SNP | biosensor

  12. Preparation of metagenomic libraries from naturally occurring marine viruses.

    PubMed

    Solonenko, Sergei A; Sullivan, Matthew B

    2013-01-01

    Microbes are now well recognized as major drivers of the biogeochemical cycling that fuels the Earth, and their viruses (phages) are known to be abundant and important in microbial mortality, horizontal gene transfer, and modulating microbial metabolic output. Investigation of environmental phages has been frustrated by an inability to culture the vast majority of naturally occurring diversity coupled with the lack of robust, quantitative, culture-independent methods for studying this uncultured majority. However, for double-stranded DNA phages, a quantitative viral metagenomic sample-to-sequence workflow now exists. Here, we review these advances with special emphasis on the technical details of preparing DNA sequencing libraries for metagenomic sequencing from environmentally relevant low-input DNA samples. Library preparation steps broadly involve manipulating the sample DNA by fragmentation, end repair and adaptor ligation, size fractionation, and amplification. One critical area of future research and development is parallel advances for alternate nucleic acid types such as single-stranded DNA and RNA viruses that are also abundant in nature. Combinations of recent advances in fragmentation (e.g., acoustic shearing and tagmentation), ligation reactions (adaptor-to-template ratio reference table availability), size fractionation (non-gel-sizing), and amplification (linear amplification for deep sequencing and linker amplification protocols) enhance our ability to generate quantitatively representative metagenomic datasets from low-input DNA samples. Such datasets are already providing new insights into the role of viruses in marine systems and will continue to do so as new environments are explored and synergies and paradigms emerge from large-scale comparative analyses. © 2013 Elsevier Inc. All rights reserved.

  13. CasA mediates Cas3-catalyzed target degradation during CRISPR RNA-guided interference.

    PubMed

    Hochstrasser, Megan L; Taylor, David W; Bhat, Prashant; Guegler, Chantal K; Sternberg, Samuel H; Nogales, Eva; Doudna, Jennifer A

    2014-05-06

    In bacteria, the clustered regularly interspaced short palindromic repeats (CRISPR)-associated (Cas) DNA-targeting complex Cascade (CRISPR-associated complex for antiviral defense) uses CRISPR RNA (crRNA) guides to bind complementary DNA targets at sites adjacent to a trinucleotide signature sequence called the protospacer adjacent motif (PAM). The Cascade complex then recruits Cas3, a nuclease-helicase that catalyzes unwinding and cleavage of foreign double-stranded DNA (dsDNA) bearing a sequence matching that of the crRNA. Cascade comprises the CasA-E proteins and one crRNA, forming a structure that binds and unwinds dsDNA to form an R loop in which the target strand of the DNA base pairs with the 32-nt RNA guide sequence. Single-particle electron microscopy reconstructions of dsDNA-bound Cascade with and without Cas3 reveal that Cascade positions the PAM-proximal end of the DNA duplex at the CasA subunit and near the site of Cas3 association. The finding that the DNA target and Cas3 colocalize with CasA implicates this subunit in a key target-validation step during DNA interference. We show biochemically that base pairing of the PAM region is unnecessary for target binding but critical for Cas3-mediated degradation. In addition, the L1 loop of CasA, previously implicated in PAM recognition, is essential for Cas3 activation following target binding by Cascade. Together, these data show that the CasA subunit of Cascade functions as an essential partner of Cas3 by recognizing DNA target sites and positioning Cas3 adjacent to the PAM to ensure cleavage.

  14. Poxvirus uracil-DNA glycosylase-An unusual member of the family I uracil-DNA glycosylases: Poxvirus Uracil-DNA Glycosylase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schormann, Norbert; Zhukovskaya, Natalia; Bedwell, Gregory

    We report that uracil-DNA glycosylases are ubiquitous enzymes, which play a key role repairing damages in DNA and in maintaining genomic integrity by catalyzing the first step in the base excision repair pathway. Within the superfamily of uracil-DNA glycosylases family I enzymes or UNGs are specific for recognizing and removing uracil from DNA. These enzymes feature conserved structural folds, active site residues and use common motifs for DNA binding, uracil recognition and catalysis. Within this family the enzymes of poxviruses are unique and most remarkable in terms of amino acid sequences, characteristic motifs and more importantly for their novel non-enzymaticmore » function in DNA replication. UNG of vaccinia virus, also known as D4, is the most extensively characterized UNG of the poxvirus family. D4 forms an unusual heterodimeric processivity factor by attaching to a poxvirus-specific protein A20, which also binds to the DNA polymerase E9 and recruits other proteins necessary for replication. D4 is thus integrated in the DNA polymerase complex, and its DNA-binding and DNA scanning abilities couple DNA processivity and DNA base excision repair at the replication fork. In conclusion, the adaptations necessary for taking on the new function are reflected in the amino acid sequence and the three-dimensional structure of D4. We provide an overview of the current state of the knowledge on the structure-function relationship of D4.« less

  15. The effects of cytosine methylation on general transcription factors

    NASA Astrophysics Data System (ADS)

    Jin, Jianshi; Lian, Tengfei; Gu, Chan; Yu, Kai; Gao, Yi Qin; Su, Xiao-Dong

    2016-07-01

    DNA methylation on CpG sites is the most common epigenetic modification. Recently, methylation in a non-CpG context was found to occur widely on genomic DNA. Moreover, methylation of non-CpG sites is a highly controlled process, and its level may vary during cellular development. To study non-CpG methylation effects on DNA/protein interactions, we have chosen three human transcription factors (TFs): glucocorticoid receptor (GR), brain and muscle ARNT-like 1 (BMAL1) - circadian locomotor output cycles kaput (CLOCK) and estrogen receptor (ER) with methylated or unmethylated DNA binding sequences, using single-molecule and isothermal titration calorimetry assays. The results demonstrated that these TFs interact with methylated DNA with different effects compared with their cognate DNA sequences. The effects of non-CpG methylation on transcriptional regulation were validated by cell-based luciferase assay at protein level. The mechanisms of non-CpG methylation influencing DNA-protein interactions were investigated by crystallographic analyses and molecular dynamics simulation. With BisChIP-seq assays in HEK-293T cells, we found that GR can recognize highly methylated sites within chromatin in cells. Therefore, we conclude that non-CpG methylation of DNA can provide a mechanism for regulating gene expression through directly affecting the binding of TFs.

  16. DNA Barcode Identification of Freshwater Snails in the Family Bithyniidae from Thailand

    PubMed Central

    Kulsantiwong, Jutharat; Prasopdee, Sattrachai; Ruangsittichai, Jiraporn; Ruangjirachuporn, Wipaporn; Boonmars, Thidarut; Viyanant, Vithoon; Pierossi, Paola; Hebert, Paul D. N.; Tesana, Smarn

    2013-01-01

    Freshwater snails in the family Bithyniidae are the first intermediate host for Southeast Asian liver fluke (Opisthorchis viverrini), the causative agent of opisthorchiasis. Unfortunately, the subtle morphological characters that differentiate species in this group are not easily discerned by non-specialists. This is a serious matter because the identification of bithyniid species is a fundamental prerequisite for better understanding of the epidemiology of this disease. Because DNA barcoding, the analysis of sequence diversity in the 5’ region of the mitochondrial COI gene, has shown strong performance in other taxonomic groups, we decided to test its capacity to resolve 10 species/ subspecies of bithyniids from Thailand. Our analysis of 217 specimens indicated that COI sequences delivered species-level identification for 9 of 10 currently recognized species. The mean intraspecific divergence of COI was 2.3% (range 0-9.2 %), whereas sequence divergences between congeneric species averaged 8.7% (range 0-22.2 %). Although our results indicate that DNA barcoding can differentiate species of these medically-important snails, we also detected evidence for the presence of one overlooked species and one possible case of synonymy. PMID:24223896

  17. Construction and characterization of a normalized cDNA library of Nannochloropsis oculata (Eustigmatophyceae)

    NASA Astrophysics Data System (ADS)

    Yu, Jianzhong; Ma, Xiaolei; Pan, Kehou; Yang, Guanpin; Yu, Wengong

    2010-07-01

    We constructed and characterized a normalized cDNA library of Nannochloropsis oculata CS-179, and obtained 905 nonredundant sequences (NRSs) ranging from 431-1 756 bp in length. Among them, 496 were very similar to nonredundant ones in the GenBank ( E ≤1.0e-05), and 349 ESTs had significant hits with the clusters of eukaryotic orthologous groups (KOG). Bases G and/or C at the third position of codons of 14 amino acid residues suggested a strong bias in the conserved domain of 362 NRSs (>60%). We also identified the unigenes encoding phosphorus and nitrogen transporters, suggesting that N. oculata could efficiently transport and metabolize phosphorus and nitrogen, and recognized the unigenes that involved in biosynthesis and storage of both fatty acids and polyunsaturated fatty acids (PUFAs), which will facilitate the demonstration of eicosapentaenoic acid (EPA) biosynthesis pathway of N. oculata. In comparison with the original cDNA library, the normalized library significantly increased the efficiencies of random sequencing and rarely expressed genes discovering, and decreased the frequency of abundant gene sequences.

  18. Mechanism of DNA binding enhancement by hepatitis B virus protein pX.

    PubMed

    Palmer, C R; Gegnas, L D; Schepartz, A

    1997-12-09

    At least three hundred million people worldwide are infected with the hepatitis B virus (HBV), and epidemiological studies show a clear correlation between chronic HBV infection and the development of hepatocellular carcinoma. HBV encodes a protein, pX, which abducts the cellular transcriptional machinery in several ways including direct interactions with bZIP transcription factors. These interactions increase the DNA affinities of target bZIP proteins in a DNA sequence-dependent manner. Here we use a series of bZIP peptide models to explore the mechanism by which pX interacts with bZIP proteins. Our results suggest that pX increases bZIP.DNA stability by increasing the stability of the bZIP dimer as well as the affinity of the dimer for DNA. Additional experiments provide evidence for a mechanism in which pX recognizes the composite structure of the peptide.DNA complex, not simply the primary peptide sequence. These experiments provide a framework for understanding how pX alters the patterns of transcription within the nucleus. The similarities between the mechanism proposed for pX and the mechanism previously proposed for the human T-cell leukemia virus protein Tax are discussed.

  19. Characterization of monomeric DNA-binding protein Histone H1 in Leishmania braziliensis.

    PubMed

    Carmelo, Emma; González, Gloria; Cruz, Teresa; Osuna, Antonio; Hernández, Mariano; Valladares, Basilio

    2011-08-01

    Histone H1 in Leishmania presents relevant differences compared to higher eukaryote counterparts, such as the lack of a DNA-binding central globular domain. Despite that, it is apparently fully functional since its differential expression levels have been related to changes in chromatin condensation and infectivity, among other features. The localization and the aggregation state of L. braziliensis H1 has been determined by immunolocalization, mass spectrometry, cross-linking and electrophoretic mobility shift assays. Analysis of H1 sequences from the Leishmania Genome Database revealed that our protein is included in a very divergent group of histones H1 that is present only in L. braziliensis. An antibody raised against recombinant L. braziliensis H1 recognized specifically that protein by immunoblot in L. braziliensis extracts, but not in other Leishmania species, a consequence of the sequence divergences observed among Leishmania species. Mass spectrometry analysis and in vitro DNA-binding experiments have also proven that L. braziliensis H1 is monomeric in solution, but oligomerizes upon binding to DNA. Finally, despite the lack of a globular domain, L. braziliensis H1 is able to form complexes with DNA in vitro, with higher affinity for supercoiled compared to linear DNA.

  20. Recognition of DNA abasic site nanocavity by fluorophore-switched probe: Suitable for all sequence environments

    NASA Astrophysics Data System (ADS)

    Wang, Ying; Hu, Yuehua; Wu, Tao; Zhang, Lihua; Liu, Hua; Zhou, Xiaoshun; Shao, Yong

    2016-01-01

    Removal of a damaged base in DNA produces an abasic site (AP site) nanocavity. If left un-repaired in vivo by the specific enzyme, this nanocavity will result in nucleotide mutation in the following DNA replication. Therefore, selective recognition of AP site nanocavity by small molecules is important for identification of such DNA damage and development of genetic drugs. In this work, we investigate the fluorescence behavior of isoquinoline alkaloids including palmatine (PAL), berberine (BER), epiberberine (EPI), jatrorrhizine (JAT), coptisine (COP), coralyne (COR), worenine (WOR), berberrubine (BEU), sanguinarine (SAN), chelerythrine (CHE), and nitidine (NIT) upon binding with the AP nanocavity. PAL is screened out as the most efficient fluorophore-switched probe to recognize the AP nanocavity over the fully matched DNA. Its fluorescence enhancement occurs for all of the AP nanocavity sequence environments, which has not been achieved by the previously used probes. The bridged π conjugation effect should partially contribute to the AP nanocavity-specific fluorescence, as opposed to the solvent effect. Due to the strong binding with the AP nanocavity, PAL will find wide applications in the DNA damage recognition and sensor development.

  1. The Staphylococcus aureus pSK41 plasmid-encoded ArtA protein is a master regulator of plasmid transmission genes and contains a RHH motif used in alternate DNA-binding modes.

    PubMed

    Ni, Lisheng; Jensen, Slade O; Ky Tonthat, Nam; Berg, Tracey; Kwong, Stephen M; Guan, Fiona H X; Brown, Melissa H; Skurray, Ronald A; Firth, Neville; Schumacher, Maria A

    2009-11-01

    Plasmids harbored by Staphylococcus aureus are a major contributor to the spread of bacterial multi-drug resistance. Plasmid conjugation and partition are critical to the dissemination and inheritance of such plasmids. Here, we demonstrate that the ArtA protein encoded by the S. aureus multi-resistance plasmid pSK41 is a global transcriptional regulator of pSK41 genes, including those involved in conjugation and segregation. ArtA shows no sequence homology to any structurally characterized DNA-binding protein. To elucidate the mechanism by which it specifically recognizes its DNA site, we obtained the structure of ArtA bound to its cognate operator, ACATGACATG. The structure reveals that ArtA is representative of a new family of ribbon-helix-helix (RHH) DNA-binding proteins that contain extended, N-terminal basic motifs. Strikingly, unlike most well-studied RHH proteins ArtA binds its cognate operators as a dimer. However, we demonstrate that it is also able to recognize an atypical operator site by binding as a dimer-of-dimers and the extended N-terminal regions of ArtA were shown to be essential for this dimer-of-dimer binding mode. Thus, these data indicate that ArtA is a master regulator of genes critical for both horizontal and vertical transmission of pSK41 and that it can recognize DNA utilizing alternate binding modes.

  2. The Staphylococcus aureus pSK41 plasmid-encoded ArtA protein is a master regulator of plasmid transmission genes and contains a RHH motif used in alternate DNA-binding modes

    PubMed Central

    Ni, Lisheng; Jensen, Slade O.; Ky Tonthat, Nam; Berg, Tracey; Kwong, Stephen M.; Guan, Fiona H. X.; Brown, Melissa H.; Skurray, Ronald A.; Firth, Neville; Schumacher, Maria A.

    2009-01-01

    Plasmids harbored by Staphylococcus aureus are a major contributor to the spread of bacterial multi-drug resistance. Plasmid conjugation and partition are critical to the dissemination and inheritance of such plasmids. Here, we demonstrate that the ArtA protein encoded by the S. aureus multi-resistance plasmid pSK41 is a global transcriptional regulator of pSK41 genes, including those involved in conjugation and segregation. ArtA shows no sequence homology to any structurally characterized DNA-binding protein. To elucidate the mechanism by which it specifically recognizes its DNA site, we obtained the structure of ArtA bound to its cognate operator, ACATGACATG. The structure reveals that ArtA is representative of a new family of ribbon–helix–helix (RHH) DNA-binding proteins that contain extended, N-terminal basic motifs. Strikingly, unlike most well-studied RHH proteins ArtA binds its cognate operators as a dimer. However, we demonstrate that it is also able to recognize an atypical operator site by binding as a dimer-of-dimers and the extended N-terminal regions of ArtA were shown to be essential for this dimer-of-dimer binding mode. Thus, these data indicate that ArtA is a master regulator of genes critical for both horizontal and vertical transmission of pSK41 and that it can recognize DNA utilizing alternate binding modes. PMID:19759211

  3. Detection of Bartonella Species in the Blood of Veterinarians and Veterinary Technicians: A Newly Recognized Occupational Hazard?

    PubMed Central

    Maggi, Ricardo G.; Ferguson, Brandy; Varkey, Jay; Park, Lawrence P.; Breitschwerdt, Edward B.

    2014-01-01

    Abstract Background: Bartonella species are important emerging pathogens in human and veterinary medicine. In the context of their daily activities, veterinary professionals have frequent animal contact and arthropod exposures. Detection of Bartonella spp. using traditional culture methods has been limited by poor sensitivity, making it difficult to determine the prevalence of infection in this population. We have developed a detection method combining enrichment culture and molecular amplification, which increases testing sensitivity. Methods: We performed a cross-sectional study to determine the prevalence of detectable Bartonella spp. in the blood of veterinary personnel and nonveterinary control subjects. Bartonella was detected by enrichment blood culture with conventional PCR followed by DNA sequencing. Results were correlated with epidemiological variables and symptoms. Results: We detected DNA from at least one Bartonella species in 32 (28%) of the 114 veterinary subjects. After DNA sequencing, the Bartonella species could be determined for 27 of the 32 infected subjects, including B. henselae in 15 (56%), B. vinsonii subsp. berkhoffii in seven (26%), B. koehlerae in six (22%), and a B. volans–like sequence in one (4%). Seventy percent of Bartonella-positive subjects described headache compared with 40% of uninfected veterinarians (p=0.009). Irritability was also reported more commonly by infected subjects (68% vs. 43%, p=0.04). Conclusions: Our study supports an emerging body of evidence that cryptic Bartonella bloodstream infection may be more frequent in humans than previously recognized and may induce symptoms. Longitudinal studies are needed to determine the natural course and clinical features of Bartonella infection. PMID:25072986

  4. A Novel Computational Method for Detecting DNA Methylation Sites with DNA Sequence Information and Physicochemical Properties.

    PubMed

    Pan, Gaofeng; Jiang, Limin; Tang, Jijun; Guo, Fei

    2018-02-08

    DNA methylation is an important biochemical process, and it has a close connection with many types of cancer. Research about DNA methylation can help us to understand the regulation mechanism and epigenetic reprogramming. Therefore, it becomes very important to recognize the methylation sites in the DNA sequence. In the past several decades, many computational methods-especially machine learning methods-have been developed since the high-throughout sequencing technology became widely used in research and industry. In order to accurately identify whether or not a nucleotide residue is methylated under the specific DNA sequence context, we propose a novel method that overcomes the shortcomings of previous methods for predicting methylation sites. We use k -gram, multivariate mutual information, discrete wavelet transform, and pseudo amino acid composition to extract features, and train a sparse Bayesian learning model to do DNA methylation prediction. Five criteria-area under the receiver operating characteristic curve (AUC), Matthew's correlation coefficient (MCC), accuracy (ACC), sensitivity (SN), and specificity-are used to evaluate the prediction results of our method. On the benchmark dataset, we could reach 0.8632 on AUC, 0.8017 on ACC, 0.5558 on MCC, and 0.7268 on SN. Additionally, the best results on two scBS-seq profiled mouse embryonic stem cells datasets were 0.8896 and 0.9511 by AUC, respectively. When compared with other outstanding methods, our method surpassed them on the accuracy of prediction. The improvement of AUC by our method compared to other methods was at least 0.0399 . For the convenience of other researchers, our code has been uploaded to a file hosting service, and can be downloaded from: https://figshare.com/s/0697b692d802861282d3.

  5. Lactobacillus delbrueckii subsp. jakobsenii subsp. nov., isolated from dolo wort, an alcoholic fermented beverage in Burkina Faso.

    PubMed

    Adimpong, David B; Nielsen, Dennis S; Sørensen, Kim I; Vogensen, Finn K; Sawadogo-Lingani, Hagrétou; Derkx, Patrick M F; Jespersen, Lene

    2013-10-01

    Lactobacillus delbrueckii is divided into five subspecies based on phenotypic and genotypic differences. A novel isolate, designated ZN7a-9(T), was isolated from malted sorghum wort used for making an alcoholic beverage (dolo) in Burkina Faso. The results of 16S rRNA gene sequencing, DNA-DNA hybridization and peptidoglycan cell-wall structure type analyses indicated that it belongs to the species L. delbrueckii. The genome sequence of isolate ZN7a-9(T) was determined by Illumina-based sequencing. Multilocus sequence typing (MLST) and split-decomposition analyses were performed on seven concatenated housekeeping genes obtained from the genome sequence of strain ZN7a-9(T) together with 41 additional L. delbrueckii strains. The results of the MLST and split-decomposition analyses could not establish the exact subspecies of L. delbrueckii represented by strain ZN7a-9(T) as it clustered with L. delbrueckii strains unassigned to any of the recognized subspecies of L. delbrueckii. Strain ZN7a-9(T) additionally differed from the recognized type strains of the subspecies of L. delbrueckii with respect to its carbohydrate fermentation profile. In conclusion, the cumulative results indicate that strain ZN7a-9(T) represents a novel subspecies of L. delbrueckii closely related to Lactobacillus delbrueckii subsp. lactis and Lactobacillus delbrueckii subsp. delbrueckii for which the name Lactobacillus delbrueckii subsp. jakobsenii subsp. nov. is proposed. The type strain is ZN7a-9(T) = DSM 26046(T) = LMG 27067(T).

  6. Caulobacter crescentus Cell Cycle-Regulated DNA Methyltransferase Uses a Novel Mechanism for Substrate Recognition.

    PubMed

    Woodcock, Clayton B; Yakubov, Aziz B; Reich, Norbert O

    2017-08-01

    Caulobacter crescentus relies on DNA methylation by the cell cycle-regulated methyltransferase (CcrM) in addition to key transcription factors to control the cell cycle and direct cellular differentiation. CcrM is shown here to efficiently methylate its cognate recognition site 5'-GANTC-3' in single-stranded and hemimethylated double-stranded DNA. We report the K m , k cat , k methylation , and K d for single-stranded and hemimethylated substrates, revealing discrimination of 10 7 -fold for noncognate sequences. The enzyme also shows a similar discrimination against single-stranded RNA. Two independent assays clearly show that CcrM is highly processive with single-stranded and hemimethylated DNA. Collectively, the data provide evidence that CcrM and other DNA-modifying enzymes may use a new mechanism to recognize DNA in a key epigenetic process.

  7. Short-Sequence DNA Repeats in Prokaryotic Genomes

    PubMed Central

    van Belkum, Alex; Scherer, Stewart; van Alphen, Loek; Verbrugh, Henri

    1998-01-01

    Short-sequence DNA repeat (SSR) loci can be identified in all eukaryotic and many prokaryotic genomes. These loci harbor short or long stretches of repeated nucleotide sequence motifs. DNA sequence motifs in a single locus can be identical and/or heterogeneous. SSRs are encountered in many different branches of the prokaryote kingdom. They are found in genes encoding products as diverse as microbial surface components recognizing adhesive matrix molecules and specific bacterial virulence factors such as lipopolysaccharide-modifying enzymes or adhesins. SSRs enable genetic and consequently phenotypic flexibility. SSRs function at various levels of gene expression regulation. Variations in the number of repeat units per locus or changes in the nature of the individual repeat sequences may result from recombination processes or polymerase inadequacy such as slipped-strand mispairing (SSM), either alone or in combination with DNA repair deficiencies. These rather complex phenomena can occur with relative ease, with SSM approaching a frequency of 10−4 per bacterial cell division and allowing high-frequency genetic switching. Bacteria use this random strategy to adapt their genetic repertoire in response to selective environmental pressure. SSR-mediated variation has important implications for bacterial pathogenesis and evolutionary fitness. Molecular analysis of changes in SSRs allows epidemiological studies on the spread of pathogenic bacteria. The occurrence, evolution and function of SSRs, and the molecular methods used to analyze them are discussed in the context of responsiveness to environmental factors, bacterial pathogenicity, epidemiology, and the availability of full-genome sequences for increasing numbers of microorganisms, especially those that are medically relevant. PMID:9618442

  8. footprintDB: a database of transcription factors with annotated cis elements and binding interfaces.

    PubMed

    Sebastian, Alvaro; Contreras-Moreira, Bruno

    2014-01-15

    Traditional and high-throughput techniques for determining transcription factor (TF) binding specificities are generating large volumes of data of uneven quality, which are scattered across individual databases. FootprintDB integrates some of the most comprehensive freely available libraries of curated DNA binding sites and systematically annotates the binding interfaces of the corresponding TFs. The first release contains 2422 unique TF sequences, 10 112 DNA binding sites and 3662 DNA motifs. A survey of the included data sources, organisms and TF families was performed together with proprietary database TRANSFAC, finding that footprintDB has a similar coverage of multicellular organisms, while also containing bacterial regulatory data. A search engine has been designed that drives the prediction of DNA motifs for input TFs, or conversely of TF sequences that might recognize input regulatory sequences, by comparison with database entries. Such predictions can also be extended to a single proteome chosen by the user, and results are ranked in terms of interface similarity. Benchmark experiments with bacterial, plant and human data were performed to measure the predictive power of footprintDB searches, which were able to correctly recover 10, 55 and 90% of the tested sequences, respectively. Correctly predicted TFs had a higher interface similarity than the average, confirming its diagnostic value. Web site implemented in PHP,Perl, MySQL and Apache. Freely available from http://floresta.eead.csic.es/footprintdb.

  9. DNA mutation motifs in the genes associated with inherited diseases.

    PubMed

    Růžička, Michal; Kulhánek, Petr; Radová, Lenka; Čechová, Andrea; Špačková, Naďa; Fajkusová, Lenka; Réblová, Kamila

    2017-01-01

    Mutations in human genes can be responsible for inherited genetic disorders and cancer. Mutations can arise due to environmental factors or spontaneously. It has been shown that certain DNA sequences are more prone to mutate. These sites are termed hotspots and exhibit a higher mutation frequency than expected by chance. In contrast, DNA sequences with lower mutation frequencies than expected by chance are termed coldspots. Mutation hotspots are usually derived from a mutation spectrum, which reflects particular population where an effect of a common ancestor plays a role. To detect coldspots/hotspots unaffected by population bias, we analysed the presence of germline mutations obtained from HGMD database in the 5-nucleotide segments repeatedly occurring in genes associated with common inherited disorders, in particular, the PAH, LDLR, CFTR, F8, and F9 genes. Statistically significant sequences (mutational motifs) rarely associated with mutations (coldspots) and frequently associated with mutations (hotspots) exhibited characteristic sequence patterns, e.g. coldspots contained purine tract while hotspots showed alternating purine-pyrimidine bases, often with the presence of CpG dinucleotide. Using molecular dynamics simulations and free energy calculations, we analysed the global bending properties of two selected coldspots and two hotspots with a G/T mismatch. We observed that the coldspots were inherently more flexible than the hotspots. We assume that this property might be critical for effective mismatch repair as DNA with a mutation recognized by MutSα protein is noticeably bent.

  10. Hand gesture recognition by analysis of codons

    NASA Astrophysics Data System (ADS)

    Ramachandra, Poornima; Shrikhande, Neelima

    2007-09-01

    The problem of recognizing gestures from images using computers can be approached by closely understanding how the human brain tackles it. A full fledged gesture recognition system will substitute mouse and keyboards completely. Humans can recognize most gestures by looking at the characteristic external shape or the silhouette of the fingers. Many previous techniques to recognize gestures dealt with motion and geometric features of hands. In this thesis gestures are recognized by the Codon-list pattern extracted from the object contour. All edges of an image are described in terms of sequence of Codons. The Codons are defined in terms of the relationship between maxima, minima and zeros of curvature encountered as one traverses the boundary of the object. We have concentrated on a catalog of 24 gesture images from the American Sign Language alphabet (Letter J and Z are ignored as they are represented using motion) [2]. The query image given as an input to the system is analyzed and tested against the Codon-lists, which are shape descriptors for external parts of a hand gesture. We have used the Weighted Frequency Indexing Transform (WFIT) approach which is used in DNA sequence matching for matching the Codon-lists. The matching algorithm consists of two steps: 1) the query sequences are converted to short sequences and are assigned weights and, 2) all the sequences of query gestures are pruned into match and mismatch subsequences by the frequency indexing tree based on the weights of the subsequences. The Codon sequences with the most weight are used to determine the most precise match. Once a match is found, the identified gesture and corresponding interpretation are shown as output.

  11. Templated sequence insertion polymorphisms in the human genome

    NASA Astrophysics Data System (ADS)

    Onozawa, Masahiro; Aplan, Peter

    2016-11-01

    Templated Sequence Insertion Polymorphism (TSIP) is a recently described form of polymorphism recognized in the human genome, in which a sequence that is templated from a distant genomic region is inserted into the genome, seemingly at random. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; Class 1 TSIPs show features of insertions that are mediated via the LINE-1 ORF2 protein, including 1) target-site duplication (TSD), 2) polyadenylation 10-30 nucleotides downstream of a “cryptic” polyadenylation signal, and 3) preference for insertion at a 5’-TTTT/A-3’ sequence. In contrast, class 2 TSIPs show features consistent with repair of a DNA double-strand break via insertion of a DNA “patch” that is derived from a distant genomic region. Survey of a large number of normal human volunteers demonstrates that most individuals have 25-30 TSIPs, and that these TSIPs track with specific geographic regions. Similar to other forms of human polymorphism, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases.

  12. Molecular Phylogenetics and Systematics of the Bivalve Family Ostreidae Based on rRNA Sequence-Structure Models and Multilocus Species Tree

    PubMed Central

    Salvi, Daniele; Macali, Armando; Mariottini, Paolo

    2014-01-01

    The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassotreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics. PMID:25250663

  13. Molecular phylogenetics and systematics of the bivalve family Ostreidae based on rRNA sequence-structure models and multilocus species tree.

    PubMed

    Salvi, Daniele; Macali, Armando; Mariottini, Paolo

    2014-01-01

    The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassostreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized [corrected]. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics.

  14. A 'new lease of life': FnCpf1 possesses DNA cleavage activity for genome editing in human cells.

    PubMed

    Tu, Mengjun; Lin, Li; Cheng, Yilu; He, Xiubin; Sun, Huihui; Xie, Haihua; Fu, Junhao; Liu, Changbao; Li, Jin; Chen, Ding; Xi, Haitao; Xue, Dongyu; Liu, Qi; Zhao, Junzhao; Gao, Caixia; Song, Zongming; Qu, Jia; Gu, Feng

    2017-11-02

    Cpf1 nucleases were recently reported to be highly specific and programmable nucleases with efficiencies comparable to those of SpCas9. AsCpf1 and LbCpf1 require a single crRNA and recognize a 5'-TTTN-3' protospacer adjacent motif (PAM) at the 5' end of the protospacer for genome editing. For widespread application in precision site-specific human genome editing, the range of sequences that AsCpf1 and LbCpf1 can recognize is limited due to the size of this PAM. To address this limitation, we sought to identify a novel Cpf1 nuclease with simpler PAM requirements. Specifically, here we sought to test and engineer FnCpf1, one reported Cpf1 nuclease (FnCpf1) only requires 5'-TTN-3' as a PAM but does not exhibit detectable levels of nuclease-induced indels at certain locus in human cells. Surprisingly, we found that FnCpf1 possesses DNA cleavage activity in human cells at multiple loci. We also comprehensively and quantitatively examined various FnCpf1 parameters in human cells, including spacer sequence, direct repeat sequence and the PAM sequence. Our study identifies FnCpf1 as a new member of the Cpf1 family for human genome editing with distinctive characteristics, which shows promise as a genome editing tool with the potential for both research and therapeutic applications. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. A ‘new lease of life’: FnCpf1 possesses DNA cleavage activity for genome editing in human cells

    PubMed Central

    Tu, Mengjun; Lin, Li; Cheng, Yilu; He, Xiubin; Sun, Huihui; Xie, Haihua; Fu, Junhao; Liu, Changbao; Li, Jin; Chen, Ding; Xi, Haitao; Xue, Dongyu; Liu, Qi; Zhao, Junzhao; Gao, Caixia; Song, Zongming; Qu, Jia

    2017-01-01

    Abstract Cpf1 nucleases were recently reported to be highly specific and programmable nucleases with efficiencies comparable to those of SpCas9. AsCpf1 and LbCpf1 require a single crRNA and recognize a 5′-TTTN-3′ protospacer adjacent motif (PAM) at the 5′ end of the protospacer for genome editing. For widespread application in precision site-specific human genome editing, the range of sequences that AsCpf1 and LbCpf1 can recognize is limited due to the size of this PAM. To address this limitation, we sought to identify a novel Cpf1 nuclease with simpler PAM requirements. Specifically, here we sought to test and engineer FnCpf1, one reported Cpf1 nuclease (FnCpf1) only requires 5′-TTN-3′ as a PAM but does not exhibit detectable levels of nuclease-induced indels at certain locus in human cells. Surprisingly, we found that FnCpf1 possesses DNA cleavage activity in human cells at multiple loci. We also comprehensively and quantitatively examined various FnCpf1 parameters in human cells, including spacer sequence, direct repeat sequence and the PAM sequence. Our study identifies FnCpf1 as a new member of the Cpf1 family for human genome editing with distinctive characteristics, which shows promise as a genome editing tool with the potential for both research and therapeutic applications. PMID:28977650

  16. Identification of Clinical Isolates of Actinomyces Species by Amplified 16S Ribosomal DNA Restriction Analysis

    PubMed Central

    Hall, Val; Talbot, P. R.; Stubbs, S. L.; Duerden, B. I.

    2001-01-01

    Amplified 16S ribosomal DNA (rDNA) restriction analysis (ARDRA), using enzymes HaeIII and HpaII, was applied to 176 fresh and 299 stored clinical isolates of putative Actinomyces spp. referred to the Anaerobe Reference Unit of the Public Health Laboratory Service for confirmation of identity. Results were compared with ARDRA results obtained previously for reference strains and with conventional phenotypic reactions. Identities of some strains were confirmed by analysis of partial 16S rDNA sequences. Of the 475 isolates, 331 (70%) were clearly assigned to recognized Actinomyces species, including 94 isolates assigned to six recently described species. A further 52 isolates in 12 ARDRA profiles were designated as apparently resembling recognized species, and 44 isolates, in 18 novel profiles, were confirmed as members of genera other than Actinomyces. The identities of 48 isolates in nine profiles remain uncertain, and they may represent novel species of Actinomyces. For the majority of species, phenotypic results, published reactions for the species, and ARDRA profiles concurred. However, of 113 stored isolates originally identified as A. meyeri or resembling A. meyeri by phenotypic tests, only 21 were confirmed as A. meyeri by ARDRA; 63 were reassigned as A. turicensis, 7 as other recognized species, and 22 as unidentified actinomycetes. Analyses of incidence and clinical associations of Actinomyces spp. add to the currently sparse knowledge of some recently described species. PMID:11574572

  17. A Rapid Method to Test for Chloroplast DNA Involvement in Atrazine Resistance

    PubMed Central

    McNally, Sheila; Bettini, Priscilla; Sevignac, Mireille; Darmency, Henry; Gasquez, Jacques; Dron, Michel

    1987-01-01

    A point mutation in the chloroplast psbA gene at codon 264 resulting in an animo acid substitution (ser-gly) manifests itself as atrazine resistance in all recognized weed species studied to date. The single base substitution overlaps a highly conserved Mae1 restriction site which is present in susceptible but not in resistant plants. This restriction enzyme, recently commercialized, has been used to show that it is now possible to discriminate rapidly between the two biotypes without the need for DNA sequencing. Images Fig. 1 PMID:16665229

  18. RAP80, ubiquitin and SUMO in the DNA damage response.

    PubMed

    Lombardi, Patrick M; Matunis, Michael J; Wolberger, Cynthia

    2017-08-01

    A decade has passed since the first reported connection between RAP80 and BRCA1 in DNA double-strand break repair. Despite the initial identification of RAP80 as a factor localizing BRCA1 to DNA double-strand breaks and potentially promoting homologous recombination, there is increasing evidence that RAP80 instead suppresses homologous recombination to fine-tune the balance of competing DNA repair processes during the S/G 2 phase of the cell cycle. RAP80 opposes homologous recombination by inhibiting DNA end-resection and sequestering BRCA1 into the BRCA1-A complex. Ubiquitin and SUMO modifications of chromatin at DNA double-strand breaks recruit RAP80, which contains distinct sequence motifs that recognize ubiquitin and SUMO. Here, we review RAP80's role in repressing homologous recombination at DNA double-strand breaks and how this role is facilitated by its ability to bind ubiquitin and SUMO modifications.

  19. Lactobacillus delbrueckii subsp. sunkii subsp. nov., isolated from sunki, a traditional Japanese pickle.

    PubMed

    Kudo, Yuko; Oki, Kaihei; Watanabe, Koichi

    2012-11-01

    Although four strains of bacteria isolated from sunki, a traditional Japanese, non-salted pickle, were initially identified as Lactobacillus delbrueckii, the molecular and phenotypic characteristics of the strains did not match those of any of the four recognized subspecies of L. delbrueckii. Together, the results of phenotypic characterization, DNA-DNA hybridizations (in which the relatedness values between the novel strains and type strains of the recognized subspecies of L. delbrueckii were all >88.7%) and 16S rRNA gene sequence, amplified fragment length polymorphism (AFLP) and whole-cell MALDI-TOF/MS spectral pattern analyses indicated that the four novel strains represented a single, novel subspecies, for which the name Lactobacillus delbrueckii subsp. sunkii subsp. nov. is proposed. The type strain is YIT 11221(T) (=JCM 17838(T) =DSM 24966(T)).

  20. Jannaschia seohaensis sp. nov., isolated from a tidal flat sediment.

    PubMed

    Yoon, Jung-Hoon; Kang, So-Jung; Park, Sooyeon; Oh, Ki-Hoon; Oh, Tae-Kwang

    2010-01-01

    A Gram-negative, motile and pleomorphic bacterial strain, SMK-146(T), was isolated from a tidal flat sediment of the Yellow Sea, Korea, and its taxonomic position was investigated. Strain SMK-146(T) grew optimally at pH 7.0-8.0 and 30 degrees C. It contained Q-10 as the predominant ubiquinone and C(18 : 1)omega7c and 11-methyl C(18 : 1)omega7c as the major fatty acids. The major polar lipids were phosphatidylcholine, phosphatidylglycerol and phosphatidylethanolamine. The DNA G+C content was 68.4 mol%. Phylogenetic analysis based on 16S rRNA gene sequences showed that strain SMK-146(T) belongs to the genus Jannaschia. Strain SMK-146(T) exhibited 16S rRNA gene sequence similarity values of 95.3-97.0 % to the type strains of the five recognized Jannaschia species. The mean DNA-DNA relatedness value between strain SMK-146(T) and Jannaschia seosinensis KCCM 42114(T), the closest phylogenetic neighbour, was 17 %. Differential phenotypic properties also revealed that strain SMK-146(T) differs from the recognized Jannaschia species. On the basis of phenotypic, phylogenetic and genetic data, strain SMK-146(T) represents a novel species of the genus Jannaschia, for which the name Jannaschia seohaensis sp. nov. is proposed. The type strain is SMK-146(T) (=KCTC 22172(T) =CCUG 55326(T)).

  1. Conservation genetics of North American freshwater mussels Amblema and Megalonaias

    USGS Publications Warehouse

    Mulvey, M.; Lydeard, C.; Pyer, D.L.; Hicks, K.M.; Brim-Box, J.; Williams, J.D.; Butler, R.S.

    1997-01-01

    Freshwater bivalves are among the most endangered groups of organisms in North America. Efforts to protect the declining mussel fauna are confounded by ambiguities associated with recognition of distinct evolutionary entities or species. This, in part, is due to the paucity of reliable morphological characters for differentiating taxa. We have employed allozymes and DNA sequence data to search for diagnosably distinct evolutionary entities within two problematic genera of unionid mussels, Amblema and Megalonaias. Within the genus Amblema three species are recognized based on our DNA sequence data for the mitochondrial 16S rRNA and allozyme data (Amblema neislerii, A. plicata, and A. elliotti). Only one taxonomically distinct entity is recognized within the genus Megalonaias—M. nervosa. Megalonaias boykiniana of the Apalachicolan Region is not diagnosable and does not warrant specific taxonomic status. Interestingly, Megalonaias from west of the Mississippi River, including the Mississippi, exhibited an allozyme and mtDNA haplotype frequency shift suggestive of an east-west dichotomy. The results of this study eliminate one subspecies of Amblema and increase the range of A. plicata. This should not affect the conservation status of “currently stable” assigned to A. plicata by Williams et al. (1993). The conservation status of A. elliotti needs to be reexamined because its distribution appears to be limited to the Coosa River System in Alabama and Georgia.

  2. The multi-zinc finger protein ZNF217 contacts DNA through a two-finger domain.

    PubMed

    Nunez, Noelia; Clifton, Molly M K; Funnell, Alister P W; Artuz, Crisbel; Hallal, Samantha; Quinlan, Kate G R; Font, Josep; Vandevenne, Marylène; Setiyaputra, Surya; Pearson, Richard C M; Mackay, Joel P; Crossley, Merlin

    2011-11-04

    Classical C2H2 zinc finger proteins are among the most abundant transcription factors found in eukaryotes, and the mechanisms through which they recognize their target genes have been extensively investigated. In general, a tandem array of three fingers separated by characteristic TGERP links is required for sequence-specific DNA recognition. Nevertheless, a significant number of zinc finger proteins do not contain a hallmark three-finger array of this type, raising the question of whether and how they contact DNA. We have examined the multi-finger protein ZNF217, which contains eight classical zinc fingers. ZNF217 is implicated as an oncogene and in repressing the E-cadherin gene. We show that two of its zinc fingers, 6 and 7, can mediate contacts with DNA. We examine its putative recognition site in the E-cadherin promoter and demonstrate that this is a suboptimal site. NMR analysis and mutagenesis is used to define the DNA binding surface of ZNF217, and we examine the specificity of the DNA binding activity using fluorescence anisotropy titrations. Finally, sequence analysis reveals that a variety of multi-finger proteins also contain two-finger units, and our data support the idea that these may constitute a distinct subclass of DNA recognition motif.

  3. The Multi-zinc Finger Protein ZNF217 Contacts DNA through a Two-finger Domain*

    PubMed Central

    Nunez, Noelia; Clifton, Molly M. K.; Funnell, Alister P. W.; Artuz, Crisbel; Hallal, Samantha; Quinlan, Kate G. R.; Font, Josep; Vandevenne, Marylène; Setiyaputra, Surya; Pearson, Richard C. M.; Mackay, Joel P.; Crossley, Merlin

    2011-01-01

    Classical C2H2 zinc finger proteins are among the most abundant transcription factors found in eukaryotes, and the mechanisms through which they recognize their target genes have been extensively investigated. In general, a tandem array of three fingers separated by characteristic TGERP links is required for sequence-specific DNA recognition. Nevertheless, a significant number of zinc finger proteins do not contain a hallmark three-finger array of this type, raising the question of whether and how they contact DNA. We have examined the multi-finger protein ZNF217, which contains eight classical zinc fingers. ZNF217 is implicated as an oncogene and in repressing the E-cadherin gene. We show that two of its zinc fingers, 6 and 7, can mediate contacts with DNA. We examine its putative recognition site in the E-cadherin promoter and demonstrate that this is a suboptimal site. NMR analysis and mutagenesis is used to define the DNA binding surface of ZNF217, and we examine the specificity of the DNA binding activity using fluorescence anisotropy titrations. Finally, sequence analysis reveals that a variety of multi-finger proteins also contain two-finger units, and our data support the idea that these may constitute a distinct subclass of DNA recognition motif. PMID:21908891

  4. Knowledge-Based Elastic Potentials for Docking Drugs or Proteins with Nucleic Acids

    PubMed Central

    Ge, Wei; Schneider, Bohdan; Olson, Wilma K.

    2005-01-01

    Elastic ellipsoidal functions defined by the observed hydration patterns around the DNA bases provide a new basis for measuring the recognition of ligands in the grooves of double-helical structures. Here a set of knowledge-based potentials suitable for quantitative description of such behavior is extracted from the observed positions of water molecules and amino acid atoms that form hydrogen bonds with the nitrogenous bases in high resolution crystal structures. Energies based on the displacement of hydrogen-bonding sites on drugs in DNA-crystal complexes relative to the preferred locations of water binding around the heterocyclic bases are low, pointing to the reliability of the potentials and the apparent displacement of water molecules by drug atoms in these structures. The validity of the energy functions has been further examined in a series of sequence substitution studies based on the structures of DNA bound to polyamides that have been designed to recognize the minor-groove edges of Watson-Crick basepairs. The higher energies of binding to incorrect sequences superimposed (without conformational adjustment or displacement of polyamide ligands) on observed high resolution structures confirm the hypothesis that the drug subunits associate with specific DNA bases. The knowledge-based functions also account satisfactorily for the measured free energies of DNA-polyamide association in solution and the observed sites of polyamide binding on nucleosomal DNA. The computations are generally consistent with mechanisms by which minor-groove binding ligands are thought to recognize DNA basepairs. The calculations suggest that the asymmetric distributions of hydrogen-bond-forming atoms on the minor-groove edge of the basepairs may underlie ligand discrimination of G·C from C·G pairs, in addition to the commonly believed role of steric hindrance. The analysis of polyamide-bound nucleosomal structures reveals other discrepancies in the expected chemical design, including unexpected contacts to DNA and modified basepair targets of some ligands. The ellipsoidal potentials thus appear promising as a mathematical tool for the study of drug- and protein-DNA interactions and for gaining new insights into DNA-binding mechanisms. PMID:15501936

  5. Halobacillus alkaliphilus sp. nov., a halophilic bacterium isolated from a salt lake in Fuente de Piedra, southern Spain.

    PubMed

    Romano, Ida; Finore, Ilaria; Nicolaus, Giancarlo; Huertas, F Javier; Lama, Licia; Nicolaus, Barbara; Poli, Annarita

    2008-04-01

    A Gram-positive, spore-forming, halophilic bacterial strain, FP5T, was isolated from a salt lake in southern Spain and subjected to a polyphasic taxonomic study. Strain FP5T was strictly aerobic. Cells were coccoidal, occurring singly or in clusters. The cell-wall peptidoglycan type of strain FP5T was A4 beta based on l-Orn-d-Asp. Strain FP5T was characterized chemotaxonomically by having MK-7 as the major menaquinone and anteiso-C15 : 0, anteiso-C17 : 0, iso-C15 : 0 and iso-C16 : 0 as the main fatty acids. The isolate grew optimally at 37 degrees C and in presence of 10 % NaCl; no growth was observed in the absence of NaCl. The DNA G+C content was 43.5 mol%. Phylogenetic analyses based on 16S rRNA gene sequences showed that strain FP5T falls within the evolutionary radiation of species of the genus Halobacillus. Levels of 16S rRNA gene sequence similarity between strain FP5T and the type strains of nine recognized Halobacillus species were in the range 97.0-99.0 %. Levels of DNA-DNA relatedness indicated that strain FP5T represents a genomic species that is distinct from recognized Halobacillus species. Strain FP5T could be differentiated from recognized Halobacillus species based on several phenotypic characteristics. On the basis of phenotypic, phylogenetic and genomic data, strain FP5T is considered to represent a novel species of the genus Halobacillus, for which the name Halobacillus alkaliphilus sp. nov. is proposed. The type strain is FP5T (=DSM 18525T =ATCC BAA-1361T).

  6. Molecular Dynamics Simulations of DNA-Free and DNA-Bound TAL Effectors

    PubMed Central

    Wan, Hua; Hu, Jian-ping; Li, Kang-shun; Tian, Xu-hong; Chang, Shan

    2013-01-01

    TAL (transcriptional activator-like) effectors (TALEs) are DNA-binding proteins, containing a modular central domain that recognizes specific DNA sequences. Recently, the crystallographic studies of TALEs revealed the structure of DNA-recognition domain. In this article, molecular dynamics (MD) simulations are employed to study two crystal structures of an 11.5-repeat TALE, in the presence and absence of DNA, respectively. The simulated results indicate that the specific binding of RVDs (repeat-variable diresidues) with DNA leads to the markedly reduced fluctuations of tandem repeats, especially at the two ends. In the DNA-bound TALE system, the base-specific interaction is formed mainly by the residue at position 13 within a TAL repeat. Tandem repeats with weak RVDs are unfavorable for the TALE-DNA binding. These observations are consistent with experimental studies. By using principal component analysis (PCA), the dominant motions are open-close movements between the two ends of the superhelical structure in both DNA-free and DNA-bound TALE systems. The open-close movements are found to be critical for the recognition and binding of TALE-DNA based on the analysis of free energy landscape (FEL). The conformational analysis of DNA indicates that the 5′ end of DNA target sequence has more remarkable structural deformability than the other sites. Meanwhile, the conformational change of DNA is likely associated with the specific interaction of TALE-DNA. We further suggest that the arrangement of N-terminal repeats with strong RVDs may help in the design of efficient TALEs. This study provides some new insights into the understanding of the TALE-DNA recognition mechanism. PMID:24130757

  7. Genetic and epigenetic mutations affect the DNA binding capability of human ZFP57 in transient neonatal diabetes type 1

    PubMed Central

    Baglivo, Ilaria; Esposito, Sabrina; De Cesare, Lucia; Sparago, Angela; Anvar, Zahra; Riso, Vincenzo; Cammisa, Marco; Fattorusso, Roberto; Grimaldi, Giovanna; Riccio, Andrea; Pedone, Paolo V.

    2013-01-01

    In the mouse, ZFP57 contains three classical Cys2His2 zinc finger domains (ZF) and recognizes the methylated TGCmetCGC target sequence using the first and the second ZFs. In this study, we demonstrate that the human ZFP57 (hZFP57) containing six Cys2His2 ZFs, binds the same methylated sequence through the third and the fourth ZFs, and identify the aminoacids critical for DNA interaction. In addition, we present evidences indicating that hZFP57 mutations and hypomethylation of the TNDM1 ICR both associated with Transient Neonatal Diabetes Mellitus type 1 result in loss of hZFP57 binding to the TNDM1 locus, likely causing PLAGL1 activation. PMID:23499433

  8. Recognition of DNA bulges by dinuclear iron(II) metallosupramolecular helicates.

    PubMed

    Malina, Jaroslav; Hannon, Michael J; Brabec, Viktor

    2014-02-01

    Bulged DNA structures are of general biological significance because of their important roles in a number of biochemical processes. Compounds capable of targeting bulged DNA sequences can be used as probes for studying their role in nucleic acid function, or could even have significant therapeutic potential. The interaction of [Fe(2)L(3)](4+) metallosupramolecular helicates (L = C(25)H(20)N(4)) with DNA duplexes containing bulges has been studied by measurement of the DNA melting temperature and gel electrophoresis. This study was aimed at exploring binding affinities of the helicates for DNA bulges of various sizes and nucleotide sequences. The studies reported herein reveal that both enantiomers of [Fe(2)L(3)](4+) bind to DNA bulges containing at least two unpaired nucleotides. In addition, these helicates show considerably enhanced affinity for duplexes containing unpaired pyrimidines in the bulge and/or pyrimidines flanking the bulge on both sides. We suggest that the bulge creates the structural motif, such as the triangular prismatic pocket formed by the unpaired bulge bases, to accommodate the [Fe(2)L(3)](4+) helicate molecule, and is probably responsible for the affinity for duplexes with a varying number of bulge bases. Our results reveal that DNA bulges represent another example of unusual DNA structures recognized by dinuclear iron(II) ([Fe(2)L(3)](4+)) supramolecular helicates. © 2013 FEBS.

  9. DNA Barcoding the Geometrid Fauna of Bavaria (Lepidoptera): Successes, Surprises, and Questions

    PubMed Central

    Hausmann, Axel; Haszprunar, Gerhard; Hebert, Paul D. N.

    2011-01-01

    Background The State of Bavaria is involved in a research program that will lead to the construction of a DNA barcode library for all animal species within its territorial boundaries. The present study provides a comprehensive DNA barcode library for the Geometridae, one of the most diverse of insect families. Methodology/Principal Findings This study reports DNA barcodes for 400 Bavarian geometrid species, 98 per cent of the known fauna, and approximately one per cent of all Bavarian animal species. Although 98.5% of these species possess diagnostic barcode sequences in Bavaria, records from neighbouring countries suggest that species-level resolution may be compromised in up to 3.5% of cases. All taxa which apparently share barcodes are discussed in detail. One case of modest divergence (1.4%) revealed a species overlooked by the current taxonomic system: Eupithecia goossensiata Mabille, 1869 stat.n. is raised from synonymy with Eupithecia absinthiata (Clerck, 1759) to species rank. Deep intraspecific sequence divergences (>2%) were detected in 20 traditionally recognized species. Conclusions/Significance The study emphasizes the effectiveness of DNA barcoding as a tool for monitoring biodiversity. Open access is provided to a data set that includes records for 1,395 geometrid specimens (331 species) from Bavaria, with 69 additional species from neighbouring regions. Taxa with deep intraspecific sequence divergences are undergoing more detailed analysis to ascertain if they represent cases of cryptic diversity. PMID:21423340

  10. A phylogenetic study of Laeliinae (Orchidaceae) based on combined nuclear and plastid DNA sequences

    PubMed Central

    van den Berg, Cássio; Higgins, Wesley E.; Dressler, Robert L.; Whitten, W. Mark; Soto-Arenas, Miguel A.; Chase, Mark W.

    2009-01-01

    Background and Aims Laeliinae are a neotropical orchid subtribe with approx. 1500 species in 50 genera. In this study, an attempt is made to assess generic alliances based on molecular phylogenetic analysis of DNA sequence data. Methods Six DNA datasets were gathered: plastid trnL intron, trnL-F spacer, matK gene and trnK introns upstream and dowstream from matK and nuclear ITS rDNA. Data were analysed with maximum parsimony (MP) and Bayesian analysis with mixed models (BA). Key Results Although relationships between Laeliinae and outgroups are well supported, within the subtribe sequence variation is low considering the broad taxonomic range covered. Localized incongruence between the ITS and plastid trees was found. A combined tree followed the ITS trees more closely, but the levels of support obtained with MP were low. The Bayesian analysis recovered more well-supported nodes. The trees from combined MP and BA allowed eight generic alliances to be recognized within Laeliinae, all of which show trends in morphological characters but lack unambiguous synapomorphies. Conclusions By using combined plastid and nuclear DNA data in conjunction with mixed-models Bayesian inference, it is possible to delimit smaller groups within Laeliinae and discuss general patterns of pollination and hybridization compatibility. Furthermore, these small groups can now be used for further detailed studies to explain morphological evolution and diversification patterns within the subtribe. PMID:19423551

  11. Secondary structure model of the RNA recognized by the reverse transcriptase from the R2 retrotransposable element.

    PubMed Central

    Mathews, D H; Banerjee, A R; Luan, D D; Eickbush, T H; Turner, D H

    1997-01-01

    RNA transcripts corresponding to the 250-nt 3' untranslated region of the R2 non-LTR retrotransposable element are recognized by the R2 reverse transcriptase and are sufficient to serve as templates in the target DNA-primed reverse transcription (TPRT) reaction. The R2 protein encoded by the Bombyx mori R2 can recognize this region from both the B. mori and Drosophila melanogaster R2 elements even though these regions show little nucleotide sequence identity. A model for the RNA secondary structure of the 3' untranslated region of the D. melanogaster R2 retrotransposon was developed by sequence comparison of 10 species aided by free energy minimization. Chemical modification experiments are consistent with this prediction. A secondary structure model for the 3' untranslated region of R2 RNA from the R2 element from B. mori was obtained by a combination of chemical modification data and free energy minimization. These two secondary structure models, found independently, share several common sites. This study shows the utility of combining free energy minimization, sequence comparison, and chemical modification to model an RNA secondary structure. PMID:8990394

  12. Opaque-2 is a transcriptional activator that recognizes a specific target site in 22-kD zein genes.

    PubMed Central

    Schmidt, R J; Ketudat, M; Aukerman, M J; Hoschek, G

    1992-01-01

    opaque-2 (o2) is a regulatory locus in maize that plays an essential role in controlling the expression of genes encoding the 22-kD zein proteins. Through DNase I footprinting and DNA binding analyses, we have identified the binding site for the O2 protein (O2) in the promoter of 22-kD zein genes. The sequence in the 22-kD zein gene promoter that is recognized by O2 is similar to the target site recognized by other "basic/leucine zipper" (bZIP) proteins in that it contains an ACGT core that is necessary for DNA binding. The site is located in the -300 region relative to the translation start and lies about 20 bp downstream of the highly conserved zein gene sequence motif known as the "prolamin box." Employing gel mobility shift assays, we used O2 antibodies and nuclear extracts from an o2 null mutant to demonstrate that the O2 protein in maize endosperm nuclei recognizes the target site in the zein gene promoter. Mobility shift assays using nuclear proteins from an o2 null mutant indicated that other endosperm proteins in addition to O2 can bind the O2 target site and that O2 may be associated with one of these proteins. We also demonstrated that in yeast cells the O2 protein can activate expression of a lacZ gene containing a multimer of the O2 target sequence as part of its promoter, thus confirming its role as a transcriptional activator. A computer-assisted search indicated that the O2 target site is not present in the promoters of zein genes other than those of the 22-kD class. These data suggest a likely explanation at the molecular level for the differential effect of o2 mutations on expression of certain members of the zein gene family. PMID:1392590

  13. The oxidative DNA glycosylases of Mycobacterium tuberculosis exhibit different substrate preferences from their Escherichia coli counterparts

    PubMed Central

    Guo, Yin; Bandaru, Viswanath; Jaruga, Pawel; Zhao, Xiaobei; Burrows, Cynthia J.; Iwai, Shigenori; Dizdaroglu, Miral; Bond, Jeffrey P.; Wallace, Susan S.

    2010-01-01

    The DNA glycosylases that remove oxidized DNA bases fall into two general families: the Fpg/Nei family and the Nth superfamily. Based on protein sequence alignments, we identified four putative Fpg/Nei family members, as well as a putative Nth protein in Mycobacterium tuberculosis H37Rv. All four Fpg/Nei proteins were successfully overexpressed using a bicistronic vector created in our laboratory. The MtuNth protein was also overexpressed in soluble form. The substrate specificities of the purified enzymes were characterized in vitro with oligodeoxynucleotide substrates containing single lesions. Some were further characterized by gas chromatography/mass spectrometry (GC/MS) analysis of products released from γ-irradiated DNA. MtuFpg1 has a substrate specificity similar to that of EcoFpg. Both EcoFpg and MtuFpg1 are more efficient at removing spiroiminodihydantoin (Sp) than 7,8-dihydro-8-oxoguanine (8-oxoG). However, MtuFpg1 shows a substantially increased opposite base discrimination compared to EcoFpg. MtuFpg2 contains only the C-terminal domain of an Fpg protein and has no detectable DNA binding activity or DNA glycosylase/lyase activity and thus appears to be a pseudogene. MtuNei1 recognizes oxidized pyrimidines on both double-stranded and single-stranded DNA and exhibits uracil DNA glycosylase activity. MtuNth recognizes a variety of oxidized bases, including urea, 5,6-dihydrouracil (DHU), 5-hydroxyuracil (5-OHU), 5-hydroxycytosine (5-OHC) and methylhydantoin (MeHyd). Both MtuNei1 and MtuNth excise thymine glycol (Tg); however, MtuNei1 strongly prefers the (5R) isomers, whereas MtuNth recognizes only the (5S) isomers. MtuNei2 did not demonstrate activity in vitro as a recombinant protein, but like MtuNei1 when expressed in Escherichia coli, it decreased the spontaneous mutation frequency of both the fpg mutY nei triple and nei nth double mutants, suggesting that MtuNei2 is functionally active in vivo recognizing both guanine and cytosine oxidation products. The kinetic parameters of the MtuFpg1, MtuNei1 and MtuNth proteins on selected substrates were also determined and compared to those of their E. coli homologs. PMID:20031487

  14. Titanium Dioxide Nanoparticle-Based Interdigitated Electrodes: A Novel Current to Voltage DNA Biosensor Recognizes E. coli O157:H7.

    PubMed

    Nadzirah, Sh; Azizah, N; Hashim, Uda; Gopinath, Subash C B; Kashif, Mohd

    2015-01-01

    Nanoparticle-mediated bio-sensing promoted the development of novel sensors in the front of medical diagnosis. In the present study, we have generated and examined the potential of titanium dioxide (TiO2) crystalline nanoparticles with aluminium interdigitated electrode biosensor to specifically detect single-stranded E.coli O157:H7 DNA. The performance of this novel DNA biosensor was measured the electrical current response using a picoammeter. The sensor surface was chemically functionalized with (3-aminopropyl) triethoxysilane (APTES) to provide contact between the organic and inorganic surfaces of a single-stranded DNA probe and TiO2 nanoparticles while maintaining the sensing system's physical characteristics. The complement of the target DNA of E. coli O157:H7 to the carboxylate-probe DNA could be translated into electrical signals and confirmed by the increased conductivity in the current-to-voltage curves. The specificity experiments indicate that the biosensor can discriminate between the complementary sequences from the base-mismatched and the non-complementary sequences. After duplex formation, the complementary target sequence can be quantified over a wide range with a detection limit of 1.0 x 10(-13)M. With target DNA from the lysed E. coli O157:H7, we could attain similar sensitivity. Stability of DNA immobilized surface was calculated with the relative standard deviation (4.6%), displayed the retaining with 99% of its original response current until 6 months. This high-performance interdigitated DNA biosensor with high sensitivity, stability and non-fouling on a novel sensing platform is suitable for a wide range of biomolecular interactive analyses.

  15. Highly parallel single-molecule amplification approach based on agarose droplet polymerase chain reaction for efficient and cost-effective aptamer selection.

    PubMed

    Zhang, Wei Yun; Zhang, Wenhua; Liu, Zhiyuan; Li, Cong; Zhu, Zhi; Yang, Chaoyong James

    2012-01-03

    We have developed a novel method for efficiently screening affinity ligands (aptamers) from a complex single-stranded DNA (ssDNA) library by employing single-molecule emulsion polymerase chain reaction (PCR) based on the agarose droplet microfluidic technology. In a typical systematic evolution of ligands by exponential enrichment (SELEX) process, the enriched library is sequenced first, and tens to hundreds of aptamer candidates are analyzed via a bioinformatic approach. Possible candidates are then chemically synthesized, and their binding affinities are measured individually. Such a process is time-consuming, labor-intensive, inefficient, and expensive. To address these problems, we have developed a highly efficient single-molecule approach for aptamer screening using our agarose droplet microfluidic technology. Statistically diluted ssDNA of the pre-enriched library evolved through conventional SELEX against cancer biomarker Shp2 protein was encapsulated into individual uniform agarose droplets for droplet PCR to generate clonal agarose beads. The binding capacity of amplified ssDNA from each clonal bead was then screened via high-throughput fluorescence cytometry. DNA clones with high binding capacity and low K(d) were chosen as the aptamer and can be directly used for downstream biomedical applications. We have identified an ssDNA aptamer that selectively recognizes Shp2 with a K(d) of 24.9 nM. Compared to a conventional sequencing-chemical synthesis-screening work flow, our approach avoids large-scale DNA sequencing and expensive, time-consuming DNA synthesis of large populations of DNA candidates. The agarose droplet microfluidic approach is thus highly efficient and cost-effective for molecular evolution approaches and will find wide application in molecular evolution technologies, including mRNA display, phage display, and so on. © 2011 American Chemical Society

  16. Titanium Dioxide Nanoparticle-Based Interdigitated Electrodes: A Novel Current to Voltage DNA Biosensor Recognizes E. coli O157:H7

    PubMed Central

    Nadzirah, Sh.; Azizah, N.; Hashim, Uda; Gopinath, Subash C. B.; Kashif, Mohd

    2015-01-01

    Nanoparticle-mediated bio-sensing promoted the development of novel sensors in the front of medical diagnosis. In the present study, we have generated and examined the potential of titanium dioxide (TiO2) crystalline nanoparticles with aluminium interdigitated electrode biosensor to specifically detect single-stranded E.coli O157:H7 DNA. The performance of this novel DNA biosensor was measured the electrical current response using a picoammeter. The sensor surface was chemically functionalized with (3-aminopropyl) triethoxysilane (APTES) to provide contact between the organic and inorganic surfaces of a single-stranded DNA probe and TiO2 nanoparticles while maintaining the sensing system’s physical characteristics. The complement of the target DNA of E. coli O157:H7 to the carboxylate-probe DNA could be translated into electrical signals and confirmed by the increased conductivity in the current-to-voltage curves. The specificity experiments indicate that the biosensor can discriminate between the complementary sequences from the base-mismatched and the non-complementary sequences. After duplex formation, the complementary target sequence can be quantified over a wide range with a detection limit of 1.0 x 10-13M. With target DNA from the lysed E. coli O157:H7, we could attain similar sensitivity. Stability of DNA immobilized surface was calculated with the relative standard deviation (4.6%), displayed the retaining with 99% of its original response current until 6 months. This high-performance interdigitated DNA biosensor with high sensitivity, stability and non-fouling on a novel sensing platform is suitable for a wide range of biomolecular interactive analyses. PMID:26445455

  17. Identification and analysis of cytochrome P450IID6 antigenic sites recognized by anti-liver-kidney microsome type-1 antibodies (LKM1).

    PubMed

    Yamamoto, A M; Cresteil, D; Boniface, O; Clerc, F F; Alvarez, F

    1993-05-01

    Anti-liver-kidney microsome type-1 antibodies (LKM1), present in sera from a group of patients with autoimmune hepatitis, are directed against P450IID6. Previous work, using cDNA constructions spanning most of the P450IID6 protein defined the main immunogenic site between the amino acids (aa), 254-271 and predicted the presence of other putative immunogenic sites in the molecule. Fusion proteins from new cDNA constructions, spanning so-far-untested regions between aa 1-125 and 431-522, were not recognized by LKM1-positive sera. Synthetic peptides, representing sequences from putative immunogenic regions or previously untested regions, allowed a precise definition of four antigenic sites located between peptides 257-269, 321-351, 373-389 and 410-429, which were recognized, respectively, by 14, 8, 1 and 2 out of 15 LKM1-positive sera tested. The minimal sequence of the main antigenic site (peptide 257-269) recognized by the autoantibody was established to be WDPAQPPRD (peptide 262-270). In addition, deletion and replacement experiments showed that aa 263 (Asp) was essential for the binding of the autoantibody to peptide 262-270. Analysis of the second most frequently recognized peptide between aa 321-351, was performed using peptides 321-339 and 340-351 in competitive inhibition studies. Complete elimination of antibody binding to peptide 321-351 obtained by absorption of both shorter peptides indicated that peptide 321-351 is a discontinuous antigenic site. LKM1-positive sera reacting against peptide 321-351 recognized either both the shorter peptides or just one of them preferentially. Results of the present study suggest that the production of LKM1 antibodies is an antigen-driven, poly- or oligoclonal B cell response. The identification of antigenic sites will allow: (i) the development of specific diagnostic tests and (ii) further studies on the pathogenic value of LKM1 antibodies in autoimmune hepatitis.

  18. Polymorphism of Paramecium pentaurelia (Ciliophora, Oligohymenophorea) strains revealed by rDNA and mtDNA sequences.

    PubMed

    Przyboś, Ewa; Tarcz, Sebastian; Greczek-Stachura, Magdalena; Surmacz, Marta

    2011-05-01

    Paramecium pentaurelia is one of 15 known sibling species of the Paramecium aurelia complex. It is recognized as a species showing no intra-specific differentiation on the basis of molecular fingerprint analyses, whereas the majority of other species are polymorphic. This study aimed at assessing genetic polymorphism within P. pentaurelia including new strains recently found in Poland (originating from two water bodies, different years, seasons, and clones of one strain) as well as strains collected from distant habitats (USA, Europe, Asia), and strains representing other species of the complex. We compared two DNA fragments: partial sequences (349 bp) of the LSU rDNA and partial sequences (618 bp) of cytochrome B gene. A correlation between the geographical origin of the strains and the genetic characteristics of their genotypes was not observed. Different genotypes were found in Kraków in two types of water bodies (Opatkowice-natural pond; Jordan's Park-artificial pond). Haplotype diversity within a single water body was not recorded. Likewise, seasonal haplotype differences between the strains within the artificial water body, as well as differences between clones originating from one strain, were not detected. The clustering of some strains belonging to different species was observed in the phylogenies. Copyright © 2010 Elsevier GmbH. All rights reserved.

  19. The chemical structure of DNA sequence signals for RNA transcription

    NASA Technical Reports Server (NTRS)

    George, D. G.; Dayhoff, M. O.

    1982-01-01

    The proposed recognition sites for RNA transcription for E. coli NRA polymerase, bacteriophage T7 RNA polymerase, and eukaryotic RNA polymerase Pol II are evaluated in the light of the requirements for efficient recognition. It is shown that although there is good experimental evidence that specific nucleic acid sequence patterns are involved in transcriptional regulation in bacteria and bacterial viruses, among the sequences now available, only in the case of the promoters recognized by bacteriophage T7 polymerase does it seem likely that the pattern is sufficient. It is concluded that the eukaryotic pattern that is investigated is not restrictive enough to serve as a recognition site.

  20. Quantitative characterization of conformational-specific protein-DNA binding using a dual-spectral interferometric imaging biosensor

    NASA Astrophysics Data System (ADS)

    Zhang, Xirui; Daaboul, George G.; Spuhler, Philipp S.; Dröge, Peter; Ünlü, M. Selim

    2016-03-01

    DNA-binding proteins play crucial roles in the maintenance and functions of the genome and yet, their specific binding mechanisms are not fully understood. Recently, it was discovered that DNA-binding proteins recognize specific binding sites to carry out their functions through an indirect readout mechanism by recognizing and capturing DNA conformational flexibility and deformation. High-throughput DNA microarray-based methods that provide large-scale protein-DNA binding information have shown effective and comprehensive analysis of protein-DNA binding affinities, but do not provide information of DNA conformational changes in specific protein-DNA complexes. Building on the high-throughput capability of DNA microarrays, we demonstrate a quantitative approach that simultaneously measures the amount of protein binding to DNA and nanometer-scale DNA conformational change induced by protein binding in a microarray format. Both measurements rely on spectral interferometry on a layered substrate using a single optical instrument in two distinct modalities. In the first modality, we quantitate the amount of binding of protein to surface-immobilized DNA in each DNA spot using a label-free spectral reflectivity technique that accurately measures the surface densities of protein and DNA accumulated on the substrate. In the second modality, for each DNA spot, we simultaneously measure DNA conformational change using a fluorescence vertical sectioning technique that determines average axial height of fluorophores tagged to specific nucleotides of the surface-immobilized DNA. The approach presented in this paper, when combined with current high-throughput DNA microarray-based technologies, has the potential to serve as a rapid and simple method for quantitative and large-scale characterization of conformational specific protein-DNA interactions.DNA-binding proteins play crucial roles in the maintenance and functions of the genome and yet, their specific binding mechanisms are not fully understood. Recently, it was discovered that DNA-binding proteins recognize specific binding sites to carry out their functions through an indirect readout mechanism by recognizing and capturing DNA conformational flexibility and deformation. High-throughput DNA microarray-based methods that provide large-scale protein-DNA binding information have shown effective and comprehensive analysis of protein-DNA binding affinities, but do not provide information of DNA conformational changes in specific protein-DNA complexes. Building on the high-throughput capability of DNA microarrays, we demonstrate a quantitative approach that simultaneously measures the amount of protein binding to DNA and nanometer-scale DNA conformational change induced by protein binding in a microarray format. Both measurements rely on spectral interferometry on a layered substrate using a single optical instrument in two distinct modalities. In the first modality, we quantitate the amount of binding of protein to surface-immobilized DNA in each DNA spot using a label-free spectral reflectivity technique that accurately measures the surface densities of protein and DNA accumulated on the substrate. In the second modality, for each DNA spot, we simultaneously measure DNA conformational change using a fluorescence vertical sectioning technique that determines average axial height of fluorophores tagged to specific nucleotides of the surface-immobilized DNA. The approach presented in this paper, when combined with current high-throughput DNA microarray-based technologies, has the potential to serve as a rapid and simple method for quantitative and large-scale characterization of conformational specific protein-DNA interactions. Electronic supplementary information (ESI) available: DNA sequences and nomenclature (Table 1S); SDS-PAGE assay of IHF stock solution (Fig. 1S); determination of the concentration of IHF stock solution by Bradford assay (Fig. 2S); equilibrium binding isotherm fitting results of other DNA sequences (Table 2S); calculation of dissociation constants (Fig. 3S, 4S; Table 2S); geometric model for quantitation of DNA bending angle induced by specific IHF binding (Fig. 4S); customized flow cell assembly (Fig. 5S); real-time measurement of average fluorophore height change by SSFM (Fig. 6S); summary of binding parameters obtained from additive isotherm model fitting (Table 3S); average surface densities of 10 dsDNA spots and bound IHF at equilibrium (Table 4S); effects of surface densities on the binding and bending of dsDNA (Tables 5S, 6S and Fig. 7S-10S). See DOI: 10.1039/c5nr06785e

  1. Archaeal RNA polymerase arrests transcription at DNA lesions.

    PubMed

    Gehring, Alexandra M; Santangelo, Thomas J

    2017-01-01

    Transcription elongation is not uniform and transcription is often hindered by protein-bound factors or DNA lesions that limit translocation and impair catalysis. Despite the high degree of sequence and structural homology of the multi-subunit RNA polymerases (RNAP), substantial differences in response to DNA lesions have been reported. Archaea encode only a single RNAP with striking structural conservation with eukaryotic RNAP II (Pol II). Here, we demonstrate that the archaeal RNAP from Thermococcus kodakarensis is sensitive to a variety of DNA lesions that pause and arrest RNAP at or adjacent to the site of DNA damage. DNA damage only halts elongation when present in the template strand, and the damage often results in RNAP arresting such that the lesion would be encapsulated with the transcription elongation complex. The strand-specific halt to archaeal transcription elongation on modified templates is supportive of RNAP recognizing DNA damage and potentially initiating DNA repair through a process akin to the well-described transcription-coupled DNA repair (TCR) pathways in Bacteria and Eukarya.

  2. Structural basis of DNA sequence recognition by the response regulator PhoP in Mycobacterium tuberculosis.

    PubMed

    He, Xiaoyuan; Wang, Liqin; Wang, Shuishu

    2016-04-15

    The transcriptional regulator PhoP is an essential virulence factor in Mycobacterium tuberculosis, and it presents a target for the development of new anti-tuberculosis drugs and attenuated tuberculosis vaccine strains. PhoP binds to DNA as a highly cooperative dimer by recognizing direct repeats of 7-bp motifs with a 4-bp spacer. To elucidate the PhoP-DNA binding mechanism, we determined the crystal structure of the PhoP-DNA complex. The structure revealed a tandem PhoP dimer that bound to the direct repeat. The surprising tandem arrangement of the receiver domains allowed the four domains of the PhoP dimer to form a compact structure, accounting for the strict requirement of a 4-bp spacer and the highly cooperative binding of the dimer. The PhoP-DNA interactions exclusively involved the effector domain. The sequence-recognition helix made contact with the bases of the 7-bp motif in the major groove, and the wing interacted with the adjacent minor groove. The structure provides a starting point for the elucidation of the mechanism by which PhoP regulates the virulence of M. tuberculosis and guides the design of screening platforms for PhoP inhibitors.

  3. Sphingomonas azotifigens sp. nov., a nitrogen-fixing bacterium isolated from the roots of Oryza sativa.

    PubMed

    Xie, Cheng-Hui; Yokota, Akira

    2006-04-01

    Three yellow-pigmented strains associated with rice plants were characterized by using a polyphasic approach. The nitrogen-fixing abilities of these strains were confirmed by acetylene reduction assay and nifH gene detection. The three strains were found to be very closely related, with 99.9 % 16S rRNA gene sequence similarity and greater than 70 % DNA-DNA hybridization values, suggesting that the three strains represent a single species. 16S rRNA gene sequence analysis indicated that the strains were closely related to Sphingomonas trueperi, with 99.5 % similarity. The chemotaxonomic characteristics (G+C content of the DNA of 68.0 mol%, ubiquinone Q-10 system, 2-OH as the only hydroxy fatty acid and homospermidine as the sole polyamine) were similar to those of members of the genus Sphingomonas. Based on DNA-DNA hybridization values and physiological characteristics, the three novel strains could be differentiated from other recognized species of the genus Sphingomonas. The name Sphingomonas azotifigens sp. nov. is proposed to accommodate these bacterial strains; the type strain is Y39T (=NBRC 15497T = IAM 15283T = CCTCC AB205007T).

  4. Lysobacter spongiicola sp. nov., isolated from a deep-sea sponge.

    PubMed

    Romanenko, Lyudmila A; Uchino, Masataka; Tanaka, Naoto; Frolova, Galina M; Mikhailov, Valery V

    2008-02-01

    An aerobic, Gram-negative bacterium, strain KMM 329(T), was isolated from a deep-sea sponge specimen from the Philippine Sea and subjected to a polyphasic taxonomic investigation. Comparative 16S rRNA gene sequence analysis showed that strain KMM 329(T) clustered with the species of the genus Lysobacter. The highest level of 16S rRNA gene sequence similarity (97.0 %) was found with respect to Lysobacter concretionis KCTC 12205(T); lower values (96.4-95.2 %) were obtained with respect to the other recognized Lysobacter species. The value for DNA-DNA relatedness between strain KMM 329(T) and L. concretionis KCTC 12205(T) was 47 %. Branched fatty acids 16 : 0 iso, 15 : 0 iso, 11 : 0 iso 3-OH and 17 : 1 iso were found to be predominant. Strain KMM 329(T) had a DNA G+C content of 69.0 mol%. On the basis of the phenotypic, chemotaxonomic, DNA-DNA hybridization and phylogenetic data, strain KMM 329(T) represents a novel species of the genus Lysobacter, for which the name Lysobacter spongiicola sp. nov. is proposed. The type strain is KMM 329(T) (=NRIC 0728(T) =JCM 14760(T)).

  5. Paenibacillus sonchi sp. nov., a nitrogen-fixing species isolated from the rhizosphere of Sonchus oleraceus.

    PubMed

    Hong, Yuan-Yuan; Ma, Yu-Chao; Zhou, Yu-Guang; Gao, Fei; Liu, Hong-Can; Chen, San-Feng

    2009-11-01

    A nitrogen-fixing bacterium, designated strain X19-5(T), was isolated from rhizosphere soil of Sonchus oleraceus. Phylogenetic analysis based on a fragment of the nifH gene and the full-length 16S rRNA gene sequence revealed that strain X19-5(T) was a member of the genus Paenibacillus. Strain X19-5(T) showed the highest 16S rRNA gene sequence similarity (98.8 %) with Paenibacillus graminis RSA19(T) and below 97 % similarity with other recognized members of the genus. The level of DNA-DNA relatedness between strain X19-5(T) and P. graminis RSA19(T) was 45.7 %. The DNA G+C content of strain X19-5(T) was 46.8 mol%. The major fatty acids were anteiso-C(15 : 0), C(16 : 0) and iso-C(16 : 0). On the basis of its phenotypic characteristics and the level of DNA-DNA hybridization, strain X19-5(T) is considered to represent a novel species of the genus Paenibacillus, for which the name Paenibacillus sonchi sp. nov. is proposed. The type strain is X19-5(T) (=CCBAU 83901(T)=LMG 24727(T)).

  6. Novel features of ARS selection in budding yeast Lachancea kluyveri

    PubMed Central

    2011-01-01

    Background The characterization of DNA replication origins in yeast has shed much light on the mechanisms of initiation of DNA replication. However, very little is known about the evolution of origins or the evolution of mechanisms through which origins are recognized by the initiation machinery. This lack of understanding is largely due to the vast evolutionary distances between model organisms in which origins have been examined. Results In this study we have isolated and characterized autonomously replicating sequences (ARSs) in Lachancea kluyveri - a pre-whole genome duplication (WGD) budding yeast. Through a combination of experimental work and rigorous computational analysis, we show that L. kluyveri ARSs require a sequence that is similar but much longer than the ARS Consensus Sequence well defined in Saccharomyces cerevisiae. Moreover, compared with S. cerevisiae and K. lactis, the replication licensing machinery in L. kluyveri seems more tolerant to variations in the ARS sequence composition. It is able to initiate replication from almost all S. cerevisiae ARSs tested and most Kluyveromyces lactis ARSs. In contrast, only about half of the L. kluyveri ARSs function in S. cerevisiae and less than 10% function in K. lactis. Conclusions Our findings demonstrate a replication initiation system with novel features and underscore the functional diversity within the budding yeasts. Furthermore, we have developed new approaches for analyzing biologically functional DNA sequences with ill-defined motifs. PMID:22204614

  7. Novel features of ARS selection in budding yeast Lachancea kluyveri.

    PubMed

    Liachko, Ivan; Tanaka, Emi; Cox, Katherine; Chung, Shau Chee Claire; Yang, Lu; Seher, Arael; Hallas, Lindsay; Cha, Eugene; Kang, Gina; Pace, Heather; Barrow, Jasmine; Inada, Maki; Tye, Bik-Kwoon; Keich, Uri

    2011-12-28

    The characterization of DNA replication origins in yeast has shed much light on the mechanisms of initiation of DNA replication. However, very little is known about the evolution of origins or the evolution of mechanisms through which origins are recognized by the initiation machinery. This lack of understanding is largely due to the vast evolutionary distances between model organisms in which origins have been examined. In this study we have isolated and characterized autonomously replicating sequences (ARSs) in Lachancea kluyveri - a pre-whole genome duplication (WGD) budding yeast. Through a combination of experimental work and rigorous computational analysis, we show that L. kluyveri ARSs require a sequence that is similar but much longer than the ARS Consensus Sequence well defined in Saccharomyces cerevisiae. Moreover, compared with S. cerevisiae and K. lactis, the replication licensing machinery in L. kluyveri seems more tolerant to variations in the ARS sequence composition. It is able to initiate replication from almost all S. cerevisiae ARSs tested and most Kluyveromyces lactis ARSs. In contrast, only about half of the L. kluyveri ARSs function in S. cerevisiae and less than 10% function in K. lactis. Our findings demonstrate a replication initiation system with novel features and underscore the functional diversity within the budding yeasts. Furthermore, we have developed new approaches for analyzing biologically functional DNA sequences with ill-defined motifs.

  8. Three new Lasiodiplodia spp. from the tropics, recognized based on DNA sequence comparisons and morphology.

    PubMed

    Burgess, Treena I; Barber, Paul A; Mohali, Sari; Pegg, Geoff; de Beer, Wilhelm; Wingfield, Michael J

    2006-01-01

    Botryosphaeria rhodina (anamorph Lasiodiplodia theobromae) is a common endophyte and opportunistic pathogen on more than 500 tree species in the tropics and subtropics. During routine disease surveys of plantations in Australia and Venezuela several isolates differing from L. theobromae were identified and subsequently characterized based upon morphology and ITS and EF1-alpha nucleotide sequences. These isolates grouped into three strongly supported clades related to but different from the known taxa, B. rhodina and L. gonubiensis, These have been described here as three new species L. venezuelensis sp. nov., L. crassispora sp. nov. and L. rubropurpurea sp. nov. The three could be distinguished easily from each other and the two described species of Lasiodiplodia, thus confirming phylogenetic separations. Furthermore all five Lasiodiplodia spp. now recognized separated from Diplodia spp. and Dothiorella spp. with 100% bootstrap support.

  9. Candida adriatica sp. nov. and Candida molendinolei sp. nov., two yeast species isolated from olive oil and its by-products.

    PubMed

    Čadež, Neža; Raspor, Peter; Turchetti, Benedetta; Cardinali, Gianluigi; Ciafardini, Gino; Veneziani, Gianluca; Péter, Gábor

    2012-09-01

    Thirteen strains isolated from virgin olive oil or its by-products in several Mediterranean countries were found to be phenotypically and genetically divergent from currently recognized yeast species. Sequence analysis of the large subunit (LSU) rDNA D1/D2 domain and internal transcribed spacer regions/5.8S rDNA revealed that the strains represented two novel species described as Candida adriatica sp. nov. (type strain ZIM 2334(T) = CBS 12504(T) = NCAIM Y.02001(T)) and Candida molendinolei sp. nov. (type strain DBVPG 5508(T) = CBS 12508(T) = NCAIM Y.02000(T)). Phylogenetic analysis based on concatenated sequences of the small subunit rRNA gene, the D1/D2 region of the LSU rDNA and the translation elongation factor-1α gene suggested that C. adriatica sp. nov. and C. molendinolei sp. nov. should be placed within the Lindnera and Nakazawaea clades, respectively.

  10. N6-Methylation Assessment in Escherichia coli 23S rRNA Utilizing a Bulge Loop in an RNA-DNA Hybrid.

    PubMed

    Yoshioka, Kyoko; Kurita, Ryoji

    2018-06-07

    We propose a sequence-selective assay of N6-methyl-adenosine (m6A) in RNA without PCR or reverse transcription, by employing a hybridization assay with a DNA probe designed to form a bulge loop at the position of a target modified nucleotide. The m6A in the bulge in the RNA-DNA hybrid was assumed to be sufficiently mobile to be selectively recognized by an anti-m6A antibody with a high affinity. By employing a surface-plasmon-resonance measurement or using a microtiter-plate immunoassay method, a specific m6A in the Escherichia coli 23S rRNA sequence could be detected at the nanomolar level when synthesized and purified oligo-RNA fragments were used for measurement. We have successfully achieved the first selective detection of m6A 2030 specifically in 23S rRNA from real samples of E. coli total RNA by using our immunochemical approach.

  11. Molecular Cytogenetics Guides Massively Parallel Sequencing of a Radiation-Induced Chromosome Translocation in Human Cells.

    PubMed

    Cornforth, Michael N; Anur, Pavana; Wang, Nicholas; Robinson, Erin; Ray, F Andrew; Bedford, Joel S; Loucas, Bradford D; Williams, Eli S; Peto, Myron; Spellman, Paul; Kollipara, Rahul; Kittler, Ralf; Gray, Joe W; Bailey, Susan M

    2018-05-11

    Chromosome rearrangements are large-scale structural variants that are recognized drivers of oncogenic events in cancers of all types. Cytogenetics allows for their rapid, genome-wide detection, but does not provide gene-level resolution. Massively parallel sequencing (MPS) promises DNA sequence-level characterization of the specific breakpoints involved, but is strongly influenced by bioinformatics filters that affect detection efficiency. We sought to characterize the breakpoint junctions of chromosomal translocations and inversions in the clonal derivatives of human cells exposed to ionizing radiation. Here, we describe the first successful use of DNA paired-end analysis to locate and sequence across the breakpoint junctions of a radiation-induced reciprocal translocation. The analyses employed, with varying degrees of success, several well-known bioinformatics algorithms, a task made difficult by the involvement of repetitive DNA sequences. As for underlying mechanisms, the results of Sanger sequencing suggested that the translocation in question was likely formed via microhomology-mediated non-homologous end joining (mmNHEJ). To our knowledge, this represents the first use of MPS to characterize the breakpoint junctions of a radiation-induced chromosomal translocation in human cells. Curiously, these same approaches were unsuccessful when applied to the analysis of inversions previously identified by directional genomic hybridization (dGH). We conclude that molecular cytogenetics continues to provide critical guidance for structural variant discovery, validation and in "tuning" analysis filters to enable robust breakpoint identification at the base pair level.

  12. Single-stranded DNA Binding by the Helix-Hairpin-Helix Domain of XPF Protein Contributes to the Substrate Specificity of the ERCC1-XPF Protein Complex*

    PubMed Central

    Das, Devashish; Faridounnia, Maryam; Kovacic, Lidija; Kaptein, Robert; Boelens, Rolf; Folkers, Gert E.

    2017-01-01

    The nucleotide excision repair protein complex ERCC1-XPF is required for incision of DNA upstream of DNA damage. Functional studies have provided insights into the binding of ERCC1-XPF to various DNA substrates. However, because no structure for the ERCC1-XPF-DNA complex has been determined, the mechanism of substrate recognition remains elusive. Here we biochemically characterize the substrate preferences of the helix-hairpin-helix (HhH) domains of XPF and ERCC-XPF and show that the binding to single-stranded DNA (ssDNA)/dsDNA junctions is dependent on joint binding to the DNA binding domain of ERCC1 and XPF. We reveal that the homodimeric XPF is able to bind various ssDNA sequences but with a clear preference for guanine-containing substrates. NMR titration experiments and in vitro DNA binding assays also show that, within the heterodimeric ERCC1-XPF complex, XPF specifically recognizes ssDNA. On the other hand, the HhH domain of ERCC1 preferentially binds dsDNA through the hairpin region. The two separate non-overlapping DNA binding domains in the ERCC1-XPF heterodimer jointly bind to an ssDNA/dsDNA substrate and, thereby, at least partially dictate the incision position during damage removal. Based on structural models, NMR titrations, DNA-binding studies, site-directed mutagenesis, charge distribution, and sequence conservation, we propose that the HhH domain of ERCC1 binds to dsDNA upstream of the damage, and XPF binds to the non-damaged strand within a repair bubble. PMID:28028171

  13. A family of cellular proteins related to snake venom disintegrins.

    PubMed

    Weskamp, G; Blobel, C P

    1994-03-29

    Disintegrins are short soluble integrin ligands that were initially identified in snake venom. A previously recognized cellular protein with a disintegrin domain was the guinea pig sperm protein PH-30, a protein implicated in sperm-egg membrane binding and fusion. Here we present peptide sequences that are characteristic for several cellular disintegrin-domain proteins. These peptide sequences were deduced from cDNA sequence tags that were generated by polymerase chain reaction from various mouse tissue and a mouse muscle cell line. Northern blot analysis with four sequence tags revealed distinct mRNA expression patterns. Evidently, cellular proteins containing a disintegrin domain define a superfamily of potential integrin ligands that are likely to function in important cell-cell and cell-matrix interactions.

  14. A Novel Rickettsia Species Detected in Vole Ticks (Ixodes angustus) from Western Canada

    PubMed Central

    Anstead, Clare A.

    2013-01-01

    The genomic DNA of ixodid ticks from western Canada was tested by PCR for the presence of Rickettsia. No rickettsiae were detected in Ixodes sculptus, whereas 18% of the I. angustus and 42% of the Dermacentor andersoni organisms examined were PCR positive for Rickettsia. The rickettsiae from each tick species were characterized genetically using multiple genes. Rickettsiae within the D. andersoni organisms had sequences at four genes that matched those of R. peacockii. In contrast, the Rickettsia present within the larvae, nymphs, and adults of I. angustus had novel DNA sequences at four of the genes characterized compared to the sequences available from GenBank for all recognized species of Rickettsia and all other putative species within the genus. Phylogenetic analyses of the sequence data revealed that the rickettsiae in I. angustus do not belong to the spotted fever, transitional, or typhus groups of rickettsiae but are most closely related to “Candidatus Rickettsia kingi” and belong to a clade that also includes R. canadensis, “Candidatus Rickettsia tarasevichiae,” and “Candidatus Rickettsia monteiroi.” PMID:24077705

  15. Different strategies for the detection of bioagents using electrochemical and photoelectrochemical genosensors

    NASA Astrophysics Data System (ADS)

    Voccia, Diego; Bettazi, Francesca; Palchetti, Ilaria

    2015-10-01

    In recent years various kinds of biosensors for the detection of pathogens have been developed. A genosensor consists in the immobilization, onto the surface of a chosen transducer, of an oligonucleotide with a specific base sequence called capture probe. The complementary sequence (the analytical target, i.e. a specific sequence of the DNA/RNA of the pathogen) present in the sample is recognized and captured by the probe through the hybridization reaction. The evaluation of the extent of the hybridization allows one to confirm whether the sample contains the complementary sequence of the probe or not. Electrochemical transducers have received considerable attention in connection with the detection of DNA hybridization. Moreover, recently, with the emergence of novel photoelectrochemically active species and new detection schemes, photoelectrochemistry has resulted in substantial progress in its analytical performance for biosensing applications. In this paper, some examples of electrochemical genosensors for multiplexed pathogen detection are shown. Moreover, the preliminary experiments towards the development of a photoelectrochemical genosensor using a TiO2 - nanocrystal-modified ITO electrode are discussed.

  16. Phylogeny and classification of bacteria in the genera Clavibacter and Rathayibacter on the basis of 16s rRNA gene sequence analyses.

    PubMed

    Lee, I M; Bartoszyk, I M; Gundersen-Rindal, D E; Davis, R E

    1997-07-01

    A phylogenetic analysis by parsimony of 16S rRNA gene sequences (16S rDNA) revealed that species and subspecies of Clavibacter and Rathayibacter form a discrete monophyletic clade, paraphyletic to Corynebacterium species. Within the Clavibacter-Rathayibacter clade, four major phylogenetic groups (subclades) with a total of 10 distinct taxa were recognized: (I) species C. michiganensis; (II) species C. xyli; (III) species R. iranicus and R. tritici; and (IV) species R. rathayi. The first three groups form a monophyletic cluster, paraphyletic to R. rathayi. On the basis of the phylogeny inferred, reclassification of members of Clavibacter-Rathayibacter group is proposed. A system for classification of taxa in Clavibacter and Rathayibacter was developed based on restriction fragment length polymorphism (RFLP) analysis of the PCR-amplified 16S rDNA sequences. The groups delineated on the basis of RFLP patterns of 16S rDNA coincided well with the subclades delineated on the basis of phylogeny. In contrast to previous classification systems, which are based primarily on phenotypic properties and are laborious, the RFLP analyses allow for rapid differentiation among species and subspecies in the two genera.

  17. Genotyping of ancient Mycobacterium tuberculosis strains reveals historic genetic diversity.

    PubMed

    Müller, Romy; Roberts, Charlotte A; Brown, Terence A

    2014-04-22

    The evolutionary history of the Mycobacterium tuberculosis complex (MTBC) has previously been studied by analysis of sequence diversity in extant strains, but not addressed by direct examination of strain genotypes in archaeological remains. Here, we use ancient DNA sequencing to type 11 single nucleotide polymorphisms and two large sequence polymorphisms in the MTBC strains present in 10 archaeological samples from skeletons from Britain and Europe dating to the second-nineteenth centuries AD. The results enable us to assign the strains to groupings and lineages recognized in the extant MTBC. We show that at least during the eighteenth-nineteenth centuries AD, strains of M. tuberculosis belonging to different genetic groups were present in Britain at the same time, possibly even at a single location, and we present evidence for a mixed infection in at least one individual. Our study shows that ancient DNA typing applied to multiple samples can provide sufficiently detailed information to contribute to both archaeological and evolutionary knowledge of the history of tuberculosis.

  18. Seven new species within western Atlantic Starksia atlantica, S. lepicoelia, and S. sluiteri (Teleostei, Labrisomidae), with comments on congruence of DNA barcodes and species

    PubMed Central

    Baldwin, Carole C.; Castillo, Cristina I.; Weigt, Lee A.; Benjamin C., Victor

    2011-01-01

    Abstract Specimens of Starksia were collected throughout the western Atlantic, and a 650-bp portion of the mitochondrial gene cytochrome oxidase-c subunit I (COl) was sequenced as part of a re-analysis of species diversity of western Central Atlantic shorefishes. A neighbor-joining tree constructed from the sequence data suggests the existence of several cryptic species. Voucher specimens from each genetically distinct lineage and color photographs of vouchers taken prior to dissection and preservation were examined for diagnostic morphological characters. The results suggest that Starksia atlantica, Starksia lepicoelia, and Starksia sluiteri are species complexes, and each comprises three or more species. Seven new species are described. DNA data usually support morphological features, but some incongruence between genetic and morphological data exists. Genetic lineages are only recognized as species if supported by morphology. Genetic lineages within western Atlantic Starksia generally correspond to geography, such that members of each species complex have a very restricted geographical distribution. Increasing geographical coverage of sampling locations will almost certainly increase the number of Starksia species and species complexes recognized in the western Atlantic. Combining molecular and morphological investigations is bringing clarity to the taxonomy of many genera of morphologically similar fishes and increasing the number of currently recognized species. Future phylogenetic studies should help resolve species relationships and shed light on patterns of speciation in western Atlantic Starksia. PMID:21594143

  19. Development of ITS sequence based molecular marker to distinguish, Tribulus terrestris L. (Zygophyllaceae) from its adulterants.

    PubMed

    Balasubramani, Subramani Paranthaman; Murugan, Ramar; Ravikumar, Kaliamoorthy; Venkatasubramanian, Padma

    2010-09-01

    Tribulus terrestris L. (Zygophyllaceae) is one of the highly traded raw drugs and also used as a stimulative food additive in Europe and USA. While, Ayurvedic Pharmacopoeia of India recognizes T. terrestris as Goksura, Tribulus lanuginosus and T. subramanyamii are also traded by the same name raising issues of quality control. The nuclear ribosomal RNA genes and ITS (internal transcribed spacer) sequence were used to develop species-specific DNA markers. The species-specific markers efficiently amplified 295bp for T. terrestris (TT1F and TT1R), 300bp for T. lanuginosus (TL1F and TL1R) and 214bp for T. subramanyamii (TS1F and TS1R). These DNA markers can be used to distinguish T. terrestris from its adulterants. Copyright (c) 2010 Elsevier B.V. All rights reserved.

  20. Plasmonic biosensor for label-free G-quadruplexes detection

    NASA Astrophysics Data System (ADS)

    Qiu, Suyan; Zhao, Fusheng; Santos, Greggy M.; Shih, Wei-Chuan

    2016-03-01

    G-quadruplex, readily formed by the G-rich sequence, potentially distributes in over 40 % of all human genes, such as the telomeric DNA with the G-rich sequence found at the end of the chromosome. The G-quadruplex structure is supposed to possess a diverse set of critical functions in the mammalian genome for transcriptional regulation, DNA replication and genome stability. However, most of the currently available methods for G-quadruplex identification are restricted to fluorescence techniques susceptible to poor sensitivity. It is essential to propose methods with higher sensitivity to specifically recognize the G-quadruplexes. In this study, we demonstrate a label-free plasmonic biosensor for G-quadruplex detection by relying on the advantages of nanoporous gold (NPG) disks that provide high-density plasmonic hot spots, suitable for molecular recognition capability without the requirement for labeling processes.

  1. Structure and Engineering of Francisella novicida Cas9

    PubMed Central

    Hirano, Hisato; Gootenberg, Jonathan S.; Horii, Takuro; Abudayyeh, Omar O.; Kimura, Mika; Hsu, Patrick D.; Nakane, Takanori; Ishitani, Ryuichiro; Hatada, Izuho; Zhang, Feng; Nishimasu, Hiroshi; Nureki, Osamu

    2016-01-01

    Summary The RNA-guided endonuclease Cas9 cleaves double-stranded DNA targets complementary to the guide RNA, and has been applied to programmable genome editing. Cas9-mediated cleavage requires a protospacer adjacent motif (PAM) juxtaposed with the DNA target sequence, thus constricting the range of targetable sites. Here, we report the 1.7 Å resolution crystal structures of Cas9 from Francisella novicida (FnCas9), one of the largest Cas9 orthologs, in complex with a guide RNA and its PAM-containing DNA targets. A structural comparison of FnCas9 with other Cas9 orthologs revealed striking conserved and divergent features among distantly related CRISPR-Cas9 systems. We found that FnCas9 recognizes the 5′-NGG-3′ PAM, and used the structural information to create a variant that can recognize the more relaxed 5′-YG-3′ PAM. Furthermore, we demonstrated that pre-assembled FnCas9 ribonucleoprotein complexes can be microinjected into mouse zygotes to edit endogenous sites with the 5′-YG-3′ PAMs, thus expanding the target space of the CRISPR-Cas9 toolbox. PMID:26875867

  2. Structure and Engineering of Francisella novicida Cas9.

    PubMed

    Hirano, Hisato; Gootenberg, Jonathan S; Horii, Takuro; Abudayyeh, Omar O; Kimura, Mika; Hsu, Patrick D; Nakane, Takanori; Ishitani, Ryuichiro; Hatada, Izuho; Zhang, Feng; Nishimasu, Hiroshi; Nureki, Osamu

    2016-02-25

    The RNA-guided endonuclease Cas9 cleaves double-stranded DNA targets complementary to the guide RNA and has been applied to programmable genome editing. Cas9-mediated cleavage requires a protospacer adjacent motif (PAM) juxtaposed with the DNA target sequence, thus constricting the range of targetable sites. Here, we report the 1.7 Å resolution crystal structures of Cas9 from Francisella novicida (FnCas9), one of the largest Cas9 orthologs, in complex with a guide RNA and its PAM-containing DNA targets. A structural comparison of FnCas9 with other Cas9 orthologs revealed striking conserved and divergent features among distantly related CRISPR-Cas9 systems. We found that FnCas9 recognizes the 5'-NGG-3' PAM, and used the structural information to create a variant that can recognize the more relaxed 5'-YG-3' PAM. Furthermore, we demonstrated that the FnCas9-ribonucleoprotein complex can be microinjected into mouse zygotes to edit endogenous sites with the 5'-YG-3' PAM, thus expanding the target space of the CRISPR-Cas9 toolbox. Copyright © 2016 Elsevier Inc. All rights reserved.

  3. DNA Barcode Analysis of Thrips (Thysanoptera) Diversity in Pakistan Reveals Cryptic Species Complexes.

    PubMed

    Iftikhar, Romana; Ashfaq, Muhammad; Rasool, Akhtar; Hebert, Paul D N

    2016-01-01

    Although thrips are globally important crop pests and vectors of viral disease, species identifications are difficult because of their small size and inconspicuous morphological differences. Sequence variation in the mitochondrial COI-5' (DNA barcode) region has proven effective for the identification of species in many groups of insect pests. We analyzed barcode sequence variation among 471 thrips from various plant hosts in north-central Pakistan. The Barcode Index Number (BIN) system assigned these sequences to 55 BINs, while the Automatic Barcode Gap Discovery detected 56 partitions, a count that coincided with the number of monophyletic lineages recognized by Neighbor-Joining analysis and Bayesian inference. Congeneric species showed an average of 19% sequence divergence (range = 5.6% - 27%) at COI, while intraspecific distances averaged 0.6% (range = 0.0% - 7.6%). BIN analysis suggested that all intraspecific divergence >3.0% actually involved a species complex. In fact, sequences for three major pest species (Haplothrips reuteri, Thrips palmi, Thrips tabaci), and one predatory thrips (Aeolothrips intermedius) showed deep intraspecific divergences, providing evidence that each is a cryptic species complex. The study compiles the first barcode reference library for the thrips of Pakistan, and examines global haplotype diversity in four important pest thrips.

  4. Conflict RNA modification, host-parasite co-evolution, and the origins of DNA and DNA-binding proteins1.

    PubMed

    McLaughlin, Paul J; Keegan, Liam P

    2014-08-01

    Nearly 150 different enzymatically modified forms of the four canonical residues in RNA have been identified. For instance, enzymes of the ADAR (adenosine deaminase acting on RNA) family convert adenosine residues into inosine in cellular dsRNAs. Recent findings show that DNA endonuclease V enzymes have undergone an evolutionary transition from cleaving 3' to deoxyinosine in DNA and ssDNA to cleaving 3' to inosine in dsRNA and ssRNA in humans. Recent work on dsRNA-binding domains of ADARs and other proteins also shows that a degree of sequence specificity is achieved by direct readout in the minor groove. However, the level of sequence specificity observed is much less than that of DNA major groove-binding helix-turn-helix proteins. We suggest that the evolution of DNA-binding proteins following the RNA to DNA genome transition represents the major advantage that DNA genomes have over RNA genomes. We propose that a hypothetical RNA modification, a RRAR (ribose reductase acting on genomic dsRNA) produced the first stretches of DNA in RNA genomes. We discuss why this is the most satisfactory explanation for the origin of DNA. The evolution of this RNA modification and later steps to DNA genomes are likely to have been driven by cellular genome co-evolution with viruses and intragenomic parasites. RNA modifications continue to be involved in host-virus conflicts; in vertebrates, edited cellular dsRNAs with inosine-uracil base pairs appear to be recognized as self RNA and to suppress activation of innate immune sensors that detect viral dsRNA.

  5. Revisiting the taxonomy of the Rattini tribe: a phylogeny-based delimitation of species boundaries

    PubMed Central

    2010-01-01

    Background Rodents are recognized as hosts for at least 60 zoonotic diseases and may represent a serious threat for human health. In the context of global environmental changes and increasing mobility of humans and animals, contacts between pathogens and potential animal hosts and vectors are modified, amplifying the risk of disease emergence. An accurate identification of each rodent at a specific level is needed in order to understand their implications in the transmission of diseases. Among the Muridae, the Rattini tribe encompasses 167 species inhabiting South East Asia, a hotspot of both biodiversity and emerging and re-emerging diseases. The region faces growing economical development that affects habitats, biodiversity and health. Rat species have been demonstrated as significant hosts of pathogens but are still difficult to recognize at a specific level using morphological criteria. DNA-barcoding methods appear as accurate tools for rat species identification but their use is hampered by the need of reliable identification of reference specimens. In this study, we explore and highlight the limits of the current taxonomy of the Rattini tribe. Results We used the DNA sequence information itself as the primary information source to establish group membership and estimate putative species boundaries. We sequenced two mitochondrial and one nuclear genes from 122 rat samples to perform phylogenetic reconstructions. The method of Pons and colleagues (2006) that determines, with no prior expectations, the locations of ancestral nodes defining putative species was then applied to our dataset. To give an appropriate name to each cluster recognized as a putative species, we reviewed information from the literature and obtained sequences from a museum holotype specimen following the ancient DNA criteria. Conclusions Using a recently developed methodology, this study succeeds in refining the taxonomy of one of the most difficult groups of mammals. Most of the species expected within the area were retrieved but new putative species limits were also indicated, in particular within Berylmys and Rattus genera, where future taxonomic studies should be directed. Our study lays the foundations to better investigate rodent-born diseases in South East Asia and illustrates the relevance of evolutionary studies for health and medical sciences. PMID:20565819

  6. Revisiting the taxonomy of the Rattini tribe: a phylogeny-based delimitation of species boundaries.

    PubMed

    Pagès, Marie; Chaval, Yannick; Herbreteau, Vincent; Waengsothorn, Surachit; Cosson, Jean-François; Hugot, Jean-Pierre; Morand, Serge; Michaux, Johan

    2010-06-18

    Rodents are recognized as hosts for at least 60 zoonotic diseases and may represent a serious threat for human health. In the context of global environmental changes and increasing mobility of humans and animals, contacts between pathogens and potential animal hosts and vectors are modified, amplifying the risk of disease emergence. An accurate identification of each rodent at a specific level is needed in order to understand their implications in the transmission of diseases. Among the Muridae, the Rattini tribe encompasses 167 species inhabiting South East Asia, a hotspot of both biodiversity and emerging and re-emerging diseases. The region faces growing economical development that affects habitats, biodiversity and health. Rat species have been demonstrated as significant hosts of pathogens but are still difficult to recognize at a specific level using morphological criteria. DNA-barcoding methods appear as accurate tools for rat species identification but their use is hampered by the need of reliable identification of reference specimens. In this study, we explore and highlight the limits of the current taxonomy of the Rattini tribe. We used the DNA sequence information itself as the primary information source to establish group membership and estimate putative species boundaries. We sequenced two mitochondrial and one nuclear genes from 122 rat samples to perform phylogenetic reconstructions. The method of Pons and colleagues (2006) that determines, with no prior expectations, the locations of ancestral nodes defining putative species was then applied to our dataset. To give an appropriate name to each cluster recognized as a putative species, we reviewed information from the literature and obtained sequences from a museum holotype specimen following the ancient DNA criteria. Using a recently developed methodology, this study succeeds in refining the taxonomy of one of the most difficult groups of mammals. Most of the species expected within the area were retrieved but new putative species limits were also indicated, in particular within Berylmys and Rattus genera, where future taxonomic studies should be directed. Our study lays the foundations to better investigate rodent-born diseases in South East Asia and illustrates the relevance of evolutionary studies for health and medical sciences.

  7. The molecular biology of environmental aromatic hydrocarbons

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Weiss, S.B.

    The induction of mutations in living cells by polycyclic aromatic hydrocarbons (PAH) has been recognized for many years. Although the mechanism for this occurrence has been examined by numerous investigators, the precise nature and type of mutations induced is still unclear. Earlier investigations of DNA damage and repair were primarily examined by the random alkylation of bacterial and mammalian DNAs, in vivo, using a variety of different PAH agents. This procedure is still used today. Though informative, such studies have not offered any explanation of the mechanism by which PAH agents induce carcinogenesis. We have attempted to examine the repairmore » of PAH-damaged DNA using small DNA oligomer constructs as targets for site-specific alkylation. DNA constructs containing a single BPDE alkylated site in each duplex strand were ligated into M13 RF DNA and used to transfect E. coli. Progeny M13 DNA was isolated from E. coli colonies grown on agar plates containing IPTG and Xgal. DNA sequence analysis of the isolated progeny M13 DNA, at the site of construct insertion, was found to contain large deletions and illegitimate recombinants. These sequence rearrangements occurred in either recA{sup +} or recA{sup -} host cells suggesting that SOS processing was not involved in the deletions and the recombinants observed. The mechanism by which BPDE induces illegitimate recombinants has not been resolved, however, it is possible that the closely spaced adducts activate the recombinant machinery in our DNA-damaged cells. 1 ref., 6 figs., 1 tab.« less

  8. Isolation, molecular cloning and in vitro expression of rhesus monkey (Macaca mulatta) prominin-1.s1 complementary DNA encoding a potential hematopoietic stem cell antigen.

    PubMed

    Husain, S M; Shou, Y; Sorrentino, B P; Handgretinger, R

    2006-10-01

    Human prominin-1 (CD133 or AC133) is an important cell surface marker used to isolate primitive hematopoietic stem cells. The commercially available antibody to human prominin-1 does not recognize rhesus prominin-1. Therefore, we isolated, cloned and characterized the complementary DNA (cDNA) of rhesus prominin-1 gene and determined its coding potential. Following the nomenclature of prominin family of genes, we named this cDNA as rhesus prominin-1.s1. The amino acid sequence data of the putative rhesus prominin-1.s1 could be used in designing antigenic peptides to raise antibodies for use in isolation of pure populations of rhesus prominin-1(+) hematopoietic cells. To the best of our knowledge, there has been no previously published report about the isolation of a prominin-1 cDNA from rhesus monkey (Macaca mulatta).

  9. Distinguishing Individual DNA Bases in a Network by Non-Resonant Tip-Enhanced Raman Scattering.

    PubMed

    Zhang, Rui; Zhang, Xianbiao; Wang, Huifang; Zhang, Yao; Jiang, Song; Hu, Chunrui; Zhang, Yang; Luo, Yi; Dong, Zhenchao

    2017-05-08

    The importance of identifying DNA bases at the single-molecule level is well recognized for many biological applications. Although such identification can be achieved by electrical measurements using special setups, it is still not possible to identify single bases in real space by optical means owing to the diffraction limit. Herein, we demonstrate the outstanding ability of scanning tunneling microscope (STM)-controlled non-resonant tip-enhanced Raman scattering (TERS) to unambiguously distinguish two individual complementary DNA bases (adenine and thymine) with a spatial resolution down to 0.9 nm. The distinct Raman fingerprints identified for the two molecules allow to differentiate in real space individual DNA bases in coupled base pairs. The demonstrated ability of non-resonant Raman scattering with super-high spatial resolution will significantly extend the applicability of TERS, opening up new routes for single-molecule DNA sequencing. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Mitochondrial DNA sequence context in the penetrance of mitochondrial t-RNA mutations: A study across multiple lineages with diagnostic implications

    PubMed Central

    Queen, Rachel A.; Steyn, Jannetta S.; Lord, Phillip

    2017-01-01

    Mitochondrial DNA (mtDNA) mutations are well recognized as an important cause of inherited disease. Diseases caused by mtDNA mutations exhibit a high degree of clinical heterogeneity with a complex genotype-phenotype relationship, with many such mutations exhibiting incomplete penetrance. There is evidence that the spectrum of mutations causing mitochondrial disease might differ between different mitochondrial lineages (haplogroups) seen in different global populations. This would point to the importance of sequence context in the expression of mutations. To explore this possibility, we looked for mutations which are known to cause disease in humans, in animals of other species unaffected by mtDNA disease. The mt-tRNA genes are the location of many pathogenic mutations, with the m.3243A>G mutation on the mt-tRNA-Leu(UUR) being the most frequently seen mutation in humans. This study looked for the presence of m.3243A>G in 2784 sequences from 33 species, as well as any of the other mutations reported in association with disease located on mt-tRNA-Leu(UUR). We report a number of disease associated variations found on mt-tRNA-Leu(UUR) in other chordates, as the major population variant, with m.3243A>G being seen in 6 species. In these, we also found a number of mutations which appear compensatory and which could prevent the pathogenicity associated with this change in humans. This work has important implications for the discovery and diagnosis of mtDNA mutations in non-European populations. In addition, it might provide a partial explanation for the conflicting results in the literature that examines the role of mtDNA variants in complex traits. PMID:29161289

  11. A Dynamic Tandem Repeat in Monocotyledons Inferred from a Comparative Analysis of Chloroplast Genomes in Melanthiaceae.

    PubMed

    Do, Hoang Dang Khoa; Kim, Joo-Hwan

    2017-01-01

    Chloroplast genomes (cpDNA) are highly valuable resources for evolutionary studies of angiosperms, since they are highly conserved, are small in size, and play critical roles in plants. Slipped-strand mispairing (SSM) was assumed to be a mechanism for generating repeat units in cpDNA. However, research on the employment of different small repeated sequences through SSM events, which may induce the accumulation of distinct types of repeats within the same region in cpDNA, has not been documented. Here, we sequenced two chloroplast genomes from the endemic species Heloniopsis tubiflora (Korea) and Xerophyllum tenax (USA) to cover the gap between molecular data and explore "hot spots" for genomic events in Melanthiaceae. Comparative analysis of 23 complete cpDNA sequences revealed that there were different stages of deletion in the rps16 region across the Melanthiaceae. Based on the partial or complete loss of rps16 gene in cpDNA, we have firstly reported potential molecular markers for recognizing two sections ( Veratrum and Fuscoveratrum ) of Veratrum . Melathiaceae exhibits a significant change in the junction between large single copy and inverted repeat regions, ranging from trnH_GUG to a part of rps3 . Our results show an accumulation of tandem repeats in the rpl23-ycf2 regions of cpDNAs. Small conserved sequences exist and flank tandem repeats in further observation of this region across most of the examined taxa of Liliales. Therefore, we propose three scenarios in which different small repeated sequences were used during SSM events to generate newly distinct types of repeats. Occasionally, prior to the SSM process, point mutation event and double strand break repair occurred and induced the formation of initial repeat units which are indispensable in the SSM process. SSM may have likely occurred more frequently for short repeats than for long repeat sequences in tribe Parideae (Melanthiaceae, Liliales). Collectively, these findings add new evidence of dynamic results from SSM in chloroplast genomes which can be useful for further evolutionary studies in angiosperms. Additionally, genomics events in cpDNA are potential resources for mining molecular markers in Liliales.

  12. Random-breakage mapping method applied to human DNA sequences

    NASA Technical Reports Server (NTRS)

    Lobrich, M.; Rydberg, B.; Cooper, P. K.; Chatterjee, A. (Principal Investigator)

    1996-01-01

    The random-breakage mapping method [Game et al. (1990) Nucleic Acids Res., 18, 4453-4461] was applied to DNA sequences in human fibroblasts. The methodology involves NotI restriction endonuclease digestion of DNA from irradiated calls, followed by pulsed-field gel electrophoresis, Southern blotting and hybridization with DNA probes recognizing the single copy sequences of interest. The Southern blots show a band for the unbroken restriction fragments and a smear below this band due to radiation induced random breaks. This smear pattern contains two discontinuities in intensity at positions that correspond to the distance of the hybridization site to each end of the restriction fragment. By analyzing the positions of those discontinuities we confirmed the previously mapped position of the probe DXS1327 within a NotI fragment on the X chromosome, thus demonstrating the validity of the technique. We were also able to position the probes D21S1 and D21S15 with respect to the ends of their corresponding NotI fragments on chromosome 21. A third chromosome 21 probe, D21S11, has previously been reported to be close to D21S1, although an uncertainty about a second possible location existed. Since both probes D21S1 and D21S11 hybridized to a single NotI fragment and yielded a similar smear pattern, this uncertainty is removed by the random-breakage mapping method.

  13. Mutations on the DNA Binding Surface of TBP Discriminate between Yeast TATA and TATA-Less Gene Transcription

    PubMed Central

    Kamenova, Ivanka; Warfield, Linda

    2014-01-01

    Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. PMID:24865972

  14. Mutations on the DNA binding surface of TBP discriminate between yeast TATA and TATA-less gene transcription.

    PubMed

    Kamenova, Ivanka; Warfield, Linda; Hahn, Steven

    2014-08-01

    Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  15. DNA-Based Taxonomy in Ecologically Versatile Microalgae: A Re-Evaluation of the Species Concept within the Coccoid Green Algal Genus Coccomyxa (Trebouxiophyceae, Chlorophyta)

    PubMed Central

    Rindi, Fabio; Tempesta, Sabrina; Paoletti, Michela; Pasqualetti, Marcella

    2016-01-01

    Coccomyxa is a genus of unicellular green algae of the class Trebouxiophyceae, well known for its cosmopolitan distribution and great ecological amplitude. The taxonomy of this genus has long been problematic, due to reliance on badly-defined and environmentally variable morphological characters. In this study, based on the discovery of a new species from an extreme habitat, we reassess species circumscription in Coccomyxa, a unicellular genus of the class Trebouxiophyceae, using a combination of ecological and DNA sequence data (analyzed with three different methods of algorithmic species delineation). Our results are compared with those of a recent integrative study of Darienko and colleagues that reassessed the taxonomy of Coccomyxa, recognizing 7 species in the genus. Expanding the dataset from 43 to 61 sequences (SSU + ITS rDNA) resulted in a different delimitation, supporting the recognition of a higher number of species (24 to 27 depending on the analysis used, with the 27-species scenario receiving the strongest support). Among these, C. melkonianii sp. nov. is described from material isolated from a river highly polluted by heavy metals (Rio Irvi, Sardinia, Italy). Analyses performed on ecological characters detected a significant phylogenetic signal in six different characters. We conclude that the 27-species scenario is presently the most realistic for Coccomyxa and we suggest that well-supported lineages distinguishable by ecological preferences should be recognized as different species in this genus. We also recommend that for microbial lineages in which the overall diversity is unknown and taxon sampling is sparse, as is often the case for green microalgae, the results of analyses for algorithmic DNA-based species delimitation should be interpreted with extreme caution. PMID:27028195

  16. Prediction of constitutive A-to-I editing sites from human transcriptomes in the absence of genomic sequences

    PubMed Central

    2013-01-01

    Background Adenosine-to-inosine (A-to-I) RNA editing is recognized as a cellular mechanism for generating both RNA and protein diversity. Inosine base pairs with cytidine during reverse transcription and therefore appears as guanosine during sequencing of cDNA. Current approaches of RNA editing identification largely depend on the comparison between transcriptomes and genomic DNA (gDNA) sequencing datasets from the same individuals, and it has been challenging to identify editing candidates from transcriptomes in the absence of gDNA information. Results We have developed a new strategy to accurately predict constitutive RNA editing sites from publicly available human RNA-seq datasets in the absence of relevant genomic sequences. Our approach establishes new parameters to increase the ability to map mismatches and to minimize sequencing/mapping errors and unreported genome variations. We identified 695 novel constitutive A-to-I editing sites that appear in clusters (named “editing boxes”) in multiple samples and which exhibit spatial and dynamic regulation across human tissues. Some of these editing boxes are enriched in non-repetitive regions lacking inverted repeat structures and contain an extremely high conversion frequency of As to Is. We validated a number of editing boxes in multiple human cell lines and confirmed that ADAR1 is responsible for the observed promiscuous editing events in non-repetitive regions, further expanding our knowledge of the catalytic substrate of A-to-I RNA editing by ADAR enzymes. Conclusions The approach we present here provides a novel way of identifying A-to-I RNA editing events by analyzing only RNA-seq datasets. This method has allowed us to gain new insights into RNA editing and should also aid in the identification of more constitutive A-to-I editing sites from additional transcriptomes. PMID:23537002

  17. Abundance of Dioxygenase Genes Similar to Ralstonia sp. Strain U2 nagAc Is Correlated with Naphthalene Concentrations in Coal Tar-Contaminated Freshwater Sediments

    PubMed Central

    Dionisi, Hebe M.; Chewning, Christopher S.; Morgan, Katherine H.; Menn, Fu-Min; Easter, James P.; Sayler, Gary S.

    2004-01-01

    We designed a real-time PCR assay able to recognize dioxygenase large-subunit gene sequences with more than 90% similarity to the Ralstonia sp. strain U2 nagAc gene (nagAc-like gene sequences) in order to study the importance of organisms carrying these genes in the biodegradation of naphthalene. Sequencing of PCR products indicated that this real-time PCR assay was specific and able to detect a variety of nagAc-like gene sequences. One to 100 ng of contaminated-sediment total DNA in 25-μl reaction mixtures produced an amplification efficiency of 0.97 without evident PCR inhibition. The assay was applied to surficial freshwater sediment samples obtained in or in close proximity to a coal tar-contaminated Superfund site. Naphthalene concentrations in the analyzed samples varied between 0.18 and 106 mg/kg of dry weight sediment. The assay for nagAc-like sequences indicated the presence of (4.1 ± 0.7) × 103 to (2.9 ± 0.3) × 105 copies of nagAc-like dioxygenase genes per μg of DNA extracted from sediment samples. These values corresponded to (1.2 ± 0.6) × 105 to (5.4 ± 0.4) × 107 copies of this target per g of dry weight sediment when losses of DNA during extraction were taken into account. There was a positive correlation between naphthalene concentrations and nagAc-like gene copies per microgram of DNA (r = 0.89) and per gram of dry weight sediment (r = 0.77). These results provide evidence of the ecological significance of organisms carrying nagAc-like genes in the biodegradation of naphthalene. PMID:15240274

  18. The consequences of sequence erosion in the evolution of recombination hotspots.

    PubMed

    Tiemann-Boege, Irene; Schwarz, Theresa; Striedner, Yasmin; Heissl, Angelika

    2017-12-19

    Meiosis is initiated by a double-strand break (DSB) introduced in the DNA by a highly controlled process that is repaired by recombination. In many organisms, recombination occurs at specific and narrow regions of the genome, known as recombination hotspots, which overlap with regions enriched for DSBs. In recent years, it has been demonstrated that conversions and mutations resulting from the repair of DSBs lead to a rapid sequence evolution at recombination hotspots eroding target sites for DSBs. We still do not fully understand the effect of this erosion in the recombination activity, but evidence has shown that the binding of trans -acting factors like PRDM9 is affected. PRDM9 is a meiosis-specific, multi-domain protein that recognizes DNA target motifs by its zinc finger domain and directs DSBs to these target sites. Here we discuss the changes in affinity of PRDM9 to eroded recognition sequences, and explain how these changes in affinity of PRDM9 can affect recombination, leading sometimes to sterility in the context of hybrid crosses. We also present experimental data showing that DNA methylation reduces PRDM9 binding in vitro Finally, we discuss PRDM9-independent hotspots, posing the question how these hotspots evolve and change with sequence erosion.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'. © 2017 The Authors.

  19. The consequences of sequence erosion in the evolution of recombination hotspots

    PubMed Central

    Schwarz, Theresa; Heissl, Angelika

    2017-01-01

    Meiosis is initiated by a double-strand break (DSB) introduced in the DNA by a highly controlled process that is repaired by recombination. In many organisms, recombination occurs at specific and narrow regions of the genome, known as recombination hotspots, which overlap with regions enriched for DSBs. In recent years, it has been demonstrated that conversions and mutations resulting from the repair of DSBs lead to a rapid sequence evolution at recombination hotspots eroding target sites for DSBs. We still do not fully understand the effect of this erosion in the recombination activity, but evidence has shown that the binding of trans-acting factors like PRDM9 is affected. PRDM9 is a meiosis-specific, multi-domain protein that recognizes DNA target motifs by its zinc finger domain and directs DSBs to these target sites. Here we discuss the changes in affinity of PRDM9 to eroded recognition sequences, and explain how these changes in affinity of PRDM9 can affect recombination, leading sometimes to sterility in the context of hybrid crosses. We also present experimental data showing that DNA methylation reduces PRDM9 binding in vitro. Finally, we discuss PRDM9-independent hotspots, posing the question how these hotspots evolve and change with sequence erosion. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’. PMID:29109225

  20. Population Genetic Structure and Phylogeography of Camellia flavida (Theaceae) Based on Chloroplast and Nuclear DNA Sequences

    PubMed Central

    Wei, Su-Juan; Lu, Yong-Bin; Ye, Quan-Qing; Tang, Shao-Qing

    2017-01-01

    Camellia flavida is an endangered species of yellow camellia growing in limestone mountains in southwest China. The current classification of C. flavida into two varieties, var. flavida and var. patens, is controversial. We conducted a genetic analysis of C. flavida to determine its taxonomic structure. A total of 188 individual plants from 20 populations across the entire distribution range in southwest China were analyzed using two DNA fragments: a chloroplast DNA fragment from the small single copy region and a single-copy nuclear gene called phenylalanine ammonia-lyase (PAL). Sequences from both chloroplast and nuclear DNA were highly diverse; with high levels of genetic differentiation and restricted gene flow. This result can be attributed to the high habitat heterogeneity in limestone karst, which isolates C. flavida populations from each other. Our nuclear DNA results demonstrate that there are three differentiated groups within C. flavida: var. flavida 1, var. flavida 2, and var. patens. These genetic groupings are consistent with the morphological characteristics of the plants. We suggest that the samples included in this study constitute three taxa and the var. flavida 2 group is the genuine C. flavida. The three groups should be recognized as three management units for conservation concerns. PMID:28579991

  1. Chemiluminescent and chemiluminescence resonance energy transfer (CRET) detection of DNA, metal ions, and aptamer-substrate complexes using hemin/G-quadruplexes and CdSe/ZnS quantum dots.

    PubMed

    Freeman, Ronit; Liu, Xiaoqing; Willner, Itamar

    2011-08-03

    Nucleic acid subunits consisting of fragments of the horseradish peroxidase (HRP)-mimicking DNAzyme and aptamer domains against ATP or sequences recognizing Hg(2+) ions self-assemble, in the presence of ATP or Hg(2+), into the active hemin-G-quadruplex DNAzyme structure. The DNAzyme-generated chemiluminescence provides the optical readout for the sensing events. In addition, the DNAzyme-stimulated chemiluminescence resonance energy transfer (CRET) to CdSe/ZnS quantum dots (QDs) is implemented to develop aptamer or DNA sensing platforms. The self-assembly of the ATP-aptamer subunits/hemin-G-quadruplex DNAzyme, where one of the aptamer subunits is functionalized with CdSe/ZnS QDs, leads to the CRET signal. Also, the functionalization of QDs with a hairpin nucleic acid that includes the G-quadruplex sequence in a ''caged'' configuration is used to analyze DNA. The opening of the hairpin structure by the target DNA assembles the hemin-G-quadruplex DNAzyme that stimulates the CRET signal. By the application of three different sized QDs functionalized with different hairpins, the multiplexed analysis of three different DNA targets is demonstrated by the generation of three different CRET luminescence signals.

  2. A phylogenetic hypothesis for passerine birds: taxonomic and biogeographic implications of an analysis of nuclear DNA sequence data.

    PubMed Central

    Barker, F Keith; Barrowclough, George F; Groth, Jeff G

    2002-01-01

    Passerine birds comprise over half of avian diversity, but have proved difficult to classify. Despite a long history of work on this group, no comprehensive hypothesis of passerine family-level relationships was available until recent analyses of DNA-DNA hybridization data. Unfortunately, given the value of such a hypothesis in comparative studies of passerine ecology and behaviour, the DNA-hybridization results have not been well tested using independent data and analytical approaches. Therefore, we analysed nucleotide sequence variation at the nuclear RAG-1 and c-mos genes from 69 passerine taxa, including representatives of most currently recognized families. In contradiction to previous DNA-hybridization studies, our analyses suggest paraphyly of suboscine passerines because the suboscine New Zealand wren Acanthisitta was found to be sister to all other passerines. Additionally, we reconstructed the parvorder Corvida as a basal paraphyletic grade within the oscine passerines. Finally, we found strong evidence that several family-level taxa are misplaced in the hybridization results, including the Alaudidae, Irenidae, and Melanocharitidae. The hypothesis of relationships we present here suggests that the oscine passerines arose on the Australian continental plate while it was isolated by oceanic barriers and that a major northern radiation of oscines (i.e. the parvorder Passerida) originated subsequent to dispersal from the south. PMID:11839199

  3. A phylogenetic hypothesis for passerine birds: taxonomic and biogeographic implications of an analysis of nuclear DNA sequence data.

    PubMed

    Barker, F Keith; Barrowclough, George F; Groth, Jeff G

    2002-02-07

    Passerine birds comprise over half of avian diversity, but have proved difficult to classify. Despite a long history of work on this group, no comprehensive hypothesis of passerine family-level relationships was available until recent analyses of DNA-DNA hybridization data. Unfortunately, given the value of such a hypothesis in comparative studies of passerine ecology and behaviour, the DNA-hybridization results have not been well tested using independent data and analytical approaches. Therefore, we analysed nucleotide sequence variation at the nuclear RAG-1 and c-mos genes from 69 passerine taxa, including representatives of most currently recognized families. In contradiction to previous DNA-hybridization studies, our analyses suggest paraphyly of suboscine passerines because the suboscine New Zealand wren Acanthisitta was found to be sister to all other passerines. Additionally, we reconstructed the parvorder Corvida as a basal paraphyletic grade within the oscine passerines. Finally, we found strong evidence that several family-level taxa are misplaced in the hybridization results, including the Alaudidae, Irenidae, and Melanocharitidae. The hypothesis of relationships we present here suggests that the oscine passerines arose on the Australian continental plate while it was isolated by oceanic barriers and that a major northern radiation of oscines (i.e. the parvorder Passerida) originated subsequent to dispersal from the south.

  4. Genetic Ancestry of the Extinct Javan and Bali Tigers

    PubMed Central

    Xue, Hao-Ran; Yamaguchi, Nobuyuki; Driscoll, Carlos A.; Han, Yu; Bar-Gal, Gila Kahila; Zhuang, Yan; Mazak, Ji H.; Macdonald, David W.; O’Brien, Stephen J.

    2015-01-01

    The Bali (Panthera tigris balica) and Javan (P. t. sondaica) tigers are recognized as distinct tiger subspecies that went extinct in the 1940s and 1980s, respectively. Yet their genetic ancestry and taxonomic status remain controversial. Following ancient DNA procedures, we generated concatenated 1750bp mtDNA sequences from 23 museum samples including 11 voucher specimens from Java and Bali and compared these to diagnostic mtDNA sequences from 122 specimens of living tiger subspecies and the extinct Caspian tiger. The results revealed a close genetic affinity of the 3 groups from the Sunda Islands (Bali, Javan, and Sumatran tigers P. t. sumatrae). Bali and Javan mtDNA haplotypes differ from Sumatran haplotypes by 1–2 nucleotides, and the 3 island populations define a monophyletic assemblage distinctive and equidistant from other mainland subspecies. Despite this close phylogenetic relationship, no mtDNA haplotype was shared between Sumatran and Javan/Bali tigers, indicating little or no matrilineal gene flow among the islands after they were colonized. The close phylogenetic relationship among Sunda tiger subspecies suggests either recent colonization across the islands, or else a once continuous tiger population that had subsequently isolated into different island subspecies. This supports the hypothesis that the Sumatran tiger is the closest living relative to the extinct Javan and Bali tigers. PMID:25754539

  5. Conserved Curvature of RNA Polymerase I Core Promoter Beyond rRNA Genes: The Case of the Tritryps

    PubMed Central

    Smircich, Pablo; Duhagon, María Ana; Garat, Beatriz

    2015-01-01

    In trypanosomatids, the RNA polymerase I (RNAPI)-dependent promoters controlling the ribosomal RNA (rRNA) genes have been well identified. Although the RNAPI transcription machinery recognizes the DNA conformation instead of the DNA sequence of promoters, no conformational study has been reported for these promoters. Here we present the in silico analysis of the intrinsic DNA curvature of the rRNA gene core promoters in Trypanosoma brucei, Trypanosoma cruzi, and Leishmania major. We found that, in spite of the absence of sequence conservation, these promoters hold conformational properties similar to other eukaryotic rRNA promoters. Our results also indicated that the intrinsic DNA curvature pattern is conserved within the Leishmania genus and also among strains of T. cruzi and T. brucei. Furthermore, we analyzed the impact of point mutations on the intrinsic curvature and their impact on the promoter activity. Furthermore, we found that the core promoters of protein-coding genes transcribed by RNAPI in T. brucei show the same conserved conformational characteristics. Overall, our results indicate that DNA intrinsic curvature of the rRNA gene core promoters is conserved in these ancient eukaryotes and such conserved curvature might be a requirement of RNAPI machinery for transcription of not only rRNA genes but also protein-coding genes. PMID:26718450

  6. Bacteroides cellulosilyticus sp. nov., a cellulolytic bacterium from the human gut microbial community.

    PubMed

    Robert, Céline; Chassard, Christophe; Lawson, Paul A; Bernalier-Donadille, Annick

    2007-07-01

    A strictly anaerobic cellulolytic bacterium, strain CRE21(T), was isolated from a human faecal sample. Cells were Gram-negative non-motile rods that were about 1.7 microm in length and 0.9 microm in width. Strain CRE21(T) degraded different types of cellulose and was able to grow on a variety of carbohydrates. Cellulose and sugars were mainly converted to acetate, propionate and succinate. The G+C content of the DNA was 41.1 mol%. 16S rRNA gene sequence analysis revealed that the isolate belonged to the genus Bacteroides with highest sequence similarity to the type strain of Bacteroides intestinalis (98 %). DNA-DNA hybridization results revealed that strain CRE21(T) was distinct from B. intestinalis (40 % DNA-DNA relatedness). Strain CRE21(T) also showed several characteristics distinct from B. intestinalis. In particular, it exhibited different capacity to degrade polysaccharides such as cellulose. On the basis of phylogenetic analysis and the morphological, physiological and biochemical data presented in this study, strain CRE21(T) can be readily differentiated from recognized species of the genus Bacteroides. The name Bacteroides cellulosilyticus sp. nov. is proposed to accommodate this organism. The type strain is CRE21(T) (=DSM 14838(T)=CCUG 44979(T)).

  7. Mitochondrial DNA diversity of the Amerindian populations living in the Andean Piedmont of Bolivia: Chimane, Moseten, Aymara and Quechua.

    PubMed

    Corella, Alfons; Bert, Francesc; Pérez-Pérez, Alejandro; Gené, Manel; Turbón, Daniel

    2007-01-01

    Chimane, Moseten Aymara and Quechua are Amerindian populations living in the Bolivian Piedmont, a characteristic ecoregion between the eastern slope of the Andean mountains and the Amazonian Llanos de Moxos. In both neighbouring areas, dense and complex societies have developed over the centuries. The Piedmont area is especially interesting from a human peopling perspective since there is no clear evidence regarding the genetic influence and peculiarities of these populations. This land has been used extensively as a territory of economic and cultural exchange between the Andes and Amazonia, however Chimane and Moseten populations have been sufficiently isolated from their neighbour groups to be recognized as distinct populations. Genetic information suggests that evolutionary processes, such as genetic drift, natural selection and genetic admixture have formed the history of the Piedmont populations. The objective of this study is to characterize the genetic diversity of the Piedmont populations, analysing the sequence variability of the HVR-I control region in the mitochondrial DNA (mtDNA). Haplogroup mtDNA data available from the whole of Central and South America were utilized to determine the relationship of the Piedmont populations with other Amerindian populations. Hair pulls were obtained in situ, and DNA from non-related individuals was extracted using a standard Chelex 100 method. A 401 bp DNA fragment of HVR-I region was amplified using standard procedures. Two independent 401 and 328 bp DNA fragments were sequenced separately for each sample. The sequence analyses included mismatch distribution and mean pairwise differences, median network analyses, AMOVA and principal component analyses. The genetic diversity of DNA sequences was measured and compared with other South Amerindian populations. The genetic diversity of 401 nucleotide mtDNA sequences, in the hypervariable Control Region, from positions 16 000-16 400, was characterized in a sample of 46 Amerindians living in the Piedmont area in the Beni Department of Bolivia. The results obtained indicate that the genetic diversity in the area is higher than that observed in other American groups living in much larger areas and despite the reduced size of the studied area the human groups analysed show high levels of inter-group variability. In addition, results show that Amerindian populations living in the Piedmont are genetically more related to those in the Andean than in the Amazonian populations.

  8. Characterization of replication and conjugation of plasmid pWTY27 from a widely distributed Streptomyces species

    PubMed Central

    2012-01-01

    Background Streptomyces species are widely distributed in natural habitats, such as soils, lakes, plants and some extreme environments. Replication loci of several Streptomyces theta-type plasmids have been reported, but are not characterized in details. Conjugation loci of some Streptomyces rolling-circle-type plasmids are identified and mechanism of conjugal transferring are described. Results We report the detection of a widely distributed Streptomyces strain Y27 and its indigenous plasmid pWTY27 from fourteen plants and four soil samples cross China by both culturing and nonculturing methods. The complete nucleotide sequence of pWTY27 consisted of 14,288 bp. A basic locus for plasmid replication comprised repAB genes and an adjacent iteron sequence, to a long inverted-repeat (ca. 105 bp) of which the RepA protein bound specifically in vitro, suggesting that RepA may recognize a second structure (e.g. a long stem-loop) of the iteron DNA. A plasmid containing the locus propagated in linear mode when the telomeres of a linear plasmid were attached, indicating a bi-directional replication mode for pWTY27. As for rolling-circle plasmids, a single traA gene and a clt sequence (covering 16 bp within traA and its adjacent 159 bp) on pWTY27 were required for plasmid transfer. TraA recognized and bound specifically to the two regions of the clt sequence, one containing all the four DC1 of 7 bp (TGACACC) and one DC2 (CCCGCCC) and most of IC1, and another covering two DC2 and part of IC1, suggesting formation of a high-ordered DNA-protein complex. Conclusions This work (i) isolates a widespread Streptomyces strain Y27 and sequences its indigenous theta-type plasmid pWTY27; (ii) identifies the replication and conjugation loci of pWTY27 and; (iii) characterizes the binding sequences of the RepA and TraA proteins. PMID:23134842

  9. Novel technique used to treat melanoma and epithelial tumors in new clinical trial | Center for Cancer Research

    Cancer.gov

    Exomic sequencing allows researchers to read the “letters” in the part of your DNA that makes proteins to see where the letters are correct and where the letters are incorrect. This information allows white blood cells engineered from the patient to recognize these tumor-specific mutations and be made into vaccines, called dendritic cell (DC) vaccines, to test effects on

  10. Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.

    PubMed

    Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi

    2014-01-01

    A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.

  11. A Children's Oncology Group and TARGET Initiative Exploring the Genetic Landscape of Wilms Tumor

    PubMed Central

    Gadd, Samantha; Huff, Vicki; Walz, Amy L.; Ooms, Ariadne H.A.G.; Armstrong, Amy E.; Gerhard, Daniela S.; Smith, Malcolm A.; Guidry Auvil, Jaime M.; Meerzaman, Daoud; Chen, Qing-Rong; Hsu, Chih Hao; Yan, Chunhua; Nguyen, Cu; Hu, Ying; Hermida, Leandro C.; Davidsen, Tanja; Gesuwan, Patee; Ma, Yussanne; Zong, Zusheng; Mungall, Andrew J.; Moore, Richard A.; Marra, Marco A.; Dome, Jeffrey S.; Mullighan, Charles G.; Ma, Jing; Wheeler, David A.; Hampton, Oliver A.; Ross, Nicole; Gastier-Foster, Julie M.; Arold, Stefan T.; Perlman, Elizabeth J.

    2017-01-01

    Genome-wide sequencing, mRNA and miRNA expression, DNA copy number and methylation analyses were performed on 117 Wilms tumors, followed by targeted sequencing of 651 Wilms tumors. In addition to genes previously implicated in Wilms tumors (WT1, CTNNB1, FAM123B, DROSHA, DGCR8, XPO5, DICER1, SIX1, SIX2, MLLT1, MYCN, and TP53), mutations were identified in genes not previously recognized as recurrently involved in Wilms tumors, the most frequent being BCOR, BCORL1, NONO, MAX, COL6A3, ASXL1, MAP3K4, and ARID1A. DNA copy number changes resulted in recurrent 1q gain, MYCN amplification, LIN28B gain, and let-7a loss. Unexpected germline variants involved PALB2 and CHEK2. Integrated analyses support two major classes of genetic changes that preserve the progenitor state and/or interrupt normal development. PMID:28825729

  12. Genetic and epigenetic mutations affect the DNA binding capability of human ZFP57 in transient neonatal diabetes type 1.

    PubMed

    Baglivo, Ilaria; Esposito, Sabrina; De Cesare, Lucia; Sparago, Angela; Anvar, Zahra; Riso, Vincenzo; Cammisa, Marco; Fattorusso, Roberto; Grimaldi, Giovanna; Riccio, Andrea; Pedone, Paolo V

    2013-05-21

    In the mouse, ZFP57 contains three classical Cys2His2 zinc finger domains (ZF) and recognizes the methylated TGC(met)CGC target sequence using the first and the second ZFs. In this study, we demonstrate that the human ZFP57 (hZFP57) containing six Cys2His2 ZFs, binds the same methylated sequence through the third and the fourth ZFs, and identify the aminoacids critical for DNA interaction. In addition, we present evidences indicating that hZFP57 mutations and hypomethylation of the TNDM1 ICR both associated with Transient Neonatal Diabetes Mellitus type 1 result in loss of hZFP57 binding to the TNDM1 locus, likely causing PLAGL1 activation. Copyright © 2013 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  13. Phylogeny of the owlet-nightjars (Aves: Aegothelidae) based on mitochondrial DNA sequence

    USGS Publications Warehouse

    Dumbacher, J.P.; Pratt, T.K.; Fleischer, R.C.

    2003-01-01

    The avian family Aegothelidae (Owlet-nightjars) comprises nine extant species and one extinct species, all of which are currently classified in a single genus, Aegotheles. Owlet-nightjars are secretive nocturnal birds of the South Pacific. They are relatively poorly studied and some species are known from only a few specimens. Furthermore, their confusing morphological variation has made it difficult to cluster existing specimens unambiguously into hierarchical taxonomic units. Here we sample all extant owlet-nightjar species and all but three currently recognized subspecies. We use DNA extracted primarily from museum specimens to obtain mitochondrial gene sequences and construct a molecular phylogeny. Our phylogeny suggests that most species are reciprocally monophyletic, however A. albertisi appears paraphyletic. Our data also suggest splitting A. bennettii into two species and splitting A. insignis and A. tatei as suggested in another recent paper. ?? 2003 Elsevier Science (USA). All rights reserved.

  14. Sequence-specific DNA binding by MYC/MAX to low-affinity non-E-box motifs.

    PubMed

    Allevato, Michael; Bolotin, Eugene; Grossman, Mark; Mane-Padros, Daniel; Sladek, Frances M; Martinez, Ernest

    2017-01-01

    The MYC oncoprotein regulates transcription of a large fraction of the genome as an obligatory heterodimer with the transcription factor MAX. The MYC:MAX heterodimer and MAX:MAX homodimer (hereafter MYC/MAX) bind Enhancer box (E-box) DNA elements (CANNTG) and have the greatest affinity for the canonical MYC E-box (CME) CACGTG. However, MYC:MAX also recognizes E-box variants and was reported to bind DNA in a "non-specific" fashion in vitro and in vivo. Here, in order to identify potential additional non-canonical binding sites for MYC/MAX, we employed high throughput in vitro protein-binding microarrays, along with electrophoretic mobility-shift assays and bioinformatic analyses of MYC-bound genomic loci in vivo. We identified all hexameric motifs preferentially bound by MYC/MAX in vitro, which include the low-affinity non-E-box sequence AACGTT, and found that the vast majority (87%) of MYC-bound genomic sites in a human B cell line contain at least one of the top 21 motifs bound by MYC:MAX in vitro. We further show that high MYC/MAX concentrations are needed for specific binding to the low-affinity sequence AACGTT in vitro and that elevated MYC levels in vivo more markedly increase the occupancy of AACGTT sites relative to CME sites, especially at distal intergenic and intragenic loci. Hence, MYC binds diverse DNA motifs with a broad range of affinities in a sequence-specific and dose-dependent manner, suggesting that MYC overexpression has more selective effects on the tumor transcriptome than previously thought.

  15. The Reconstruction of Condition-Specific Transcriptional Modules Provides New Insights in the Evolution of Yeast AP-1 Proteins

    PubMed Central

    Goudot, Christel; Etchebest, Catherine

    2011-01-01

    AP-1 proteins are transcription factors (TFs) that belong to the basic leucine zipper family, one of the largest families of TFs in eukaryotic cells. Despite high homology between their DNA binding domains, these proteins are able to recognize diverse DNA motifs. In yeasts, these motifs are referred as YRE (Yap Response Element) and are either seven (YRE-Overlap) or eight (YRE-Adjacent) base pair long. It has been proposed that the AP-1 DNA binding motif preference relies on a single change in the amino acid sequence of the yeast AP-1 TFs (an arginine in the YRE-O binding factors being replaced by a lysine in the YRE-A binding Yaps). We developed a computational approach to infer condition-specific transcriptional modules associated to the orthologous AP-1 protein Yap1p, Cgap1p and Cap1p, in three yeast species: the model yeast Saccharomyces cerevisiae and two pathogenic species Candida glabrata and Candida albicans. Exploitation of these modules in terms of predictions of the protein/DNA regulatory interactions changed our vision of AP-1 protein evolution. Cis-regulatory motif analyses revealed the presence of a conserved adenine in 5′ position of the canonical YRE sites. While Yap1p, Cgap1p and Cap1p shared a remarkably low number of target genes, an impressive conservation was observed in the YRE sequences identified by Yap1p and Cap1p. In Candida glabrata, we found that Cgap1p, unlike Yap1p and Cap1p, recognizes YRE-O and YRE-A motifs. These findings were supported by structural data available for the transcription factor Pap1p (Schizosaccharomyces pombe). Thus, whereas arginine and lysine substitutions in Cgap1p and Yap1p proteins were reported as responsible for a specific YRE-O or YRE-A preference, our analyses rather suggest that the ancestral yeast AP-1 protein could recognize both YRE-O and YRE-A motifs and that the arginine/lysine exchange is not the only determinant of the specialization of modern Yaps for one motif or another. PMID:21695268

  16. Newer Gene Editing Technologies toward HIV Gene Therapy

    PubMed Central

    Manjunath, N.; Yi, Guohua; Dang, Ying; Shankar, Premlata

    2013-01-01

    Despite the great success of highly active antiretroviral therapy (HAART) in ameliorating the course of HIV infection, alternative therapeutic approaches are being pursued because of practical problems associated with life-long therapy. The eradication of HIV in the so-called “Berlin patient” who received a bone marrow transplant from a CCR5-negative donor has rekindled interest in genome engineering strategies to achieve the same effect. Precise gene editing within the cells is now a realistic possibility with recent advances in understanding the DNA repair mechanisms, DNA interaction with transcription factors and bacterial defense mechanisms. Within the past few years, four novel technologies have emerged that can be engineered for recognition of specific DNA target sequences to enable site-specific gene editing: Homing Endonuclease, ZFN, TALEN, and CRISPR/Cas9 system. The most recent CRISPR/Cas9 system uses a short stretch of complementary RNA bound to Cas9 nuclease to recognize and cleave target DNA, as opposed to the previous technologies that use DNA binding motifs of either zinc finger proteins or transcription activator-like effector molecules fused to an endonuclease to mediate sequence-specific DNA cleavage. Unlike RNA interference, which requires the continued presence of effector moieties to maintain gene silencing, the newer technologies allow permanent disruption of the targeted gene after a single treatment. Here, we review the applications, limitations and future prospects of novel gene-editing strategies for use as HIV therapy. PMID:24284874

  17. Biogeography of “Cyprinella lutrensis”: intensive genetic sampling from the Pecos River ‘melting pot’ reveals a dynamic history and phylogenetic complexity

    PubMed Central

    Osborne, Megan J.; Diver, Tracy A.; Hoagstrom, Christopher W.; Turner, Thomas F.

    2015-01-01

    Thorough sampling is necessary to delineate lineage diversity for polytypic “species” such as Cyprinella lutrensis. We conducted extensive mtDNA sampling (cytochrome b and ND4) from the Pecos River, Rio Grande, and South Canadian River, New Mexico. Our study emphasized the Pecos River due to its complex geological history and potential to harbor multiple lineages. We used geometric-morphometric, morphometric, and meristic analyses to test for phenotypic divergence and combined nucDNA with mtDNA to test for cytonuclear disequilibrium and combined our sequences with published data to conduct a phylogenetic re-assessment of the entire C. lutrensis clade. We detected five co-occurring mtDNA lineages in the Pecos River, but no evidence for cytonuclear disequilibrium or phenotypic divergence. Recognized species were interspersed amongst divergent lineages of “C. lutrensis”. Allopatric divergence among drainages isolated in the Late Miocene and Pliocene apparently produced several recognized species and major divisions within “C. lutrensis”. Pleistocene re-expansion and subsequent re-fragmentation of a centralized lineage founded younger, divergent lineages throughout the Rio Grande basin and Edwards Plateau. There is also evidence of recent introductions to the Rio Grande, Pecos and South Canadian Rivers. Nonetheless, deeply divergent lineages have coexisted since the Pleistocene. PMID:26858464

  18. Quantitative analysis of TALE-DNA interactions suggests polarity effects.

    PubMed

    Meckler, Joshua F; Bhakta, Mital S; Kim, Moon-Soo; Ovadia, Robert; Habrian, Chris H; Zykovich, Artem; Yu, Abigail; Lockwood, Sarah H; Morbitzer, Robert; Elsäesser, Janett; Lahaye, Thomas; Segal, David J; Baldwin, Enoch P

    2013-04-01

    Transcription activator-like effectors (TALEs) have revolutionized the field of genome engineering. We present here a systematic assessment of TALE DNA recognition, using quantitative electrophoretic mobility shift assays and reporter gene activation assays. Within TALE proteins, tandem 34-amino acid repeats recognize one base pair each and direct sequence-specific DNA binding through repeat variable di-residues (RVDs). We found that RVD choice can affect affinity by four orders of magnitude, with the relative RVD contribution in the order NG > HD ≈ NN > NI > NK. The NN repeat preferred the base G over A, whereas the NK repeat bound G with 10(3)-fold lower affinity. We compared AvrBs3, a naturally occurring TALE that recognizes its target using some atypical RVD-base combinations, with a designed TALE that precisely matches 'standard' RVDs with the target bases. This comparison revealed unexpected differences in sensitivity to substitutions of the invariant 5'-T. Another surprising observation was that base mismatches at the 5' end of the target site had more disruptive effects on affinity than those at the 3' end, particularly in designed TALEs. These results provide evidence that TALE-DNA recognition exhibits a hitherto un-described polarity effect, in which the N-terminal repeats contribute more to affinity than C-terminal ones.

  19. Wnt-Mediated Repression via Bipartite DNA Recognition by TCF in the Drosophila Hematopoietic System

    PubMed Central

    Zhang, Chen U.; Blauwkamp, Timothy A.; Burby, Peter E.; Cadigan, Ken M.

    2014-01-01

    The Wnt/β-catenin signaling pathway plays many important roles in animal development, tissue homeostasis and human disease. Transcription factors of the TCF family mediate many Wnt transcriptional responses, promoting signal-dependent activation or repression of target gene expression. The mechanism of this specificity is poorly understood. Previously, we demonstrated that for activated targets in Drosophila, TCF/Pangolin (the fly TCF) recognizes regulatory DNA through two DNA binding domains, with the High Mobility Group (HMG) domain binding HMG sites and the adjacent C-clamp domain binding Helper sites. Here, we report that TCF/Pangolin utilizes a similar bipartite mechanism to recognize and regulate several Wnt-repressed targets, but through HMG and Helper sites whose sequences are distinct from those found in activated targets. The type of HMG and Helper sites is sufficient to direct activation or repression of Wnt regulated cis-regulatory modules, and protease digestion studies suggest that TCF/Pangolin adopts distinct conformations when bound to either HMG-Helper site pair. This repressive mechanism occurs in the fly lymph gland, the larval hematopoietic organ, where Wnt/β-catenin signaling controls prohemocytic differentiation. Our study provides a paradigm for direct repression of target gene expression by Wnt/β-catenin signaling and allosteric regulation of a transcription factor by DNA. PMID:25144371

  20. The African buffalo parasite Theileria. sp. (buffalo) can infect and immortalize cattle leukocytes and encodes divergent orthologues of Theileria parva antigen genes

    PubMed Central

    Bishop, R.P.; Hemmink, J.D.; Morrison, W.I.; Weir, W.; Toye, P.G.; Sitt, T.; Spooner, P.R.; Musoke, A.J.; Skilton, R.A.; Odongo, D.O.

    2015-01-01

    African Cape buffalo (Syncerus caffer) is the wildlife reservoir of multiple species within the apicomplexan protozoan genus Theileria, including Theileria parva which causes East coast fever in cattle. A parasite, which has not yet been formally named, known as Theileria sp. (buffalo) has been recognized as a potentially distinct species based on rDNA sequence, since 1993. We demonstrate using reverse line blot (RLB) and sequencing of 18S rDNA genes, that in an area where buffalo and cattle co-graze and there is a heavy tick challenge, T. sp. (buffalo) can frequently be isolated in culture from cattle leukocytes. We also show that T. sp. (buffalo), which is genetically very closely related to T. parva, according to 18s rDNA sequence, has a conserved orthologue of the polymorphic immunodominant molecule (PIM) that forms the basis of the diagnostic ELISA used for T. parva serological detection. Closely related orthologues of several CD8 T cell target antigen genes are also shared with T. parva. By contrast, orthologues of the T. parva p104 and the p67 sporozoite surface antigens could not be amplified by PCR from T. sp. (buffalo), using conserved primers designed from the corresponding T. parva sequences. Collectively the data re-emphasise doubts regarding the value of rDNA sequence data alone for defining apicomplexan species in the absence of additional data. ‘Deep 454 pyrosequencing’ of DNA from two Theileria sporozoite stabilates prepared from Rhipicephalus appendiculatus ticks fed on buffalo failed to detect T. sp. (buffalo). This strongly suggests that R. appendiculatus may not be a vector for T. sp. (buffalo). Collectively, the data provides further evidence that T. sp. (buffalo). is a distinct species from T. parva. PMID:26543804

  1. Molecular Evidence of Bartonella Infection in Domestic Dogs from Algeria, North Africa, by Polymerase Chain Reaction (PCR)

    PubMed Central

    Kernif, Tahar; Aissi, Meriem; Doumandji, Salah-Eddine; Chomel, Bruno B.; Raoult, Didier; Bitam, Idir

    2010-01-01

    Bartonella species are being recognized as important bacterial human and canine pathogens, and are associated with multiple arthropod vectors. Bartonella DNA extracted from blood samples was obtained from domestic dogs in Algiers, Algeria. Polymerase chain reaction (PCR) and DNA sequence analyses of the ftsZ gene and the 16S-23S intergenic spacer region (ITS) were performed. Three Bartonella species: Bartonella vinsonii subsp. berkhoffii, Bartonella clarridgeiae, and Bartonells elizabethae were detected infecting Algerian dogs. To our knowledge, this study is the first report of detection by PCR amplification of Bartonella in dogs in North Africa. PMID:20682871

  2. Spiders (Araneae) of Churchill, Manitoba: DNA barcodes and morphology reveal high species diversity and new Canadian records.

    PubMed

    Blagoev, Gergin A; Nikolova, Nadya I; Sobel, Crystal N; Hebert, Paul D N; Adamowicz, Sarah J

    2013-11-26

    Arctic ecosystems, especially those near transition zones, are expected to be strongly impacted by climate change. Because it is positioned on the ecotone between tundra and boreal forest, the Churchill area is a strategic locality for the analysis of shifts in faunal composition. This fact has motivated the effort to develop a comprehensive biodiversity inventory for the Churchill region by coupling DNA barcoding with morphological studies. The present study represents one element of this effort; it focuses on analysis of the spider fauna at Churchill. 198 species were detected among 2704 spiders analyzed, tripling the count for the Churchill region. Estimates of overall diversity suggest that another 10-20 species await detection. Most species displayed little intraspecific sequence variation (maximum <1%) in the barcode region of the cytochrome c oxidase subunit I (COI) gene, but four species showed considerably higher values (maximum = 4.1-6.2%), suggesting cryptic species. All recognized species possessed a distinct haplotype array at COI with nearest-neighbour interspecific distances averaging 8.57%. Three species new to Canada were detected: Robertus lyrifer (Theridiidae), Baryphyma trifrons (Linyphiidae), and Satilatlas monticola (Linyphiidae). The first two species may represent human-mediated introductions linked to the port in Churchill, but the other species represents a range extension from the USA. The first description of the female of S. monticola was also presented. As well, one probable new species of Alopecosa (Lycosidae) was recognized. This study provides the first comprehensive DNA barcode reference library for the spider fauna of any region. Few cryptic species of spiders were detected, a result contrasting with the prevalence of undescribed species in several other terrestrial arthropod groups at Churchill. Because most (97.5%) sequence clusters at COI corresponded with a named taxon, DNA barcoding reliably identifies spiders in the Churchill fauna. The capacity of DNA barcoding to enable the identification of otherwise taxonomically ambiguous specimens (juveniles, females) also represents a major advance for future monitoring efforts on this group.

  3. Chimeras of human complement C9 reveal the site recognized by complement regulatory protein CD59.

    PubMed

    Hüsler, T; Lockert, D H; Kaufman, K M; Sodetz, J M; Sims, P J

    1995-02-24

    CD59 antigen is a membrane glycoprotein that inhibits the activity of the C9 component of the C5b-9 membrane attack complex, thereby protecting human cells from lysis by human complement. The complement-inhibitory activity of CD59 is species-selective and is most effective toward C9 derived from human or other primate plasma. By contrast, rabbit C9, which can substitute for human C9 in the membrane attack complex, mediates unrestricted lysis of human cells. To identify the peptide segment of human C9 that is recognized by CD59, rabbit C9 cDNA clones were isolated, characterized, and used to construct hybrid cDNAs for expression of full-length human/rabbit C9 chimeras in COS-7 cells. All resulting chimeras were hemolytically active, when tested against chicken erythrocytes bearing C5b-8 complexes. Assays performed in the presence or absence of CD59 revealed that this inhibitor reduced the hemolytic activity of those chimeras containing human C9 sequence between residues 334-415, irrespective of whether the remainder of the protein contained human or rabbit sequence. By contrast, when this segment of C9 contained rabbit sequence, lytic activity was unaffected by CD59. These data establish that human C9 residues 334-415 contain the site recognized by CD59, and they suggest that sequence variability within this segment of C9 is responsible for the observed species-selective inhibitory activity of CD59.

  4. Specific interaction of mutant p53 with regions of matrix attachment region DNA elements (MARs) with a high potential for base-unpairing

    PubMed Central

    Will, Katrin; Warnecke, Gabriele; Wiesmüller, Lisa; Deppert, Wolfgang

    1998-01-01

    Mutant, but not wild-type p53 binds with high affinity to a variety of MAR-DNA elements (MARs), suggesting that MAR-binding of mutant p53 relates to the dominant-oncogenic activities proposed for mutant p53. MARs recognized by mutant p53 share AT richness and contain variations of an AATATATTT “DNA-unwinding motif,” which enhances the structural dynamics of chromatin and promotes regional DNA base-unpairing. Mutant p53 specifically interacted with MAR-derived oligonucleotides carrying such unwinding motifs, catalyzing DNA strand separation when this motif was located within a structurally labile sequence environment. Addition of GC-clamps to the respective MAR-oligonucleotides or introducing mutations into the unwinding motif strongly reduced DNA strand separation, but supported the formation of tight complexes between mutant p53 and such oligonucleotides. We conclude that the specific interaction of mutant p53 with regions of MAR-DNA with a high potential for base-unpairing provides the basis for the high-affinity binding of mutant p53 to MAR-DNA. PMID:9811860

  5. Micromonospora halotolerans sp. nov., isolated from the rhizosphere of a Pisum sativum plant.

    PubMed

    Carro, Lorena; Pukall, Rüdiger; Spröer, Cathrin; Kroppenstedt, Reiner M; Trujillo, Martha E

    2013-06-01

    A filamentous actinomycete strain designated CR18(T) was isolated on humic acid agar from the rhizosphere of a Pisum sativum plant collected in Spain. This isolate was observed to grow optimally at 28 °C, pH 7.0 and in the presence of 5 % NaCl. Phylogenetic analyses based on the 16S rRNA gene sequence indicated a close relationship with the type strains of Micromonospora chersina and Micromonospora endolithica. A further analysis based on a concatenated DNA sequence stretch of 4,523 bp that included partial sequences of the atpD, gyrB, recA, rpoB and 16S rRNA genes clearly differentiated the new strain from recognized Micromonospora species compared. DNA-DNA hybridization studies further supported the taxonomic position of strain CR18(T) as a novel genomic species. Chemotaxonomic analyses which included whole cell sugars, polar lipids, fatty acid profiles and menaquinone composition confirmed the affiliation of the new strain to the genus Micromonospora and also highlighted differences at the species level. These studies were finally complemented with an array of physiological tests to help differentiate between the new strain and its phylogenetic neighbours. Consequently, strain CR18(T) (= CECT 7890(T) = DSM 45598(T)) is proposed as the type strain of a novel species, Micromonospora halotolerans sp. nov.

  6. Drosophila cell cycle under arrest: uncapped telomeres plead guilty.

    PubMed

    Cenci, Giovanni

    2009-04-01

    Telomeres are specialized structures that protect chromosome ends from degradation and fusion events. In most organisms, telomeres consist of short, repetitive G-rich sequences added to chromosome ends by a reverse transcriptase with an internal RNA template, called telomerase. Specific DNA-binding protein complexes associate with telomeric sequences preventing chromosome ends from being recognized as DNA double strand breaks (DSBs). Telomeres that lose their cap activate the DNA damage response (DDR) likewise DSBs and, if inappropriately repaired, generate telomeric fusions, which eventually lead to genome instability. In Drosophila there is not telomerase, and telomere length is maintained by transposition of three specialized retroelements. However, fly telomeres are protected by multi protein complexes like their yeast and vertebrate counterparts; these complexes bind chromosome ends in a sequence-independent fashion and are required to prevent checkpoint activation and end-to-end fusion. Uncapped Drosophila telomeres elicit a DDR just as dysfunctional human telomeres. Most interestingly, uncapped Drosophila telomeres also activate the spindle assembly checkpoint (SAC) by recruiting the SAC kinase BubR1. BubR1 accumulations at chromosome ends trigger the SAC that inhibits the metaphase-to-anaphase transition. These findings, reviewed here, highlight an intriguing and unsuspected connection between telomeres and cell cycle regulation, providing a clue to understand human telomere function.

  7. A generalized global alignment algorithm.

    PubMed

    Huang, Xiaoqiu; Chao, Kun-Mao

    2003-01-22

    Homologous sequences are sometimes similar over some regions but different over other regions. Homologous sequences have a much lower global similarity if the different regions are much longer than the similar regions. We present a generalized global alignment algorithm for comparing sequences with intermittent similarities, an ordered list of similar regions separated by different regions. A generalized global alignment model is defined to handle sequences with intermittent similarities. A dynamic programming algorithm is designed to compute an optimal general alignment in time proportional to the product of sequence lengths and in space proportional to the sum of sequence lengths. The algorithm is implemented as a computer program named GAP3 (Global Alignment Program Version 3). The generalized global alignment model is validated by experimental results produced with GAP3 on both DNA and protein sequences. The GAP3 program extends the ability of standard global alignment programs to recognize homologous sequences of lower similarity. The GAP3 program is freely available for academic use at http://bioinformatics.iastate.edu/aat/align/align.html.

  8. Biodiversity hot spot on a hot spot: novel extremophile diversity in Hawaiian fumaroles.

    PubMed

    Wall, Kate; Cornell, Jennifer; Bizzoco, Richard W; Kelley, Scott T

    2015-01-06

    Fumaroles (steam vents) are the most common, yet least understood, microbial habitat in terrestrial geothermal settings. Long believed too extreme for life, recent advances in sample collection and DNA extraction methods have found that fumarole deposits and subsurface waters harbor a considerable diversity of viable microbes. In this study, we applied culture-independent molecular methods to explore fumarole deposit microbial assemblages in 15 different fumaroles in four geographic locations on the Big Island of Hawai'i. Just over half of the vents yielded sufficient high-quality DNA for the construction of 16S ribosomal RNA gene sequence clone libraries. The bacterial clone libraries contained sequences belonging to 11 recognized bacterial divisions and seven other division-level phylogenetic groups. Archaeal sequences were less numerous, but similarly diverse. The taxonomic composition among fumarole deposits was highly heterogeneous. Phylogenetic analysis found cloned fumarole sequences were related to microbes identified from a broad array of globally distributed ecotypes, including hot springs, terrestrial soils, and industrial waste sites. Our results suggest that fumarole deposits function as an "extremophile collector" and may be a hot spot of novel extremophile biodiversity. © 2015 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.

  9. Biodiversity hot spot on a hot spot: novel extremophile diversity in Hawaiian fumaroles

    PubMed Central

    Wall, Kate; Cornell, Jennifer; Bizzoco, Richard W; Kelley, Scott T

    2015-01-01

    Fumaroles (steam vents) are the most common, yet least understood, microbial habitat in terrestrial geothermal settings. Long believed too extreme for life, recent advances in sample collection and DNA extraction methods have found that fumarole deposits and subsurface waters harbor a considerable diversity of viable microbes. In this study, we applied culture-independent molecular methods to explore fumarole deposit microbial assemblages in 15 different fumaroles in four geographic locations on the Big Island of Hawai'i. Just over half of the vents yielded sufficient high-quality DNA for the construction of 16S ribosomal RNA gene sequence clone libraries. The bacterial clone libraries contained sequences belonging to 11 recognized bacterial divisions and seven other division-level phylogenetic groups. Archaeal sequences were less numerous, but similarly diverse. The taxonomic composition among fumarole deposits was highly heterogeneous. Phylogenetic analysis found cloned fumarole sequences were related to microbes identified from a broad array of globally distributed ecotypes, including hot springs, terrestrial soils, and industrial waste sites. Our results suggest that fumarole deposits function as an “extremophile collector” and may be a hot spot of novel extremophile biodiversity. PMID:25565172

  10. The ectomycorrhizas of Lactarius cuspidoaurantiacus and Lactarius herrerae associated with Alnus acuminata in Central Mexico.

    PubMed

    Montoya, Leticia; Bandala, Victor M; Garay-Serrano, Edith

    2015-08-01

    Two pure Alnus acuminata stands established in a montane forest in central Mexico (Puebla State) were monitored between 2010 and 2013 to confirm and recognize the ectomycorrhizal (EcM) systems of A. acuminata with Lactarius cuspidoaurantiacus and Lactarius herrerae, two recently described species. Through comparison of internal transcribed spacer (ITS) of nuclear ribosomal DNA sequences from basidiomes and ectomycorrhizas sampled in the forest stands, we confirmed their ectomycorrhizal association. The phytobiont was corroborated by comparing ITS sequences obtained from EcM root tips and leaves collected in the study site and from other sequences of A. acuminata available in Genbank. Detailed morphological and anatomical descriptions of the ectomycorrhizal systems are presented and complemented with photographs.

  11. A multi-locus analysis of phylogenetic relationships within grass subfamily Pooideae (Poaceae) inferred from sequences of nuclear single copy gene regions compared with plastid DNA.

    PubMed

    Hochbach, Anne; Schneider, Julia; Röser, Martin

    2015-06-01

    To investigate phylogenetic relationships within the grass subfamily Pooideae we studied about 50 taxa covering all recognized tribes, using one plastid DNA (cpDNA) marker (matK gene-3'trnK exon) and for the first time four nuclear single copy gene loci. DNA sequence information from two parts of the nuclear genes topoisomerase 6 (Topo6) spanning the exons 8-13 and 17-19, the exons 9-13 encoding plastid acetyl-CoA-carboxylase (Acc1) and the partial exon 1 of phytochrome B (PhyB) were generated. Individual and nuclear combined data were evaluated using maximum parsimony, maximum likelihood and Bayesian methods. All of the phylogenetic results show Brachyelytrum and the tribe Nardeae as earliest diverging lineages within the subfamily. The 'core' Pooideae (Hordeeae and the Aveneae/Poeae tribe complex) are also strongly supported, as well as the monophyly of the tribes Brachypodieae, Meliceae and Stipeae (except PhyB). The beak grass tribe Diarrheneae and the tribe Duthieeae are not monophyletic in some of the analyses. However, the combined nuclear DNA (nDNA) tree yields the highest resolution and the best delimitation of the tribes, and provides the following evolutionary hypothesis for the tribes: Brachyelytrum, Nardeae, Duthieeae, Meliceae, Stipeae, Diarrheneae, Brachypodieae and the 'core' Pooideae. Within the individual datasets, the phylogenetic trees obtained from Topo6 exon 8-13 shows the most interesting results. The divergent positions of some clone sequences of Ampelodesmos mauritanicus and Trikeraia pappiformis, for instance, may indicate a hybrid origin of these stipoid taxa. Copyright © 2015 Elsevier Inc. All rights reserved.

  12. Penetration of short fluorescence-labeled peptides into the nucleus in HeLa cells and in vitro specific interaction of the peptides with deoxyribooligonucleotides and DNA.

    PubMed

    Fedoreyeva, L I; Kireev, I I; Khavinson, V Kh; Vanyushin, B F

    2011-11-01

    Marked fluorescence in cytoplasm, nucleus, and nucleolus was observed in HeLa cells after incubation with each of several fluorescein isothiocyanate-labeled peptides (epithalon, Ala-Glu-Asp-Gly; pinealon, Glu-Asp-Arg; testagen, Lys-Glu-Asp-Gly). This means that short biologically active peptides are able to penetrate into an animal cell and its nucleus and, in principle they may interact with various components of cytoplasm and nucleus including DNA and RNA. It was established that various initial (intact) peptides differently affect the fluorescence of the 5,6-carboxyfluorescein-labeled deoxyribooligonucleotides and DNA-ethidium bromide complexes. The Stern-Volmer constants characterizing the degree of fluorescence quenching of various single- and double-stranded fluorescence-labeled deoxyribooligonucleotides with short peptides used were different depending on the peptide primary structures. This indicates the specific interaction between short biologically active peptides and nucleic acid structures. On binding to them, the peptides discriminate between different nucleotide sequences and recognize even their cytosine methylation status. Judging from corresponding constants of the fluorescence quenching, the epithalon, pinealon, and bronchogen (Ala-Glu-Asp-Leu) bind preferentially with deoxyribooligonucleotides containing CNG sequence (CNG sites are targets for cytosine DNA methylation in eukaryotes). Epithalon, testagen, and pinealon seem to preferentially bind with CAG- but bronchogen with CTG-containing sequences. The site-specific interactions of peptides with DNA can control epigenetically the cell genetic functions, and they seem to play an important role in regulation of gene activity even at the earliest stages of life origin and in evolution.

  13. Fluorescence bio-barcode DNA assay based on gold and magnetic nanoparticles for detection of Exotoxin A gene sequence.

    PubMed

    Amini, Bahram; Kamali, Mehdi; Salouti, Mojtaba; Yaghmaei, Parichehreh

    2017-06-15

    Bio-barcode DNA based on gold nanoparticle (bDNA-GNPs) as a new generation of biosensor based detection tools, holds promise for biological science studies. They are of enormous importance in the emergence of rapid and sensitive procedures for detecting toxins of microorganisms. Exotoxin A (ETA) is the most toxic virulence factor of Pseudomonas aeruginosa. ETA has ADP-ribosylation activity and decisively affects the protein synthesis of the host cells. In the present study, we developed a fluorescence bio-barcode technology to trace P. aeruginosa ETA. The GNPs were coated with the first target-specific DNA probe 1 (1pDNA) and bio-barcode DNA, which acted as a signal reporter. The magnetic nanoparticles (MNPs) were coated with the second target-specific DNA probe 2 (2pDNA) that was able to recognize the other end of the target DNA. After binding the nanoparticles with the target DNA, the following sandwich structure was formed: MNP 2pDNA/tDNA/1pDNA-GNP-bDNA. After isolating the sandwiches by a magnetic field, the DNAs of the probes which have been hybridized to their complementary DNA, GNPs and MNPs, via the hydrogen, electrostatic and covalently bonds, were released from the sandwiches after dissolving in dithiothreitol solution (DTT 0.8M). This bio-barcode DNA with known DNA sequence was then detected by fluorescence spectrophotometry. The findings showed that the new method has the advantages of fast, high sensitivity (the detection limit was 1.2ng/ml), good selectivity, and wide linear range of 5-200ng/ml. The regression analysis also showed that there was a good linear relationship (∆F=0.57 [target DNA]+21.31, R 2 =0.9984) between the fluorescent intensity and the target DNA concentration in the samples. Copyright © 2016. Published by Elsevier B.V.

  14. The tick plasma lectin, Dorin M, is a fibrinogen-related molecule.

    PubMed

    Rego, Ryan O M; Kovár, Vojtĕch; Kopácek, Petr; Weise, Christoph; Man, Petr; Sauman, Ivo; Grubhoffer, Libor

    2006-04-01

    A lectin, named Dorin M, previously isolated and characterized from the hemolymph plasma of the soft tick, Ornithodoros moubata, was cloned and sequenced. The immunofluorescence using confocal microscopy revealed that Dorin M is produced in the tick hemocytes. A tryptic cleavage of Dorin M was performed and the resulting peptide fragments were sequenced by Edman degradation and/or mass spectrometry. Two of three internal peptide sequences displayed a significant similarity to the family of fibrinogen-related molecules. Degenerate primers were designed and used for PCR with hemocyte cDNA as a template. The sequence of the whole Dorin M cDNA was completed by the method of RACE. The tissue-specific expression investigated by RT-PCR revealed that Dorin M, in addition to hemocytes, is significantly expressed in salivary glands. The derived amino-acid sequence clearly shows that Dorin M has a fibrinogen-like domain, and exhibited the most significant similarity with tachylectins 5A and 5B from a horseshoe crab, Tachypleus tridentatus. In addition, other protein and binding characteristics suggest that Dorin M is closely related to tachylectins-5. Since these lectins have been reported to function as non-self recognizing molecules, we believe that Dorin M may play a similar role in an innate immunity of the tick and, possibly, also in pathogen transmission by this vector.

  15. Untangling taxonomy: a DNA barcode reference library for Canadian spiders.

    PubMed

    Blagoev, Gergin A; deWaard, Jeremy R; Ratnasingham, Sujeevan; deWaard, Stephanie L; Lu, Liuqiong; Robertson, James; Telfer, Angela C; Hebert, Paul D N

    2016-01-01

    Approximately 1460 species of spiders have been reported from Canada, 3% of the global fauna. This study provides a DNA barcode reference library for 1018 of these species based upon the analysis of more than 30,000 specimens. The sequence results show a clear barcode gap in most cases with a mean intraspecific divergence of 0.78% vs. a minimum nearest-neighbour (NN) distance averaging 7.85%. The sequences were assigned to 1359 Barcode index numbers (BINs) with 1344 of these BINs composed of specimens belonging to a single currently recognized species. There was a perfect correspondence between BIN membership and a known species in 795 cases, while another 197 species were assigned to two or more BINs (556 in total). A few other species (26) were involved in BIN merges or in a combination of merges and splits. There was only a weak relationship between the number of specimens analysed for a species and its BIN count. However, three species were clear outliers with their specimens being placed in 11-22 BINs. Although all BIN splits need further study to clarify the taxonomic status of the entities involved, DNA barcodes discriminated 98% of the 1018 species. The present survey conservatively revealed 16 species new to science, 52 species new to Canada and major range extensions for 426 species. However, if most BIN splits detected in this study reflect cryptic taxa, the true species count for Canadian spiders could be 30-50% higher than currently recognized. © 2015 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.

  16. Application of Sequence-based Methods in Human MicrobialEcology

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Weng, Li; Rubin, Edward M.; Bristow, James

    2005-08-29

    Ecologists studying microbial life in the environment have recognized the enormous complexity of microbial diversity for many years, and the development of a variety of culture-independent methods, many of them coupled with high-throughput DNA sequencing, has allowed this diversity to be explored in ever greater detail. Despite the widespread application of these new techniques to the characterization of uncultivated microbes and microbial communities in the environment, their application to human health and disease has lagged behind. Because DNA based-techniques for defining uncultured microbes allow not only cataloging of microbial diversity, but also insight into microbial functions, investigators are beginning tomore » apply these tools to the microbial communities that abound on and within us, in what has aptly been called the second Human Genome Project. In this review we discuss the sequence-based methods for microbial analysis that are currently available and their application to identify novel human pathogens, improve diagnosis of known infectious diseases, and to advance understanding of our relationship with microbial communities that normally reside in and on the human body.« less

  17. Combining use of a panel of ssDNA aptamers in the detection of Staphylococcus aureus

    PubMed Central

    Cao, Xiaoxiao; Li, Shaohua; Chen, Liucun; Ding, Hongmei; Xu, Hua; Huang, Yanping; Li, Jie; Liu, Nongle; Cao, Weihong; Zhu, Yanjun; Shen, Beifen; Shao, Ningsheng

    2009-01-01

    In this article, a panel of ssDNA aptamers specific to Staphylococcus aureus was obtained by a whole bacterium-based SELEX procedure and applied to probing S. aureus. After several rounds of selection with S. aureus as the target and Streptococcus and S. epidermidis as counter targets, the highly enriched oligonucleic acid pool was sequenced and then grouped under different families on the basis of the homology of the primary sequence and the similarity of the secondary structure. Eleven sequences from different families were selected for further characterization by confocal imaging and flow cytometry analysis. Results showed that five aptamers demonstrated high specificity and affinity to S. aureus individually. The five aptamers recognize different molecular targets by competitive experiment. Combining these five aptamers had a much better effect than the individual aptamer in the recognition of different S. aureus strains. In addition, the combined aptamers can probe single S. aureus in pyogenic fluids. Our work demonstrates that a set of aptamers specific to one bacterium can be used in combination for the identification of the bacterium instead of a single aptamer. PMID:19498077

  18. Combining use of a panel of ssDNA aptamers in the detection of Staphylococcus aureus.

    PubMed

    Cao, Xiaoxiao; Li, Shaohua; Chen, Liucun; Ding, Hongmei; Xu, Hua; Huang, Yanping; Li, Jie; Liu, Nongle; Cao, Weihong; Zhu, Yanjun; Shen, Beifen; Shao, Ningsheng

    2009-08-01

    In this article, a panel of ssDNA aptamers specific to Staphylococcus aureus was obtained by a whole bacterium-based SELEX procedure and applied to probing S. aureus. After several rounds of selection with S. aureus as the target and Streptococcus and S. epidermidis as counter targets, the highly enriched oligonucleic acid pool was sequenced and then grouped under different families on the basis of the homology of the primary sequence and the similarity of the secondary structure. Eleven sequences from different families were selected for further characterization by confocal imaging and flow cytometry analysis. Results showed that five aptamers demonstrated high specificity and affinity to S. aureus individually. The five aptamers recognize different molecular targets by competitive experiment. Combining these five aptamers had a much better effect than the individual aptamer in the recognition of different S. aureus strains. In addition, the combined aptamers can probe single S. aureus in pyogenic fluids. Our work demonstrates that a set of aptamers specific to one bacterium can be used in combination for the identification of the bacterium instead of a single aptamer.

  19. Synthetic Biology Parts for the Storage of Increased Genetic Information in Cells.

    PubMed

    Morris, Sydney E; Feldman, Aaron W; Romesberg, Floyd E

    2017-10-20

    To bestow cells with novel forms and functions, the goal of synthetic biology, we have developed the unnatural nucleoside triphosphates dNaMTP and dTPT3TP, which form an unnatural base pair (UBP) and expand the genetic alphabet. While the UBP may be retained in the DNA of a living cell, its retention is sequence-dependent. We now report a steady-state kinetic characterization of the rate with which the Klenow fragment of E. coli DNA polymerase I synthesizes the UBP and its mispairs in a variety of sequence contexts. Correct UBP synthesis is as efficient as for a natural base pair, except in one sequence context, and in vitro performance is correlated with in vivo performance. The data elucidate the determinants of efficient UBP synthesis, show that the dNaM-dTPT3 UBP is the first generally recognized natural-like base pair, and importantly, demonstrate that dNaMTP and dTPT3TP are well optimized and standardized parts for the expansion of the genetic alphabet.

  20. Structure and Genetic Content of the Megaplasmids of Neurotoxigenic Clostridium butyricum Type E Strains from Italy

    PubMed Central

    Iacobino, Angelo; Scalfaro, Concetta; Franciosa, Giovanna

    2013-01-01

    We determined the genetic maps of the megaplasmids of six neutoroxigenic Clostridium butyricum type E strains from Italy using molecular and bioinformatics techniques. The megaplasmids are circular, not linear as we had previously proposed. The differently-sized megaplasmids share a genetic region that includes structural, metabolic and regulatory genes. In addition, we found that a 168 kb genetic region is present only in the larger megaplasmids of two tested strains, whereas it is absent from the smaller megaplasmids of the four remaining strains. The genetic region unique to the larger megaplasmids contains, among other features, a locus for clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR associated (cas) genes, i.e. a bacterial adaptive immune system providing sequence-specific protection from invading genetic elements. Some CRISPR spacer sequences of the neurotoxigenic C. butyricum type E strains showed homology to prophage, phage and plasmid sequences from closely related clostridia species or from distant species, all sharing the intestinal habitat, suggesting that the CRISPR locus might be involved in the microorganism adaptation to the human or animal intestinal environment. Besides, we report here that each of four distinct CRISPR spacers partially matched DNA sequences of different prophages and phages, at identical nucleotide locations. This suggests that, at least in neurotoxigenic C. butyricum type E, the CRISPR locus is potentially able to recognize the same conserved DNA sequence of different invading genetic elements, besides targeting sequences unique to previously encountered invading DNA, as currently predicted for a CRISPR locus. Thus, the results of this study introduce the possibility that CRISPR loci can provide resistance to a wider range of invading DNA elements than previously appreciated. Whether it is more advantageous for the peculiar neurotoxigenic C. butyricum type E strains to maintain or to lose the CRISPR-cas system remains an open question. PMID:23967192

  1. Combination of the immunization with the sequence close to the consensus sequence and two DNA prime plus one VLP boost generate H5 hemagglutinin specific broad neutralizing antibodies

    PubMed Central

    Wang, Guiqin; Yin, Renfu; Zhou, Paul; Ding, Zhuang

    2017-01-01

    Hemagglutinin (HA) head has long been considered to be able to elicit only a narrow, strain-specific antibody response as it undergoes rapid antigenic drift. However, we previously showed that a heterologous prime-boost strategy, in which mice were primed twice with DNA encoding HA and boosted once with virus-like particles (VLP) from an H5N1 strain A/Thailand/1(KAN)-1/2004 (noted as TH DDV), induced anti-head broad cross-H5 neutralizing antibody response. To explain why TH DDV immunization could generate such breadth, we systemically compared the neutralization breadth and potency between TH DDV sera and immune sera elicited by TH DDD (three times of DNA immunizations), TH VVV (three times of VLP immunizations), TH DV (one DNA prime plus one VLP boost) and TK DDV (plasmid DNA and VLP derived from another H5N1 strain, A/Turkey/65596/2006). Then we determined the antigenic sites (AS) on TH HA head and the key residues of the main antigenic site. Through the comparison of different regiments, we found that the combination of the immunization with the sequence close to the consensus sequence and two DNA prime plus one VLP boost caused that TH DDV immunization generate broad neutralizing antibodies. Antigenic analysis showed that TH DDV, TH DV, TH DDD and TH VVV sera recognize the common antigenic site AS1. Antibodies directed to AS1 contribute to the largest proportion of the neutralizing activity of these immune sera. Residues 188 and 193 in AS1 are the key residues which are responsible for neutralization breadth of the immune sera. Interestingly, residues 188 and 193 locate in classical antigen sites but are relatively conserved among the 16 tested strains and 1,663 HA sequences from NCBI database. Thus, our results strongly indicate that it is feasible to develop broad cross-H5 influenza vaccines against HA head. PMID:28542275

  2. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Robbins, P.F.; El-Gamil, M.; Li, Y.F.

    The role of tumor-specific T cells in mediating the regression of metastatic melanoma has been suggested by the clinical response of patients to treatment with tumor-infiltrating lymphocytes (TIL). A number of Ags recognized by class I-restricted melanoma-specific T cells have recently been isolated, raising the hope that this will lead to the development of improved therapies. In this study, we report the cloning of a tumor Ag recognized by T cells from melanoma patient 888. Previously, we reported that TIL 888, grown from the tumor of this patient, recognized tyrosinase in an HLA-A24 -restricted fashion. This line, when infused intomore » the autologous patient, resulted in complete regression of multiple metastases. Three years later, a second TIL line, TIL 1290, was isolated from a recurrent pelvic tumor. Infusion of a mixture of TIL 888 and TIL 1290 cell lines into the patient resulted in complete regression of a residual abdominal mass and the patient remains disease-free 2 yr later. The TIL 1290 cell line, which recognized melanoma in an HLAA-A24-restricted manner, failed to recognize tyrosinase. TIL 1290 was then used to screen an 888 melanoma cDNA library, and an Ag was isolated that did not correspond to any found in sequence databases. This gene, termed p15, was found to be expressed in a variety of normal tissues, and a peptide epitope recognized by TIL 1290 was found to represent the product of an nonmutated gene. Screening of additional cDNA pools resulted in the isolation of a second clone which stimulated TIL 1290. This clone also appeared to represent a transcript of the p15 gene, indicating that this gene may encode the predominant Ag recognized by TIL 1290. 27 refs., 4 figs., 5 tabs.« less

  3. Sphingomonas pituitosa sp. nov., an exopolysaccharide-producing bacterium that secretes an unusual type of sphingan.

    PubMed

    Denner, E B; Paukner, S; Kämpfer, P; Moore, E R; Abraham, W R; Busse, H J; Wanner, G; Lubitz, W

    2001-05-01

    Strain EDIVT, an exopolysaccharide-producing bacterium, was subjected to polyphasic characterization. The bacterium produced copious amounts of an extracellular polysaccharide, forming slimy, viscous, intensely yellow-pigmented colonies on Czapek-Dox (CZD) agar. The culture fluids of the liquid version of CZD medium were highly viscous after cultivation for 5 d. Cells of strain EDIVT were Gram-negative, catalase-positive, oxidase-negative, nonspore-forming, rod-shaped and motile. Comparisons of 16S rDNA gene sequences demonstrated that EDIVT clusters phylogenetically with the species of the genus Sphingomonas sensu stricto. The G+C content of the DNA (64.5 mol%), the presence of ubiquinone Q-10, the presence of 2-hydroxymyristic acid (14:0 2-OH) as the major hydroxylated fatty acid, the absence of 3-hydroxy fatty acids and the detection of sym-homospermidine as the major component in the polyamine pattern, together with the presence of sphingoglycolipid, supported this delineation. 16S rDNA sequence analysis indicated that strain EDIVT is most closely related (99.4% similarity) to Sphingomonas trueperi LMG 2142T. DNA-DNA hybridization showed that the level of relatedness to S. trueperi is only 45.5%. Further differences were apparent in the cellular fatty acid profile, the polar lipid pattern, the Fourier-transform infrared spectrum and whole-cell proteins and in a number of biochemical characteristics. On the basis of the estimated phylogenetic position derived from 16S rDNA sequence data, DNA-DNA reassociation and phenotypic differences, strain EDIVT (= CIP 106154T = DSM 13101T) was recognized as a new species of Sphingomonas, for which the name Sphingomonas pituitosa sp. nov. is proposed. A component analysis of the exopolysaccharide (named PS-EDIV) suggested that it represents a novel type of sphingan composed of glucose, rhamnose and an unidentified sugar. Glucuronic acid, which is commonly found in sphingans, was absent. The mean molecular mass of PS-EDIV was approximately 3 x 10(6) Da.

  4. New CRISPR-Cas systems from uncultivated microbes

    NASA Astrophysics Data System (ADS)

    Burstein, David; Harrington, Lucas B.; Strutt, Steven C.; Probst, Alexander J.; Anantharaman, Karthik; Thomas, Brian C.; Doudna, Jennifer A.; Banfield, Jillian F.

    2017-02-01

    CRISPR-Cas systems provide microbes with adaptive immunity by employing short DNA sequences, termed spacers, that guide Cas proteins to cleave foreign DNA. Class 2 CRISPR-Cas systems are streamlined versions, in which a single RNA-bound Cas protein recognizes and cleaves target sequences. The programmable nature of these minimal systems has enabled researchers to repurpose them into a versatile technology that is broadly revolutionizing biological and clinical research. However, current CRISPR-Cas technologies are based solely on systems from isolated bacteria, leaving the vast majority of enzymes from organisms that have not been cultured untapped. Metagenomics, the sequencing of DNA extracted directly from natural microbial communities, provides access to the genetic material of a huge array of uncultivated organisms. Here, using genome-resolved metagenomics, we identify a number of CRISPR-Cas systems, including the first reported Cas9 in the archaeal domain of life, to our knowledge. This divergent Cas9 protein was found in little-studied nanoarchaea as part of an active CRISPR-Cas system. In bacteria, we discovered two previously unknown systems, CRISPR-CasX and CRISPR-CasY, which are among the most compact systems yet discovered. Notably, all required functional components were identified by metagenomics, enabling validation of robust in vivo RNA-guided DNA interference activity in Escherichia coli. Interrogation of environmental microbial communities combined with in vivo experiments allows us to access an unprecedented diversity of genomes, the content of which will expand the repertoire of microbe-based biotechnologies.

  5. RPA and POT1: friends or foes at telomeres?

    PubMed

    Flynn, Rachel Litman; Chang, Sandy; Zou, Lee

    2012-02-15

    Telomere maintenance in cycling cells relies on both DNA replication and capping by the protein complex shelterin. Two single-stranded DNA (ssDNA)-binding proteins, replication protein A (RPA) and protection of telomere 1 (POT1) play critical roles in DNA replication and telomere capping, respectively. While RPA binds to ssDNA in a non-sequence-specific manner, POT1 specifically recognizes singlestranded TTAGGG telomeric repeats. Loss of POT1 leads to aberrant accumulation of RPA at telomeres and activation of the ataxia telangiectasia and Rad3-related kinase (ATR)-mediated checkpoint response, suggesting that POT1 antagonizes RPA binding to telomeric ssDNA. The requirement for both POT1 and RPA in telomere maintenance and the antagonism between the two proteins raises the important question of how they function in concert on telomeric ssDNA. Two interesting models were proposed by recent studies to explain the regulation of POT1 and RPA at telomeres. Here, we discuss how these models help unravel the coordination, and also the antagonism, between POT1 and RPA during the cell cycle.

  6. Intrinsic Nucleic Acid Dynamics Modulates HIV-1 Nucleocapsid Protein Binding to Its Targets

    PubMed Central

    Bazzi, Ali; Zargarian, Loussiné; Chaminade, Françoise; De Rocquigny, Hugues; René, Brigitte; Mély, Yves; Fossé, Philippe; Mauffret, Olivier

    2012-01-01

    HIV-1 nucleocapsid protein (NC) is involved in the rearrangement of nucleic acids occurring in key steps of reverse transcription. The protein, through its two zinc fingers, interacts preferentially with unpaired guanines in single-stranded sequences. In mini-cTAR stem-loop, which corresponds to the top half of the cDNA copy of the transactivation response element of the HIV-1 genome, NC was found to exhibit a clear preference for the TGG sequence at the bottom of mini-cTAR stem. To further understand how this site was selected among several potential binding sites containing unpaired guanines, we probed the intrinsic dynamics of mini-cTAR using 13C relaxation measurements. Results of spin relaxation time measurements have been analyzed using the model-free formalism and completed by dispersion relaxation measurements. Our data indicate that the preferentially recognized guanine in the lower part of the stem is exempt of conformational exchange and highly mobile. In contrast, the unrecognized unpaired guanines of mini-cTAR are involved in conformational exchange, probably related to transient base-pairs. These findings support the notion that NC preferentially recognizes unpaired guanines exhibiting a high degree of mobility. The ability of NC to discriminate between close sequences through their dynamic properties contributes to understanding how NC recognizes specific sites within the HIV genome. PMID:22745685

  7. Cellulophaga geojensis sp. nov., a member of the family Flavobacteriaceae isolated from marine sand.

    PubMed

    Park, Sooyeon; Oh, Ki-Hoon; Lee, Soo-Young; Oh, Tae-Kwang; Yoon, Jung-Hoon

    2012-06-01

    A Gram-stain-negative, aerobic, non-flagellated, non-spore-forming, motile (by gliding) bacterial strain, designated M-M6(T), was isolated from marine sand of Geoje island, Korea. Strain M-M6(T) grew optimally at 25 °C, at pH 7.0-8.0 and in the presence of 2 % (w/v) NaCl. Phylogenetic analyses based on 16S rRNA gene sequences revealed that strain M-M6(T) fell within the clade comprising Cellulophaga species, forming a coherent cluster with Cellulophaga lytica ATCC 23178(T) and Cellulophaga fucicola NN015860(T), with which it shared 16S rRNA gene sequence similarities of 98.1 and 98.2 %, respectively. Sequence similarities between strain M-M6(T) and the type strains of other recognized Cellulophaga species were in the range 92.4-93.8 %. Strain M-M6(T) contained MK-6 as the predominant menaquinone and iso-C(15:0), iso-C(15:1) G, iso-C(17:0) 3-OH, and C(16:1)ω7c and/or iso-C(15:0) 2-OH as the major fatty acids. The major polar lipids detected in strain M-M6(T) and the type strains of C. lytica and C. fucicola were two unidentified lipids, one unidentified aminolipid and one unidentified aminophospholipid. The DNA G+C content of strain M-M6(T) was 35.4 mol%. Levels of DNA-DNA relatedness between strain M-M6(T) and C. lytica JCM 8516(T) and C. fucicola JCM 21778(T) were 33 and 35 %, respectively. Differential phenotypic properties and phylogenetic and genetic distinctiveness distinguished strain M-M6(T) from all recognized Cellulophaga species. On the basis of the data presented, strain M-M6(T) is considered to represent a novel species of the genus Cellulophaga, for which the name Cellulophaga geojensis sp. nov. is proposed. The type strain is M-M6(T) ( = KCTC 23498(T) = CCUG 60801(T)).

  8. Evolution of helotialean fungi (Leotiomycetes, Pezizomycotina): a nuclear rDNA phylogeny.

    PubMed

    Wang, Zheng; Binder, Manfred; Schoch, Conrad L; Johnston, Peter R; Spatafora, Joseph W; Hibbett, David S

    2006-11-01

    The highly divergent characters of morphology, ecology, and biology in the Helotiales make it one of the most problematic groups in traditional classification and molecular phylogeny. Sequences of three rDNA regions, SSU, LSU, and 5.8S rDNA, were generated for 50 helotialean fungi, representing 11 out of 13 families in the current classification. Data sets with different compositions were assembled, and parsimony and Bayesian analyses were performed. The phylogenetic distribution of lifestyle and ecological factors was assessed. Plant endophytism is distributed across multiple clades in the Leotiomycetes. Our results suggest that (1) the inclusion of LSU rDNA and a wider taxon sampling greatly improves resolution of the Helotiales phylogeny, however, the usefulness of rDNA in resolving the deep relationships within the Leotiomycetes is limited; (2) a new class Geoglossomycetes, including Geoglossum, Trichoglossum, and Sarcoleotia, is the basal lineage of the Leotiomyceta; (3) the Leotiomycetes, including the Helotiales, Erysiphales, Cyttariales, Rhytismatales, and Myxotrichaceae, is monophyletic; and (4) nine clades can be recognized within the Helotiales.

  9. Thermodynamics of DNA target site recognition by homing endonucleases

    PubMed Central

    Eastberg, Jennifer H.; Smith, Audrey McConnell; Zhao, Lei; Ashworth, Justin; Shen, Betty W.; Stoddard, Barry L.

    2007-01-01

    The thermodynamic profiles of target site recognition have been surveyed for homing endonucleases from various structural families. Similar to DNA-binding proteins that recognize shorter target sites, homing endonucleases display a narrow range of binding free energies and affinities, mediated by structural interactions that balance the magnitude of enthalpic and entropic forces. While the balance of ΔH and TΔS are not strongly correlated with the overall extent of DNA bending, unfavorable ΔHbinding is associated with unstacking of individual base steps in the target site. The effects of deleterious basepair substitutions in the optimal target sites of two LAGLIDADG homing endonucleases, and the subsequent effect of redesigning one of those endonucleases to accommodate that DNA sequence change, were also measured. The substitution of base-specific hydrogen bonds in a wild-type endonuclease/DNA complex with hydrophobic van der Waals contacts in a redesigned complex reduced the ability to discriminate between sites, due to nonspecific ΔSbinding. PMID:17947319

  10. The force-dependent mechanism of DnaK-mediated mechanical folding

    PubMed Central

    Perales-Calvo, Judit; Giganti, David; Stirnemann, Guillaume; Garcia-Manyes, Sergi

    2018-01-01

    It is well established that chaperones modulate the protein folding free-energy landscape. However, the molecular determinants underlying chaperone-mediated mechanical folding remain largely elusive, primarily because the force-extended unfolded conformation fundamentally differs from that characterized in biochemistry experiments. We use single-molecule force-clamp spectroscopy, combined with molecular dynamics simulations, to study the effect that the Hsp70 system has on the mechanical folding of three mechanically stiff model proteins. Our results demonstrate that, when working independently, DnaJ (Hsp40) and DnaK (Hsp70) work as holdases, blocking refolding by binding to distinct substrate conformations. Whereas DnaK binds to molten globule–like forms, DnaJ recognizes a cryptic sequence in the extended state in an unanticipated force-dependent manner. By contrast, the synergetic coupling of the Hsp70 system exhibits a marked foldase behavior. Our results offer unprecedented molecular and kinetic insights into the mechanisms by which mechanical force finely regulates chaperone binding, directly affecting protein elasticity. PMID:29487911

  11. Phylogenetic Relationships and Species Delimitation in Pinus Section Trifoliae Inferrred from Plastid DNA

    PubMed Central

    Hernández-León, Sergio; Gernandt, David S.; Pérez de la Rosa, Jorge A.; Jardón-Barbolla, Lev

    2013-01-01

    Recent diversification followed by secondary contact and hybridization may explain complex patterns of intra- and interspecific morphological and genetic variation in the North American hard pines (Pinus section Trifoliae), a group of approximately 49 tree species distributed in North and Central America and the Caribbean islands. We concatenated five plastid DNA markers for an average of 3.9 individuals per putative species and assessed the suitability of the five regions as DNA bar codes for species identification, species delimitation, and phylogenetic reconstruction. The ycf1 gene accounted for the greatest proportion of the alignment (46.9%), the greatest proportion of variable sites (74.9%), and the most unique sequences (75 haplotypes). Phylogenetic analysis recovered clades corresponding to subsections Australes, Contortae, and Ponderosae. Sequences for 23 of the 49 species were monophyletic and sequences for another 9 species were paraphyletic. Morphologically similar species within subsections usually grouped together, but there were exceptions consistent with incomplete lineage sorting or introgression. Bayesian relaxed molecular clock analyses indicated that all three subsections diversified relatively recently during the Miocene. The general mixed Yule-coalescent method gave a mixed model estimate of only 22 or 23 evolutionary entities for the plastid sequences, which corresponds to less than half the 49 species recognized based on morphological species assignments. Including more unique haplotypes per species may result in higher estimates, but low mutation rates, recent diversification, and large effective population sizes may limit the effectiveness of this method to detect evolutionary entities. PMID:23936218

  12. Cloning metallothionein gene in Zacco platypus and its potential as an exposure biomarker against cadmium.

    PubMed

    Lee, Sangwoo; Kim, Cheolmin; Kim, Jungkon; Kim, Woo-Keun; Shin, Hyun Suk; Lim, Eun-Suk; Lee, Jin Wuk; Kim, Sunmi; Kim, Ki-Tae; Lee, Sung-Kyu; Choi, Cheol Young; Choi, Kyungho

    2015-07-01

    Zacco platypus, pale chub, is an indigenous freshwater fish of East Asia including Korea and has many useful characteristics as indicator species for water pollution. While utility of Z. platypus as an experimental species has been recognized, genetic-level information is very limited and warrants extensive research. Metallothionein (MT) is widely used and well-known biomarker for heavy metal exposure in many experimental species. In the present study, we cloned MT in Z. platypus and evaluated its utility as a biomarker for metal exposure. For this purpose, we sequenced complete complementary DNA (cDNA) of MT in Z. platypus and carried out phylogenetic analysis with its sequences. The transcription-level responses of MT gene following the exposure to CdCl2 were also assessed to validate the utility of this gene as an exposure biomarker. Analysis of cDNA sequence of MT gene demonstrated high conformity with those of other fish. MT messenger RNA (mRNA) expression and enzymatic MT content significantly increased following CdCl2 exposure in a concentration-dependent manner. The level of CdCl2 that resulted in significant MT changes in Z. platypus was within the range that was reported from other fish. The MT gene of Z. platypus sequenced in the present study can be used as a useful biomarker for heavy metal exposure in the aquatic environment of Korea and other countries where this freshwater fish species represents the ecosystem.

  13. Phylogenetic relationships and species delimitation in pinus section trifoliae inferrred from plastid DNA.

    PubMed

    Hernández-León, Sergio; Gernandt, David S; Pérez de la Rosa, Jorge A; Jardón-Barbolla, Lev

    2013-01-01

    Recent diversification followed by secondary contact and hybridization may explain complex patterns of intra- and interspecific morphological and genetic variation in the North American hard pines (Pinus section Trifoliae), a group of approximately 49 tree species distributed in North and Central America and the Caribbean islands. We concatenated five plastid DNA markers for an average of 3.9 individuals per putative species and assessed the suitability of the five regions as DNA bar codes for species identification, species delimitation, and phylogenetic reconstruction. The ycf1 gene accounted for the greatest proportion of the alignment (46.9%), the greatest proportion of variable sites (74.9%), and the most unique sequences (75 haplotypes). Phylogenetic analysis recovered clades corresponding to subsections Australes, Contortae, and Ponderosae. Sequences for 23 of the 49 species were monophyletic and sequences for another 9 species were paraphyletic. Morphologically similar species within subsections usually grouped together, but there were exceptions consistent with incomplete lineage sorting or introgression. Bayesian relaxed molecular clock analyses indicated that all three subsections diversified relatively recently during the Miocene. The general mixed Yule-coalescent method gave a mixed model estimate of only 22 or 23 evolutionary entities for the plastid sequences, which corresponds to less than half the 49 species recognized based on morphological species assignments. Including more unique haplotypes per species may result in higher estimates, but low mutation rates, recent diversification, and large effective population sizes may limit the effectiveness of this method to detect evolutionary entities.

  14. Identification of the Quorum-Sensing Target DNA Sequence and N-Acyl Homoserine Lactone Responsiveness of the Brucella abortus virB promoter▿

    PubMed Central

    Arocena, Gastón M.; Sieira, Rodrigo; Comerci, Diego J.; Ugalde, Rodolfo A.

    2010-01-01

    VjbR is a LuxR-type quorum-sensing (QS) regulator that plays an essential role in the virulence of the intracellular facultative pathogen Brucella, the causative agent of brucellosis. It was previously described that VjbR regulates a diverse group of genes, including the virB operon. The latter codes for a type IV secretion system (T4SS) that is central for the pathogenesis of Brucella. Although the regulatory role of VjbR on the virB promoter (PvirB) was extensively studied by different groups, the VjbR-binding site had not been identified so far. Here, we identified the target DNA sequence of VjbR in PvirB by DNase I footprinting analyses. Surprisingly, we observed that VjbR specifically recognizes a sequence that is identical to a half-binding site of the QS-related regulator MrtR of Mesorhizobium tianshanense. As shown by DNase I footprinting and electrophoretic mobility shift assays, generation of a palindromic MrtR-like-binding site in PvirB increased both the affinity and the stability of the VjbR-DNA complex, which confirmed that the QS regulator of Brucella is highly related to that of M. tianshanense. The addition of N-dodecanoyl homoserine lactone dissociated VjbR from the promoter, which confirmed previous reports that indicated a negative effect of this signal on the VjbR-mediated activation of PvirB. Our results provide new molecular evidence for the structure of the virB promoter and reveal unusual features of the QS target DNA sequence of the main regulator of virulence in Brucella. PMID:20400542

  15. Genetic ancestry of the extinct Javan and Bali tigers.

    PubMed

    Xue, Hao-Ran; Yamaguchi, Nobuyuki; Driscoll, Carlos A; Han, Yu; Bar-Gal, Gila Kahila; Zhuang, Yan; Mazak, Ji H; Macdonald, David W; O'Brien, Stephen J; Luo, Shu-Jin

    2015-01-01

    The Bali (Panthera tigris balica) and Javan (P. t. sondaica) tigers are recognized as distinct tiger subspecies that went extinct in the 1940s and 1980s, respectively. Yet their genetic ancestry and taxonomic status remain controversial. Following ancient DNA procedures, we generated concatenated 1750bp mtDNA sequences from 23 museum samples including 11 voucher specimens from Java and Bali and compared these to diagnostic mtDNA sequences from 122 specimens of living tiger subspecies and the extinct Caspian tiger. The results revealed a close genetic affinity of the 3 groups from the Sunda Islands (Bali, Javan, and Sumatran tigers P. t. sumatrae). Bali and Javan mtDNA haplotypes differ from Sumatran haplotypes by 1-2 nucleotides, and the 3 island populations define a monophyletic assemblage distinctive and equidistant from other mainland subspecies. Despite this close phylogenetic relationship, no mtDNA haplotype was shared between Sumatran and Javan/Bali tigers, indicating little or no matrilineal gene flow among the islands after they were colonized. The close phylogenetic relationship among Sunda tiger subspecies suggests either recent colonization across the islands, or else a once continuous tiger population that had subsequently isolated into different island subspecies. This supports the hypothesis that the Sumatran tiger is the closest living relative to the extinct Javan and Bali tigers. © The American Genetic Association 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  16. The Hemiptera (Insecta) of Canada: Constructing a Reference Library of DNA Barcodes

    PubMed Central

    Gwiazdowski, Rodger A.; Foottit, Robert G.; Maw, H. Eric L.; Hebert, Paul D. N.

    2015-01-01

    DNA barcode reference libraries linked to voucher specimens create new opportunities for high-throughput identification and taxonomic re-evaluations. This study provides a DNA barcode library for about 45% of the recognized species of Canadian Hemiptera, and the publically available R workflow used for its generation. The current library is based on the analysis of 20,851 specimens including 1849 species belonging to 628 genera and 64 families. These individuals were assigned to 1867 Barcode Index Numbers (BINs), sequence clusters that often coincide with species recognized through prior taxonomy. Museum collections were a key source for identified specimens, but we also employed high-throughput collection methods that generated large numbers of unidentified specimens. Many of these specimens represented novel BINs that were subsequently identified by taxonomists, adding barcode coverage for additional species. Our analyses based on both approaches includes 94 species not listed in the most recent Canadian checklist, representing a potential 3% increase in the fauna. We discuss the development of our workflow in the context of prior DNA barcode library construction projects, emphasizing the importance of delineating a set of reference specimens to aid investigations in cases of nomenclatural and DNA barcode discordance. The identification for each specimen in the reference set can be annotated on the Barcode of Life Data System (BOLD), allowing experts to highlight questionable identifications; annotations can be added by any registered user of BOLD, and instructions for this are provided. PMID:25923328

  17. DNA is structured as a linear "jigsaw puzzle" in the genomes of Arabidopsis, rice, and budding yeast.

    PubMed

    Liu, Yun-Hua; Zhang, Meiping; Wu, Chengcang; Huang, James J; Zhang, Hong-Bin

    2014-01-01

    Knowledge of how a genome is structured and organized from its constituent elements is crucial to understanding its biology and evolution. Here, we report the genome structuring and organization pattern as revealed by systems analysis of the sequences of three model species, Arabidopsis, rice and yeast, at the whole-genome and chromosome levels. We found that all fundamental function elements (FFE) constituting the genomes, including genes (GEN), DNA transposable elements (DTE), retrotransposable elements (RTE), simple sequence repeats (SSR), and (or) low complexity repeats (LCR), are structured in a nonrandom and correlative manner, thus leading to a hypothesis that the DNA of the species is structured as a linear "jigsaw puzzle". Furthermore, we showed that different FFE differ in their importance in the formation and evolution of the DNA jigsaw puzzle structure between species. DTE and RTE play more important roles than GEN, LCR, and SSR in Arabidopsis, whereas GEN and RTE play more important roles than LCR, SSR, and DTE in rice. The genes having multiple recognized functions play more important roles than those having single functions. These results provide useful knowledge necessary for better understanding genome biology and evolution of the species and for effective molecular breeding of rice.

  18. The mining of pearl formation genes in pearl oyster Pinctada fucata by cDNA suppression subtractive hybridization.

    PubMed

    Wang, Ning; Kinoshita, Shigeharu; Nomura, Naoko; Riho, Chihiro; Maeyama, Kaoru; Nagai, Kiyohito; Watabe, Shugo

    2012-04-01

    Recent researches revealed the regional preference of biomineralization gene transcription in the pearl oyster Pinctada fucata: it transcribed mainly the genes responsible for nacre secretion in mantle pallial, whereas the ones regulating calcite shells expressed in mantle edge. This study took use of this character and constructed the forward and reverse suppression subtractive hybridization (SSH) cDNA libraries. A total of 669 cDNA clones were sequenced and 360 expressed sequence tags (ESTs) greater than 100 bp were generated. Functional annotation associated 95 ESTs with specific functions, and 79 among them were identified from P. fucata at the first time. In the forward SSH cDNA library, it recognized mass amount of nacre protein genes, biomineralization genes dominantly expressed in the mantle pallial, calcium-ion-binding genes, and other biomineralization-related genes important for pearl formation. Real-time PCR showed that all the examined genes were distributed in oyster mantle tissues with a consistence to the SSH design. The detection of their RNA transcripts in pearl sac confirmed that the identified genes were certainly involved in pearl formation. Therefore, the data from this work will initiate a new round of pearl formation gene study and shed new insights into molluscan biomineralization.

  19. Conformational heterogeneity and bubble dynamics in single bacterial transcription initiation complexes

    PubMed Central

    Duchi, Diego; Gryte, Kristofer; Robb, Nicole C; Morichaud, Zakia; Sheppard, Carol; Wigneshweraraj, Sivaramesh

    2018-01-01

    Abstract Transcription initiation is a major step in gene regulation for all organisms. In bacteria, the promoter DNA is first recognized by RNA polymerase (RNAP) to yield an initial closed complex. This complex subsequently undergoes conformational changes resulting in DNA strand separation to form a transcription bubble and an RNAP-promoter open complex; however, the series and sequence of conformational changes, and the factors that influence them are unclear. To address the conformational landscape and transitions in transcription initiation, we applied single-molecule Förster resonance energy transfer (smFRET) on immobilized Escherichia coli transcription open complexes. Our results revealed the existence of two stable states within RNAP–DNA complexes in which the promoter DNA appears to adopt closed and partially open conformations, and we observed large-scale transitions in which the transcription bubble fluctuated between open and closed states; these transitions, which occur roughly on the 0.1 s timescale, are distinct from the millisecond-timescale dynamics previously observed within diffusing open complexes. Mutational studies indicated that the σ70 region 3.2 of the RNAP significantly affected the bubble dynamics. Our results have implications for many steps of transcription initiation, and support a bend-load-open model for the sequence of transitions leading to bubble opening during open complex formation. PMID:29177430

  20. Epigenetics of oropharyngeal squamous cell carcinoma: opportunities for novel chemotherapeutic targets.

    PubMed

    Lindsay, Cameron; Seikaly, Hadi; Biron, Vincent L

    2017-01-31

    Epigenetic modifications are heritable changes in gene expression that do not directly alter DNA sequence. These modifications include DNA methylation, histone post-translational modifications, small and non-coding RNAs. Alterations in epigenetic profiles cause deregulation of fundamental gene expression pathways associated with carcinogenesis. The role of epigenetics in oropharyngeal squamous cell carcinoma (OPSCC) has recently been recognized, with implications for novel biomarkers, molecular diagnostics and chemotherapeutics. In this review, important epigenetic pathways in human papillomavirus (HPV) positive and negative OPSCC are summarized, as well as the potential clinical utility of this knowledge.This material has never been published and is not currently under evaluation in any other peer-reviewed publication.

  1. Selection and Characterization of Single Stranded DNA Aptamers for the Hormone Abscisic Acid

    PubMed Central

    Gonzalez, Victor M.; Millo, Enrico; Sturla, Laura; Vigliarolo, Tiziana; Bagnasco, Luca; Guida, Lucrezia; D'Arrigo, Cristina; De Flora, Antonio; Salis, Annalisa; Martin, Elena M.; Bellotti, Marta; Zocchi, Elena

    2013-01-01

    The hormone abscisic acid (ABA) is a small molecule involved in pivotal physiological functions in higher plants. Recently, ABA has been also identified as an endogenous hormone in mammals, regulating different cell functions including inflammatory processes, stem cell expansion, insulin release, and glucose uptake. Aptamers are short, single-stranded (ss) oligonucleotidesable to recognize target molecules with high affinity. The small size of the ABA molecule represented a challenge for aptamer development and the aim of this study was to develop specific anti-ABA DNA aptamers. Biotinylated abscisic acid (bio-ABA) was immobilized on streptavidin-coated magnetic beads. DNA aptamers against bio-ABA were selected with 7 iterative rounds of the systematic evolution of ligands by exponential enrichment method (SELEX), each round comprising incubation of the ABA-binding beads with the ssDNA sequences, DNA elution, electrophoresis, and polymerase chain reaction (PCR) amplification. The PCR product was cloned and sequenced. The binding affinity of several clones was determined using bio-ABA immobilized on streptavidin-coated plates. Aptamer 2 and aptamer 9 showed the highest binding affinity, with dissociation constants values of 0.98±0.14 μM and 0.80±0.07 μM, respectively. Aptamers 2 and 9 were also able to bind free, unmodified ABA and to discriminate between different ABA enantiomers and isomers. Our findings indicate that ssDNA aptamers can selectively bind ABA and could be used for the development of ABA quantitation assays. PMID:23971905

  2. Chloroplast DNA sequence of the green alga Oedogonium cardiacum (Chlorophyceae): Unique genome architecture, derived characters shared with the Chaetophorales and novel genes acquired through horizontal transfer

    PubMed Central

    Brouard, Jean-Simon; Otis, Christian; Lemieux, Claude; Turmel, Monique

    2008-01-01

    Background To gain insight into the branching order of the five main lineages currently recognized in the green algal class Chlorophyceae and to expand our understanding of chloroplast genome evolution, we have undertaken the sequencing of chloroplast DNA (cpDNA) from representative taxa. The complete cpDNA sequences previously reported for Chlamydomonas (Chlamydomonadales), Scenedesmus (Sphaeropleales), and Stigeoclonium (Chaetophorales) revealed tremendous variability in their architecture, the retention of only few ancestral gene clusters, and derived clusters shared by Chlamydomonas and Scenedesmus. Unexpectedly, our recent phylogenies inferred from these cpDNAs and the partial sequences of three other chlorophycean cpDNAs disclosed two major clades, one uniting the Chlamydomonadales and Sphaeropleales (CS clade) and the other uniting the Oedogoniales, Chaetophorales and Chaetopeltidales (OCC clade). Although molecular signatures provided strong support for this dichotomy and for the branching of the Oedogoniales as the earliest-diverging lineage of the OCC clade, more data are required to validate these phylogenies. We describe here the complete cpDNA sequence of Oedogonium cardiacum (Oedogoniales). Results Like its three chlorophycean homologues, the 196,547-bp Oedogonium chloroplast genome displays a distinctive architecture. This genome is one of the most compact among photosynthetic chlorophytes. It has an atypical quadripartite structure, is intron-rich (17 group I and 4 group II introns), and displays 99 different conserved genes and four long open reading frames (ORFs), three of which are clustered in the spacious inverted repeat of 35,493 bp. Intriguingly, two of these ORFs (int and dpoB) revealed high similarities to genes not usually found in cpDNA. At the gene content and gene order levels, the Oedogonium genome most closely resembles its Stigeoclonium counterpart. Characters shared by these chlorophyceans but missing in members of the CS clade include the retention of psaM, rpl32 and trnL(caa), the loss of petA, the disruption of three ancestral clusters and the presence of five derived gene clusters. Conclusion The Oedogonium chloroplast genome disclosed additional characters that bolster the evidence for a close alliance between the Oedogoniales and Chaetophorales. Our unprecedented finding of int and dpoB in this cpDNA provides a clear example that novel genes were acquired by the chloroplast genome through horizontal transfers, possibly from a mitochondrial genome donor. PMID:18558012

  3. Molecular phylogenetics of subfamily Ornithogaloideae (Hyacinthaceae) based on nuclear and plastid DNA regions, including a new taxonomic arrangement

    PubMed Central

    Martínez-Azorín, Mario; Crespo, Manuel B.; Juan, Ana; Fay, Michael F.

    2011-01-01

    Background and Aims The taxonomic arrangement within subfamily Ornithogaloideae (Hyacinthaceae) has been a matter of controversy in recent decades: several new taxonomic treatments have been proposed, based exclusively on plastid DNA sequences, and these have resulted in classifications which are to a great extent contradictory. Some authors have recognized only a single genus Ornithogalum for the whole subfamily, including 250–300 species of variable morphology, whereas others have recognized many genera. In the latter case, the genera are inevitably much smaller and they are better defined morphologically. However, some are not monophyletic as circumscribed. Methods Phylogenetic analyses of Ornithogaloideae were based on nucleotide sequences of four plastid regions (trnL intron, trnL-F spacer, rbcL and matK) and a nuclear region (ITS). Eighty species covering all relevant taxonomic groups previously recognized in the subfamily were sampled. Parsimony and Bayesian analyses were performed. The molecular data were compared with a matrix of 34 morphological characters. Key Results Combinations of plastid and nuclear data yielded phylogenetic trees which are better resolved than those obtained with any plastid region alone or plastid regions in combination. Three main clades are found, corresponding to the previously recognized tribes Albuceae, Dipcadieae and Ornithogaleae. In these, up to 19 clades are described which are definable by morphology and biogeography. These mostly correspond to previously described taxa, though some need recircumscription. Morphological characters are assessed for their diagnostic value for taxonomy in the subfamily. Conclusions On the basis of the phylogenetic analyses, 19 monophyletic genera are accepted within Ornithogaloideae: Albuca, Avonsera, Battandiera, Cathissa, Coilonox, Dipcadi, Eliokarmos, Elsiea, Ethesia, Galtonia, Honorius, Loncomelos, Melomphis, Neopatersonia, Nicipe, Ornithogalum, Pseudogaltonia, Stellarioides and Trimelopter. Each of these has a particular syndrome of morphological characters. As a result, 105 new combinations are made and two new names are proposed to accommodate the taxa studied in the new arrangement. A short morphological diagnosis, synonymy, details of distribution and an identification key are presented. PMID:21163815

  4. A pan-Theileria FRET-qPCR survey for Theileria spp. in ruminants from nine provinces of China.

    PubMed

    Yang, Yi; Mao, Yongjiang; Kelly, Patrick; Yang, Zhangpin; Luan, Lu; Zhang, Jilei; Li, Jing; El-Mahallawy, Heba S; Wang, Chengming

    2014-08-31

    Theileria spp. are tick transmitted protozoa that can infect large and small ruminants causing disease and economic losses. Diagnosis of infections is often challenging, as parasites can be difficult to detect and identify microscopically and serology is unreliable. While there are PCR assays which can identify certain Theileria spp., there is no one PCR that has been designed to identify all recognized species that occur in ruminants and which will greatly simplify the laboratory diagnoses of infections. Primers and probes for a genus-specific pan-Theileria FRET-qPCR were selected by comparing sequences of recognized Theileria spp. in GenBank and the test validated using reference organisms. The assay was also tested on whole blood samples from large and small ruminants from nine provinces in China. The pan-Theileria FRET-qPCR detected all recognized species but none of the closely related protozoa. In whole blood samples from animals in China, Theileria spp. DNA was detected in 53.2% of the sheep tested (59/111), 44.4% of the goats (120/270) and 30.8% of the cattle (380/1,235). Water buffaloes (n = 29) were negative. Sequencing of some of the PCR products showed cattle in China were infected with T. orientalis/T. sergenti/T. buffeli group while T. ovis and T. luwenshuni were found in sheep and T. luwenshuni in goats. The prevalence of Theileria DNA was significantly higher in Bos p. indicus than in Bos p. taurus (77.7% vs. 18.3%) and copy numbers were also significantly higher (10(4.88) vs. 10(3.00) Theileria 18S rRNA gene copies/per ml whole blood). The pan-Theileria FRET-qPCR can detect all recognized Theileria spp. of ruminants in a single reaction. Large and small ruminants in China are commonly infected with a variety of Theileria spp.

  5. Pathogen profiling for disease management and surveillance.

    PubMed

    Sintchenko, Vitali; Iredell, Jonathan R; Gilbert, Gwendolyn L

    2007-06-01

    The usefulness of rapid pathogen genotyping is widely recognized, but its effective interpretation and application requires integration into clinical and public health decision-making. How can pathogen genotyping data best be translated to inform disease management and surveillance? Pathogen profiling integrates microbial genomics data into communicable disease control by consolidating phenotypic identity-based methods with DNA microarrays, proteomics, metabolomics and sequence-based typing. Sharing data on pathogen profiles should facilitate our understanding of transmission patterns and the dynamics of epidemics.

  6. Novel technique used to treat melanoma and epithelial tumors in new clinical trial | Center for Cancer Research

    Cancer.gov

    Exomic sequencing allows researchers to read the “letters” in the part of your DNA that makes proteins to see where the letters are correct and where the letters are incorrect. This information allows white blood cells engineered from the patient to recognize these tumor-specific mutations and be made into vaccines, called dendritic cell (DC) vaccines, to test effects on melanoma and epithelial tumors. Read more…

  7. Paramyosin from the parasitic mite Sarcoptes scabiei: cDNA cloning and heterologous expression.

    PubMed

    Mattsson, J G; Ljunggren, E L; Bergström, K

    2001-05-01

    The burrowing mite Sarcoptes scabiei is the causative agent of the highly contagious disease sarcoptic mange or scabies. So far, there is no in vitro propagation system for S. scabiei available, and mites used for various purposes must be isolated from infected hosts. Lack of parasite-derived material has limited the possibilities to study several aspects of scabies, including pathogenesis and immunity. It has also hampered the development of high performance serological assays. We have now constructed an S. scabiei cDNA expression library with mRNA purified from mites isolated from red foxes. Immunoscreening of the library enabled us to clone a full-length cDNA coding for a 102.5 kDa protein. Sequence similarity searches identified the protein as a paramyosin. Recombinant S. scabiei paramyosin expressed in Escherichia coli was recognized by sera from dogs and swine infected with S. scabiei. We also designed a small paramyosin construct of about 17 kDa that included the N-terminal part, an evolutionary variable part of the helical core, and the C-terminal part of the molecule. The miniaturized protein was efficiently expressed in E. coli and was recognized by sera from immunized rabbits. These data demonstrate that the cDNA library can assist in the isolation of important S. scabiei antigens and that recombinant proteins can be useful for the study of scabies.

  8. The novel primers for mammal species identification-based mitochondrial cytochrome b sequence: implication for reserved wild animals in Thailand and endangered mammal species in Southeast Asia.

    PubMed

    Muangkram, Yuttamol; Wajjwalku, Worawidh; Amano, Akira; Sukmak, Manakorn

    2018-01-01

    We presented the powerful techniques for species identification using the short amplicon of mitochondrial cytochrome b gene sequence. Two faecal samples and one single hair sample of the Asian tapir were tested using the new cytochrome b primers. The results showed a high sequence similarity with the mainland Asian tapir group. The comparative sequence analysis of the reserved wild mammals in Thailand and the other endangered mammal species from Southeast Asia comprehensibly verified the potential of our novel primers. The forward and reverse primers were 94.2 and 93.2%, respectively, by the average value of the sequence identity among 77 species sequences, and the overall mean distance was 35.9%. This development technique could provide rapid, simple, and reliable tools for species confirmation. Especially, it could recognize the problematic biological specimens contained less DNA material from illegal products and assist with wildlife crime investigation of threatened species and related forensic casework.

  9. Label-Free Sensitive Detection of DNA Methyltransferase by Target-Induced Hyperbranched Amplification with Zero Background Signal.

    PubMed

    Zhang, Yan; Wang, Xin-Yan; Zhang, Qianyi; Zhang, Chun-Yang

    2017-11-21

    DNA methyltransferases (MTases) may specifically recognize the short palindromic sequences and transfer a methyl group from S-adenosyl-l-methionine to target cytosine/adenine. The aberrant DNA methylation is linked to the abnormal DNA MTase activity, and some DNA MTases have become promising targets of anticancer/antimicrobial drugs. However, the reported DNA MTase assays often involve laborious operation, expensive instruments, and radio-labeled substrates. Here, we develop a simple and label-free fluorescent method to sensitively detect DNA adenine methyltransferase (Dam) on the basis of terminal deoxynucleotidyl transferase (TdT)-activated Endonuclease IV (Endo IV)-assisted hyperbranched amplification. We design a hairpin probe with a palindromic sequence in the stem as the substrate and a NH 2 -modified 3' end for the prevention of nonspecific amplification. The substrate may be methylated by Dam and subsequently cleaved by DpnI, producing three single-stranded DNAs, two of which with 3'-OH termini may be amplified by hyperbranched amplification to generate a distinct fluorescence signal. Because high exactitude of TdT enables the amplification only in the presence of free 3'-OH termini and Endo IV only hydrolyzes the intact apurinic/apyrimidinic sites in double-stranded DNAs, zero background signal can be achieved. This method exhibits excellent selectivity and high sensitivity with a limit of detection of 0.003 U/mL for pure Dam and 9.61 × 10 -6 mg/mL for Dam in E. coli cells. Moreover, it can be used to screen the Dam inhibitors, holding great potentials in disease diagnosis and drug development.

  10. Crystal structure of APOBEC3A bound to single-stranded DNA reveals structural basis for cytidine deamination and specificity.

    PubMed

    Kouno, Takahide; Silvas, Tania V; Hilbert, Brendan J; Shandilya, Shivender M D; Bohn, Markus F; Kelch, Brian A; Royer, William E; Somasundaran, Mohan; Kurt Yilmaz, Nese; Matsuo, Hiroshi; Schiffer, Celia A

    2017-04-28

    Nucleic acid editing enzymes are essential components of the immune system that lethally mutate viral pathogens and somatically mutate immunoglobulins, and contribute to the diversification and lethality of cancers. Among these enzymes are the seven human APOBEC3 deoxycytidine deaminases, each with unique target sequence specificity and subcellular localization. While the enzymology and biological consequences have been extensively studied, the mechanism by which APOBEC3s recognize and edit DNA remains elusive. Here we present the crystal structure of a complex of a cytidine deaminase with ssDNA bound in the active site at 2.2 Å. This structure not only visualizes the active site poised for catalysis of APOBEC3A, but pinpoints the residues that confer specificity towards CC/TC motifs. The APOBEC3A-ssDNA complex defines the 5'-3' directionality and subtle conformational changes that clench the ssDNA within the binding groove, revealing the architecture and mechanism of ssDNA recognition that is likely conserved among all polynucleotide deaminases, thereby opening the door for the design of mechanistic-based therapeutics.

  11. Molecular cloning of crustins from the hemocytes of Brazilian penaeid shrimps.

    PubMed

    Rosa, Rafael Diego; Bandeira, Paula Terra; Barracco, Margherita Anna

    2007-09-01

    Crustins are antimicrobial peptides initially identified in the hemocytes of the crab Carcinus maenas (11.5-kDa peptide or carcinin) and recently also recognized in penaeid shrimps and other crustacean species. The aim of this study was to identify sequences encoding for crustins from the hemocytes of four Brazilian penaeid species: Farfantepenaeus paulensis, Farfantepenaeus subtilis, Farfantepenaeus brasiliensis and Litopenaeus schmitti. Using primers based on consensus nucleotide alignment of crustins from different crustaceans, cDNA sequences coding for crustins in all indigenous penaeid species were amplified. The obtained four crustin sequences encoded for peptides containing a hydrophobic N-terminal region rich in glycine repeats and a C-terminal part with 12 cysteine residues and a conserved whey acidic protein domain. All obtained crustin sequences showed high amino acidic similarity among each other and with crustins from litopenaeid shrimps (76-98%). This is the first report of crustins in native Brazilian penaeid shrimps.

  12. In vitro isolation and molecular characterization of an Ehrlichia canis strain from São Paulo, Brazil

    PubMed Central

    Aguiar, Daniel M.; Hagiwara, Mitika K.; Labruna, Marcelo B.

    2008-01-01

    An Ehrlichia canis isolate was obtained from an naturally infected dog exhibiting clinical signs of ehrlichiosis in São Paulo Municipality, state of São Paulo, Brazil. The isolate was characterized by PCR and DNA sequencing of portions of the ehrlichial genes dsb, 16SrRNA, and p28. Partial dsb and 16S rRNA sequences were identical to three and five other E. canis strains, respectively, from different countries and continents (including North America, Africa, Asia and Europe). Conversely, the p28 partial sequence for this E. canis (São Paulo) differed by 1, 2, and 2 nucleotides from the corresponding sequences of the E. canis strains Jake (from USA), Oklahoma (USA), and VHE (Venezuela), respectively. The results in this study indicate that E. canis is the only recognized Ehrlichia species infecting dogs in Brazil. PMID:24031251

  13. Systematics and distribution of Cristaria plicata (Bivalvia, Unionidae) from the Russian Far East

    PubMed Central

    Klishko, Olga K.; Lopes-Lima, Manuel; Froufe, Elsa; Bogan, Arthur E.; Abakumova, Vera Y.

    2016-01-01

    Abstract The number of anodontine bivalve species placed in the genus Cristaria (Bivalvia, Unionidae) from the Russian Far East is still not stable among authors. Some recognize only one valid species Cristaria plicata (Leach, 1815) while others accept two additional species, Cristaria tuberculata Schumacher, 1817 and Cristaria herculea (Middendorff, 1847). In the present study, these taxonomic doubts are addressed using analyses of mitochondrial DNA sequences and shell morphometry. No significant differences have been revealed by the COI DNA sequences or the main statistical morphometric indices from the three Cristaria forms. In the specimens analysed, changes in shell morphometry with age suggest that original descriptions of the different forms may be attributed solely to differences in age and sex. We consider that Cristaria plicata, Cristaria tuberculata and Cristaria herculea from the Russian Far East should be considered as a single species, namely Cristaria plicata (Leach, 1815), with Cristaria tuberculata and Cristaria herculea as junior synonyms. The geographic range of Cristaria plicata and its conservation status are also presented here. PMID:27110206

  14. The octamer-binding proteins form multi-protein--DNA complexes with the HSV alpha TIF regulatory protein.

    PubMed Central

    Kristie, T M; LeBowitz, J H; Sharp, P A

    1989-01-01

    The herpes simplex virus transactivator, alpha TIF, stimulates transcription of the alpha/immediate early genes via a cis-acting site containing an octamer element and a conserved flanking sequence. The alpha TIF protein, produced in a baculovirus expression system, nucleates the formation of at least two DNA--protein complexes on this regulatory element. Both of these complexes contain the ubiquitous Oct-1 protein, whose POU domain alone is sufficient to allow assembly of the alpha TIF-dependent complexes. A second member of the POU domain family, the lymphoid specific Oct-2 protein, can also be assembled into similar complexes at high concentrations of alpha TIF protein. These complexes contain at least two cellular proteins in addition to Oct-1. One of these proteins is present in both insect and HeLa cells and probably recognizes sequences in the cis element. The second cellular protein, only present in HeLa cells, probably binds by protein-protein interactions. Images PMID:2556266

  15. The octamer-binding proteins form multi-protein--DNA complexes with the HSV alpha TIF regulatory protein.

    PubMed

    Kristie, T M; LeBowitz, J H; Sharp, P A

    1989-12-20

    The herpes simplex virus transactivator, alpha TIF, stimulates transcription of the alpha/immediate early genes via a cis-acting site containing an octamer element and a conserved flanking sequence. The alpha TIF protein, produced in a baculovirus expression system, nucleates the formation of at least two DNA--protein complexes on this regulatory element. Both of these complexes contain the ubiquitous Oct-1 protein, whose POU domain alone is sufficient to allow assembly of the alpha TIF-dependent complexes. A second member of the POU domain family, the lymphoid specific Oct-2 protein, can also be assembled into similar complexes at high concentrations of alpha TIF protein. These complexes contain at least two cellular proteins in addition to Oct-1. One of these proteins is present in both insect and HeLa cells and probably recognizes sequences in the cis element. The second cellular protein, only present in HeLa cells, probably binds by protein-protein interactions.

  16. Amazonian phylogeography: mtDNA sequence variation in arboreal echimyid rodents (Caviomorpha).

    PubMed

    da Silva, M N; Patton, J L

    1993-09-01

    Patterns of evolutionary relationships among haplotype clades of sequences of the mitochondrial cytochrome b DNA gene are examined for five genera of arboreal rodents of the Caviomorph family Echimyidae from the Amazon Basin. Data are available for 798 bp of sequence from a total of 24 separate localities in Peru, Venezuela, Bolivia, and Brazil for Mesomys, Isothrix, Makalata, Dactylomys, and Echimys. Sequence divergence, corrected for multiple hits, is extensive, ranging from less than 1% for comparisons within populations of over 20% among geographic units within genera. Both the degree of differentiation and the geographic patterning of the variation suggest that more than one species composes the Amazonian distribution of the currently recognized Mesomys hispidus, Isothrix bistriata, Makalata didelphoides, and Dactylomys dactylinus. There is general concordance in the geographic range of haplotype clades for each of these taxa, and the overall level of differentiation within them is largely equivalent. These observations suggest that a common vicariant history underlies the respective diversification of each genus. However, estimated times of divergence based on the rate of third position transversion substitutions for the major clades within each genus typically range above 1 million years. Thus, allopatric isolation precipitating divergence must have been considerably earlier than the late Pleistocene forest fragmentation events commonly invoked for Amazonian biota.

  17. Type II restriction endonucleases—a historical perspective and more

    PubMed Central

    Pingoud, Alfred; Wilson, Geoffrey G.; Wende, Wolfgang

    2014-01-01

    This article continues the series of Surveys and Summaries on restriction endonucleases (REases) begun this year in Nucleic Acids Research. Here we discuss ‘Type II’ REases, the kind used for DNA analysis and cloning. We focus on their biochemistry: what they are, what they do, and how they do it. Type II REases are produced by prokaryotes to combat bacteriophages. With extreme accuracy, each recognizes a particular sequence in double-stranded DNA and cleaves at a fixed position within or nearby. The discoveries of these enzymes in the 1970s, and of the uses to which they could be put, have since impacted every corner of the life sciences. They became the enabling tools of molecular biology, genetics and biotechnology, and made analysis at the most fundamental levels routine. Hundreds of different REases have been discovered and are available commercially. Their genes have been cloned, sequenced and overexpressed. Most have been characterized to some extent, but few have been studied in depth. Here, we describe the original discoveries in this field, and the properties of the first Type II REases investigated. We discuss the mechanisms of sequence recognition and catalysis, and the varied oligomeric modes in which Type II REases act. We describe the surprising heterogeneity revealed by comparisons of their sequences and structures. PMID:24878924

  18. Barcoding the butterflies of southern South America: Species delimitation efficacy, cryptic diversity and geographic patterns of divergence.

    PubMed

    Lavinia, Pablo D; Núñez Bustos, Ezequiel O; Kopuchian, Cecilia; Lijtmaer, Darío A; García, Natalia C; Hebert, Paul D N; Tubaro, Pablo L

    2017-01-01

    Because the tropical regions of America harbor the highest concentration of butterfly species, its fauna has attracted considerable attention. Much less is known about the butterflies of southern South America, particularly Argentina, where over 1,200 species occur. To advance understanding of this fauna, we assembled a DNA barcode reference library for 417 butterfly species of Argentina, focusing on the Atlantic Forest, a biodiversity hotspot. We tested the efficacy of this library for specimen identification, used it to assess the frequency of cryptic species, and examined geographic patterns of genetic variation, making this study the first large-scale genetic assessment of the butterflies of southern South America. The average sequence divergence to the nearest neighbor (i.e. minimum interspecific distance) was 6.91%, ten times larger than the mean distance to the furthest conspecific (0.69%), with a clear barcode gap present in all but four of the species represented by two or more specimens. As a consequence, the DNA barcode library was extremely effective in the discrimination of these species, allowing a correct identification in more than 95% of the cases. Singletons (i.e. species represented by a single sequence) were also distinguishable in the gene trees since they all had unique DNA barcodes, divergent from those of the closest non-conspecific. The clustering algorithms implemented recognized from 416 to 444 barcode clusters, suggesting that the actual diversity of butterflies in Argentina is 3%-9% higher than currently recognized. Furthermore, our survey added three new records of butterflies for the country (Eurema agave, Mithras hannelore, Melanis hillapana). In summary, this study not only supported the utility of DNA barcoding for the identification of the butterfly species of Argentina, but also highlighted several cases of both deep intraspecific and shallow interspecific divergence that should be studied in more detail.

  19. Barcoding the butterflies of southern South America: Species delimitation efficacy, cryptic diversity and geographic patterns of divergence

    PubMed Central

    Núñez Bustos, Ezequiel O.; Kopuchian, Cecilia; Lijtmaer, Darío A.; García, Natalia C.; Hebert, Paul D. N.; Tubaro, Pablo L.

    2017-01-01

    Because the tropical regions of America harbor the highest concentration of butterfly species, its fauna has attracted considerable attention. Much less is known about the butterflies of southern South America, particularly Argentina, where over 1,200 species occur. To advance understanding of this fauna, we assembled a DNA barcode reference library for 417 butterfly species of Argentina, focusing on the Atlantic Forest, a biodiversity hotspot. We tested the efficacy of this library for specimen identification, used it to assess the frequency of cryptic species, and examined geographic patterns of genetic variation, making this study the first large-scale genetic assessment of the butterflies of southern South America. The average sequence divergence to the nearest neighbor (i.e. minimum interspecific distance) was 6.91%, ten times larger than the mean distance to the furthest conspecific (0.69%), with a clear barcode gap present in all but four of the species represented by two or more specimens. As a consequence, the DNA barcode library was extremely effective in the discrimination of these species, allowing a correct identification in more than 95% of the cases. Singletons (i.e. species represented by a single sequence) were also distinguishable in the gene trees since they all had unique DNA barcodes, divergent from those of the closest non-conspecific. The clustering algorithms implemented recognized from 416 to 444 barcode clusters, suggesting that the actual diversity of butterflies in Argentina is 3%–9% higher than currently recognized. Furthermore, our survey added three new records of butterflies for the country (Eurema agave, Mithras hannelore, Melanis hillapana). In summary, this study not only supported the utility of DNA barcoding for the identification of the butterfly species of Argentina, but also highlighted several cases of both deep intraspecific and shallow interspecific divergence that should be studied in more detail. PMID:29049373

  20. An Engineered Kinetic Amplification Mechanism for Single Nucleotide Variant Discrimination by DNA Hybridization Probes.

    PubMed

    Chen, Sherry Xi; Seelig, Georg

    2016-04-20

    Even a single-nucleotide difference between the sequences of two otherwise identical biological nucleic acids can have dramatic functional consequences. Here, we use model-guided reaction pathway engineering to quantitatively improve the performance of selective hybridization probes in recognizing single nucleotide variants (SNVs). Specifically, we build a detection system that combines discrimination by competition with DNA strand displacement-based catalytic amplification. We show, both mathematically and experimentally, that the single nucleotide selectivity of such a system in binding to single-stranded DNA and RNA is quadratically better than discrimination due to competitive hybridization alone. As an additional benefit the integrated circuit inherits the property of amplification and provides at least 10-fold better sensitivity than standard hybridization probes. Moreover, we demonstrate how the detection mechanism can be tuned such that the detection reaction is agnostic to the position of the SNV within the target sequence. in contrast, prior strand displacement-based probes designed for kinetic discrimination are highly sensitive to position effects. We apply our system to reliably discriminate between different members of the let-7 microRNA family that differ in only a single base position. Our results demonstrate the power of systematic reaction network design to quantitatively improve biotechnology.

  1. Lactobacillus crustorum sp. nov., isolated from two traditional Belgian wheat sourdoughs.

    PubMed

    Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Huys, Geert; Vandamme, Peter; De Vuyst, Luc; Vancanneyt, Marc

    2007-07-01

    A polyphasic taxonomic study of the lactic acid bacteria (LAB) population in three traditional Belgian sourdoughs, sampled between 2002 and 2004, revealed a group of isolates that could not be assigned to any recognized LAB species. Initially, sourdough isolates were screened by means of (GTG)(5)-PCR fingerprinting. Four isolates displaying unique (GTG)(5)-PCR patterns were further investigated by means of phenylalanyl-tRNA synthase (pheS) gene sequence analysis and represented a bifurcated branch that could not be allocated to any LAB species present in the in-house pheS database. Their phylogenetic affiliation was determined using 16S rRNA gene sequence analysis and showed that the four sourdough isolates belong to the Lactobacillus plantarum group with Lactobacillus mindensis, Lactobacillus farciminis and Lactobacillus nantensis as closest relatives. Further genotypic and phenotypic studies, including whole-cell protein analysis (SDS-PAGE), amplified fragment length polymorphism (AFLP) fingerprinting, DNA-DNA hybridization, DNA G+C content analysis, growth characteristics and biochemical features, demonstrated that the new sourdough isolates represent a novel Lactobacillus species for which the name Lactobacillus crustorum sp. nov. is proposed. The type strain of the new species is LMG 23699(T) (=CCUG 53174(T)).

  2. The three-dimensional structure of TrmB, a transcriptional regulator of dual function in the hyperthermophilic archaeon Pyrococcus furiosus in complex with sucrose

    PubMed Central

    Krug, Michael; Lee, Sung-Jae; Boos, Winfried; Diederichs, Kay; Welte, Wolfram

    2013-01-01

    TrmB is a repressor that binds maltose, maltotriose, and sucrose, as well as other α-glucosides. It recognizes two different operator sequences controlling the TM (Trehalose/Maltose) and the MD (Maltodextrin) operon encoding the respective ABC transporters and sugar-degrading enzymes. Binding of maltose to TrmB abrogates repression of the TM operon but maintains the repression of the MD operon. On the other hand, binding of sucrose abrogates repression of the MD operon but maintains repression of the TM operon. The three-dimensional structure of TrmB in complex with sucrose was solved and refined to a resolution of 3.0 Å. The structure shows the N-terminal DNA binding domain containing a winged-helix-turn-helix (wHTH) domain followed by an amphipathic helix with a coiled-coil motif. The latter promotes dimerization and places the symmetry mates of the putative recognition helix in the wHTH motif about 30 Å apart suggesting a canonical binding to two successive major grooves of duplex palindromic DNA. This suggests that the structure resembles the conformation of TrmB recognizing the pseudopalindromic TM promoter but not the conformation recognizing the nonpalindromic MD promoter. PMID:23576322

  3. Electronic hybridization detection in microarray format and DNA genotyping

    NASA Astrophysics Data System (ADS)

    Blin, Antoine; Cissé, Ismaïl; Bockelmann, Ulrich

    2014-02-01

    We describe an approach to substituting a fluorescence microarray with a surface made of an arrangement of electrolyte-gated field effect transistors. This was achieved using a dedicated blocking of non-specific interactions and comparing threshold voltage shifts of transistors exhibiting probe molecules of different base sequence. We apply the approach to detection of the 35delG mutation, which is related to non-syndromic deafness and is one of the most frequent mutations in humans. The process involves barcode sequences that are generated by Tas-PCR, a newly developed replication reaction using polymerase blocking. The barcodes are recognized by hybridization to surface attached probes and are directly detected by the semiconductor device.

  4. Electronic hybridization detection in microarray format and DNA genotyping

    PubMed Central

    Blin, Antoine; Cissé, Ismaïl; Bockelmann, Ulrich

    2014-01-01

    We describe an approach to substituting a fluorescence microarray with a surface made of an arrangement of electrolyte-gated field effect transistors. This was achieved using a dedicated blocking of non-specific interactions and comparing threshold voltage shifts of transistors exhibiting probe molecules of different base sequence. We apply the approach to detection of the 35delG mutation, which is related to non-syndromic deafness and is one of the most frequent mutations in humans. The process involves barcode sequences that are generated by Tas-PCR, a newly developed replication reaction using polymerase blocking. The barcodes are recognized by hybridization to surface attached probes and are directly detected by the semiconductor device. PMID:24569823

  5. Burkholderia symbiotica sp. nov., isolated from root nodules of Mimosa spp. native to north-east Brazil.

    PubMed

    Sheu, Shih-Yi; Chou, Jui-Hsing; Bontemps, Cyril; Elliott, Geoffrey N; Gross, Eduardo; James, Euan K; Sprent, Janet I; Young, J Peter W; Chen, Wen-Ming

    2012-09-01

    Four strains, designated JPY-345(T), JPY-347, JPY-366 and JPY-581, were isolated from nitrogen-fixing nodules on the roots of two species of Mimosa, Mimosa cordistipula and Mimosa misera, that are native to North East Brazil, and their taxonomic positions were investigated by using a polyphasic approach. All four strains grew at 15-43 °C (optimum 35 °C), at pH 4-7 (optimum pH 5) and with 0-2 % (w/v) NaCl (optimum 0 % NaCl). On the basis of 16S rRNA gene sequence analysis, strain JPY-345(T) showed 97.3 % sequence similarity to the closest related species Burkholderia soli GP25-8(T), 97.3 % sequence similarity to Burkholderia caryophylli ATCC25418(T) and 97.1 % sequence similarity to Burkholderia kururiensis KP23(T). The predominant fatty acids of the strains were C(18 : 1)ω7c (36.1 %), C(16 : 0) (19.8 %) and summed feature 3, comprising C(16 : 1)ω7c and/or C(16 : 1)ω6c (11.5 %). The major isoprenoid quinone was Q-8 and the DNA G+C content of the strains was 64.2-65.7 mol%. The polar lipid profile consisted of a mixture of phosphatidylethanolamine, phosphatidylglycerol, diphosphatidylglycerol and several uncharacterized aminophospholipids and phospholipids. DNA-DNA hybridizations between the novel strain and recognized species of the genus Burkholderia yielded relatedness values of <51.8 %. On the basis of 16S rRNA and recA gene sequence similarities and chemotaxonomic and phenotypic data, the four strains represent a novel species in the genus Burkholderia, for which the name Burkholderia symbiotica sp. nov. is proposed. The type strain is JPY-345(T) (= LMG 26032(T) = BCRC 80258(T) = KCTC 23309(T)).

  6. Identification of antigenic regions on VP2 of African horsesickness virus serotype 3 by using phage-displayed epitope libraries.

    PubMed

    Bentley, L; Fehrsen, J; Jordaan, F; Huismans, H; du Plessis, D H

    2000-04-01

    VP2 is an outer capsid protein of African horsesickness virus (AHSV) and is recognized by serotype-discriminatory neutralizing antibodies. With the objective of locating its antigenic regions, a filamentous phage library was constructed that displayed peptides derived from the fragmentation of a cDNA copy of the gene encoding VP2. Peptides ranging in size from approximately 30 to 100 amino acids were fused with pIII, the attachment protein of the display vector, fUSE2. To ensure maximum diversity, the final library consisted of three sub-libraries. The first utilized enzymatically fragmented DNA encoding only the VP2 gene, the second included plasmid sequences, while the third included a PCR step designed to allow different peptide-encoding sequences to recombine before ligation into the vector. The resulting composite library was subjected to immunoaffinity selection with AHSV-specific polyclonal chicken IgY, polyclonal horse immunoglobulins and a monoclonal antibody (MAb) known to neutralize AHSV. Antigenic peptides were located by sequencing the DNA of phages bound by the antibodies. Most antigenic determinants capable of being mapped by this method were located in the N-terminal half of VP2. Important binding areas were mapped with high resolution by identifying the minimum overlapping areas of the selected peptides. The MAb was also used to screen a random 17-mer epitope library. Sequences that may be part of a discontinuous neutralization epitope were identified. The amino acid sequences of the antigenic regions on VP2 of serotype 3 were compared with corresponding regions on three other serotypes, revealing regions with the potential to discriminate AHSV serotypes serologically.

  7. A DNA Barcoding Method to Discriminate between the Model Plant Brachypodium distachyon and Its Close Relatives B. stacei and B. hybridum (Poaceae)

    PubMed Central

    López-Alvarez, Diana; López-Herranz, Maria Luisa; Betekhtin, Alexander; Catalán, Pilar

    2012-01-01

    Background Brachypodium distachyon s. l. has been widely investigated across the world as a model plant for temperate cereals and biofuel grasses. However, this annual plant shows three cytotypes that have been recently recognized as three independent species, the diploids B. distachyon (2n = 10) and B. stacei (2n = 20) and their derived allotetraploid B. hybridum (2n = 30). Methodology/Principal Findings We propose a DNA barcoding approach that consists of a rapid, accurate and automatable species identification method using the standard DNA sequences of complementary plastid (trnLF) and nuclear (ITS, GI) loci. The highly homogenous but largely divergent B. distachyon and B. stacei diploids could be easily distinguished (100% identification success) using direct trnLF (2.4%), ITS (5.5%) or GI (3.8%) sequence divergence. By contrast, B. hybridum could only be unambiguously identified through the use of combined trnLF+ITS sequences (90% of identification success) or by cloned GI sequences (96.7%) that showed 5.4% (ITS) and 4% (GI) rate divergence between the two parental sequences found in the allopolyploid. Conclusion/Significance Our data provide an unbiased and effective barcode to differentiate these three closely-related species from one another. This procedure overcomes the taxonomic uncertainty generated from methods based on morphology or flow cytometry identifications that have resulted in some misclassifications of the model plant and its allies. Our study also demonstrates that the allotetraploid B. hybridum has resulted from bi-directional crosses of B. distachyon and B. stacei plants acting either as maternal or paternal parents. PMID:23240000

  8. Genetic signs of multiple colonization events in Baltic ciscoes with radiation into sympatric spring- and autumn-spawners confined to early postglacial arrival

    PubMed Central

    Delling, Bo; Palm, Stefan; Palkopoulou, Eleftheria; Prestegaard, Tore

    2014-01-01

    Presence of sympatric populations may reflect local diversification or secondary contact of already distinct forms. The Baltic cisco (Coregonus albula) normally spawns in late autumn, but in a few lakes in Northern Europe sympatric autumn and spring- or winter-spawners have been described. So far, the evolutionary relationships and taxonomic status of these main life history forms have remained largely unclear. With microsatellites and mtDNA sequences, we analyzed extant and extinct spring- and autumn-spawners from a total of 23 Swedish localities, including sympatric populations. Published sequences from Baltic ciscoes in Germany and Finland, and Coregonus sardinella from North America were also included together with novel mtDNA sequences from Siberian C. sardinella. A clear genetic structure within Sweden was found that included two population assemblages markedly differentiated at microsatellites and apparently fixed for mtDNA haplotypes from two distinct clades. All sympatric Swedish populations belonged to the same assemblage, suggesting parallel evolution of spring-spawning rather than secondary contact. The pattern observed further suggests that postglacial immigration to Northern Europe occurred from at least two different refugia. Previous results showing that mtDNA in Baltic cisco is paraphyletic with respect to North American C. sardinella were confirmed. However, the inclusion of Siberian C. sardinella revealed a more complicated pattern, as these novel haplotypes were found within one of the two main C. albula clades and were clearly distinct from those in North American C. sardinella. The evolutionary history of Northern Hemisphere ciscoes thus seems to be more complex than previously recognized. PMID:25540695

  9. Genetic signs of multiple colonization events in Baltic ciscoes with radiation into sympatric spring- and autumn-spawners confined to early postglacial arrival.

    PubMed

    Delling, Bo; Palm, Stefan; Palkopoulou, Eleftheria; Prestegaard, Tore

    2014-11-01

    Presence of sympatric populations may reflect local diversification or secondary contact of already distinct forms. The Baltic cisco (Coregonus albula) normally spawns in late autumn, but in a few lakes in Northern Europe sympatric autumn and spring- or winter-spawners have been described. So far, the evolutionary relationships and taxonomic status of these main life history forms have remained largely unclear. With microsatellites and mtDNA sequences, we analyzed extant and extinct spring- and autumn-spawners from a total of 23 Swedish localities, including sympatric populations. Published sequences from Baltic ciscoes in Germany and Finland, and Coregonus sardinella from North America were also included together with novel mtDNA sequences from Siberian C. sardinella. A clear genetic structure within Sweden was found that included two population assemblages markedly differentiated at microsatellites and apparently fixed for mtDNA haplotypes from two distinct clades. All sympatric Swedish populations belonged to the same assemblage, suggesting parallel evolution of spring-spawning rather than secondary contact. The pattern observed further suggests that postglacial immigration to Northern Europe occurred from at least two different refugia. Previous results showing that mtDNA in Baltic cisco is paraphyletic with respect to North American C. sardinella were confirmed. However, the inclusion of Siberian C. sardinella revealed a more complicated pattern, as these novel haplotypes were found within one of the two main C. albula clades and were clearly distinct from those in North American C. sardinella. The evolutionary history of Northern Hemisphere ciscoes thus seems to be more complex than previously recognized.

  10. Xeroderma pigmentosum complementation group C protein (XPC) serves as a general sensor of damaged DNA

    PubMed Central

    Shell, Steven M.; Hawkins, Edward K.; Tsai, Miaw-Sheue; Hlaing, Aye Su; Rizzo, Carmelo J.; Chazin, Walter J.

    2013-01-01

    The xeroderma pigmentosum complementation group C protein (XPC) serves as the primary initiating factor in the global genome nucleotide excision repair pathway (GG-NER). Recent reports suggest XPC also stimulates repair of oxidative lesions by base excision repair. However, whether XPC distinguishes among various types of DNA lesions remains unclear. Although the DNA binding properties of XPC have been studied by several groups, there is a lack of consensus over whether XPC discriminates between DNA damaged by lesions associated with NER activity versus those that are not. In this study we report a high-throughput fluorescence anisotropy assay used to measure the DNA binding affinity of XPC for a panel of DNA substrates containing a range of chemical lesions in a common sequence. Our results demonstrate that while XPC displays a preference for binding damaged DNA, the identity of the lesion has little effect on the binding affinity of XPC. Moreover, XPC was equally capable of binding to DNA substrates containing lesions not repaired by GG-NER. Our results support an indirect read-out model for sensing the presence of lesions by human XPC and suggest XPC may act as a general sensor of damaged DNA capable of recognizing DNA containing lesions not repaired by NER. PMID:24051049

  11. TIR-NBS-LRR genes are rare in monocots: evidence from diverse monocot orders

    PubMed Central

    Tarr, D Ellen K; Alexander, Helen M

    2009-01-01

    Background Plant resistance (R) gene products recognize pathogen effector molecules. Many R genes code for proteins containing nucleotide binding site (NBS) and C-terminal leucine-rich repeat (LRR) domains. NBS-LRR proteins can be divided into two groups, TIR-NBS-LRR and non-TIR-NBS-LRR, based on the structure of the N-terminal domain. Although both classes are clearly present in gymnosperms and eudicots, only non-TIR sequences have been found consistently in monocots. Since most studies in monocots have been limited to agriculturally important grasses, it is difficult to draw conclusions. The purpose of our study was to look for evidence of these sequences in additional monocot orders. Findings Using degenerate PCR, we amplified NBS sequences from four monocot species (C. blanda, D. marginata, S. trifasciata, and Spathiphyllum sp.), a gymnosperm (C. revoluta) and a eudicot (C. canephora). We successfully amplified TIR-NBS-LRR sequences from dicot and gymnosperm DNA, but not from monocot DNA. Using databases, we obtained NBS sequences from additional monocots, magnoliids and basal angiosperms. TIR-type sequences were not present in monocot or magnoliid sequences, but were present in the basal angiosperms. Phylogenetic analysis supported a single TIR clade and multiple non-TIR clades. Conclusion We were unable to find monocot TIR-NBS-LRR sequences by PCR amplification or database searches. In contrast to previous studies, our results represent five monocot orders (Poales, Zingiberales, Arecales, Asparagales, and Alismatales). Our results establish the presence of TIR-NBS-LRR sequences in basal angiosperms and suggest that although these sequences were present in early land plants, they have been reduced significantly in monocots and magnoliids. PMID:19785756

  12. Two high-mobility group box domains act together to underwind and kink DNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sánchez-Giraldo, R.; Acosta-Reyes, F. J.; Malarkey, C. S.

    The crystal structure of HMGB1 box A bound to an unmodified AT-rich DNA fragment is reported at a resolution of 2 Å. A new mode of DNA recognition for HMG box proteins is found in which two box A domains bind in an unusual configuration generating a highly kinked DNA structure. High-mobility group protein 1 (HMGB1) is an essential and ubiquitous DNA architectural factor that influences a myriad of cellular processes. HMGB1 contains two DNA-binding domains, box A and box B, which have little sequence specificity but have remarkable abilities to underwind and bend DNA. Although HMGB1 box A ismore » thought to be responsible for the majority of HMGB1–DNA interactions with pre-bent or kinked DNA, little is known about how it recognizes unmodified DNA. Here, the crystal structure of HMGB1 box A bound to an AT-rich DNA fragment is reported at a resolution of 2 Å. Two box A domains of HMGB1 collaborate in an unusual configuration in which the Phe37 residues of both domains stack together and intercalate the same CG base pair, generating highly kinked DNA. This represents a novel mode of DNA recognition for HMGB proteins and reveals a mechanism by which structure-specific HMG boxes kink linear DNA.« less

  13. Molecular organization and chromosomal localization of 5S rDNA in Amazonian Engystomops (Anura, Leiuperidae)

    PubMed Central

    2012-01-01

    Background For anurans, knowledge of 5S rDNA is scarce. For Engystomops species, chromosomal homeologies are difficult to recognize due to the high level of inter- and intraspecific cytogenetic variation. In an attempt to better compare the karyotypes of the Amazonian species Engystomops freibergi and Engystomops petersi, and to extend the knowledge of 5S rDNA organization in anurans, the 5S rDNA sequences of Amazonian Engystomops species were isolated, characterized, and mapped. Results Two types of 5S rDNA, which were readily differentiated by their NTS (non-transcribed spacer) sizes and compositions, were isolated from specimens of E. freibergi from Brazil and E. petersi from two Ecuadorian localities (Puyo and Yasuní). In the E. freibergi karyotypes, the entire type I 5S rDNA repeating unit hybridized to the pericentromeric region of 3p, whereas the entire type II 5S rDNA repeating unit mapped to the distal region of 6q, suggesting a differential localization of these sequences. The type I NTS probe clearly detected the 3p pericentromeric region in the karyotypes of E. freibergi and E. petersi from Puyo and the 5p pericentromeric region in the karyotype of E. petersi from Yasuní, but no distal or interstitial signals were observed. Interestingly, this probe also detected many centromeric regions in the three karyotypes, suggesting the presence of a satellite DNA family derived from 5S rDNA. The type II NTS probe detected only distal 6q regions in the three karyotypes, corroborating the differential distribution of the two types of 5S rDNA. Conclusions Because the 5S rDNA types found in Engystomops are related to those of Physalaemus with respect to their nucleotide sequences and chromosomal locations, their origin likely preceded the evolutionary divergence of these genera. In addition, our data indicated homeology between Chromosome 5 in E. petersi from Yasuní and Chromosomes 3 in E. freibergi and E. petersi from Puyo. In addition, the chromosomal location of the type II 5S rDNA corroborates the hypothesis that the Chromosomes 6 of E. petersi and E. freibergi are homeologous despite the great differences observed between the karyotypes of the Yasuní specimens and the others. PMID:22433220

  14. Molecular organization and chromosomal localization of 5S rDNA in Amazonian Engystomops (Anura, Leiuperidae).

    PubMed

    Rodrigues, Débora Silva; Rivera, Miryan; Lourenço, Luciana Bolsoni

    2012-03-20

    For anurans, knowledge of 5S rDNA is scarce. For Engystomops species, chromosomal homeologies are difficult to recognize due to the high level of inter- and intraspecific cytogenetic variation. In an attempt to better compare the karyotypes of the Amazonian species Engystomops freibergi and Engystomops petersi, and to extend the knowledge of 5S rDNA organization in anurans, the 5S rDNA sequences of Amazonian Engystomops species were isolated, characterized, and mapped. Two types of 5S rDNA, which were readily differentiated by their NTS (non-transcribed spacer) sizes and compositions, were isolated from specimens of E. freibergi from Brazil and E. petersi from two Ecuadorian localities (Puyo and Yasuní). In the E. freibergi karyotypes, the entire type I 5S rDNA repeating unit hybridized to the pericentromeric region of 3p, whereas the entire type II 5S rDNA repeating unit mapped to the distal region of 6q, suggesting a differential localization of these sequences. The type I NTS probe clearly detected the 3p pericentromeric region in the karyotypes of E. freibergi and E. petersi from Puyo and the 5p pericentromeric region in the karyotype of E. petersi from Yasuní, but no distal or interstitial signals were observed. Interestingly, this probe also detected many centromeric regions in the three karyotypes, suggesting the presence of a satellite DNA family derived from 5S rDNA. The type II NTS probe detected only distal 6q regions in the three karyotypes, corroborating the differential distribution of the two types of 5S rDNA. Because the 5S rDNA types found in Engystomops are related to those of Physalaemus with respect to their nucleotide sequences and chromosomal locations, their origin likely preceded the evolutionary divergence of these genera. In addition, our data indicated homeology between Chromosome 5 in E. petersi from Yasuní and Chromosomes 3 in E. freibergi and E. petersi from Puyo. In addition, the chromosomal location of the type II 5S rDNA corroborates the hypothesis that the Chromosomes 6 of E. petersi and E. freibergi are homeologous despite the great differences observed between the karyotypes of the Yasuní specimens and the others.

  15. Finding the target sites of RNA-binding proteins

    PubMed Central

    Li, Xiao; Kazan, Hilal; Lipshitz, Howard D; Morris, Quaid D

    2014-01-01

    RNA–protein interactions differ from DNA–protein interactions because of the central role of RNA secondary structure. Some RNA-binding domains (RBDs) recognize their target sites mainly by their shape and geometry and others are sequence-specific but are sensitive to secondary structure context. A number of small- and large-scale experimental approaches have been developed to measure RNAs associated in vitro and in vivo with RNA-binding proteins (RBPs). Generalizing outside of the experimental conditions tested by these assays requires computational motif finding. Often RBP motif finding is done by adapting DNA motif finding methods; but modeling secondary structure context leads to better recovery of RBP-binding preferences. Genome-wide assessment of mRNA secondary structure has recently become possible, but these data must be combined with computational predictions of secondary structure before they add value in predicting in vivo binding. There are two main approaches to incorporating structural information into motif models: supplementing primary sequence motif models with preferred secondary structure contexts (e.g., MEMERIS and RNAcontext) and directly modeling secondary structure recognized by the RBP using stochastic context-free grammars (e.g., CMfinder and RNApromo). The former better reconstruct known binding preferences for sequence-specific RBPs but are not suitable for modeling RBPs that recognize shape and geometry of RNAs. Future work in RBP motif finding should incorporate interactions between multiple RBDs and multiple RBPs in binding to RNA. WIREs RNA 2014, 5:111–130. doi: 10.1002/wrna.1201 PMID:24217996

  16. Human XPA and RPA DNA repair proteins participate in specific recognition of triplex-induced helical distortions

    NASA Astrophysics Data System (ADS)

    Vasquez, Karen M.; Christensen, Jesper; Li, Lei; Finch, Rick A.; Glazer, Peter M.

    2002-04-01

    Nucleotide excision repair (NER) plays a central role in maintaining genomic integrity by detecting and repairing a wide variety of DNA lesions. Xeroderma pigmentosum complementation group A protein (XPA) is an essential component of the repair machinery, and it is thought to be involved in the initial step as a DNA damage recognition and/or confirmation factor. Human replication protein A (RPA) and XPA have been reported to interact to form a DNA damage recognition complex with greater specificity for damaged DNA than XPA alone. The mechanism by which these two proteins recognize such a wide array of structures resulting from different types of DNA damage is not known. One possibility is that they recognize a common feature of the lesions, such as distortions of the helical backbone. We have tested this idea by determining whether human XPA and RPA proteins can recognize the helical distortions induced by a DNA triple helix, a noncanonical DNA structure that has been shown to induce DNA repair, mutagenesis, and recombination. We measured binding of XPA and RPA, together or separately, to substrates containing triplexes with three, two, or no strands covalently linked by psoralen conjugation and photoaddition. We found that RPA alone recognizes all covalent triplex structures, but also forms multivalent nonspecific DNA aggregates at higher concentrations. XPA by itself does not recognize the substrates, but it binds them in the presence of RPA. Addition of XPA decreases the nonspecific DNA aggregate formation. These results support the hypothesis that the NER machinery is targeted to helical distortions and demonstrate that RPA can recognize damaged DNA even without XPA.

  17. Phyloscan: locating transcription-regulating binding sites in mixed aligned and unaligned sequence data.

    PubMed

    Palumbo, Michael J; Newberg, Lee A

    2010-07-01

    The transcription of a gene from its DNA template into an mRNA molecule is the first, and most heavily regulated, step in gene expression. Especially in bacteria, regulation is typically achieved via the binding of a transcription factor (protein) or small RNA molecule to the chromosomal region upstream of a regulated gene. The protein or RNA molecule recognizes a short, approximately conserved sequence within a gene's promoter region and, by binding to it, either enhances or represses expression of the nearby gene. Since the sought-for motif (pattern) is short and accommodating to variation, computational approaches that scan for binding sites have trouble distinguishing functional sites from look-alikes. Many computational approaches are unable to find the majority of experimentally verified binding sites without also finding many false positives. Phyloscan overcomes this difficulty by exploiting two key features of functional binding sites: (i) these sites are typically more conserved evolutionarily than are non-functional DNA sequences; and (ii) these sites often occur two or more times in the promoter region of a regulated gene. The website is free and open to all users, and there is no login requirement. Address: (http://bayesweb.wadsworth.org/phyloscan/).

  18. Molecular Barcoding of Aquatic Oligochaetes: Implications for Biomonitoring

    PubMed Central

    Vivien, Régis; Wyler, Sofia; Lafont, Michel; Pawlowski, Jan

    2015-01-01

    Aquatic oligochaetes are well recognized bioindicators of quality of sediments and water in watercourses and lakes. However, the difficult taxonomic determination based on morphological features compromises their more common use in eco-diagnostic analyses. To overcome this limitation, we investigated molecular barcodes as identification tool for broad range of taxa of aquatic oligochaetes. We report 185 COI and 52 ITS2 rDNA sequences for specimens collected in Switzerland and belonging to the families Naididae, Lumbriculidae, Enchytraeidae and Lumbricidae. Phylogenetic analyses allowed distinguishing 41 lineages separated by more than 10 % divergence in COI sequences. The lineage distinction was confirmed by Automatic Barcode Gap Discovery (ABGD) method and by ITS2 data. Our results showed that morphological identification underestimates the oligochaete diversity. Only 26 of the lineages could be assigned to morphospecies, of which seven were sequenced for the first time. Several cryptic species were detected within common morphospecies. Many juvenile specimens that could not be assigned morphologically have found their home after genetic analysis. Our study showed that COI barcodes performed very well as species identifiers in aquatic oligochaetes. Their easy amplification and good taxonomic resolution might help promoting aquatic oligochaetes as bioindicators for next generation environmental DNA biomonitoring of aquatic ecosystems. PMID:25856230

  19. Spiders (Araneae) of Churchill, Manitoba: DNA barcodes and morphology reveal high species diversity and new Canadian records

    PubMed Central

    2013-01-01

    Background Arctic ecosystems, especially those near transition zones, are expected to be strongly impacted by climate change. Because it is positioned on the ecotone between tundra and boreal forest, the Churchill area is a strategic locality for the analysis of shifts in faunal composition. This fact has motivated the effort to develop a comprehensive biodiversity inventory for the Churchill region by coupling DNA barcoding with morphological studies. The present study represents one element of this effort; it focuses on analysis of the spider fauna at Churchill. Results 198 species were detected among 2704 spiders analyzed, tripling the count for the Churchill region. Estimates of overall diversity suggest that another 10–20 species await detection. Most species displayed little intraspecific sequence variation (maximum <1%) in the barcode region of the cytochrome c oxidase subunit I (COI) gene, but four species showed considerably higher values (maximum = 4.1-6.2%), suggesting cryptic species. All recognized species possessed a distinct haplotype array at COI with nearest-neighbour interspecific distances averaging 8.57%. Three species new to Canada were detected: Robertus lyrifer (Theridiidae), Baryphyma trifrons (Linyphiidae), and Satilatlas monticola (Linyphiidae). The first two species may represent human-mediated introductions linked to the port in Churchill, but the other species represents a range extension from the USA. The first description of the female of S. monticola was also presented. As well, one probable new species of Alopecosa (Lycosidae) was recognized. Conclusions This study provides the first comprehensive DNA barcode reference library for the spider fauna of any region. Few cryptic species of spiders were detected, a result contrasting with the prevalence of undescribed species in several other terrestrial arthropod groups at Churchill. Because most (97.5%) sequence clusters at COI corresponded with a named taxon, DNA barcoding reliably identifies spiders in the Churchill fauna. The capacity of DNA barcoding to enable the identification of otherwise taxonomically ambiguous specimens (juveniles, females) also represents a major advance for future monitoring efforts on this group. PMID:24279427

  20. Asymmetric Regulation of Bipolar Single-stranded DNA Translocation by the Two Motors within Escherichia coli RecBCD Helicase*

    PubMed Central

    Xie, Fuqian; Wu, Colin G.; Weiland, Elizabeth; Lohman, Timothy M.

    2013-01-01

    Repair of double-stranded DNA breaks in Escherichia coli is initiated by the RecBCD helicase that possesses two superfamily-1 motors, RecB (3′ to 5′ translocase) and RecD (5′ to 3′ translocase), that operate on the complementary DNA strands to unwind duplex DNA. However, it is not known whether the RecB and RecD motors act independently or are functionally coupled. Here we show by directly monitoring ATP-driven single-stranded DNA translocation of RecBCD that the 5′ to 3′ rate is always faster than the 3′ to 5′ rate on DNA without a crossover hotspot instigator site and that the translocation rates are coupled asymmetrically. That is, RecB regulates both 3′ to 5′ and 5′ to 3′ translocation, whereas RecD only regulates 5′ to 3′ translocation. We show that the recently identified RecBC secondary translocase activity functions within RecBCD and that this contributes to the coupling. This coupling has implications for how RecBCD activity is regulated after it recognizes a crossover hotspot instigator sequence during DNA unwinding. PMID:23192341

  1. The Fanconi anemia associated protein FAAP24 uses two substrate specific binding surfaces for DNA recognition

    PubMed Central

    Wienk, Hans; Slootweg, Jack C.; Speerstra, Sietske; Kaptein, Robert; Boelens, Rolf; Folkers, Gert E.

    2013-01-01

    To maintain the integrity of the genome, multiple DNA repair systems exist to repair damaged DNA. Recognition of altered DNA, including bulky adducts, pyrimidine dimers and interstrand crosslinks (ICL), partially depends on proteins containing helix-hairpin-helix (HhH) domains. To understand how ICL is specifically recognized by the Fanconi anemia proteins FANCM and FAAP24, we determined the structure of the HhH domain of FAAP24. Although it resembles other HhH domains, the FAAP24 domain contains a canonical hairpin motif followed by distorted motif. The HhH domain can bind various DNA substrates; using nuclear magnetic resonance titration experiments, we demonstrate that the canonical HhH motif is required for double-stranded DNA (dsDNA) binding, whereas the unstructured N-terminus can interact with single-stranded DNA. Both DNA binding surfaces are used for binding to ICL-like single/double-strand junction-containing DNA substrates. A structural model for FAAP24 bound to dsDNA has been made based on homology with the translesion polymerase iota. Site-directed mutagenesis, sequence conservation and charge distribution support the dsDNA-binding model. Analogous to other HhH domain-containing proteins, we suggest that multiple FAAP24 regions together contribute to binding to single/double-strand junction, which could contribute to specificity in ICL DNA recognition. PMID:23661679

  2. piRNA pathway targets active LINE1 elements to establish the repressive H3K9me3 mark in germ cells

    PubMed Central

    Pezic, Dubravka; Manakov, Sergei A.; Sachidanandam, Ravi; Aravin, Alexei A.

    2014-01-01

    Transposable elements (TEs) occupy a large fraction of metazoan genomes and pose a constant threat to genomic integrity. This threat is particularly critical in germ cells, as changes in the genome that are induced by TEs will be transmitted to the next generation. Small noncoding piwi-interacting RNAs (piRNAs) recognize and silence a diverse set of TEs in germ cells. In mice, piRNA-guided transposon repression correlates with establishment of CpG DNA methylation on their sequences, yet the mechanism and the spectrum of genomic targets of piRNA silencing are unknown. Here we show that in addition to DNA methylation, the piRNA pathway is required to maintain a high level of the repressive H3K9me3 histone modification on long interspersed nuclear elements (LINEs) in germ cells. piRNA-dependent chromatin repression targets exclusively full-length elements of actively transposing LINE families, demonstrating the remarkable ability of the piRNA pathway to recognize active elements among the large number of genomic transposon fragments. PMID:24939875

  3. Nuclear targeting of viral and non-viral DNA.

    PubMed

    Chowdhury, E H

    2009-07-01

    The nuclear envelope presents a major barrier to transgene delivery and expression using a non-viral vector. Virus is capable of overcoming the barrier to deliver their genetic materials efficiently into the nucleus by virtue of the specialized protein components with the unique amino acid sequences recognizing cellular nuclear transport machinery. However, considering the safety issues in the clinical gene therapy for treating critical human diseases, non-viral systems are highly promising compared with their viral counterparts. This review summarizes the progress on exploring the nuclear traffic mechanisms for the prominent viral vectors and the technological innovations for the nuclear delivery of non-viral DNA by mimicking those natural processes evolved for the viruses as well as for many cellular proteins.

  4. Programming of Essential Hypertension: What Pediatric Cardiologists Need to Know.

    PubMed

    Morgado, Joana; Sanches, Bruno; Anjos, Rui; Coelho, Constança

    2015-10-01

    Hypertension is recognized as one of the major contributing factors to cardiovascular disease, but its etiology remains incompletely understood. Known genetic and environmental influences can only explain a small part of the variability in cardiovascular disease risk. The missing heritability is currently one of the most important challenges in blood pressure and hypertension genetics. Recently, some promising approaches have emerged that move beyond the DNA sequence and focus on identification of blood pressure genes regulated by epigenetic mechanisms such as DNA methylation, histone modification and microRNAs. This review summarizes information on gene-environmental interactions that lead toward the developmental programming of hypertension with specific reference to epigenetics and provides pediatricians and pediatric cardiologists with a more complete understanding of its pathogenesis.

  5. Lactobacillus nantensis sp. nov., isolated from French wheat sourdough.

    PubMed

    Valcheva, Rosica; Ferchichi, Mounir F; Korakli, Maher; Ivanova, Iskra; Gänzle, Michael G; Vogel, Rudi F; Prévost, Hervé; Onno, Bernard; Dousset, Xavier

    2006-03-01

    A polyphasic taxonomic study of the bacterial flora isolated from traditional French wheat sourdough, using phenotypic characterization and phylogenetic as well as genetic methods, revealed a consistent group of isolates that could not be assigned to any recognized species. These results were confirmed by randomly amplified polymorphic DNA and amplified fragment length polymorphism fingerprinting analyses. Cells were Gram-positive, homofermentative rods. Comparative 16S rRNA gene sequence analysis of the representative strain LP33T indicated that these strains belong to the genus Lactobacillus and that they formed a branch distinct from their closest relatives Lactobacillus farciminis, Lactobacillus alimentarius, Lactobacillus paralimentarius and Lactobacillus mindensis. DNA-DNA reassociation experiments with the three phylogenetically closest Lactobacillus species confirmed that LP33T (= DSM 16982T = CIP 108546T = TMW 1.1265T) represents the type strain of a novel species, for which the name Lactobacillus nantensis sp. nov. is proposed.

  6. Improved detection of DNA-binding proteins via compression technology on PSSM information.

    PubMed

    Wang, Yubo; Ding, Yijie; Guo, Fei; Wei, Leyi; Tang, Jijun

    2017-01-01

    Since the importance of DNA-binding proteins in multiple biomolecular functions has been recognized, an increasing number of researchers are attempting to identify DNA-binding proteins. In recent years, the machine learning methods have become more and more compelling in the case of protein sequence data soaring, because of their favorable speed and accuracy. In this paper, we extract three features from the protein sequence, namely NMBAC (Normalized Moreau-Broto Autocorrelation), PSSM-DWT (Position-specific scoring matrix-Discrete Wavelet Transform), and PSSM-DCT (Position-specific scoring matrix-Discrete Cosine Transform). We also employ feature selection algorithm on these feature vectors. Then, these features are fed into the training SVM (support vector machine) model as classifier to predict DNA-binding proteins. Our method applys three datasets, namely PDB1075, PDB594 and PDB186, to evaluate the performance of our approach. The PDB1075 and PDB594 datasets are employed for Jackknife test and the PDB186 dataset is used for the independent test. Our method achieves the best accuracy in the Jacknife test, from 79.20% to 86.23% and 80.5% to 86.20% on PDB1075 and PDB594 datasets, respectively. In the independent test, the accuracy of our method comes to 76.3%. The performance of independent test also shows that our method has a certain ability to be effectively used for DNA-binding protein prediction. The data and source code are at https://doi.org/10.6084/m9.figshare.5104084.

  7. Toward a DNA Taxonomy of Alpine Rhithrogena (Ephemeroptera: Heptageniidae) Using a Mixed Yule-Coalescent Analysis of Mitochondrial and Nuclear DNA

    PubMed Central

    Vuataz, Laurent; Sartori, Michel; Wagner, André; Monaghan, Michael T.

    2011-01-01

    Aquatic larvae of many Rhithrogena mayflies (Ephemeroptera) inhabit sensitive Alpine environments. A number of species are on the IUCN Red List and many recognized species have restricted distributions and are of conservation interest. Despite their ecological and conservation importance, ambiguous morphological differences among closely related species suggest that the current taxonomy may not accurately reflect the evolutionary diversity of the group. Here we examined the species status of nearly 50% of European Rhithrogena diversity using a widespread sampling scheme of Alpine species that included 22 type localities, general mixed Yule-coalescent (GMYC) model analysis of one standard mtDNA marker and one newly developed nDNA marker, and morphological identification where possible. Using sequences from 533 individuals from 144 sampling localities, we observed significant clustering of the mitochondrial (cox1) marker into 31 GMYC species. Twenty-one of these could be identified based on the presence of topotypes (expertly identified specimens from the species' type locality) or unambiguous morphology. These results strongly suggest the presence of both cryptic diversity and taxonomic oversplitting in Rhithrogena. Significant clustering was not detected with protein-coding nuclear PEPCK, although nine GMYC species were congruent with well supported terminal clusters of nDNA. Lack of greater congruence in the two data sets may be the result of incomplete sorting of ancestral polymorphism. Bayesian phylogenetic analyses of both gene regions recovered four of the six recognized Rhithrogena species groups in our samples as monophyletic. Future development of more nuclear markers would facilitate multi-locus analysis of unresolved, closely related species pairs. The DNA taxonomy developed here lays the groundwork for a future revision of the important but cryptic Rhithrogena genus in Europe. PMID:21611178

  8. Schizosaccharomyces pombe MutSα and MutLα Maintain Stability of Tetra-Nucleotide Repeats and Msh3 of Hepta-Nucleotide Repeats

    PubMed Central

    Villahermosa, Desirée; Christensen, Olaf; Knapp, Karen; Fleck, Oliver

    2017-01-01

    Defective mismatch repair (MMR) in humans is associated with colon cancer and instability of microsatellites, that is, DNA sequences with one or several nucleotides repeated. Key factors of eukaryotic MMR are the heterodimers MutSα (Msh2-Msh6), which recognizes base-base mismatches and unpaired nucleotides in DNA, and MutLα (Mlh1-Pms1), which facilitates downstream steps. In addition, MutSβ (Msh2-Msh3) recognizes DNA loops of various sizes, although our previous data and the data presented here suggest that Msh3 of Schizosaccharomyces pombe does not play a role in MMR. To test microsatellite stability in S. pombe and hence DNA loop repair, we have inserted tetra-, penta-, and hepta-nucleotide repeats in the ade6 gene and determined their Ade+ reversion rates and spectra in wild type and various mutants. Our data indicate that loops with four unpaired nucleotides in the nascent and the template strand are the upper limit of MutSα- and MutLα-mediated MMR in S. pombe. Stability of hepta-nucleotide repeats requires Msh3 and Exo1 in MMR-independent processes as well as the DNA repair proteins Rad50, Rad51, and Rad2FEN1. Most strikingly, mutation rates in the double mutants msh3 exo1 and msh3 rad51 were decreased when compared to respective single mutants, indicating that Msh3 prevents error prone processes carried out by Exo1 and Rad51. We conclude that Msh3 has no obvious function in MMR in S. pombe, but contributes to DNA repeat stability in MMR-independent processes. PMID:28341698

  9. Schizosaccharomyces pombe MutSα and MutLα Maintain Stability of Tetra-Nucleotide Repeats and Msh3 of Hepta-Nucleotide Repeats.

    PubMed

    Villahermosa, Desirée; Christensen, Olaf; Knapp, Karen; Fleck, Oliver

    2017-05-05

    Defective mismatch repair (MMR) in humans is associated with colon cancer and instability of microsatellites, that is, DNA sequences with one or several nucleotides repeated. Key factors of eukaryotic MMR are the heterodimers MutSα (Msh2-Msh6), which recognizes base-base mismatches and unpaired nucleotides in DNA, and MutLα (Mlh1-Pms1), which facilitates downstream steps. In addition, MutSβ (Msh2-Msh3) recognizes DNA loops of various sizes, although our previous data and the data presented here suggest that Msh3 of Schizosaccharomyces pombe does not play a role in MMR. To test microsatellite stability in S. pombe and hence DNA loop repair, we have inserted tetra-, penta-, and hepta-nucleotide repeats in the ade6 gene and determined their Ade + reversion rates and spectra in wild type and various mutants. Our data indicate that loops with four unpaired nucleotides in the nascent and the template strand are the upper limit of MutSα- and MutLα-mediated MMR in S. pombe Stability of hepta-nucleotide repeats requires Msh3 and Exo1 in MMR-independent processes as well as the DNA repair proteins Rad50, Rad51, and Rad2 FEN1 Most strikingly, mutation rates in the double mutants msh3 exo1 and msh3 rad51 were decreased when compared to respective single mutants, indicating that Msh3 prevents error prone processes carried out by Exo1 and Rad51. We conclude that Msh3 has no obvious function in MMR in S. pombe , but contributes to DNA repeat stability in MMR-independent processes. Copyright © 2017 Villahermosa et al.

  10. Enzyme-assisted cycling amplification and DNA-templated in-situ deposition of silver nanoparticles for the sensitive electrochemical detection of Hg(2.).

    PubMed

    Xie, Hua; Wang, Qin; Chai, Yaqin; Yuan, Yali; Yuan, Ruo

    2016-12-15

    In this work, a label-free electrochemical biosensor was developed for sensitive and selective detection of mercury (II) ions (Hg(2+)) based on in-situ deposition of silver nanoparticles (AgNPs) on terminal deoxynucleotidyl transferase (TdT) extended ssDNA for signal output and nicking endonuclease for cycling amplification. In the presence of target Hg(2+), the T-rich DNA (HP1) could partly fold into duplex-like structure (termed as output DNA) via T-Hg(2+)-T base pairs and thus exposed its sticky end. The sticky end of output DNA could then hybridize with 3'-PO4 terminated capture DNA (HP2) on electrode surface to form output DNA-HP2 hybridization complex with the sequence 5'-CCTCAGC-3'/3'-GGAGTCG-5' (the sequence could be recognized by nicking endonuclease Nt. BbvCI). With the introduction of Nt. BbvCI, output DNA existed in hybridization complex was released from electrode and participated in the next hybridization process, accompanying with the cleave of HP2 to expose substantial 3'-OH group, which could be extended into a long ssDNA nanotail with the aid of TdT and deoxyadenosine triphosphate (dATP). Since the long negatively charged ssDNA nanotail absorbed the positively charged silver ions on the DNA skeleton, the metallic silver could be in-situ deposited on electrode surface for electrochemical signal output upon addition of reduction regent sodium borohydride. Under optimal conditions, the developed electrochemical biosensor presented a good response to Hg(2+) with a detection limit of 3 pM (S/N=3). Furthermore, the biosensor exhibited good reproducibility and high selectivity towards other interfering ions. The proposed sensing system also showed a promising potential application in real sample analysis. Copyright © 2016 Elsevier B.V. All rights reserved.

  11. Differences in DNA Binding Specificity of Floral Homeotic Protein Complexes Predict Organ-Specific Target Genes.

    PubMed

    Smaczniak, Cezary; Muiño, Jose M; Chen, Dijun; Angenent, Gerco C; Kaufmann, Kerstin

    2017-08-01

    Floral organ identities in plants are specified by the combinatorial action of homeotic master regulatory transcription factors. However, how these factors achieve their regulatory specificities is still largely unclear. Genome-wide in vivo DNA binding data show that homeotic MADS domain proteins recognize partly distinct genomic regions, suggesting that DNA binding specificity contributes to functional differences of homeotic protein complexes. We used in vitro systematic evolution of ligands by exponential enrichment followed by high-throughput DNA sequencing (SELEX-seq) on several floral MADS domain protein homo- and heterodimers to measure their DNA binding specificities. We show that specification of reproductive organs is associated with distinct binding preferences of a complex formed by SEPALLATA3 and AGAMOUS. Binding specificity is further modulated by different binding site spacing preferences. Combination of SELEX-seq and genome-wide DNA binding data allows differentiation between targets in specification of reproductive versus perianth organs in the flower. We validate the importance of DNA binding specificity for organ-specific gene regulation by modulating promoter activity through targeted mutagenesis. Our study shows that intrafamily protein interactions affect DNA binding specificity of floral MADS domain proteins. Differential DNA binding of MADS domain protein complexes plays a role in the specificity of target gene regulation. © 2017 American Society of Plant Biologists. All rights reserved.

  12. TALE-PvuII fusion proteins--novel tools for gene targeting.

    PubMed

    Yanik, Mert; Alzubi, Jamal; Lahaye, Thomas; Cathomen, Toni; Pingoud, Alfred; Wende, Wolfgang

    2013-01-01

    Zinc finger nucleases (ZFNs) consist of zinc fingers as DNA-binding module and the non-specific DNA-cleavage domain of the restriction endonuclease FokI as DNA-cleavage module. This architecture is also used by TALE nucleases (TALENs), in which the DNA-binding modules of the ZFNs have been replaced by DNA-binding domains based on transcription activator like effector (TALE) proteins. Both TALENs and ZFNs are programmable nucleases which rely on the dimerization of FokI to induce double-strand DNA cleavage at the target site after recognition of the target DNA by the respective DNA-binding module. TALENs seem to have an advantage over ZFNs, as the assembly of TALE proteins is easier than that of ZFNs. Here, we present evidence that variant TALENs can be produced by replacing the catalytic domain of FokI with the restriction endonuclease PvuII. These fusion proteins recognize only the composite recognition site consisting of the target site of the TALE protein and the PvuII recognition sequence (addressed site), but not isolated TALE or PvuII recognition sites (unaddressed sites), even at high excess of protein over DNA and long incubation times. In vitro, their preference for an addressed over an unaddressed site is > 34,000-fold. Moreover, TALE-PvuII fusion proteins are active in cellula with minimal cytotoxicity.

  13. Novel structural features drive DNA binding properties of Cmr, a CRP family protein in TB complex mycobacteria.

    PubMed

    Ranganathan, Sridevi; Cheung, Jonah; Cassidy, Michael; Ginter, Christopher; Pata, Janice D; McDonough, Kathleen A

    2018-01-09

    Mycobacterium tuberculosis (Mtb) encodes two CRP/FNR family transcription factors (TF) that contribute to virulence, Cmr (Rv1675c) and CRPMt (Rv3676). Prior studies identified distinct chromosomal binding profiles for each TF despite their recognizing overlapping DNA motifs. The present study shows that Cmr binding specificity is determined by discriminator nucleotides at motif positions 4 and 13. X-ray crystallography and targeted mutational analyses identified an arginine-rich loop that expands Cmr's DNA interactions beyond the classical helix-turn-helix contacts common to all CRP/FNR family members and facilitates binding to imperfect DNA sequences. Cmr binding to DNA results in a pronounced asymmetric bending of the DNA and its high level of cooperativity is consistent with DNA-facilitated dimerization. A unique N-terminal extension inserts between the DNA binding and dimerization domains, partially occluding the site where the canonical cAMP binding pocket is found. However, an unstructured region of this N-terminus may help modulate Cmr activity in response to cellular signals. Cmr's multiple levels of DNA interaction likely enhance its ability to integrate diverse gene regulatory signals, while its novel structural features establish Cmr as an atypical CRP/FNR family member. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. Understanding the recognition mechanisms of Zα domain of human editing enzyme ADAR1 (hZα(ADAR1)) and various Z-DNAs from molecular dynamics simulation.

    PubMed

    Wang, Qianqian; Li, Lanlan; Wang, Xiaoting; Liu, Huanxiang; Yao, Xiaojun

    2014-11-01

    The Z-DNA-binding domain of human double-stranded RNA adenosine deaminase I (hZαADAR1) can specifically recognize the left-handed Z-DNA which preferentially occurs at alternating purine-pyrimidine repeats, especially the CG-repeats. The interactions of hZαADAR1 and Z-DNAs in different sequence contexts can affect many important biological functions including gene regulation and chromatin remodeling. Therefore it is of great necessity to fully understand their recognition mechanisms. However, most existing studies are aimed at the standard CG-repeat Z-DNA rather than the non-CG-repeats, and whether the molecular basis of hZαADAR1 binding to various Z-DNAs are identical or not is still unclear on the atomic level. Here, based on the recently determined crystal structures of three representative non-CG-repeat Z-DNAs (d(CACGTG)2, d(CGTACG)2 and d(CGGCCG)2) in complex with hZαADAR1, 40 ns molecular dynamics simulation together with binding free energy calculation were performed for each system. For comparison, the standard CG-repeat Z-DNA (d(CGCGCG)2) complexed with hZαADAR1 was also simulated. The consistent results demonstrate that nonpolar interaction is the driving force during the protein-DNA binding process, and that polar interaction mainly from helix α3 also provides important contributions. Five common hot-spot residues were identified, namely Lys169, Lys170, Asn173, Arg174 and Tyr177. Hydrogen bond analysis coupled with surface charge distribution further reveal the interfacial information between hZαADAR1 and Z-DNA in detail. All of the analysis illustrate that four complexes share the common key features and the similar binding modes irrespective of Z-DNA sequences, suggesting that Z-DNA recognition by hZαADAR1 is conformation-specific rather than sequence-specific. Additionally, by analyzing the conformational changes of hZαADAR1, we found that the binding of Z-DNA could effectively stabilize hZαADAR1 protein. Our study can provide some valuable information for better understanding the binding mechanism between hZαADAR1 or even other Z-DNA-binding protein and Z-DNA.

  15. Unraveling the Sex Chromosome Heteromorphism of the Paradoxical Frog Pseudis tocantins

    PubMed Central

    Gatto, Kaleb Pretto; Busin, Carmen Silvia; Lourenço, Luciana Bolsoni

    2016-01-01

    The paradoxical frog Pseudis tocantins is the only species in the Hylidae family with known heteromorphic Z and W sex chromosomes. The Z chromosome is metacentric and presents an interstitial nucleolar organizer region (NOR) on the long arm that is adjacent to a pericentromeric heterochromatic band. In contrast, the submetacentric W chromosome carries a pericentromeric NOR on the long arm, which is adjacent to a clearly evident heterochromatic band that is larger than the band found on the Z chromosome and justify the size difference observed between these chromosomes. Here, we provide evidence that the non-centromeric heterochromatic bands in Zq and Wq differ not only in size and location but also in composition, based on comparative genomic hybridization (CGH) and an analysis of the anuran PcP190 satellite DNA. The finding of PcP190 sequences in P. tocantins extends the presence of this satellite DNA, which was previously detected among Leptodactylidae and Hylodidae, suggesting that this family of repetitive DNA is even older than it was formerly considered. Seven groups of PcP190 sequences were recognized in the genome of P. tocantins. PcP190 probes mapped to the heterochromatic band in Wq, and a Southern blot analysis indicated the accumulation of PcP190 in the female genome of P. tocantins, which suggests the involvement of this satellite DNA in the evolution of the sex chromosomes of this species. PMID:27214234

  16. Modularly assembled designer TAL effector nucleases for targeted gene knockout and gene replacement in eukaryotes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, T; Huang, S; Zhao, XF

    Recent studies indicate that the DNA recognition domain of transcription activator-like (TAL) effectors can be combined with the nuclease domain of FokI restriction enzyme to produce TAL effector nucleases (TALENs) that, in pairs, bind adjacent DNA target sites and produce double-strand breaks between the target sequences, stimulating non-homologous end-joining and homologous recombination. Here, we exploit the four prevalent TAL repeats and their DNA recognition cipher to develop a 'modular assembly' method for rapid production of designer TALENs (dTALENs) that recognize unique DNA sequence up to 23 bases in any gene. We have used this approach to engineer 10 dTALENs tomore » target specific loci in native yeast chromosomal genes. All dTALENs produced high rates of site-specific gene disruptions and created strains with expected mutant phenotypes. Moreover, dTALENs stimulated high rates (up to 34%) of gene replacement by homologous recombination. Finally, dTALENs caused no detectable cytotoxicity and minimal levels of undesired genetic mutations in the treated yeast strains. These studies expand the realm of verified TALEN activity from cultured human cells to an intact eukaryotic organism and suggest that low-cost, highly dependable dTALENs can assume a significant role for gene modifications of value in human and animal health, agriculture and industry.« less

  17. Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes.

    PubMed

    Al-Attar, Sinan; Westra, Edze R; van der Oost, John; Brouns, Stan J J

    2011-04-01

    Many prokaryotes contain the recently discovered defense system against mobile genetic elements. This defense system contains a unique type of repetitive DNA stretches, termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs). CRISPRs consist of identical repeated DNA sequences (repeats), interspaced by highly variable sequences referred to as spacers. The spacers originate from either phages or plasmids and comprise the prokaryotes' 'immunological memory'. CRISPR-associated (cas) genes encode conserved proteins that together with CRISPRs make-up the CRISPR/Cas system, responsible for defending the prokaryotic cell against invaders. CRISPR-mediated resistance has been proposed to involve three stages: (i) CRISPR-Adaptation, the invader DNA is encountered by the CRISPR/Cas machinery and an invader-derived short DNA fragment is incorporated in the CRISPR array. (ii) CRISPR-Expression, the CRISPR array is transcribed and the transcript is processed by Cas proteins. (iii) CRISPR-Interference, the invaders' nucleic acid is recognized by complementarity to the crRNA and neutralized. An application of the CRISPR/Cas system is the immunization of industry-relevant prokaryotes (or eukaryotes) against mobile-genetic invasion. In addition, the high variability of the CRISPR spacer content can be exploited for phylogenetic and evolutionary studies. Despite impressive progress during the last couple of years, the elucidation of several fundamental details will be a major challenge in future research.

  18. A new Heraclides swallowtail (Lepidoptera, Papilionidae) from North America is recognized by the pattern on its neck

    PubMed Central

    Shiraiwa, Kojiro; Cong, Qian; Grishin, Nick V.

    2014-01-01

    Abstract Heraclides rumiko Shiraiwa & Grishin, sp. n. is described from southwestern United States, Mexico, and Central America (type locality: USA, Texas, Duval County). It is closely allied to Heraclides cresphontes (Cramer, 1777) and the two species are sympatric in central Texas. The new species is diagnosed by male genitalia and exhibits a nearly 3% difference from Heraclides cresphontes in the COI DNA barcode sequence of mitochondrial DNA. The two Heraclides species can usually be told apart by the shape and size of yellow spots on the neck, by the wing shape, and the details of wing patterns. “Western Giant Swallowtail” is proposed as the English name for Heraclides rumiko. To stabilize nomenclature, neotype for Papilio cresphontes Cramer, 1777, an eastern United States species, is designated from Brooklyn, New York, USA; and lectotype for Papilio thoas Linnaeus, 1771 is designated from Suriname. We sequenced DNA barcodes and ID tags of nearly 400 Papilionini specimens completing coverage of all Heraclides species. Comparative analyses of DNA barcodes, genitalia, and facies suggest that Heraclides oviedo (Gundlach, 1866), reinstated status, is a species-level taxon rather than a subspecies of Heraclides thoas (Linnaeus, 1771); and Heraclides pallas (G. Gray, [1853]), reinstated status, with its subspecies Heraclides Papilio bajaensis (J. Brown & Faulkner, 1992), comb. n., and Heraclides anchicayaensis Constantino, Le Crom & Salazar, 2002, stat. n., are not conspecific with Heraclides astyalus (Godart, 1819). PMID:25610342

  19. Inverted repeats in the promoter as an autoregulatory sequence for TcrX in Mycobacterium tuberculosis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bhattacharya, Monolekha; Das, Amit Kumar, E-mail: amitk@hijli.iitkgp.ernet.in

    Highlights: Black-Right-Pointing-Pointer The regulatory sequences recognized by TcrX have been identified. Black-Right-Pointing-Pointer The regulatory region comprises of inverted repeats segregated by 30 bp region. Black-Right-Pointing-Pointer The mode of binding of TcrX with regulatory sequence is unique. Black-Right-Pointing-Pointer In silico TcrX-DNA docked model binds one of the inverted repeats. Black-Right-Pointing-Pointer Both phosphorylated and unphosphorylated TcrX binds regulatory sequence in vitro. -- Abstract: TcrY, a histidine kinase, and TcrX, a response regulator, constitute a two-component system in Mycobacterium tuberculosis. tcrX, which is expressed during iron scarcity, is instrumental in the survival of iron-dependent M. tuberculosis. However, the regulator of tcrX/Y has notmore » been fully characterized. Crosslinking studies of TcrX reveal that it can form oligomers in vitro. Electrophoretic mobility shift assays (EMSAs) show that TcrX recognizes two regions in the promoter that are comprised of inverted repeats separated by {approx}30 bp. The dimeric in silico model of TcrX predicts binding to one of these inverted repeat regions. Site-directed mutagenesis and radioactive phosphorylation indicate that D54 of TcrX is phosphorylated by H256 of TcrY. However, phosphorylated and unphosphorylated TcrX bind the regulatory sequence with equal efficiency, which was shown with an EMSA using the D54A TcrX mutant.« less

  20. Scar-less multi-part DNA assembly design automation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hillson, Nathan J.

    The present invention provides a method of a method of designing an implementation of a DNA assembly. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which to assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding flanking homology sequences to each of the DNA oligos. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which tomore » assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding optimized overhang sequences to each of the DNA oligos.« less

  1. Novel epigenetic determinants of type 2 diabetes in Mexican-American families.

    PubMed

    Kulkarni, Hemant; Kos, Mark Z; Neary, Jennifer; Dyer, Thomas D; Kent, Jack W; Göring, Harald H H; Cole, Shelley A; Comuzzie, Anthony G; Almasy, Laura; Mahaney, Michael C; Curran, Joanne E; Blangero, John; Carless, Melanie A

    2015-09-15

    Although DNA methylation is now recognized as an important mediator of complex diseases, the extent to which the genetic basis of such diseases is accounted for by DNA methylation is unknown. In the setting of large, extended families representing a minority, high-risk population of the USA, we aimed to characterize the role of epigenome-wide DNA methylation in type 2 diabetes (T2D). Using Illumina HumanMethylation450 BeadChip arrays, we tested for association of DNA methylation at 446 356 sites with age, sex and phenotypic traits related to T2D in 850 pedigreed Mexican-American individuals. Robust statistical analyses showed that (i) 15% of the methylome is significantly heritable, with a median heritability of 0.14; (ii) DNA methylation at 14% of CpG sites is associated with nearby sequence variants; (iii) 22% and 3% of the autosomal CpG sites are associated with age and sex, respectively; (iv) 53 CpG sites were significantly associated with liability to T2D, fasting blood glucose and insulin resistance; (v) DNA methylation levels at five CpG sites, mapping to three well-characterized genes (TXNIP, ABCG1 and SAMD12) independently explained 7.8% of the heritability of T2D (vi) methylation at these five sites was unlikely to be influenced by neighboring DNA sequence variation. Our study has identified novel epigenetic indicators of T2D risk in Mexican Americans who have increased risk for this disease. These results provide new insights into potential treatment targets of T2D. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  2. Phosphorylation and cellular function of the human Rpa2 N-terminus in the budding yeast Saccharomyces cerevisiae.

    PubMed

    Ghospurkar, Padmaja L; Wilson, Timothy M; Liu, Shengqin; Herauf, Anna; Steffes, Jenna; Mueller, Erica N; Oakley, Gregory G; Haring, Stuart J

    2015-02-01

    Maintenance of genome integrity is critical for proper cell growth. This occurs through accurate DNA replication and repair of DNA lesions. A key factor involved in both DNA replication and the DNA damage response is the heterotrimeric single-stranded DNA (ssDNA) binding complex Replication Protein A (RPA). Although the RPA complex appears to be structurally conserved throughout eukaryotes, the primary amino acid sequence of each subunit can vary considerably. Examination of sequence differences along with the functional interchangeability of orthologous RPA subunits or regions could provide insight into important regions and their functions. This might also allow for study in simpler systems. We determined that substitution of yeast Replication Factor A (RFA) with human RPA does not support yeast cell viability. Exchange of a single yeast RFA subunit with the corresponding human RPA subunit does not function due to lack of inter-species subunit interactions. Substitution of yeast Rfa2 with domains/regions of human Rpa2 important for Rpa2 function (i.e., the N-terminus and the loop 3-4 region) supports viability in yeast cells, and hybrid proteins containing human Rpa2 N-terminal phospho-mutations result in similar DNA damage phenotypes to analogous yeast Rfa2 N-terminal phospho-mutants. Finally, the human Rpa2 N-terminus (NT) fused to yeast Rfa2 is phosphorylated in a manner similar to human Rpa2 in human cells, indicating that conserved kinases recognize the human domain in yeast. The implication is that budding yeast represents a potential model system for studying not only human Rpa2 N-terminal phosphorylation, but also phosphorylation of Rpa2 N-termini from other eukaryotic organisms. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  3. DHX9 helicase is involved in preventing genomic instability induced by alternatively structured DNA in human cells

    PubMed Central

    Jain, Aklank; Bacolla, Albino; del Mundo, Imee M.; Zhao, Junhua; Wang, Guliang; Vasquez, Karen M.

    2013-01-01

    Sequences that have the capacity to adopt alternative (i.e. non-B) DNA structures in the human genome have been implicated in stimulating genomic instability. Previously, we found that a naturally occurring intra-molecular triplex (H-DNA) caused genetic instability in mammals largely in the form of DNA double-strand breaks. Thus, it is of interest to determine the mechanism(s) involved in processing H-DNA. Recently, we demonstrated that human DHX9 helicase preferentially unwinds inter-molecular triplex DNA in vitro. Herein, we used a mutation-reporter system containing H-DNA to examine the relevance of DHX9 activity on naturally occurring H-DNA structures in human cells. We found that H-DNA significantly increased mutagenesis in small-interfering siRNA-treated, DHX9-depleted cells, affecting mostly deletions. Moreover, DHX9 associated with H-DNA in the context of supercoiled plasmids. To further investigate the role of DHX9 in the recognition/processing of H-DNA, we performed binding assays in vitro and chromatin immunoprecipitation assays in U2OS cells. DHX9 recognized H-DNA, as evidenced by its binding to the H-DNA structure and enrichment at the H-DNA region compared with a control region in human cells. These composite data implicate DHX9 in processing H-DNA structures in vivo and support its role in the overall maintenance of genomic stability at sites of alternatively structured DNA. PMID:24049074

  4. DHX9 helicase is involved in preventing genomic instability induced by alternatively structured DNA in human cells.

    PubMed

    Jain, Aklank; Bacolla, Albino; Del Mundo, Imee M; Zhao, Junhua; Wang, Guliang; Vasquez, Karen M

    2013-12-01

    Sequences that have the capacity to adopt alternative (i.e. non-B) DNA structures in the human genome have been implicated in stimulating genomic instability. Previously, we found that a naturally occurring intra-molecular triplex (H-DNA) caused genetic instability in mammals largely in the form of DNA double-strand breaks. Thus, it is of interest to determine the mechanism(s) involved in processing H-DNA. Recently, we demonstrated that human DHX9 helicase preferentially unwinds inter-molecular triplex DNA in vitro. Herein, we used a mutation-reporter system containing H-DNA to examine the relevance of DHX9 activity on naturally occurring H-DNA structures in human cells. We found that H-DNA significantly increased mutagenesis in small-interfering siRNA-treated, DHX9-depleted cells, affecting mostly deletions. Moreover, DHX9 associated with H-DNA in the context of supercoiled plasmids. To further investigate the role of DHX9 in the recognition/processing of H-DNA, we performed binding assays in vitro and chromatin immunoprecipitation assays in U2OS cells. DHX9 recognized H-DNA, as evidenced by its binding to the H-DNA structure and enrichment at the H-DNA region compared with a control region in human cells. These composite data implicate DHX9 in processing H-DNA structures in vivo and support its role in the overall maintenance of genomic stability at sites of alternatively structured DNA.

  5. Variability and repertoire size of T-cell receptor V alpha gene segments.

    PubMed

    Becker, D M; Pattern, P; Chien, Y; Yokota, T; Eshhar, Z; Giedlin, M; Gascoigne, N R; Goodnow, C; Wolf, R; Arai, K

    The immune system of higher organisms is composed largely of two distinct cell types, B lymphocytes and T lymphocytes, each of which is independently capable of recognizing an enormous number of distinct entities through their antigen receptors; surface immunoglobulin in the case of the former, and the T-cell receptor (TCR) in the case of the latter. In both cell types, the genes encoding the antigen receptors consist of multiple gene segments which recombine during maturation to produce many possible peptides. One striking difference between B- and T-cell recognition that has not yet been resolved by the structural data is the fact that T cells generally require a major histocompatibility determinant together with an antigen whereas, in most cases, antibodies recognize antigen alone. Recently, we and others have found that a series of TCR V beta gene sequences show conservation of many of the same residues that are conserved between heavy- and light-chain immunoglobulin V regions, and these V beta sequences are predicted to have an immunoglobulin-like secondary structure. To extend these studies, we have isolated and sequenced eight additional alpha-chain complementary cDNA clones and compared them with published sequences. Analyses of these sequences, reported here, indicate that V alpha regions have many of the characteristics of V beta gene segments but differ in that they almost always occur as cross-hybridizing gene families. We conclude that there may be very different selective pressures operating on V alpha and V beta sequences and that the V alpha repertoire may be considerably larger than that of V beta.

  6. HLA-B*3531, a hybrid of B35 and B61, implications for diagnostic approaches to alleles with complex ancestral compositions.

    PubMed

    Elsner, H-A; Himmel, A; Steitz, M; Hammer, P; Schmitz, G; Ballas, M; Blasczyk, R

    2002-07-01

    The serological characterization of allelic variants that have been generated by large-scale interallelic recombination events indicates which residues may be involved in the formation of epitopes crucial for serological recognition. The allelic product of HLA-B*3531 is composed of B35 in its alpha1 domain and of B61(40) in its alpha2 domain. Both specificities are only weakly detectable with available sera. Allelic products with 'mixed' serology also represent a challenge to DNA-based HLA typing methods, as only the sequence motif of one ancestral allele may be recognized. In this case the hidden specificity would not be considered in the matching process and might not be recognized as an antigen 'unacceptable' to the recipient.

  7. DNA residence time is a regulatory factor of transcription repression

    PubMed Central

    Clauß, Karen; Popp, Achim P.; Schulze, Lena; Hettich, Johannes; Reisser, Matthias; Escoter Torres, Laura; Uhlenhaut, N. Henriette

    2017-01-01

    Abstract Transcription comprises a highly regulated sequence of intrinsically stochastic processes, resulting in bursts of transcription intermitted by quiescence. In transcription activation or repression, a transcription factor binds dynamically to DNA, with a residence time unique to each factor. Whether the DNA residence time is important in the transcription process is unclear. Here, we designed a series of transcription repressors differing in their DNA residence time by utilizing the modular DNA binding domain of transcription activator-like effectors (TALEs) and varying the number of nucleotide-recognizing repeat domains. We characterized the DNA residence times of our repressors in living cells using single molecule tracking. The residence times depended non-linearly on the number of repeat domains and differed by more than a factor of six. The factors provoked a residence time-dependent decrease in transcript level of the glucocorticoid receptor-activated gene SGK1. Down regulation of transcription was due to a lower burst frequency in the presence of long binding repressors and is in accordance with a model of competitive inhibition of endogenous activator binding. Our single molecule experiments reveal transcription factor DNA residence time as a regulatory factor controlling transcription repression and establish TALE-DNA binding domains as tools for the temporal dissection of transcription regulation. PMID:28977492

  8. Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

    DOEpatents

    Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

    2013-06-25

    A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.

  9. Multiple nucleotide preferences determine cleavage-site recognition by the HIV-1 and M-MuLV RNases H.

    PubMed

    Schultz, Sharon J; Zhang, Miaohua; Champoux, James J

    2010-03-19

    The RNase H activity of reverse transcriptase is required during retroviral replication and represents a potential target in antiviral drug therapies. Sequence features flanking a cleavage site influence the three types of retroviral RNase H activity: internal, DNA 3'-end-directed, and RNA 5'-end-directed. Using the reverse transcriptases of HIV-1 (human immunodeficiency virus type 1) and Moloney murine leukemia virus (M-MuLV), we evaluated how individual base preferences at a cleavage site direct retroviral RNase H specificity. Strong test cleavage sites (designated as between nucleotide positions -1 and +1) for the HIV-1 and M-MuLV enzymes were introduced into model hybrid substrates designed to assay internal or DNA 3'-end-directed cleavage, and base substitutions were tested at specific nucleotide positions. For internal cleavage, positions +1, -2, -4, -5, -10, and -14 for HIV-1 and positions +1, -2, -6, and -7 for M-MuLV significantly affected RNase H cleavage efficiency, while positions -7 and -12 for HIV-1 and positions -4, -9, and -11 for M-MuLV had more modest effects. DNA 3'-end-directed cleavage was influenced substantially by positions +1, -2, -4, and -5 for HIV-1 and positions +1, -2, -6, and -7 for M-MuLV. Cleavage-site distance from the recessed end did not affect sequence preferences for M-MuLV reverse transcriptase. Based on the identified sequence preferences, a cleavage site recognized by both HIV-1 and M-MuLV enzymes was introduced into a sequence that was otherwise resistant to RNase H. The isolated RNase H domain of M-MuLV reverse transcriptase retained sequence preferences at positions +1 and -2 despite prolific cleavage in the absence of the polymerase domain. The sequence preferences of retroviral RNase H likely reflect structural features in the substrate that favor cleavage and represent a novel specificity determinant to consider in drug design. Copyright (c) 2010 Elsevier Ltd. All rights reserved.

  10. Phylogenetics of Pinus (Pinaceae) based on nuclear ribosomal DNA internal transcribed spacer region sequences.

    PubMed

    Liston, A; Robinson, W A; Piñero, D; Alvarez-Buylla, E R

    1999-02-01

    A 650-bp portion of the nuclear ribosomal DNA internal transcribed spacer region was sequenced in 47 species of Pinus, representing all recognized subsections of the genus, and 2 species of Picea and Cathaya as outgroups. Parsimony analyses of these length variable sequences were conducted using a manual alignment, 13 different automated alignments, elision of the automated alignments, and exclusion of all alignment ambiguous sites. High and moderately supported clades were consistently resolved across the different analyses, while poorly supported clades were inconsistently recovered. Comparison of the topologies highlights taxa of particularly problematic placement including Pinus nelsonii and P. aristata. Within subgenus Pinus, there is moderate support for the monophyly of a narrowly circumscribed subsect. Pinus (=subsect. Sylvestres) and strong support for a clade of North and Central American hard pines. The Himalayan P. roxburghii may be sister species to these "New World hard pines," which have two well-supported subgroups, subsect. Ponderosae and a clade of the remaining five subsections. The position of subsect. Contortae conflicts with its placement in a chloroplast DNA restriction site study. Within subgenus Strobus there is consistent support for the monophyly of a broadly circumscribed subsect. Strobi (including P. krempfii and a polyphyletic subsect. Cembrae) derived from a paraphyletic grade of the remaining soft pines. Relationships among subsects. Gerardianae, Cembroides, and Balfourianae are poorly resolved. Support for the monophyly of subgenus Pinus and subgenus Strobus is not consistently obtained. Copyright 1999 Academic Press.

  11. New CRISPR–Cas systems from uncultivated microbes

    DOE PAGES

    Burstein, David; Harrington, Lucas B.; Strutt, Steven C.; ...

    2016-12-22

    We present that CRISPR-Cas systems provide microbes with adaptive immunity by employing short DNA sequences, termed spacers, that guide Cas proteins to cleave foreign DNA. Class 2 CRISPR-Cas systems are streamlined versions, in which a single RNA-bound Cas protein recognizes and cleaves target sequences. The programmable nature of these minimal systems has enabled researchers to repurpose them into a versatile technology that is broadly revolutionizing biological and clinical research. However, current CRISPR-Cas technologies are based solely on systems from isolated bacteria, leaving the vast majority of enzymes from organisms that have not been cultured untapped. Metagenomics, the sequencing of DNAmore » extracted directly from natural microbial communities, provides access to the genetic material of a huge array of uncultivated organisms. Here, using genome-resolved metagenomics, we identify a number of CRISPR-Cas systems, including the first reported Cas9 in the archaeal domain of life, to our knowledge. This divergent Cas9 protein was found in little-studied nanoarchaea as part of an active CRISPR-Cas system. In bacteria, we discovered two previously unknown systems, CRISPR-CasX and CRISPR-CasY, which are among the most compact systems yet discovered. Notably, all required functional components were identified by metagenomics, enabling validation of robust in vivo RNA-guided DNA interference activity in Escherichia coli. Lastly, interrogation of environmental microbial communities combined with in vivo experiments allows us to access an unprecedented diversity of genomes, the content of which will expand the repertoire of microbe-based biotechnologies.« less

  12. New CRISPR–Cas systems from uncultivated microbes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Burstein, David; Harrington, Lucas B.; Strutt, Steven C.

    We present that CRISPR-Cas systems provide microbes with adaptive immunity by employing short DNA sequences, termed spacers, that guide Cas proteins to cleave foreign DNA. Class 2 CRISPR-Cas systems are streamlined versions, in which a single RNA-bound Cas protein recognizes and cleaves target sequences. The programmable nature of these minimal systems has enabled researchers to repurpose them into a versatile technology that is broadly revolutionizing biological and clinical research. However, current CRISPR-Cas technologies are based solely on systems from isolated bacteria, leaving the vast majority of enzymes from organisms that have not been cultured untapped. Metagenomics, the sequencing of DNAmore » extracted directly from natural microbial communities, provides access to the genetic material of a huge array of uncultivated organisms. Here, using genome-resolved metagenomics, we identify a number of CRISPR-Cas systems, including the first reported Cas9 in the archaeal domain of life, to our knowledge. This divergent Cas9 protein was found in little-studied nanoarchaea as part of an active CRISPR-Cas system. In bacteria, we discovered two previously unknown systems, CRISPR-CasX and CRISPR-CasY, which are among the most compact systems yet discovered. Notably, all required functional components were identified by metagenomics, enabling validation of robust in vivo RNA-guided DNA interference activity in Escherichia coli. Lastly, interrogation of environmental microbial communities combined with in vivo experiments allows us to access an unprecedented diversity of genomes, the content of which will expand the repertoire of microbe-based biotechnologies.« less

  13. Phage T4 SegB protein is a homing endonuclease required for the preferred inheritance of T4 tRNA gene region occurring in co-infection with a related phage.

    PubMed

    Brok-Volchanskaya, Vera S; Kadyrov, Farid A; Sivogrivov, Dmitry E; Kolosov, Peter M; Sokolov, Andrey S; Shlyapnikov, Michael G; Kryukov, Valentine M; Granovsky, Igor E

    2008-04-01

    Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3' 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TpsiC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages.

  14. Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

    DOEpatents

    Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

    2011-01-18

    A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.

  15. Pyrosequencing as a tool for the identification of common isolates of Mycobacterium sp.

    PubMed

    Tuohy, Marion J; Hall, Gerri S; Sholtis, Mary; Procop, Gary W

    2005-04-01

    Pyrosequencing technology, sequencing by addition, was evaluated for categorization of mycobacterial isolates. One hundred and eighty-nine isolates, including 18 ATCC and Trudeau Mycobacterial Culture Collection (TMC) strains, were studied. There were 38 Mycobacterium tuberculosis complex, 27 M. kansasii, 27 MAI complex, 21 M. marinum, 14 M. gordonae, 20 M. chelonae-abscessus group, 10 M. fortuitum, 5 M. xenopi, 3 M. celatum, 2 M. terrae complex, 20 M. mucogenicum, and 2 M. scrofulaceum. Nucleic acid extracts were prepared from solid media or MGIT broth. Traditional PCR was performed with one of the primers biotinylated; the assay targeted a portion of the 16S rRNA gene that contains a hypervariable region, which has been previously shown to be useful for the identification of mycobacteria. The PSQ Sample Preparation Kit was used, and the biotinylated PCR product was processed to a single-stranded DNA template. The sequencing primer was hybridized to the DNA template in a PSQ96 plate. Incorporation of the complementary nucleotides resulted in light generation peaks, forming a pyrogram, which was evaluated by the instrument software. Thirty basepairs were used for isolate categorization. Manual interpretation of the sequences was performed if the quality of the 30-bp sequence was in doubt or if more than 4 bp homopolymers were recognized. Sequences with more than 5 bp of bad quality were deemed unacceptable. When blasted against GenBank, 179 of 189 sequences (94.7%) assigned isolates to the correct molecular genus or group. Ten M. gordonae isolates had more than 5 bp of bad quality sequence and were not accepted. Pyrosequencing of this hypervariable region afforded rapid and acceptable characterization of common, routinely isolated clinical Mycobacterium sp. Algorithms are recommended for further differentiation with an additional sequencing primer or additional biochemicals.

  16. Fanconi anemia proteins in telomere maintenance.

    PubMed

    Sarkar, Jaya; Liu, Yie

    2016-07-01

    Mammalian chromosome ends are protected by nucleoprotein structures called telomeres. Telomeres ensure genome stability by preventing chromosome termini from being recognized as DNA damage. Telomere length homeostasis is inevitable for telomere maintenance because critical shortening or over-lengthening of telomeres may lead to DNA damage response or delay in DNA replication, and hence genome instability. Due to their repetitive DNA sequence, unique architecture, bound shelterin proteins, and high propensity to form alternate/secondary DNA structures, telomeres are like common fragile sites and pose an inherent challenge to the progression of DNA replication, repair, and recombination apparatus. It is conceivable that longer the telomeres are, greater is the severity of such challenges. Recent studies have linked excessively long telomeres with increased tumorigenesis. Here we discuss telomere abnormalities in a rare recessive chromosomal instability disorder called Fanconi Anemia and the role of the Fanconi Anemia pathway in telomere biology. Reports suggest that Fanconi Anemia proteins play a role in maintaining long telomeres, including processing telomeric joint molecule intermediates. We speculate that ablation of the Fanconi Anemia pathway would lead to inadequate aberrant structural barrier resolution at excessively long telomeres, thereby causing replicative burden on the cell. Published by Elsevier B.V.

  17. Crystal structure of APOBEC3A bound to single-stranded DNA reveals structural basis for cytidine deamination and specificity

    PubMed Central

    Kouno, Takahide; Silvas, Tania V.; Hilbert, Brendan J.; Shandilya, Shivender M. D.; Bohn, Markus F.; Kelch, Brian A.; Royer, William E.; Somasundaran, Mohan; Kurt Yilmaz, Nese; Matsuo, Hiroshi; Schiffer, Celia A.

    2017-01-01

    Nucleic acid editing enzymes are essential components of the immune system that lethally mutate viral pathogens and somatically mutate immunoglobulins, and contribute to the diversification and lethality of cancers. Among these enzymes are the seven human APOBEC3 deoxycytidine deaminases, each with unique target sequence specificity and subcellular localization. While the enzymology and biological consequences have been extensively studied, the mechanism by which APOBEC3s recognize and edit DNA remains elusive. Here we present the crystal structure of a complex of a cytidine deaminase with ssDNA bound in the active site at 2.2 Å. This structure not only visualizes the active site poised for catalysis of APOBEC3A, but pinpoints the residues that confer specificity towards CC/TC motifs. The APOBEC3A–ssDNA complex defines the 5′–3′ directionality and subtle conformational changes that clench the ssDNA within the binding groove, revealing the architecture and mechanism of ssDNA recognition that is likely conserved among all polynucleotide deaminases, thereby opening the door for the design of mechanistic-based therapeutics. PMID:28452355

  18. Evolutionary relationships in Panicoid grasses based on plastome phylogenomics (Panicoideae; Poaceae).

    PubMed

    Burke, Sean V; Wysocki, William P; Zuloaga, Fernando O; Craine, Joseph M; Pires, J Chris; Edger, Patrick P; Mayfield-Jones, Dustin; Clark, Lynn G; Kelchner, Scot A; Duvall, Melvin R

    2016-06-18

    Panicoideae are the second largest subfamily in Poaceae (grass family), with 212 genera and approximately 3316 species. Previous studies have begun to reveal relationships within the subfamily, but largely lack resolution and/or robust support for certain tribal and subtribal groups. This study aims to resolve these relationships, as well as characterize a putative mitochondrial insert in one linage. 35 newly sequenced Panicoideae plastomes were combined in a phylogenomic study with 37 other species: 15 Panicoideae and 22 from outgroups. A robust Panicoideae topology largely congruent with previous studies was obtained, but with some incongruences with previously reported subtribal relationships. A mitochondrial DNA (mtDNA) to plastid DNA (ptDNA) transfer was discovered in the Paspalum lineage. The phylogenomic analysis returned a topology that largely supports previous studies. Five previously recognized subtribes appear on the topology to be non-monophyletic. Additionally, evidence for mtDNA to ptDNA transfer was identified in both Paspalum fimbriatum and P. dilatatum, and suggests a single rare event that took place in a common progenitor. Finally, the framework from this study can guide larger whole plastome sampling to discern the relationships in Cyperochloeae, Steyermarkochloeae, Gynerieae, and other incertae sedis taxa that are weakly supported or unresolved.

  19. Algoriphagus aestuarii sp. nov., a member of the Cyclobacteriaceae isolated from a tidal-flat sediment of the Yellow Sea in Korea.

    PubMed

    Jung, Yong-Taek; Lee, Jung-Sook; Yoon, Jung-Hoon

    2015-10-01

    A Gram-strain-negative, coccoid or oval-shaped, non-motile bacterial strain, designated MDM-1T, was isolated from a tidal-flat sediment on the Korean peninsula. Strain MDM-1T was found to grow optimally at pH 7.0-8.0, at 30 °C and in the presence of 2-3 % (w/v) NaCl. A neighbour-joining phylogenetic tree based on 16S rRNA gene sequences showed that strain MDM-1T falls within the clade comprising species of the genus Algoriphagus, clustering with the type strains of Algoriphagus halophilus, A. lutimaris, A. chungangensis and A. machipongonensis, with which it exhibited 97.2-98.5 % 16S rRNA gene sequence similarity. Sequence similarities to the type strains of the other recognized species of the genus Algoriphagus were 92.8-97.6 %. Strain MDM-1T was found to contain MK-7 as the predominant menaquinone and iso-C15 : 0 and summed feature 3 (C16 : 1ω6c and/or C16 : 1ω7c) as the major fatty acids. The major polar lipids were identified as phosphatidylcholine, phosphatidylethanolamine and two unidentified lipids. The DNA G+C content of strain MDM-1T was determined to be 42.7 mol% and the mean DNA-DNA relatedness with A. halophilus KCTC 12051T, A. lutimaris S1-3T, A. chungangensis KCTC 23759T, A. machipongonensis DSM 24695T and A. ratkowskyi CIP 107452T was 19.7-5.2 %. Differential phenotypic properties, together with the phylogenetic and genetic distinctiveness, revealed that strain MDM-1T is distinguishable from recognized species of the genus Algoriphagus. On the basis of the data presented, strain MDM-1T is proposed to represent a novel species of the genus Algoriphagus, for which the name Algoriphagus aestuarii sp. nov. is proposed. The type strain is MDM-1T ( = KCTC 42199T = NBRC 110552T).

  20. Interactions of DNA binding proteins with G-Quadruplex structures at the single molecule level

    NASA Astrophysics Data System (ADS)

    Ray, Sujay

    Guanine-rich nucleic acid (DNA/RNA) sequences can form non-canonical secondary structures, known as G-quadruplex (GQ). Numerous in vivo and in vitro studies have demonstrated formation of these structures in telomeric and non-telomeric regions of the genome. Telomeric GQs protect the chromosome ends whereas non-telomeric GQs either act as road blocks or recognition sites for DNA metabolic machinery. These observations suggest the significance of these structures in regulation of different metabolic processes, such as replication and repair. GQs are typically thermodynamically more stable than the corresponding Watson-Crick base pairing formed by G-rich and C-rich strands, making protein activity a crucial factor for their destabilization. Inside the cell, GQs interact with different proteins and their enzymatic activity is the determining factor for their stability. We studied interactions of several proteins with GQs to understand the underlying principles of protein-GQ interactions using single-molecule FRET and other biophysical techniques. Replication Protein-A (RPA), a single stranded DNA (ssDNA) binding protein, is known to posses GQ unfolding activity. First, we compared the thermal stability of three potentially GQ-forming DNA sequences (PQS) to their stability against RPA-mediated unfolding. One of these sequences is the human telomeric repeat and the other two, located in the promoter region of tyrosine hydroxylase gene, are highly heterogeneous sequences that better represent PQS in the genome. The thermal stability of these structures do not necessarily correlate with their stability against protein-mediated unfolding. We conclude that thermal stability is not necessarily an adequate criterion for predicting the physiological viability of GQ structures. To determine the critical structural factors that influence protein-GQ interactions we studied two groups of GQ structures that have systematically varying loop lengths and number of G-tetrad layers. We observed a linear increase in the steady-state stability of the GQ against RPA-mediated unfolding with increasing number of layers or decreasing loop length. The stability demonstrated by different GQ structures varied by at least three orders of magnitude. Finally, we studied another protein-GQ system where a protein complex works synergistically with a GQ to suppress DNA damage signals by preventing RPA to bind to telomeric DNA. Human telomeres that terminate with a single-stranded 3' G-overhang can be recognized as a DNA damage site by RPA. The protection of telomere-1 (POT1) and POT1-interacting protein (TPP1) heterodimer, binds specifically to telomeric DNA and protects it against RPA binding. Using model telomeric DNA, we studied the competition between POT1/TPP1 and RPA to access telomeric GQs in vitro. Under physiological salt and pH conditions, POT1/TPP1 stably load to a minimal DNA sequence adjacent to a folded GQ and unfolds the anti-parallel GQ as the parallel conformation remains folded. We showed that GQ formation of telomeres enhances the ability of POT1/TPP1 to block RPA's access to telomeres by two orders of magnitude and contributes to suppress DNA damage signals.

  1. Cloning of the Gene Encoding a 22-Kilodalton Cell Surface Antigen of Mycobacterium bovis BCG and Analysis of Its Potential for DNA Vaccination against Tuberculosis

    PubMed Central

    Lefèvre, Philippe; Denis, Olivier; De Wit, Lucas; Tanghe, Audrey; Vandenbussche, Paul; Content, Jean; Huygen, Kris

    2000-01-01

    Using spleen cells from mice vaccinated with live Mycobacterium bovis BCG, we previously generated three monoclonal antibodies reactive against a 22-kDa protein present in mycobacterial culture filtrate (CF) (K. Huygen et al., Infect. Immun. 61:2687–2693, 1993). These monoclonal antibodies were used to screen an M. bovis BCG genomic library made in phage λgt11. The gene encoding a 233-amino-acid (aa) protein, including a putative 26-aa signal sequence, was isolated, and sequence analysis indicated that the protein was 98% identical with the M. tuberculosis Lppx protein and that it contained a sequence 94% identical with the M. leprae 38-mer polypeptide 13B3 recognized by T cells from killed M. leprae-immunized subjects. Flow cytometry and cell fractionation demonstrated that the 22-kDa CF protein is also highly expressed in the bacterial cell wall and membrane compartment but not in the cytosol. C57BL/6, C3H, and BALB/c mice were vaccinated with plasmid DNA encoding the 22-kDa protein and analyzed for immune response and protection against intravenous M. tuberculosis challenge. Whereas DNA vaccination induced elevated antibody responses in C57BL/6 and particularly in C3H mice, Th1-type cytokine response, as measured by interleukin-2 and gamma interferon secretion, was only modest, and no protection against intravenous M. tuberculosis challenge was observed in any of the three mouse strains tested. Therefore, the 22-kDa antigen seems to have little potential for a DNA vaccine against tuberculosis, but it may be a good candidate for a mycobacterial antigen detection test. PMID:10678905

  2. DNA barcode based wildlife forensics for resolving the origin of claw samples using a novel primer cocktail.

    PubMed

    Khedkar, Gulab D; Abhayankar, Shil Bapurao; Nalage, Dinesh; Ahmed, Shaikh Nadeem; Khedkar, Chandraprakash D

    2016-11-01

    Excessive wildlife hunting for commercial purposes can have negative impacts on biodiversity and may result in species extinction. To ensure compliance with legal statutes, forensic identification approaches relying on molecular markers may be used to identify the species of origin of animal material from hairs, claw, blood, bone, or meat. Using this approach, DNA sequences from the COI "barcoding" gene have been used to identify material from a number of domesticated animal species. However, many wild species of carnivores still present great challenges in generating COI barcodes using standard "universal" primer pairs. In the work presented here, the mitochondrial COI gene was successfully amplified using a novel primer cocktail, and the products were sequenced to determine the species of twenty one unknown samples of claw material collected as part of forensic wildlife case investigations. Sixteen of the unknown samples were recognized to have originated from either Panthera leo or P. pardus individuals. The remaining five samples could be identified only to the family level due to the absence of reference animal sequences. This is the first report on the use of COI sequences for the identification of P. pardus and P. leo from claw samples as part of forensic investigations in India. The study also highlights the need for adequate reference material to aid in the resolution of suspected cases of illegal wildlife harvesting.

  3. High sensitivity 5-hydroxymethylcytosine detection in Balb/C brain tissue.

    PubMed

    Davis, Theodore; Vaisvila, Romualdas

    2011-02-01

    DNA hydroxymethylation is a long known modification of DNA, but has recently become a focus in epigenetic research. Mammalian DNA is enzymatically modified at the 5(th) carbon position of cytosine (C) residues to 5-mC, predominately in the context of CpG dinucleotides. 5-mC is amenable to enzymatic oxidation to 5-hmC by the Tet family of enzymes, which are believed to be involved in development and disease. Currently, the biological role of 5-hmC is not fully understood, but is generating a lot of interest due to its potential as a biomarker. This is due to several groundbreaking studies identifying 5-hydroxymethylcytosine in mouse embryonic stem (ES) and neuronal cells. Research techniques, including bisulfite sequencing methods, are unable to easily distinguish between 5-mC and 5-hmC . A few protocols exist that can measure global amounts of 5-hydroxymethylcytosine in the genome, including liquid chromatography coupled with mass spectrometry analysis or thin layer chromatography of single nucleosides digested from genomic DNA. Antibodies that target 5-hydroxymethylcytosine also exist, which can be used for dot blot analysis, immunofluorescence, or precipitation of hydroxymethylated DNA, but these antibodies do not have single base resolution.In addition, resolution depends on the size of the immunoprecipitated DNA and for microarray experiments, depends on probe design. Since it is unknown exactly where 5-hydroxymethylcytosine exists in the genome or its role in epigenetic regulation, new techniques are required that can identify locus specific hydroxymethylation. The EpiMark 5-hmC and 5-mC Analysis Kit provides a solution for distinguishing between these two modifications at specific loci. The EpiMark 5-hmC and 5-mC Analysis Kit is a simple and robust method for the identification and quantitation of 5-methylcytosine and 5-hydroxymethylcytosine within a specific DNA locus. This enzymatic approach utilizes the differential methylation sensitivity of the isoschizomers MspI and HpaII in a simple 3-step protocol. Genomic DNA of interest is treated with T4-BGT, adding a glucose moeity to 5-hydroxymethylcytosine. This reaction is sequence-independent, therefore all 5-hmC will be glucosylated; unmodified or 5-mC containing DNA will not be affected. This glucosylation is then followed by restriction endonuclease digestion. MspI and HpaII recognize the same sequence (CCGG) but are sensitive to different methylation states. HpaII cleaves only a completely unmodified site: any modification (5-mC, 5-hmC or 5-ghmC) at either cytosine blocks cleavage. MspI recognizes and cleaves 5-mC and 5-hmC, but not 5-ghmC. The third part of the protocol is interrogation of the locus by PCR. As little as 20 ng of input DNA can be used. Amplification of the experimental (glucosylated and digested) and control (mock glucosylated and digested) target DNA with primers flanking a CCGG site of interest (100-200 bp) is performed. If the CpG site contains 5-hydroxymethylcytosine, a band is detected after glucosylation and digestion, but not in the non-glucosylated control reaction. Real time PCR will give an approximation of how much hydroxymethylcytosine is in this particular site. In this experiment, we will analyze the 5-hydroxymethylcytosine amount in a mouse Babl/C brain sample by end point PCR.

  4. DNA barcoding of Bemisia tabaci complex (Hemiptera: Aleyrodidae) reveals southerly expansion of the dominant whitefly species on cotton in Pakistan.

    PubMed

    Ashfaq, Muhammad; Hebert, Paul D N; Mirza, M Sajjad; Khan, Arif M; Mansoor, Shahid; Shah, Ghulam S; Zafar, Yusuf

    2014-01-01

    Although whiteflies (Bemisia tabaci complex) are an important pest of cotton in Pakistan, its taxonomic diversity is poorly understood. As DNA barcoding is an effective tool for resolving species complexes and analyzing species distributions, we used this approach to analyze genetic diversity in the B. tabaci complex and map the distribution of B. tabaci lineages in cotton growing areas of Pakistan. Sequence diversity in the DNA barcode region (mtCOI-5') was examined in 593 whiteflies from Pakistan to determine the number of whitefly species and their distributions in the cotton-growing areas of Punjab and Sindh provinces. These new records were integrated with another 173 barcode sequences for B. tabaci, most from India, to better understand regional whitefly diversity. The Barcode Index Number (BIN) System assigned the 766 sequences to 15 BINs, including nine from Pakistan. Representative specimens of each Pakistan BIN were analyzed for mtCOI-3' to allow their assignment to one of the putative species in the B. tabaci complex recognized on the basis of sequence variation in this gene region. This analysis revealed the presence of Asia II 1, Middle East-Asia Minor 1, Asia 1, Asia II 5, Asia II 7, and a new lineage "Pakistan". The first two taxa were found in both Punjab and Sindh, but Asia 1 was only detected in Sindh, while Asia II 5, Asia II 7 and "Pakistan" were only present in Punjab. The haplotype networks showed that most haplotypes of Asia II 1, a species implicated in transmission of the cotton leaf curl virus, occurred in both India and Pakistan. DNA barcodes successfully discriminated cryptic species in B. tabaci complex. The dominant haplotypes in the B. tabaci complex were shared by India and Pakistan. Asia II 1 was previously restricted to Punjab, but is now the dominant lineage in southern Sindh; its southward spread may have serious implications for cotton plantations in this region.

  5. BuD, a helix–loop–helix DNA-binding domain for genome modification

    PubMed Central

    Stella, Stefano; Molina, Rafael; López-Méndez, Blanca; Juillerat, Alexandre; Bertonati, Claudia; Daboussi, Fayza; Campos-Olivas, Ramon; Duchateau, Phillippe; Montoya, Guillermo

    2014-01-01

    DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein–DNA interactions in protein scaffolds is key to providing ‘toolkits’ for precise genome modification or regulation of gene expression. In a search for putative DNA-binding domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix–loop–helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing. PMID:25004980

  6. Shotgun protein sequencing: assembly of peptide tandem mass spectra from mixtures of modified proteins.

    PubMed

    Bandeira, Nuno; Clauser, Karl R; Pevzner, Pavel A

    2007-07-01

    Despite significant advances in the identification of known proteins, the analysis of unknown proteins by MS/MS still remains a challenging open problem. Although Klaus Biemann recognized the potential of MS/MS for sequencing of unknown proteins in the 1980s, low throughput Edman degradation followed by cloning still remains the main method to sequence unknown proteins. The automated interpretation of MS/MS spectra has been limited by a focus on individual spectra and has not capitalized on the information contained in spectra of overlapping peptides. Indeed the powerful shotgun DNA sequencing strategies have not been extended to automated protein sequencing. We demonstrate, for the first time, the feasibility of automated shotgun protein sequencing of protein mixtures by utilizing MS/MS spectra of overlapping and possibly modified peptides generated via multiple proteases of different specificities. We validate this approach by generating highly accurate de novo reconstructions of multiple regions of various proteins in western diamondback rattlesnake venom. We further argue that shotgun protein sequencing has the potential to overcome the limitations of current protein sequencing approaches and thus catalyze the otherwise impractical applications of proteomics methodologies in studies of unknown proteins.

  7. Rapid rate of control-region evolution in Pacific butterflyfishes (Chaetodontidae).

    PubMed

    McMillan, W O; Palumbi, S R

    1997-11-01

    Sequence differences in the tRNA-proline (tRNApro) end of the mitochondrial control-region of three species of Pacific butterflyfishes accumulated 33-43 times more rapidly than did changes within the mitochondrial cytochrome b gene (cytb). Rapid evolution in this region was accompanied by strong transition/transversion bias and large variation in the probability of a DNA substitution among sites. These substitution constraints placed an absolute ceiling on the magnitude of sequence divergence that could be detected between individuals. This divergence "ceiling" was reached rapidly and led to a decay in the relative rate of control-region/cytb b evolution. A high rate of evolution in this section of the control-region of butterflyfishes stands in marked contrast to the patterns reported in some other fish lineages. Although the mechanism underlying rate variation remains unclear, all taxa with rapid evolution in the 5'-end of the control-region showed extreme transition biases. By contrast, in taxa with slower control-region evolution, transitions accumulated at nearly the same rate as transversions. More information is needed to understand the relationship between nucleotide bias and the rate of evolution in the 5'-end of the control-region. Despite strong constraints on sequence change, phylogenetic information was preserved in the group of recently differentiated species and supported the clustering of sequences into three major mtDNA groupings. Within these groups, very similar control-region sequences were widely distributed across the Pacific Ocean and were shared between recognized species, indicating a lack of mitochondrial sequence monophyly among species.

  8. A Gibbs sampler for motif detection in phylogenetically close sequences

    NASA Astrophysics Data System (ADS)

    Siddharthan, Rahul; van Nimwegen, Erik; Siggia, Eric

    2004-03-01

    Genes are regulated by transcription factors that bind to DNA upstream of genes and recognize short conserved ``motifs'' in a random intergenic ``background''. Motif-finders such as the Gibbs sampler compare the probability of these short sequences being represented by ``weight matrices'' to the probability of their arising from the background ``null model'', and explore this space (analogous to a free-energy landscape). But closely related species may show conservation not because of functional sites but simply because they have not had sufficient time to diverge, so conventional methods will fail. We introduce a new Gibbs sampler algorithm that accounts for common ancestry when searching for motifs, while requiring minimal ``prior'' assumptions on the number and types of motifs, assessing the significance of detected motifs by ``tracking'' clusters that stay together. We apply this scheme to motif detection in sporulation-cycle genes in the yeast S. cerevisiae, using recent sequences of other closely-related Saccharomyces species.

  9. The 86-kilodalton antigen from Schistosoma mansoni is a heat-shock protein homologous to yeast HSP-90.

    PubMed

    Johnson, K S; Wells, K; Bock, J V; Nene, V; Taylor, D W; Cordingley, J S

    1989-08-01

    We report the sequence of a cDNA clone encoding an 86-kDa polypeptide antigen (p86) from Schistosoma mansoni. Fusion proteins made in Escherichia coli are recognized by human infection sera. The reading frame of this antigen is highly homologous to those of the large heat-shock proteins of Saccharomyces cerevisiae (HSP90) and Drosophila melanogaster (HSP83). mRNA encoding p86 increases in response to heat shock of adult worms, as does HSP70. Comparisons of the sequences of HSP70 and HSP83 homologues show that these two families of heat-shock proteins are not significantly related except for the last four amino acid residues, which are Glu-Glu-Val-Asp in every case. This sequence is not found at the carboxy terminus of any other protein in the current databases.

  10. Molecular and Physiological Analysis of a Heat-Shock Response in Wheat 1

    PubMed Central

    McElwain, Elizabeth F.; Spiker, Steven

    1992-01-01

    We have isolated two cDNA clones from wheat (Triticum aestivum L. var Stephens), designated WHSP16.8 and WHSP16.9, that are highly similar in sequence to the low molecular weight heat-shock protein genes previously isolated from soybean. RNA blot analysis confirms that these sequences are present in heat-shocked wheat seedlings, but not in control tissues. The WHSP16.8 and WHSP16.9 cDNAs were isolated by screening a lambda gt11 expression library with antibodies to HMGc (a chromosomal protein of wheat). Immunoblot analysis has demonstrated that the antibodies raised against HMGc also recognize a group of proteins that are induced by heat shock and have molecular weights (estimated by sodium dodecyl sulfate electrophoresis) consistent with the molecular weights of the proteins deduced from the sequences of the cDNAs. ImagesFigure 3Figure 4Figure 5 PMID:16669058

  11. Roseovarius aestuarii sp. nov., isolated from a tidal flat of the Yellow Sea in Korea.

    PubMed

    Yoon, Jung-Hoon; Kang, So-Jung; Oh, Tae-Kwang

    2008-05-01

    A Gram-negative, motile, ovoid to rod-shaped bacterial strain, designated strain SMK-122T, was isolated from a Yellow Sea tidal flat located on the coast of Korea. Strain SMK-122T grew optimally at pH 7.0-8.0 and 30 degrees C. It contained Q-10 as the predominant ubiquinone and possessed C18 : 1omega7c and C16 : 0 as the major fatty acids. The DNA G+C content was 58.6 mol%. A phylogenetic analysis based on 16S rRNA gene sequences showed that strain SMK-122T fell within the genus Roseovarius, being closest to Roseovarius nubinhibens ISM(T); the sequence similarities with respect to Roseovarius species ranged from 94.9 to 97.3 %. The mean value for DNA-DNA relatedness between strain SMK-122T and Rva. nubinhibens DSM 15170T was 13 %. Differential phenotypic properties of SMK-122T, together with its phylogenetic and genetic distinctiveness, revealed that this strain is distinct from recognized Roseovarius species. On this basis, strain SMK-122T represents a novel species of the genus Roseovarius, for which the name Roseovarius aestuarii sp. nov. is proposed. The type strain is SMK-122T (=KCTC 22174T =CCUG 55325T).

  12. Phylogeography of Canada Geese (Branta canadensis) in western North America

    USGS Publications Warehouse

    Scribner, K.T.; Talbot, S.L.; Pearce, J.M.; Pierson, Barbara J.; Bollinger, K.S.; Derksen, D.V.

    2003-01-01

    Using molecular genetic markers that differ in mode of inheritance and rate of evolution, we examined levels and partitioning of genetic variation for seven nominal subspecies (11 breeding populations) of Canada Geese (Branta canadensis) in western North America. Gene trees constructed from mtDNA control region sequence data show that subspecies of Canada Geese do not have distinct mtDNA. Large- and small-bodied forms of Canada Geese were highly diverged (0. 077 average sequence divergence) and represent monophyletic groups. A majority (65%) of 20 haplotypes resolved were observed in single breeding locales. However, within both large- and small-bodied forms certain haplotypes occurred across multiple subspecies. Population trees for both nuclear (microsatellites) and mitochondrial markers were generally concordant and provide resolution of population and subspecific relationships indicating incomplete lineage sorting. All populations and subspecies were genetically diverged, but to varying degrees. Analyses of molecular variance, nested-clade and coalescence-based analyses of mtDNA suggest that both historical (past fragmentation) and contemporary forces have been important in shaping current spatial genetic distributions. Gene flow appears to be ongoing though at different rates, even among currently recognized subspecies. The efficacy of current subspecific taxonomy is discussed in light of hypothesized historical vicariance and current demographic trends of management and conservation concern.

  13. Molecular identification of Taenia spp. in the Eurasian lynx (Lynx lynx) from Finland.

    PubMed

    Lavikainen, A; Haukisalmi, V; Deksne, G; Holmala, K; Lejeune, M; Isomursu, M; Jokelainen, P; Näreaho, A; Laakkonen, J; Hoberg, E P; Sukura, A

    2013-04-01

    Cestodes of the genus Taenia are parasites of mammals, with mainly carnivores as definitive and herbivores as intermediate hosts. Various medium-sized cats, Lynx spp., are involved in the life cycles of several species of Taenia. The aim of the present study was to identify Taenia tapeworms in the Eurasian lynx (Lynx lynx) from Finland. In total, 135 tapeworms from 72 lynx were subjected to molecular identification based on sequences of 2 mtDNA regions, the cytochrome c oxidase subunit 1 and the NADH dehydrogenase subunit 1 genes. Available morphological characters of the rostellar hooks and strobila were compared. Two species of Taenia were found: T. laticollis (127 samples) and an unknown Taenia sp. (5 samples). The latter could not be identified to species based on mtDNA, and the rostellar hooks were short relative to those described among other Taenia spp. recorded in felids from the Holarctic region. In the phylogenetic analyses of mtDNA sequences, T. laticollis was placed as a sister species of T. macrocystis, and the unknown Taenia sp. was closely related to T. hydatigena and T. regis. Our analyses suggest that these distinct taeniid tapeworms represent a putative new species of Taenia. The only currently recognized definitive host is L. lynx and the intermediate host is unknown.

  14. Nuclear and cpDNA sequences combined provide strong inference of higher phylogenetic relationships in the phlox family (Polemoniaceae).

    PubMed

    Johnson, Leigh A; Chan, Lauren M; Weese, Terri L; Busby, Lisa D; McMurry, Samuel

    2008-09-01

    Members of the phlox family (Polemoniaceae) serve as useful models for studying various evolutionary and biological processes. Despite its biological importance, no family-wide phylogenetic estimate based on multiple DNA regions with complete generic sampling is available. Here, we analyze one nuclear and five chloroplast DNA sequence regions (nuclear ITS, chloroplast matK, trnL intron plus trnL-trnF intergeneric spacer, and the trnS-trnG, trnD-trnT, and psbM-trnD intergenic spacers) using parsimony and Bayesian methods, as well as assessments of congruence and long branch attraction, to explore phylogenetic relationships among 84 ingroup species representing all currently recognized Polemoniaceae genera. Relationships inferred from the ITS and concatenated chloroplast regions are similar overall. A combined analysis provides strong support for the monophyly of Polemoniaceae and subfamilies Acanthogilioideae, Cobaeoideae, and Polemonioideae. Relationships among subfamilies, and thus for the precise root of Polemoniaceae, remain poorly supported. Within the largest subfamily, Polemonioideae, four clades corresponding to tribes Polemonieae, Phlocideae, Gilieae, and Loeselieae receive strong support. The monogeneric Polemonieae appears sister to Phlocideae. Relationships within Polemonieae, Phlocideae, and Gilieae are mostly consistent between analyses and data permutations. Many relationships within Loeselieae remain uncertain. Overall, inferred phylogenetic relationships support a higher-level classification for Polemoniaceae proposed in 2000.

  15. Engineering synthetic TAL effectors with orthogonal target sites

    PubMed Central

    Garg, Abhishek; Lohmueller, Jason J.; Silver, Pamela A.; Armel, Thomas Z.

    2012-01-01

    The ability to engineer biological circuits that process and respond to complex cellular signals has the potential to impact many areas of biology and medicine. Transcriptional activator-like effectors (TALEs) have emerged as an attractive component for engineering these circuits, as TALEs can be designed de novo to target a given DNA sequence. Currently, however, the use of TALEs is limited by degeneracy in the site-specific manner by which they recognize DNA. Here, we propose an algorithm to computationally address this problem. We apply our algorithm to design 180 TALEs targeting 20 bp cognate binding sites that are at least 3 nt mismatches away from all 20 bp sequences in putative 2 kb human promoter regions. We generated eight of these synthetic TALE activators and showed that each is able to activate transcription from a targeted reporter. Importantly, we show that these proteins do not activate synthetic reporters containing mismatches similar to those present in the genome nor a set of endogenous genes predicted to be the most likely targets in vivo. Finally, we generated and characterized TALE repressors comprised of our orthogonal DNA binding domains and further combined them with shRNAs to accomplish near complete repression of target gene expression. PMID:22581776

  16. Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

    NASA Astrophysics Data System (ADS)

    Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

    2017-07-01

    DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.

  17. Detection of a typhus group Rickettsia in Amblyomma ticks in the state of Nuevo Leon, Mexico.

    PubMed

    Medina-Sanchez, Aaron; Bouyer, Donald H; Alcantara-Rodriguez, Virginia; Mafra, Claudio; Zavala-Castro, Jorge; Whitworth, Ted; Popov, Vsevolod L; Fernandez-Salas, Ildefonso; Walker, David H

    2005-12-01

    The state of Nuevo Leon, Mexico has had outbreaks of typhus group rickettsiosis, most recently recognized in 1997. Evaluation of the sera of 345 patients with a dengue-like illness revealed that 25.5% had antibodies reactive with typhus group rickettsiae and 16% had antibodies to Rickettsia parkeri. Rickettsiae were detected by PCR and shell-vial isolations in the field-collected Amblyomma ticks. Molecular characterization by DNA sequence analysis of the gltA, ompB, and 17-kDa gene identified the organisms to be R. prowazekii.

  18. The colocalization transition of homologous chromosomes at meiosis

    NASA Astrophysics Data System (ADS)

    Nicodemi, Mario; Panning, Barbara; Prisco, Antonella

    2008-06-01

    Meiosis is the specialized cell division required in sexual reproduction. During its early stages, in the mother cell nucleus, homologous chromosomes recognize each other and colocalize in a crucial step that remains one of the most mysterious of meiosis. Starting from recent discoveries on the system molecular components and interactions, we discuss a statistical mechanics model of chromosome early pairing. Binding molecules mediate long-distance interaction of special DNA recognition sequences and, if their concentration exceeds a critical threshold, they induce a spontaneous colocalization transition of chromosomes, otherwise independently diffusing.

  19. Mycobacterial lesions in fish, amphibians, reptiles, rodents, lagomorphs, and ferrets with reference to animal models.

    PubMed

    Reavill, Drury R; Schmidt, Robert E

    2012-01-01

    Mycobacteriosis is a serious disease across many animal species. Approximately more than 120 species are currently recognized in the genus Mycobacterium. This article describes the zoonotic potential of mycobacteria and mycobacteriosis in fish, amphibians, rodents, rabbits, and ferrets. It considers clinical signs; histology; molecular methods of identification, such as polymerase chain reaction and DNA sequencing; routes of infection; and disease progression. Studying the disease in animals may aid in understanding the pathogenesis of mycobacterial infections in humans and identify better therapy and preventative options such as vaccines.

  20. [Screening specific recognition motif of RNA-binding proteins by SELEX in combination with next-generation sequencing technique].

    PubMed

    Zhang, Lu; Xu, Jinhao; Ma, Jinbiao

    2016-07-25

    RNA-binding protein exerts important biological function by specifically recognizing RNA motif. SELEX (Systematic evolution of ligands by exponential enrichment), an in vitro selection method, can obtain consensus motif with high-affinity and specificity for many target molecules from DNA or RNA libraries. Here, we combined SELEX with next-generation sequencing to study the protein-RNA interaction in vitro. A pool of RNAs with 20 bp random sequences were transcribed by T7 promoter, and target protein was inserted into plasmid containing SBP-tag, which can be captured by streptavidin beads. Through only one cycle, the specific RNA motif can be obtained, which dramatically improved the selection efficiency. Using this method, we found that human hnRNP A1 RRMs domain (UP1 domain) bound RNA motifs containing AGG and AG sequences. The EMSA experiment indicated that hnRNP A1 RRMs could bind the obtained RNA motif. Taken together, this method provides a rapid and effective method to study the RNA binding specificity of proteins.

  1. MToolBox: a highly automated pipeline for heteroplasmy annotation and prioritization analysis of human mitochondrial variants in high-throughput sequencing

    PubMed Central

    Diroma, Maria Angela; Santorsola, Mariangela; Guttà, Cristiano; Gasparre, Giuseppe; Picardi, Ernesto; Pesole, Graziano; Attimonelli, Marcella

    2014-01-01

    Motivation: The increasing availability of mitochondria-targeted and off-target sequencing data in whole-exome and whole-genome sequencing studies (WXS and WGS) has risen the demand of effective pipelines to accurately measure heteroplasmy and to easily recognize the most functionally important mitochondrial variants among a huge number of candidates. To this purpose, we developed MToolBox, a highly automated pipeline to reconstruct and analyze human mitochondrial DNA from high-throughput sequencing data. Results: MToolBox implements an effective computational strategy for mitochondrial genomes assembling and haplogroup assignment also including a prioritization analysis of detected variants. MToolBox provides a Variant Call Format file featuring, for the first time, allele-specific heteroplasmy and annotation files with prioritized variants. MToolBox was tested on simulated samples and applied on 1000 Genomes WXS datasets. Availability and implementation: MToolBox package is available at https://sourceforge.net/projects/mtoolbox/. Contact: marcella.attimonelli@uniba.it Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25028726

  2. CNV-seq, a new method to detect copy number variation using high-throughput sequencing.

    PubMed

    Xie, Chao; Tammi, Martti T

    2009-03-06

    DNA copy number variation (CNV) has been recognized as an important source of genetic variation. Array comparative genomic hybridization (aCGH) is commonly used for CNV detection, but the microarray platform has a number of inherent limitations. Here, we describe a method to detect copy number variation using shotgun sequencing, CNV-seq. The method is based on a robust statistical model that describes the complete analysis procedure and allows the computation of essential confidence values for detection of CNV. Our results show that the number of reads, not the length of the reads is the key factor determining the resolution of detection. This favors the next-generation sequencing methods that rapidly produce large amount of short reads. Simulation of various sequencing methods with coverage between 0.1x to 8x show overall specificity between 91.7 - 99.9%, and sensitivity between 72.2 - 96.5%. We also show the results for assessment of CNV between two individual human genomes.

  3. Analysis of xylem formation in pine by cDNA sequencing

    NASA Technical Reports Server (NTRS)

    Allona, I.; Quinn, M.; Shoop, E.; Swope, K.; St Cyr, S.; Carlis, J.; Riedl, J.; Retzel, E.; Campbell, M. M.; Sederoff, R.; hide

    1998-01-01

    Secondary xylem (wood) formation is likely to involve some genes expressed rarely or not at all in herbaceous plants. Moreover, environmental and developmental stimuli influence secondary xylem differentiation, producing morphological and chemical changes in wood. To increase our understanding of xylem formation, and to provide material for comparative analysis of gymnosperm and angiosperm sequences, ESTs were obtained from immature xylem of loblolly pine (Pinus taeda L.). A total of 1,097 single-pass sequences were obtained from 5' ends of cDNAs made from gravistimulated tissue from bent trees. Cluster analysis detected 107 groups of similar sequences, ranging in size from 2 to 20 sequences. A total of 361 sequences fell into these groups, whereas 736 sequences were unique. About 55% of the pine EST sequences show similarity to previously described sequences in public databases. About 10% of the recognized genes encode factors involved in cell wall formation. Sequences similar to cell wall proteins, most known lignin biosynthetic enzymes, and several enzymes of carbohydrate metabolism were found. A number of putative regulatory proteins also are represented. Expression patterns of several of these genes were studied in various tissues and organs of pine. Sequencing novel genes expressed during xylem formation will provide a powerful means of identifying mechanisms controlling this important differentiation pathway.

  4. Large-Scale Concatenation cDNA Sequencing

    PubMed Central

    Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.

    1997-01-01

    A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174

  5. Synthesis of DNA

    DOEpatents

    Mariella, Jr., Raymond P.

    2008-11-18

    A method of synthesizing a desired double-stranded DNA of a predetermined length and of a predetermined sequence. Preselected sequence segments that will complete the desired double-stranded DNA are determined. Preselected segment sequences of DNA that will be used to complete the desired double-stranded DNA are provided. The preselected segment sequences of DNA are assembled to produce the desired double-stranded DNA.

  6. Roseomonas tokyonensis sp. nov. isolated from a biofilm sample obtained from a cooling tower in Tokyo, Japan.

    PubMed

    Furuhata, Katsunori; Ishizaki, Naoto; Edagawa, Akiko; Fukuyama, Masafumi

    2013-01-01

    Strain K-20(T), a Gram-negative, nonmotile, nonspore-forming and strictly aerobic coccobacillus, which produces a pale pink pigment (R2A agar medium, 30℃, seven days) was isolated from a sample of biofilm obtained from a cooling tower in Tokyo, Japan. A phylogenetic analysis of the 16S rRNA partial gene sequences (1,439 bp) showed that the strain (accession number: AB297501) was related to Roseomonas frigidaquae CW67(T) and Roseomonas stagni HS-69(T) with 97.4% and 96.9% sequence similarity, respectively. Strain K-20(T) formed a distinct cluster with Roseomonas frigidaquae CW67(T) in the phylogenetic tree at a high bootstrap value (93%); however, distance was recognized between the strains. In addition, the DNA-DNA hybridization level between strain K-20(T) and Roseomonas frigidaquae JCM 15073(T) was 33%. The taxonomic data indicate that K-20(T) (=JCM 14634(T) =KCTC 32152(T)) should be classified in the genus Roseomonas as the type strain of a novel species, Roseomonas tokyonensis sp. nov.

  7. Nanopore Technology: A Simple, Inexpensive, Futuristic Technology for DNA Sequencing.

    PubMed

    Gupta, P D

    2016-10-01

    In health care, importance of DNA sequencing has been fully established. Sanger's Capillary Electrophoresis DNA sequencing methodology is time consuming, cumbersome, hence become more expensive. Lately, because of its versatility DNA sequencing became house hold name, and therefore, there is an urgent need of simple, fast, inexpensive, DNA sequencing technology. In the beginning of this century efforts were made, and Nanopore DNA sequencing technology was developed; still it is infancy, nevertheless, it is the futuristic technology.

  8. The genome-wide DNA sequence specificity of the anti-tumour drug bleomycin in human cells.

    PubMed

    Murray, Vincent; Chen, Jon K; Tanaka, Mark M

    2016-07-01

    The cancer chemotherapeutic agent, bleomycin, cleaves DNA at specific sites. For the first time, the genome-wide DNA sequence specificity of bleomycin breakage was determined in human cells. Utilising Illumina next-generation DNA sequencing techniques, over 200 million bleomycin cleavage sites were examined to elucidate the bleomycin genome-wide DNA selectivity. The genome-wide bleomycin cleavage data were analysed by four different methods to determine the cellular DNA sequence specificity of bleomycin strand breakage. For the most highly cleaved DNA sequences, the preferred site of bleomycin breakage was at 5'-GT* dinucleotide sequences (where the asterisk indicates the bleomycin cleavage site), with lesser cleavage at 5'-GC* dinucleotides. This investigation also determined longer bleomycin cleavage sequences, with preferred cleavage at 5'-GT*A and 5'- TGT* trinucleotide sequences, and 5'-TGT*A tetranucleotides. For cellular DNA, the hexanucleotide DNA sequence 5'-RTGT*AY (where R is a purine and Y is a pyrimidine) was the most highly cleaved DNA sequence. It was striking that alternating purine-pyrimidine sequences were highly cleaved by bleomycin. The highest intensity cleavage sites in cellular and purified DNA were very similar although there were some minor differences. Statistical nucleotide frequency analysis indicated a G nucleotide was present at the -3 position (relative to the cleavage site) in cellular DNA but was absent in purified DNA.

  9. A major lineage of non-tailed dsDNA viruses as unrecognized killers of marine bacteria

    NASA Astrophysics Data System (ADS)

    Kauffman, Kathryn M.; Hussain, Fatima A.; Yang, Joy; Arevalo, Philip; Brown, Julia M.; Chang, William K.; Vaninsberghe, David; Elsherbini, Joseph; Sharma, Radhey S.; Cutler, Michael B.; Kelly, Libusha; Polz, Martin F.

    2018-02-01

    The most abundant viruses on Earth are thought to be double-stranded DNA (dsDNA) viruses that infect bacteria. However, tailed bacterial dsDNA viruses (Caudovirales), which dominate sequence and culture collections, are not representative of the environmental diversity of viruses. In fact, non-tailed viruses often dominate ocean samples numerically, raising the fundamental question of the nature of these viruses. Here we characterize a group of marine dsDNA non-tailed viruses with short 10-kb genomes isolated during a study that quantified the diversity of viruses infecting Vibrionaceae bacteria. These viruses, which we propose to name the Autolykiviridae, represent a novel family within the ancient lineage of double jelly roll (DJR) capsid viruses. Ecologically, members of the Autolykiviridae have a broad host range, killing on average 34 hosts in four Vibrio species, in contrast to tailed viruses which kill on average only two hosts in one species. Biochemical and physical characterization of autolykiviruses reveals multiple virion features that cause systematic loss of DJR viruses in sequencing and culture-based studies, and we describe simple procedural adjustments to recover them. We identify DJR viruses in the genomes of diverse major bacterial and archaeal phyla, and in marine water column and sediment metagenomes, and find that their diversity greatly exceeds the diversity that is currently captured by the three recognized families of such viruses. Overall, these data suggest that viruses of the non-tailed dsDNA DJR lineage are important but often overlooked predators of bacteria and archaea that impose fundamentally different predation and gene transfer regimes on microbial systems than on tailed viruses, which form the basis of all environmental models of bacteria-virus interactions.

  10. Molecular mechanisms of conformational specificity: A study of Hox in vivo target DNA binding specificities and the structure of a Ure2p mutation that affects fibril formation rates

    NASA Astrophysics Data System (ADS)

    Bauer, William Joseph, Jr.

    The fate of an individual cell, or even an entire organism, is often determined by minute, yet very specific differences in the conformation of a single protein species. Very often, proteins take on alternate folds or even side chain conformations to deal with different situations present within the cell. These differences can be as large as a whole domain or as subtle as the alteration of a single amino acid side chain. Yet, even these seemingly minor side chain conformational differences can determine the development of a cell type during differentiation or even dictate whether a cell will live or die. Two examples of situations where minor conformational differences within a specific protein could lead to major differences in the life cycle of a cell are described herein. The first example describes the variations seen in DNA conformations which can lead to slightly different Hox protein binding conformations responsible for recognizing biologically relevant regulatory sites. These specific differences occur in the minor groove of the bound DNA and are limited to the conformation of only two side chains. The conformation of the bound DNA, however, is not solely determined by the sequence of the DNA, as multiple sequences can result in the same DNA conformation. The second example takes place in the context of a yeast prion protein which contains a mutation that decreases the frequency at which fibrils form. While the specific interactions leading to this physiological change were not directly detected, it can be ascertained from the crystal structure that the structural changes are subtle and most likely involve another binding partner. In both cases, these conformational changes are very slight but have a profound effect on the downstream processes.

  11. A DNA barcode library for Germany's mayflies, stoneflies and caddisflies (Ephemeroptera, Plecoptera and Trichoptera).

    PubMed

    Morinière, Jérôme; Hendrich, Lars; Balke, Michael; Beermann, Arne J; König, Tobias; Hess, Monika; Koch, Stefan; Müller, Reinhard; Leese, Florian; Hebert, Paul D N; Hausmann, Axel; Schubart, Christoph D; Haszprunar, Gerhard

    2017-11-01

    Mayflies, stoneflies and caddisflies (Ephemeroptera, Plecoptera and Trichoptera) are prominent representatives of aquatic macroinvertebrates, commonly used as indicator organisms for water quality and ecosystem assessments. However, unambiguous morphological identification of EPT species, especially their immature life stages, is a challenging, yet fundamental task. A comprehensive DNA barcode library based upon taxonomically well-curated specimens is needed to overcome the problematic identification. Once available, this library will support the implementation of fast, cost-efficient and reliable DNA-based identifications and assessments of ecological status. This study represents a major step towards a DNA barcode reference library as it covers for two-thirds of Germany's EPT species including 2,613 individuals belonging to 363 identified species. As such, it provides coverage for 38 of 44 families (86%) and practically all major bioindicator species. DNA barcode compliant sequences (≥500 bp) were recovered from 98.74% of the analysed specimens. Whereas most species (325, i.e., 89.53%) were unambiguously assigned to a single Barcode Index Number (BIN) by its COI sequence, 38 species (18 Ephemeroptera, nine Plecoptera and 11 Trichoptera) were assigned to a total of 89 BINs. Most of these additional BINs formed nearest neighbour clusters, reflecting the discrimination of geographical subclades of a currently recognized species. BIN sharing was uncommon, involving only two species pairs of Ephemeroptera. Interestingly, both maximum pairwise and nearest neighbour distances were substantially higher for Ephemeroptera compared to Plecoptera and Trichoptera, possibly indicating older speciation events, stronger positive selection or faster rate of molecular evolution. © 2017 John Wiley & Sons Ltd.

  12. Intrinsic DNA curvature in trypanosomes.

    PubMed

    Smircich, Pablo; El-Sayed, Najib M; Garat, Beatriz

    2017-11-09

    Trypanosoma cruzi and Trypanosoma brucei are protozoan parasites causing Chagas disease and African sleeping sickness, displaying unique features of cellular and molecular biology. Remarkably, no canonical signals for RNA polymerase II promoters, which drive protein coding genes transcription, have been identified so far. The secondary structure of DNA has long been recognized as a signal in biological processes and more recently, its involvement in transcription initiation in Leishmania was proposed. In order to study whether this feature is conserved in trypanosomatids, we undertook a genome wide search for intrinsic DNA curvature in T. cruzi and T. brucei. Using a region integrated intrinsic curvature (RIIC) scoring that we previously developed, a non-random distribution of sequence-dependent curvature was observed. High RIIC scores were found to be significantly correlated with transcription start sites in T. cruzi, which have been mapped in divergent switch regions, whereas in T. brucei, the high RIIC scores correlated with sites that have been involved not only in RNA polymerase II initiation but also in termination. In addition, we observed regions with high RIIC score presenting in-phase tracts of Adenines, in the subtelomeric regions of the T. brucei chromosomes that harbor the variable surface glycoproteins genes. In both T. cruzi and T. brucei genomes, a link between DNA conformational signals and gene expression was found. High sequence dependent curvature is associated with transcriptional regulation regions. High intrinsic curvature also occurs at the T. brucei chromosome subtelomeric regions where the recombination processes involved in the evasion of the immune host system take place. These findings underscore the relevance of indirect DNA readout in these ancient eukaryotes.

  13. TALE: a tale of genome editing.

    PubMed

    Zhang, Mingjie; Wang, Feng; Li, Shifei; Wang, Yan; Bai, Yun; Xu, Xueqing

    2014-01-01

    Transcription activator-like effectors (TALEs), first identified in Xanthomonas bacteria, are naturally occurring or artificially designed proteins that modulate gene transcription. These proteins recognize and bind DNA sequences based on a variable numbers of tandem repeats. Each repeat is comprised of a set of ∼ 34 conserved amino acids; within this conserved domain, there are usually two amino acids that distinguish one TALE from another. Interestingly, TALEs have revealed a simple cipher for the one-to-one recognition of proteins for DNA bases. Synthetic TALEs have been used to successfully target genes in a variety of species, including humans. Depending on the type of functional domain that is fused to the TALE of interest, these proteins can have diverse biological effects. For example, after binding DNA, TALEs fused to transcriptional activation domains can function as robust transcription factors (TALE-TFs), while fused to restriction endonucleases (TALENs) can cut DNA. Targeted genome editing, in theory, is capable of modifying any endogenous gene sequence of interest; this can be performed in cells or organisms, and may be applied to clinical gene-based therapies in the future. With current technologies, highly accurate, specific, and reliable gene editing cannot be achieved. Thus, recognition and binding mechanisms governing TALE biology are currently hot research areas. In this review, we summarize the major advances in TALE technology over the past several years with a focus on the interaction between TALEs and DNA, TALE design and construction, potential applications for this technology, and unique characteristics that make TALEs superior to zinc finger endonucleases. Copyright © 2013 Elsevier Ltd. All rights reserved.

  14. Ancestor of land plants acquired the DNA-3-methyladenine glycosylase (MAG) gene from bacteria through horizontal gene transfer.

    PubMed

    Fang, Huimin; Huangfu, Liexiang; Chen, Rujia; Li, Pengcheng; Xu, Shuhui; Zhang, Enying; Cao, Wei; Liu, Li; Yao, Youli; Liang, Guohua; Xu, Chenwu; Zhou, Yong; Yang, Zefeng

    2017-08-24

    The origin and evolution of land plants was an important event in the history of life and initiated the establishment of modern terrestrial ecosystems. From water to terrestrial environments, plants needed to overcome the enhanced ultraviolet (UV) radiation and many other DNA-damaging agents. Evolving new genes with the function of DNA repair is critical for the origin and radiation of land plants. In bacteria, the DNA-3-methyladenine glycosylase (MAG) recognizes of a variety of base lesions and initiates the process of the base excision repair for damaged DNA. The homologs of MAG gene are present in all major lineages of streptophytes, and both the phylogenic and sequence similarity analyses revealed that green plant MAG gene originated through an ancient horizontal gene transfer (HGT) event from bacteria. Experimental evidence demonstrated that the expression of the maize ZmMAG gene was induced by UV and zeocin, both of which are known as DNA-damaging agents. Further investigation revealed that Streptophyta MAG genes had undergone positive selection during the initial evolutionary period in the ancestor of land plants. Our findings demonstrated that the ancient HGT of MAG to the ancestor of land plants probably played an important role in preadaptation to DNA-damaging agents in terrestrial environments.

  15. Microsatellite marker development by partial sequencing of the sour passion fruit genome (Passiflora edulis Sims).

    PubMed

    Araya, Susan; Martins, Alexandre M; Junqueira, Nilton T V; Costa, Ana Maria; Faleiro, Fábio G; Ferreira, Márcio E

    2017-07-21

    The Passiflora genus comprises hundreds of wild and cultivated species of passion fruit used for food, industrial, ornamental and medicinal purposes. Efforts to develop genomic tools for genetic analysis of P. edulis, the most important commercial Passiflora species, are still incipient. In spite of many recognized applications of microsatellite markers in genetics and breeding, their availability for passion fruit research remains restricted. Microsatellite markers in P. edulis are usually limited in number, show reduced polymorphism, and are mostly based on compound or imperfect repeats. Furthermore, they are confined to only a few Passiflora species. We describe the use of NGS technology to partially assemble the P. edulis genome in order to develop hundreds of new microsatellite markers. A total of 14.11 Gbp of Illumina paired-end sequence reads were analyzed to detect simple sequence repeat sites in the sour passion fruit genome. A sample of 1300 contigs containing perfect repeat microsatellite sequences was selected for PCR primer development. Panels of di- and tri-nucleotide repeat markers were then tested in P. edulis germplasm accessions for validation. DNA polymorphism was detected in 74% of the markers (PIC = 0.16 to 0.77; number of alleles/locus = 2 to 7). A core panel of highly polymorphic markers (PIC = 0.46 to 0.77) was used to cross-amplify PCR products in 79 species of Passiflora (including P. edulis), belonging to four subgenera (Astrophea, Decaloba, Distephana and Passiflora). Approximately 71% of the marker/species combinations resulted in positive amplicons in all species tested. DNA polymorphism was detected in germplasm accessions of six closely related Passiflora species (P. edulis, P. alata, P. maliformis, P. nitida, P. quadrangularis and P. setacea) and the data used for accession discrimination and species assignment. A database of P. edulis DNA sequences obtained by NGS technology was examined to identify microsatellite repeats in the sour passion fruit genome. Markers were submitted to evaluation using accessions of cultivated and wild Passiflora species. The new microsatellite markers detected high levels of DNA polymorphism in sour passion fruit and can potentially be used in genetic analysis of P. edulis and other Passiflora species.

  16. Altererythrobacter xiamenensis sp. nov., an algicidal bacterium isolated from red tide seawater.

    PubMed

    Lei, Xueqian; Li, Yi; Chen, Zhangran; Zheng, Wei; Lai, Qiliang; Zhang, Huajun; Guan, Chengwei; Cai, Guanjing; Yang, Xujun; Tian, Yun; Zheng, Tianling

    2014-02-01

    A Gram-stain-negative, yellow-pigmented, aerobic bacterial strain, designated LY02(T), was isolated from red tide seawater in Xiamen, Fujian Province, China. Growth was observed at temperatures from 4 to 44 °C, at salinities from 0 to 9% and at pH from 6 to 10. Phylogenetic analysis based on 16S rRNA gene sequencing revealed that the isolate was a member of the genus Altererythrobacter, which belongs to the family Erythrobacteraceae. Strain LY02(T) was related most closely to Altererythrobacter marensis MSW-14(T) (97.2% 16S rRNA gene sequence similarity), followed by Altererythrobacter ishigakiensis JPCCMB0017(T) (97.1%), Altererythrobacter epoxidivorans JCS350(T) (97.1%) and Altererythrobacter luteolus SW-109(T) (97.0%). The dominant fatty acids were C(18 : 1)ω7c, C(17 : 1)ω6c and summed feature 3 (comprising C(16 : 1)ω7c and/or C(16 : 1)ω6c). DNA-DNA hybridization showed that strain LY02(T) possessed low DNA-DNA relatedness to A. marensis MSW-14(T), A. ishigakiensis JPCCMB0017(T), A. epoxidivorans JCS350(T) and A. luteolus SW-109(T) (mean ± SD of 33.2 ± 1.3, 32.1 ± 1.0, 26.7 ± 0.7 and 25.2 ± 1.1 %, respectively). The G+C content of the chromosomal DNA was 61.2 mol%. The predominant respiratory quinone was ubiquinone-10 (Q-10). According to its morphology, physiology, fatty acid composition and 16S rRNA gene sequence data, the novel strain most appropriately belongs to the genus Altererythrobacter, but can readily be distinguished from recognized species. The name Altererythrobacter xiamenensis sp. nov. is proposed (type strain LY02(T) = CGMCC 1.12494(T) = KCTC 32398(T) = NBRC 109638(T)).

  17. Structures of apo IRF-3 and IRF-7 DNA binding domains: effect of loop L1 on DNA binding

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    De Ioannes, Pablo; Escalante, Carlos R.; Aggarwal, Aneel K.

    2013-11-20

    Interferon regulatory factors IRF-3 and IRF-7 are transcription factors essential in the activation of interferon-{beta} (IFN-{beta}) gene in response to viral infections. Although, both proteins recognize the same consensus IRF binding site AANNGAAA, they have distinct DNA binding preferences for sites in vivo. The X-ray structures of IRF-3 and IRF-7 DNA binding domains (DBDs) bound to IFN-{beta} promoter elements revealed flexibility in the loops (L1-L3) and the residues that make contacts with the target sequence. To characterize the conformational changes that occur on DNA binding and how they differ between IRF family members, we have solved the X-ray structures ofmore » IRF-3 and IRF-7 DBDs in the absence of DNA. We found that loop L1, carrying the conserved histidine that interacts with the DNA minor groove, is disordered in apo IRF-3 but is ordered in apo IRF-7. This is reflected in differences in DNA binding affinities when the conserved histidine in loop L1 is mutated to alanine in the two proteins. The stability of loop L1 in IRF-7 derives from a unique combination of hydrophobic residues that pack against the protein core. Together, our data show that differences in flexibility of loop L1 are an important determinant of differential IRF-DNA binding.« less

  18. TALE-PvuII Fusion Proteins – Novel Tools for Gene Targeting

    PubMed Central

    Yanik, Mert; Alzubi, Jamal; Lahaye, Thomas; Cathomen, Toni; Pingoud, Alfred; Wende, Wolfgang

    2013-01-01

    Zinc finger nucleases (ZFNs) consist of zinc fingers as DNA-binding module and the non-specific DNA-cleavage domain of the restriction endonuclease FokI as DNA-cleavage module. This architecture is also used by TALE nucleases (TALENs), in which the DNA-binding modules of the ZFNs have been replaced by DNA-binding domains based on transcription activator like effector (TALE) proteins. Both TALENs and ZFNs are programmable nucleases which rely on the dimerization of FokI to induce double-strand DNA cleavage at the target site after recognition of the target DNA by the respective DNA-binding module. TALENs seem to have an advantage over ZFNs, as the assembly of TALE proteins is easier than that of ZFNs. Here, we present evidence that variant TALENs can be produced by replacing the catalytic domain of FokI with the restriction endonuclease PvuII. These fusion proteins recognize only the composite recognition site consisting of the target site of the TALE protein and the PvuII recognition sequence (addressed site), but not isolated TALE or PvuII recognition sites (unaddressed sites), even at high excess of protein over DNA and long incubation times. In vitro, their preference for an addressed over an unaddressed site is > 34,000-fold. Moreover, TALE-PvuII fusion proteins are active in cellula with minimal cytotoxicity. PMID:24349308

  19. RNAi drives nonreciprocal translocations at eroding chromosome ends to establish telomere-free linear chromosomes.

    PubMed

    Begnis, Martina; Apte, Manasi S; Masuda, Hirohisa; Jain, Devanshi; Wheeler, David Lee; Cooper, Julia Promisel

    2018-04-01

    The identification of telomerase-negative HAATI (heterochromatin amplification-mediated and telomerase-independent) cells, in which telomeres are superseded by nontelomeric heterochromatin tracts, challenged the idea that canonical telomeres are essential for chromosome linearity and raised crucial questions as to how such tracts translocate to eroding chromosome ends and confer end protection. Here we show that HAATI arises when telomere loss triggers a newly recognized illegitimate translocation pathway that requires RNAi factors. While RNAi is necessary for the translocation events that mobilize ribosomal DNA (rDNA) tracts to all chromosome ends (forming "HAATI rDNA " chromosomes), it is dispensable for HAATI rDNA maintenance. Surprisingly, Dicer (Dcr1) plays a separate, RNAi-independent role in preventing formation of the rare HAATI subtype in which a different repetitive element (the subtelomeric element) replaces telomeres. Using genetics and fusions between shelterin components and rDNA-binding proteins, we mapped the mechanism by which rDNA loci engage crucial end protection factors-despite the absence of telomere repeats-and secure end protection. Sequence analysis of HAATI rDNA genomes allowed us to propose RNA and DNA polymerase template-switching models for the mechanism of RNAi-triggered rDNA translocations. Collectively, our results reveal unforeseen roles for noncoding RNAs (ncRNAs) in assembling a telomere-free chromosome end protection device. © 2018 Begnis et al.; Published by Cold Spring Harbor Laboratory Press.

  20. Noncoding transcripts in sense and antisense orientation regulate the epigenetic state of ribosomal RNA genes.

    PubMed

    Bierhoff, H; Schmitz, K; Maass, F; Ye, J; Grummt, I

    2010-01-01

    Alternative transcription of the same gene in sense and antisense orientation regulates expression of protein-coding genes. Here we show that noncoding RNA (ncRNA) in sense and antisense orientation also controls transcription of rRNA genes (rDNA). rDNA exists in two types of chromatin--a euchromatic conformation that is permissive to transcription and a heterochromatic conformation that is transcriptionally silent. Silencing of rDNA is mediated by NoRC, a chromatin-remodeling complex that triggers heterochromatin formation. NoRC function requires RNA that is complementary to the rDNA promoter (pRNA). pRNA forms a DNA:RNA triplex with a regulatory element in the rDNA promoter, and this triplex structure is recognized by DNMT3b. The results imply that triplex-mediated targeting of DNMT3b to specific sequences may be a common pathway in epigenetic regulation. We also show that rDNA is transcribed in antisense orientation. The level of antisense RNA (asRNA) is down-regulated in cancer cells and up-regulated in senescent cells. Ectopic asRNA triggers trimethylation of histone H4 at lysine 20 (H4K20me3), suggesting that antisense transcripts guide the histone methyltransferase Suv4-20 to rDNA. The results reveal that noncoding RNAs in sense and antisense orientation are important determinants of the epigenetic state of rDNA.

  1. Morphological characters and DNA barcoding of Syngnathus schlegeli in the coastal waters of China

    NASA Astrophysics Data System (ADS)

    Chen, Zhi; Zhang, Yan; Han, Zhiqiang; Song, Na; Gao, Tianxiang

    2018-03-01

    A Syngnathus species widely distributed in Chinese seas was permanently identified as Syngnathus acus by native ichthyologists, but the taxonomic description about this species was inadequate and lacking conclusively molecular evidence. To identify this species, 357 individuals of this species from the coastal waters of Dandong, Yantai, Qingdao and Zhoushan were collected and measured. Morphological results showed that these slender specimens were mainly brownish, usually mottled with pale. Standard length ranged from 117 mm to 213 mm with an average length of 180.3 mm. The above characters were consistent with S. schlegeli distributed in Japan but colored differently from and much smaller than typical S. acus reported in Europe. Thus, morphological studies revealed that this species was previously misidentified as S. acus and might be S. schlegeli in reality. In addition, a fragment of cytochrome oxidase subunit I ( COI) gene of mitochondrial DNA was also sequenced for species identification, and 15 COI sequences belonging to different Syngnathus species were also used for the molecular identification. COI sequences of our specimens had the minimum genetic distance from recognized S. schlegeli from Japan and clustered with it firstly. The phylogenetic analysis similarly suggested that the species previously identified as S. acus in the coastal waters of China was S. schlegeli actually.

  2. Localization of Action of the Is50-Encoded Transposase Protein

    PubMed Central

    Phadnis, Suhas H.; Sasakawa, Chihiro; Berg, Douglas E.

    1986-01-01

    The movement of the bacterial insertion sequence IS50 and of composite elements containing direct terminal repeats of IS50 involves the two ends of IS50, designated O (outside) and I (inside), which are weakly matched in DNA sequence, and an IS50 encoded protein, transposase, which recognizes the O and I ends and acts preferentially in cis. Previous data had suggested that, initially, transposase interacts preferentially with the O end sequence and then, in a second step, with either an O or an I end. To better understand the cis action of transposase and how IS50 ends are selected, we generated a series of composite transposons which contain direct repeats of IS50 elements. In each transposon, one IS50 element encoded transposase (tnp +), and the other contained a null (tnp-) allele. In each of the five sets of composite transposons studied, the transposon for which the tnp+ IS50 element contained its O end was more active than a complementary transposon for which the tnp - IS50 element contained its O end. This pattern of O end use suggests models in which the cis action of transposase and its choice of ends is determined by protein tracking along DNA molecules. PMID:3007274

  3. Sequence and Structure Dependent DNA-DNA Interactions

    NASA Astrophysics Data System (ADS)

    Kopchick, Benjamin; Qiu, Xiangyun

    Molecular forces between dsDNA strands are largely dominated by electrostatics and have been extensively studied. Quantitative knowledge has been accumulated on how DNA-DNA interactions are modulated by varied biological constituents such as ions, cationic ligands, and proteins. Despite its central role in biology, the sequence of DNA has not received substantial attention and ``random'' DNA sequences are typically used in biophysical studies. However, ~50% of human genome is composed of non-random-sequence DNAs, particularly repetitive sequences. Furthermore, covalent modifications of DNA such as methylation play key roles in gene functions. Such DNAs with specific sequences or modifications often take on structures other than the canonical B-form. Here we present series of quantitative measurements of the DNA-DNA forces with the osmotic stress method on different DNA sequences, from short repeats to the most frequent sequences in genome, and to modifications such as bromination and methylation. We observe peculiar behaviors that appear to be strongly correlated with the incurred structural changes. We speculate the causalities in terms of the differences in hydration shell and DNA surface structures.

  4. Corruption of genomic databases with anomalous sequence.

    PubMed

    Lamperti, E D; Kittelberger, J M; Smith, T F; Villa-Komaroff, L

    1992-06-11

    We describe evidence that DNA sequences from vectors used for cloning and sequencing have been incorporated accidentally into eukaryotic entries in the GenBank database. These incorporations were not restricted to one type of vector or to a single mechanism. Many minor instances may have been the result of simple editing errors, but some entries contained large blocks of vector sequence that had been incorporated by contamination or other accidents during cloning. Some cases involved unusual rearrangements and areas of vector distant from the normal insertion sites. Matches to vector were found in 0.23% of 20,000 sequences analyzed in GenBank Release 63. Although the possibility of anomalous sequence incorporation has been recognized since the inception of GenBank and should be easy to avoid, recent evidence suggests that this problem is increasing more quickly than the database itself. The presence of anomalous sequence may have serious consequences for the interpretation and use of database entries, and will have an impact on issues of database management. The incorporated vector fragments described here may also be useful for a crude estimate of the fidelity of sequence information in the database. In alignments with well-defined ends, the matching sequences showed 96.8% identity to vector; when poorer matches with arbitrary limits were included, the aggregate identity to vector sequence was 94.8%.

  5. Polyomavirus BK non-coding control region rearrangements in health and disease.

    PubMed

    Sharma, Preety M; Gupta, Gaurav; Vats, Abhay; Shapiro, Ron; Randhawa, Parmjeet S

    2007-08-01

    BK virus is an increasingly recognized pathogen in transplanted patients. DNA sequencing of this virus shows considerable genomic variability. To understand the clinical significance of rearrangements in the non-coding control region (NCCR) of BK virus (BKV), we report a meta-analysis of 507 sequences, including 40 sequences generated in our own laboratory, for associations between rearrangements and disease, tissue tropism, geographic origin, and viral genotype. NCCR rearrangements were less frequent in (a) asymptomatic BKV viruria compared to patients viral nephropathy (1.7% vs. 22.5%), and (b) viral genotype 1 compared to other genotypes (2.4% vs. 11.2%). Rearrangements were commoner in malignancy (78.6%), and Norwegians (45.7%), and less common in East Indians (0%), and Japanese (4.3%). A surprising number of rearranged sequences were reported from mononuclear cells of healthy subjects, whereas most plasma sequences were archetypal. This difference could not be related to potential recombinase activity in lymphocytes, as consensus recombination signal sequences could not be found in the NCCR region. NCCR rearrangements are neither required nor a sufficient condition to produce clinical disease. BKV nephropathy and hemorrhagic cystitis are not associated with any unique NCCR configuration or nucleotide sequence.

  6. Verification of Frequency in Species of Nontuberculous Mycobacteria in Kermanshah Drinking Water Supplies Using the PCR-Sequencing Method.

    PubMed

    Mohajeri, Parviz; Yazdani, Laya; Shahraki, Abdolrazagh Hashemi; Alvandi, Amirhoshang; Atashi, Sara; Farahani, Abbas; Almasi, Ali; Rezaei, Mansour

    2017-04-01

    Nontuberculous mycobacteria are habitants of environment, especially in aquatic systems. Some of them cause problems in immunodeficient patients. Over the last decade, 16S rRNA gene sequencing was established in 45 novel species of nontuberculous mycobacteria. Experiences revealed that this method underestimates the diversity, but does not distinguish between some of mycobacterium subsp. To recognize emerging rapidly growing mycobacteria and identify their subsp, rpoB gene sequencing has been developed. To better understand the transmission of nontuberculous mycobacterial species from drinking water and preventing the spread of illness with these bacteria, the aim of this study was to detect the presence of bacteria by PCR-sequencing techniques. Drinking water samples were collected from different areas of Kermanshah city in west of IRAN. After decontamination with cetylpyridinium chloride, samples were filtered with 0.45-micron filters, the filter transferred directly on growth medium waiting to appear in colonies, then DNA extraction and PCR were performed, and products were sent to sequencing. We found 35/110 (32%) nontuberculous mycobacterial species in drinking water samples, isolates included Mycobacterium goodii, Mycobacterium aurum, and Mycobacterium gastri with the most abundance (11.5%), followed by Mycobacterium smegmatis, Mycobacterium porcinum, Mycobacterium peregrinum, Mycobacterium mucogenicum, and Mycobacterium chelonae (8%). In this study, we recognized the evidence of contamination by nontuberculous mycobacteria in corroded water pipes. As a result of the high prevalence of these bacteria in drinking water in Kermanshah, this is important evidence of transmission through drinking water. This finding can also help public health policy makers control these isolates in drinking water supplies in Kermanshah.

  7. Conformational plasticity of RepB, the replication initiator protein of promiscuous streptococcal plasmid pMV158

    PubMed Central

    Boer, D. Roeland; Ruiz-Masó, José Angel; Rueda, Manuel; Petoukhov, Maxim V.; Machón, Cristina; Svergun, Dmitri I.; Orozco, Modesto; del Solar, Gloria; Coll, Miquel

    2016-01-01

    DNA replication initiation is a vital and tightly regulated step in all replicons and requires an initiator factor that specifically recognizes the DNA replication origin and starts replication. RepB from the promiscuous streptococcal plasmid pMV158 is a hexameric ring protein evolutionary related to viral initiators. Here we explore the conformational plasticity of the RepB hexamer by i) SAXS, ii) sedimentation experiments, iii) molecular simulations and iv) X-ray crystallography. Combining these techniques, we derive an estimate of the conformational ensemble in solution showing that the C-terminal oligomerisation domains of the protein form a rigid cylindrical scaffold to which the N-terminal DNA-binding/catalytic domains are attached as highly flexible appendages, featuring multiple orientations. In addition, we show that the hinge region connecting both domains plays a pivotal role in the observed plasticity. Sequence comparisons and a literature survey show that this hinge region could exists in other initiators, suggesting that it is a common, crucial structural element for DNA binding and manipulation. PMID:26875695

  8. Engineering a Cell-surface Aptamer Circuit for Targeted and Amplified Photodynamic Cancer Therapy

    PubMed Central

    Han, Da; Zhu, Guizhi; Wu, Cuichen; Zhu, Zhi; Chen, Tao; Zhang, Xiaobing

    2013-01-01

    Photodynamic therapy (PDT) is one of the most promising and noninvasive methods for clinical treatment of different malignant diseases. Here, we present a novel strategy of designing an aptamer-based DNA nanocircuit capable of the selective recognition of cancer cells, controllable activation of photosensitizer and amplification of photodynamic therapeutic effect. The aptamers can selectively recognize target cancer cells and bind to the specific proteins on cell membranes. Then the overhanging catalyst sequence on aptamer can trigger a toehold-mediated catalytic strand displacement to activate photosensitizer and achieve amplified therapeutic effect. The specific binding-induced activation allows the DNA circuit to distinguish diseased cells from healthy cells, reducing damage to nearby healthy cells. Moreover, the catalytic amplification reaction will only take place close to the target cancer cells, resulting in a high local concentration of singlet oxygen to selectively kill the target cells. The principle employed in this study demonstrated the feasibility of assembling a DNA circuit on cell membranes and could further broaden the utility of DNA circuits for applications in biology, biotechnology, and biomedicine. PMID:23397942

  9. Cytochrome C oxidase subunit I barcodes provide an efficient tool for Jinqian Baihua She (Bungarus parvus) authentication

    PubMed Central

    Chao, Zhi; Liao, Jing; Liang, Zhenbiao; Huang, Suhua; Zhang, Liang; Li, Junde

    2014-01-01

    Objective: To test the feasibility of DNA barcoding for accurate identification of Jinqian Baihua She and its adulterants. Materials and Methods: Standard cytochrome C oxidase subunit I (COI) gene fragments were sequenced for DNA barcoding of 39 samples from 9 snake species, including Bungarus multicinctus, the officially recognized origin animal by Chinese Pharmacopoeia, and other 8 adulterate species. The aligned sequences, 658 base pairs in length, were analyzed for divergence using the Kimura-2-parameter (K2P) distance model with MEGA5.0. Results: The mean intraspecific K2P distance was 0.0103 and the average interspecific genetic distance was 0.2178 in B. multicinctus, far greater than the minimal interspecific genetic distance of 0.027 recommended for species identification. A neighbor-joining (NJ) tree was constructed, in which each species formed a monophyletic clade with bootstrap supports of 100%. All the data were submitted to Barcode of Life Data system version 3.0 (BOLD, http://www.barcodinglife.org) under the project title “DNA barcoding Bungarus multicinctus and its adulterants”. Ten samples of commercially available crude drugs of JBS were identified using the identification engine provided by BOLD. All the samples were clearly identified at the species level, among which five were found to be the adulterants and identified as Dinodon rufozonatum. Conclusion: DNA barcoding using the standard COI gene fragments provides an effective and accurate means for JBS identification and authentication. PMID:25422545

  10. DNA barcoding of Rhodiola (crassulaceae): a case study on a group of recently diversified medicinal plants from the Qinghai-Tibetan Plateau.

    PubMed

    Zhang, Jian-Qiang; Meng, Shi-Yong; Wen, Jun; Rao, Guang-Yuan

    2015-01-01

    DNA barcoding, the identification of species using one or a few short standardized DNA sequences, is an important complement to traditional taxonomy. However, there are particular challenges for barcoding plants, especially for species with complex evolutionary histories. We herein evaluated the utility of five candidate sequences - rbcL, matK, trnH-psbA, trnL-F and the internal transcribed spacer (ITS) - for barcoding Rhodiola species, a group of high-altitude plants frequently used as adaptogens, hemostatics and tonics in traditional Tibetan medicine. Rhodiola was suggested to have diversified rapidly recently. The genus is thus a good model for testing DNA barcoding strategies for recently diversified medicinal plants. This study analyzed 189 accessions, representing 47 of the 55 recognized Rhodiola species in the Flora of China treatment. Based on intraspecific and interspecific divergence and degree of monophyly statistics, ITS was the best single-locus barcode, resolving 66% of the Rhodiola species. The core combination rbcL+matK resolved only 40.4% of them. Unsurprisingly, the combined use of all five loci provided the highest discrimination power, resolving 80.9% of the species. However, this is weaker than the discrimination power generally reported in barcoding studies of other plant taxa. The observed complications may be due to the recent diversification, incomplete lineage sorting and reticulate evolution of the genus. These processes are common features of numerous plant groups in the high-altitude regions of the Qinghai-Tibetan Plateau.

  11. Use of Fe(III) as an electron acceptor to recover previously uncultured hyperthermophiles: isolation and characterization of Geothermobacterium ferrireducens gen. nov., sp. nov.

    PubMed

    Kashefi, Kazem; Holmes, Dawn E; Reysenbach, Anna-Louise; Lovley, Derek R

    2002-04-01

    It has recently been recognized that the ability to use Fe(III) as a terminal electron acceptor is a highly conserved characteristic in hyperthermophilic microorganisms. This suggests that it may be possible to recover as-yet-uncultured hyperthermophiles in pure culture if Fe(III) is used as an electron acceptor. As part of a study of the microbial diversity of the Obsidian Pool area in Yellowstone National Park, Wyo., hot sediment samples were used as the inoculum for enrichment cultures in media containing hydrogen as the sole electron donor and poorly crystalline Fe(III) oxide as the electron acceptor. A pure culture was recovered on solidified, Fe(III) oxide medium. The isolate, designated FW-1a, is a hyperthermophilic anaerobe that grows exclusively by coupling hydrogen oxidation to the reduction of poorly crystalline Fe(III) oxide. Organic carbon is not required for growth. Magnetite is the end product of Fe(III) oxide reduction under the culture conditions evaluated. The cells are rod shaped, about 0.5 microm by 1.0 to 1.2 microm, and motile and have a single flagellum. Strain FW-1a grows at circumneutral pH, at freshwater salinities, and at temperatures of between 65 and 100 degrees C with an optimum of 85 to 90 degrees C. To our knowledge this is the highest temperature optimum of any organism in the Bacteria. Analysis of the 16S ribosomal DNA (rDNA) sequence of strain FW-1a places it within the Bacteria, most closely related to abundant but uncultured microorganisms whose 16S rDNA sequences have been previously recovered from Obsidian Pool and a terrestrial hot spring in Iceland. While previous studies inferred that the uncultured microorganisms with these 16S rDNA sequences were sulfate-reducing organisms, the physiology of the strain FW-1a, which does not reduce sulfate, indicates that these organisms are just as likely to be Fe(III) reducers. These results further demonstrate that Fe(III) may be helpful for recovering as-yet-uncultured microorganisms from hydrothermal environments and illustrate that caution must be used in inferring the physiological characteristics of at least some thermophilic microorganisms solely from 16S rDNA sequences. Based on both its 16S rDNA sequence and physiological characteristics, strain FW-1a represents a new genus among the Bacteria. The name Geothermobacterium ferrireducens gen. nov., sp. nov., is proposed (ATCC BAA-426).

  12. Physical model of the immune response of bacteria against bacteriophage through the adaptive CRISPR-Cas immune system

    NASA Astrophysics Data System (ADS)

    Han, Pu; Niestemski, Liang Ren; Barrick, Jeffrey E.; Deem, Michael W.

    2013-04-01

    Bacteria and archaea have evolved an adaptive, heritable immune system that recognizes and protects against viruses or plasmids. This system, known as the CRISPR-Cas system, allows the host to recognize and incorporate short foreign DNA or RNA sequences, called ‘spacers’ into its CRISPR system. Spacers in the CRISPR system provide a record of the history of bacteria and phage coevolution. We use a physical model to study the dynamics of this coevolution as it evolves stochastically over time. We focus on the impact of mutation and recombination on bacteria and phage evolution and evasion. We discuss the effect of different spacer deletion mechanisms on the coevolutionary dynamics. We make predictions about bacteria and phage population growth, spacer diversity within the CRISPR locus, and spacer protection against the phage population.

  13. A dimer of the lymphoid protein RAG1 recognizes the recombination signal sequence and the complex stably incorporates the high mobility group protein HMG2.

    PubMed

    Rodgers, K K; Villey, I J; Ptaszek, L; Corbett, E; Schatz, D G; Coleman, J E

    1999-07-15

    RAG1 and RAG2 are the two lymphoid-specific proteins required for the cleavage of DNA sequences known as the recombination signal sequences (RSSs) flanking V, D or J regions of the antigen-binding genes. Previous studies have shown that RAG1 alone is capable of binding to the RSS, whereas RAG2 only binds as a RAG1/RAG2 complex. We have expressed recombinant core RAG1 (amino acids 384-1008) in Escherichia coli and demonstrated catalytic activity when combined with RAG2. This protein was then used to determine its oligomeric forms and the dissociation constant of binding to the RSS. Electrophoretic mobility shift assays show that up to three oligomeric complexes of core RAG1 form with a single RSS. Core RAG1 was found to exist as a dimer both when free in solution and as the minimal species bound to the RSS. Competition assays show that RAG1 recognizes both the conserved nonamer and heptamer sequences of the RSS. Zinc analysis shows the core to contain two zinc ions. The purified RAG1 protein overexpressed in E.coli exhibited the expected cleavage activity when combined with RAG2 purified from transfected 293T cells. The high mobility group protein HMG2 is stably incorporated into the recombinant RAG1/RSS complex and can increase the affinity of RAG1 for the RSS in the absence of RAG2.

  14. Dynamic distribution patterns of ribosomal DNA and chromosomal evolution in Paphiopedilum, a lady's slipper orchid

    PubMed Central

    2011-01-01

    Background Paphiopedilum is a horticulturally and ecologically important genus of ca. 80 species of lady's slipper orchids native to Southeast Asia. These plants have long been of interest regarding their chromosomal evolution, which involves a progressive aneuploid series based on either fission or fusion of centromeres. Chromosome number is positively correlated with genome size, so rearrangement processes must include either insertion or deletion of DNA segments. We have conducted Fluorescence In Situ Hybridization (FISH) studies using 5S and 25S ribosomal DNA (rDNA) probes to survey for rearrangements, duplications, and phylogenetically-correlated variation within Paphiopedilum. We further studied sequence variation of the non-transcribed spacers of 5S rDNA (5S-NTS) to examine their complex duplication history, including the possibility that concerted evolutionary forces may homogenize diversity. Results 5S and 25S rDNA loci among Paphiopedilum species, representing all key phylogenetic lineages, exhibit a considerable diversity that correlates well with recognized evolutionary groups. 25S rDNA signals range from 2 (representing 1 locus) to 9, the latter representing hemizygosity. 5S loci display extensive structural variation, and show from 2 specific signals to many, both major and minor and highly dispersed. The dispersed signals mainly occur at centromeric and subtelomeric positions, which are hotspots for chromosomal breakpoints. Phylogenetic analysis of cloned 5S rDNA non-transcribed spacer (5S-NTS) sequences showed evidence for both ancient and recent post-speciation duplication events, as well as interlocus and intralocus diversity. Conclusions Paphiopedilum species display many chromosomal rearrangements - for example, duplications, translocations, and inversions - but only weak concerted evolutionary forces among highly duplicated 5S arrays, which suggests that double-strand break repair processes are dynamic and ongoing. These results make the genus a model system for the study of complex chromosomal evolution in plants. PMID:21910890

  15. Dynamic distribution patterns of ribosomal DNA and chromosomal evolution in Paphiopedilum, a lady's slipper orchid.

    PubMed

    Lan, Tianying; Albert, Victor A

    2011-09-12

    Paphiopedilum is a horticulturally and ecologically important genus of ca. 80 species of lady's slipper orchids native to Southeast Asia. These plants have long been of interest regarding their chromosomal evolution, which involves a progressive aneuploid series based on either fission or fusion of centromeres. Chromosome number is positively correlated with genome size, so rearrangement processes must include either insertion or deletion of DNA segments. We have conducted Fluorescence In Situ Hybridization (FISH) studies using 5S and 25S ribosomal DNA (rDNA) probes to survey for rearrangements, duplications, and phylogenetically-correlated variation within Paphiopedilum. We further studied sequence variation of the non-transcribed spacers of 5S rDNA (5S-NTS) to examine their complex duplication history, including the possibility that concerted evolutionary forces may homogenize diversity. 5S and 25S rDNA loci among Paphiopedilum species, representing all key phylogenetic lineages, exhibit a considerable diversity that correlates well with recognized evolutionary groups. 25S rDNA signals range from 2 (representing 1 locus) to 9, the latter representing hemizygosity. 5S loci display extensive structural variation, and show from 2 specific signals to many, both major and minor and highly dispersed. The dispersed signals mainly occur at centromeric and subtelomeric positions, which are hotspots for chromosomal breakpoints. Phylogenetic analysis of cloned 5S rDNA non-transcribed spacer (5S-NTS) sequences showed evidence for both ancient and recent post-speciation duplication events, as well as interlocus and intralocus diversity. Paphiopedilum species display many chromosomal rearrangements--for example, duplications, translocations, and inversions--but only weak concerted evolutionary forces among highly duplicated 5S arrays, which suggests that double-strand break repair processes are dynamic and ongoing. These results make the genus a model system for the study of complex chromosomal evolution in plants.

  16. A High-Throughput Process for the Solid-Phase Purification of Synthetic DNA Sequences

    PubMed Central

    Grajkowski, Andrzej; Cieślak, Jacek; Beaucage, Serge L.

    2017-01-01

    An efficient process for the purification of synthetic phosphorothioate and native DNA sequences is presented. The process is based on the use of an aminopropylated silica gel support functionalized with aminooxyalkyl functions to enable capture of DNA sequences through an oximation reaction with the keto function of a linker conjugated to the 5′-terminus of DNA sequences. Deoxyribonucleoside phosphoramidites carrying this linker, as a 5′-hydroxyl protecting group, have been synthesized for incorporation into DNA sequences during the last coupling step of a standard solid-phase synthesis protocol executed on a controlled pore glass (CPG) support. Solid-phase capture of the nucleobase- and phosphate-deprotected DNA sequences released from the CPG support is demonstrated to proceed near quantitatively. Shorter than full-length DNA sequences are first washed away from the capture support; the solid-phase purified DNA sequences are then released from this support upon reaction with tetra-n-butylammonium fluoride in dry dimethylsulfoxide (DMSO) and precipitated in tetrahydrofuran (THF). The purity of solid-phase-purified DNA sequences exceeds 98%. The simulated high-throughput and scalability features of the solid-phase purification process are demonstrated without sacrificing purity of the DNA sequences. PMID:28628204

  17. Utility of DNA barcoding for rapid and accurate assessment of bat diversity in Malaysia in the absence of formally described species.

    PubMed

    Wilson, J-J; Sing, K-W; Halim, M R A; Ramli, R; Hashim, R; Sofian-Azirun, M

    2014-02-19

    Bats are important flagship species for biodiversity research; however, diversity in Southeast Asia is considerably underestimated in the current checklists and field guides. Incorporation of DNA barcoding into surveys has revealed numerous species-level taxa overlooked by conventional methods. Inclusion of these taxa in inventories provides a more informative record of diversity, but is problematic as these species lack formal description. We investigated how frequently documented, but undescribed, bat taxa are encountered in Peninsular Malaysia. We discuss whether a barcode library provides a means of recognizing and recording these taxa across biodiversity inventories. Tissue was sampled from bats trapped at Pasir Raja, Dungun Terengganu, Peninsular Malaysia. The DNA was extracted and the COI barcode region amplified and sequenced. We identified 9 species-level taxa within our samples, based on analysis of the DNA barcodes. Six specimens matched to four previously documented taxa considered candidate species but currently lacking formal taxonomic status. This study confirms the high diversity of bats within Peninsular Malaysia (9 species in 13 samples) and demonstrates how DNA barcoding allows for inventory and documentation of known taxa lacking formal taxonomic status.

  18. Serratia ureilytica sp. nov., a novel urea-utilizing species.

    PubMed

    Bhadra, Bhaskar; Roy, Pradosh; Chakraborty, Ranadhir

    2005-09-01

    A Gram-negative, rod-shaped, urea-dissolving and non-spore-forming bacterium, designated strain NiVa 51(T), was isolated from water of the River Torsa in Hasimara, Jalpaiguri district, West Bengal, India. On the basis of 16S rRNA gene sequence similarity, strain NiVa 51(T) was shown to belong to the gamma-Proteobacteria and to be related to Serratia marcescens subsp. sakuensis (98.35%) and S. marcescens subsp. marcescens (98.30%); however, strain NiVa 51(T) exhibited only 43.7% similarity to S. marcescens by DNA-DNA hybridization. The G+C content of the genomic DNA of the isolate was 60 mol%. Both biochemical characteristics and fatty acid analysis data supported the affiliation of strain NiVa 51(T) to the genus Serratia. Furthermore, strain NiVa 51(T) was found to utilize urea as nitrogen source. The results of DNA-DNA hybridization as well as physiological and biochemical tests allowed genotypic and phenotypic differentiation of strain NiVa 51(T) from recognized Serratia species. Strain NiVa 51(T) therefore represents a novel species, for which the name Serratia ureilytica sp. nov. is proposed, with type strain NiVa 51(T) (=LMG 22860(T)=CCUG 50595(T)).

  19. An improved model for whole genome phylogenetic analysis by Fourier transform.

    PubMed

    Yin, Changchuan; Yau, Stephen S-T

    2015-10-07

    DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.

    PubMed

    Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene

    2017-02-01

    Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  1. Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.

    PubMed

    Yin, Changchuan

    2015-04-01

    To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.

  2. Single-cell genomic sequencing using Multiple Displacement Amplification.

    PubMed

    Lasken, Roger S

    2007-10-01

    Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).

  3. Proliferation of group II introns in the chloroplast genome of the green alga Oedocladium carolinianum (Chlorophyceae).

    PubMed

    Brouard, Jean-Simon; Turmel, Monique; Otis, Christian; Lemieux, Claude

    2016-01-01

    The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA) structure, size, gene order, and intron content have been observed. The large inverted repeat (IR), an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales) but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum . The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium , it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold longer and dispersed repeats are more abundant, but a smaller fraction of the Oedocladium genome is occupied by introns. Six additional group II introns are present, five of which lack ORFs and carry highly similar sequences to that of the ORF-less IIA intron shared with Oedogonium . Secondary structure analysis of the group IIA introns disclosed marked differences in the exon-binding sites; however, each intron showed perfect or nearly perfect base pairing interactions with its target site. Our results suggest that chloroplast genes rearrange more slowly in the Oedogoniales than in the Chaetophorales and raise questions as to what was the nature of the foreign coding sequences in the IR of the common ancestor of the Oedogoniales. They provide the first evidence for intragenomic proliferation of group IIA introns in the Viridiplantae, revealing that intron spread in the Oedocladium lineage likely occurred by retrohoming after sequence divergence of the exon-binding sites.

  4. If the cap fits, wear it: an overview of telomeric structures over evolution.

    PubMed

    Fulcher, Nick; Derboven, Elisa; Valuchova, Sona; Riha, Karel

    2014-03-01

    Genome organization into linear chromosomes likely represents an important evolutionary innovation that has permitted the development of the sexual life cycle; this process has consequently advanced nuclear expansion and increased complexity of eukaryotic genomes. Chromosome linearity, however, poses a major challenge to the internal cellular machinery. The need to efficiently recognize and repair DNA double-strand breaks that occur as a consequence of DNA damage presents a constant threat to native chromosome ends known as telomeres. In this review, we present a comparative survey of various solutions to the end protection problem, maintaining an emphasis on DNA structure. This begins with telomeric structures derived from a subset of prokaryotes, mitochondria, and viruses, and will progress into the typical telomere structure exhibited by higher organisms containing TTAGG-like tandem sequences. We next examine non-canonical telomeres from Drosophila melanogaster, which comprise arrays of retrotransposons. Finally, we discuss telomeric structures in evolution and possible switches between canonical and non-canonical solutions to chromosome end protection.

  5. Broad and Cross-Clade CD4+ T-Cell Responses Elicited by a DNA Vaccine Encoding Highly Conserved and Promiscuous HIV-1 M-Group Consensus Peptides

    PubMed Central

    Almeida, Rafael Ribeiro; Rosa, Daniela Santoro; Ribeiro, Susan Pereira; Santana, Vinicius Canato; Kallás, Esper Georges; Sidney, John; Sette, Alessandro; Kalil, Jorge; Cunha-Neto, Edecio

    2012-01-01

    T-cell based vaccine approaches have emerged to counteract HIV-1/AIDS. Broad, polyfunctional and cytotoxic CD4+ T-cell responses have been associated with control of HIV-1 replication, which supports the inclusion of CD4+ T-cell epitopes in vaccines. A successful HIV-1 vaccine should also be designed to overcome viral genetic diversity and be able to confer immunity in a high proportion of immunized individuals from a diverse HLA-bearing population. In this study, we rationally designed a multiepitopic DNA vaccine in order to elicit broad and cross-clade CD4+ T-cell responses against highly conserved and promiscuous peptides from the HIV-1 M-group consensus sequence. We identified 27 conserved, multiple HLA-DR-binding peptides in the HIV-1 M-group consensus sequences of Gag, Pol, Nef, Vif, Vpr, Rev and Vpu using the TEPITOPE algorithm. The peptides bound in vitro to an average of 12 out of the 17 tested HLA-DR molecules and also to several molecules such as HLA-DP, -DQ and murine IAb and IAd. Sixteen out of the 27 peptides were recognized by PBMC from patients infected with different HIV-1 variants and 72% of such patients recognized at least 1 peptide. Immunization with a DNA vaccine (HIVBr27) encoding the identified peptides elicited IFN-γ secretion against 11 out of the 27 peptides in BALB/c mice; CD4+ and CD8+ T-cell proliferation was observed against 8 and 6 peptides, respectively. HIVBr27 immunization elicited cross-clade T-cell responses against several HIV-1 peptide variants. Polyfunctional CD4+ and CD8+ T cells, able to simultaneously proliferate and produce IFN-γ and TNF-α, were also observed. This vaccine concept may cope with HIV-1 genetic diversity as well as provide increased population coverage, which are desirable features for an efficacious strategy against HIV-1/AIDS. PMID:23028895

  6. Phage T4 SegB protein is a homing endonuclease required for the preferred inheritance of T4 tRNA gene region occurring in co-infection with a related phage

    PubMed Central

    Brok-Volchanskaya, Vera S.; Kadyrov, Farid A.; Sivogrivov, Dmitry E.; Kolosov, Peter M.; Sokolov, Andrey S.; Shlyapnikov, Michael G.; Kryukov, Valentine M.; Granovsky, Igor E.

    2008-01-01

    Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3′ 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TψC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages. PMID:18281701

  7. Analysis of p53 gene mutations in human gliomas by polymerase chain reaction-based single-strand conformation polymorphism and DNA sequencing.

    PubMed

    Sarkar, F H; Kupsky, W J; Li, Y W; Sreepathi, P

    1994-03-01

    Mutations in the p53 gene have been recognized in brain tumors, and clonal expansion of p53 mutant cells has been shown to be associated with glioma progression. However, studies on the p53 gene have been limited by the need for frozen tissues. We have developed a method utilizing polymerase chain reaction (PCR) for the direct analysis of p53 mutation by single-strand conformation polymorphism (SSCP) and by direct DNA sequencing of the p53 gene using a single 10-microns paraffin-embedded tissue section. We applied this method to screen for p53 gene mutations in exons 5-8 in human gliomas utilizing paraffin-embedded tissues. Twenty paraffin blocks containing tumor were selected from surgical specimens from 17 different adult patients. Tumors included six anaplastic astrocytomas (AAs), nine glioblastomas (GBs), and two mixed malignant gliomas (MMGs). The tissue section on the stained glass slide was used to guide microdissection of an unstained adjacent tissue section to ensure > 90% of the tumor cell population for p53 mutational analysis. Simultaneously, microdissection of the tissue was also carried out to obtain normal tissue from adjacent areas as a control. Mutations in the p53 gene were identified in 3 of 17 (18%) patients by PCR-SSCP analysis and subsequently confirmed by PCR-based DNA sequencing. Mutations in exon 5 resulting in amino acid substitution were found in one thalamic AA (codon 158, CGC > CTT: Arg > Leu) and one cerebral hemispheric GB (codon 151, CCG > CTG: Pro > Leu).(ABSTRACT TRUNCATED AT 250 WORDS)

  8. Acquisition of New DNA Sequences After Infection of Chicken Cells with Avian Myeloblastosis Virus

    PubMed Central

    Shoyab, M.; Baluda, M. A.; Evans, R.

    1974-01-01

    DNA-RNA hybridization studies between 70S RNA from avian myeloblastosis virus (AMV) and an excess of DNA from (i) AMV-induced leukemic chicken myeloblasts or (ii) a mixture of normal and of congenitally infected K-137 chicken embryos producing avian leukosis viruses revealed the presence of fast- and slow-hybridizing virus-specific DNA sequences. However, the leukemic cells contained twice the level of AMV-specific DNA sequences observed in normal chicken embryonic cells. The fast-reacting sequences were two to three times more numerous in leukemic DNA than in DNA from the mixed embryos. The slow-reacting sequences had a reiteration frequency of approximately 9 and 6, in the two respective systems. Both the fast- and the slow-reacting DNA sequences in leukemic cells exhibited a higher Tm (2 C) than the respective DNA sequences in normal cells. In normal and leukemic cells the slow hybrid sequences appeared to have a Tm which was 2 C higher than that of the fast hybrid sequences. Individual non-virus-producing chicken embryos, either group-specific antigen positive or negative, contained 40 to 100 copies of the fast sequences and 2 to 6 copies of the slowly hybridizing sequences per cell genome. Normal rat cells did not contain DNA that hybridized with AMV RNA, whereas non-virus-producing rat cells transformed by B-77 avian sarcoma virus contained only the slowly reacting sequences. The results demonstrate that leukemic cells transformed by AMV contain new AMV-specific DNA sequences which were not present before infection. PMID:16789139

  9. Homogeneity of the 16S rDNA sequence among geographically disparate isolates of Taylorella equigenitalis

    PubMed Central

    Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC

    2006-01-01

    Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935

  10. DNA barcode identification of Podocarpaceae--the second largest conifer family.

    PubMed

    Little, Damon P; Knopf, Patrick; Schulz, Christian

    2013-01-01

    We have generated matK, rbcL, and nrITS2 DNA barcodes for 320 specimens representing all 18 extant genera of the conifer family Podocarpaceae. The sample includes 145 of the 198 recognized species. Comparative analyses of sequence quality and species discrimination were conducted on the 159 individuals from which all three markers were recovered (representing 15 genera and 97 species). The vast majority of sequences were of high quality (B 30 = 0.596-0.989). Even the lowest quality sequences exceeded the minimum requirements of the BARCODE data standard. In the few instances that low quality sequences were generated, the responsible mechanism could not be discerned. There were no statistically significant differences in the discriminatory power of markers or marker combinations (p = 0.05). The discriminatory power of the barcode markers individually and in combination is low (56.7% of species at maximum). In some instances, species discrimination failed in spite of ostensibly useful variation being present (genotypes were shared among species), but in many cases there was simply an absence of sequence variation. Barcode gaps (maximum intraspecific p-distance > minimum interspecific p-distance) were observed in 50.5% of species when all three markers were considered simultaneously. The presence of a barcode gap was not predictive of discrimination success (p = 0.02) and there was no statistically significant difference in the frequency of barcode gaps among markers (p = 0.05). In addition, there was no correlation between number of individuals sampled per species and the presence of a barcode gap (p = 0.27).

  11. DNA Barcode Identification of Podocarpaceae—The Second Largest Conifer Family

    PubMed Central

    Little, Damon P.; Knopf, Patrick; Schulz, Christian

    2013-01-01

    We have generated matK, rbcL, and nrITS2 DNA barcodes for 320 specimens representing all 18 extant genera of the conifer family Podocarpaceae. The sample includes 145 of the 198 recognized species. Comparative analyses of sequence quality and species discrimination were conducted on the 159 individuals from which all three markers were recovered (representing 15 genera and 97 species). The vast majority of sequences were of high quality (B 30 = 0.596–0.989). Even the lowest quality sequences exceeded the minimum requirements of the BARCODE data standard. In the few instances that low quality sequences were generated, the responsible mechanism could not be discerned. There were no statistically significant differences in the discriminatory power of markers or marker combinations (p = 0.05). The discriminatory power of the barcode markers individually and in combination is low (56.7% of species at maximum). In some instances, species discrimination failed in spite of ostensibly useful variation being present (genotypes were shared among species), but in many cases there was simply an absence of sequence variation. Barcode gaps (maximum intraspecific p–distance > minimum interspecific p–distance) were observed in 50.5% of species when all three markers were considered simultaneously. The presence of a barcode gap was not predictive of discrimination success (p = 0.02) and there was no statistically significant difference in the frequency of barcode gaps among markers (p = 0.05). In addition, there was no correlation between number of individuals sampled per species and the presence of a barcode gap (p = 0.27). PMID:24312258

  12. Molecular phylogenetics of the family Cyprinidae (Actinopterygii: Cypriniformes) as evidenced by sequence variation in the first intron of S7 ribosomal protein-coding gene: further evidence from a nuclear gene of the systematic chaos in the family.

    PubMed

    He, Shunping; Mayden, Richard L; Wang, Xuzheng; Wang, Wei; Tang, Kevin L; Chen, Wei-Jen; Chen, Yiyu

    2008-03-01

    The family Cyprinidae is the largest freshwater fish group in the world, including over 200 genera and 2100 species. The phylogenetic relationships of major clades within this family are simply poorly understood, largely because of the overwhelming diversity of the group; however, several investigators have advanced different hypotheses of relationships that pre- and post-date the use of shared-derived characters as advocated through phylogenetic systematics. As expected, most previous investigations used morphological characters. Recently, mitochondrial DNA (mtDNA) sequences and combined morphological and mtDNA investigations have been used to explore and advance our understanding of species relationships and test monophyletic groupings. Limitations of these studies include limited taxon sampling and a strict reliance upon maternally inherited mtDNA variation. The present study is the first endeavor to recover the phylogenetic relationships of the 12 previously recognized monophyletic subfamilies within the Cyprinidae using newly sequenced nuclear DNA (nDNA) for over 50 species representing members of the different previously hypothesized subfamily and family groupings within the Cyprinidae and from other cypriniform families as outgroup taxa. Hypothesized phylogenetic relationships are constructed using maximum parsimony and Basyesian analyses of 1042 sites, of which 971 sites were variable and 790 were phylogenetically informative. Using other appropriate cypriniform taxa of the families Catostomidae (Myxocyprinus asiaticus), Gyrinocheilidae (Gyrinocheilus aymonieri), and Balitoridae (Nemacheilus sp. and Beaufortia kweichowensis) as outgroups, the Cyprinidae is resolved as a monophyletic group. Within the family the genera Raiamas, Barilius, Danio, and Rasbora, representing many of the tropical cyprinids, represent basal members of the family. All other species can be classified into variably supported and resolved monophyletic lineages, depending upon analysis, that are consistent with or correspond to Barbini and Leuciscini. The Barbini includes taxa traditionally aligned with the subfamily Cyprininae sensu previous morphological revisionary studies by Howes (Barbinae, Labeoninae, Cyprininae and Schizothoracinae). The Leuciscini includes six other subfamilies that are mainly divided into three separate lineages. The relationships among genera and subfamilies are discussed as well as the possible origins of major lineages.

  13. Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

    DOEpatents

    McCutchen-Maloney, Sandra L.

    2002-01-01

    DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.

  14. Hybridization chain reaction amplification for highly sensitive fluorescence detection of DNA with dextran coated microarrays.

    PubMed

    Chao, Jie; Li, Zhenhua; Li, Jing; Peng, Hongzhen; Su, Shao; Li, Qian; Zhu, Changfeng; Zuo, Xiaolei; Song, Shiping; Wang, Lianhui; Wang, Lihua

    2016-07-15

    Microarrays of biomolecules hold great promise in the fields of genomics, proteomics, and clinical assays on account of their remarkably parallel and high-throughput assay capability. However, the fluorescence detection used in most conventional DNA microarrays is still limited by sensitivity. In this study, we have demonstrated a novel universal and highly sensitive platform for fluorescent detection of sequence specific DNA at the femtomolar level by combining dextran-coated microarrays with hybridization chain reaction (HCR) signal amplification. Three-dimensional dextran matrix was covalently coated on glass surface as the scaffold to immobilize DNA recognition probes to increase the surface binding capacity and accessibility. DNA nanowire tentacles were formed on the matrix surface for efficient signal amplification by capturing multiple fluorescent molecules in a highly ordered way. By quantifying microscopic fluorescent signals, the synergetic effects of dextran and HCR greatly improved sensitivity of DNA microarrays, with a detection limit of 10fM (1×10(5) molecules). This detection assay could recognize one-base mismatch with fluorescence signals dropped down to ~20%. This cost-effective microarray platform also worked well with samples in serum and thus shows great potential for clinical diagnosis. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Stella, Stefano; University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen; Molina, Rafael

    Crystal structures of BurrH and the BurrH–DNA complex are reported. DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein–DNA interactions in protein scaffolds is key to providing ‘toolkits’ for precise genome modification or regulation of gene expression. In a search for putative DNA-bindingmore » domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix–loop–helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing.« less

  16. A Nonconventional Approach to Patterned Nanoarrays of DNA Strands for Template-Assisted Assembly of Polyfluorene Nanowires.

    PubMed

    Bae, Dong Geun; Jeong, Ji-Eun; Kang, Seok Hee; Byun, Myunghwan; Han, Dong-Wook; Lin, Zhiqun; Woo, Han Young; Hong, Suck Won

    2016-08-01

    DNA molecules have been widely recognized as promising building blocks for constructing functional nanostructures with two main features, that is, self-assembly and rich chemical functionality. The intrinsic feature size of DNA makes it attractive for creating versatile nanostructures. Moreover, the ease of access to tune the surface of DNA by chemical functionalization offers numerous opportunities for many applications. Herein, a simple yet robust strategy is developed to yield the self-assembly of DNA by exploiting controlled evaporative assembly of DNA solution in a unique confined geometry. Intriguingly, depending on the concentration of DNA solution, highly aligned nanostructured fibrillar-like arrays and well-positioned concentric ring-like superstructures composed of DNAs are formed. Subsequently, the ring-like negatively charged DNA superstructures are employed as template to produce conductive organic nanowires on a silicon substrate by complexing with a positively charged conjugated polyelectrolyte poly[9,9-bis(6'-N,N,N-trimethylammoniumhexyl)fluorene dibromide] (PF2) through the strong electrostatic interaction. Finally, a monolithic integration of aligned arrays of DNA-templated PF2 nanowires to yield two DNA/PF2-based devices is demonstrated. It is envisioned that this strategy can be readily extended to pattern other biomolecules and may render a broad range of potential applications from the nucleotide sequence and hybridization as recognition events to transducing elements in chemical sensors. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Formation and processing of DNA damage substrates for the hNEIL enzymes.

    PubMed

    Fleming, Aaron M; Burrows, Cynthia J

    2017-06-01

    Reactive oxygen species (ROS) are harnessed by the cell for signaling at the same time as being detrimental to cellular components such as DNA. The genome and transcriptome contain instructions that can alter cellular processes when oxidized. The guanine (G) heterocycle in the nucleotide pool, DNA, or RNA is the base most prone to oxidation. The oxidatively-derived products of G consistently observed in high yields from hydroxyl radical, carbonate radical, or singlet oxygen oxidations under conditions modeling the cellular reducing environment are discussed. The major G base oxidation products are 8-oxo-7,8-dihydroguanine (OG), 5-carboxamido-5-formamido-2-iminohydantoin (2Ih), spiroiminodihydantoin (Sp), and 5-guanidinohydantoin (Gh). The yields of these products show dependency on the oxidant and the reaction context that includes nucleoside, single-stranded DNA (ssDNA), double-stranded DNA (dsDNA), and G-quadruplex DNA (G4-DNA) structures. Upon formation of these products in cells, they are recognized by the DNA glycosylases in the base excision repair (BER) pathway. This review focuses on initiation of BER by the mammalian Nei-like1-3 (NEIL1-3) glycosylases for removal of 2Ih, Sp, and Gh. The unique ability of the human NEILs to initiate removal of the hydantoins in ssDNA, bulge-DNA, bubble-DNA, dsDNA, and G4-DNA is outlined. Additionally, when Gh exists in a G4 DNA found in a gene promoter, NEIL-mediated repair is modulated by the plasticity of the G4-DNA structure provided by additional G-runs flanking the sequence. On the basis of these observations and cellular studies from the literature, the interplay between DNA oxidation and BER to alter gene expression is discussed. Copyright © 2017 Elsevier Inc. All rights reserved.

  18. TE-Tracker: systematic identification of transposition events through whole-genome resequencing.

    PubMed

    Gilly, Arthur; Etcheverry, Mathilde; Madoui, Mohammed-Amin; Guy, Julie; Quadrana, Leandro; Alberti, Adriana; Martin, Antoine; Heitkam, Tony; Engelen, Stefan; Labadie, Karine; Le Pen, Jeremie; Wincker, Patrick; Colot, Vincent; Aury, Jean-Marc

    2014-11-19

    Transposable elements (TEs) are DNA sequences that are able to move from their location in the genome by cutting or copying themselves to another locus. As such, they are increasingly recognized as impacting all aspects of genome function. With the dramatic reduction in cost of DNA sequencing, it is now possible to resequence whole genomes in order to systematically characterize novel TE mobilization in a particular individual. However, this task is made difficult by the inherently repetitive nature of TE sequences, which in some eukaryotes compose over half of the genome sequence. Currently, only a few software tools dedicated to the detection of TE mobilization using next-generation-sequencing are described in the literature. They often target specific TEs for which annotation is available, and are only able to identify families of closely related TEs, rather than individual elements. We present TE-Tracker, a general and accurate computational method for the de-novo detection of germ line TE mobilization from re-sequenced genomes, as well as the identification of both their source and destination sequences. We compare our method with the two classes of existing software: specialized TE-detection tools and generic structural variant (SV) detection tools. We show that TE-Tracker, while working independently of any prior annotation, bridges the gap between these two approaches in terms of detection power. Indeed, its positive predictive value (PPV) is comparable to that of dedicated TE software while its sensitivity is typical of a generic SV detection tool. TE-Tracker demonstrates the benefit of adopting an annotation-independent, de novo approach for the detection of TE mobilization events. We use TE-Tracker to provide a comprehensive view of transposition events induced by loss of DNA methylation in Arabidopsis. TE-Tracker is freely available at http://www.genoscope.cns.fr/TE-Tracker . We show that TE-Tracker accurately detects both the source and destination of novel transposition events in re-sequenced genomes. Moreover, TE-Tracker is able to detect all potential donor sequences for a given insertion, and can identify the correct one among them. Furthermore, TE-Tracker produces significantly fewer false positives than common SV detection programs, thus greatly facilitating the detection and analysis of TE mobilization events.

  19. Transcriptome and target DNA enrichment sequence data provide new insights into the phylogeny of vespid wasps (Hymenoptera: Aculeata: Vespidae).

    PubMed

    Bank, Sarah; Sann, Manuela; Mayer, Christoph; Meusemann, Karen; Donath, Alexander; Podsiadlowski, Lars; Kozlov, Alexey; Petersen, Malte; Krogmann, Lars; Meier, Rudolf; Rosa, Paolo; Schmitt, Thomas; Wurdack, Mareike; Liu, Shanlin; Zhou, Xin; Misof, Bernhard; Peters, Ralph S; Niehuis, Oliver

    2017-11-01

    The wasp family Vespidae comprises more than 5000 described species which represent life history strategies ranging from solitary and presocial to eusocial and socially parasitic. The phylogenetic relationships of the major vespid wasp lineages (i.e., subfamilies and tribes) have been investigated repeatedly by analyzing behavioral and morphological traits as well as nucleotide sequences of few selected genes with largely incongruent results. Here we reconstruct their phylogenetic relationships using a phylogenomic approach. We sequenced the transcriptomes of 24 vespid wasp and eight outgroup species and exploited the transcript sequences for design of probes for enriching 913 single-copy protein-coding genes to complement the transcriptome data with nucleotide sequence data from additional 25 ethanol-preserved vespid species. Results from phylogenetic analyses of the combined sequence data revealed the eusocial subfamily Stenogastrinae to be the sister group of all remaining Vespidae, while the subfamily Eumeninae turned out to be paraphyletic. Of the three currently recognized eumenine tribes, Odynerini is paraphyletic with respect to Eumenini, and Zethini is paraphyletic with respect to Polistinae and Vespinae. Our results are in conflict with the current tribal subdivision of Eumeninae and thus, we suggest granting subfamily rank to the two major clades of "Zethini": Raphiglossinae and Zethinae. Overall, our findings corroborate the hypothesis of two independent origins of eusociality in vespid wasps and suggest a single origin of using masticated and salivated plant material for building nests by Raphiglossinae, Zethinae, Polistinae, and Vespinae. The inferred phylogenetic relationships and the open access vespid wasp target DNA enrichment probes will provide a valuable tool for future comparative studies on species of the family Vespidae, including their genomes, life styles, evolution of sociality, and co-evolution with other organisms. Copyright © 2017 Elsevier Inc. All rights reserved.

  20. Methylation patterns of repetitive DNA sequences in germ cells of Mus musculus.

    PubMed

    Sanford, J; Forrester, L; Chapman, V; Chandley, A; Hastie, N

    1984-03-26

    The major and the minor satellite sequences of Mus musculus were undermethylated in both sperm and oocyte DNAs relative to the amount of undermethylation observed in adult somatic tissue DNA. This hypomethylation was specific for satellite sequences in sperm DNA. Dispersed repetitive and low copy sequences show a high degree of methylation in sperm DNA; however, a dispersed repetitive sequence was undermethylated in oocyte DNA. This finding suggests a difference in the amount of total genomic DNA methylation between sperm and oocyte DNA. The methylation levels of the minor satellite sequences did not change during spermiogenesis, and were not associated with the onset of meiosis or a specific stage in sperm development.

  1. Process of labeling specific chromosomes using recombinant repetitive DNA

    DOEpatents

    Moyzis, R.K.; Meyne, J.

    1988-02-12

    Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.

  2. Genotypic and phenotypic diversity of Alicyclobacillus acidocaldarius isolates.

    PubMed

    Félix-Valenzuela, L; Guardiola-Avila, I; Burgara-Estrella, A; Ibarra-Zavala, M; Mata-Haro, V

    2015-10-01

    The fruit juice industry recognizes Alicyclobacillus as a major quality control target micro-organism. In this study, we analysed 19 bacterial isolates to identify Alicyclobacillus species by polymerase chain reaction (PCR) and sequencing analyses. Phenotypic and genomic diversity among isolates were investigated by API 50CHB system and ERIC-PCR (enterobacterial repetitive intergenic consensus-PCR) respectively. All bacterial isolates were identified as Alicyclobacillus acidocaldarius, and almost all showed identical DNA sequences according to their 16S rRNA (rDNA) gene partial sequences. Only few carbohydrates were fermented by A. acidocaldarius isolates, and there was little variability in the biochemical profile. Genotypic fingerprinting of the A. acidocaldarius isolates showed high diversity, and clusters by ERIC-PCR were distinct to those obtained from the 16S rRNA gene phylogenetic tree. There was no correlation between phenotypic and genotypic variability in the A. acidocaldarius isolates analysed in this study. Detection of Alicyclobacillus strains is imperative in fruit concentrates and juices due to the production of guaiacol. Identification of the genera originates rejection of the product by processing industry. However, not all the Alicyclobacillus species are deteriorative and hence the importance to differentiate among them. In this study, partial 16S ribosomal RNA sequence alignment allowed the differentiation of species. In addition, ERIC-PCR was introduced for the genotypic characterization of Alicyclobacillus, as an alternative for differentiation among isolates from the same species. © 2015 The Society for Applied Microbiology.

  3. Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

    PubMed

    Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

    2014-11-01

    As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  4. Enlightenment of Yeast Mitochondrial Homoplasmy: Diversified Roles of Gene Conversion

    PubMed Central

    Ling, Feng; Mikawa, Tsutomu; Shibata, Takehiko

    2011-01-01

    Mitochondria have their own genomic DNA. Unlike the nuclear genome, each cell contains hundreds to thousands of copies of mitochondrial DNA (mtDNA). The copies of mtDNA tend to have heterogeneous sequences, due to the high frequency of mutagenesis, but are quickly homogenized within a cell (“homoplasmy”) during vegetative cell growth or through a few sexual generations. Heteroplasmy is strongly associated with mitochondrial diseases, diabetes and aging. Recent studies revealed that the yeast cell has the machinery to homogenize mtDNA, using a common DNA processing pathway with gene conversion; i.e., both genetic events are initiated by a double-stranded break, which is processed into 3′ single-stranded tails. One of the tails is base-paired with the complementary sequence of the recipient double-stranded DNA to form a D-loop (homologous pairing), in which repair DNA synthesis is initiated to restore the sequence lost by the breakage. Gene conversion generates sequence diversity, depending on the divergence between the donor and recipient sequences, especially when it occurs among a number of copies of a DNA sequence family with some sequence variations, such as in immunoglobulin diversification in chicken. MtDNA can be regarded as a sequence family, in which the members tend to be diversified by a high frequency of spontaneous mutagenesis. Thus, it would be interesting to determine why and how double-stranded breakage and D-loop formation induce sequence homogenization in mitochondria and sequence diversification in nuclear DNA. We will review the mechanisms and roles of mtDNA homoplasmy, in contrast to nuclear gene conversion, which diversifies gene and genome sequences, to provide clues toward understanding how the common DNA processing pathway results in such divergent outcomes. PMID:24710143

  5. "First generation" automated DNA sequencing technology.

    PubMed

    Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M

    2011-10-01

    Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.

  6. Influence of DNA sequence on the structure of minicircles under torsional stress

    PubMed Central

    Wang, Qian; Irobalieva, Rossitza N.; Chiu, Wah; Schmid, Michael F.; Fogg, Jonathan M.; Zechiedrich, Lynn

    2017-01-01

    Abstract The sequence dependence of the conformational distribution of DNA under various levels of torsional stress is an important unsolved problem. Combining theory and coarse-grained simulations shows that the DNA sequence and a structural correlation due to topology constraints of a circle are the main factors that dictate the 3D structure of a 336 bp DNA minicircle under torsional stress. We found that DNA minicircle topoisomers can have multiple bend locations under high torsional stress and that the positions of these sharp bends are determined by the sequence, and by a positive mechanical correlation along the sequence. We showed that simulations and theory are able to provide sequence-specific information about individual DNA minicircles observed by cryo-electron tomography (cryo-ET). We provided a sequence-specific cryo-ET tomogram fitting of DNA minicircles, registering the sequence within the geometric features. Our results indicate that the conformational distribution of minicircles under torsional stress can be designed, which has important implications for using minicircle DNA for gene therapy. PMID:28609782

  7. Analysis of DNA Sequences by an Optical Time-Integrating Correlator: Proof-of-Concept Experiments.

    DTIC Science & Technology

    1992-05-01

    DNA ANALYSIS STRATEGY 4 2.1 Representation of DNA Bases 4 2.2 DNA Analysis Strategy 6 3.0 CUSTOM GENERATORS FOR DNA SEQUENCES 10 3.1 Hardware Design 10...of the DNA bases where each base is represented by a 7-bits long pseudorandom sequence. 5 Figure 4: Coarse analysis of a DNA sequence. 7 Figure 5: Fine...a 20-bases long database. 32 xiii LIST OF TABLES PAGE Table 1: Short representations of the DNA bases where each base is represented by 7-bits long

  8. Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting

    NASA Astrophysics Data System (ADS)

    Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.

    1997-05-01

    Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.

  9. Micronuclear DNA of Oxytricha nova contains sequences with autonomously replicating activity in Saccharomyces cerevisiae.

    PubMed Central

    Colombo, M M; Swanton, M T; Donini, P; Prescott, D M

    1984-01-01

    Oxytricha nova is a hypotrichous ciliate with micronuclei and macronuclei. Micronuclei, which contain large, chromosomal-sized DNA, are genetically inert but undergo meiosis and exchange during cell mating. Macronuclei, which contain only small, gene-sized DNA molecules, provide all of the nuclear RNA needed to run the cell. After cell mating the macronucleus is derived from a micronucleus, a derivation that includes excision of the genes from chromosomes and elimination of the remaining DNA. The eliminated DNA includes all of the repetitious sequences and approximately 95% of the unique sequences. We cloned large restriction fragments from the micronucleus that confer replication ability on a replication-deficient plasmid in Saccharomyces cerevisiae. Sequences that confer replication ability are called autonomously replicating sequences. The frequency and effectiveness of autonomously replicating sequences in micronuclear DNA are similar to those reported for DNAs of other organisms introduced into yeast cells. Of the 12 micronuclear fragments with autonomously replicating sequence activity, 9 also showed homology to macronuclear DNA, indicating that they contain a macronuclear gene sequence. We conclude from this that autonomously replicating sequence activity is nonrandomly distributed throughout micronuclear DNA and is preferentially associated with those regions of micronuclear DNA that contain genes. Images PMID:6092934

  10. DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation

    PubMed Central

    Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob

    2014-01-01

    As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252

  11. Single-molecule nanopore enzymology

    PubMed Central

    Wloka, Carsten; Maglia, Giovanni

    2017-01-01

    Biological nanopores are a class of membrane proteins that open nanoscale water-conduits in biological membranes. When they are reconstituted in artificial membranes and a bias voltage is applied across the membrane, the ionic current passing through individual nanopores can be used to monitor chemical reactions, to recognize individual molecules and, of most interest, to sequence DNA. More recently, proteins and enzymes have started being analysed with nanopores. Monitoring enzymatic reactions with nanopores, i.e. nanopore enzymology, has the unique advantage that it allows long-timescale observations of native proteins at the single-molecule level. Here we describe the approaches and challenges in nanopore enzymology. PMID:28630164

  12. Cryptococcus neoformans var. grubii: Separate Varietal Status for Cryptococcus neoformans Serotype A Isolates

    PubMed Central

    Franzot, Sarah P.; Salkin, Ira F.; Casadevall, Arturo

    1999-01-01

    Cryptococcus neoformans var. neoformans presently includes isolates which have been determined by the immunologic reactivity of their capsular polysaccharides to be serotype A and those which have been determined to be serotype D. However, recent analyses of the URA5 sequences and DNA fingerprinting patterns suggest significant genetic differences between the two serotypes. Therefore, we propose to recognize these genotypic distinctions, as well as previously reported phenotypic differences, by restricting C. neoformans var. neoformans to isolates which are serotype D and describing a new variety, C. neoformans var. grubii, for serotype A isolates. PMID:9986871

  13. Initiation at closely spaced replication origins in a yeast chromosome.

    PubMed

    Brewer, B J; Fangman, W L

    1993-12-10

    Replication of eukaryotic chromosomes involves initiation at origins spaced an average of 50 to 100 kilobase pairs. In yeast, potential origins can be recognized as autonomous replication sequences (ARSs) that allow maintenance of plasmids. However, there are more ARS elements than active chromosomal origins. The possibility was examined that close spacing of ARSs can lead to inactive origins. Two ARSs located 6.5 kilobase pairs apart can indeed interfere with each other. Replication is initiated from one or the other ARS with equal probability, but rarely (< 5%) from both ARSs on the same DNA molecule.

  14. Synthesis, Physicochemical Properties, and Hydrogen Bonding of 4(5)-Substituted 1-H-Imidazole-2-carboxamide, A Potential Universal Reader for DNA Sequencing by Recognition Tunneling

    PubMed Central

    Liang, Feng; Li, Shengqing

    2012-01-01

    We have developed a chemical reagent that recognizes all naturally occurring DNA bases, a so called universal reader, for DNA sequencing by recognition tunnelling in nanopores.[1] The primary requirements for this type of molecules are the ability to form non-covalent complexes with individual DNA bases and to generate recognizable electronic signatures under an electrical bias. 1-H-imidazole-2-carboxamide was designed as such a recognition moiety to interact with the DNA bases through hydrogen bonding. In the present study, we first furnished a synthetic route to 1-H-imidazole-2-carboxamide containing a short ω-functionalized alkyl chain at its 4(5) position for its attachment to metal and carbon electrodes. The acid dissociation constants of the imidazole-2-carboxamide were then determined by UV spectroscopy. The data show that the 1-H-imidazole-2-carboxamide exists in a neutral form between pH 6–10. Density functional theory (DFT) and NMR studies indicate that the imidazole ring exists in prototropic tautomers. We propose an intramolecular mechanism for tautomerization of 1-H-imidazole-2-carboxamide. In addition, the imidazole-2-carboxamide can self-associate to form hydrogen bonded dimers. NMR titration found that naturally occurring nucleosides interacted with 1-H-imidazole-2-carboxamide through hydrogen bonding in a tendency of dG>dC≫dT> dA. These studies are indispensable to assisting us in understanding the molecular recognition that takes place in the nanopore where routinely used analytical tools such as NMR and FTIR cannot be conveniently applied. PMID:22461259

  15. Escherichia marmotae sp. nov., isolated from faeces of Marmota himalayana.

    PubMed

    Liu, Sha; Jin, Dong; Lan, Ruiting; Wang, Yiting; Meng, Qiong; Dai, Hang; Lu, Shan; Hu, Shoukui; Xu, Jianguo

    2015-07-01

    The taxonomic position of a group of seven closely related lactose-negative enterobacterial strains, which were isolated from fresh faecal samples of Marmota himalayana collected from the Qinghai-Tibetan plateau, China, was determined by using a polyphasic approach. Cells were Gram-reaction-negative, non-sporulating, non-motile, short rods (0.5-1 × 1-2.5 μm). By 16S rRNA gene sequences, the representative strain, HT073016(T), showed highest similarity values with Escherichia fergusonii ATCC 35469(T) at 99.3%, Escherichia coli ATCC 11775(T) at 99.2%, Escherichia albertii LMG 20976(T) at 98.9%, Escherichia hermannii CIP 103176(T) at 98.4%, and Escherichia vulneris ATCC 33821(T) at 97.7%. Phylogenetic analysis based on the 16S rRNA gene sequences showed that the seven strains formed a monophyletic group with five other species of the genus Escherichia. Digital DNA-DNA hybridization studies between strain HT073016(T) and five other species of the genus Escherichia showed that it shared less than 70% DNA-DNA relatedness with all known species of the genus Escherichia, supporting the novel species status of the strain. The DNA G+C content of strain HT073016(T) was 53.8 mol%. On the basis of phenotypic and phylogenetic characteristics, strain HT073016(T) and the six other HT073016(T)-like strains were clearly distinct from the type strains of other recognized species of the genus Escherichia and represent a novel species of the genus Escherichia, for which the name Escherichia marmotae sp. nov. is proposed, with HT073016(T) ( = CGMCC 1.12862(T) = DSM 28771(T)) as the type strain.

  16. Geographical and genospecies distribution of Borrelia burgdorferi sensu lato DNA detected in humans in the USA.

    PubMed

    Clark, Kerry L; Leydet, Brian F; Threlkeld, Clifford

    2014-05-01

    The present study investigated the cause of illness in human patients primarily in the southern USA with suspected Lyme disease based on erythema migrans-like skin lesions and/or symptoms consistent with early localized or late disseminated Lyme borreliosis. The study also included some patients from other states throughout the USA. Several PCR assays specific for either members of the genus Borrelia or only for Lyme group Borrelia spp. (Borrelia burgdorferi sensu lato), and DNA sequence analysis, were used to identify Borrelia spp. DNA in blood and skin biopsy samples from human patients. B. burgdorferi sensu lato DNA was found in both blood and skin biopsy samples from patients residing in the southern states and elsewhere in the USA, but no evidence of DNA from other Borrelia spp. was detected. Based on phylogenetic analysis of partial flagellin (flaB) gene sequences, strains that clustered separately with B. burgdorferi sensu stricto, Borrelia americana or Borrelia andersonii were associated with Lyme disease-like signs and symptoms in patients from the southern states, as well as from some other areas of the country. Strains most similar to B. burgdorferi sensu stricto and B. americana were found most commonly and appeared to be widely distributed among patients residing throughout the USA. The study findings suggest that human cases of Lyme disease in the southern USA may be more common than previously recognized and may also be caused by more than one species of B. burgdorferi sensu lato. This study provides further evidence that B. burgdorferi sensu stricto is not the only species associated with signs and/or symptoms consistent with Lyme borreliosis in the USA.

  17. Development of Solid-State Nanopore Technology for Life Detection

    NASA Technical Reports Server (NTRS)

    Bywaters, K. B.; Schmidt, H.; Vercoutere, W.; Deamer, D.; Hawkins, A. R.; Quinn, R. C.; Burton, A. S.; Mckay, C. P.

    2017-01-01

    Biomarkers for life on Earth are an important starting point to guide the search for life elsewhere. However, the search for life beyond Earth should incorporate technologies capable of recognizing an array of potential biomarkers beyond what we see on Earth, in order to minimize the risk of false negatives from life detection missions. With this in mind, charged linear polymers may be a universal signature for life, due to their ability to store information while also inherently reducing the tendency of complex tertiary structure formation that significantly inhibit replication. Thus, these molecules are attractive targets for biosignature detection as potential "self-sustaining chemical signatures." Examples of charged linear polymers, or polyelectrolytes, include deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) as well as synthetic polyelectrolytes that could potentially support life, including threose nucleic acid (TNA) and other xenonucleic acids (XNAs). Nanopore analysis is a novel technology that has been developed for singlemolecule sequencing with exquisite single nucleotide resolution which is also well-suited for analysis of polyelectrolyte molecules. Nanopore analysis has the ability to detect repeating sequences of electrical charges in organic linear polymers, and it is not molecule- specific (i.e. it is not restricted to only DNA or RNA). In this sense, it is a better life detection technique than approaches that are based on specific molecules, such as the polymerase chain reaction (PCR), which requires that the molecule being detected be composed of DNA.

  18. Caldithrix abyssi gen. nov., sp. nov., a nitrate-reducing, thermophilic, anaerobic bacterium isolated from a Mid-Atlantic Ridge hydrothermal vent, represents a novel bacterial lineage.

    PubMed

    Miroshnichenko, Margarita L; Kostrikina, Nadezhda A; Chernyh, Nikolai A; Pimenov, Nikolai V; Tourova, Tatyana P; Antipov, Alexei N; Spring, Stefan; Stackebrandt, Erko; Bonch-Osmolovskaya, Elizaveta A

    2003-01-01

    A novel, moderately thermophilic, strictly anaerobic, mixotrophic bacterium, designated strain LF13T, was isolated from a deep-sea hydrothermal chimney sample that was collected at a vent site at 14 degrees 45' N, 44 degrees 59' W on the Mid-Atlantic Ridge. Cells were Gram-negative, thin, non-motile rods of variable length. Strain LF13T grew optimally at pH 6.8-7.0 and 60 degrees C with 2.5% (w/v) NaCl. It grew chemo-organoheterotrophically, fermenting proteinaceous substrates, pyruvate and Casamino acids. The strain was able to grow by respiration, utilizing molecular hydrogen (chemolithoheterotrophically) or acetate as electron donors and nitrate as an electron acceptor. Ammonium was formed in the course of denitrification. One-hundred milligrams of yeast extract per litre were required for growth of the strain. The G + C content of the genomic DNA of strain LF13T was 42.5 mol%. Neither 16S rDNA sequence similarity values nor phylogenetic analysis unambiguously related strain LF13T with members of any recognized bacterial phyla. On the basis of 16S rDNA sequence comparisons, and in combination with physiological and morphological traits, a novel genus, Caldithrix, is proposed, with strain LF13T (= DSM 13497T =VKM B-2286T) representing the type species, Caldithrix abyssi.

  19. Tenacibaculum aestuarii sp. nov., isolated from a tidal flat sediment in Korea.

    PubMed

    Jung, Seo-Youn; Oh, Tae-Kwang; Yoon, Jung-Hoon

    2006-07-01

    A novel Tenacibaculum-like bacterial strain, SMK-4(T), was isolated from a tidal flat sediment in Korea. Strain SMK-4(T) was Gram-negative, pale yellow-pigmented and rod-shaped. It grew optimally at 30-37 degrees C and in the presence of 2-3 % (w/v) NaCl. It contained MK-6 as the predominant menaquinone and iso-C(15 : 0), iso-C(16 : 0) 3-OH and C(16 : 1)omega7c and/or iso-C(15 : 0) 2-OH as the major fatty acids (>10 % of total fatty acids). The DNA G+C content was 33.6 mol%. Phylogenetic trees based on 16S rRNA gene sequences showed that strain SMK-4(T) fell within the evolutionary radiation encompassed by the genus Tenacibaculum. Strain SMK-4(T) exhibited 16S rRNA gene sequence similarity levels of 95.2-98.6 % with respect to the type strains of recognized Tenacibaculum species. DNA-DNA relatedness levels and differential phenotypic properties made it possible to categorize strain SMK-4(T) as a species that is separate from previously described Tenacibaculum species. On the basis of phenotypic properties and phylogenetic and genetic distinctiveness, strain SMK-4(T) (=KCTC 12569(T)=JCM 13491(T)) should be classified as a novel Tenacibaculum species, for which the name Tenacibaculum aestuarii sp. nov. is proposed.

  20. Orchestration of Molecular Information through Higher Order Chemical Recognition

    NASA Astrophysics Data System (ADS)

    Frezza, Brian M.

    Broadly defined, higher order chemical recognition is the process whereby discrete chemical building blocks capable of specifically binding to cognate moieties are covalently linked into oligomeric chains. These chains, or sequences, are then able to recognize and bind to their cognate sequences with a high degree of cooperativity. Principally speaking, DNA and RNA are the most readily obtained examples of this chemical phenomenon, and function via Watson-Crick cognate pairing: guanine pairs with cytosine and adenine with thymine (DNA) or uracil (RNA), in an anti-parallel manner. While the theoretical principles, techniques, and equations derived herein apply generally to any higher-order chemical recognition system, in practice we utilize DNA oligomers as a model-building material to experimentally investigate and validate our hypotheses. Historically, general purpose information processing has been a task limited to semiconductor electronics. Molecular computing on the other hand has been limited to ad hoc approaches designed to solve highly specific and unique computation problems, often involving components or techniques that cannot be applied generally in a manner suitable for precise and predictable engineering. Herein, we provide a fundamental framework for harnessing high-order recognition in a modular and programmable fashion to synthesize molecular information process networks of arbitrary construction and complexity. This document provides a solid foundation for routinely embedding computational capability into chemical and biological systems where semiconductor electronics are unsuitable for practical application.

Top