dna sequence encoding: Topics by Science.gov

Sample records for dna sequence encoding

Multiplexed Sequence Encoding: A Framework for DNA Communication.

PubMed

Zakeri, Bijan; Carr, Peter A; Lu, Timothy K

2016-01-01

Synthetic DNA has great propensity for efficiently and stably storing non-biological information. With DNA writing and reading technologies rapidly advancing, new applications for synthetic DNA are emerging in data storage and communication. Traditionally, DNA communication has focused on the encoding and transfer of complete sets of information. Here, we explore the use of DNA for the communication of short messages that are fragmented across multiple distinct DNA molecules. We identified three pivotal points in a communication-data encoding, data transfer & data extraction-and developed novel tools to enable communication via molecules of DNA. To address data encoding, we designed DNA-based individualized keyboards (iKeys) to convert plaintext into DNA, while reducing the occurrence of DNA homopolymers to improve synthesis and sequencing processes. To address data transfer, we implemented a secret-sharing system-Multiplexed Sequence Encoding (MuSE)-that conceals messages between multiple distinct DNA molecules, requiring a combination key to reveal messages. To address data extraction, we achieved the first instance of chromatogram patterning through multiplexed sequencing, thereby enabling a new method for data extraction. We envision these approaches will enable more widespread communication of information via DNA.
Toward a Better Compression for DNA Sequences Using Huffman Encoding

PubMed Central

Almarri, Badar; Al Yami, Sultan; Huang, Chun-Hsi

2017-01-01

Abstract Due to the significant amount of DNA data that are being generated by next-generation sequencing machines for genomes of lengths ranging from megabases to gigabases, there is an increasing need to compress such data to a less space and a faster transmission. Different implementations of Huffman encoding incorporating the characteristics of DNA sequences prove to better compress DNA data. These implementations center on the concepts of selecting frequent repeats so as to force a skewed Huffman tree, as well as the construction of multiple Huffman trees when encoding. The implementations demonstrate improvements on the compression ratios for five genomes with lengths ranging from 5 to 50 Mbp, compared with the standard Huffman tree algorithm. The research hence suggests an improvement on all such DNA sequence compression algorithms that use the conventional Huffman encoding. The research suggests an improvement on all DNA sequence compression algorithms that use the conventional Huffman encoding. Accompanying software is publicly available (AL-Okaily, 2016). PMID:27960065
Toward a Better Compression for DNA Sequences Using Huffman Encoding.

PubMed

Al-Okaily, Anas; Almarri, Badar; Al Yami, Sultan; Huang, Chun-Hsi

2017-04-01

Due to the significant amount of DNA data that are being generated by next-generation sequencing machines for genomes of lengths ranging from megabases to gigabases, there is an increasing need to compress such data to a less space and a faster transmission. Different implementations of Huffman encoding incorporating the characteristics of DNA sequences prove to better compress DNA data. These implementations center on the concepts of selecting frequent repeats so as to force a skewed Huffman tree, as well as the construction of multiple Huffman trees when encoding. The implementations demonstrate improvements on the compression ratios for five genomes with lengths ranging from 5 to 50 Mbp, compared with the standard Huffman tree algorithm. The research hence suggests an improvement on all such DNA sequence compression algorithms that use the conventional Huffman encoding. The research suggests an improvement on all DNA sequence compression algorithms that use the conventional Huffman encoding. Accompanying software is publicly available (AL-Okaily, 2016 ).
Multiplexed Sequence Encoding: A Framework for DNA Communication

PubMed Central

Zakeri, Bijan; Carr, Peter A.; Lu, Timothy K.

2016-01-01

Synthetic DNA has great propensity for efficiently and stably storing non-biological information. With DNA writing and reading technologies rapidly advancing, new applications for synthetic DNA are emerging in data storage and communication. Traditionally, DNA communication has focused on the encoding and transfer of complete sets of information. Here, we explore the use of DNA for the communication of short messages that are fragmented across multiple distinct DNA molecules. We identified three pivotal points in a communication—data encoding, data transfer & data extraction—and developed novel tools to enable communication via molecules of DNA. To address data encoding, we designed DNA-based individualized keyboards (iKeys) to convert plaintext into DNA, while reducing the occurrence of DNA homopolymers to improve synthesis and sequencing processes. To address data transfer, we implemented a secret-sharing system—Multiplexed Sequence Encoding (MuSE)—that conceals messages between multiple distinct DNA molecules, requiring a combination key to reveal messages. To address data extraction, we achieved the first instance of chromatogram patterning through multiplexed sequencing, thereby enabling a new method for data extraction. We envision these approaches will enable more widespread communication of information via DNA. PMID:27050646
Local alignment of two-base encoded DNA sequence

PubMed Central

Homer, Nils; Merriman, Barry; Nelson, Stanley F

2009-01-01

Background DNA sequence comparison is based on optimal local alignment of two sequences using a similarity score. However, some new DNA sequencing technologies do not directly measure the base sequence, but rather an encoded form, such as the two-base encoding considered here. In order to compare such data to a reference sequence, the data must be decoded into sequence. The decoding is deterministic, but the possibility of measurement errors requires searching among all possible error modes and resulting alignments to achieve an optimal balance of fewer errors versus greater sequence similarity. Results We present an extension of the standard dynamic programming method for local alignment, which simultaneously decodes the data and performs the alignment, maximizing a similarity score based on a weighted combination of errors and edits, and allowing an affine gap penalty. We also present simulations that demonstrate the performance characteristics of our two base encoded alignment method and contrast those with standard DNA sequence alignment under the same conditions. Conclusion The new local alignment algorithm for two-base encoded data has substantial power to properly detect and correct measurement errors while identifying underlying sequence variants, and facilitating genome re-sequencing efforts based on this form of sequence data. PMID:19508732
Designing universal primers for the isolation of DNA sequences encoding Proanthocyanidins biosynthetic enzymes in Crataegus aronia

PubMed Central

2012-01-01

Background Hawthorn is the common name of all plant species in the genus Crataegus, which belongs to the Rosaceae family. Crataegus are considered useful medicinal plants because of their high content of proanthocyanidins (PAs) and other related compounds. To improve PAs production in Crataegus tissues, the sequences of genes encoding PAs biosynthetic enzymes are required. Findings Different bioinformatics tools, including BLAST, multiple sequence alignment and alignment PCR analysis were used to design primers suitable for the amplification of DNA fragments from 10 candidate genes encoding enzymes involved in PAs biosynthesis in C. aronia. DNA sequencing results proved the utility of the designed primers. The primers were used successfully to amplify DNA fragments of different PAs biosynthesis genes in different Rosaceae plants. Conclusion To the best of our knowledge, this is the first use of the alignment PCR approach to isolate DNA sequences encoding PAs biosynthetic enzymes in Rosaceae plants. PMID:22883984
Designing universal primers for the isolation of DNA sequences encoding Proanthocyanidins biosynthetic enzymes in Crataegus aronia.

PubMed

Zuiter, Afnan Saeid; Sawwan, Jammal; Al Abdallat, Ayed

2012-08-10

Hawthorn is the common name of all plant species in the genus Crataegus, which belongs to the Rosaceae family. Crataegus are considered useful medicinal plants because of their high content of proanthocyanidins (PAs) and other related compounds. To improve PAs production in Crataegus tissues, the sequences of genes encoding PAs biosynthetic enzymes are required. Different bioinformatics tools, including BLAST, multiple sequence alignment and alignment PCR analysis were used to design primers suitable for the amplification of DNA fragments from 10 candidate genes encoding enzymes involved in PAs biosynthesis in C. aronia. DNA sequencing results proved the utility of the designed primers. The primers were used successfully to amplify DNA fragments of different PAs biosynthesis genes in different Rosaceae plants. To the best of our knowledge, this is the first use of the alignment PCR approach to isolate DNA sequences encoding PAs biosynthetic enzymes in Rosaceae plants.
A cDNA from a mouse pancreatic beta cell encoding a putative transcription factor of the insulin gene.

PubMed Central

Walker, M D; Park, C W; Rosen, A; Aronheim, A

1990-01-01

Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis.

PubMed

MacConnell, Andrew B; McEnaney, Patrick J; Cavett, Valerie J; Paegel, Brian M

2015-09-14

The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the "structure elucidation problem": the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very molecular features that confer binding potency and specificity, such as stereochemistry, regiochemistry, and scaffold rigidity, are conspicuously absent from most libraries because isomerism introduces mass redundancy and diverse scaffolds yield uninterpretable MS fragmentation. Here we present DNA-encoded solid-phase synthesis (DESPS), comprising parallel compound synthesis in organic solvent and aqueous enzymatic ligation of unprotected encoding dsDNA oligonucleotides. Computational encoding language design yielded 148 thermodynamically optimized sequences with Hamming string distance ≥ 3 and total read length <100 bases for facile sequencing. Ligation is efficient (70% yield), specific, and directional over 6 encoding positions. A series of isomers served as a testbed for DESPS's utility in split-and-pool diversification. Single-bead quantitative PCR detected 9 × 10(4) molecules/bead and sequencing allowed for elucidation of each compound's synthetic history. We applied DESPS to the combinatorial synthesis of a 75,645-member OBOC library containing scaffold, stereochemical and regiochemical diversity using mixed-scale resin (160-μm quality control beads and 10-μm screening beads). Tandem DNA sequencing/MALDI-TOF MS analysis of 19 quality control beads showed excellent agreement (<1 ppt) between DNA sequence-predicted mass and the observed mass. DESPS synergistically unites the advantages of solid-phase synthesis and DNA encoding, enabling single-bead structural elucidation of complex compounds and synthesis using reactions normally considered incompatible with unprotected DNA. The widespread availability of inexpensive oligonucleotide synthesis, enzymes, DNA sequencing, and PCR make implementation of DESPS straightforward, and may prompt the chemistry community to revisit the synthesis of more complex and diverse libraries.
Cloning and sequence analysis of complementary DNA encoding an aberrantly rearranged human T-cell gamma chain.

PubMed Central

Dialynas, D P; Murre, C; Quertermous, T; Boss, J M; Leiden, J M; Seidman, J G; Strominger, J L

1986-01-01

Complementary DNA (cDNA) encoding a human T-cell gamma chain has been cloned and sequenced. At the junction of the variable and joining regions, there is an apparent deletion of two nucleotides in the human cDNA sequence relative to the murine gamma-chain cDNA sequence, resulting simultaneously in the generation of an in-frame stop codon and in a translational frameshift. For this reason, the sequence presented here encodes an aberrantly rearranged human T-cell gamma chain. There are several surprising differences between the deduced human and murine gamma-chain amino acid sequences. These include poor homology in the variable region, poor homology in a discrete segment of the constant region precisely bounded by the expected junctions of exon CII, and the presence in the human sequence of five potential sites for N-linked glycosylation. Images PMID:3458221
Novel encoding methods for DNA-templated chemical libraries.

PubMed

Li, Gang; Zheng, Wenlu; Liu, Ying; Li, Xiaoyu

2015-06-01

Among various types of DNA-encoded chemical libraries, DNA-templated library takes advantage of the sequence-specificity of DNA hybridization, enabling not only highly effective DNA-templated chemical reactions, but also high fidelity in library encoding. This brief review summarizes recent advances that have been made on the encoding strategies for DNA-templated libraries, and it also highlights their respective advantages and limitations for the preparation of DNA-encoded libraries. Copyright © 2015 Elsevier Ltd. All rights reserved.
Multi-modulus algorithm based on global artificial fish swarm intelligent optimization of DNA encoding sequences.

PubMed

Guo, Y C; Wang, H; Wu, H P; Zhang, M Q

2015-12-21

Aimed to address the defects of the large mean square error (MSE), and the slow convergence speed in equalizing the multi-modulus signals of the constant modulus algorithm (CMA), a multi-modulus algorithm (MMA) based on global artificial fish swarm (GAFS) intelligent optimization of DNA encoding sequences (GAFS-DNA-MMA) was proposed. To improve the convergence rate and reduce the MSE, this proposed algorithm adopted an encoding method based on DNA nucleotide chains to provide a possible solution to the problem. Furthermore, the GAFS algorithm, with its fast convergence and global search ability, was used to find the best sequence. The real and imaginary parts of the initial optimal weight vector of MMA were obtained through DNA coding of the best sequence. The simulation results show that the proposed algorithm has a faster convergence speed and smaller MSE in comparison with the CMA, the MMA, and the AFS-DNA-MMA.
Complementary DNA sequences encoding the multimammate rat MHC class II DQ alpha and beta chains and cross-species sequence comparison in rodents.

PubMed

de Bellocq, J Goüy; Leirs, H

2009-09-01

Sequences of the complete open reading frame (ORF) for rodents major histocompatibility complex (MHC) class II genes are rare. Multimammate rat (Mastomys natalensis) complementary DNA (cDNA) encoding the alpha and beta chains of MHC class II DQ gene was cloned from a rapid amplifications of cDNA Emds (RACE) cDNA library. The ORFs consist of 801 and 771 bp encoding 266 and 256 amino acid residues for DQB and DQA, respectively. The genomic structure of Mana-DQ genes is globally analogous to that described for other rodents except for the insertion of a serine residue in the signal peptide of Mana-DQB, which is unique among known rodents.
DNA-encoded chemistry: enabling the deeper sampling of chemical space.

PubMed

Goodnow, Robert A; Dumelin, Christoph E; Keefe, Anthony D

2017-02-01

DNA-encoded chemical library technologies are increasingly being adopted in drug discovery for hit and lead generation. DNA-encoded chemistry enables the exploration of chemical spaces four to five orders of magnitude more deeply than is achievable by traditional high-throughput screening methods. Operation of this technology requires developing a range of capabilities including aqueous synthetic chemistry, building block acquisition, oligonucleotide conjugation, large-scale molecular biological transformations, selection methodologies, PCR, sequencing, sequence data analysis and the analysis of large chemistry spaces. This Review provides an overview of the development and applications of DNA-encoded chemistry, highlighting the challenges and future directions for the use of this technology.
Hiding message into DNA sequence through DNA coding and chaotic maps.

PubMed

Liu, Guoyan; Liu, Hongjun; Kadir, Abdurahman

2014-09-01

The paper proposes an improved reversible substitution method to hide data into deoxyribonucleic acid (DNA) sequence, and four measures have been taken to enhance the robustness and enlarge the hiding capacity, such as encode the secret message by DNA coding, encrypt it by pseudo-random sequence, generate the relative hiding locations by piecewise linear chaotic map, and embed the encoded and encrypted message into a randomly selected DNA sequence using the complementary rule. The key space and the hiding capacity are analyzed. Experimental results indicate that the proposed method has a better performance compared with the competing methods with respect to robustness and capacity.
Cloning and sequencing of a gene encoding a novel extracellular neutral proteinase from Streptomyces sp. strain C5 and expression of the gene in Streptomyces lividans 1326.

PubMed Central

Lampel, J S; Aphale, J S; Lampel, K A; Strohl, W R

1992-01-01

The gene encoding a novel milk protein-hydrolyzing proteinase was cloned on a 6.56-kb SstI fragment from Streptomyces sp. strain C5 genomic DNA into Streptomyces lividans 1326 by using the plasmid vector pIJ702. The gene encoding the small neutral proteinase (snpA) was located within a 2.6-kb BamHI-SstI restriction fragment that was partially sequenced. The molecular mass of the deduced amino acid sequence of the mature protein was determined to be 15,740, which corresponds very closely with the relative molecular mass of the purified protein (15,500) determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The N-terminal amino acid sequence of the purified neutral proteinase was determined, and the DNA encoding this sequence was found to be located within the sequenced DNA. The deduced amino acid sequence contains a conserved zinc binding site, although secondary ligand binding and active sites typical of thermolysinlike metalloproteinases are absent. The combination of its small size, deduced amino acid sequence, and substrate and inhibition profile indicate that snpA encodes a novel neutral proteinase. Images PMID:1569011
An Integrated Microfluidic Processor for DNA-Encoded Combinatorial Library Functional Screening

PubMed Central

2017-01-01

DNA-encoded synthesis is rekindling interest in combinatorial compound libraries for drug discovery and in technology for automated and quantitative library screening. Here, we disclose a microfluidic circuit that enables functional screens of DNA-encoded compound beads. The device carries out library bead distribution into picoliter-scale assay reagent droplets, photochemical cleavage of compound from the bead, assay incubation, laser-induced fluorescence-based assay detection, and fluorescence-activated droplet sorting to isolate hits. DNA-encoded compound beads (10-μm diameter) displaying a photocleavable positive control inhibitor pepstatin A were mixed (1920 beads, 729 encoding sequences) with negative control beads (58 000 beads, 1728 encoding sequences) and screened for cathepsin D inhibition using a biochemical enzyme activity assay. The circuit sorted 1518 hit droplets for collection following 18 min incubation over a 240 min analysis. Visual inspection of a subset of droplets (1188 droplets) yielded a 24% false discovery rate (1166 pepstatin A beads; 366 negative control beads). Using template barcoding strategies, it was possible to count hit collection beads (1863) using next-generation sequencing data. Bead-specific barcodes enabled replicate counting, and the false discovery rate was reduced to 2.6% by only considering hit-encoding sequences that were observed on >2 beads. This work represents a complete distributable small molecule discovery platform, from microfluidic miniaturized automation to ultrahigh-throughput hit deconvolution by sequencing. PMID:28199790
An Integrated Microfluidic Processor for DNA-Encoded Combinatorial Library Functional Screening.

PubMed

MacConnell, Andrew B; Price, Alexander K; Paegel, Brian M

2017-03-13

DNA-encoded synthesis is rekindling interest in combinatorial compound libraries for drug discovery and in technology for automated and quantitative library screening. Here, we disclose a microfluidic circuit that enables functional screens of DNA-encoded compound beads. The device carries out library bead distribution into picoliter-scale assay reagent droplets, photochemical cleavage of compound from the bead, assay incubation, laser-induced fluorescence-based assay detection, and fluorescence-activated droplet sorting to isolate hits. DNA-encoded compound beads (10-μm diameter) displaying a photocleavable positive control inhibitor pepstatin A were mixed (1920 beads, 729 encoding sequences) with negative control beads (58 000 beads, 1728 encoding sequences) and screened for cathepsin D inhibition using a biochemical enzyme activity assay. The circuit sorted 1518 hit droplets for collection following 18 min incubation over a 240 min analysis. Visual inspection of a subset of droplets (1188 droplets) yielded a 24% false discovery rate (1166 pepstatin A beads; 366 negative control beads). Using template barcoding strategies, it was possible to count hit collection beads (1863) using next-generation sequencing data. Bead-specific barcodes enabled replicate counting, and the false discovery rate was reduced to 2.6% by only considering hit-encoding sequences that were observed on >2 beads. This work represents a complete distributable small molecule discovery platform, from microfluidic miniaturized automation to ultrahigh-throughput hit deconvolution by sequencing.
Methods and materials relating to IMPDH and GMP production

DOEpatents

Collart, Frank R.; Huberman, Eliezer

1997-01-01

Disclosed are purified and isolated DNA sequences encoding eukaryotic proteins possessing biological properties of inosine 5'-monophosphate dehydrogenase ("IMPDH"). Illustratively, mammalian (e.g., human) IMPDH-encoding DNA sequences are useful in transformation or transfection of host cells for the large scale recombinant production of the enzymatically active expression products and/or products (e.g., GMP) resulting from IMPDH catalyzed synthesis in cells. Vectors including IMPDH-encoding DNA sequences are useful in gene amplification procedures. Recombinant proteins and synthetic peptides provided by the invention are useful as immunological reagents and in the preparation of antibodies (including polyclonal and monoclonal antibodies) for quantitative detection of IMPDH.
Identification of a novel circular DNA virus in pig feces

USDA-ARS?s Scientific Manuscript database

Metagenomic analysis of fecal samples collected from a swine with diarrhea detected sequences encoding a replicase (Rep) protein typically found in small circular Rep-encoding ssDNA (CRESS-DNA) viruses. The complete 3,062 nucleotide genome was generated and found to encode two bi-directionally trans...

DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

NASA Astrophysics Data System (ADS)

Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

1984-08-01

A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.
The cDNA sequence of a neutral horseradish peroxidase.

PubMed

Bartonek-Roxå, E; Eriksson, H; Mattiasson, B

1991-02-16

A cDNA clone encoding a horseradish (Armoracia rusticana) peroxidase has been isolated and characterized. The cDNA contains 1378 nucleotides excluding the poly(A) tail and the deduced protein contains 327 amino acids which includes a 28 amino acid leader sequence. The predicted amino acid sequence is nine amino acids shorter than the major isoenzyme belonging to the horseradish peroxidase C group (HRP-C) and the sequence shows 53.7% identity with this isoenzyme. The described clone encodes nine cysteines of which eight correspond well with the cysteines found in HRP-C. Five potential N-glycosylation sites with the general sequence Asn-X-Thr/Ser are present in the deduced sequence. Compared to the earlier described HRP-C this is three glycosylation sites less. The shorter sequence and fewer N-glycosylation sites give the native isoenzyme a molecular weight of several thousands less than the horseradish peroxidase C isoenzymes. Comparison with the net charge value of HRP-C indicates that the described cDNA clone encodes a peroxidase which has either the same or a slightly less basic pI value, depending on whether the encoded protein is N-terminally blocked or not. This excludes the possibility that HRP-n could belong to either the HRP-A, -D or -E groups. The low sequence identity (53.7%) with HRP-C indicates that the described clone does not belong to the HRP-C isoenzyme group and comparison of the total amino acid composition with the HRP-B group does not place the described clone within this isoenzyme group. Our conclusion is that the described cDNA clone encodes a neutral horseradish peroxidase which belongs to a new, not earlier described, horseradish peroxidase group.
Cloning of novel cellulases from cellulolytic fungi: heterologous expression of a family 5 glycoside hydrolase from Trametes versicolor in Pichia pastoris.

PubMed

Salinas, Alejandro; Vega, Marcela; Lienqueo, María Elena; Garcia, Alejandro; Carmona, Rene; Salazar, Oriana

2011-12-10

Total cDNA isolated from cellulolytic fungi cultured in cellulose was examined for the presence of sequences encoding for endoglucanases. Novel sequences encoding for glycoside hydrolases (GHs) were identified in Fusarium oxysporum, Ganoderma applanatum and Trametes versicolor. The cDNA encoding for partial sequences of GH family 61 cellulases from F. oxysporum and G. applanatum shares 58 and 68% identity with endoglucanases from Glomerella graminicola and Laccaria bicolor, respectively. A new GH family 5 endoglucanase from T. versicolor was also identified. The cDNA encoding for the mature protein was completely sequenced. This enzyme shares 96% identity with Trametes hirsuta endoglucanase and 22% with Trichoderma reesei endoglucanase II (EGII). The enzyme, named TvEG, has N-terminal family 1 carbohydrate binding module (CBM1). The full length cDNA was cloned into the pPICZαB vector and expressed as an active, extracellular enzyme in the methylotrophic yeast Pichia pastoris. Preliminary studies suggest that T. versicolor could be useful for lignocellulose degradation. Copyright © 2011 Elsevier Inc. All rights reserved.
Cloning, sequencing and expression in MEL cells of a cDNA encoding the mouse ribosomal protein S5.

PubMed

Vanegas, N; Castañeda, V; Santamaría, D; Hernández, P; Schvartzman, J B; Krimer, D B

1997-06-05

We describe the isolation and characterization of a cDNA encoding the mouse S5 ribosomal protein. It was isolated from a MEL (murine erythroleukemia) cell cDNA library by differential hybridization as a down regulated sequence during HMBA-induced differentiation. Northern series analysis showed that S5 mRNA expression is reduced 5-fold throughout the differentiation process. The mouse S5 mRNA is 760 bp long and encodes for a 204 amino acid protein with 94% homology with the human and rat S5.
Acetylcholinesterase of the Sand Fly, Phlebotomus papatasi (Scopoli): cDNA Sequence, Baculovirus Expression, and Biochemical Properties

DTIC Science & Technology

2013-01-01

identity to acetylcholinesterase mRNA sequences of Culex tritaeniorhynchus and Lutzomyia longipalpis, respectively. The P. papatasi cDNA ORF encoded a...tritaeniorhynchus and Lutzomyia longipalpis, respectively. The P. papatasi cDNA ORF encoded a 710-amino acid protein [GenBank: AFP20868] exhibiting 85...improve effectiveness of pesticide application for control of the new world sand fly Lutzomyia longipalpis in chicken sheds [13]. Attempts to control
An Efficient Method for Electroporation of Small Interfering RNAs into ENCODE Project Tier 1 GM12878 and K562 Cell Lines.

PubMed

Muller, Ryan Y; Hammond, Ming C; Rio, Donald C; Lee, Yeon J

2015-12-01

The Encyclopedia of DNA Elements (ENCODE) Project aims to identify all functional sequence elements in the human genome sequence by use of high-throughput DNA/cDNA sequencing approaches. To aid the standardization, comparison, and integration of data sets produced from different technologies and platforms, the ENCODE Consortium selected several standard human cell lines to be used by the ENCODE Projects. The Tier 1 ENCODE cell lines include GM12878, K562, and H1 human embryonic stem cell lines. GM12878 is a lymphoblastoid cell line, transformed with the Epstein-Barr virus, that was selected by the International HapMap Project for whole genome and transcriptome sequencing by use of the Illumina platform. K562 is an immortalized myelogenous leukemia cell line. The GM12878 cell line is attractive for the ENCODE Projects, as it offers potential synergy with the International HapMap Project. Despite the vast amount of sequencing data available on the GM12878 cell line through the ENCODE Project, including transcriptome, chromatin immunoprecipitation-sequencing for histone marks, and transcription factors, no small interfering siRNA-mediated knockdown studies have been performed in the GM12878 cell line, as cationic lipid-mediated transfection methods are inefficient for lymphoid cell lines. Here, we present an efficient and reproducible method for transfection of a variety of siRNAs into the GM12878 and K562 cell lines, which subsequently results in targeted protein depletion.
Characterization of Urtica dioica agglutinin isolectins and the encoding gene family.

PubMed

Does, M P; Ng, D K; Dekker, H L; Peumans, W J; Houterman, P M; Van Damme, E J; Cornelissen, B J

1999-01-01

Urtica dioica agglutinin (UDA) has previously been found in roots and rhizomes of stinging nettles as a mixture of UDA-isolectins. Protein and cDNA sequencing have shown that mature UDA is composed of two hevein domains and is processed from a precursor protein. The precursor contains a signal peptide, two in-tandem hevein domains, a hinge region and a carboxyl-terminal chitinase domain. Genomic fragments encoding precursors for UDA-isolectins have been amplified by five independent polymerase chain reactions on genomic DNA from stinging nettle ecotype Weerselo. One amplified gene was completely sequenced. As compared to the published cDNA sequence, the genomic sequence contains, besides two basepair substitutions, two introns located at the same positions as in other plant chitinases. By partial sequence analysis of 40 amplified genes, 16 different genes were identified which encode seven putative UDA-isolectins. The deduced amino acid sequences share 78.9-98.9% identity. In extracts of roots and rhizomes of stinging nettle ecotype Weerselo six out of these seven isolectins were detected by mass spectrometry. One of them is an acidic form, which has not been identified before. Our results demonstrate that UDA is encoded by a large gene family.
Horse cDNA clones encoding two MHC class I genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Barbis, D.P.; Maher, J.K.; Stanek, J.

1994-12-31

Two full-length clones encoding MHC class I genes were isolated by screening a horse cDNA library, using a probe encoding in human HLA-A2.2Y allele. The library was made in the pcDNA1 vector (Invitrogen, San Diego, CA), using mRNA from peripheral blood lymphocytes obtained from a Thoroughbred stallion (No. 0834) homozygous for a common horse MHC haplotype (ELA-A2, -B2, -D2; Antczak et al. 1984; Donaldson et al. 1988). The clones were sequenced, using SP6 and T7 universal primers and horse-specific oligonucleotides designed to extend previously determined sequences.
The intrinsic combinatorial organization and information theoretic content of a sequence are correlated to the DNA encoded nucleosome organization of eukaryotic genomes.

PubMed

Utro, Filippo; Di Benedetto, Valeria; Corona, Davide F V; Giancarlo, Raffaele

2016-03-15

Thanks to research spanning nearly 30 years, two major models have emerged that account for nucleosome organization in chromatin: statistical and sequence specific. The first is based on elegant, easy to compute, closed-form mathematical formulas that make no assumptions of the physical and chemical properties of the underlying DNA sequence. Moreover, they need no training on the data for their computation. The latter is based on some sequence regularities but, as opposed to the statistical model, it lacks the same type of closed-form formulas that, in this case, should be based on the DNA sequence only. We contribute to close this important methodological gap between the two models by providing three very simple formulas for the sequence specific one. They are all based on well-known formulas in Computer Science and Bioinformatics, and they give different quantifications of how complex a sequence is. In view of how remarkably well they perform, it is very surprising that measures of sequence complexity have not even been considered as candidates to close the mentioned gap. We provide experimental evidence that the intrinsic level of combinatorial organization and information-theoretic content of subsequences within a genome are strongly correlated to the level of DNA encoded nucleosome organization discovered by Kaplan et al Our results establish an important connection between the intrinsic complexity of subsequences in a genome and the intrinsic, i.e. DNA encoded, nucleosome organization of eukaryotic genomes. It is a first step towards a mathematical characterization of this latter 'encoding'. Supplementary data are available at Bioinformatics online. futro@us.ibm.com. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
The bglA Gene of Aspergillus kawachii Encodes Both Extracellular and Cell Wall-Bound β-Glucosidases

PubMed Central

Iwashita, Kazuhiro; Nagahara, Tatsuya; Kimura, Hitoshi; Takano, Makoto; Shimoi, Hitoshi; Ito, Kiyoshi

1999-01-01

We cloned the genomic DNA and cDNA of bglA, which encodes β-glucosidase in Aspergillus kawachii, based on a partial amino acid sequence of purified cell wall-bound β-glucosidase CB-1. The nucleotide sequence of the cloned bglA gene revealed a 2,933-bp open reading frame with six introns that encodes an 860-amino-acid protein. Based on the deduced amino acid sequence, we concluded that the bglA gene encodes cell wall-bound β-glucosidase CB-1. The amino acid sequence exhibited high levels of homology with the amino acid sequences of fungal β-glucosidases classified in subfamily B. We expressed the bglA cDNA in Saccharomyces cerevisiae and detected the recombinant β-glucosidase in the periplasm fraction of the recombinant yeast. A. kawachii can produce two extracellular β-glucosidases (EX-1 and EX-2) in addition to the cell wall-bound β-glucosidase. A. kawachii in which the bglA gene was disrupted produced none of the three β-glucosidases, as determined by enzyme assays and a Western blot analysis. Thus, we concluded that the bglA gene encodes both extracellular and cell wall-bound β-glucosidases in A. kawachii. PMID:10584016
Characterization and mapping of cDNA encoding aspartate aminotransferase in rice, Oryza sativa L.

PubMed

Song, J; Yamamoto, K; Shomura, A; Yano, M; Minobe, Y; Sasaki, T

1996-10-31

Fifteen cDNA clones, putatively identified as encoding aspartate aminotransferase (AST, EC 2.6.1.1.), were isolated and partially sequenced. Together with six previously isolated clones putatively identified to encode ASTs (Sasaki, et al. 1994, Plant Journal 6, 615-624), their sequences were characterized and classified into 4 cDNA species. Two of the isolated clones, C60213 and C2079, were full-length cDNAs, and their complete nucleotide sequences were determined. C60213 was 1612 bp long and its deduced amino acid sequence showed 88% homology with that of Panicum miliaceum L. mitochondrial AST. The C60213-encoded protein had an N-terminal amino acid sequence that was characteristic of a mitochondrial transit peptide. On the other hand, C2079 was 1546 bp long and had 91% amino acid sequence homology with P. miliaceum L. cytosolic AST but lacked in the transit peptide sequence. The homologies of nucleotide sequences and deduced amino acid sequences of C2079 and C60213 were 54% and 52%, respectively. C2079 and C60213 were mapped on chromosomes 1 and 6, respectively, by restriction fragment length polymorphism linkage analysis. Northern blot analysis using C2079 as a probe revealed much higher transcript levels in callus and root than in green and etiolated shoots, suggesting tissue-specific variations of AST gene expression.
Characterization of a cDNA encoding a protein involved in formation of the skeleton during development of the sea urchin Lytechinus pictus.

PubMed

Livingston, B T; Shaw, R; Bailey, A; Wilt, F

1991-12-01

In order to investigate the role of proteins in the formation of mineralized tissues during development, we have isolated a cDNA that encodes a protein that is a component of the organic matrix of the skeletal spicule of the sea urchin, Lytechinus pictus. The expression of the RNA encoding this protein is regulated over development and is localized to the descendents of the micromere lineage. Comparison of the sequence of this cDNA to homologous cDNAs from other species of urchin reveal that the protein is basic and contains three conserved structural motifs: a signal peptide, a proline-rich region, and an unusual region composed of a series of direct repeats. Studies on the protein encoded by this cDNA confirm the predicted reading frame deduced from the nucleotide sequence and show that the protein is secreted and not glycosylated. Comparison of the amino acid sequence to databases reveal that the repeat domain is similar to proteins that form a unique beta-spiral supersecondary structure.
Storing data encoded DNA in living organisms

DOEpatents

Wong,; Pak C. , Wong; Kwong K. , Foote; Harlan, P [Richland, WA

2006-06-06

Current technologies allow the generation of artificial DNA molecules and/or the ability to alter the DNA sequences of existing DNA molecules. With a careful coding scheme and arrangement, it is possible to encode important information as an artificial DNA strand and store it in a living host safely and permanently. This inventive technology can be used to identify origins and protect R&D investments. It can also be used in environmental research to track generations of organisms and observe the ecological impact of pollutants. Today, there are microorganisms that can survive under extreme conditions. As well, it is advantageous to consider multicellular organisms as hosts for stored information. These living organisms can provide as memory housing and protection for stored data or information. The present invention provides well for data storage in a living organism wherein at least one DNA sequence is encoded to represent data and incorporated into a living organism.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-02-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed Central

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-01-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators. Images PMID:3257578
Cloning and sequence analysis of a cDNA encoding the alpha-subunit of mouse beta-N-acetylhexosaminidase and comparison with the human enzyme.

PubMed Central

Beccari, T; Hoade, J; Orlacchio, A; Stirling, J L

1992-01-01

cDNAs encoding the mouse beta-N-acetylhexosaminidase alpha-subunit were isolated from a mouse testis library. The longest of these (1.7 kb) was sequenced and showed 83% similarity with the human alpha-subunit cDNA sequence. The 5' end of the coding sequence was obtained from a genomic DNA clone. Alignment of the human and mouse sequences showed that all three putative N-glycosylation sites are conserved, but that the mouse alpha-subunit has an additional site towards the C-terminus. All eight cysteines in the human sequence are conserved in the mouse. There are an additional two cysteines in the mouse alpha-subunit signal peptide. All amino acids affected in Tay-Sachs-disease mutations are conserved in the mouse. Images Fig. 1. PMID:1379046
cDNA encoding a polypeptide including a hevein sequence

DOEpatents

Raikhel, N.V.; Broekaert, W.F.; Namhai Chua; Kush, A.

1993-02-16

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids.
A novel chaotic image encryption scheme using DNA sequence operations

NASA Astrophysics Data System (ADS)

Wang, Xing-Yuan; Zhang, Ying-Qian; Bao, Xue-Mei

2015-10-01

In this paper, we propose a novel image encryption scheme based on DNA (Deoxyribonucleic acid) sequence operations and chaotic system. Firstly, we perform bitwise exclusive OR operation on the pixels of the plain image using the pseudorandom sequences produced by the spatiotemporal chaos system, i.e., CML (coupled map lattice). Secondly, a DNA matrix is obtained by encoding the confused image using a kind of DNA encoding rule. Then we generate the new initial conditions of the CML according to this DNA matrix and the previous initial conditions, which can make the encryption result closely depend on every pixel of the plain image. Thirdly, the rows and columns of the DNA matrix are permuted. Then, the permuted DNA matrix is confused once again. At last, after decoding the confused DNA matrix using a kind of DNA decoding rule, we obtain the ciphered image. Experimental results and theoretical analysis show that the scheme is able to resist various attacks, so it has extraordinarily high security.
cDNA encoding a polypeptide including a hevein sequence

DOEpatents

Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

1993-02-16

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a pu GOVERNMENT RIGHTS This application was funded under Department of Energy Contract DE-AC02-76ER01338. The U.S. Government has certain rights under this application and any patent issuing thereon.
DNA Sequence Analysis of a Complementary DNA for Cold-Regulated Arabidopsis Gene cor15 and Characterization of the COR 15 Polypeptide 1

PubMed Central

Lin, Chentao; Thomashow, Michael F.

1992-01-01

Previous studies have indicated that changes in gene expression occur in Arabidopsis thaliana L. (Heyn) during cold acclimation and that certain of the cor (cold-regulated) genes encode polypeptides that share the unusual property of remaining soluble upon boiling in aqueous solution. Here, we identify a cDNA clone for a cold-regulated gene encoding one of the “boiling-stable” polypeptides, COR15. DNA sequence analysis indicated that the gene, designated cor15, encodes a 14.7-kilodalton hydrophilic polypeptide having an N-terminal amino acid sequence that closely resembles transit peptides that target proteins to the stromal compartment of chloroplasts. Immunological studies indicated that COR15 is processed in vivo and that the mature polypeptide, COR 15m, is present in the soluble fraction of chloroplasts. Possible functions of COR 15m are discussed. ImagesFigure 1Figure 4Figure 5Figure 6Figure 7 PMID:16668917

Random access in large-scale DNA data storage.

PubMed

Organick, Lee; Ang, Siena Dumas; Chen, Yuan-Jyue; Lopez, Randolph; Yekhanin, Sergey; Makarychev, Konstantin; Racz, Miklos Z; Kamath, Govinda; Gopalan, Parikshit; Nguyen, Bichlien; Takahashi, Christopher N; Newman, Sharon; Parker, Hsing-Yeh; Rashtchian, Cyrus; Stewart, Kendall; Gupta, Gagan; Carlson, Robert; Mulligan, John; Carmean, Douglas; Seelig, Georg; Ceze, Luis; Strauss, Karin

2018-03-01

Synthetic DNA is durable and can encode digital data with high density, making it an attractive medium for data storage. However, recovering stored data on a large-scale currently requires all the DNA in a pool to be sequenced, even if only a subset of the information needs to be extracted. Here, we encode and store 35 distinct files (over 200 MB of data), in more than 13 million DNA oligonucleotides, and show that we can recover each file individually and with no errors, using a random access approach. We design and validate a large library of primers that enable individual recovery of all files stored within the DNA. We also develop an algorithm that greatly reduces the sequencing read coverage required for error-free decoding by maximizing information from all sequence reads. These advances demonstrate a viable, large-scale system for DNA data storage and retrieval.
Synchronous detection of ebolavirus conserved RNA sequences and ebolavirus-encoded miRNA-like fragment based on a zwitterionic copper (II) metal-organic framework.

PubMed

Qiu, Gui-Hua; Weng, Zi-Hua; Hu, Pei-Pei; Duan, Wen-Jun; Xie, Bao-Ping; Sun, Bin; Tang, Xiao-Yan; Chen, Jin-Xiang

2018-04-01

From a three-dimensional (3D) metal-organic framework (MOF) of {[Cu(Cmdcp)(phen)(H 2 O)] 2 ·9H 2 O} n (1, H 3 CmdcpBr = N-carboxymethyl-(3,5-dicarboxyl)pyridinium bromide, phen = phenanthroline), a sensitive and selective fluorescence sensor has been developed for the simultaneous detection of ebolavirus conserved RNA sequences and ebolavirus-encoded microRNA-like (miRNA-like) fragment. The results from molecular dynamics simulation confirmed that MOF 1 absorbs carboxyfluorescein (FAM)-tagged and 5(6)-carboxyrhodamine, triethylammonium salt (ROX)-tagged probe ss-DNA (probe DNA, P-DNA) by π … π stacking and hydrogen bonding, as well as additional electrostatic interactions to form a sensing platform of P-DNAs@1 with quenched FAM and ROX fluorescence. In the presence of targeted ebolavirus conserved RNA sequences or ebolavirus-encoded miRNA-like fragment, the fluorophore-labeled P-DNA hybridizes with the analyte to give a P-DNA@RNA duplex and released from MOF 1, triggering a fluorescence recovery. Simultaneous detection of two target RNAs has also been realized by single and synchronous fluorescence analysis. The formed sensing platform shows high sensitivity for ebolavirus conserved RNA sequences and ebolavirus-encoded miRNA-like fragment with detection limits at the picomolar level and high selectivity without cross-reaction between the two probes. MOF 1 thus shows the potential as an effective fluorescent sensing platform for the synchronous detection of two ebolavirus-related sequences, and offer improved diagnostic accuracy of Ebola virus disease. Copyright © 2017 Elsevier B.V. All rights reserved.
Isolation and characterization of the chicken trypsinogen gene family.

PubMed Central

Wang, K; Gan, L; Lee, I; Hood, L

1995-01-01

Based on genomic Southern hybridizations and cDNA sequence analyses, the chicken trypsinogen gene family can be divided into two multi-member subfamilies, a six-member trypsinogen I subfamily which encodes the cationic trypsin isoenzymes and a three-member trypsinogen II subfamily which encodes the anionic trypsin isoenzymes. The chicken cDNA and genomic clones containing these two subfamilies were isolated and characterized by DNA sequence analysis. The results indicated that the chicken trypsinogen genes encoded a signal peptide of 15 to 16 amino acid residues, an activation peptide of 9 to 10 residues and a trypsin of 223 amino acid residues. The chicken trypsinogens contain all the common catalytic and structural features for trypsins, including the catalytic triad His, Asp and Ser and the six disulphide bonds. The trypsinogen I and II subfamilies share approximately 70% sequence identity at the nucleotide and amino acid level. The sequence comparison among chicken trypsinogen subfamily members and trypsin sequences from other species suggested that the chicken trypsinogen genes may have evolved in coincidental or concerted fashion. Images Figure 6 Figure 7 PMID:7733885
Cloning and expression of cDNA coding for bouganin.

PubMed

den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo

2002-03-01

Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.
Molecular cloning and expression of the gene encoding the kinetoplast-associated type II DNA topoisomerase of Crithidia fasciculata.

PubMed

Pasion, S G; Hines, J C; Aebersold, R; Ray, D S

1992-01-01

A type II DNA topoisomerase, topoIImt, was shown previously to be associated with the kinetoplast DNA of the trypanosomatid Crithidia fasciculata. The gene encoding this kinetoplast-associated topoisomerase has been cloned by immunological screening of a Crithidia genomic expression library with monoclonal antibodies raised against the purified enzyme. The gene CfaTOP2 is a single copy gene and is expressed as a 4.8-kb polyadenylated transcript. The nucleotide sequence of CfaTOP2 has been determined and encodes a predicted polypeptide of 1239 amino acids with a molecular mass of 138,445. The identification of the cloned gene is supported by immunoblot analysis of the beta-galactosidase-CfaTOP2 fusion protein expressed in Escherichia coli and by analysis of tryptic peptide sequences derived from purified topoIImt. CfaTOP2 shares significant homology with nuclear type II DNA topoisomerases of other eukaryotes suggesting that in Crithidia both nuclear and mitochondrial forms of topoisomerase II are encoded by the same gene.
Characterization of the cDNA coding for rat brain cysteine sulfinate decarboxylase: brain and liver enzymes are identical proteins encoded by two distinct mRNAs.

PubMed

Tappaz, M; Bitoun, M; Reymond, I; Sergeant, A

1999-09-01

Cysteine sulfinate decarboxylase (CSD) is considered as the rate-limiting enzyme in the biosynthesis of taurine, a possible osmoregulator in brain. Through cloning and sequencing of RT-PCR and RACE-PCR products of rat brain mRNAs, a 2,396-bp cDNA sequence was obtained encoding a protein of 493 amino acids (calculated molecular mass, 55.2 kDa). The corresponding fusion protein showed a substrate specificity similar to that of the endogenous enzyme. The sequence of the encoded protein is identical to that encoded by liver CSD cDNA. Among other characterized amino acid decarboxylases, CSD shows the highest homology (54%) with either isoform of glutamic acid decarboxylase (GAD65 and GAD67). A single mRNA band, approximately 2.5 kb, was detected by northern blot in RNA extracts of brain, liver, and kidney. However, brain and liver CSD cDNA sequences differed in the 5' untranslated region. This indicates two forms of CSD mRNA. Analysis of PCR-amplified products of genomic DNA suggests that the brain form results from the use of a 3' alternative internal splicing site within an exon specifically found in liver CSD mRNA. Through selective RT-PCR the brain form was detected in brain only, whereas the liver form was found in liver and kidney. These results indicate a tissue-specific regulation of CSD genomic expression.
Cloning and sequencing of a gene encoding a 21-kilodalton outer membrane protein from Bordetella avium and expression of the gene in Salmonella typhimurium.

PubMed Central

Gentry-Weeks, C R; Hultsch, A L; Kelly, S M; Keith, J M; Curtiss, R

1992-01-01

Three gene libraries of Bordetella avium 197 DNA were prepared in Escherichia coli LE392 by using the cosmid vectors pCP13 and pYA2329, a derivative of pCP13 specifying spectinomycin resistance. The cosmid libraries were screened with convalescent-phase anti-B. avium turkey sera and polyclonal rabbit antisera against B. avium 197 outer membrane proteins. One E. coli recombinant clone produced a 56-kDa protein which reacted with convalescent-phase serum from a turkey infected with B. avium 197. In addition, five E. coli recombinant clones were identified which produced B. avium outer membrane proteins with molecular masses of 21, 38, 40, 43, and 48 kDa. At least one of these E. coli clones, which encoded the 21-kDa protein, reacted with both convalescent-phase turkey sera and antibody against B. avium 197 outer membrane proteins. The gene for the 21-kDa outer membrane protein was localized by Tn5seq1 mutagenesis, and the nucleotide sequence was determined by dideoxy sequencing. DNA sequence analysis of the 21-kDa protein revealed an open reading frame of 582 bases that resulted in a predicted protein of 194 amino acids. Comparison of the predicted amino acid sequence of the gene encoding the 21-kDa outer membrane protein with protein sequences in the National Biomedical Research Foundation protein sequence data base indicated significant homology to the OmpA proteins of Shigella dysenteriae, Enterobacter aerogenes, E. coli, and Salmonella typhimurium and to Neisseria gonorrhoeae outer membrane protein III, Haemophilus influenzae protein P6, and Pseudomonas aeruginosa porin protein F. The gene (ompA) encoding the B. avium 21-kDa protein hybridized with 4.1-kb DNA fragments from EcoRI-digested, chromosomal DNA of Bordetella pertussis and Bordetella bronchiseptica and with 6.0- and 3.2-kb DNA fragments from EcoRI-digested, chromosomal DNA of B. avium and B. avium-like DNA, respectively. A 6.75-kb DNA fragment encoding the B. avium 21-kDa protein was subcloned into the Asd+ vector pYA292, and the construct was introduced into the avirulent delta cya delta crp delta asd S. typhimurium chi 3987 for oral immunization of birds. The gene encoding the 21-kDa protein was expressed equivalently in B. avium 197, delta asd E. coli chi 6097, and S. typhimurium chi 3987 and was localized primarily in the cytoplasmic membrane and outer membrane. In preliminary studies on oral inoculation of turkey poults with S. typhimurium chi 3987 expressing the gene encoding the B. avium 21-kDa protein, it was determined that a single dose of the recombinant Salmonella vaccine failed to elicit serum antibodies against the 21-kDa protein and challenge with wild-type B. avium 197 resulted in colonization of the trachea and thymus with B. avium 197. Images PMID:1447140
Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

PubMed Central

Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

1986-01-01

A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461
Next-generation digital information storage in DNA.

PubMed

Church, George M; Gao, Yuan; Kosuri, Sriram

2012-09-28

Digital information is accumulating at an astounding rate, straining our ability to store and archive it. DNA is among the most dense and stable information media known. The development of new technologies in both DNA synthesis and sequencing make DNA an increasingly feasible digital storage medium. We developed a strategy to encode arbitrary digital information in DNA, wrote a 5.27-megabit book using DNA microchips, and read the book by using next-generation DNA sequencing.
Molecular Cloning and Characterization of cDNA Encoding a Putative Stress-Induced Heat-Shock Protein from Camelus dromedarius

PubMed Central

Elrobh, Mohamed S.; Alanazi, Mohammad S.; Khan, Wajahatullah; Abduljaleel, Zainularifeen; Al-Amri, Abdullah; Bazzi, Mohammad D.

2011-01-01

Heat shock proteins are ubiquitous, induced under a number of environmental and metabolic stresses, with highly conserved DNA sequences among mammalian species. Camelus dromedaries (the Arabian camel) domesticated under semi-desert environments, is well adapted to tolerate and survive against severe drought and high temperatures for extended periods. This is the first report of molecular cloning and characterization of full length cDNA of encoding a putative stress-induced heat shock HSPA6 protein (also called HSP70B′) from Arabian camel. A full-length cDNA (2417 bp) was obtained by rapid amplification of cDNA ends (RACE) and cloned in pET-b expression vector. The sequence analysis of HSPA6 gene showed 1932 bp-long open reading frame encoding 643 amino acids. The complete cDNA sequence of the Arabian camel HSPA6 gene was submitted to NCBI GeneBank (accession number HQ214118.1). The BLAST analysis indicated that C. dromedaries HSPA6 gene nucleotides shared high similarity (77–91%) with heat shock gene nucleotide of other mammals. The deduced 643 amino acid sequences (accession number ADO12067.1) showed that the predicted protein has an estimated molecular weight of 70.5 kDa with a predicted isoelectric point (pI) of 6.0. The comparative analyses of camel HSPA6 protein sequences with other mammalian heat shock proteins (HSPs) showed high identity (80–94%). Predicted camel HSPA6 protein structure using Protein 3D structural analysis high similarities with human and mouse HSPs. Taken together, this study indicates that the cDNA sequences of HSPA6 gene and its amino acid and protein structure from the Arabian camel are highly conserved and have similarities with other mammalian species. PMID:21845074
Nucleic acid molecules encoding isopentenyl monophosphate kinase, and methods of use

DOEpatents

Croteau, Rodney B.; Lange, Bernd M.

2001-01-01

A cDNA encoding isopentenyl monophosphate kinase (IPK) from peppermint (Mentha x piperita) has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID NO:1) is provided which codes for the expression of isopentenyl monophosphate kinase (SEQ ID NO:2), from peppermint (Mentha x piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for isopentenyl monophosphate kinase, or for a base sequence sufficiently complementary to at least a portion of isopentenyl monophosphate kinase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding isopentenyl monophosphate kinase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant isopentenyl monophosphate kinase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant isopentenyl monophosphate kinase may be used to obtain expression or enhanced expression of isopentenyl monophosphate kinase in plants in order to enhance the production of isopentenyl monophosphate kinase, or isoprenoids derived therefrom, or may be otherwise employed for the regulation or expression of isopentenyl monophosphate kinase, or the production of its products.
DNA polymerase having modified nucleotide binding site for DNA sequencing

DOEpatents

Tabor, Stanley; Richardson, Charles

1997-01-01

Modified gene encoding a modified DNA polymerase wherein the modified polymerase incorporates dideoxynucleotides at least 20-fold better compared to the corresponding deoxynucleotides as compared with the corresponding naturally-occurring DNA polymerase.
Isolation and characterization of a cDNA clone for the complete protein coding region of the delta subunit of the mouse acetylcholine receptor.

PubMed Central

LaPolla, R J; Mayne, K M; Davidson, N

1984-01-01

A mouse cDNA clone has been isolated that contains the complete coding region of a protein highly homologous to the delta subunit of the Torpedo acetylcholine receptor (AcChoR). The cDNA library was constructed in the vector lambda 10 from membrane-associated poly(A)+ RNA from BC3H-1 mouse cells. Surprisingly, the delta clone was selected by hybridization with cDNA encoding the gamma subunit of the Torpedo AcChoR. The nucleotide sequence of the mouse cDNA clone contains an open reading frame of 520 amino acids. This amino acid sequence exhibits 59% and 50% sequence homology to the Torpedo AcChoR delta and gamma subunits, respectively. However, the mouse nucleotide sequence has several stretches of high homology with the Torpedo gamma subunit cDNA, but not with delta. The mouse protein has the same general structural features as do the Torpedo subunits. It is encoded by a 3.3-kilobase mRNA. There is probably only one, but at most two, chromosomal genes coding for this or closely related sequences. Images PMID:6096870
Nucleotide sequence analysis establishes the role of endogenous murine leukemia virus DNA segments in formation of recombinant mink cell focus-forming murine leukemia viruses.

PubMed Central

Khan, A S

1984-01-01

The sequence of 363 nucleotides near the 3' end of the pol gene and 564 nucleotides from the 5' terminus of the env gene in an endogenous murine leukemia viral (MuLV) DNA segment, cloned from AKR/J mouse DNA and designated as A-12, was obtained. For comparison, the nucleotide sequence in an analogous portion of AKR mink cell focus-forming (MCF) 247 MuLV provirus was also determined. Sequence features unique to MCF247 MuLV DNA in the 3' pol and 5' env regions were identified by comparison with nucleotide sequences in analogous regions of NFS -Th-1 xenotropic and AKR ecotropic MuLV proviruses. These included (i) an insertion of 12 base pairs encoding four amino acids located 60 base pairs from the 3' terminus of the pol gene and immediately preceding the env gene, (ii) the deletion of 12 base pairs (encoding four amino acids) and the insertion of 3 base pairs (encoding one amino acid) in the 5' portion of the env gene, and (iii) single base substitutions resulting in 2 MCF247 -specific amino acids in the 3' pol and 23 in the 5' env regions. Nucleotide sequence comparison involving the 3' pol and 5' env regions of AKR MCF247 , NFS xenotropic, and AKR ecotropic MuLV proviruses with the cloned endogenous MuLV DNA indicated that MCF247 proviral DNA sequences were conserved in the cloned endogenous MuLV proviral segment. In fact, total nucleotide sequence identity existed between the endogenous MuLV DNA and the MCF247 MuLV provirus in the 3' portion of the pol gene. In the 5' env region, only 4 of 564 nucleotides were different, resulting in three amino acid changes between AKR MCF247 MuLV DNA and the endogenous MuLV DNA present in clone A-12. In addition, nucleotide sequence comparison indicated that Moloney-and Friend-MCF MuLVs were also highly related in the 3' pol and 5' env regions to the cloned endogenous MuLV DNA. These results establish the role of endogenous MuLV DNA segments in generation of recombinant MCF viruses. PMID:6328017
Sequence and transcriptional analysis of the barley ctDNA region upstream of psbD-psbC encoding trnK(UUU), rps16, trnQ(UUG), psbK, psbI, and trnS(GCU).

PubMed

Berends Sexton, T; Jones, J T; Mullet, J E

1990-05-01

A 6.25 kbp barley plastid DNA region located between psbA and psbD-psbC were sequenced and RNAs produced from this DNA were analyzed. TrnK(UUU), rps16 and trnQ(UUG) were located upstream of psbA. These genes were transcribed from the same DNA strand as psbA and multiple RNAs hybridized to them. TrnK and rsp16 contained introns; a 504 amino acid open reading frame (ORF504) was located within the trnK intron. Between trnQ and psbD-psbC was a 2.24 kbp region encoding psbK, psbI and trnS(GCU). PsbK and psbI are encoded on the same DNA strand as psbD-psbC whereas trnS(GCU) is transcribed from the opposite strand. Two large RNAs accumulate in barley etioplasts which contain psbK, psbI, anti-sense trnS(GCU) and psbD-psbC sequences. Other RNAs encode psbK and psbI only, or psbK only. The divergent trnS(GCU) located upstream of psbD-psbC and a second divergent trnS(UGA) located downstream of psbD-psbC were both expressed. Furthermore, RNA complementary to psbK and psbI mRNA was detected, suggesting that transcription from divergent overlapping transcription units may modulate expression from this DNA region.
Mollusk genes encoding lysine tRNA (UUU) contain introns.

PubMed

Matsuo, M; Abe, Y; Saruta, Y; Okada, N

1995-11-20

New intron-containing genes encoding tRNAs were discovered when genomic DNA isolated from various animal species was amplified by the polymerase chain reaction (PCR) with primers based on sequences of rabbit tRNA(Lys). From sequencing analysis of the products of PCR, we found that introns are present in several genes encoding tRNA(Lys) in mollusks, such as Loligo bleekeri (squid) and Octopus vulgaris (octopus). These introns were specific to genes encoding tRNA(Lys)(CUU) and were not present in genes encoding tRNA(Lys)(CUU). In addition, the sequences of the introns were different from one another. To confirm the results of our initial experiments, we isolated and sequenced genes encoding tRNA(Lys)(CUU) and tRNA(Lys)(UUU). The gene for tRNA(Lys)(UUU) from squid contained an intron, whose sequence was the same as that identified by PCR, and the gene formed a cluster with a corresponding pseudogene. Several DNA regions of 2.1 kb containing this cluster appeared to be tandemly arrayed in the squid genome. By contrast, the gene encoding tRNA(Lys)(CUU) did not contain an intron, as shown also by PCR. The tRNA(Lys)(UUU) that corresponded to the analyzed gene was isolated and characterized. The present study provides the first example of an intron-containing gene encoding a tRNA in mollusks and suggests the universality of introns in such genes in higher eukaryotes.
Chaotic Image Encryption Algorithm Based on Bit Permutation and Dynamic DNA Encoding.

PubMed

Zhang, Xuncai; Han, Feng; Niu, Ying

2017-01-01

With the help of the fact that chaos is sensitive to initial conditions and pseudorandomness, combined with the spatial configurations in the DNA molecule's inherent and unique information processing ability, a novel image encryption algorithm based on bit permutation and dynamic DNA encoding is proposed here. The algorithm first uses Keccak to calculate the hash value for a given DNA sequence as the initial value of a chaotic map; second, it uses a chaotic sequence to scramble the image pixel locations, and the butterfly network is used to implement the bit permutation. Then, the image is coded into a DNA matrix dynamic, and an algebraic operation is performed with the DNA sequence to realize the substitution of the pixels, which further improves the security of the encryption. Finally, the confusion and diffusion properties of the algorithm are further enhanced by the operation of the DNA sequence and the ciphertext feedback. The results of the experiment and security analysis show that the algorithm not only has a large key space and strong sensitivity to the key but can also effectively resist attack operations such as statistical analysis and exhaustive analysis.
Chaotic Image Encryption Algorithm Based on Bit Permutation and Dynamic DNA Encoding

PubMed Central

2017-01-01

With the help of the fact that chaos is sensitive to initial conditions and pseudorandomness, combined with the spatial configurations in the DNA molecule's inherent and unique information processing ability, a novel image encryption algorithm based on bit permutation and dynamic DNA encoding is proposed here. The algorithm first uses Keccak to calculate the hash value for a given DNA sequence as the initial value of a chaotic map; second, it uses a chaotic sequence to scramble the image pixel locations, and the butterfly network is used to implement the bit permutation. Then, the image is coded into a DNA matrix dynamic, and an algebraic operation is performed with the DNA sequence to realize the substitution of the pixels, which further improves the security of the encryption. Finally, the confusion and diffusion properties of the algorithm are further enhanced by the operation of the DNA sequence and the ciphertext feedback. The results of the experiment and security analysis show that the algorithm not only has a large key space and strong sensitivity to the key but can also effectively resist attack operations such as statistical analysis and exhaustive analysis. PMID:28912802
DNA sequence analysis of a 10 624 bp fragment of the left arm of chromosome XV from Saccharomyces cerevisiae reveals a RNA binding protein, a mitochondrial protein, two ribosomal proteins and two new open reading frames.

PubMed

Lafuente, M J; Gamo, F J; Gancedo, C

1996-09-01

We have determined the sequence of a 10624 bp DNA segment located in the left arm of chromosome XV of Saccharomyces cerevisiae. The sequence contains eight open reading frames (ORFs) longer than 100 amino acids. Two of them do not present significant homology with sequences found in the databases. The product of ORF o0553 is identical to the protein encoded by the gene SMF1. Internal to it there is another ORF, o0555 that is apparently expressed. The proteins encoded by ORFs o0559 and o0565 are identical to ribosomal proteins S19.e and L18 respectively. ORF o0550 encodes a protein with an RNA binding signature including RNP motifs and stretches rich in asparagine, glutamine and arginine.
DNA polymerase having modified nucleotide binding site for DNA sequencing

DOEpatents

Tabor, S.; Richardson, C.

1997-03-25

A modified gene encoding a modified DNA polymerase is disclosed. The modified polymerase incorporates dideoxynucleotides at least 20-fold better compared to the corresponding deoxynucleotides as compared with the corresponding naturally-occurring DNA polymerase. 6 figs.

Pea chloroplast DNA encodes homologues of Escherichia coli ribosomal subunit S2 and the beta'-subunit of RNA polymerase.

PubMed Central

Cozens, A L; Walker, J E

1986-01-01

The nucleotide sequence has been determined of a segment of 4680 bases of the pea chloroplast genome. It adjoins a sequence described elsewhere that encodes subunits of the F0 membrane domain of the ATP-synthase complex. The sequence contains a potential gene encoding a protein which is strongly related to the S2 polypeptide of Escherichia coli ribosomes. It also encodes an incomplete protein which contains segments that are homologous to the beta'-subunit of E. coli RNA polymerase and to yeast RNA polymerases II and III. PMID:3530249
Continuous in vitro evolution of bacteriophage RNA polymerase promoters

NASA Technical Reports Server (NTRS)

Breaker, R. R.; Banerji, A.; Joyce, G. F.

1994-01-01

Rapid in vitro evolution of bacteriophage T7, T3, and SP6 RNA polymerase promoters was achieved by a method that allows continuous enrichment of DNAs that contain functional promoter elements. This method exploits the ability of a special class of nucleic acid molecules to replicate continuously in the presence of both a reverse transcriptase and a DNA-dependent RNA polymerase. Replication involves the synthesis of both RNA and cDNA intermediates. The cDNA strand contains an embedded promoter sequence, which becomes converted to a functional double-stranded promoter element, leading to the production of RNA transcripts. Synthetic cDNAs, including those that contain randomized promoter sequences, can be used to initiate the amplification cycle. However, only those cDNAs that contain functional promoter sequences are able to produce RNA transcripts. Furthermore, each RNA transcript encodes the RNA polymerase promoter sequence that was responsible for initiation of its own transcription. Thus, the population of amplifying molecules quickly becomes enriched for those templates that encode functional promoters. Optimal promoter sequences for phage T7, T3, and SP6 RNA polymerase were identified after a 2-h amplification reaction, initiated in each case with a pool of synthetic cDNAs encoding greater than 10(10) promoter sequence variants.
Sequence characterization of cDNA sequence of encoding of an antimicrobial Peptide with no disulfide bridge from the Iranian mesobuthus eupeus venomous glands.

PubMed

Farajzadeh-Sheikh, Ahmad; Jolodar, Abbas; Ghaemmaghami, Shamsedin

2013-01-01

Scorpion venom glands produce some antimicrobial peptides (AMP) that can rapidly kill a broad range of microbes and have additional activities that impact on the quality and effectiveness of innate responses and inflammation. In this study, we reported the identification of a cDNA sequence encoding cysteine-free antimicrobial peptides isolated from venomous glands of this species. Total RNA was extracted from the Iranian mesobuthus eupeus venom glands, and cDNA was synthesized by using the modified oligo (dT). The cDNA was used as the template for applying Semi-nested RT- PCR technique. PCR Products were used for direct nucleotide sequencing and the results were compared with Gen Bank database. A 213 BP cDNA fragment encoding the entire coding region of an antimicrobial toxin from the Iranian scorpion M. Eupeus venom glands were isolated. The full-length sequence of the coding region was 210 BP contained an open reading frame of 70 amino with a predicted molecular mass of 7970.48 Da and theoretical Pi of 9.10. The open reading frame consists of 210 BP encoding a precursor of 70 amino acid residues, including a signal peptide of 23 residues a propertied of 7 residues, and a mature peptide of 34 residues with no disulfide bridge. The peptide has detectable sequence identity to the Lesser Asian mesobuthus eupeus MeVAMP-2 (98%), MeVAMP-9 (60%) and several previously described AMPs from other scorpion venoms including mesobuthus martensii (94%) and buthus occitanus Israelis (82%). The secondary structure of the peptide mainly consisted of α-helical structure which was generally conserved by previously reported scorpion counterparts. The phylogenetic analysis showed that the Iranian MeAMP-like toxin was similar but not identical with that of venom antimicrobial peptides from lesser Asian scorpion mesobuthus eupeus.
Novel selection methods for DNA-encoded chemical libraries

PubMed Central

Chan, Alix I.; McGregor, Lynn M.; Liu, David R.

2015-01-01

Driven by the need for new compounds to serve as biological probes and leads for therapeutic development and the growing accessibility of DNA technologies including high-throughput sequencing, many academic and industrial groups have begun to use DNA-encoded chemical libraries as a source of bioactive small molecules. In this review, we describe the technologies that have enabled the selection of compounds with desired activities from these libraries. These methods exploit the sensitivity of in vitro selection coupled with DNA amplification to overcome some of the limitations and costs associated with conventional screening methods. In addition, we highlight newer techniques with the potential to be applied to the high-throughput evaluation of DNA-encoded chemical libraries. PMID:25723146
Ubiquitous and gene-specific regulatory 5' sequences in a sea urchin histone DNA clone coding for histone protein variants.

PubMed Central

Busslinger, M; Portmann, R; Irminger, J C; Birnstiel, M L

1980-01-01

The DNA sequences of the entire structural H4, H3, H2A and H2B genes and of their 5' flanking regions have been determined in the histone DNA clone h19 of the sea urchin Psammechinus miliaris. In clone h19 the polarity of transcription and the relative arrangement of the histone genes is identical to that in clone h22 of the same species. The histone proteins encoded by h19 DNA differ in their primary structure from those encoded by clone h22 and have been compared to histone protein sequences of other sea urchin species as well as other eukaryotes. A comparative analysis of the 5' flanking DNA sequences of the structural histone genes in both clones revealed four ubiquitous sequence motifs; a pentameric element GATCC, followed at short distance by the Hogness box GTATAAATAG, a conserved sequence PyCATTCPu, in or near which the 5' ends of the mRNAs map in h22 DNA and lastly a sequence A, containing the initiation codon. These sequences are also found, sometimes in modified version, in front of other eukaryotic genes transcribed by polymerase II. When prelude sequences of isocoding histone genes in clone h19 and h22 are compared areas of homology are seen to extend beyond the ubiquitous sequence motifs towards the divergent AT-rich spacer and terminate between approximately 140 and 240 nucleotides away from the structural gene. These prelude regions contain quite large conservative sequence blocks which are specific for each type of histone genes. Images PMID:7443547
Molecular cloning of a cDNA encoding the glycoprotein of hen oviduct microsomal signal peptidase.

PubMed Central

Newsome, A L; McLean, J W; Lively, M O

1992-01-01

Detergent-solubilized hen oviduct signal peptidase has been characterized previously as an apparent complex of a 19 kDa protein and a 23 kDa glycoprotein (GP23) [Baker & Lively (1987) Biochemistry 26, 8561-8567]. A cDNA clone encoding GP23 from a chicken oviduct lambda gt11 cDNA library has now been characterized. The cDNA encodes a protein of 180 amino acid residues with a single site for asparagine-linked glycosylation that has been directly identified by amino acid sequence analysis of a tryptic-digest peptide containing the glycosylated site. Immunoblot analysis reveals cross-reactivity with a dog pancreas protein. Comparison of the deduced amino acid sequence of GP23 with the 22/23 kDa glycoprotein of dog microsomal signal peptidase [Shelness, Kanwar & Blobel (1988) J. Biol. Chem. 263, 17063-17070], one of five proteins associated with this enzyme, reveals that the amino acid sequences are 90% identical. Thus the signal peptidase glycoprotein is as highly conserved as the sequences of cytochromes c and b from these same species and is likely to be found in a similar form in many, if not all, vertebrate species. The data also show conclusively that the dog and avian signal peptidases have at least one protein subunit in common. Images Fig. 1. PMID:1546959
[The ENCODE project and functional genomics studies].

PubMed

Ding, Nan; Qu, Hongzhu; Fang, Xiangdong

2014-03-01

Upon the completion of the Human Genome Project, scientists have been trying to interpret the underlying genomic code for human biology. Since 2003, National Human Genome Research Institute (NHGRI) has invested nearly $0.3 billion and gathered over 440 scientists from more than 32 institutions in the United States, China, United Kingdom, Japan, Spain and Singapore to initiate the Encyclopedia of DNA Elements (ENCODE) project, aiming to identify and analyze all regulatory elements in the human genome. Taking advantage of the development of next-generation sequencing technologies and continuous improvement of experimental methods, ENCODE had made remarkable achievements: identified methylation and histone modification of DNA sequences and their regulatory effects on gene expression through altering chromatin structures, categorized binding sites of various transcription factors and constructed their regulatory networks, further revised and updated database for pseudogenes and non-coding RNA, and identified SNPs in regulatory sequences associated with diseases. These findings help to comprehensively understand information embedded in gene and genome sequences, the function of regulatory elements as well as the molecular mechanism underlying the transcriptional regulation by noncoding regions, and provide extensive data resource for life sciences, particularly for translational medicine. We re-viewed the contributions of high-throughput sequencing platform development and bioinformatical technology improve-ment to the ENCODE project, the association between epigenetics studies and the ENCODE project, and the major achievement of the ENCODE project. We also provided our prospective on the role of the ENCODE project in promoting the development of basic and clinical medicine.
NUCLEOTIDE SEQUENCING AND TRANSCRIPTIONAL MAPPING OF THE GENES ENCODING BIPHENYL DIOXYGENASE, A MULTICOMPONENT POLYCHLORINATED-BIPHENYL-DEGRADING ENZYME IN PSEUDOMONAS STRAIN LB400

EPA Science Inventory

The DNA region encoding biphenyl dioxygenase, the first enzyme in the biphenyl-polychlorinated biphenyl degradation pathway of Pseudomonas species strain LB400, was sequenced. ix open reading frames were identified, four of which are, homologous to the components of toluene dioxy...
The cDNA sequence of mouse Pgp-1 and homology to human CD44 cell surface antigen and proteoglycan core/link proteins.

PubMed

Wolffe, E J; Gause, W C; Pelfrey, C M; Holland, S M; Steinberg, A D; August, J T

1990-01-05

We describe the isolation and sequencing of a cDNA encoding mouse Pgp-1. An oligonucleotide probe corresponding to the NH2-terminal sequence of the purified protein was synthesized by the polymerase chain reaction and used to screen a mouse macrophage lambda gt11 library. A cDNA clone with an insert of 1.2 kilobases was selected and sequenced. In Northern blot analysis, only cells expressing Pgp-1 contained mRNA species that hybridized with this Pgp-1 cDNA. The nucleotide sequence of the cDNA has a single open reading frame that yields a protein-coding sequence of 1076 base pairs followed by a 132-base pair 3'-untranslated sequence that includes a putative polyadenylation signal but no poly(A) tail. The translated sequence comprises a 13-amino acid signal peptide followed by a polypeptide core of 345 residues corresponding to an Mr of 37,800. Portions of the deduced amino acid sequence were identical to those obtained by amino acid sequence analysis from the purified glycoprotein, confirming that the cDNA encodes Pgp-1. The predicted structure of Pgp-1 includes an NH2-terminal extracellular domain (residues 14-265), a transmembrane domain (residues 266-286), and a cytoplasmic tail (residues 287-358). Portions of the mouse Pgp-1 sequence are highly similar to that of the human CD44 cell surface glycoprotein implicated in cell adhesion. The protein also shows sequence similarity to the proteoglycan tandem repeat sequences found in cartilage link protein and cartilage proteoglycan core protein which are thought to be involved in binding to hyaluronic acid.
DNA encoding for plant digalactosyldiacylglycerol galactosyltransferase and methods of use

DOEpatents

Benning, Christoph; Doermann, Peter

2003-11-04

The cDNA encoding digalactosyldiacylglycerol galactosyltransferase (DGD1) is provided. The deduced amino acid sequence is also provided. Methods of making and using DGD1 to screen for new herbicides and alter a plant's leaf lipid composition are also provided, as well as expression vectors, transgenic plants or other organisms transfected with said vectors.
Cloning of an avilamycin biosynthetic gene cluster from Streptomyces viridochromogenes Tü57.

PubMed Central

Gaisser, S; Trefzer, A; Stockert, S; Kirschning, A; Bechthold, A

1997-01-01

A 65-kb region of DNA from Streptomyces viridochromogenes Tü57, containing genes encoding proteins involved in the biosynthesis of avilamycins, was isolated. The DNA sequence of a 6.4-kb fragment from this region revealed four open reading frames (ORF1 to ORF4), three of which are fully contained within the sequenced fragment. The deduced amino acid sequence of AviM, encoded by ORF2, shows 37% identity to a 6-methylsalicylic acid synthase from Penicillium patulum. Cultures of S. lividans TK24 and S. coelicolor CH999 containing plasmids with ORF2 on a 5.5-kb PstI fragment were able to produce orsellinic acid, an unreduced version of 6-methylsalicylic acid. The amino acid sequence encoded by ORF3 (AviD) is 62% identical to that of StrD, a dTDP-glucose synthase from S. griseus. The deduced amino acid sequence of AviE, encoded by ORF4, shows 55% identity to a dTDP-glucose dehydratase (StrE) from S. griseus. Gene insertional inactivation experiments of aviE abolished avilamycin production, indicating the involvement of aviE in the biosynthesis of avilamycins. PMID:9335272
Universal strategies for the DNA-encoding of libraries of small molecules using the chemical ligation of oligonucleotide tags

PubMed Central

Litovchick, Alexander; Clark, Matthew A; Keefe, Anthony D

2014-01-01

The affinity-mediated selection of large libraries of DNA-encoded small molecules is increasingly being used to initiate drug discovery programs. We present universal methods for the encoding of such libraries using the chemical ligation of oligonucleotides. These methods may be used to record the chemical history of individual library members during combinatorial synthesis processes. We demonstrate three different chemical ligation methods as examples of information recording processes (writing) for such libraries and two different cDNA-generation methods as examples of information retrieval processes (reading) from such libraries. The example writing methods include uncatalyzed and Cu(I)-catalyzed alkyne-azide cycloadditions and a novel photochemical thymidine-psoralen cycloaddition. The first reading method “relay primer-dependent bypass” utilizes a relay primer that hybridizes across a chemical ligation junction embedded in a fixed-sequence and is extended at its 3′-terminus prior to ligation to adjacent oligonucleotides. The second reading method “repeat-dependent bypass” utilizes chemical ligation junctions that are flanked by repeated sequences. The upstream repeat is copied prior to a rearrangement event during which the 3′-terminus of the cDNA hybridizes to the downstream repeat and polymerization continues. In principle these reading methods may be used with any ligation chemistry and offer universal strategies for the encoding (writing) and interpretation (reading) of DNA-encoded chemical libraries. PMID:25483841
Targeted next-generation sequencing helps to decipher the genetic and phenotypic heterogeneity of hypertrophic cardiomyopathy

PubMed Central

Cecconi, Massimiliano; Parodi, Maria I.; Formisano, Francesco; Spirito, Paolo; Autore, Camillo; Musumeci, Maria B.; Favale, Stefano; Forleo, Cinzia; Rapezzi, Claudio; Biagini, Elena; Davì, Sabrina; Canepa, Elisabetta; Pennese, Loredana; Castagnetta, Mauro; Degiorgio, Dario; Coviello, Domenico A.

2016-01-01

Hypertrophic cardiomyopathy (HCM) is mainly associated with myosin, heavy chain 7 (MYH7) and myosin binding protein C, cardiac (MYBPC3) mutations. In order to better explain the clinical and genetic heterogeneity in HCM patients, in this study, we implemented a target-next generation sequencing (NGS) assay. An Ion AmpliSeq™ Custom Panel for the enrichment of 19 genes, of which 9 of these did not encode thick/intermediate and thin myofilament (TTm) proteins and, among them, 3 responsible of HCM phenocopy, was created. Ninety-two DNA samples were analyzed by the Ion Personal Genome Machine: 73 DNA samples (training set), previously genotyped in some of the genes by Sanger sequencing, were used to optimize the NGS strategy, whereas 19 DNA samples (discovery set) allowed the evaluation of NGS performance. In the training set, we identified 72 out of 73 expected mutations and 15 additional mutations: the molecular diagnosis was achieved in one patient with a previously wild-type status and the pre-excitation syndrome was explained in another. In the discovery set, we identified 20 mutations, 5 of which were in genes encoding non-TTm proteins, increasing the diagnostic yield by approximately 20%: a single mutation in genes encoding non-TTm proteins was identified in 2 out of 3 borderline HCM patients, whereas co-occuring mutations in genes encoding TTm and galactosidase alpha (GLA) altered proteins were characterized in a male with HCM and multiorgan dysfunction. Our combined targeted NGS-Sanger sequencing-based strategy allowed the molecular diagnosis of HCM with greater efficiency than using the conventional (Sanger) sequencing alone. Mutant alleles encoding non-TTm proteins may aid in the complete understanding of the genetic and phenotypic heterogeneity of HCM: co-occuring mutations of genes encoding TTm and non-TTm proteins could explain the wide variability of the HCM phenotype, whereas mutations in genes encoding only the non-TTm proteins are identifiable in patients with a milder HCM status. PMID:27600940
A deep learning method for lincRNA detection using auto-encoder algorithm.

PubMed

Yu, Ning; Yu, Zeng; Pan, Yi

2017-12-06

RNA sequencing technique (RNA-seq) enables scientists to develop novel data-driven methods for discovering more unidentified lincRNAs. Meantime, knowledge-based technologies are experiencing a potential revolution ignited by the new deep learning methods. By scanning the newly found data set from RNA-seq, scientists have found that: (1) the expression of lincRNAs appears to be regulated, that is, the relevance exists along the DNA sequences; (2) lincRNAs contain some conversed patterns/motifs tethered together by non-conserved regions. The two evidences give the reasoning for adopting knowledge-based deep learning methods in lincRNA detection. Similar to coding region transcription, non-coding regions are split at transcriptional sites. However, regulatory RNAs rather than message RNAs are generated. That is, the transcribed RNAs participate the biological process as regulatory units instead of generating proteins. Identifying these transcriptional regions from non-coding regions is the first step towards lincRNA recognition. The auto-encoder method achieves 100% and 92.4% prediction accuracy on transcription sites over the putative data sets. The experimental results also show the excellent performance of predictive deep neural network on the lincRNA data sets compared with support vector machine and traditional neural network. In addition, it is validated through the newly discovered lincRNA data set and one unreported transcription site is found by feeding the whole annotated sequences through the deep learning machine, which indicates that deep learning method has the extensive ability for lincRNA prediction. The transcriptional sequences of lincRNAs are collected from the annotated human DNA genome data. Subsequently, a two-layer deep neural network is developed for the lincRNA detection, which adopts the auto-encoder algorithm and utilizes different encoding schemes to obtain the best performance over intergenic DNA sequence data. Driven by those newly annotated lincRNA data, deep learning methods based on auto-encoder algorithm can exert their capability in knowledge learning in order to capture the useful features and the information correlation along DNA genome sequences for lincRNA detection. As our knowledge, this is the first application to adopt the deep learning techniques for identifying lincRNA transcription sequences.
Isolation, nucleotide sequence and expression of a cDNA encoding feline granulocyte colony-stimulating factor.

PubMed

Dunham, S P; Onions, D E

2001-06-21

A cDNA encoding feline granulocyte colony stimulating factor (fG-CSF) was cloned from alveolar macrophages using the reverse transcriptase-polymerase chain reaction. The cDNA is 949 bp in length and encodes a predicted mature protein of 174 amino acids. Recombinant fG-CSF was expressed as a glutathione S-transferase fusion and purified by affinity chromatography. Biological activity of the recombinant protein was demonstrated using the murine myeloblastic cell line GNFS-60, which showed an ED50 for fG-CSF of approximately 2 ng/ml. Copyright 2001 Academic Press.
Novel selection methods for DNA-encoded chemical libraries.

PubMed

Chan, Alix I; McGregor, Lynn M; Liu, David R

2015-06-01

Driven by the need for new compounds to serve as biological probes and leads for therapeutic development and the growing accessibility of DNA technologies including high-throughput sequencing, many academic and industrial groups have begun to use DNA-encoded chemical libraries as a source of bioactive small molecules. In this review, we describe the technologies that have enabled the selection of compounds with desired activities from these libraries. These methods exploit the sensitivity of in vitro selection coupled with DNA amplification to overcome some of the limitations and costs associated with conventional screening methods. In addition, we highlight newer techniques with the potential to be applied to the high-throughput evaluation of DNA-encoded chemical libraries. Copyright © 2015 Elsevier Ltd. All rights reserved.
Recombinant pinoresinol/lariciresinol reductase, recombinant dirigent protein, and methods of use

DOEpatents

Lewis, Norman G.; Davin, Laurence B.; Dinkova-Kostova, Albena T.; Fujita, Masayuki; Gang, David R.; Sarkanen, Simo; Ford, Joshua D.

2001-04-03

Dirigent proteins and pinoresinol/lariciresinol reductases have been isolated, together with cDNAs encoding dirigent proteins and pinoresinol/lariciresinol reductases. Accordingly, isolated DNA sequences are provided which code for the expression of dirigent proteins and pinoresinol/lariciresinol reductases. In other aspects, replicable recombinant cloning vehicles are provided which code for dirigent proteins or pinoresinol/lariciresinol reductases or for a base sequence sufficiently complementary to at least a portion of dirigent protein or pinoresinol/lariciresinol reductase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding dirigent protein or pinoresinol/lariciresinol reductase. Thus, systems and methods are provided for the recombinant expression of dirigent proteins and/or pinoresinol/lariciresinol reductases.
Environmental Control Of A Genetic Process

NASA Technical Reports Server (NTRS)

Khosla, Chaitan; Bailey, James E.

1991-01-01

E. coli bacteria altered to contain DNA sequence encoding production of hemoglobin made to produce hemoglobin at rates decreasing with increases in concentration of oxygen in culture media. Represents amplification of part of method described in "Cloned Hemoglobin Genes Enhance Growth Of Cells" (NPO-17517). Manipulation of promoter/regulator DNA sequences opens promising new subfield of recombinant-DNA technology for environmental control of expression of selected DNA sequences. New recombinant-DNA fusion gene products, expression vectors, and nucleotide-base sequences will emerge. Likely applications include such aerobic processes as manufacture of cloned proteins and synthesis of metabolites, production of chemicals by fermentation, enzymatic degradation, treatment of wastes, brewing, and variety of oxidative chemical reactions.
NUCLEOTIDE SEQUENCING AND TRANSCRIPTIONAL MAPPING OF THE GENES ENCODING BIPHENYL DIOXYGENASE, A MULTICOM- PONENT POLYCHLORINATED-BIPHENYL-DEGRADING ENZYME IN PSEUDOMONAS STRAIN LB400

EPA Science Inventory

The DNA region encoding biphenyl dioxygenase, the first enzyme in the biphenyl-polychlorinated biphenyl degradation pathway of Pseudomonas species strain LB400, was sequenced. Six open reading frames were identified, four of which are homologous to the components of toluene dioxy...
Genetic programs can be compressed and autonomously decompressed in live cells

NASA Astrophysics Data System (ADS)

Lapique, Nicolas; Benenson, Yaakov

2018-04-01

Fundamental computer science concepts have inspired novel information-processing molecular systems in test tubes1-13 and genetically encoded circuits in live cells14-21. Recent research has shown that digital information storage in DNA, implemented using deep sequencing and conventional software, can approach the maximum Shannon information capacity22 of two bits per nucleotide23. In nature, DNA is used to store genetic programs, but the information content of the encoding rarely approaches this maximum24. We hypothesize that the biological function of a genetic program can be preserved while reducing the length of its DNA encoding and increasing the information content per nucleotide. Here we support this hypothesis by describing an experimental procedure for compressing a genetic program and its subsequent autonomous decompression and execution in human cells. As a test-bed we choose an RNAi cell classifier circuit25 that comprises redundant DNA sequences and is therefore amenable for compression, as are many other complex gene circuits15,18,26-28. In one example, we implement a compressed encoding of a ten-gene four-input AND gate circuit using only four genetic constructs. The compression principles applied to gene circuits can enable fitting complex genetic programs into DNA delivery vehicles with limited cargo capacity, and storing compressed and biologically inert programs in vivo for on-demand activation.

Characterization and chromosomal mapping of the human TFG gene involved in thyroid carcinoma

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mencinger, M.; Panagopoulos, I.; Andreasson, P.

1997-05-01

Homology searches in the Expressed Sequence Tag Database were performed using SPYGQ-rich regions as query sequences to find genes encoding protein regions similar to the N-terminal parts of the sarcoma-associated EWS and FUS proteins. Clone 22911 (T74973), encoding a SPYGQ-rich region in its 5{prime} end, and several other clones that overlapped 22911 were selected. The combined data made it possible to assemble a full-length cDNA sequence. This cDNA sequence is 1677 bp, containing an initiation codon ATG, an open reading frame of 400 amino acids, a poly(A) signal, and a poly(A) tail. We found 100% identity between the 5{prime} partmore » of the consensus sequence and the 598-bp-long sequence named TFG. The TFG sequence is fused to the 3{prime} end of NTRK1, generating the TRK-T3 fusion transcript found in papillary thyroid carcinoma. The cDNA therefore represents the full-length transcript of the TFG gene. TFG was localized to 3q11-q12 by fluorescence in situ hybridization. The 3{prime} and the 5{prime} ends of the TFG cDNA probe hybridized to a 2.2-kb band on Northern blot filters in all tissues examined. 28 refs., 5 figs., 1 tab.« less
Isolation and characterization of two cDNA clones encoding for glutamate dehydrogenase in Nicotiana plumbaginifolia.

PubMed

Ficarelli, A; Tassi, F; Restivo, F M

1999-03-01

We have isolated two full length cDNA clones encoding Nicotiana plumbaginifolia NADH-glutamate dehydrogenase. Both clones share amino acid boxes of homology corresponding to conserved GDH catalytic domains and putative mitochondrial targeting sequence. One clone shows a putative EF-hand loop. The level of the two transcripts is affected differently by carbon source.
Open reading frames in a 4556 nucleotide sequence within MDV-1 BamHI-D DNA fragment: evidence for splicing of mRNA from a new viral glycoprotein gene.

PubMed

Becker, Y; Asher, Y; Tabor, E; Davidson, I; Malkinson, M

1994-01-01

A DNA segment of the MDV-1 BamHI-D fragment was sequenced, and the open reading frames (ORFs) present in the 4556 nucleotide fragment were analyzed by computer programs. Computer analysis identified 19 putative ORFs in the sequence ranging from a coding capacity of 37 amino acids (aa) (ORF-1a) to 684aa (ORF-1). The special properties of four ORFs (1a, 1, 2, and 3) were investigated. Two adjacent ORFs, ORF-1a and ORF-1, were found by computer analysis to have the properties of two introns encoding a glycoprotein: ORF-1a encodes an aa sequence with the properties of a signal peptide, and ORF-1 encodes a polypeptide with a membrane anchor domain and putative N-glycosylation sites in the aa sequence. ORF-1a and ORF-1 were found to be transcribed in MDV-1-infected cells. Two RNA transcripts were detected: a precursor RNA and its spliced form. Both are transcribed from a promoter located 5' to ORF-1a, and splice donor and acceptor sites are used to splice the mRNA after cleavage of a 71-nucleotide sequence. This finding suggest that ORF-1a and ORF-1 are two introns of a new MDV-1 glycoprotein gene. The DNA sequence containing ORF-1 was transiently expressed in COS-1 cells, and the viral protein produced in these cells was found to react with anti-MDV serotype-1 Antigen B-specific monoclonal antibodies. These studies indicate that the protein encoded by ORF-1 has antigenic properties resembling Antigen B of MDV-1. A gene homologous to ORF-1 was detected in the genome of both MDV-2(SB1) and MDV-3(HVT), which serve as commercial vaccine strains. Two additional ORFs were noted in the 4556 nucleotide sequence: ORF-2, which encodes a 333 aa polypeptide initiating in the UL and terminating in the TRL prior to the putative origin of replication, and ORF-3, which encodes a 155 aa polypeptide that is partly homologous to the phosphoprotein pp38 encoded by the BamHI-H sequence. The 65 N-terminal aa of the two gene products are identical, both being derived from the nucleotide sequences in the TRL and IRL, respectively. Additional homologous aa sequences are the hydrophobic aa domain in the middle of both proteins. The functions of ORF-2, ORF-3, and additional ORFs are under study.
Identification and Characterization of Putative Integron-Like Elements of the Heavy-Metal-Hypertolerant Strains of Pseudomonas spp.

PubMed

Ciok, Anna; Adamczuk, Marcin; Bartosik, Dariusz; Dziewit, Lukasz

2016-11-28

Pseudomonas strains isolated from the heavily contaminated Lubin copper mine and Zelazny Most post-flotation waste reservoir in Poland were screened for the presence of integrons. This analysis revealed that two strains carried homologous DNA regions composed of a gene encoding a DNA_BRE_C domain-containing tyrosine recombinase (with no significant sequence similarity to other integrases of integrons) plus a three-component array of putative integron gene cassettes. The predicted gene cassettes encode three putative polypeptides with homology to (i) transmembrane proteins, (ii) GCN5 family acetyltransferases, and (iii) hypothetical proteins of unknown function (homologous proteins are encoded by the gene cassettes of several class 1 integrons). Comparative sequence analyses identified three structural variants of these novel integron-like elements within the sequenced bacterial genomes. Analysis of their distribution revealed that they are found exclusively in strains of the genus Pseudomonas .
Cloning and expression of UDP-glucose: flavonoid 7-O-glucosyltransferase from hairy root cultures of Scutellaria baicalensis.

PubMed

Hirotani, M; Kuroda, R; Suzuki, H; Yoshikawa, T

2000-05-01

A cDNA encoding UDP-glucose: baicalein 7-O-glucosyltransferase (UBGT) was isolated from a cDNA library from hairy root cultures of Scutellaria baicalensis Georgi probed with a partial-length cDNA clone of a UDP-glucose: flavonoid 3-O-glucosyltransferase (UFGT) from grape (Vitis vinifera L.). The heterologous probe contained a glucosyltransferase consensus amino acid sequence which was also present in the Scutellaria cDNA clones. The complete nucleotide sequence of the 1688-bp cDNA insert was determined and the deduced amino acid sequences are presented. The nucleotide sequence analysis of UBGT revealed an open reading frame encoding a polypeptide of 476 amino acids with a calculated molecular mass of 53,094 Da. The reaction product for baicalein and UDP-glucose catalyzed by recombinant UBGT in Escherichia coli was identified as authentic baicalein 7-O-glucoside using high-performance liquid chromatography and proton nuclear magnetic resonance spectroscopy. The enzyme activities of recombinant UBGT expressed in E. coli were also detected towards flavonoids such as baicalein, wogonin, apigenin, scutellarein, 7,4'-dihydroxyflavone and kaempferol, and phenolic compounds. The accumulation of UBGT mRNA in hairy roots was in response to wounding or salicylic acid treatments.
The mitochondrial gene encoding ribosomal protein S12 has been translocated to the nuclear genome in Oenothera.

PubMed Central

Grohmann, L; Brennicke, A; Schuster, W

1992-01-01

The Oenothera mitochondrial genome contains only a gene fragment for ribosomal protein S12 (rps12), while other plants encode a functional gene in the mitochondrion. The complete Oenothera rps12 gene is located in the nucleus. The transit sequence necessary to target this protein to the mitochondrion is encoded by a 5'-extension of the open reading frame. Comparison of the amino acid sequence encoded by the nuclear gene with the polypeptides encoded by edited mitochondrial cDNA and genomic sequences of other plants suggests that gene transfer between mitochondrion and nucleus started from edited mitochondrial RNA molecules. Mechanisms and requirements of gene transfer and activation are discussed. Images PMID:1454526
Extraordinary Structured Noncoding RNAs Revealed by Bacterial Metagenome Analysis

PubMed Central

Weinberg, Zasha; Perreault, Jonathan; Meyer, Michelle M.; Breaker, Ronald R.

2012-01-01

Estimates of the total number of bacterial species1-3 suggest that existing DNA sequence databases carry only a tiny fraction of the total amount of DNA sequence space represented by this division of life. Indeed, environmental DNA samples have been shown to encode many previously unknown classes of proteins4 and RNAs5. Bioinformatics searches6-10 of genomic DNA from bacteria commonly identify novel noncoding RNAs (ncRNAs)10-12 such as riboswitches13,14. In rare instances, RNAs that exhibit more extensive sequence and structural conservation across a wide range of bacteria are encountered15,16. Given that large structured RNAs are known to carry out complex biochemical functions such as protein synthesis and RNA processing reactions, identifying more RNAs of great size and intricate structure is likely to reveal additional biochemical functions that can be achieved by RNA. We applied an updated computational pipeline17 to discover ncRNAs that rival the known large ribozymes in size and structural complexity or that are among the most abundant RNAs in bacteria that encode them. These RNAs would have been difficult or impossible to detect without examining environmental DNA sequences, suggesting that numerous RNAs with extraordinary size, structural complexity, or other exceptional characteristics remain to be discovered in unexplored sequence space. PMID:19956260
Molecular cloning of actin genes in Trichomonas vaginalis and phylogeny inferred from actin sequences.

PubMed

Bricheux, G; Brugerolle, G

1997-08-01

The parasitic protozoan Trichomonas vaginalis is known to contain the ubiquitous and highly conserved protein actin. A genomic library and a cDNA library have been screened to identify and clone the actin gene(s) of T. vaginalis. The nucleotide sequence of one gene and its flanking regions have been determined. The open reading frame encodes a protein of 376 amino acids. The sequence is not interrupted by any introns and the promoter could be represented by a 10 bp motif close to a consensus motif also found upstream of most sequenced T. vaginalis genes. The five different clones isolated from the cDNA library have similar sequences and encode three actin proteins differing only by one or two amino acids. A phylogenetic analysis of 31 actin sequences by distance matrix and parsimony methods, using centractin as outgroup, gives congruent trees with Parabasala branching above Diplomonadida.
Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.

PubMed

Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G

2002-11-01

The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.
A novel chaos-based image encryption algorithm using DNA sequence operations

NASA Astrophysics Data System (ADS)

Chai, Xiuli; Chen, Yiran; Broyde, Lucie

2017-01-01

An image encryption algorithm based on chaotic system and deoxyribonucleic acid (DNA) sequence operations is proposed in this paper. First, the plain image is encoded into a DNA matrix, and then a new wave-based permutation scheme is performed on it. The chaotic sequences produced by 2D Logistic chaotic map are employed for row circular permutation (RCP) and column circular permutation (CCP). Initial values and parameters of the chaotic system are calculated by the SHA 256 hash of the plain image and the given values. Then, a row-by-row image diffusion method at DNA level is applied. A key matrix generated from the chaotic map is used to fuse the confused DNA matrix; also the initial values and system parameters of the chaotic system are renewed by the hamming distance of the plain image. Finally, after decoding the diffused DNA matrix, we obtain the cipher image. The DNA encoding/decoding rules of the plain image and the key matrix are determined by the plain image. Experimental results and security analyses both confirm that the proposed algorithm has not only an excellent encryption result but also resists various typical attacks.
Circular replication-associated protein encoding DNA viruses identified in the faecal matter of various animals in New Zealand.

PubMed

Steel, Olivia; Kraberger, Simona; Sikorski, Alyssa; Young, Laura M; Catchpole, Ryan J; Stevens, Aaron J; Ladley, Jenny J; Coray, Dorien S; Stainton, Daisy; Dayaram, Anisha; Julian, Laurel; van Bysterveldt, Katherine; Varsani, Arvind

2016-09-01

In recent years, innovations in molecular techniques and sequencing technologies have resulted in a rapid expansion in the number of known viral sequences, in particular those with circular replication-associated protein (Rep)-encoding single-stranded (CRESS) DNA genomes. CRESS DNA viruses are present in the virome of many ecosystems and are known to infect a wide range of organisms. A large number of the recently identified CRESS DNA viruses cannot be classified into any known viral families, indicating that the current view of CRESS DNA viral sequence space is greatly underestimated. Animal faecal matter has proven to be a particularly useful source for sampling CRESS DNA viruses in an ecosystem, as it is cost-effective and non-invasive. In this study a viral metagenomic approach was used to explore the diversity of CRESS DNA viruses present in the faeces of domesticated and wild animals in New Zealand. Thirty-eight complete CRESS DNA viral genomes and two circular molecules (that may be defective molecules or single components of multicomponent genomes) were identified from forty-nine individual animal faecal samples. Based on shared genome organisations and sequence similarities, eighteen of the isolates were classified as gemycircularviruses and twelve isolates were classified as smacoviruses. The remaining eight isolates lack significant sequence similarity with any members of known CRESS DNA virus groups. This research adds significantly to our knowledge of CRESS DNA viral diversity in New Zealand, emphasising the prevalence of CRESS DNA viruses in nature, and reinforcing the suggestion that a large proportion of CRESS DNA viruses are yet to be identified. Copyright © 2016 Elsevier B.V. All rights reserved.
Sequence and pattern of expression of a bovine homologue of a human mitochondrial transport protein associated with Grave's disease.

PubMed

Fiermonte, G; Runswick, M J; Walker, J E; Palmieri, F

1992-01-01

A human cDNA has been isolated previously from a thyroid library with the aid of serum from a patient with Grave's disease. It encodes a protein belonging to the mitochondrial metabolite carrier family, referred to as the Grave's disease carrier protein (GDC). Using primers based on this sequence, overlapping cDNAs encoding the bovine homologue of the GDC have been isolated from total bovine heart poly(A)+ cDNA. The bovine protein is 18 amino acids shorter than the published human sequence, but if a frame shift requiring the removal of one nucleotide is introduced into the human cDNA sequence, the human and bovine proteins become identical in their C-terminal regions, and 308 out of 330 amino acids are conserved over their entire sequences. The bovine cDNA has been used to investigate the expression of the GDC in various bovine tissues. In the tissues that were examined, the GDC is most strongly expressed in the thyroid, but substantial amounts of its mRNA were also detected in liver, lung and kidney, and lesser amounts in heart and skeletal muscle.
The DNA-encoded nucleosome organization of a eukaryotic genome.

PubMed

Kaplan, Noam; Moore, Irene K; Fondufe-Mittendorf, Yvonne; Gossett, Andrea J; Tillo, Desiree; Field, Yair; LeProust, Emily M; Hughes, Timothy R; Lieb, Jason D; Widom, Jonathan; Segal, Eran

2009-03-19

Nucleosome organization is critical for gene regulation. In living cells this organization is determined by multiple factors, including the action of chromatin remodellers, competition with site-specific DNA-binding proteins, and the DNA sequence preferences of the nucleosomes themselves. However, it has been difficult to estimate the relative importance of each of these mechanisms in vivo, because in vivo nucleosome maps reflect the combined action of all influencing factors. Here we determine the importance of nucleosome DNA sequence preferences experimentally by measuring the genome-wide occupancy of nucleosomes assembled on purified yeast genomic DNA. The resulting map, in which nucleosome occupancy is governed only by the intrinsic sequence preferences of nucleosomes, is similar to in vivo nucleosome maps generated in three different growth conditions. In vitro, nucleosome depletion is evident at many transcription factor binding sites and around gene start and end sites, indicating that nucleosome depletion at these sites in vivo is partly encoded in the genome. We confirm these results with a micrococcal nuclease-independent experiment that measures the relative affinity of nucleosomes for approximately 40,000 double-stranded 150-base-pair oligonucleotides. Using our in vitro data, we devise a computational model of nucleosome sequence preferences that is significantly correlated with in vivo nucleosome occupancy in Caenorhabditis elegans. Our results indicate that the intrinsic DNA sequence preferences of nucleosomes have a central role in determining the organization of nucleosomes in vivo.
Identification and cloning of a gamma 3 subunit splice variant of the human GABA(A) receptor.

PubMed

Poulsen, C F; Christjansen, K N; Hastrup, S; Hartvig, L

2000-05-31

cDNA sequences encoding two forms of the GABA(A) gamma 3 receptor subunit were cloned from human hippocampus. The nucleotide sequences differ by the absence (gamma 3S) or presence (gamma 3L) of 18 bp located in the presumed intracellular loop between transmembrane region (TM) III and IV. The extra 18 bp in the gamma 3L subunit generates a consensus site for phosphorylation by protein kinase C (PKC). Analysis of human genomic DNA encoding the gamma 3 subunit reveals that the 18 bp insert is contiguous with the upstream proximal exon.
Recominant Pinoresino-Lariciresinol Reductase, Recombinant Dirigent Protein And Methods Of Use

DOEpatents

Lewis, Norman G.; Davin, Laurence B.; Dinkova-Kostova, Albena T.; Fujita, Masayuki , Gang; David R. , Sarkanen; Simo , Ford; Joshua D.

2003-10-21

Dirigent proteins and pinoresinol/lariciresinol reductases have been isolated, together with cDNAs encoding dirigent proteins and pinoresinol/lariciresinol reductases. Accordingly, isolated DNA sequences are provided from source species Forsythia intermedia, Thuja plicata, Tsuga heterophylla, Eucommia ulmoides, Linum usitatissimum, and Schisandra chinensis, which code for the expression of dirigent proteins and pinoresinol/lariciresinol reductases. In other aspects, replicable recombinant cloning vehicles are provided which code for dirigent proteins or pinoresinol/lariciresinol reductases or for a base sequence sufficiently complementary to at least a portion of dirigent protein or pinoresinol/lariciresinol reductase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding dirigent protein or pinoresinol/lariciresinol reductase. Thus, systems and methods are provided for the recombinant expression of dirigent proteins and/or pinoresinol/lariciresinol reductases.
Conditional poliovirus mutants made by random deletion mutagenesis of infectious cDNA.

PubMed Central

Kirkegaard, K; Nelsen, B

1990-01-01

Small deletions were introduced into DNA plasmids bearing cDNA copies of Mahoney type 1 poliovirus RNA. The procedure used was similar to that of P. Hearing and T. Shenk (J. Mol. Biol. 167:809-822, 1983), with modifications designed to introduce only one lesion randomly into each DNA molecule. Methods to map small deletions in either large DNA or RNA molecules were employed. Two poliovirus mutants, VP1-101 and VP1-102, were selected from mutagenized populations on the basis of their host range phenotype, showing a large reduction in the relative numbers of plaques on CV1 and HeLa cells compared with wild-type virus. The deletions borne by the mutant genomes were mapped to the region encoding the amino terminus of VP1. That these lesions were responsible for the mutant phenotypes was substantiated by reintroduction of the sequenced lesions into a wild-type poliovirus cDNA by deoxyoligonucleotide-directed mutagenesis. The deletion of nucleotides encoding amino acids 8 and 9 of VP1 was responsible for the VP1-101 phenotype; the VP1-102 defect was caused by the deletion of the sequences encoding the first four amino acids of VP1. The peptide sequence at the VP1-VP3 proteolytic cleavage site was altered from glutamine-glycine to glutamine-methionine in VP1-102; this apparently did not alter the proteolytic cleavage pattern. The biochemical defects resulting from these mutations are discussed in the accompanying report. Images PMID:2152811
Characterization of the gene encoding component C3 of the complement system from the spider Loxosceles laeta venom glands: Phylogenetic implications.

PubMed

Myamoto, D T; Pidde-Queiroz, G; Pedroso, A; Gonçalves-de-Andrade, R M; van den Berg, C W; Tambourgi, D V

2016-09-01

A transcriptome analysis of the venom glands of the spider Loxosceles laeta, performed by our group, in a previous study (Fernandes-Pedrosa et al., 2008), revealed a transcript with a sequence similar to the human complement component C3. Here we present the analysis of this transcript. cDNA fragments encoding the C3 homologue (Lox-C3) were amplified from total RNA isolated from the venom glands of L. laeta by RACE-PCR. Lox-C3 is a 5178 bps cDNA sequence encoding a 190kDa protein, with a domain configuration similar to human C3. Multiple alignments of C3-like proteins revealed two processing sites, suggesting that Lox-C3 is composed of three chains. Furthermore, the amino acids consensus sequences for the thioester was found, in addition to putative sequences responsible for FB binding. The phylogenetic analysis showed that Lox-C3 belongs to the same group as two C3 isoforms from the spider Hasarius adansoni (Family Salcitidae), showing 53% homology with these. This is the first characterization of a Loxosceles cDNA sequence encoding a human C3 homologue, and this finding, together with our previous finding of the expression of a FB-like molecule, suggests that this spider species also has a complement system. This work will help to improve our understanding of the innate immune system in these spiders and the ancestral structure of C3. Copyright © 2016 Elsevier GmbH. All rights reserved.
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.

PubMed Central

Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E

1982-01-01

We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
Three copies of a single protein II-encoding sequence in the genome of Neisseria gonorrhoeae JS3: evidence for gene conversion and gene duplication.

PubMed

van der Ley, P

1988-11-01

Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.
Structure of CARB-4 and AER-1 CarbenicillinHydrolyzing β-Lactamases

PubMed Central

Sanschagrin, François; Bejaoui, Noureddine; Levesque, Roger C.

1998-01-01

We determined the nucleotide sequences of blaCARB-4 encoding CARB-4 and deduced a polypeptide of 288 amino acids. The gene was characterized as a variant of group 2c carbenicillin-hydrolyzing β-lactamases such as PSE-4, PSE-1, and CARB-3. The level of DNA homology between the bla genes for these β-lactamases varied from 98.7 to 99.9%, while that between these genes and blaCARB-4 encoding CARB-4 was 86.3%. The blaCARB-4 gene was acquired from some other source because it has a G+C content of 39.1%, compared to a G+C content of 67% for typical Pseudomonas aeruginosa genes. DNA sequencing revealed that blaAER-1 shared 60.8% DNA identity with blaPSE-3 encoding PSE-3. The deduced AER-1 β-lactamase peptide was compared to class A, B, C, and D enzymes and had 57.6% identity with PSE-3, including an STHK tetrad at the active site. For CARB-4 and AER-1, conserved canonical amino acid boxes typical of class A β-lactamases were identified in a multiple alignment. Analysis of the DNA sequences flanking blaCARB-4 and blaAER-1 confirmed the importance of gene cassettes acquired via integrons in bla gene distribution. PMID:9687391

Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prody, C.A.; Zevin-Sonkin, D.; Gnatt, A.

1987-06-01

To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase and Torpedo electric organ true acetylcholinesterase. Using these probes, the authors isolated several cDNA clones from lambdagt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. Inmore » RNA blots of poly(A)/sup +/ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These finding demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species.« less
DNA Shape Dominates Sequence Affinity in Nucleosome Formation

NASA Astrophysics Data System (ADS)

Freeman, Gordon S.; Lequieu, Joshua P.; Hinckley, Daniel M.; Whitmer, Jonathan K.; de Pablo, Juan J.

2014-10-01

Nucleosomes provide the basic unit of compaction in eukaryotic genomes, and the mechanisms that dictate their position at specific locations along a DNA sequence are of central importance to genetics. In this Letter, we employ molecular models of DNA and proteins to elucidate various aspects of nucleosome positioning. In particular, we show how DNA's histone affinity is encoded in its sequence-dependent shape, including subtle deviations from the ideal straight B-DNA form and local variations of minor groove width. By relying on high-precision simulations of the free energy of nucleosome complexes, we also demonstrate that, depending on DNA's intrinsic curvature, histone binding can be dominated by bending interactions or electrostatic interactions. More generally, the results presented here explain how sequence, manifested as the shape of the DNA molecule, dominates molecular recognition in the problem of nucleosome positioning.
DNA-encoded libraries - an efficient small molecule discovery technology for the biomedical sciences.

PubMed

Kunig, Verena; Potowski, Marco; Gohla, Anne; Brunschweiger, Andreas

2018-06-27

DNA-encoded compound libraries are a highly attractive technology for the discovery of small molecule protein ligands. These compound collections consist of small molecules covalently connected to individual DNA sequences carrying readable information about the compound structure. DNA-tagging allows for efficient synthesis, handling and interrogation of vast numbers of chemically synthesized, drug-like compounds. They are screened on proteins by an efficient, generic assay based on Darwinian principles of selection. To date, selection of DNA-encoded libraries allowed for the identification of numerous bioactive compounds. Some of these compounds uncovered hitherto unknown allosteric binding sites on target proteins; several compounds proved their value as chemical biology probes unraveling complex biology; and the first examples of clinical candidates that trace their ancestry to a DNA-encoded library were reported. Thus, DNA-encoded libraries proved their value for the biomedical sciences as a generic technology for the identification of bioactive drug-like molecules numerous times. However, large scale experiments showed that even the selection of billions of compounds failed to deliver bioactive compounds for the majority of proteins in an unbiased panel of target proteins. This raises the question of compound library design.
A User's Guide to the Encyclopedia of DNA Elements (ENCODE)

PubMed Central

2011-01-01

The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome. PMID:21526222
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation

PubMed Central

Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob

2014-01-01

As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
Isolation and characterization of cDNA clones for carrot extensin and a proline-rich 33-kDa protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, J.; Varner, J.E.

1985-07-01

Extensins are hydroxyproline-rich glycoproteins associated with most dicotyledonous plant cell walls. To isolate cDNA clones encoding extensin, the authors started by isolating poly(A) RNA from carrot root tissue, and then translating the RNA in vitro, in the presence of tritiated leucine or proline. A 33-kDa peptide was identified in the translation products as a putative extensin precursor. From a cDNA library constructed with poly(A) RNA from wounded carrots, one cDNA clone (pDC5) was identified that specifically hybridized to poly(A) RNA encoding this 33-kDa peptide. They isolated three cDNA clones (pDC11, pDC12, and pDC16) from another cDNA library using pCD5 asmore » a probe. DNA sequence data, RNA hybridization analysis, and hybrid released in vitro translation indicate that the cDNA clones pDC11 encodes extensin and that cDNA clones pDC12 and pDC16 encode the 33-kDa peptide, which as yet has an unknown identity and function. The assumption that the 33-kDa peptide was an extensin precursor was invalid. RNA hybridization analysis showed that RNA encoded by both clone types is accumulated upon wounding.« less
Geranyl diphosphate synthase large subunit, and methods of use

DOEpatents

Croteau, Rodney B.; Burke, Charles C.; Wildung, Mark R.

2001-10-16

A cDNA encoding geranyl diphosphate synthase large subunit from peppermint has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Replicable recombinant cloning vehicles are provided which code for geranyl diphosphate synthase large subunit). In another aspect, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding geranyl diphosphate synthase large subunit. In yet another aspect, the present invention provides isolated, recombinant geranyl diphosphate synthase protein comprising an isolated, recombinant geranyl diphosphate synthase large subunit protein and an isolated, recombinant geranyl diphosphate synthase small subunit protein. Thus, systems and methods are provided for the recombinant expression of geranyl diphosphate synthase.
Molecular cloning and nucleotide sequence of CYP6BF1 from the diamondback moth, Plutella xylostella

PubMed Central

Li, Hongshan; Dai, Huaguo; Wei, Hui

2005-01-01

A novel cDNA clong encoding a cytochrome P450 was screened from the insecticide-susceptible strain of Plutella xylostella (L.) (Lepidoptera:Yponomeutidae). The nucleotide sequence of the clone, designated CYP6BF1, was determined. This is the first full-length sequence of the CYP6 family from Plutella xylostella (L.). The cDNA is 1661bp in length and contains an open reading frame from base pairs 26 to 1570, encoding a protein of 514 amino acid residues. It is similar to the other insect P450s in gene family 6, including CYP6AE1 from Depressaria pastinacella, (46%). The GenBank accession number is AY971374. PMID:17119627
Analysis of the DNA sequence of a 15,500 bp fragment near the left telomere of chromosome XV from Saccharomyces cerevisiae reveals a putative sugar transporter, a carboxypeptidase homologue and two new open reading frames.

PubMed

Gamo, F J; Lafuente, M J; Casamayor, A; Ariño, J; Aldea, M; Casas, C; Herrero, E; Gancedo, C

1996-06-15

We report the sequence of a 15.5 kb DNA segment located near the left telomere of chromosome XV of Saccharomyces cerevisiae. The sequence contains nine open reading frames (ORFs) longer than 300 bp. Three of them are internal to other ones. One corresponds to the gene LGT3 that encodes a putative sugar transporter. Three adjacent ORFs were separated by two stop codons in frame. These ORFs presented homology with the gene CPS1 that encodes carboxypeptidase S. The stop codons were not found in the same sequence derived from another yeast strain. Two other ORFs without significant homology in databases were also found. One of them, O0420, is very rich in serine and threonine and presents a series of repeated or similar amino acid stretches along the sequence.
Biochemical Characterization of a Mycobacteriophage Derived DnaB Ortholog Reveals New Insight into the Evolutionary Origin of DnaB Helicases

PubMed Central

Bhowmik, Priyanka; Das Gupta, Sujoy K.

2015-01-01

The bacterial replicative helicases known as DnaB are considered to be members of the RecA superfamily. All members of this superfamily, including DnaB, have a conserved C- terminal domain, known as the RecA core. We unearthed a series of mycobacteriophage encoded proteins in which the RecA core domain alone was present. These proteins were phylogenetically related to each other and formed a distinct clade within the RecA superfamily. A mycobacteriophage encoded protein, Wildcat Gp80 that roots deep in the DnaB family, was found to possess a core domain having significant sequence homology (Expect value < 10-5) with members of this novel cluster. This indicated that Wildcat Gp80, and by extrapolation, other members of the DnaB helicase family, may have evolved from a single domain RecA core polypeptide belonging to this novel group. Biochemical investigations confirmed that Wildcat Gp80 was a helicase. Surprisingly, our investigations also revealed that a thioredoxin tagged truncated version of the protein in which the N-terminal sequences were removed was fully capable of supporting helicase activity, although its ATP dependence properties were different. DnaB helicase activity is thus, primarily a function of the RecA core although additional N-terminal sequences may be necessary for fine tuning its activity and stability. Based on sequence comparison and biochemical studies we propose that DnaB helicases may have evolved from single domain RecA core proteins having helicase activities of their own, through the incorporation of additional N-terminal sequences. PMID:26237048
Cloning and characterisation of cDNA sequences encoding for anti-lipopolysaccharide factors (ALFs) in Brazilian palaemonid and penaeid shrimps.

PubMed

Rosa, Rafael Diego; Stoco, Patricia Hermes; Barracco, Margherita Anna

2008-11-01

Anti-lipopolysaccharide factors (ALFs) are antimicrobial peptides found in limulids and crustaceans that have a potent and broad range of antimicrobial activity. We report here the identification and molecular characterisation of new sequences encoding for ALFs in the haemocytes of the freshwater prawn Macrobrachium olfersi and also in two Brazilian penaeid species, Farfantepenaeus paulensis and Litopenaeus schmitti. All obtained sequences encoded for highly cationic peptides containing two conserved cysteine residues flanking a putative LPS-binding domain. They exhibited a significant amino acid similarity with crustacean and limulid ALF sequences, especially with those of penaeid shrimps. This is the first identification of ALF in a freshwater prawn.
A complete Neandertal mitochondrial genome sequence determined by high-throughput sequencing

PubMed Central

Green, Richard E.; Malaspinas, Anna-Sapfo; Krause, Johannes; Briggs, Adrian W.; Johnson, Philip L. F.; Uhler, Caroline; Meyer, Matthias; Good, Jeffrey M.; Maricic, Tomislav; Stenzel, Udo; Prüfer, Kay; Siebauer, Michael; Burbano, Hernán A.; Ronan, Michael; Rothberg, Jonathan M.; Egholm, Michael; Rudan, Pavao; Brajković, Dejana; Kućan, Željko; Gušić, Ivan; Wikström, Mårten; Laakkonen, Liisa; Kelso, Janet; Slatkin, Montgomery; Pääbo, Svante

2008-01-01

Summary A complete mitochondrial (mt) genome sequence was reconstructed from a 38,000-year-old Neandertal individual using 8,341 mtDNA sequences identified among 4.8 Gb of DNA generated from ~0.3 grams of bone. Analysis of the assembled sequence unequivocally establishes that the Neandertal mtDNA falls outside the variation of extant human mtDNAs and allows an estimate of the divergence date between the two mtDNA lineages of 660,000±140,000 years. Of the 13 proteins encoded in the mtDNA, subunit 2 of cytochrome c oxidase of the mitochondrial electron transport chain has experienced the largest number of amino acid substitutions in human ancestors since the separation from Neandertals. There is evidence that purifying selection in the Neandertal mtDNA was reduced compared to other primate lineages suggesting that the effective population size of Neandertals was small. PMID:18692465
Cloning of a cDNA encoding 1-aminocyclopropane-1-carboxylate synthase and expression of its mRNA in ripening apple fruit.

PubMed

Dong, J G; Kim, W T; Yip, W K; Thompson, G A; Li, L; Bennett, A B; Yang, S F

1991-08-01

1-Aminocyclopropane-1-carboxylate (ACC) synthase (EC 4.4.1.14) purified from apple (Malus sylvestris Mill.) fruit was subjected to trypsin digestion. Following separation by reversed-phase high-pressure liquid chromatography, ten tryptic peptides were sequenced. Based on the sequences of three tryptic peptides, three sets of mixed oligonucleotide probes were synthesized and used to screen a plasmid cDNA library prepared from poly(A)(+) RNA of ripe apple fruit. A 1.5-kb (kilobase) cDNA clone which hybridized to all three probes were isolated. The clone contained an open reading frame of 1214 base pairs (bp) encoding a sequence of 404 amino acids. While the polyadenine tail at the 3'-end was intact, it lacked a portion of sequence at the 5'-end. Using the RNA-based polymerase chain reaction, an additional sequence of 148 bp was obtained at the 5'-end. Thus, 1362 bp were sequenced and they encode 454 amino acids. The deduced amino-acid sequence contained peptide sequences corresponding to all ten tryptic fragments, confirming the identity of the cDNA clone. Comparison of the deduced amino-acid sequence between ACC synthase from apple fruit and those from tomato (Lycopersicon esculentum Mill.) and winter squash (Cucurbita maxima Duch.) fruits demonstrated the presence of seven highly conserved regions, including the previously identified region for the active site. The size of the translation product of ACC-synthase mRNA was similar to that of the mature protein on sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), indicating that apple ACC-synthase undergoes only minor, if any, post-translational proteolytic processing. Analysis of ACC-synthase mRNA by in-vitro translation-immunoprecipitation, and by Northern blotting indicates that the ACC-synthase mRNA was undetectable in unripe fruit, but was accumulated massively during the ripening proccess. These data demonstrate that the expression of the ACC-synthase gene is developmentally regulated.
Nucleic acids encoding human trithorax protein

DOEpatents

Evans, Glen A.; Djabali, Malek; Selleri, Licia; Parry, Pauline

2001-01-01

In accordance with the present invention, there is provided an isolated peptide having the characteristics of human trithorax protein (as well as DNA encoding same, antisense DNA derived therefrom and antagonists therefor). The invention peptide is characterized by having a DNA binding domain comprising multiple zinc fingers and at least 40% amino acid identity with respect to the DNA binding domain of Drosophila trithorax protein and at least 70% conserved sequence with respect to the DNA binding domain of Drosophila trithorax protein, and wherein said peptide is encoded by a gene located at chromosome 11 of the human genome at q23. Also provided are methods for the treatment of subject(s) suffering from immunodeficiency, developmental abnormality, inherited disease, or cancer by administering to said subject a therapeutically effective amount of one of the above-described agents (i.e., peptide, antagonist therefor, DNA encoding said peptide or antisense DNA derived therefrom). Also provided is a method for the diagnosis, in a subject, of immunodeficiency, developmental abnormality, inherited disease, or cancer associated with disruption of chromosome 11 at q23.
Ancient dna from pleistocene fossils: Preservation, recovery, and utility of ancient genetic information for quaternary research

NASA Astrophysics Data System (ADS)

Yang, Hong

Until recently, recovery and analysis of genetic information encoded in ancient DNA sequences from Pleistocene fossils were impossible. Recent advances in molecular biology offered technical tools to obtain ancient DNA sequences from well-preserved Quaternary fossils and opened the possibilities to directly study genetic changes in fossil species to address various biological and paleontological questions. Ancient DNA studies involving Pleistocene fossil material and ancient DNA degradation and preservation in Quaternary deposits are reviewed. The molecular technology applied to isolate, amplify, and sequence ancient DNA is also presented. Authentication of ancient DNA sequences and technical problems associated with modern and ancient DNA contamination are discussed. As illustrated in recent studies on ancient DNA from proboscideans, it is apparent that fossil DNA sequence data can shed light on many aspects of Quaternary research such as systematics and phylogeny. conservation biology, evolutionary theory, molecular taphonomy, and forensic sciences. Improvement of molecular techniques and a better understanding of DNA degradation during fossilization are likely to build on current strengths and to overcome existing problems, making fossil DNA data a unique source of information for Quaternary scientists.
Quantitation of heteroplasmy of mtDNA sequence variants identified in a population of AD patients and controls by array-based resequencing.

PubMed

Coon, Keith D; Valla, Jon; Szelinger, Szabolics; Schneider, Lonnie E; Niedzielko, Tracy L; Brown, Kevin M; Pearson, John V; Halperin, Rebecca; Dunckley, Travis; Papassotiropoulos, Andreas; Caselli, Richard J; Reiman, Eric M; Stephan, Dietrich A

2006-08-01

The role of mitochondrial dysfunction in the pathogenesis of Alzheimer's disease (AD) has been well documented. Though evidence for the role of mitochondria in AD seems incontrovertible, the impact of mitochondrial DNA (mtDNA) mutations in AD etiology remains controversial. Though mutations in mitochondrially encoded genes have repeatedly been implicated in the pathogenesis of AD, many of these studies have been plagued by lack of replication as well as potential contamination of nuclear-encoded mitochondrial pseudogenes. To assess the role of mtDNA mutations in the pathogenesis of AD, while avoiding the pitfalls of nuclear-encoded mitochondrial pseudogenes encountered in previous investigations and showcasing the benefits of a novel resequencing technology, we sequenced the entire coding region (15,452 bp) of mtDNA from 19 extremely well-characterized AD patients and 18 age-matched, unaffected controls utilizing a new, reliable, high-throughput array-based resequencing technique, the Human MitoChip. High-throughput, array-based DNA resequencing of the entire mtDNA coding region from platelets of 37 subjects revealed the presence of 208 loci displaying a total of 917 sequence variants. There were no statistically significant differences in overall mutational burden between cases and controls, however, 265 independent sites of statistically significant change between cases and controls were identified. Changed sites were found in genes associated with complexes I (30.2%), III (3.0%), IV (33.2%), and V (9.1%) as well as tRNA (10.6%) and rRNA (14.0%). Despite their statistical significance, the subtle nature of the observed changes makes it difficult to determine whether they represent true functional variants involved in AD etiology or merely naturally occurring dissimilarity. Regardless, this study demonstrates the tremendous value of this novel mtDNA resequencing platform, which avoids the pitfalls of erroneously amplifying nuclear-encoded mtDNA pseudogenes, and our proposed analysis paradigm, which utilizes the availability of raw signal intensity values for each of the four potential alleles to facilitate quantitative estimates of mtDNA heteroplasmy. This information provides a potential new target for burgeoning diagnostics and therapeutics that could truly assist those suffering from this devastating disorder.
Rhipicephalus (Boophilus) microplus strain Deutsch, 5 BAC clone sequencing, including two encoding Cytochrome P450s and one encoding CzEst9 carboxylesterase

USDA-ARS?s Scientific Manuscript database

The cattle tick, Rhipicephalus (Boophilus) microplus, has a genome over 2.4 times the size of the human genome, and with over 70% of repetitive DNA, this genome would prove very costly to sequence at today's prices and difficult to assemble and analyze. BAC clones give insight into the genome struct...
Protection of CpG islands from DNA methylation is DNA-encoded and evolutionarily conserved

PubMed Central

Long, Hannah K.; King, Hamish W.; Patient, Roger K.; Odom, Duncan T.; Klose, Robert J.

2016-01-01

DNA methylation is a repressive epigenetic modification that covers vertebrate genomes. Regions known as CpG islands (CGIs), which are refractory to DNA methylation, are often associated with gene promoters and play central roles in gene regulation. Yet how CGIs in their normal genomic context evade the DNA methylation machinery and whether these mechanisms are evolutionarily conserved remains enigmatic. To address these fundamental questions we exploited a transchromosomic animal model and genomic approaches to understand how the hypomethylated state is formed in vivo and to discover whether mechanisms governing CGI formation are evolutionarily conserved. Strikingly, insertion of a human chromosome into mouse revealed that promoter-associated CGIs are refractory to DNA methylation regardless of host species, demonstrating that DNA sequence plays a central role in specifying the hypomethylated state through evolutionarily conserved mechanisms. In contrast, elements distal to gene promoters exhibited more variable methylation between host species, uncovering a widespread dependence on nucleotide frequency and occupancy of DNA-binding transcription factors in shaping the DNA methylation landscape away from gene promoters. This was exemplified by young CpG rich lineage-restricted repeat sequences that evaded DNA methylation in the absence of co-evolved mechanisms targeting methylation to these sequences, and species specific DNA binding events that protected against DNA methylation in CpG poor regions. Finally, transplantation of mouse chromosomal fragments into the evolutionarily distant zebrafish uncovered the existence of a mechanistically conserved and DNA-encoded logic which shapes CGI formation across vertebrate species. PMID:27084945
Isolation and bacterial expression of a sesquiterpene synthase cDNA clone from peppermint (Mentha x piperita, L.) that produces the aphid alarm pheromone (E)-.beta.-farnesene

DOEpatents

Croteau, Rodney Bruce; Crock, John E.

2005-01-25

A cDNA encoding (E)-.beta.-farnesene synthase from peppermint (Mentha piperita) has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID NO:1) is provided which codes for the expression of (E)-.beta.-farnesene synthase (SEQ ID NO:2), from peppermint (Mentha piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for (E)-.beta.-farnesene synthase, or for a base sequence sufficiently complementary to at least a portion of (E)-.beta.-farnesene synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding (E)-.beta.-farnesene synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant (E)-.beta.-famesene synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant (E)-.beta.-farnesene synthase may be used to obtain expression or enhanced expression of (E)-.beta.-famesene synthase in plants in order to enhance the production of (E)-.beta.-farnesene, or may be otherwise employed for the regulation or expression of (E)-.beta.-farnesene synthase, or the production of its product.
Nucleotide sequences of two genomic DNAs encoding peroxidase of Arabidopsis thaliana.

PubMed

Intapruk, C; Higashimura, N; Yamamoto, K; Okada, N; Shinmyo, A; Takano, M

1991-02-15

The peroxidase (EC 1.11.1.7)-encoding gene of Arabidopsis thaliana was screened from a genomic library using a cDNA encoding a neutral isozyme of horseradish, Armoracia rusticana, peroxidase (HRP) as a probe, and two positive clones were isolated. From the comparison with the sequences of the HRP-encoding genes, we concluded that two clones contained peroxidase-encoding genes, and they were named prxCa and prxEa. Both genes consisted of four exons and three introns; the introns had consensus nucleotides, GT and AG, at the 5' and 3' ends, respectively. The lengths of each putative exon of the prxEa gene were the same as those of the HRP-basic-isozyme-encoding gene, prxC3, and coded for 349 amino acids (aa) with a sequence homology of 89% to that encoded by prxC3. The prxCa gene was very close to the HRP-neutral-isozyme-encoding gene, prxC1b, and coded for 354 aa with 91% homology to that encoded by prxC1b. The aa sequence homology was 64% between the two peroxidases encoded by prxCa and prxEa.

Characterization and distribution of a maize cDNA encoding a peptide similar to the catalytic region of second messenger dependent protein kinases

NASA Technical Reports Server (NTRS)

Biermann, B.; Johnson, E. M.; Feldman, L. J.

1990-01-01

Maize (Zea mays) roots respond to a variety of environmental stimuli which are perceived by a specialized group of cells, the root cap. We are studying the transduction of extracellular signals by roots, particularly the role of protein kinases. Protein phosphorylation by kinases is an important step in many eukaryotic signal transduction pathways. As a first phase of this research we have isolated a cDNA encoding a maize protein similar to fungal and animal protein kinases known to be involved in the transduction of extracellular signals. The deduced sequence of this cDNA encodes a polypeptide containing amino acids corresponding to 33 out of 34 invariant or nearly invariant sequence features characteristic of protein kinase catalytic domains. The maize cDNA gene product is more closely related to the branch of serine/threonine protein kinase catalytic domains composed of the cyclic-nucleotide- and calcium-phospholipid-dependent subfamilies than to other protein kinases. Sequence identity is 35% or more between the deduced maize polypeptide and all members of this branch. The high structural similarity strongly suggests that catalytic activity of the encoded maize protein kinase may be regulated by second messengers, like that of all members of this branch whose regulation has been characterized. Northern hybridization with the maize cDNA clone shows a single 2400 base transcript at roughly similar levels in maize coleoptiles, root meristems, and the zone of root elongation, but the transcript is less abundant in mature leaves. In situ hybridization confirms the presence of the transcript in all regions of primary maize root tissue.
The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.

PubMed

Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo

2016-05-01

The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.
Purification, characterization, and cDNA cloning of a novel acidic endoglycoceramidase from the jellyfish, Cyanea nozakii.

PubMed

Horibata, Y; Okino, N; Ichinose, S; Omori, A; Ito, M

2000-10-06

Endoglycoceramidase (EC ) is an enzyme capable of cleaving the glycosidic linkage between oligosaccharides and ceramides in various glycosphingolipids. We report here the purification, characterization, and cDNA cloning of a novel endoglycoceramidase from the jellyfish, Cyanea nozakii. The purified enzyme showed a single protein band estimated to be 51 kDa on SDS-polyacrylamide gel electrophoresis. The enzyme showed a pH optimum of 3.0 and was activated by Triton X-100 and Lubrol PX but not by sodium taurodeoxycholate. This enzyme preferentially hydrolyzed gangliosides, especially GT1b and GQ1b, whereas neutral glycosphingolipids were somewhat resistant to hydrolysis by the enzyme. A full-length cDNA encoding the enzyme was cloned by 5'- and 3'-rapid amplification of cDNA ends using a partial amino acid sequence of the purified enzyme. The open reading frame of 1509 nucleotides encoded a polypeptide of 503 amino acids including a signal sequence of 25 residues and six potential N-glycosylation sites. Interestingly, the Asn-Glu-Pro sequence, which is the putative active site of Rhodococcus endoglycoceramidase, was conserved in the deduced amino acid sequences. This is the first report of the cloning of an endoglycoceramidase from a eukaryote.
Molecular cloning of a putative gene encoding isopentenyltransferase from pingyitiancha (Malus hupehensis) and characterization of its response to nitrate.

PubMed

Peng, Jing; Peng, Futian; Zhu, Chunfu; Wei, Shaochong

2008-06-01

A putative isopentenyltransferase (IPT) encoding gene was identified from a pingyitiancha (Malus hupehensis Rehd.) expressed sequence tag database, and the full-length gene was cloned by RACE. Based on expression profile and sequence alignment, the nucleotide sequence of the clone, named MhIPT3, was most similar to AtIPT3, an IPT gene in Arabidopsis. The full-length cDNA contained a 963-bp open reading frame encoding a protein of 321 amino acids with a molecular mass of 37.3 kDa. Sequence analysis of genomic DNA revealed the absence of introns in the frame. Quantitative real-time PCR analysis demonstrated that the gene was expressed in roots, stems and leaves. Application of nitrate to roots of nitrogen-deprived seedlings strongly induced expression of MhIPT3 and was accompanied by the accumulation of cytokinins, whereas MhIPT3 expression was little affected by ammonium application to roots of nitrogen-deprived seedlings. Application of nitrate to leaves also up-regulated the expression of MhIPT3 and corresponded closely with the accumulation of isopentyladenine and isopentyladenosine in leaves.
A putative peroxidase cDNA from turnip and analysis of the encoded protein sequence.

PubMed

Romero-Gómez, S; Duarte-Vázquez, M A; García-Almendárez, B E; Mayorga-Martínez, L; Cervantes-Avilés, O; Regalado, C

2008-12-01

A putative peroxidase cDNA was isolated from turnip roots (Brassica napus L. var. purple top white globe) by reverse transcriptase-polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends (RACE). Total RNA extracted from mature turnip roots was used as a template for RT-PCR, using a degenerated primer designed to amplify the highly conserved distal motif of plant peroxidases. The resulting partial sequence was used to design the rest of the specific primers for 5' and 3' RACE. Two cDNA fragments were purified, sequenced, and aligned with the partial sequence from RT-PCR, and a complete overlapping sequence was obtained and labeled as BbPA (Genbank Accession No. AY423440, named as podC). The full length cDNA is 1167bp long and contains a 1077bp open reading frame (ORF) encoding a 358 deduced amino acid peroxidase polypeptide. The putative peroxidase (BnPA) showed a calculated Mr of 34kDa, and isoelectric point (pI) of 4.5, with no significant identity with other reported turnip peroxidases. Sequence alignment showed that only three peroxidases have a significant identity with BnPA namely AtP29a (84%), and AtPA2 (81%) from Arabidopsis thaliana, and HRPA2 (82%) from horseradish (Armoracia rusticana). Work is in progress to clone this gene into an adequate host to study the specific role and possible biotechnological applications of this alternative peroxidase source.
Highly sensitive detection of mutations in CHO cell recombinant DNA using multi-parallel single molecule real-time DNA sequencing.

PubMed

Cartwright, Joseph F; Anderson, Karin; Longworth, Joseph; Lobb, Philip; James, David C

2018-06-01

High-fidelity replication of biologic-encoding recombinant DNA sequences by engineered mammalian cell cultures is an essential pre-requisite for the development of stable cell lines for the production of biotherapeutics. However, immortalized mammalian cells characteristically exhibit an increased point mutation frequency compared to mammalian cells in vivo, both across their genomes and at specific loci (hotspots). Thus unforeseen mutations in recombinant DNA sequences can arise and be maintained within producer cell populations. These may affect both the stability of recombinant gene expression and give rise to protein sequence variants with variable bioactivity and immunogenicity. Rigorous quantitative assessment of recombinant DNA integrity should therefore form part of the cell line development process and be an essential quality assurance metric for instances where synthetic/multi-component assemblies are utilized to engineer mammalian cells, such as the assessment of recombinant DNA fidelity or the mutability of single-site integration target loci. Based on Pacific Biosciences (Menlo Park, CA) single molecule real-time (SMRT™) circular consensus sequencing (CCS) technology we developed a rDNA sequence analysis tool to process the multi-parallel sequencing of ∼40,000 single recombinant DNA molecules. After statistical filtering of raw sequencing data, we show that this analytical method is capable of detecting single point mutations in rDNA to a minimum single mutation frequency of 0.0042% (<1/24,000 bases). Using a stable CHO transfectant pool harboring a randomly integrated 5 kB plasmid construct encoding GFP we found that 28% of recombinant plasmid copies contained at least one low frequency (<0.3%) point mutation. These mutations were predominantly found in GC base pairs (85%) and that there was no positional bias in mutation across the plasmid sequence. There was no discernable difference between the mutation frequencies of coding and non-coding DNA. The putative ratio of non-synonymous and synonymous changes within the open reading frames (ORFs) in the plasmid sequence indicates that natural selection does not impact upon the prevalence of these mutations. Here we have demonstrated the abundance of mutations that fall outside of the reported range of detection of next generation sequencing (NGS) and second generation sequencing (SGS) platforms, providing a methodology capable of being utilized in cell line development platforms to identify the fidelity of recombinant genes throughout the production process. © 2018 Wiley Periodicals, Inc.
Molecular cloning of a gene encoding translation initiation factor (TIF) from Candida albicans.

PubMed

Mirbod, F; Nakashima, S; Kitajima, Y; Ghannoum, M A; Cannon, R D; Nozawa, Y

1996-01-01

The differential display technique was applied to compare mRNAs from two clinical isolates of Candida albicans with different virulence; high (potent strain, 16240) and low (weak strain, 18084) extracellular phospholipase activities. Complementary DNA fragments corresponding to several apparently differentially expressed mRNAs were recovered and sequenced. A complementary DNA fragment seen distinctly in the potent phospholipase producing strain was highly homologous to the yeast translation initiation factor (TIF). The selected DNA fragment was then used as a probe to isolate its corresponding complementary DNA clone from a library of C. albicans genomic DNA. The sequence of isolated gene revealed an open reading frame of 1194 nucleotides with the potential to encode a protein of 397 amino acids with a predicted molecular weight of 43 kDa. Over its entire length, the amino acid sequence showed strong homology (78-89%) to Saccharomyces cerevisiae TIF and (63-80%) to mouse eIF-4A proteins. Therefore, our C. albicans gene was identified to be TIF (Ca TIF). Northern blot analysis in the two strains of C. albicans revealed that Ca TIF expression is 1.5-fold higher in the potent phospholipase producing strain. The restriction endonuclease digestion of genomic DNA from this potent strain revealed at least two hybridized bands in Southern blot analysis, suggesting two or more closely related sequences in the C. albicans genome.
Recombination of polynucleotide sequences using random or defined primers

DOEpatents

Arnold, Frances H.; Shao, Zhixin; Affholter, Joseph A.; Zhao, Huimin H; Giver, Lorraine J.

2000-01-01

A method for in vitro mutagenesis and recombination of polynucleotide sequences based on polymerase-catalyzed extension of primer oligonucleotides is disclosed. The method involves priming template polynucleotide(s) with random-sequences or defined-sequence primers to generate a pool of short DNA fragments with a low level of point mutations. The DNA fragments are subjected to denaturization followed by annealing and further enzyme-catalyzed DNA polymerization. This procedure is repeated a sufficient number of times to produce full-length genes which comprise mutants of the original template polynucleotides. These genes can be further amplified by the polymerase chain reaction and cloned into a vector for expression of the encoded proteins.
Recombination of polynucleotide sequences using random or defined primers

DOEpatents

Arnold, Frances H.; Shao, Zhixin; Affholter, Joseph A.; Zhao, Huimin; Giver, Lorraine J.

2001-01-01

A method for in vitro mutagenesis and recombination of polynucleotide sequences based on polymerase-catalyzed extension of primer oligonucleotides is disclosed. The method involves priming template polynucleotide(s) with random-sequences or defined-sequence primers to generate a pool of short DNA fragments with a low level of point mutations. The DNA fragments are subjected to denaturization followed by annealing and further enzyme-catalyzed DNA polymerization. This procedure is repeated a sufficient number of times to produce full-length genes which comprise mutants of the original template polynucleotides. These genes can be further amplified by the polymerase chain reaction and cloned into a vector for expression of the encoded proteins.
Cloning and characterization of the nagA gene that encodes beta-n-acetylglucosaminidase from Aspergillus nidulans and its expression in Aspergillus oryzae.

PubMed

Kim, Sunhwa; Matsuo, Ichiro; Ajisaka, Katsumi; Nakajima, Harushi; Kitamoto, Katsuhiko

2002-10-01

We isolated a beta-N-acetylglucosaminidase encoding gene and its cDNA from the filamentous fungus Aspergillus nidulans, and designated it nagA. The nagA gene contained no intron and encoded a polypeptide of 603 amino acids with a putative 19-amino acid signal sequence. The deduced amino acid sequence was very similar to the sequence of Candida albicans Hex1 and Trichoderma harzianum Nag1. Yeast cells containing the nagA cDNA under the control of the GAL1 promoter expressed beta-N-acetylglucosaminidase activity. The chromosomal nagA gene of A. nidulans was disrupted by replacement with the argB marker gene. The disruptant strains expressed low levels of beta-N-acetylglucosaminidase activity and showed poor growth on a medium containing chitobiose as a carbon source. Aspergillus oryzae strain carrying the nagA gene under the control of the improved glaA promoter produced large amounts of beta-N-acetylglucosaminidase in a wheat bran solid culture.
Expression of the Caulobacter heat shock gene dnaK is developmentally controlled during growth at normal temperatures.

PubMed Central

Gomes, S L; Gober, J W; Shapiro, L

1990-01-01

Caulobacter crescentus has a single dnaK gene that is highly homologous to the hsp70 family of heat shock genes. Analysis of the cloned and sequenced dnaK gene has shown that the deduced amino acid sequence could encode a protein of 67.6 kilodaltons that is 68% identical to the DnaK protein of Escherichia coli and 49% identical to the Drosophila and human hsp70 protein family. A partial open reading frame 165 base pairs 3' to the end of dnaK encodes a peptide of 190 amino acids that is 59% identical to DnaJ of E. coli. Northern blot analysis revealed a single 4.0-kilobase mRNA homologous to the cloned fragment. Since the dnaK coding region is 1.89 kilobases, dnaK and dnaJ may be transcribed as a polycistronic message. S1 mapping and primer extension experiments showed that transcription initiated at two sites 5' to the dnaK coding sequence. A single start site of transcription was identified during heat shock at 42 degrees C, and the predicted promoter sequence conformed to the consensus heat shock promoters of E. coli. At normal growth temperature (30 degrees C), a different start site was identified 3' to the heat shock start site that conformed to the E. coli sigma 70 promoter consensus sequence. S1 protection assays and analysis of expression of the dnaK gene fused to the lux transcription reporter gene showed that expression of dnaK is temporally controlled under normal physiological conditions and that transcription occurs just before the initiation of DNA replication. Thus, in both human cells (I. K. L. Milarski and R. I. Morimoto, Proc. Natl. Acad. Sci. USA 83:9517-9521, 1986) and in a simple bacterium, the transcription of a hsp70 gene is temporally controlled as a function of the cell cycle under normal growth conditions. Images PMID:2345134
Molecular characterization of a gene POLR2H encoded an essential subunit for RNA polymerase II from the Giant Panda (Ailuropoda Melanoleuca).

PubMed

Du, Yu-Jie; Hou, Yi-Ling; Hou, Wan-Ru

2013-02-01

The Giant Panda is an endangered and valuable gene pool in genetic, its important functional gene POLR2H encodes an essential shared peptide H of RNA polymerases. The genomic DNA and cDNA sequences were cloned successfully for the first time from the Giant Panda (Ailuropoda melanoleuca) adopting touchdown-PCR and reverse transcription polymerase chain reaction (RT-PCR), respectively. The length of the genomic sequence of the Giant Panda is 3,285 bp, including five exons and four introns. The cDNA fragment cloned is 509 bp in length, containing an open reading frame of 453 bp encoding 150 amino acids. Alignment analysis indicated that both the cDNA and its deduced amino acid sequence were highly conserved. Protein structure prediction showed that there was one protein kinase C phosphorylation site, four casein kinase II phosphorylation sites and one amidation site in the POLR2H protein, further shaping advanced protein structure. The cDNA cloned was expressed in Escherichia coli, which indicated that POLR2H fusion with the N-terminally His-tagged form brought about the accumulation of an expected 20.5 kDa polypeptide in line with the predicted protein. On the basis of what has already been achieved in this study, further deep-in research will be conducted, which has great value in theory and practical significance.
ANN modeling of DNA sequences: new strategies using DNA shape code.

PubMed

Parbhane, R V; Tambe, S S; Kulkarni, B D

2000-09-01

Two new encoding strategies, namely, wedge and twist codes, which are based on the DNA helical parameters, are introduced to represent DNA sequences in artificial neural network (ANN)-based modeling of biological systems. The performance of the new coding strategies has been evaluated by conducting three case studies involving mapping (modeling) and classification applications of ANNs. The proposed coding schemes have been compared rigorously and shown to outperform the existing coding strategies especially in situations wherein limited data are available for building the ANN models.
Identification of Delta5-fatty acid desaturase from the cellular slime mold dictyostelium discoideum.

PubMed

Saito, T; Ochiai, H

1999-10-01

cDNA fragments putatively encoding amino acid sequences characteristic of the fatty acid desaturase were obtained using expressed sequence tag (EST) information of the Dictyostelium cDNA project. Using this sequence, we have determined the cDNA sequence and genomic sequence of a desaturase. The cloned cDNA is 1489 nucleotides long and the deduced amino acid sequence comprised 464 amino acid residues containing an N-terminal cytochrome b5 domain. The whole sequence was 38.6% identical to the initially identified Delta5-desaturase of Mortierella alpina. We have confirmed its function as Delta5-desaturase by over expression mutation in D. discoideum and also the gain of function mutation in the yeast Saccharomyces cerevisiae. Analysis of the lipids from transformed D. discoideum and yeast demonstrated the accumulation of Delta5-desaturated products. This is the first report concering fatty acid desaturase in cellular slime molds.
Noncoding sequence classification based on wavelet transform analysis: part I

NASA Astrophysics Data System (ADS)

Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

2017-09-01

DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.
Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues.

PubMed Central

Prody, C A; Zevin-Sonkin, D; Gnatt, A; Goldberg, O; Soreq, H

1987-01-01

To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase (BtChoEase; EC 3.1.1.8) and Torpedo electric organ "true" acetylcholinesterase (AcChoEase; EC 3.1.1.7). Using these probes, we isolated several cDNA clones from lambda gt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. In RNA blots of poly(A)+ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These findings demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species. Images PMID:3035536
Survey of Navy Funded Marine Mammal Research and Studies FY 00-01

DTIC Science & Technology

2001-05-10

protein of canine distemper virus as a reporter system in order to evaluate 103 the humoral response to DNA-mediated vaccination in cetaceans. If...PCR/ RT PCR, DNA cloning and sequencing, etc. Efforts are ongoing to design and clone a vector encoding Canine Distemper Virus, a virus closely...alternative plasmid as our reporter gene delivery vector. This alternate plasmid will encode for Canine Distemper virus genes, closely related to
Isolation and characterization of a cDNA from Cuphea lanceolata encoding a beta-ketoacyl-ACP reductase.

PubMed

Klein, B; Pawlowski, K; Höricke-Grandpierre, C; Schell, J; Töpfer, R

1992-05-01

A cDNA encoding beta-ketoacyl-ACP reductase (EC 1.1.1.100), an integral part of the fatty acid synthase type II, was cloned from Cuphea lanceolata. This cDNA of 1276 bp codes for a polypeptide of 320 amino acids with 63 N-terminal residues presumably representing a transit peptide and 257 residues corresponding to the mature protein of 27 kDa. The encoded protein shows strong homology with the amino-terminal sequence and two tryptic peptides from avocado mesocarp beta-ketoacyl-ACP reductase, and its total amino acid composition is highly similar to those of the beta-ketoacyl-ACP reductases of avocado and spinach. Amino acid sequence homologies to polyketide synthase, beta-ketoreductases and short-chain alcohol dehydrogenases are discussed. An engineered fusion protein lacking most of the transit peptide, which was produced in Escherichia coli, was isolated and proved to possess beta-ketoacyl-ACP reductase activity. Hybridization studies revealed that in C. lanceolata beta-ketoacyl-ACP reductase is encoded by a small family of at least two genes and that members of this family are expressed in roots, leaves, flowers and seeds.
Isolation of Onchocerca lupi in Dogs and Black Flies, California, USA

PubMed Central

Hassan, Hassan K.; Bolcen, Shanna; Kubofcik, Joseph; Nutman, Thomas B.; Eberhard, Mark L.; Middleton, Kelly; Wekesa, Joseph Wakoli; Ruedas, Gimena; Nelson, Kimberly J.; Dubielzig, Richard; De Lombaert, Melissa; Silverman, Bruce; Schorling, Jamie J.; Adler, Peter H.; Beeler, Emily S.

2015-01-01

In southern California, ocular infections caused by Onchocerca lupi were diagnosed in 3 dogs (1 in 2006, 2 in 2012). The infectious agent was confirmed through morphologic analysis of fixed parasites in tissues and by PCR and sequencing of amplicons derived from 2 mitochondrially encoded genes and 1 nuclear-encoded gene. A nested PCR based on the sequence of the cytochrome oxidase subunit 1 gene of the parasite was developed and used to screen Simulium black flies collected from southern California for O. lupi DNA. Six (2.8%; 95% CI 0.6%–5.0%) of 213 black flies contained O. lupi DNA. Partial mitochondrial16S rRNA gene sequences from the infected flies matched sequences derived from black fly larvae cytotaxonomically identified as Simulium tribulatum. These data implicate S. tribulatum flies as a putative vector for O. lupi in southern California. PMID:25897954
Massive Collection of Full-Length Complementary DNA Clones and Microarray Analyses:. Keys to Rice Transcriptome Analysis

NASA Astrophysics Data System (ADS)

Kikuchi, Shoshi

2009-02-01

Completion of the high-precision genome sequence analysis of rice led to the collection of about 35,000 full-length cDNA clones and the determination of their complete sequences. Mapping of these full-length cDNA sequences has given us information on (1) the number of genes expressed in the rice genome; (2) the start and end positions and exon-intron structures of rice genes; (3) alternative transcripts; (4) possible encoded proteins; (5) non-protein-coding (np) RNAs; (6) the density of gene localization on the chromosome; (7) setting the parameters of gene prediction programs; and (8) the construction of a microarray system that monitors global gene expression. Manual curation for rice gene annotation by using mapping information on full-length cDNA and EST assemblies has revealed about 32,000 expressed genes in the rice genome. Analysis of major gene families, such as those encoding membrane transport proteins (pumps, ion channels, and secondary transporters), along with the evolution from bacteria to higher animals and plants, reveals how gene numbers have increased through adaptation to circumstances. Family-based gene annotation also gives us a new way of comparing organisms. Massive amounts of data on gene expression under many kinds of physiological conditions are being accumulated in rice oligoarrays (22K and 44K) based on full-length cDNA sequences. Cluster analyses of genes that have the same promoter cis-elements, that have similar expression profiles, or that encode enzymes in the same metabolic pathways or signal transduction cascades give us clues to understanding the networks of gene expression in rice. As a tool for that purpose, we recently developed "RiCES", a tool for searching for cis-elements in the promoter regions of clustered genes.

Signal sequence and keyword trap in silico for selection of full-length human cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries.

PubMed

Otsuki, Tetsuji; Ota, Toshio; Nishikawa, Tetsuo; Hayashi, Koji; Suzuki, Yutaka; Yamamoto, Jun-ichi; Wakamatsu, Ai; Kimura, Kouichi; Sakamoto, Katsuhiko; Hatano, Naoto; Kawai, Yuri; Ishii, Shizuko; Saito, Kaoru; Kojima, Shin-ichi; Sugiyama, Tomoyasu; Ono, Tetsuyoshi; Okano, Kazunori; Yoshikawa, Yoko; Aotsuka, Satoshi; Sasaki, Naokazu; Hattori, Atsushi; Okumura, Koji; Nagai, Keiichi; Sugano, Sumio; Isogai, Takao

2005-01-01

We have developed an in silico method of selection of human full-length cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries. Fullness rates were increased to about 80% by combination of the oligo-capping method and ATGpr, software for prediction of translation start point and the coding potential. Then, using 5'-end single-pass sequences, cDNAs having the signal sequence were selected by PSORT ('signal sequence trap'). We also applied 'secretion or membrane protein-related keyword trap' based on the result of BLAST search against the SWISS-PROT database for the cDNAs which could not be selected by PSORT. Using the above procedures, 789 cDNAs were primarily selected and subjected to full-length sequencing, and 334 of these cDNAs were finally selected as novel. Most of the cDNAs (295 cDNAs: 88.3%) were predicted to encode secretion or membrane proteins. In particular, 165(80.5%) of the 205 cDNAs selected by PSORT were predicted to have signal sequences, while 70 (54.2%) of the 129 cDNAs selected by 'keyword trap' preserved the secretion or membrane protein-related keywords. Many important cDNAs were obtained, including transporters, receptors, and ligands, involved in significant cellular functions. Thus, an efficient method of selecting secretion or membrane protein-encoding cDNAs was developed by combining the above four procedures.
Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

PubMed

Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2004-02-01

To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
Porcine parvovirus: DNA sequence and genome organization.

PubMed

Ranz, A I; Manclús, J J; Díaz-Aroca, E; Casal, J I

1989-10-01

We have determined the nucleotide sequence of an almost full-length clone of porcine parvovirus (PPV). The sequence is 4973 nucleotides (nt) long. The 3' end of virion DNA shows a Y-shaped configuration homologous to rodent parvoviruses. The 5' end of virion DNA shows a repetition of 127 nt at the carboxy terminus of the capsid proteins. The overall organization of the PPV genome is similar to those of other autonomous parvoviruses. There are two large open reading frames (ORFs) that almost entirely cover the genome, both located in the same frame of the complementary strand. The left ORF encodes the non-structural protein NS1 and the right ORF encodes the capsid proteins (VP1, VP2 and VP3). Promoter analysis, location of splicing sites and putative amino acid sequences for the viral proteins show a high homology of PPV with feline panleukopenia virus and canine parvoviruses (FPV and CPV) and rodent parvovirus. Therefore we conclude that PPV is related to the Kilham rat virus (KRV) group of autonomous parvoviruses formed by KRV, minute virus of mice, Lu III, H-1, FPV and CPV.
Deep-sea vent phage DNA polymerase specifically initiates DNA synthesis in the absence of primers.

PubMed

Zhu, Bin; Wang, Longfei; Mitsunobu, Hitoshi; Lu, Xueling; Hernandez, Alfredo J; Yoshida-Takashima, Yukari; Nunoura, Takuro; Tabor, Stanley; Richardson, Charles C

2017-03-21

A DNA polymerase is encoded by the deep-sea vent phage NrS-1. NrS-1 has a unique genome organization containing genes that are predicted to encode a helicase and a single-stranded DNA (ssDNA)-binding protein. The gene for an unknown protein shares weak homology with the bifunctional primase-polymerases (prim-pols) from archaeal plasmids but is missing the zinc-binding domain typically found in primases. We show that this gene product has efficient DNA polymerase activity and is processive in DNA synthesis in the presence of the NrS-1 helicase and ssDNA-binding protein. Remarkably, this NrS-1 DNA polymerase initiates DNA synthesis from a specific template DNA sequence in the absence of any primer. The de novo DNA polymerase activity resides in the N-terminal domain of the protein, whereas the C-terminal domain enhances DNA binding.
Characterization of GM-CSF-inhibitory factor and Uracil DNA glycosylase encoding genes from camel pseudocowpoxvirus.

PubMed

Nagarajan, G; Swami, Shelesh Kumar; Dahiya, Shyam Singh; Narnaware, S D; Mehta, S C; Singh, P K; Singh, Raghvendar; Tuteja, F C; Patil, N V

2015-06-01

The present study describes the PCR amplification of GM-CSF-inhibitory factor (GIF) and Uracil DNA glycosylase (UDG) encoding genes of pseudocowpoxvirus (PCPV) from the Indian Dromedaries (Camelus dromedarius) infected with contagious ecthyma using the primers based on the corresponding gene sequences of human PCPV and reindeer PCPV, respectively. The length of GIF gene of PCPV obtained from camel is 795 bp and due to the addition of one cytosine residue at position 374 and one adenine residue at position 516, the open reading frame (ORF) got altered, resulting in the production of truncated polypeptide. The ORF of UDG encoding gene of camel PCPV is 696 bp encoding a polypeptide of 26.0 kDa. Comparison of amino acid sequence homologies of GIF and UDG of camel PCPV revealed that the camel PCPV is closer to ORFV and PCPV (reference stains of both human and reindeer), respectively. Copyright © 2015 Elsevier Ltd. All rights reserved.
Horizontal gene transfer of chromosomal Type II toxin-antitoxin systems of Escherichia coli.

PubMed

Ramisetty, Bhaskar Chandra Mohan; Santhosh, Ramachandran Sarojini

2016-02-01

Type II toxin-antitoxin systems (TAs) are small autoregulated bicistronic operons that encode a toxin protein with the potential to inhibit metabolic processes and an antitoxin protein to neutralize the toxin. Most of the bacterial genomes encode multiple TAs. However, the diversity and accumulation of TAs on bacterial genomes and its physiological implications are highly debated. Here we provide evidence that Escherichia coli chromosomal TAs (encoding RNase toxins) are 'acquired' DNA likely originated from heterologous DNA and are the smallest known autoregulated operons with the potential for horizontal propagation. Sequence analyses revealed that integration of TAs into the bacterial genome is unique and contributes to variations in the coding and/or regulatory regions of flanking host genome sequences. Plasmids and genomes encoding identical TAs of natural isolates are mutually exclusive. Chromosomal TAs might play significant roles in the evolution and ecology of bacteria by contributing to host genome variation and by moderation of plasmid maintenance. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
[Structural organization of 5S ribosomal DNA of Rosa rugosa].

PubMed

Tynkevych, Iu O; Volkov, R A

2014-01-01

In order to clarify molecular organization of the genomic region encoding 5S rRNA in diploid species Rosa rugosa several 5S rDNA repeated units were cloned and sequenced. Analysis of the obtained sequences revealed that only one length variant of 5S rDNA repeated units, which contains intact promoter elements in the intergenic spacer region (IGS) and appears to be transcriptionally active is present in the genome. Additionally, a limited number of 5S rDNA pseudogenes lacking a portion of coding sequence and the complete IGS was detected. A high level of sequence similarity (from 93.7 to 97.5%) between the IGS of major 5S rDNA variants of East Asian R. rugosa and North American R. nitida was found indicating comparatively recent divergence of these species.
Protection of CpG islands from DNA methylation is DNA-encoded and evolutionarily conserved.

PubMed

Long, Hannah K; King, Hamish W; Patient, Roger K; Odom, Duncan T; Klose, Robert J

2016-08-19

DNA methylation is a repressive epigenetic modification that covers vertebrate genomes. Regions known as CpG islands (CGIs), which are refractory to DNA methylation, are often associated with gene promoters and play central roles in gene regulation. Yet how CGIs in their normal genomic context evade the DNA methylation machinery and whether these mechanisms are evolutionarily conserved remains enigmatic. To address these fundamental questions we exploited a transchromosomic animal model and genomic approaches to understand how the hypomethylated state is formed in vivo and to discover whether mechanisms governing CGI formation are evolutionarily conserved. Strikingly, insertion of a human chromosome into mouse revealed that promoter-associated CGIs are refractory to DNA methylation regardless of host species, demonstrating that DNA sequence plays a central role in specifying the hypomethylated state through evolutionarily conserved mechanisms. In contrast, elements distal to gene promoters exhibited more variable methylation between host species, uncovering a widespread dependence on nucleotide frequency and occupancy of DNA-binding transcription factors in shaping the DNA methylation landscape away from gene promoters. This was exemplified by young CpG rich lineage-restricted repeat sequences that evaded DNA methylation in the absence of co-evolved mechanisms targeting methylation to these sequences, and species specific DNA binding events that protected against DNA methylation in CpG poor regions. Finally, transplantation of mouse chromosomal fragments into the evolutionarily distant zebrafish uncovered the existence of a mechanistically conserved and DNA-encoded logic which shapes CGI formation across vertebrate species. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
cDNA isolated from a human T-cell library encodes a member of the protein-tyrosine-phosphatase family

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cool, D.E.; Tonks, N.K.; Charbonneau, H.

1989-07-01

A human peripheral T-cell cDNA library was screened with two labeled synthetic oligonucleotides encoding regions of a human placenta protein-tyrosine-phosphatase. One positive clone was isolated and the nucleotide sequence was determined. It contained 1,305 base pairs of open reading frame followed by a TAA stop codon and 978 base pairs of 3{prime} untranslated end, although a poly(A){sup +} tail was not found. An initiator methionine residue was predicted at position 61, which would result in a protein of 415 amino acid residues. This was supported by the synthesis of a M{sub r} 48,000 protein in an in vitro reticulocyte lysatemore » translation system using RNA transcribed from the cloned cDNA and T7 RNA polymerase. The deduced amino acid sequence was compared to other known proteins revealing 65% identity to the low M{sub r} PTPase 1B isolated from placenta. In view of the high degree of similarity, the T-cell cDNA likely encodes a newly discovered protein-tyrosine-phosphatase, thus expanding this family of genes.« less
Isolation of complementary DNA clones encoding pathogenesis-related proteins P and Q, two acidic chitinases from tobacco.

PubMed Central

Payne, G; Ahl, P; Moyer, M; Harper, A; Beck, J; Meins, F; Ryals, J

1990-01-01

Complementary DNA clones encoding two isoforms of the acidic endochitinase (chitinase, EC 3.2.1.14) from tobacco were isolated. Comparison of amino acid sequences deduced from the cDNA clones and the sequence of peptides derived from purified proteins show that these clones encode the pathogenesis-related proteins PR-P and PR-Q. The cDNA inserts were not homologous to either the bacterial form of chitinase or the form from cucumber but shared significant homology to the basic form of chitinase from tobacco and bean. The acidic isoforms of tobacco chitinase did not contain the amino-terminal, cysteine-rich "hevein" domain found in the basic isoforms, indicating that this domain, which binds chitin, is not essential for chitinolytic activity. The accumulation of mRNA for the pathogenesis-related proteins PR-1, PR-R, PR-P, and PR-Q in Xanthi.nc tobacco leaves following infection with tobacco mosaic virus was measured by primer extension. The results indicate that the induction of these proteins during the local necrotic lesion response to the virus is coordinated at the mRNA level. Images PMID:2296608
Cloning and sequencing of a laccase gene from the lignin-degrading basidiomycete Pleurotus ostreatus.

PubMed Central

Giardina, P; Cannio, R; Martirani, L; Marzullo, L; Palmieri, G; Sannia, G

1995-01-01

The gene (pox1) encoding a phenol oxidase from Pleurotus ostreatus, a lignin-degrading basidiomycete, was cloned and sequenced, and the corresponding pox1 cDNA was also synthesized and sequenced. The isolated gene consists of 2,592 bp, with the coding sequence being interrupted by 19 introns and flanked by an upstream region in which putative CAAT and TATA consensus sequences could be identified at positions -174 and -84, respectively. The isolation of a second cDNA (pox2 cDNA), showing 84% similarity, and of the corresponding truncated genomic clones demonstrated the existence of a multigene family coding for isoforms of laccase in P. ostreatus. PCR amplifications of specific regions on the DNA of isolated monokaryons proved that the two genes are not allelic forms. The POX1 amino acid sequence deduced was compared with those of other known laccases from different fungi. PMID:7793961
Variation in DNA methylation is not consistently reflected by CpG depletion or sociality in Hymenoptera

USDA-ARS?s Scientific Manuscript database

Changes in gene regulation that underlie phenotypic evolution can be encoded directly in the DNA sequence or mediated by chromatin modifications such as DNA methylation. It has been hypothesized that the evolution of social behavior is associated with enhanced gene regulatory potential, which may in...
Biological sequence compression algorithms.

PubMed

Matsumoto, T; Sadakane, K; Imai, H

2000-01-01

Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.
Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): A linear DNA molecule encoding a putative DNA-dependent DNA polymerase.

PubMed

Shao, Zhiyong; Graf, Shannon; Chaga, Oleg Y; Lavrov, Dennis V

2006-10-15

The 16,937-nuceotide sequence of the linear mitochondrial DNA (mt-DNA) molecule of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa) - the first mtDNA sequence from the class Scypozoa and the first sequence of a linear mtDNA from Metazoa - has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs. In addition, two open reading frames of 324 and 969 base pairs in length have been found. The deduced amino-acid sequence of one of them, ORF969, displays extensive sequence similarity with the polymerase [but not the exonuclease] domain of family B DNA polymerases, and this ORF has been tentatively identified as dnab. This is the first report of dnab in animal mtDNA. The genes in A. aurita mtDNA are arranged in two clusters with opposite transcriptional polarities; transcription proceeding toward the ends of the molecule. The determined sequences at the ends of the molecule are nearly identical but inverted and lack any obvious potential secondary structures or telomere-like repeat elements. The acquisition of mitochondrial genomic data for the second class of Cnidaria allows us to reconstruct characteristic features of mitochondrial evolution in this animal phylum.
DNA as a Binary Code: How the Physical Structure of Nucleotide Bases Carries Information

ERIC Educational Resources Information Center

McCallister, Gary

2005-01-01

The DNA triplet code also functions as a binary code. Because double-ring compounds cannot bind to double-ring compounds in the DNA code, the sequence of bases classified simply as purines or pyrimidines can encode for smaller groups of possible amino acids. This is an intuitive approach to teaching the DNA code. (Contains 6 figures.)
Geranyl diphosphate synthase from mint

DOEpatents

Croteau, Rodney Bruce; Wildung, Mark Raymond; Burke, Charles Cullen; Gershenzon, Jonathan

1999-01-01

A cDNA encoding geranyl diphosphate synthase from peppermint has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID No:1) is provided which codes for the expression of geranyl diphosphate synthase (SEQ ID No:2) from peppermint (Mentha piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for geranyl diphosphate synthase or for a base sequence sufficiently complementary to at least a portion of the geranyl diphosphate synthase DNA or RNA to enable hybridization therewith (e.g., antisense geranyl diphosphate synthase RNA or fragments of complementary geranyl diphosphate synthase DNA which are useful as polymerase chain reaction primers or as probes for geranyl diphosphate synthase or related genes). In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding geranyl diphosphate synthase. Thus, systems and methods are provided for the recombinant expression of geranyl diphosphate synthase that may be used to facilitate the production, isolation and purification of significant quantities of recombinant geranyl diphosphate synthase for subsequent use, to obtain expression or enhanced expression of geranyl diphosphate synthase in plants in order to enhance the production of monoterpenoids, to produce geranyl diphosphate in cancerous cells as a precursor to monoterpenoids having anti-cancer properties or may be otherwise employed for the regulation or expression of geranyl diphosphate synthase or the production of geranyl diphosphate.
Geranyl diphosphate synthase from mint

DOEpatents

Croteau, R.B.; Wildung, M.R.; Burke, C.C.; Gershenzon, J.

1999-03-02

A cDNA encoding geranyl diphosphate synthase from peppermint has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID No:1) is provided which codes for the expression of geranyl diphosphate synthase (SEQ ID No:2) from peppermint (Mentha piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for geranyl diphosphate synthase or for a base sequence sufficiently complementary to at least a portion of the geranyl diphosphate synthase DNA or RNA to enable hybridization therewith (e.g., antisense geranyl diphosphate synthase RNA or fragments of complementary geranyl diphosphate synthase DNA which are useful as polymerase chain reaction primers or as probes for geranyl diphosphate synthase or related genes). In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding geranyl diphosphate synthase. Thus, systems and methods are provided for the recombinant expression of geranyl diphosphate synthase that may be used to facilitate the production, isolation and purification of significant quantities of recombinant geranyl diphosphate synthase for subsequent use, to obtain expression or enhanced expression of geranyl diphosphate synthase in plants in order to enhance the production of monoterpenoids, to produce geranyl diphosphate in cancerous cells as a precursor to monoterpenoids having anti-cancer properties or may be otherwise employed for the regulation or expression of geranyl diphosphate synthase or the production of geranyl diphosphate. 5 figs.
Expression of an Atriplex nummularia gene encoding a protein homologous to the bacterial molecular chaperone DnaJ.

PubMed

Zhu, J K; Shi, J; Bressan, R A; Hasegawa, P M

1993-03-01

DnaJ is a 36-kD heat shock protein that functions together with Dnak (Hsp70) as a molecular chaperone in Escherichia coli. We have obtained a cDNA clone from the higher plant Atriplex nummularia that encodes a 46.6-kD polypeptide (ANJ1) with an overall 35.2% amino acid sequence identity with the E. coli DnaJ. ANJ1 has 43.4% overall sequence identity with the Saccharomyces cerevisiae cytoplasmic DnaJ homolog YDJ1/MAS5. Complementation of the yeast mas5 mutation indicated that ANJ1 is a functional homolog of YDJ1/MAS5. The presence of other DnaJ homologs in A. nummularia was demonstrated by the detection of proteins that are antigenically related to the yeast mitochondrial DnaJ homolog SCJ1 and the yeast DnaJ-related protein Sec63. Expression of the ANJ1 gene was compared with that of an A. nummularia Hsp70 gene. Expression of both ANJ1 and Hsp70 transcripts was coordinately induced by heat shock. However, noncoordinate accumulation of ANJ1 and Hsp70 mRNAs occurred during the cell growth cycle and in response to NaCl stress.
Cloning and baculovirus expression of a desiccation stress gene from the beetle, Tenebrio molitor.

PubMed

Graham, L A; Bendena, W G; Walker, V K

1996-02-01

The cDNA sequence encoding a novel desiccation stress protein (dsp28) found in the hemolymph of the common yellow mealworm beetle, Tenebrio molitor, has been determined. The sequence encodes a 225 amino acid protein containing a 20 amino acid signal peptide. Dsp28 shows no significant similarity to any known nucleic acid or protein sequence. Levels of dsp28 mRNA were found to increase approx 5-fold following desiccation. Dsp28 cDNA has been cloned into a baculovirus expression vector and the expressed protein was compared to native dsp28. Both dsp28 expressed by recombinant baculovirus and native dsp28 are glycosylated and N-terminally processed. Although dsp28 is induced by cold in addition to desiccation stress, it does not contribute to the freezing point depression (thermal hysteresis) observed in Tenebrio hemolymph.
Cloning and characterization of the major histone H2A genes completes the cloning and sequencing of known histone genes of Tetrahymena thermophila.

PubMed Central

Liu, X; Gorovsky, M A

1996-01-01

A truncated cDNA clone encoding Tetrahymena thermophila histone H2A2 was isolated using synthetic degenerate oligonucleotide probes derived from H2A protein sequences of Tetrahymena pyriformis. The cDNA clone was used as a homologous probe to isolate a truncated genomic clone encoding H2A1. The remaining regions of the genes for H2A1 (HTA1) and H2A2 (HTA2) were then isolated using inverse PCR on circularized genomic DNA fragments. These partial clones were assembled into intact HTA1 and HTA2 clones. Nucleotide sequences of the two genes were highly homologous within the coding region but not in the noncoding regions. Comparison of the deduced amino acid sequences with protein sequences of T. pyriformis H2As showed only two and three differences respectively, in a total of 137 amino acids for H2A1, and 132 amino acids for H2A2, indicating the two genes arose before the divergence of these two species. The HTA2 gene contains a TAA triplet within the coding region, encoding a glutamine residue. In contrast with the T. thermophila HHO and HTA3 genes, no introns were identified within the two genes. The 5'- and 3'-ends of the histone H2A mRNAs; were determined by RNase protection and by PCR mapping using RACE and RLM-RACE methods. Both genes encode polyadenylated mRNAs and are highly expressed in vegetatively growing cells but only weakly expressed in starved cultures. With the inclusion of these two genes, T. thermophila is the first organism whose entire complement of known core and linker histones, including replication-dependent and basal variants, has been cloned and sequenced. PMID:8760889

Isolation and bacterial expression of a sesquiterpene synthase CDNA clone from peppermint(mentha .chi. piperita, L.) that produces the aphid alarm pheromone (E)-.beta.-farnesene

DOEpatents

Croteau, Rodney Bruce; Wildung, Mark Raymond; Crock, John E.

1999-01-01

A cDNA encoding (E)-.beta.-farnesene synthase from peppermint (Mentha piperita) has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID NO:1) is provided which codes for the expression of (E)-.beta.-farnesene synthase (SEQ ID NO:2), from peppermint (Mentha piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for (E)-.beta.-farnesene synthase, or for a base sequence sufficiently complementary to at least a portion of (E)-.beta.-farnesene synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding (E)-.beta.-farnesene synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant (E)-.beta.-farnesene synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant (E)-.beta.-farnesene synthase may be used to obtain expression or enhanced expression of (E)-.beta.-farnesene synthase in plants in order to enhance the production of (E)-.beta.-farnesene, or may be otherwise employed for the regulation or expression of (E)-.beta.-farnesene synthase, or the production of its product.
Cloning of a coconut endosperm cDNA encoding a 1-acyl-sn-glycerol-3-phosphate acyltransferase that accepts medium-chain-length substrates.

PubMed Central

Knutzon, D S; Lardizabal, K D; Nelsen, J S; Bleibaum, J L; Davies, H M; Metz, J G

1995-01-01

Immature coconut (Cocos nucifera) endosperm contains a 1-acyl-sn-glycerol-3-phosphate acyltransferase (LPAAT) activity that shows a preference for medium-chain-length fatty acyl-coenzyme A substrates (H.M. Davies, D.J. Hawkins, J.S. Nelsen [1995] Phytochemistry 39:989-996). Beginning with solubilized membrane preparations, we have used chromatographic separations to identify a polypeptide with an apparent molecular mass of 29 kD, whose presence in various column fractions correlates with the acyltransferase activity detected in those same fractions. Amino acid sequence data obtained from several peptides generated from this protein were used to isolate a full-length clone from a coconut endosperm cDNA library. Clone pCGN5503 contains a 1325-bp cDNA insert with an open reading frame encoding a 308-amino acid protein with a calculated molecular mass of 34.8 kD. Comparison of the deduced amino acid sequence of pCGN5503 to sequences in the data banks revealed significant homology to other putative LPAAT sequences. Expression of the coconut cDNA in Escherichia coli conferred upon those cells a novel LPAAT activity whose substrate activity profile matched that of the coconut enzyme. PMID:8552723
1,4-Benzoquinone reductase from Phanerochaete chrysosporium: cDNA cloning and regulation of expression

DOE Office of Scientific and Technical Information (OSTI.GOV)

Akileswaran, L.; Brock, B.J.; Cereghino, J.L.

1999-02-01

A cDNA clone encoding a quinone reductase (QR) from the white rot basidiomycete Phanerochaete chrysosporium was isolated and sequenced. The cDNA consisted of 1,007 nucleotides and a poly(A) tail and encoded a deduced protein containing 271 amino acids. The experimentally determined eight-amino-acid N-germinal sequence of the purified QR protein from P. chrysosporium matched amino acids 72 to 79 of the predicted translation product of the cDNA. The M{sub r} of the predicted translation product, beginning with Pro-72, was essentially identical to the experimentally determined M{sub r} of one monomer of the QR dimer, and this finding suggested that QR ismore » synthesized as a proenzyme. The results of in vitro transcription-translation experiments suggested that QR is synthesized as a proenzyme with a 71-amino-acid leader sequence. This leader sequence contains two potential KEX2 cleavage sites and numerous potential cleavage sites for dipeptidyl aminopeptidase. The QR activity in cultures of P. chrysosporium increased following the addition of 2-dimethoxybenzoquinone, vanillic acid, or several other aromatic compounds. An immunoblot analysis indicated that induction resulted in an increase in the amount of QR protein, and a Northern blot analysis indicated that this regulation occurs at the level of the qr mRNA.« less
[Cloning and sequence analysis of full-length cDNA of secoisolariciresinol dehydrogenase of Dysosma versipellis].

PubMed

Xu, Li; Ding, Zhi-Shan; Zhou, Yun-Kai; Tao, Xue-Fen

2009-06-01

To obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis by RACE PCR,then investigate the character of Secoisolariciresinol Dehydrogenase gene. The full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene was obtained by 3'-RACE and 5'-RACE from Dysosma versipellis. We first reported the full cDNA sequences of Secoisolariciresinol Dehydrogenase in Dysosma versipellis. The acquired gene was 991bp in full length, including 5' untranslated region of 42bp, 3' untranslated region of 112bp with Poly (A). The open reading frame (ORF) encoding 278 amino acid with molecular weight 29253.3 Daltons and isolectric point 6.328. The gene accession nucleotide sequence number in GeneBank was EU573789. Semi-quantitative RT-PCR analysis revealed that the Secoisolariciresinol Dehydrogenase gene was highly expressed in stem. Alignment of the amino acid sequence of Secoisolariciresinol Dehydrogenase indicated there may be some significant amino acid sequence difference among different species. Obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis.
Light regulation of the abundance of mRNA encoding a nucleolin-like protein localized in the nucleoli of pea nuclei.

PubMed Central

Tong, C G; Reichler, S; Blumenthal, S; Balk, J; Hsieh, H L; Roux, S J

1997-01-01

A cDNA encoding a nucleolar protein was selected from a pea (Pisum sativum) plumule library, cloned, and sequenced. The translated sequence of the cDNA has significant percent identity to Xenopus laevis nucleolin (31%), the alfalfa (Medicago sativa) nucleolin homolog (66%), and the yeast (Saccharomyces cerevisiae) nucleolin homolog (NSR1) (28%). It also has sequence patterns in its primary structure that are characteristic of all nucleolins, including an N-terminal acidic motif, RNA recognition motifs, and a C-terminal Gly- and Arg-rich domain. By immunoblot analysis, the polyclonal antibodies used to select the cDNA bind selectively to a 90-kD protein in purified pea nuclei and nucleoli and to an 88-kD protein in extracts of Escherichia coli expressing the cDNA. In immunolocalization assays of pea plumule cells, the antibodies stained primarily a region surrounding the fibrillar center of nucleoli, where animal nucleolins are typically found. Southern analysis indicated that the pea nucleolin-like protein is encoded by a single gene, and northern analysis showed that the labeled cDNA binds to a single band of RNA, approximately the same size and the cDNA. After irradiation of etiolated pea seedlings by red light, the mRNA level in plumules decreased during the 1st hour and then increased to a peak of six times the 0-h level at 12 h. Far-red light reversed this effect of red light, and the mRNA accumulation from red/far-red light irradiation was equal to that found in the dark control. This indicates that phytochrome may regulate the expression of this gene. PMID:9193096
Transcriptionally active PCR for antigen identification and vaccine development: in vitro genome-wide screening and in vivo immunogenicity

PubMed Central

Regis, David P.; Dobaño, Carlota; Quiñones-Olson, Paola; Liang, Xiaowu; Graber, Norma L.; Stefaniak, Maureen E.; Campo, Joseph J.; Carucci, Daniel J.; Roth, David A.; He, Huaping; Felgner, Philip L.; Doolan, Denise L.

2009-01-01

We have evaluated a technology called Transcriptionally Active PCR (TAP) for high throughput identification and prioritization of novel target antigens from genomic sequence data using the Plasmodium parasite, the causative agent of malaria, as a model. First, we adapted the TAP technology for the highly AT-rich Plasmodium genome, using well-characterized P. falciparum and P. yoelii antigens and a small panel of uncharacterized open reading frames from the P. falciparum genome sequence database. We demonstrated that TAP fragments encoding six well-characterized P. falciparum antigens and five well-characterized P. yoelii antigens could be amplified in an equivalent manner from both plasmid DNA and genomic DNA templates, and that uncharacterized open reading frames could also be amplified from genomic DNA template. Second, we showed that the in vitro expression of the TAP fragments was equivalent or superior to that of supercoiled plasmid DNA encoding the same antigen. Third, we evaluated the in vivo immunogenicity of TAP fragments encoding a subset of the model P. falciparum and P. yoelii antigens. We found that antigen-specific antibody and cellular immune responses induced by the TAP fragments in mice were equivalent or superior to those induced by the corresponding plasmid DNA vaccines. Finally, we developed and demonstrated proof-of-principle for an in vitro humoral immunoscreening assay for down-selection of novel target antigens. These data support the potential of a TAP approach for rapid high throughput functional screening and identification of potential candidate vaccine antigens from genomic sequence data. PMID:18164079
Transcriptionally active PCR for antigen identification and vaccine development: in vitro genome-wide screening and in vivo immunogenicity.

PubMed

Regis, David P; Dobaño, Carlota; Quiñones-Olson, Paola; Liang, Xiaowu; Graber, Norma L; Stefaniak, Maureen E; Campo, Joseph J; Carucci, Daniel J; Roth, David A; He, Huaping; Felgner, Philip L; Doolan, Denise L

2008-03-01

We have evaluated a technology called transcriptionally active PCR (TAP) for high throughput identification and prioritization of novel target antigens from genomic sequence data using the Plasmodium parasite, the causative agent of malaria, as a model. First, we adapted the TAP technology for the highly AT-rich Plasmodium genome, using well-characterized P. falciparum and P. yoelii antigens and a small panel of uncharacterized open reading frames from the P. falciparum genome sequence database. We demonstrated that TAP fragments encoding six well-characterized P. falciparum antigens and five well-characterized P. yoelii antigens could be amplified in an equivalent manner from both plasmid DNA and genomic DNA templates, and that uncharacterized open reading frames could also be amplified from genomic DNA template. Second, we showed that the in vitro expression of the TAP fragments was equivalent or superior to that of supercoiled plasmid DNA encoding the same antigen. Third, we evaluated the in vivo immunogenicity of TAP fragments encoding a subset of the model P. falciparum and P. yoelii antigens. We found that antigen-specific antibody and cellular immune responses induced by the TAP fragments in mice were equivalent or superior to those induced by the corresponding plasmid DNA vaccines. Finally, we developed and demonstrated proof-of-principle for an in vitro humoral immunoscreening assay for down-selection of novel target antigens. These data support the potential of a TAP approach for rapid high throughput functional screening and identification of potential candidate vaccine antigens from genomic sequence data.
Characterization of the variable-number tandem repeats in vrrA from different Bacillus anthracis isolates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jackson, P.J.; Walthers, E.A.; Richmond, K.L.

1997-04-01

PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less
ECB deacylase mutants

DOEpatents

Arnold, Frances H.; Shao, Zhixin; Zhao, Huimin; Giver, Lorraine J.

2002-01-01

A method for in vitro mutagenesis and recombination of polynucleotide sequences based on polymerase-catalyzed extension of primer oligonucleotides is disclosed. The method involves priming template polynucleotide(s) with random-sequences or defined-sequence primers to generate a pool of short DNA fragments with a low level of point mutations. The DNA fragments are subjected to denaturization followed by annealing and further enzyme-catalyzed DNA polymerization. This procedure is repeated a sufficient number of times to produce full-length genes which comprise mutants of the original template polynucleotides. These genes can be further amplified by the polymerase chain reaction and cloned into a vector for expression of the encoded proteins.
Chicken genome analysis reveals novel genes encoding biotin-binding proteins related to avidin family

PubMed Central

Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H

2005-01-01

Background A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins. PMID:15777476
Artificial Intelligence, DNA Mimicry, and Human Health.

PubMed

Stefano, George B; Kream, Richard M

2017-08-14

The molecular evolution of genomic DNA across diverse plant and animal phyla involved dynamic registrations of sequence modifications to maintain existential homeostasis to increasingly complex patterns of environmental stressors. As an essential corollary, driver effects of positive evolutionary pressure are hypothesized to effect concerted modifications of genomic DNA sequences to meet expanded platforms of regulatory controls for successful implementation of advanced physiological requirements. It is also clearly apparent that preservation of updated registries of advantageous modifications of genomic DNA sequences requires coordinate expansion of convergent cellular proofreading/error correction mechanisms that are encoded by reciprocally modified genomic DNA. Computational expansion of operationally defined DNA memory extends to coordinate modification of coding and previously under-emphasized noncoding regions that now appear to represent essential reservoirs of untapped genetic information amenable to evolutionary driven recruitment into the realm of biologically active domains. Additionally, expansion of DNA memory potential via chemical modification and activation of noncoding sequences is targeted to vertical augmentation and integration of an expanded cadre of transcriptional and epigenetic regulatory factors affecting linear coding of protein amino acid sequences within open reading frames.
Twenty-seven nonoverlapping zinc finger cDNAs from human T cells map to nine different chromosomes with apparent clustering.

PubMed Central

Huebner, K; Druck, T; Croce, C M; Thiesen, H J

1991-01-01

cDNA clones encoding zinc finger structures were isolated by screening Molt4 and Jurkat cDNA libraries with zinc finger consensus sequences. Candidate clones were partially sequenced to verify the presence of zinc finger-encoding regions; nonoverlapping cDNA clones were chosen on the basis of sequences and genomic hybridization pattern. Zinc finger structure-encoding clones, which were designated by the term "Kox" and a number from 1 to 32 and which were apparently unique (i.e., distinct from each other and distinct from those isolated by other laboratories), were chosen for mapping in the human genome. DNAs from rodent-human somatic cell hybrids retaining defined complements of human chromosomes were analyzed for the presence of each of the Kox genes. Correlation between the presence of specific human chromosome regions and specific Kox genes established the chromosomal locations. Multiple Kox loci were mapped to 7q (Kox 18 and 25 and a locus detected by both Kox 8 cDNA and Kox 27 cDNA), 8q24 5' to the myc locus (Kox 9 and 32), 10cen----q24 (Kox 2, 15, 19, 21, 30, and 31), 12q13-qter (Kox 1 and 20), 17p13 (Kox 11 and 26), and 19q (Kox 5, 6, 10, 22, 24, and 28). Single Kox loci were mapped to 7p22 (Kox 3), 18q12 (Kox 17), 19p (Kox 13), 22q11 between IG lambda and BCR-1 (locus detected by both Kox 8 cDNA and Kox 27 cDNA), and Xp (Kox 14). Several of the Kox loci map to regions in which other zinc finger structure-encoding loci have already been localized, indicating possible zinc finger gene clusters. In addition, Kox genes at 8q24, 17p13, and 22q11--and perhaps other Kox genes--are located near recurrent chromosomal translocation breakpoints. Others, such as those on 7p and 7q, may be near regions specifically active in T cells. Images Figure 4 Figure 5 Figure 2 Figure 3 PMID:2014798
[Replication of Streptomyces plasmids: the DNA nucleotide sequence of plasmid pSB 24.2].

PubMed

Bolotin, A P; Sorokin, A V; Aleksandrov, N N; Danilenko, V N; Kozlov, Iu I

1985-11-01

The nucleotide sequence of DNA in plasmid pSB 24.2, a natural deletion derivative of plasmid pSB 24.1 isolated from S. cyanogenus was studied. The plasmid amounted by its size to 3706 nucleotide pairs. The G-C composition was equal to 73 per cent. The analysis of the DNA structure in plasmid pSB 24.2 revealed the protein-encoding sequence of DNA, the continuity of which was significant for replication of the plasmid containing more than 1300 nucleotide pairs. The analysis also revealed two A-T-rich areas of DNA, the G-C composition of which was less than 55 per cent and a DNA area with a branched pin structure. The results may be of value in investigation of plasmid replication in actinomycetes and experimental cloning of DNA with this plasmid as a vector.
Research on Image Encryption Based on DNA Sequence and Chaos Theory

NASA Astrophysics Data System (ADS)

Tian Zhang, Tian; Yan, Shan Jun; Gu, Cheng Yan; Ren, Ran; Liao, Kai Xin

2018-04-01

Nowadays encryption is a common technique to protect image data from unauthorized access. In recent years, many scientists have proposed various encryption algorithms based on DNA sequence to provide a new idea for the design of image encryption algorithm. Therefore, a new method of image encryption based on DNA computing technology is proposed in this paper, whose original image is encrypted by DNA coding and 1-D logistic chaotic mapping. First, the algorithm uses two modules as the encryption key. The first module uses the real DNA sequence, and the second module is made by one-dimensional logistic chaos mapping. Secondly, the algorithm uses DNA complementary rules to encode original image, and uses the key and DNA computing technology to compute each pixel value of the original image, so as to realize the encryption of the whole image. Simulation results show that the algorithm has good encryption effect and security.
Cloning and sequence analysis of a full-length cDNA of SmPP1cb encoding turbot protein phosphatase 1 beta catalytic subunit

NASA Astrophysics Data System (ADS)

Qi, Fei; Guo, Huarong; Wang, Jian

2008-02-01

Reversible protein phosphorylation, catalyzed by protein kinases and phosphatases, is an important and versatile mechanism by which eukaryotic cells regulate almost all the signaling processes. Protein phosphatase 1 (PP1) is the first and well-characterized member of the protein serine/threonine phosphatase family. In the present study, a full-length cDNA encoding the beta isoform of the catalytic subunit of protein phosphatase 1(PP1cb), was for the first time isolated and sequenced from the skin tissue of flatfish turbot Scophthalmus maximus, designated SmPP1cb, by the rapid amplification of cDNA ends (RACE) technique. The cDNA sequence of SmPP1cb we obtained contains a 984 bp open reading frame (ORF), flanked by a complete 39 bp 5' untranslated region and 462 bp 3' untranslated region. The ORF encodes a putative 327 amino acid protein, and the N-terminal section of this protein is highly acidic, Met-Ala-Glu-Gly-Glu-Leu-Asp-Val-Asp, a common feature for PP1 catalytic subunit but absent in protein phosphatase 2B (PP2B). And its calculated molecular mass is 37 193 Da and pI 5.8. Sequence analysis indicated that, SmPP1cb is extremely conserved in both amino acid and nucleotide acid levels compared with the PP1cb of other vertebrates and invertebrates, and its Kozak motif contained in the 5'UTR around ATG start codon is GXXAXXGXX ATGG, which is different from mammalian in two positions A-6 and G-3, indicating the possibility of different initiation of translation in turbot, and also the 3'UTR of SmPP1cb is highly diverse in the sequence similarity and length compared with other animals, especially zebrafish. The cloning and sequencing of SmPP1cb gene lays a good foundation for the future work on the biological functions of PP1 in the flatfish turbot.
A fully decompressed synthetic bacteriophage øX174 genome assembled and archived in yeast.

PubMed

Jaschke, Paul R; Lieberman, Erica K; Rodriguez, Jon; Sierra, Adrian; Endy, Drew

2012-12-20

The 5386 nucleotide bacteriophage øX174 genome has a complicated architecture that encodes 11 gene products via overlapping protein coding sequences spanning multiple reading frames. We designed a 6302 nucleotide synthetic surrogate, øX174.1, that fully separates all primary phage protein coding sequences along with cognate translation control elements. To specify øX174.1f, a decompressed genome the same length as wild type, we truncated the gene F coding sequence. We synthesized DNA encoding fragments of øX174.1f and used a combination of in vitro- and yeast-based assembly to produce yeast vectors encoding natural or designer bacteriophage genomes. We isolated clonal preparations of yeast plasmid DNA and transfected E. coli C strains. We recovered viable øX174 particles containing the øX174.1f genome from E. coli C strains that independently express full-length gene F. We expect that yeast can serve as a genomic 'drydock' within which to maintain and manipulate clonal lineages of other obligate lytic phage. Copyright © 2012 Elsevier Inc. All rights reserved.
Development of PCR primers specific for the amplification and direct sequencing of gyrB genes from microbacteria, order Actinomycetales.

PubMed

Richert, Kathrin; Brambilla, Evelyne; Stackebrandt, Erko

2005-01-01

PCR primer sets were developed for the specific amplification and sequence analyses encoding the gyrase subunit B (gyrB) of members of the family Microbacteriaceae, class Actinobacteria. The family contains species highly related by 16S rRNA gene sequence analyses. In order to test if the gene sequence analysis of gyrB is appropriate to discriminate between closely related species, we evaluate the 16S rRNA gene phylogeny of its members. As the published universal primer set for gyrB failed to amplify the responding gene of the majority of the 80 type strains of the family, three new primer sets were identified that generated fragments with a composite sequence length of about 900 nt. However, the amplification of all three fragments was successful only in 25% of the 80 type strains. In this study, the substitution frequencies in genes encoding gyrase and 16S rDNA were compared for 10 strains of nine genera. The frequency of gyrB nucleotide substitution is significantly higher than that of the 16S rDNA, and no linear correlation exists between the similarities of both molecules among members of the Microbacteriaceae. The phylogenetic analyses using the gyrB sequences provide higher resolution than using 16S rDNA sequences and seem able to discriminate between closely related species.
Role of sequence encoded κB DNA geometry in gene regulation by Dorsal

PubMed Central

Mrinal, Nirotpal; Tomar, Archana; Nagaraju, Javaregowda

2011-01-01

Many proteins of the Rel family can act as both transcriptional activators and repressors. However, mechanism that discerns the ‘activator/repressor’ functions of Rel-proteins such as Dorsal (Drosophila homologue of mammalian NFκB) is not understood. Using genomic, biophysical and biochemical approaches, we demonstrate that the underlying principle of this functional specificity lies in the ‘sequence-encoded structure’ of the κB-DNA. We show that Dorsal-binding motifs exist in distinct activator and repressor conformations. Molecular dynamics of DNA-Dorsal complexes revealed that repressor κB-motifs typically have A-tract and flexible conformation that facilitates interaction with co-repressors. Deformable structure of repressor motifs, is due to changes in the hydrogen bonding in A:T pair in the ‘A-tract’ core. The sixth nucleotide in the nonameric κB-motif, ‘A’ (A6) in the repressor motifs and ‘T’ (T6) in the activator motifs, is critical to confer this functional specificity as A6 → T6 mutation transformed flexible repressor conformation into a rigid activator conformation. These results highlight that ‘sequence encoded κB DNA-geometry’ regulates gene expression by exerting allosteric effect on binding of Rel proteins which in turn regulates interaction with co-regulators. Further, we identified and characterized putative repressor motifs in Dl-target genes, which can potentially aid in functional annotation of Dorsal gene regulatory network. PMID:21890896
Characterization of the complete mitochondrial genomes of Nematodirus oiratianus and Nematodirus spathiger of small ruminants

PubMed Central

2014-01-01

Background Nematodirus spp. are among the most common nematodes of ruminants worldwide. N. oiratianus and N. spathiger are distributed worldwide as highly prevalent gastrointestinal nematodes, which cause emerging health problems and economic losses. Accurate identification of Nematodirus species is essential to develop effective control strategies for Nematodirus infection in ruminants. Mitochondrial DNA (mtDNA) could provide powerful genetic markers for identifying these closely related species and resolving phylogenetic relationships at different taxonomic levels. Methods In the present study, the complete mitochondrial (mt) genomes of N. oiratianus and N. spathiger from small ruminants in China were obtained using Long-range PCR and sequencing. Results The complete mt genomes of N. oiratianus and N. spathiger were 13,765 bp and 13,519 bp in length, respectively. Both mt genomes were circular and consisted of 36 genes, including 12 genes encoding proteins, 2 genes encoding rRNA, and 22 genes encoding tRNA. Phylogenetic analyses based on the concatenated amino acid sequence data of all 12 protein-coding genes by Bayesian inference (BI), Maximum likelihood (ML) and Maximum parsimony (MP) showed that the two Nematodirus species (Molineidae) were closely related to Dictyocaulidae. Conclusions The availability of the complete mtDNA sequences of N. oiratianus and N. spathiger not only provides new mtDNA sources for a better understanding of nematode mt genomics and phylogeny, but also provides novel and useful genetic markers for studying diagnosis, population genetics and molecular epidemiology of Nematodirus spp. in small ruminants. PMID:25015379
Characterization of the complete mitochondrial genomes of Nematodirus oiratianus and Nematodirus spathiger of small ruminants.

PubMed

Zhao, Guang-Hui; Jia, Yan-Qing; Cheng, Wen-Yu; Zhao, Wen; Bian, Qing-Qing; Liu, Guo-Hua

2014-07-11

Nematodirus spp. are among the most common nematodes of ruminants worldwide. N. oiratianus and N. spathiger are distributed worldwide as highly prevalent gastrointestinal nematodes, which cause emerging health problems and economic losses. Accurate identification of Nematodirus species is essential to develop effective control strategies for Nematodirus infection in ruminants. Mitochondrial DNA (mtDNA) could provide powerful genetic markers for identifying these closely related species and resolving phylogenetic relationships at different taxonomic levels. In the present study, the complete mitochondrial (mt) genomes of N. oiratianus and N. spathiger from small ruminants in China were obtained using Long-range PCR and sequencing. The complete mt genomes of N. oiratianus and N. spathiger were 13,765 bp and 13,519 bp in length, respectively. Both mt genomes were circular and consisted of 36 genes, including 12 genes encoding proteins, 2 genes encoding rRNA, and 22 genes encoding tRNA. Phylogenetic analyses based on the concatenated amino acid sequence data of all 12 protein-coding genes by Bayesian inference (BI), Maximum likelihood (ML) and Maximum parsimony (MP) showed that the two Nematodirus species (Molineidae) were closely related to Dictyocaulidae. The availability of the complete mtDNA sequences of N. oiratianus and N. spathiger not only provides new mtDNA sources for a better understanding of nematode mt genomics and phylogeny, but also provides novel and useful genetic markers for studying diagnosis, population genetics and molecular epidemiology of Nematodirus spp. in small ruminants.

Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing.

PubMed

Hargreaves, Adam D; Mulley, John F

2015-01-01

Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0-2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5' and 3' UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species.
Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing

PubMed Central

Hargreaves, Adam D.

2015-01-01

Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0–2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5′ and 3′ UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species. PMID:26623194
cDNA encoding a polypeptide including a hevein sequence

DOEpatents

Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

1999-05-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
cDNA encoding a polypeptide including a hev ein sequence

DOEpatents

Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

2000-07-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
cDNA encoding a polypeptide including a hevein sequence

DOEpatents

Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

1999-05-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 12 figs.
cDNA encoding a polypeptide including a hevein sequence

DOEpatents

Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

1995-03-21

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 11 figures.
Pseudomonas aeruginosa phage PaP1 DNA polymerase is an A-family DNA polymerase demonstrating ssDNA and dsDNA 3'-5' exonuclease activity.

PubMed

Liu, Binyan; Gu, Shiling; Liang, Nengsong; Xiong, Mei; Xue, Qizhen; Lu, Shuguang; Hu, Fuquan; Zhang, Huidong

2016-08-01

Most phages contain DNA polymerases, which are essential for DNA replication and propagation in infected host bacteria. However, our knowledge on phage-encoded DNA polymerases remains limited. This study investigated the function of a novel DNA polymerase of PaP1, which is the lytic phage of Pseudomonas aeruginosa. PaP1 encodes its sole DNA polymerase called Gp90 that was predicted as an A-family DNA polymerase with polymerase and 3'-5' exonuclease activities. The sequence of Gp90 is homologous but not identical to that of other A-family DNA polymerases, such as T7 DNA polymerases (Pol) and DNA Pol I. The purified Gp90 demonstrated a polymerase activity. The processivity of Gp90 in DNA replication and its efficiency in single-dNTP incorporation are similar to those of T7 Pol with processive thioredoxin (T7 Pol/trx). Gp90 can degrade ssDNA and dsDNA in 3'-5' direction at a similar rate, which is considerably lower than that of T7 Pol/trx. The optimized conditions for polymerization were a temperature of 37 °C and a buffer consisting of 40 mM Tris-HCl (pH 8.0), 30 mM MgCl2, and 200 mM NaCl. These studies on DNA polymerase encoded by PaP1 help advance our knowledge on phage-encoded DNA polymerases and elucidate PaP1 propagation in infected P. aeruginosa.
Functional metagenomics reveals novel β-galactosidases not predictable from gene sequences.

PubMed

Cheng, Jiujun; Romantsov, Tatyana; Engel, Katja; Doxey, Andrew C; Rose, David R; Neufeld, Josh D; Charles, Trevor C

2017-01-01

The techniques of metagenomics have allowed researchers to access the genomic potential of uncultivated microbes, but there remain significant barriers to determination of gene function based on DNA sequence alone. Functional metagenomics, in which DNA is cloned and expressed in surrogate hosts, can overcome these barriers, and make important contributions to the discovery of novel enzymes. In this study, a soil metagenomic library carried in an IncP cosmid was used for functional complementation for β-galactosidase activity in both Sinorhizobium meliloti (α-Proteobacteria) and Escherichia coli (γ-Proteobacteria) backgrounds. One β-galactosidase, encoded by six overlapping clones that were selected in both hosts, was identified as a member of glycoside hydrolase family 2. We could not identify ORFs obviously encoding possible β-galactosidases in 19 other sequenced clones that were only able to complement S. meliloti. Based on low sequence identity to other known glycoside hydrolases, yet not β-galactosidases, three of these ORFs were examined further. Biochemical analysis confirmed that all three encoded β-galactosidase activity. Lac36W_ORF11 and Lac161_ORF7 had conserved domains, but lacked similarities to known glycoside hydrolases. Lac161_ORF10 had neither conserved domains nor similarity to known glycoside hydrolases. Bioinformatic and structural modeling implied that Lac161_ORF10 protein represented a novel enzyme family with a five-bladed propeller glycoside hydrolase domain. By discovering founding members of three novel β-galactosidase families, we have reinforced the value of functional metagenomics for isolating novel genes that could not have been predicted from DNA sequence analysis alone.
Light-modulated abundance of an mRNA encoding a calmodulin-regulated, chromatin-associated NTPase in pea

NASA Technical Reports Server (NTRS)

Hsieh, H. L.; Tong, C. G.; Thomas, C.; Roux, S. J.

1996-01-01

A CDNA encoding a 47 kDa nucleoside triphosphatase (NTPase) that is associated with the chromatin of pea nuclei has been cloned and sequenced. The translated sequence of the cDNA includes several domains predicted by known biochemical properties of the enzyme, including five motifs characteristic of the ATP-binding domain of many proteins, several potential casein kinase II phosphorylation sites, a helix-turn-helix region characteristic of DNA-binding proteins, and a potential calmodulin-binding domain. The deduced primary structure also includes an N-terminal sequence that is a predicted signal peptide and an internal sequence that could serve as a bipartite-type nuclear localization signal. Both in situ immunocytochemistry of pea plumules and immunoblots of purified cell fractions indicate that most of the immunodetectable NTPase is within the nucleus, a compartment proteins typically reach through nuclear pores rather than through the endoplasmic reticulum pathway. The translated sequence has some similarity to that of human lamin C, but not high enough to account for the earlier observation that IgG against human lamin C binds to the NTPase in immunoblots. Northern blot analysis shows that the NTPase MRNA is strongly expressed in etiolated plumules, but only poorly or not at all in the leaf and stem tissues of light-grown plants. Accumulation of NTPase mRNA in etiolated seedlings is stimulated by brief treatments with both red and far-red light, as is characteristic of very low-fluence phytochrome responses. Southern blotting with pea genomic DNA indicates the NTPase is likely to be encoded by a single gene.
Complementary DNA characterization and chromosomal localization of a human gene related to the poliovirus receptor-encoding gene.

PubMed

Lopez, M; Eberlé, F; Mattei, M G; Gabert, J; Birg, F; Bardin, F; Maroc, C; Dubreuil, P

1995-04-03

The human poliovirus (PV) receptor (PVR) is a member of the immunoglobulin (Ig) superfamily with unknown cellular function. We have isolated a human PVR-related (PRR) cDNA. The deduced amino acid (aa) sequence of PRR showed, in the extracellular region, 51.7 and 54.3% similarity with human PVR and with the murine PVR homolog, respectively. The cDNA coding sequence is 1.6-kb long and encodes a deduced 57-kDa protein; this protein has a structural organization analogous to that of PVR, that is, one V- and two C-set Ig domains, with a conserved number of aa. Northern blot analysis indicated that a major 5.9-kb transcript is present in all normal human tissues tested. In situ hybridization showed that the PRR gene is located at bands q23-q24 of human chromosome 11.
cDNA cloning of Brassica napus malonyl-CoA:ACP transacylase (MCAT) (fab D) and complementation of an E. coli MCAT mutant.

PubMed

Simon, J W; Slabas, A R

1998-09-18

The GenBank database was searched using the E. coli malonyl CoA:ACP transacylase (MCAT) sequence, for plant protein/cDNA sequences corresponding to MCAT, a component of plant fatty acid synthetase (FAS), for which the plant cDNA has not been isolated. A 272-bp Zea mays EST sequence (GenBank accession number: AA030706) was identified which has strong homology to the E. coli MCAT. A PCR derived cDNA probe from Zea mays was used to screen a Brassica napus (rape) cDNA library. This resulted in the isolation of a 1200-bp cDNA clone which encodes an open reading frame corresponding to a protein of 351 amino acids. The protein shows 47% homology to the E. coli MCAT amino acid sequence in the coding region for the mature protein. Expression of a plasmid (pMCATrap2) containing the plant cDNA sequence in Fab D89, an E. coli mutant, in MCAT activity restores growth demonstrating functional complementation and direct function of the cloned cDNA. This is the first functional evidence supporting the identification of a plant cDNA for MCAT.
Non-essential MCM-related proteins mediate a response to DNA damage in the archaeon Methanococcus maripaludis.

PubMed

Walters, Alison D; Chong, James P J

2017-05-01

The single minichromosome maintenance (MCM) protein found in most archaea has been widely studied as a simplified model for the MCM complex that forms the catalytic core of the eukaryotic replicative helicase. Organisms of the order Methanococcales are unusual in possessing multiple MCM homologues. The Methanococcus maripaludis S2 genome encodes four MCM homologues, McmA-McmD. DNA helicase assays reveal that the unwinding activity of the three MCM-like proteins is highly variable despite sequence similarities and suggests additional motifs that influence MCM function are yet to be identified. While the gene encoding McmA could not be deleted, strains harbouring individual deletions of genes encoding each of the other MCMs display phenotypes consistent with these proteins modulating DNA damage responses. M. maripaludis S2 is the first archaeon in which MCM proteins have been shown to influence the DNA damage response.
Two circular chromosomes of unequal copy number make up the mitochondrial genome of the rotifer Brachionus plicatilis.

PubMed

Suga, Koushirou; Mark Welch, David B; Tanaka, Yukari; Sakakura, Yoshitaka; Hagiwara, Atsushi

2008-06-01

The monogonont rotifer Brachionus plicatilis is an emerging model system for a diverse array of questions in limnological ecosystem dynamics, the evolution of sexual recombination, cryptic speciation, and the phylogeny of basal metazoans. We sequenced the complete mitochondrial genome of B. plicatilis sensu strictu NH1L and found that it is composed of 2 circular chromosomes, designated mtDNA-I (11,153 bp) and mtDNA-II (12,672 bp). Hybridization to DNA isolated from mitochondria demonstrated that mtDNA-I is present at 4 times the copy number of mtDNA-II. The only nucleotide similarity between the 2 chromosomes is a 4.9-kbp region of 99.5% identity including a transfer RNA (tRNA) gene and an extensive noncoding region that contains putative D-loop and control sequence. The mtDNA-I chromosome encodes 4 proteins (ATP6, COB, NAD1, and NAD2), 13 tRNAs, and the large and small subunit ribosomal RNAs; mtDNA-II encodes 8 proteins (COX1-3, NAD3-6, and NAD4L) and 9 tRNAs. Gene order is not conserved between B. plicatilis and its closest relative with a sequenced mitochondrial genome, the acanthocephalan Leptorhynchoides thecatus, or other sequenced mitochondrial genomes. Polymerase chain reaction assays and Southern hybridization to DNA from 18 strains of Brachionus suggest that the 2-chromosome structure has been stable for millions of years. The novel organization of the B. plicatilis mitochondrial genome into 2 nearly equal chromosomes of 4-fold different copy number may provide insight into the evolution of metazoan mitochondria and the phylogenetics of rotifers and other basal animal phyla.
Small gene family encoding an eggshell (chorion) protein of the human parasite Schistosoma mansoni

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bobek, L.A.; Rekosh, D.M.; Lo Verde, P.T.

1988-08-01

The authors isolated six independent genomic clones encoding schistosome chorion or eggshell proteins from a Schistosoma mansoni genomic library. A linkage map of five of the clones spanning 35 kilobase pairs (kbp) of the S. mansoni genome was constructed. The region contained two eggshell protein genes closely linked, separated by 7.5 kbp of intergenic DNA. The two genes of the cluster were arranged in the same orientation, that is, they were transcribed from the same strand. The sixth clone probably represents a third copy of the eggshell gene that is not contained within the 35-kbp region. The 5- end ofmore » the mRNA transcribed from these genes was defined by primer extension directly off the RNA. The ATCAT cap site sequence was homologous to a silkmoth chorion PuTCATT cap site sequence, where Pu indicates any purine. DNA sequence analysis showed that there were no introns in these genes. The DNA sequences of the three genes were very homologous to each other and to a cDNA clone, pSMf61-46, differing only in three or four nucleotices. A multiple TATA box was located at positions -23 to -31, and a CAAAT sequence was located at -52 upstream of the eggshell transcription unit. Comparison of sequences in regions further upstream with silkmoth and Drosophila sequences revealed very short elements that were shared. One such element, TCACGT, recently shown to be an essential cis-regulatory element for silkmoth chorion gene promoter function, was found at a similar position in all three organisms.« less
The utility of DNA sequences of an intron from the beta-fibrinogen gene in phylogenetic analysis of woodpeckers (Aves: Picidae).

PubMed

Prychitko, T M; Moore, W S

1997-10-01

Estimating phylogenies from DNA sequence data has become the major methodology of molecular phylogenetics. To date, molecular phylogenetics of the vertebrates has been very dependent on mtDNA, but studies involving mtDNA are limited because the several genes comprising the mt-genome are inherited as a single linkage group. The only apparent solution to this problem is to sequence additional genes, each representing a distinct linkage group, so that the resultant gene trees provide independent estimates of the species tree. There exists the need to find novel gene sequences which contain enough phylogenetic information to resolve relationships between closely related species. A possible source is the nuclear-encoded introns, because they evolve more rapidly than exons. We designed primers to amplify and sequence the 7 intron from the beta-fibrinogen gene for a recently evolved group, the woodpeckers. We sequenced the entire intron for 10 specimens representing five species. Nucleotide substitutions are randomly distributed along the length of the intron, suggesting selective neutrality. A preliminary analysis indicates that the phylogenetic signal in the intron is as strong as that in the mitochondrial encoded cytochrome b (cyt b) gene. The topology of the beta-fibrinogen tree is identical to that of the cyt b tree. This analysis demonstrates the ability of the 7 intron of beta-fibrinogen to provide well resolved, independent gene trees for recently evolved groups and establishes it as a source of sequences to be used in other phylogenetic studies. Copyright 1997 Academic Press
Identification of a maize nucleic acid-binding protein (NBP) belonging to a family of nuclear-encoded chloroplast proteins.

PubMed Central

Cook, W B; Walker, J C

1992-01-01

A cDNA encoding a nuclear-encoded chloroplast nucleic acid-binding protein (NBP) has been isolated from maize. Identified as an in vitro DNA-binding activity, NBP belongs to a family of nuclear-encoded chloroplast proteins which share a common domain structure and are thought to be involved in posttranscriptional regulation of chloroplast gene expression. NBP contains an N-terminal chloroplast transit peptide, a highly acidic domain and a pair of ribonucleoprotein consensus sequence domains. NBP is expressed in a light-dependent, organ-specific manner which is consistent with its involvement in chloroplast biogenesis. The relationship of NBP to the other members of this protein family and their possible regulatory functions are discussed. Images PMID:1346929
The organisation and interviral homologies of genes at the 3' end of tobacco rattle virus RNA1

PubMed Central

Boccara, Martine; Hamilton, William D. O.; Baulcombe, David C.

1986-01-01

The RNA1 of tobacco rattle virus (TRV) has been cloned as cDNA and the nucleotide sequence determined of 2 kb from the 3'-terminal region. The sequence contains three long open reading frames. One of these starts 5' of the cDNA and probably corresponds to the carboxy-terminal sequence of a 170-K protein encoded on RNA1. The deduced protein sequence from this reading frame shows homology with the putative replicases of tobacco mosaic virus (TMV) and tricornaviruses. The location of the second open reading frame, which encodes a 29-K polypeptide, was shown by Northern blot analysis to coincide with a 1.6-kb subgenomic RNA. The validity of this reading frame was confirmed by showing that the cDNA extending over this region could be transcribed and translated in vitro to produce a polypeptide of the predicted size which co-migrates in electrophoresis with a translation product of authentic viral RNA. The sequence of this 29-K polypeptide showed homology with two regions in the 30-K protein of TMV. This homology includes positions in the TMV 30-K protein where mutations have been identified which affect the transport of virus between cells. The third open reading frame encodes a potential 16-K protein and was shown by Northern blot hybridisation to be contained within the region of a 0.7-kb subgenomic RNA which is found in cellular RNA of infected cells but not virus particles. The many similarities between TRV and TMV in viral morphology, gene organisation and sequence suggest that these two viral groups may share a common viral ancestor. ImagesFig. 2.Fig. 3. PMID:16453668
cDNA sequence and expression of a cold-responsive gene in Citrus unshiu.

PubMed

Hara, M; Wakasugi, Y; Ikoma, Y; Yano, M; Ogawa, K; Kuboi, T

1999-02-01

A cDNA clone encoding a protein (CuCOR19), the sequence of which is similar to Poncirus COR19, of the dehydrin family was isolated from the epicarp of Citrus unshiu. The molecular mass of the predicted protein was 18,980 daltons. CuCOR19 was highly hydrophilic and contained three repeating elements including Lys-rich motifs. The gene expression in leaves increased by cold stress.
Molecular cloning and characterization of RGA1 encoding a G protein alpha subunit from rice (Oryza sativa L. IR-36).

PubMed

Seo, H S; Kim, H Y; Jeong, J Y; Lee, S Y; Cho, M J; Bahk, J D

1995-03-01

A cDNA clone, RGA1, was isolated by using a GPA1 cDNA clone of Arabidopsis thaliana G protein alpha subunit as a probe from a rice (Oryza sativa L. IR-36) seedling cDNA library from roots and leaves. Sequence analysis of genomic clone reveals that the RGA1 gene has 14 exons and 13 introns, and encodes a polypeptide of 380 amino acid residues with a calculated molecular weight of 44.5 kDa. The encoded protein exhibits a considerable degree of amino acid sequence similarity to all the other known G protein alpha subunits. A putative TATA sequence (ATATGA), a potential CAAT box sequence (AGCAATAC), and a cis-acting element, CCACGTGG (ABRE), known to be involved in ABA induction are found in the promoter region. The RGA1 protein contains all the consensus regions of G protein alpha subunits except the cysteine residue near the C-terminus for ADP-ribosylation by pertussis toxin. The RGA1 polypeptide expressed in Escherichia coli was, however, ADP-ribosylated by 10 microM [adenylate-32P] NAD and activated cholera toxin. Southern analysis indicates that there are no other genes similar to the RGA1 gene in the rice genome. Northern analysis reveals that the RGA1 mRNA is 1.85 kb long and expressed in vegetative tissues, including leaves and roots, and that its expression is regulated by light.
cap alpha. /sub i/-3 cDNA encodes the. cap alpha. subunit of G/sub k/, the stimulatory G protein of receptor-regulated K/sup +/ channels

DOE Office of Scientific and Technical Information (OSTI.GOV)

Codina, J.; Olate, J.; Abramowitz, J.

1988-05-15

cDNA cloning has identified the presence in the human genome of three genes encoding ..cap alpha.. subunits of pertussis toxin substrates, generically called G/sub i/. They are named ..cap alpha../sub i/-1, ..cap alpha../sub i/-2 and ..cap alpha../sub i/-3. However, none of these genes has been functionally identified with any of the ..cap alpha.. subunits of several possible G proteins, including pertussis toxin-sensitive G/sub p/'s, stimulatory to phospholipase C or A/sub 2/, G/sub i/, inhibitory to adenylyl cyclase, or G/sub k/, stimulatory to a type of K/sup +/ channels. The authors now report the nucleotide sequence and the complete predicted aminomore » acid sequence of human liver ..cap alpha../sub i/-3 and the partial amino acid sequence of proteolytic fragments of the ..cap alpha.. subunit of human erythrocyte G/sub k/. The amino acid sequence of the proteolytic fragment is uniquely encoded by the cDNA of ..cap alpha../sub i/-3, thus identifying it as ..cap alpha../sub k/. The probable identity of ..cap alpha../sub i/-1 with ..cap alpha../sub p/ and possible roles for ..cap alpha../sub i/-2, as well as additional roles for ..cap alpha../sub i/-1 and ..cap alpha../sub i/-3 (..cap alpha../sub k/) are discussed.« less

Mavericks, a novel class of giant transposable elements widespread in eukaryotes and related to DNA viruses.

PubMed

Pritham, Ellen J; Putliwala, Tasneem; Feschotte, Cédric

2007-04-01

We previously identified a group of atypical mobile elements designated Mavericks from the nematodes Caenorhabditis elegans and C. briggsae and the zebrafish Danio rerio. Here we present the results of comprehensive database searches of the genome sequences available, which reveal that Mavericks are widespread in invertebrates and non-mammalian vertebrates but show a patchy distribution in non-animal species, being present in the fungi Glomus intraradices and Phakopsora pachyrhizi and in several single-celled eukaryotes such as the ciliate Tetrahymena thermophila, the stramenopile Phytophthora infestans and the trichomonad Trichomonas vaginalis, but not detectable in plants. This distribution, together with comparative and phylogenetic analyses of Maverick-encoded proteins, is suggestive of an ancient origin of these elements in eukaryotes followed by lineage-specific losses and/or recurrent episodes of horizontal transmission. In addition, we report that Maverick elements have amplified recently to high copy numbers in T. vaginalis where they now occupy as much as 30% of the genome. Sequence analysis confirms that most Mavericks encode a retroviral-like integrase, but lack other open reading frames typically found in retroelements. Nevertheless, the length and conservation of the target site duplication created upon Maverick insertion (5- or 6-bp) is consistent with a role of the integrase-like protein in the integration of a double-stranded DNA transposition intermediate. Mavericks also display long terminal-inverted repeats but do not contain ORFs similar to proteins encoded by DNA transposons. Instead, Mavericks encode a conserved set of 5 to 9 genes (in addition to the integrase) that are predicted to encode proteins with homology to replication and packaging proteins of some bacteriophages and diverse eukaryotic double-stranded DNA viruses, including a DNA polymerase B homolog and putative capsid proteins. Based on these and other structural similarities, we speculate that Mavericks represent an evolutionary missing link between seemingly disparate invasive DNA elements that include bacteriophages, adenoviruses and eukaryotic linear plasmids.
Overexpression of Nrp/b (nuclear restrict protein in brain) suppresses the malignant phenotype in the C6/ST1 glioma cell line.

PubMed

Degaki, Theri Leica; Demasi, Marcos Angelo Almeida; Sogayar, Mari Cleide

2009-11-01

Upon searching for glucocorticoid-regulated cDNA sequences associated with the transformed to normal phenotypic reversion of C6/ST1 rat glioma cells, we identified Nrp/b (nuclear restrict protein in brain) as a novel rat gene. Here we report on the identification and functional characterization of the complete sequence encoding the rat NRP/B protein. The cloned cDNA presented a 1767 nucleotides open-reading frame encoding a 589 amino acids residues sequence containing a BTB/POZ (broad complex Tramtrack bric-a-brac/Pox virus and zinc finger) domain in its N-terminal region and kelch motifs in its C-terminal region. Sequence analysis indicates that the rat Nrp/b displays a high level of identity with the equivalent gene orthologs from other organisms. Among rat tissues, Nrp/b expression is more pronounced in brain tissue. We show that overexpression of the Nrp/b cDNA in C6/ST1 cells suppresses anchorage independence in vitro and tumorigenicity in vivo, altering their malignant nature towards a more benign phenotype. Therefore, Nrp/b may be postulated as a novel tumor suppressor gene, with possible relevance for glioblastoma therapy.
Molecular cloning and nucleotide sequence of a transforming gene detected by transfection of chicken B-cell lymphoma DNA

NASA Astrophysics Data System (ADS)

Goubin, Gerard; Goldman, Debra S.; Luce, Judith; Neiman, Paul E.; Cooper, Geoffrey M.

1983-03-01

A transforming gene detected by transfection of chicken B-cell lymphoma DNA has been isolated by molecular cloning. It is homologous to a conserved family of sequences present in normal chicken and human DNAs but is not related to transforming genes of acutely transforming retroviruses. The nucleotide sequence of the cloned transforming gene suggests that it encodes a protein that is partially homologous to the amino terminus of transferrin and related proteins although only about one tenth the size of transferrin.
Frame-Insensitive Expression Cloning of Fluorescent Protein from Scolionema suvaense.

PubMed

Horiuchi, Yuki; Laskaratou, Danai; Sliwa, Michel; Ruckebusch, Cyril; Hatori, Kuniyuki; Mizuno, Hideaki; Hotta, Jun-Ichi

2018-01-26

Expression cloning from cDNA is an important technique for acquiring genes encoding novel fluorescent proteins. However, the probability of in-frame cDNA insertion following the first start codon of the vector is normally only 1/3, which is a cause of low cloning efficiency. To overcome this issue, we developed a new expression plasmid vector, pRSET-TriEX, in which transcriptional slippage was induced by introducing a DNA sequence of (dT) 14 next to the first start codon of pRSET. The effectiveness of frame-insensitive cloning was validated by inserting the gene encoding eGFP with all three possible frames to the vector. After transformation with one of these plasmids, E. coli cells expressed eGFP with no significant difference in the expression level. The pRSET-TriEX vector was then used for expression cloning of a novel fluorescent protein from Scolionema suvaense . We screened 3658 E. coli colonies transformed with pRSET-TriEX containing Scolionema suvaense cDNA, and found one colony expressing a novel green fluorescent protein, ScSuFP. The highest score in protein sequence similarity was 42% with the chain c of multi-domain green fluorescent protein like protein "ember" from Anthoathecata sp. Variations in the N- and/or C-terminal sequence of ScSuFP compared to other fluorescent proteins indicate that the expression cloning, rather than the sequence similarity-based methods, was crucial for acquiring the gene encoding ScSuFP. The absorption maximum was at 498 nm, with an extinction efficiency of 1.17 × 10⁵ M -1 ·cm -1 . The emission maximum was at 511 nm and the fluorescence quantum yield was determined to be 0.6. Pseudo-native gel electrophoresis showed that the protein forms obligatory homodimers.
Molecular cloning and expression of rat brain endopeptidase 3.4.24.16.

PubMed

Dauch, P; Vincent, J P; Checler, F

1995-11-10

We have isolated by immunological screening of a lambda ZAPII cDNA library constructed from rat brain mRNAs a cDNA clone encoding endopeptidase 3.4.24.16. The longest open reading frame encodes a 704-amino acid protein with a theoretical molecular mass of 80,202 daltons and bears the consensus sequence of the zinc metalloprotease family. The sequence exhibits a 60.2% homology with those of another zinc metallopeptidase, endopeptidase 3.4.24.15. Northern blot analysis reveals two mRNA species of about 3 and 5 kilobases in rat brain, ileum, kidney, and testis. We have transiently transfected COS-7 cells with pcDNA3 containing the cloned cDNA and established the overexpression of a 70-75-kDa immunoreactive protein. This protein hydrolyzes QFS, a quenched fluorimetric substrate of endopeptidase 3.4.24.16, and cleaves neurotensin at a single peptide bond, leading to the formation of neurotensin (1-10) and neurotensin (11-13). QFS and neurotensin hydrolysis are potently inhibited by the selective endopeptidase 3.4.24.16 dipeptide blocker Pro-Ile and by dithiothreitol, while the enzymatic activity remains unaffected by phosphoramidon and captopril, the specific inhibitors of endopeptidase 3.4.24.11 and angiotensin-converting enzyme, respectively. Altogether, these physicochemical, biochemical, and immunological properties unambiguously identify endopeptidase 3.4.24.16 as the protein encoded by the isolated cDNA clone.
Manipulation of oligonucleotides immobilized on solid supports - DNA computations on surfaces

NASA Astrophysics Data System (ADS)

Liu, Qinghua

The manipulation of DNA oligonucleotides immobilized on various solid supports has been studied intensively, especially in the area of surface hybridization. Recently, surface-based biotechnology has been applied to the area of molecular computing. These surface-based methods have advantages with regard to ease of handling, facile purification, and less interference when compared to solution methodologies. This dissertation describes the investigation of molecular approaches to DNA computing. The feasibility of encoding a bit (0 or 1) of information for DNA-based computations at the single nucleotide level was studied, particularly with regard to the efficiency and specificity of hybridization discrimination. Both gold and glass surfaces, with addressed arrays of 32 oligonucleotides, were employed with similar hybridization results. Although single-base discrimination may be achieved in the system, it is at the cost of a severe decrease in the efficiency of hybridization to perfectly matched sequences. This compromises the utility of single nucleotide encoding for DNA computing applications in the absence of some additional mechanism for increasing specificity. Several methods are suggested including a multiple-base encoding strategy. The multiple-base encoding strategy was employed to develop a prototype DNA computer. The approach was demonstrated by solving a small example of the Satisfiability (SAT) problem, an NP-complete problem in Boolean logic. 16 distinct DNA oligonucleotides, encoding all candidate solutions to the 4-variable-4-clause-3-SAT problem, were immobilized on a gold surface in the non-addressed format. Four cycles of MARK (hybridization), DESTROY (enzymatic destruction) and UNMARK (denaturation) were performed, which identified and eliminated members of the set which were not solutions to the problem. Determination of the answer was accomplished in the READOUT (sequence identification) operation by PCR amplification of the remaining molecules and hybridization to an addressed array. Four answers were determined and the S/N ratio between correct and incorrect solutions ranged from 10 to 777, making discrimination between correct and incorrect solutions to the problem straightforward. Additionally, studies of enzymatic manipulations of DNA molecules on surfaces suggested the use of E. coli Exonuclease I (Exo I) and perhaps EarI in the DESTROY operation.
Phylogenetic characterization of a biogas plant microbial community integrating clone library 16S-rDNA sequences and metagenome sequence data obtained by 454-pyrosequencing.

PubMed

Kröber, Magdalena; Bekel, Thomas; Diaz, Naryttza N; Goesmann, Alexander; Jaenicke, Sebastian; Krause, Lutz; Miller, Dimitri; Runte, Kai J; Viehöver, Prisca; Pühler, Alfred; Schlüter, Andreas

2009-06-01

The phylogenetic structure of the microbial community residing in a fermentation sample from a production-scale biogas plant fed with maize silage, green rye and liquid manure was analysed by an integrated approach using clone library sequences and metagenome sequence data obtained by 454-pyrosequencing. Sequencing of 109 clones from a bacterial and an archaeal 16S-rDNA amplicon library revealed that the obtained nucleotide sequences are similar but not identical to 16S-rDNA database sequences derived from different anaerobic environments including digestors and bioreactors. Most of the bacterial 16S-rDNA sequences could be assigned to the phylum Firmicutes with the most abundant class Clostridia and to the class Bacteroidetes, whereas most archaeal 16S-rDNA sequences cluster close to the methanogen Methanoculleus bourgensis. Further sequences of the archaeal library most probably represent so far non-characterised species within the genus Methanoculleus. A similar result derived from phylogenetic analysis of mcrA clone sequences. The mcrA gene product encodes the alpha-subunit of methyl-coenzyme-M reductase involved in the final step of methanogenesis. BLASTn analysis applying stringent settings resulted in assignment of 16S-rDNA metagenome sequence reads to 62 16S-rDNA amplicon sequences thus enabling frequency of abundance estimations for 16S-rDNA clone library sequences. Ribosomal Database Project (RDP) Classifier processing of metagenome 16S-rDNA reads revealed abundance of the phyla Firmicutes, Bacteroidetes and Euryarchaeota and the orders Clostridiales, Bacteroidales and Methanomicrobiales. Moreover, a large fraction of 16S-rDNA metagenome reads could not be assigned to lower taxonomic ranks, demonstrating that numerous microorganisms in the analysed fermentation sample of the biogas plant are still unclassified or unknown.
Amino acid sequence of a trypsin inhibitor from a Spirometra (Spirometra erinaceieuropaei).

PubMed

Sanda, A; Uchida, A; Itagaki, T; Kobayashi, H; Inokuchi, N; Koyama, T; Iwama, M; Ohgi, K; Irie, M

2001-12-01

A trypsin inhibitor that is highly homologous with bovine pancreatic trypsin inhibitor (BPTI) was co-purified along with RNase from Spirometra (Spirometra erinaceieuropaei). The amino acid sequence of this inhibitor (SETI) and the nucleotide sequence of the cDNA encoding this protein were determined by protein chemistry and gene technology. SETI contains 68 amino acid residues and has a molecular mass of 7,798 Da. SETI has 31 amino acid residues that are identical with BPTI's sequence, including 6 half-cystine and 5 aromatic amino acid residues. The active site Lys residue in BPTI is replaced by an Arg residue in SETI. SETI is an effective inhibitor of trypsin and moderately inhibits a-chymotrypsin, but less inhibits elastase or subtilisin. SETI was expressed by E. coli containing a PelB vector carrying the SETI encoding cDNA; an expression yield of 0.68 mg/l was obtained. The phylogenetic relationship of SETI and the other BPTI-like trypsin inhibitors was analyzed using most likelihood inference methods.
Cloning and expression of a nuclear encoded plastid specific 33 kDa ribonucleoprotein gene (33RNP) from pea that is light stimulated.

PubMed

Reddy, M K; Nair, S; Singh, B N; Mudgil, Y; Tewari, K K; Sopory, S K

2001-01-24

We report the cloning and sequencing of both cDNA and genomic DNA of a 33 kDa chloroplast ribonucleoprotein (33RNP) from pea. The analysis of the predicted amino acid sequence of the cDNA clone revealed that the encoded protein contains two RNA binding domains, including the conserved consensus ribonucleoprotein sequences CS-RNP1 and CS-RNP2, on the C-terminus half and the presence of a putative transit peptide sequence in the N-terminus region. The phylogenetic and multiple sequence alignment analysis of pea chloroplast RNP along with RNPs reported from the other plant sources revealed that the pea 33RNP is very closely related to Nicotiana sylvestris 31RNP and 28RNP and also to 31RNP and 28RNP of Arabidopsis and spinach, respectively. The pea 33RNP was expressed in Escherichia coli and purified to homogeneity. The in vitro import of precursor protein into chloroplasts confirmed that the N-terminus putative transit peptide is a bona fide transit peptide and 33RNP is localized in the chloroplast. The nucleic acid-binding properties of the recombinant protein, as revealed by South-Western analysis, showed that 33RNP has higher binding affinity for poly (U) and oligo dT than for ssDNA and dsDNA. The steady state transcript level was higher in leaves than in roots and the expression of this gene is light stimulated. Sequence analysis of the genomic clone revealed that the gene contains four exons and three introns. We have also isolated and analyzed the 5' flanking region of the pea 33RNP gene.
Nucleotide sequencing and characterization of the genes encoding benzene oxidation enzymes of Pseudomonas putida.

PubMed Central

Irie, S; Doi, S; Yorifuji, T; Takagi, M; Yano, K

1987-01-01

The nucleotide sequence of the genes from Pseudomonas putida encoding oxidation of benzene to catechol was determined. Five open reading frames were found in the sequence. Four corresponding protein molecules were detected by a DNA-directed in vitro translation system. Escherichia coli cells containing the fragment with the four open reading frames transformed benzene to cis-benzene glycol, which is an intermediate of the oxidation of benzene to catechol. The relation between the product of each cistron and the components of the benzene oxidation enzyme system is discussed. Images PMID:3667527
Genome organization and DNA methylation patterns of B chromosomes in the red fox and Chinese raccoon dogs.

PubMed

Bugno-Poniewierska, Monika; Solek, Przemysław; Wronski, Mariusz; Potocki, Leszek; Jezewska-Witkowska, Grażyna; Wnuk, Maciej

2014-12-01

The molecular structure of B chromosomes (Bs) is relatively well studied. Previous research demonstrates that Bs of various species usually contain two types of repetitive DNA sequences, satellite DNA and ribosomal DNA, but Bs also contain genes encoding histone proteins and many others. However, many questions remain regarding the origin and function of these chromosomes. Here, we focused on the comparative cytogenetic characteristics of the red fox and Chinese raccoon dog B chromosomes with particular attention to the distribution of repetitive DNA sequences and their methylation status. We confirmed that the small Bs of the red fox show a typical fluorescent telomeric distal signal, whereas medium-sized Bs of the Chinese raccoon dog were characterized by clusters of telomeric sequences along their length. We also found different DNA methylation patterns for the B chromosomes of both species. Therefore, we concluded that DNA methylation may maintain the transcriptional inactivation of DNA sequences localized to B chromosomes and may prevent genetic unbalancing and several negative phenotypic effects. © 2014 The Authors.
Detection with synthetic oligonucleotide probes of nucleotide sequence variations in the genes encoding enterotoxins of Escherichia coli.

PubMed Central

Nishibuchi, M; Murakami, A; Arita, M; Jikuya, H; Takano, J; Honda, T; Miwatani, T

1989-01-01

We examined variations in the genes encoding heat-stable enterotoxin (ST) and heat-labile enterotoxin (LT) in 88 strains of Escherichia coli isolated from individuals with traveler's diarrhea to find suitable sequences for use as oligonucleotide probes. Four oligonucleotide probes of the gene encoding ST of human origin (STIb or STh), one oligonucleotide probe of the gene encoding ST of porcine origin (STIa or STp), and three oligonucleotide probes of the gene encoding LT of human origin (LTIh) were used in DNA colony hybridization tests. In 15 of 22 strains possessing the STh gene and 28 of 42 strains producing LT, the sequences of all regions tested were identical to the published sequences. One region in the STh gene examined with a 18-mer probe was relatively well conserved and was shown to be closely associated with the enterotoxicity of the E. coli strains in suckling mice. This oligonucleotide, however, hybridized with strains of Vibrio cholerae O1, V. parahaemolyticus, and Yersinia enterocolitica that gave negative results in the suckling mouse assay. PMID:2685027
GDP-L-fucose: {beta}-D-galactoside 2-{alpha}-Lfucosyltransferases, DNA sequences encoding the same, method for producing the same and a method of genotyping a person

DOEpatents

Lowe, J.B.; Lennon, G.; Rouquier, S.; Giorgi, D.; Kelly, R.J.

1998-09-15

The gene encoding GDP-L-fucose: {beta}-D-Galactoside 2-{alpha}-Lfucosyltransferase has been cloned, and a mutation in this gene has been found to be responsible for an individual being a non-secretor. 30 figs.
GDP-L-fucose: .beta.-D-galactoside 2-.alpha.-L-fucosyltransferases, DNA sequences encoding the same, method for producing the same and a method of genotyping a person

DOEpatents

Lowe, John B.; Lennon, Gregory; Rouquier, Sylvie; Giorgi, Dominique; Kelly, Robert J.

1998-01-01

The gene encoding GDP-L-fucose: .beta.-D-Galactoside 2-.alpha.-L-fucosyltransferase has been cloned, and a mutation in this gene has been found to be responsible for an individual being a non-secretor.
Molecular cloning of a cDNA encoding the precursor of adenoregulin from frog skin. Relationships with the vertebrate defensive peptides, dermaseptins.

PubMed

Amiche, M; Ducancel, F; Lajeunesse, E; Boulain, J C; Ménez, A; Nicolas, P

1993-03-31

Adenoregulin has recently been isolated from Phyllomedusa skin as a 33 amino acid residues peptide which enhanced binding of agonists to the A1 adenosine receptor. In order to study the structure of the precursor of adenoregulin we constructed a cDNA library from mRNAs extracted from the skin of Phyllomedusa bicolor. We detected the complete nucleotide sequence of a cDNA encoding the adenoregulin biosynthetic precursor. The deduced sequence of the precursor is 81 amino acids long, exhibits a putative signal sequence at the NH2 terminus and contains a single copy of the biologically active peptide at the COOH terminus. Structural and conformational homologies that are observed between adenoregulin and the dermaseptins, antimicrobial peptides exhibiting strong membranolytic activities against various pathogenic agents, suggest that adenoregulin is an additional member of the growing family of cytotropic antimicrobial peptides that allow vertebrate animals to defend themselves against microorganisms. As such, the adenosine receptor regulating activity of adenoregulin could be due to its ability to interact with and disrupt membranes lipid bilayers.
Cloning and expression of Bartonella henselae sucB gene encoding an immunogenic dihydrolipoamide succinyltransferase homologous protein.

PubMed

Kabeya, Hidenori; Maruyama, Soichi; Hirano, Kouji; Mikami, Takeshi

2003-01-01

Immunoscreening of a ZAP genomic library of Bartonella henselae strain Houston-1 expressed in Escherichia coli resulted in the isolation of a clone containing 3.5 kb BamHI genomic DNA fragment. This 3.5 kb DNA fragment was found to contain a sequence of a gene encoding a protein with significant homology to the dihydrolipoamide succinyltransferase of Brucella melitensis (sucB). Subsequent cloning and DNA sequence analysis revealed that the deduced amino acid sequence from the cloned gene showed 66.5% identity to SucB protein of B. melitensis, and 43.4 and 47.2% identities to those of Coxiella burnetii and E. coli, respectively. The gene was expressed as a His-Nus A-tagged fusion protein. The recombinant SucB protein (rSucB) was shown to be an immunoreactive protein of about 115 kDa by Western blot analysis with sera from B. henselae-immunized mice. Therefore the rSucB may be a candidate antigen for a specific serological diagnosis of B. henselae infection.
Saponin Biosynthesis in Saponaria vaccaria. cDNAs Encoding β-Amyrin Synthase and a Triterpene Carboxylic Acid Glucosyltransferase1[OA

PubMed Central

Meesapyodsuk, Dauenpen; Balsevich, John; Reed, Darwin W.; Covello, Patrick S.

2007-01-01

Saponaria vaccaria (Caryophyllaceae), a soapwort, known in western Canada as cowcockle, contains bioactive oleanane-type saponins similar to those found in soapbark tree (Quillaja saponaria; Rosaceae). To improve our understanding of the biosynthesis of these saponins, a combined polymerase chain reaction and expressed sequence tag approach was taken to identify the genes involved. A cDNA encoding a β-amyrin synthase (SvBS) was isolated by reverse transcription-polymerase chain reaction and characterized by expression in yeast (Saccharomyces cerevisiae). The SvBS gene is predominantly expressed in leaves. A S. vaccaria developing seed expressed sequence tag collection was developed and used for the isolation of a full-length cDNA bearing sequence similarity to ester-forming glycosyltransferases. The gene product of the cDNA, classified as UGT74M1, was expressed in Escherichia coli, purified, and identified as a triterpene carboxylic acid glucosyltransferase. UGT74M1 is expressed in roots and leaves and appears to be involved in monodesmoside biosynthesis in S. vaccaria. PMID:17172290
Characterization and expression of the calpastatin gene in Cyprinus carpio.

PubMed

Chen, W X; Ma, Y

2015-07-03

Calpastatin, an important protein used to regulate meat quality traits in animals, is encoded by the CAST gene. The aim of the present study was to clone the cDNA sequence of the CAST gene and detect the expression of CAST in the tissues of Cyprinus carpio. The cDNA of the C. carpio CAST gene, amplified using rapid amplification of cDNA ends PCR, is 2834 bp in length (accession No. JX275386), contains a 2634-bp open reading frame, and encodes a protein with 877 amino acid residues. The amino acid sequence of the C. carpio CAST gene was 88, 80, and 59% identical to the sequences observed in grass carp, zebrafish, and other fish, respectively. The C. carpio CAST was observed to contain four conserved domains with 54 serine phosphorylation loci, 28 threonine phosphorylation loci, 1 tyrosine phosphorylation loci, and 6 specific protein kinase C phosphorylation loci. The CAST gene showed widespread expression in different tissues of C. carpio. Surprisingly, the relative expression of the CAST transcript in the muscle and heart tissues of C. carpio was significantly higher than in other tissues (P < 0.01).
Expression of an Atriplex nummularia gene encoding a protein homologous to the bacterial molecular chaperone DnaJ.

PubMed Central

Zhu, J K; Shi, J; Bressan, R A; Hasegawa, P M

1993-01-01

DnaJ is a 36-kD heat shock protein that functions together with Dnak (Hsp70) as a molecular chaperone in Escherichia coli. We have obtained a cDNA clone from the higher plant Atriplex nummularia that encodes a 46.6-kD polypeptide (ANJ1) with an overall 35.2% amino acid sequence identity with the E. coli DnaJ. ANJ1 has 43.4% overall sequence identity with the Saccharomyces cerevisiae cytoplasmic DnaJ homolog YDJ1/MAS5. Complementation of the yeast mas5 mutation indicated that ANJ1 is a functional homolog of YDJ1/MAS5. The presence of other DnaJ homologs in A. nummularia was demonstrated by the detection of proteins that are antigenically related to the yeast mitochondrial DnaJ homolog SCJ1 and the yeast DnaJ-related protein Sec63. Expression of the ANJ1 gene was compared with that of an A. nummularia Hsp70 gene. Expression of both ANJ1 and Hsp70 transcripts was coordinately induced by heat shock. However, noncoordinate accumulation of ANJ1 and Hsp70 mRNAs occurred during the cell growth cycle and in response to NaCl stress. PMID:8467224
A novel gene, RSD-3/HSD-3.1, encodes a meiotic-related protein expressed in rat and human testis.

PubMed

Zhang, Xiaodong; Liu, Huixian; Zhang, Yan; Qiao, Yuan; Miao, Shiying; Wang, Linfang; Zhang, Jianchao; Zong, Shudong; Koide, S S

2003-06-01

The expression of stage-specific genes during spermatogenesis was determined by isolating two segments of rat seminiferous tubule at different stages of the germinal epithelium cycle delineated by transillumination-delineated microdissection, combined with differential display polymerase chain reaction to identify the differential transcripts formed. A total of 22 cDNAs were identified and accepted by GenBank as new expressed sequence tags. One of the expressed sequence tags was radiolabeled and used as a probe to screen a rat testis cDNA library. A novel full-length cDNA composed of 2228 bp, designated as RSD-3 (rat sperm DNA no.3, GenBank accession no. AF094609) was isolated and characterized. The reading frame encodes a polypeptide consisting of 526 amino acid residues, containing a number of DNA binding motifs and phosphorylation sites for PKC, CK-II, and p34cdc2. Northern blot of mRNA prepared from various tissues of adult rats showed that RSD-3 is expressed only in the testis. The initial expression of the RSD-3 gene was detected in the testis on the 30th postnatal day and attained adult level on the 60th postnatal day. Immunolocalization of RSD-3 in germ cells of rat testis showed that its expression is restricted to primary spermatocytes, undergoing meiosis division I. A human testis homologue of RSD-3 cDNA, designated as HSD-3.1 (GenBank accession no. AF144487) was isolated by screening the Human Testis Rapid-Screen arrayed cDNA library panels by RT-PCR. The exon-intron boundaries of HSD-3.1 gene were determined by aligning the cDNA sequence with the corresponding genome sequence. The cDNA consisted of 12 exons that span approximately 52.8 kb of the genome sequence and was mapped to chromosome 14q31.3.

Improved efficiency in amplification of Escherichia coli o-antigen gene clusters using genome-wide sequence comparison

USDA-ARS?s Scientific Manuscript database

Background: In many bacteria including E. coli, genes encoding O-antigens are clustered in the chromosome, with a 39-bp JUMPstart sequence and gnd gene located upstream and downstream of the cluster, respectively. For determining the DNA sequence of the E. coli O-antigen gene cluster, one set of P...
Genetic engineering of syringyl-enriched lignin in plants

DOEpatents

Chiang, Vincent Lee; Li, Laigeng

2004-11-02

The present invention relates to a novel DNA sequence, which encodes a previously unidentified lignin biosynthetic pathway enzyme, sinapyl alcohol dehydrogenase (SAD) that regulates the biosynthesis of syringyl lignin in plants. Also provided are methods for incorporating this novel SAD gene sequence or substantially similar sequences into a plant genome for genetic engineering of syringyl-enriched lignin in plants.
Sequence of the cDNA of a human dihydrodiol dehydrogenase isoform (AKR1C2) and tissue distribution of its mRNA.

PubMed Central

Shiraishi, H; Ishikura, S; Matsuura, K; Deyashiki, Y; Ninomiya, M; Sakai, S; Hara, A

1998-01-01

Human liver contains three isoforms (DD1, DD2 and DD4) of dihydrodiol dehydrogenase with 20alpha- or 3alpha-hydroxysteroid dehydrogenase activity; the dehydrogenases belong to the aldo-oxo reductase (AKR) superfamily. cDNA species encoding DD1 and DD4 have been identified. However, four cDNA species with more than 99% sequence identity have been cloned and are compatible with a partial amino acid sequence of DD2. In this study we have isolated a cDNA clone encoding DD2, which was confirmed by comparison of the properties of the recombinant and hepatic enzymes. This cDNA showed differences of one, two, four and five nucleotides from the previously reported four cDNA species for a dehydrogenase of human colon carcinoma HT29 cells, human prostatic 3alpha-hydroxysteroid dehydrogenase, a human liver 3alpha-hydroxysteroid dehydrogenase-like protein and chlordecone reductase-like protein respectively. Expression of mRNA species for the five similar cDNA species in 20 liver samples and 10 other different tissue samples was examined by reverse transcriptase-mediated PCR with specific primers followed by diagnostic restriction with endonucleases. All the tissues expressed only one mRNA species corresponding to the newly identified cDNA for DD2: mRNA transcripts corresponding to the other cDNA species were not detected. We suggest that the new cDNA is derived from the principal gene for DD2, which has been named AKR1C2 by a new nomenclature for the AKR superfamily. It is possible that some of the other cDNA species previously reported are rare allelic variants of this gene. PMID:9716498
Molecular cloning and characterization of alpha - galactosidase gene from Glaciozyma antarctica

NASA Astrophysics Data System (ADS)

Moheer, Reyad Qaed Al; Bakar, Farah Diba Abu; Murad, Abdul Munir Abdul

2015-09-01

Psychrophilic enzymes are proteins produced by psychrophilic organisms which recently are the limelight for industrial applications. A gene encoding α-galactosidase from a psychrophilic yeast, Glaciozyma antarctica PI12 which belongs to glycoside hydrolase family 27, was isolated and analyzed using several bioinformatic tools. The cDNA of the gene with the size of 1,404-bp encodes a protein with 467 amino acid residues. Predicted molecular weight of protein was 48.59 kDa and hence we name the gene encoding α-galactosidase as GAL48. We found that the predicted protein sequences possessed signal peptide sequence and are highly conserved among other fungal α-galactosidase.
Sequences within the 5' untranslated region regulate the levels of a kinetoplast DNA topoisomerase mRNA during the cell cycle.

PubMed Central

Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S

1996-01-01

Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA. PMID:8943327
Sequences within the 5' untranslated region regulate the levels of a kinetoplast DNA topoisomerase mRNA during the cell cycle.

PubMed

Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S

1996-12-01

Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA.
The delta-subunit of murine guanine nucleotide exchange factor eIF-2B. Characterization of cDNAs predicts isoforms differing at the amino-terminal end.

PubMed

Henderson, R A; Krissansen, G W; Yong, R Y; Leung, E; Watson, J D; Dholakia, J N

1994-12-02

Protein synthesis in mammalian cells is regulated at the level of the guanine nucleotide exchange factor, eIF-2B, which catalyzes the exchange of eukaryotic initiation factor 2-bound GDP for GTP. We have isolated and sequenced cDNA clones encoding the delta-subunit of murine eIF-2B. The cDNA sequence encodes a polypeptide of 544 amino acids with molecular mass of 60 kDa. Antibodies against a synthetic polypeptide of 30 amino acids deduced from the cDNA sequence specifically react with the delta-subunit of mammalian eIF-2B. The cDNA-derived amino acid sequence shows significant homology with the yeast translational regulator Gcd2, supporting the hypothesis that Gcd2 may be the yeast homolog of the delta-subunit of mammalian eIF-2B. Primer extension studies and anchor polymerase chain reaction analysis were performed to determine the 5'-end of the transcript for the delta-subunit of eIF-2B. Results of these experiments demonstrate two different mRNAs for the delta-subunit of eIF-2B in murine cells. The isolation and characterization of two different full-length cDNAs also predicts the presence of two alternate forms of the delta-subunit of eIF-2B in murine cells. These differ at their amino-terminal end but have identical nucleotide sequences coding for amino acids 31-544.
Heat-shock response in a molluscan cell line: characterization of the response and cloning of an inducible HSP70 cDNA.

PubMed

Laursen, J R; di Liu, H; Wu, X J; Yoshino, T P

1997-11-01

Sublethal heat-shock of cells of the Bge (Biomphalaria glabrata embryonic) snail cell line resulted in increased or new expression of metabolically labeled polypeptides of approximately 21.5, 41, 70, and 74 kDa molecular mass. Regulation of this response appeared to be at the transcriptional level since a similar protein banding pattern was seen upon SDS-PAGE/fluorographic analysis of polypeptides produced by in vitro translation of total RNA from cells subjected to heat shock. Using a yeast (Saccharomyces cerevisiae) 70-kDa heat-shock protein (HSP70) probe to screen a cDNA library from heat-treated Bge cells, we isolated a full-length cDNA clone encoding a putative Bge HSP70. The cDNA was 2453 bp in length and contained an open reading frame of 1908 bp encoding a 636-amino-acid polypeptide with calculated molecular mass of 70,740 Da. Comparison of a conserved region of 209 amino acid residues revealed > 80% identity between the deduced amino acid sequence of Bge HSP70 and that of yeast (81%), the human blood fluke Schistosoma mansoni (for which B. glabrata serves as intermediate host) (81%), Drosophila (81%), human (84%), and the marine gastropod Aplysia californica (88%, 90%). In addition to the extensive sharing of sequence homology, the identification of several eukaryotic HSP70 signature sequences and an N-linked glycosylation site characteristic of cytoplasmic HSPs strongly support the identity of the Bge cDNA as encoding an authentic HSP70. Results of a Northern blot analysis, using Bge HSP70 clone-specific probes, indicated that gene expression was heat inducible and not constitutively expressed. This is the first reported sequence of an inducible HSP70 from cells originating from a freshwater gastropod and provides a first step in the development of a genetic transformation system for molluscs of medical importance.
Identification and characterization of a DnaJ gene from red alga Pyropia yezoensis (Bangiales, Rhodophyta)

NASA Astrophysics Data System (ADS)

Liu, Jiao; Li, Xianchao; Tang, Xuexi; Zhou, Bin

2016-03-01

Members of the DnaJ family are proteins that play a pivotal role in various cellular processes, such as protein folding, protein transport and cellular responses to stress. In the present study, we identified and characterized the full-length DnaJ cDNA sequence from expressed sequence tags of Pyropia yezoensis ( PyDnaJ) via rapid identification of cDNA ends. This cDNA encoded a protein of 429 amino acids, which shared high sequence similarity with other identified DnaJ proteins, such as a heat shock protein 40/DnaJ from Pyropia haitanensis. The relative mRNA expression level of PyDnaJ was investigated using real-time PCR to determine its specific expression during the algal life cycle and during desiccation. The relative mRNA expression level in sporophytes was higher than that in gametophytes and significantly increased during the whole desiccation process. These results indicate that PyDnaJ is an authentic member of the DnaJ family in plants and red algae and might play a pivotal role in mitigating damage to P. yezoensis during desiccation.
Nucleotide sequence of the gene encoding the nitrogenase iron protein of Thiobacillus ferrooxidans

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pretorius, I.M.; Rawlings, D.E.; O'Neill, E.G.

1987-01-01

The DNA sequence was determined for the cloned Thiobacillus ferrooxidans nifH and part of the nifD genes. The DNA chains were radiolabeled with (..cap alpha..-/sup 32/P)dCTP (3000 Ci/mmol) or (..cap alpha..-/sup 35/S)dCTP (400 Ci/mmol). A putative T. ferrooxidans nifH promoter was identified whose sequences showed perfect consensus with those of the Klebsiella pneumoniae nif promoter. Two putative consensus upstream activator sequences were also identified. The amino acid sequence was deduced from the DNA sequence. In a comparison of nifH DNA sequences from T. ferrooxidans and eight other nitrogen-fixing microbes, a Rhizobium sp. isolated from Parasponia andersonii showed the greatest homologymore » (74%) and Clostridium pasteurianum (nifH1) showed the least homology (54%). In the comparison of the amino acid sequences of the Fe proteins, the Rhizobium sp. and Rhizobium japonicum showed the greatest homology (both 86%) and C. pasteurianum (nifH1 gene product) demonstrated the least homology (56%) to the T. ferrooxidans Fe protein.« less
DNA-Encoded Raman-Active Anisotropic Nanoparticles for microRNA Detection.

PubMed

Qi, Lin; Xiao, Mingshu; Wang, Xiwei; Wang, Cheng; Wang, Lihua; Song, Shiping; Qu, Xiangmeng; Li, Li; Shi, Jiye; Pei, Hao

2017-09-19

The development of highly sensitive and selective methods for the detection of microRNA (miRNA) has attracted tremendous attention because of its importance in fundamental biological studies and diagnostic applications. In this work, we develop DNA-encoded Raman-active anisotropic nanoparticles modified origami paper analytical devices (oPADs) for rapid, highly sensitive, and specific miRNA detection. The Raman-active anisotropic nanoparticles were prepared using 10-mer oligo-A, -T, -C, and -G to mediate the growth of Ag cubic seeds into Ag nanoparticles (AgNPs) with different morphologies. The resulting AgNPs were further encoded with DNA probes to serve as effective surface-enhanced Raman scattering (SERS) probes. The analytical device was then fabricated on a single piece of SERS probes loaded paper-based substrate and assembled based on the principles of origami. The addition of the target analyte amplifies the Raman signals on DNA-encoded AgNPs through a target-dependent, sequence specific DNA hybridization assembly. This simple and low-cost analytical device is generic and applicable to a variety of miRNAs, allowing detection sensitivity down to 1 pM and assay time within 15 min, and therefore holds promising applications in point-of-care diagnostics.
Complementary DNA cloning and molecular evolution of opine dehydrogenases in some marine invertebrates.

PubMed

Kimura, Tomohiro; Nakano, Toshiki; Yamaguchi, Toshiyasu; Sato, Minoru; Ogawa, Tomohisa; Muramoto, Koji; Yokoyama, Takehiko; Kan-No, Nobuhiro; Nagahisa, Eizou; Janssen, Frank; Grieshaber, Manfred K

2004-01-01

The complete complementary DNA sequences of genes presumably coding for opine dehydrogenases from Arabella iricolor (sandworm), Haliotis discus hannai (abalone), and Patinopecten yessoensis (scallop) were determined, and partial cDNA sequences were derived for Meretrix lusoria (Japanese hard clam) and Spisula sachalinensis (Sakhalin surf clam). The primers ODH-9F and ODH-11R proved useful for amplifying the sequences for opine dehydrogenases from the 4 mollusk species investigated in this study. The sequence of the sandworm was obtained using primers constructed from the amino acid sequence of tauropine dehydrogenase, the main opine dehydrogenase in A. iricolor. The complete cDNA sequence of A. iricolor, H. discus hannai, and P. yessoensis encode 397, 400, and 405 amino acids, respectively. All sequences were aligned and compared with published databank sequences of Loligo opalescens, Loligo vulgaris (squid), Sepia officinalis (cuttlefish), and Pecten maximus (scallop). As expected, a high level of homology was observed for the cDNA from closely related species, such as for cephalopods or scallops, whereas cDNA from the other species showed lower-level homologies. A similar trend was observed when the deduced amino acid sequences were compared. Furthermore, alignment of these sequences revealed some structural motifs that are possibly related to the binding sites of the substrates. The phylogenetic trees derived from the nucleotide and amino acid sequences were consistent with the classification of species resulting from classical taxonomic analyses.
Alternative polyadenylation of the gene transcripts encoding a rat DNA polymerase beta.

PubMed

Konopiński, R; Nowak, R; Siedlecki, J A

1996-10-17

Rat cells produce two different transcripts of DNA polymerase beta (beta-Pol). The low-molecular-weight transcript (1.4 kb) was already sequenced. We report here the cloning and sequencing of the full-length cDNA, corresponding to the high-molecular-weight (HMW) transcript (4.0 kb) of beta-Pol. Sequence data strongly suggest that both transcripts are produced from a single gene by alternative polyadenylation. The HMW transcript contains the entire 1.4 kb transcript sequence and additional 2.2 kb on the 3' end. The 3' UTR of the HMW transcript contains some regulatory sequences which are not present in the 1.4-kb transcript. The A + U-rich fragment and (GU)21 sequence are believed to influence the stability of the mRNA. The functional significance of the A-rich region locally destabilizing double-stranded secondary structure remains unknown.
Evolution of EF-hand calcium-modulated proteins. III. Exon sequences confirm most dendrograms based on protein sequences: calmodulin dendrograms show significant lack of parallelism

NASA Technical Reports Server (NTRS)

Nakayama, S.; Kretsinger, R. H.

1993-01-01

In the first report in this series we presented dendrograms based on 152 individual proteins of the EF-hand family. In the second we used sequences from 228 proteins, containing 835 domains, and showed that eight of the 29 subfamilies are congruent and that the EF-hand domains of the remaining 21 subfamilies have diverse evolutionary histories. In this study we have computed dendrograms within and among the EF-hand subfamilies using the encoding DNA sequences. In most instances the dendrograms based on protein and on DNA sequences are very similar. Significant differences between protein and DNA trees for calmodulin remain unexplained. In our fourth report we evaluate the sequences and the distribution of introns within the EF-hand family and conclude that exon shuffling did not play a significant role in its evolution.
A modular DNA signal translator for the controlled release of a protein by an aptamer.

PubMed

Beyer, Stefan; Simmel, Friedrich C

2006-01-01

Owing to the intimate linkage of sequence and structure in nucleic acids, DNA is an extremely attractive molecule for the development of molecular devices, in particular when a combination of information processing and chemomechanical tasks is desired. Many of the previously demonstrated devices are driven by hybridization between DNA 'effector' strands and specific recognition sequences on the device. For applications it is of great interest to link several of such molecular devices together within artificial reaction cascades. Often it will not be possible to choose DNA sequences freely, e.g. when functional nucleic acids such as aptamers are used. In such cases translation of an arbitrary 'input' sequence into a desired effector sequence may be required. Here we demonstrate a molecular 'translator' for information encoded in DNA and show how it can be used to control the release of a protein by an aptamer using an arbitrarily chosen DNA input strand. The function of the translator is based on branch migration and the action of the endonuclease FokI. The modular design of the translator facilitates the adaptation of the device to various input or output sequences.
A modular DNA signal translator for the controlled release of a protein by an aptamer

PubMed Central

Beyer, Stefan; Simmel, Friedrich C.

2006-01-01

Owing to the intimate linkage of sequence and structure in nucleic acids, DNA is an extremely attractive molecule for the development of molecular devices, in particular when a combination of information processing and chemomechanical tasks is desired. Many of the previously demonstrated devices are driven by hybridization between DNA ‘effector’ strands and specific recognition sequences on the device. For applications it is of great interest to link several of such molecular devices together within artificial reaction cascades. Often it will not be possible to choose DNA sequences freely, e.g. when functional nucleic acids such as aptamers are used. In such cases translation of an arbitrary ‘input’ sequence into a desired effector sequence may be required. Here we demonstrate a molecular ‘translator’ for information encoded in DNA and show how it can be used to control the release of a protein by an aptamer using an arbitrarily chosen DNA input strand. The function of the translator is based on branch migration and the action of the endonuclease FokI. The modular design of the translator facilitates the adaptation of the device to various input or output sequences. PMID:16547201
The Arabidopsis At1g30680 gene encodes a homologue to the phage T7 gp4 protein that has both DNA primase and DNA helicase activities.

PubMed

Diray-Arce, Joann; Liu, Bin; Cupp, John D; Hunt, Travis; Nielsen, Brent L

2013-03-04

The Arabidopsis thaliana genome encodes a homologue of the full-length bacteriophage T7 gp4 protein, which is also homologous to the eukaryotic Twinkle protein. While the phage protein has both DNA primase and DNA helicase activities, in animal cells Twinkle is localized to mitochondria and has only DNA helicase activity due to sequence changes in the DNA primase domain. However, Arabidopsis and other plant Twinkle homologues retain sequence homology for both functional domains of the phage protein. The Arabidopsis Twinkle homologue has been shown by others to be dual targeted to mitochondria and chloroplasts. To determine the functional activity of the Arabidopsis protein we obtained the gene for the full-length Arabidopsis protein and expressed it in bacteria. The purified protein was shown to have both DNA primase and DNA helicase activities. Western blot and qRT-PCR analysis indicated that the Arabidopsis gene is expressed most abundantly in young leaves and shoot apex tissue, as expected if this protein plays a role in organelle DNA replication. This expression is closely correlated with the expression of organelle-localized DNA polymerase in the same tissues. Homologues from other plant species show close similarity by phylogenetic analysis. The results presented here indicate that the Arabidopsis phage T7 gp4/Twinkle homologue has both DNA primase and DNA helicase activities and may provide these functions for organelle DNA replication.
Cloning and characterization of transferrin cDNA and rapid detection of transferrin gene polymorphism in rainbow trout (Oncorhynchus mykiss).

PubMed

Tange, N; Jong-Young, L; Mikawa, N; Hirono, I; Aoki, T

1997-12-01

A cDNA clone of rainbow trout (Oncorhynchus mykiss) transferrin was obtained from a liver cDNA library. The 2537-bp cDNA sequence contained an open reading frame encoding 691 amino acids and the 5' and 3' noncoding regions. The amino acid sequences at the iron-binding sites and the two N-linked glycosylation sites, and the cysteine residues were consistent with known, conserved vertebrate transferrin cDNA sequences. Single N-linked glycosylation sites existed on the N- and C-lobe. The deduced amino acid sequence of the rainbow trout transferrin cDNA had 92.9% identities with transferrin of coho salmon (Oncorhynchus kisutch); 85%, Atlantic salmon (Salmo salar); 67.3%, medaka (Oryzias latipes); 61.3% Atlantic cod (Gadus morhua); and 59.7%, Japanese flounder (Paralichthys olivaceus). The long and accurate polymerase chain reaction (LA-PCR) was used to amplify approximately 6.5 kb of the transferrin gene from rainbow trout genomic DNA. Restriction fragment length polymorphisms (RFLPs) of the LA-PCR products revealed three digestion patterns in 22 samples.
Ostertagia circumcincta: isolation of a partial cDNA encoding an unusual member of the mitochondrial processing peptidase subfamily of M16 metallopeptidases.

PubMed

Walker, J; Tait, A

1997-11-01

A reverse-transcriptase polymerase chain reaction (PCR) procedure was used to isolate an Ostertagia circumcincta partial cDNA encoding a protein with general primary sequence features characteristic of members of the mitochondrial processing peptidase (MPP) subfamily of M16 metallopeptidases. The structural relationships of the predicted protein (Oc MPPX) with MPP subfamily proteins from other species (including the model free-living nematode Caenorhabditis elegans) were examined, and Northern analysis confirmed the expression of the Oc mppx gene in adult nematodes.
Glutathione-S-conjugate transport in plants

DOEpatents

Rea, Philip A.; Lu, Yu-Ping; Li, Ze-Sheng

2000-01-01

The invention includes an isolated DNA encoding a plant GS-X pump polypeptide and an isolated preparation of a plant GS-X pump polypeptide. Also included is an isolated preparation of a nucleic acid which is antisense in orientation to a portion or all of a plant GS-X pump gene. The invention also includes a cells, vectors and transgenic plants having an isolated DNA encoding a plant GS-X pump and methods of use thereof. In addition, the invention relates to plant GS-X pump promoter sequences and the uses thereof.

LAMP-1-chimeric DNA vaccines enhance the antibody response in Japanese flounder, Paralichthys olivaceus.

PubMed

Rondón-Barragán, Iang; Nozaki, Reiko; Hirono, Ikuo; Kondo, Hidehiro

2017-08-01

DNA vaccination is one method to protect farmed fish from viral and bacterial diseases. Chimeric antigens encoded by DNA vaccines have been shown to increase the resistance to viral diseases. Here, we sequenced the gene encoding lysosome-associated membrane protein-1 from Japanese flounder, Paralichthys olivaceus, (JfLAMP-1) and assessed its use in a chimeric DNA vaccine fused with the major capsule protein (MCP) from red seabream iridovirus (RSIV). JfLAMP-1 cDNA has a length of 1248 bp encoding 415 aa, which contains transmembrane and cytoplasmic domains. JfLAMP-1 is constitutively expressed in several tissues and its expression in spleen was upregulated following injection of formalin-killed cells (FKC) of Edwardsiella tarda. Immunofluorescence analysis showed that JfLAMP-1 is distributed in the small and large granules in the cytoplasm and groups close to the nucleus. The DNA encoding the luminal domain of JfLAMP-1 was replaced with the gene for the RSIV MCP, and the construct was cloned in an expression vector (pCIneo). Fish vaccinated with pCLAMP-MCP had significantly higher antibody levels than fish vaccinated with pCIneo vector harboring the MCP gene (p < 0.05) at day 30 post-vaccination. Copyright © 2017 Elsevier Ltd. All rights reserved.
Identification of antigenic regions on VP2 of African horsesickness virus serotype 3 by using phage-displayed epitope libraries.

PubMed

Bentley, L; Fehrsen, J; Jordaan, F; Huismans, H; du Plessis, D H

2000-04-01

VP2 is an outer capsid protein of African horsesickness virus (AHSV) and is recognized by serotype-discriminatory neutralizing antibodies. With the objective of locating its antigenic regions, a filamentous phage library was constructed that displayed peptides derived from the fragmentation of a cDNA copy of the gene encoding VP2. Peptides ranging in size from approximately 30 to 100 amino acids were fused with pIII, the attachment protein of the display vector, fUSE2. To ensure maximum diversity, the final library consisted of three sub-libraries. The first utilized enzymatically fragmented DNA encoding only the VP2 gene, the second included plasmid sequences, while the third included a PCR step designed to allow different peptide-encoding sequences to recombine before ligation into the vector. The resulting composite library was subjected to immunoaffinity selection with AHSV-specific polyclonal chicken IgY, polyclonal horse immunoglobulins and a monoclonal antibody (MAb) known to neutralize AHSV. Antigenic peptides were located by sequencing the DNA of phages bound by the antibodies. Most antigenic determinants capable of being mapped by this method were located in the N-terminal half of VP2. Important binding areas were mapped with high resolution by identifying the minimum overlapping areas of the selected peptides. The MAb was also used to screen a random 17-mer epitope library. Sequences that may be part of a discontinuous neutralization epitope were identified. The amino acid sequences of the antigenic regions on VP2 of serotype 3 were compared with corresponding regions on three other serotypes, revealing regions with the potential to discriminate AHSV serotypes serologically.
Characterization of a digestive carboxypeptidase from the insect pest corn earworm (Helicoverpa armigera) with novel specificity towards C-terminal glutamate residues.

PubMed

Bown, David P; Gatehouse, John A

2004-05-01

Carboxypeptidases were purified from guts of larvae of corn earworm (Helicoverpa armigera), a lepidopteran crop pest, by affinity chromatography on immobilized potato carboxypeptidase inhibitor, and characterized by N-terminal sequencing. A larval gut cDNA library was screened using probes based on these protein sequences. cDNA HaCA42 encoded a carboxypeptidase with sequence similarity to enzymes of clan MC [Barrett, A. J., Rawlings, N. D. & Woessner, J. F. (1998) Handbook of Proteolytic Enzymes. Academic Press, London.], but with a novel predicted specificity towards C-terminal acidic residues. This carboxypeptidase was expressed as a recombinant proprotein in the yeast Pichia pastoris. The expressed protein could be activated by treatment with bovine trypsin; degradation of bound pro-region, rather than cleavage of pro-region from mature protein, was the rate-limiting step in activation. Activated HaCA42 carboxypeptidase hydrolysed a synthetic substrate for glutamate carboxypeptidases (FAEE, C-terminal Glu), but did not hydrolyse substrates for carboxypeptidase A or B (FAPP or FAAK, C-terminal Phe or Lys) or methotrexate, cleaved by clan MH glutamate carboxypeptidases. The enzyme was highly specific for C-terminal glutamate in peptide substrates, with slow hydrolysis of C-terminal aspartate also observed. Glutamate carboxypeptidase activity was present in larval gut extract from H. armigera. The HaCA42 protein is the first glutamate-specific metallocarboxypeptidase from clan MC to be identified and characterized. The genome of Drosophila melanogaster contains genes encoding enzymes with similar sequences and predicted specificity, and a cDNA encoding a similar enzyme has been isolated from gut tissue in tsetse fly. We suggest that digestive carboxypeptidases with sequence similarity to the classical mammalian enzymes, but with specificity towards C-terminal glutamate, are widely distributed in insects.
Cloning and bioinformatic analysis of lovastatin biosynthesis regulatory gene lovE.

PubMed

Huang, Xin; Li, Hao-ming

2009-08-05

Lovastatin is an effective drug for treatment of hyperlipidemia. This study aimed to clone lovastatin biosynthesis regulatory gene lovE and analyze the structure and function of its encoding protein. According to the lovastatin synthase gene sequence from genebank, primers were designed to amplify and clone the lovastatin biosynthesis regulatory gene lovE from Aspergillus terrus genomic DNA. Bioinformatic analysis of lovE and its encoding animo acid sequence was performed through internet resources and software like DNAMAN. Target fragment lovE, almost 1500 bp in length, was amplified from Aspergillus terrus genomic DNA and the secondary and three-dimensional structures of LovE protein were predicted. In the lovastatin biosynthesis process lovE is a regulatory gene and LovE protein is a GAL4-like transcriptional factor.
Differential mitochondrial DNA and gene expression in inherited retinal dysplasia in miniature Schnauzer dogs.

PubMed

Appleyard, Greg D; Forsyth, George W; Kiehlbauch, Laura M; Sigfrid, Kristen N; Hanik, Heather L J; Quon, Anita; Loewen, Matthew E; Grahn, Bruce H

2006-05-01

To investigate the molecular basis of inherited retinal dysplasia in miniature Schnauzers. Retina and retinal pigment epithelial tissues were collected from canine subjects at the age of 3 weeks. Total RNA isolated from these tissues was reverse transcribed to make representative cDNA pools that were compared for differences in gene expression by using a subtractive hybridization technique referred to as representational difference analysis (RDA). Expression differences identified by RDA were confirmed and quantified by real-time reverse-transcription PCR. Mitochondrial morphology from leukocytes and skeletal muscle of normal and affected miniature Schnauzers was examined by transmission electron microscopy. RDA screening of retinal pigment epithelial cDNA identified differences in mRNA transcript coding for two mitochondrial (mt) proteins--cytochrome oxidase subunit 1 and NADH dehydrogenase subunit 6--in affected dogs. Contrary to expectations, these identified sequences did not contain mutations. Based on the implication of mt-DNA-encoded proteins by the RDA experiments we used real-time PCR to compare the relative amounts of mt-DNA template in white blood cells from normal and affected dogs. White blood cells of affected dogs contained less than 30% of the normal amount of two specific mtDNA sequences, compared with the content of the nuclear-encoded glyceraldehyde-3-phosphate dehydrogenase (GA-3-PDH) reference gene. Retina and RPE tissue from affected dogs had reduced mRNA transcript levels for the two mitochondrial genes detected in the RDA experiment. Transcript levels for another mtDNA-encoded gene as well as the nuclear-encoded mitochondrial Tfam transcription factor were reduced in these tissues in affected dogs. Mitochondria from affected dogs were reduced in number and size and were unusually electron dense. Reduced levels of nuclear and mitochondrial transcripts in the retina and RPE of miniature Schnauzers affected with retinal dysplasia suggest that the pathogenesis of the disorder may arise from a lowered energy supply to the retina and RPE.
Previously unknown and highly divergent ssDNA viruses populate the oceans.

PubMed

Labonté, Jessica M; Suttle, Curtis A

2013-11-01

Single-stranded DNA (ssDNA) viruses are economically important pathogens of plants and animals, and are widespread in oceans; yet, the diversity and evolutionary relationships among marine ssDNA viruses remain largely unknown. Here we present the results from a metagenomic study of composite samples from temperate (Saanich Inlet, 11 samples; Strait of Georgia, 85 samples) and subtropical (46 samples, Gulf of Mexico) seawater. Most sequences (84%) had no evident similarity to sequenced viruses. In total, 608 putative complete genomes of ssDNA viruses were assembled, almost doubling the number of ssDNA viral genomes in databases. These comprised 129 genetically distinct groups, each represented by at least one complete genome that had no recognizable similarity to each other or to other virus sequences. Given that the seven recognized families of ssDNA viruses have considerable sequence homology within them, this suggests that many of these genetic groups may represent new viral families. Moreover, nearly 70% of the sequences were similar to one of these genomes, indicating that most of the sequences could be assigned to a genetically distinct group. Most sequences fell within 11 well-defined gene groups, each sharing a common gene. Some of these encoded putative replication and coat proteins that had similarity to sequences from viruses infecting eukaryotes, suggesting that these were likely from viruses infecting eukaryotic phytoplankton and zooplankton.
Stachyose synthesis in seeds of adzuki bean (Vigna angularis): molecular cloning and functional expression of stachyose synthase.

PubMed

Peterbauer, T; Mucha, J; Mayer, U; Popp, M; Glössl, J; Richter, A

1999-12-01

Stachyose is the major soluble carbohydrate in seeds of a number of important crop species. It is synthesized from raffinose and galactinol by the action of stachyose synthase (EC 2.4.1.67). We report here on the identification of a cDNA encoding stachyose synthase from seeds of adzuki bean (Vigna angularis Ohwi et Ohashi). Based on internal amino acid sequences of the enzyme purified from adzuki bean, oligonucleotides were designed and used to amplify corresponding sequences from adzuki bean cDNA by RT-PCR, followed by rapid amplification of cDNA ends (RACE-PCR). The complete cDNA sequence comprised 3046 nucleotides and included an open reading frame which encoded a polypeptide of 857 amino acid residues. The entire coding region was amplified by PCR, engineered into the baculovirus expression vector pVL1393 and introduced into Spodoptera frugiperda (Sf21) insect cells for heterologous expression. The recombinant protein was immunologically reactive with polyclonal antibodies raised against stachyose synthase purified from adzuki bean and was shown to be a functional stachyose synthase with the same catalytic properties as its native counterpart. High levels of stachyose synthase mRNA were transiently accumulated midway through seed development, and the enzyme was also present in mature seeds and during germination.
Design and construction of a double inversion recombination switch for heritable sequential genetic memory.

PubMed

Ham, Timothy S; Lee, Sung K; Keasling, Jay D; Arkin, Adam P

2008-07-30

Inversion recombination elements present unique opportunities for computing and information encoding in biological systems. They provide distinct binary states that are encoded into the DNA sequence itself, allowing us to overcome limitations posed by other biological memory or logic gate systems. Further, it is in theory possible to create complex sequential logics by careful positioning of recombinase recognition sites in the sequence. In this work, we describe the design and synthesis of an inversion switch using the fim and hin inversion recombination systems to create a heritable sequential memory switch. We have integrated the two inversion systems in an overlapping manner, creating a switch that can have multiple states. The switch is capable of transitioning from state to state in a manner analogous to a finite state machine, while encoding the state information into DNA. This switch does not require protein expression to maintain its state, and "remembers" its state even upon cell death. We were able to demonstrate transition into three out of the five possible states showing the feasibility of such a switch. We demonstrate that a heritable memory system that encodes its state into DNA is possible, and that inversion recombination system could be a starting point for more complex memory circuits. Although the circuit did not fully behave as expected, we showed that a multi-state, temporal memory is achievable.
Design and Construction of a Double Inversion Recombination Switch for Heritable Sequential Genetic Memory

PubMed Central

Ham, Timothy S.; Lee, Sung K.; Keasling, Jay D.; Arkin, Adam P.

2008-01-01

Background Inversion recombination elements present unique opportunities for computing and information encoding in biological systems. They provide distinct binary states that are encoded into the DNA sequence itself, allowing us to overcome limitations posed by other biological memory or logic gate systems. Further, it is in theory possible to create complex sequential logics by careful positioning of recombinase recognition sites in the sequence. Methodology/Principal Findings In this work, we describe the design and synthesis of an inversion switch using the fim and hin inversion recombination systems to create a heritable sequential memory switch. We have integrated the two inversion systems in an overlapping manner, creating a switch that can have multiple states. The switch is capable of transitioning from state to state in a manner analogous to a finite state machine, while encoding the state information into DNA. This switch does not require protein expression to maintain its state, and “remembers” its state even upon cell death. We were able to demonstrate transition into three out of the five possible states showing the feasibility of such a switch. Conclusions/Significance We demonstrate that a heritable memory system that encodes its state into DNA is possible, and that inversion recombination system could be a starting point for more complex memory circuits. Although the circuit did not fully behave as expected, we showed that a multi-state, temporal memory is achievable. PMID:18665232
The gene for stinging nettle lectin (Urtica dioica agglutinin) encodes both a lectin and a chitinase.

PubMed

Lerner, D R; Raikhel, N V

1992-06-05

Chitin-binding proteins are present in a wide range of plant species, including both monocots and dicots, even though these plants contain no chitin. To investigate the relationship between in vitro antifungal and insecticidal activities of chitin-binding proteins and their unknown endogenous functions, the stinging nettle lectin (Urtica dioica agglutinin, UDA) cDNA was cloned using a synthetic gene as the probe. The nettle lectin cDNA clone contained an open reading frame encoding 374 amino acids. Analysis of the deduced amino acid sequence revealed a 21-amino acid putative signal sequence and the 86 amino acids encoding the two chitin-binding domains of nettle lectin. These domains were fused to a 19-amino acid "spacer" domain and a 244-amino acid carboxyl extension with partial identity to a chitinase catalytic domain. The authenticity of the cDNA clone was confirmed by deduced amino acid sequence identity with sequence data obtained from tryptic digests, RNA gel blot, and polymerase chain reaction analyses. RNA gel blot analysis also showed the nettle lectin message was present primarily in rhizomes and inflorescence (with immature seeds) but not in leaves or stems. Chitinase enzymatic activity was found when the chitinase-like domain alone or the chitinase-like domain with the chitin-binding domains were expressed in Escherichia coli. This is the first example of a chitin-binding protein with both a duplication of the 43-amino acid chitin-binding domain and a fusion of the chitin-binding domains to a structurally unrelated domain, the chitinase domain.
SEQUENCE SIMILARITIES IN THE GENES ENCODING POLY- CHLORINATED BIPHENYL DEGRADATION BY PSEUDOMONAS STRAIN LB400 AND ALCALIGENES EUTROPHUS H850

EPA Science Inventory

DNA-DNA hybridization was used to compare the Pseudomonas strain LB400 genes for polychlorinated biphenyl (PCB) degradation with those from seven other PCB-degrading strains. Significant hybridization was detected to the genome of Alcaligenes eutrophus H850, a strain similar to L...
Run-length encoding graphic rules, biochemically editable designs and steganographical numeric data embedment for DNA-based cryptographical coding system.

PubMed

Kawano, Tomonori

2013-03-01

There have been a wide variety of approaches for handling the pieces of DNA as the "unplugged" tools for digital information storage and processing, including a series of studies applied to the security-related area, such as DNA-based digital barcodes, water marks and cryptography. In the present article, novel designs of artificial genes as the media for storing the digitally compressed data for images are proposed for bio-computing purpose while natural genes principally encode for proteins. Furthermore, the proposed system allows cryptographical application of DNA through biochemically editable designs with capacity for steganographical numeric data embedment. As a model case of image-coding DNA technique application, numerically and biochemically combined protocols are employed for ciphering the given "passwords" and/or secret numbers using DNA sequences. The "passwords" of interest were decomposed into single letters and translated into the font image coded on the separate DNA chains with both the coding regions in which the images are encoded based on the novel run-length encoding rule, and the non-coding regions designed for biochemical editing and the remodeling processes revealing the hidden orientation of letters composing the original "passwords." The latter processes require the molecular biological tools for digestion and ligation of the fragmented DNA molecules targeting at the polymerase chain reaction-engineered termini of the chains. Lastly, additional protocols for steganographical overwriting of the numeric data of interests over the image-coding DNA are also discussed.
Efficacy of vaccination with plasmid DNA encoding for HER2/neu or HER2/neu-eGFP fusion protein against prostate cancer in rats.

PubMed

Bhattachary, R; Bukkapatnam, R; Prawoko, I; Soto, J; Morgan, M; Salup, R R

2002-05-01

Despite early diagnosis and improved therapy, 31,500 men will die from prostate cancer (PC) this year. The HER2/neu oncoprotein is an important effector of cell growth found in the majority of high-grade prostatic tumors and is capable of rendering immunogenicity. The antigenicity of this oncoprotein might prove useful in the development of PC vaccines. Our goal is to prove the principle that a single DNA vaccine can provide reliable immunity against PC in the MatLyLu (MLL) translational tumor model. The parental rat MatLyLu PC cell line expresses low to moderate levels of the rat neu protein. To simulate in vivo human PC, MatLyLu cells were transfected with a truncated sequence of human HER2/neu cDNA cloned into the pCI-neo vector. This HER2/neu cDNA sequence encodes the first 433 amino acids of the extracellular domain (ECD). MatLyLu cells were also transfected with the same HER2/neu cDNA sequence cloned into the N1-terminal sequence of EGFP reporter gene to produce a fusion protein. The partial ECD sequence of HER2/neu includes five rat major histocompatibility (MHC)-II-restricted peptides with complete human-to-rat cross-species homology. The HER2/neu protein overexpression was documented by Western Blot analysis, and the expression of fusion protein was monitored by confocal microscopy and fluorimetry. Vaccination with a single injection of HER2/neu cDNA protected 50% of animals against HER2/neu-MatLyLu tumors (P < 0.01). When the tumor cells were engineered to express HER2/neu-EGFP fusion protein, the antitumor immunity was enhanced, as following vaccination with HER2/neu-EGFP cDNA, 80% of these rats rejected HER2/neu-EGFP-MatLyLu (P<0.001). Both vaccines induced HER2/neu-specific antibody titers. Rats vaccinated with EGFP-cDNA rejected 80% of EGFP-MatLyLu tumors and, interestingly, 40% of HER2/neu-MatLyLu tumors. None of the cDNA vaccines induced immunity against parental MatLyLu cells. Our data clearly demonstrate that a single injection of HER2/neu-EGFP cDNA is a very effective vaccine against PC tumors expressing the cognate tumor-associated antigen (TA). The antitumor immunity is significantly more pronounced if the tumors express xenogeneic HER2/neu-EGFP fusion protein as opposed to only the syngeneic HER2/neu oncoprotein. Our data suggests that the HER2/neu-EGFP-MatLyLu tumor is a potential animal tumor model for investigating therapeutic vaccine strategies against PC in vivo and demonstrates the limitations of a cDNA vaccine only encoding for MHC-II-restricted HER2/neu-ECD sequence peptides.
Identification and Characterization of a Cis-Encoded Antisense RNA Associated with the Replication Process of Salmonella enterica Serovar Typhi

PubMed Central

Dadzie, Isaac; Xu, Shungao; Ni, Bin; Zhang, Xiaolei; Zhang, Haifang; Sheng, Xiumei; Xu, Huaxi; Huang, Xinxiang

2013-01-01

Antisense RNAs that originate from the complementary strand of protein coding genes are involved in the regulation of gene expression in all domains of life. In bacteria, some of these antisense RNAs are transcriptional noise whiles others play a vital role to adapt the cell to changing environmental conditions. By deep sequencing analysis of transcriptome of Salmonella enterica serovar Typhi, a partial RNA sequence encoded in-cis to the dnaA gene was revealed. Northern blot and RACE analysis confirmed the transcription of this antisense RNA which was expressed mostly in the stationary phase of the bacterial growth and also under iron limitation and osmotic stress. Pulse expression analysis showed that overexpression of the antisense RNA resulted in a significant increase in the mRNA levels of dnaA, which will ultimately enhance their translation. Our findings have revealed that antisense RNA of dnaA is indeed transcribed not merely as a by-product of the cell's transcription machinery but plays a vital role as far as stability of dnaA mRNA is concerned. PMID:23637809
Sequences of heavy and light chain variable regions from four bovine immunoglobulins.

PubMed

Armour, K L; Tempest, P R; Fawcett, P H; Fernie, M L; King, S I; White, P; Taylor, G; Harris, W J

1994-12-01

Oligodeoxyribonucleotide primers based on the 5' ends of bovine IgG1/2 and lambda constant (C) region genes, together with primers encoding conserved amino acids at the N-terminus of mature variable (V) regions from other species, have been used in cDNA and polymerase chain reactions (PCRs) to amplify heavy and light chain V region cDNA from bovine heterohybridomas. The amino acid sequences of VH and V lambda from four bovine immunoglobulins of different specificities are presented.
Comparative sequence analysis revealed altered chromosomal organization and a novel insertion sequence encoding DNA modification and potentially stress-related functions in an Escherichia coli O157:H7 foodborne isolate

USDA-ARS?s Scientific Manuscript database

We recently described the complete genome of enterohemorrhagic Escherichia coli (EHEC) O157:H7 strain NADC 6564, an isolate of strain 86-24 linked to the 1986 disease outbreak. In the current study, we compared the chromosomal sequence of NADC 6564 to the well-characterized chromosomal sequences of ...
Isolation of Nicotiana plumbaginifolia cDNAs encoding isoforms of serine acetyltransferase and O-acetylserine (thiol) lyase in a yeast two-hybrid system with Escherichia coli cysE and cysK genes as baits.

PubMed

Liszewska, Frantz; Gaganidze, Dali; Sirko, Agnieszka

2005-01-01

We applied the yeast two-hybrid system for screening of a cDNA library of Nicotiana plumbaginifolia for clones encoding plant proteins interacting with two proteins of Escherichia coli: serine acetyltransferase (SAT, the product of cysE gene) and O-acetylserine (thiol)lyase A, also termed cysteine synthase (OASTL-A, the product of cysK gene). Two plant cDNA clones were identified when using the cysE gene as a bait. These clones encode a probable cytosolic isoform of OASTL and an organellar isoform of SAT, respectively, as indicated by evolutionary trees. The second clone, encoding SAT, was identified independently also as a "prey" when using cysK as a bait. Our results reveal the possibility of applying the two-hybrid system for cloning of plant cDNAs encoding enzymes of the cysteine synthase complex in the two-hybrid system. Additionally, using genome walking sequences located upstream of the sat1 cDNA were identified. Subsequently, in silico analyses were performed aiming towards identification of the potential signal peptide and possible location of the deduced mature protein encoded by sat1.
Portable and Error-Free DNA-Based Data Storage.

PubMed

Yazdi, S M Hossein Tabatabaei; Gabrys, Ryan; Milenkovic, Olgica

2017-07-10

DNA-based data storage is an emerging nonvolatile memory technology of potentially unprecedented density, durability, and replication efficiency. The basic system implementation steps include synthesizing DNA strings that contain user information and subsequently retrieving them via high-throughput sequencing technologies. Existing architectures enable reading and writing but do not offer random-access and error-free data recovery from low-cost, portable devices, which is crucial for making the storage technology competitive with classical recorders. Here we show for the first time that a portable, random-access platform may be implemented in practice using nanopore sequencers. The novelty of our approach is to design an integrated processing pipeline that encodes data to avoid costly synthesis and sequencing errors, enables random access through addressing, and leverages efficient portable sequencing via new iterative alignment and deletion error-correcting codes. Our work represents the only known random access DNA-based data storage system that uses error-prone nanopore sequencers, while still producing error-free readouts with the highest reported information rate/density. As such, it represents a crucial step towards practical employment of DNA molecules as storage media.
Cloning, expression and activity analysis of a novel fibrinolytic serine protease from Arenicola cristata

NASA Astrophysics Data System (ADS)

Zhao, Chunling; Ju, Jiyu

2015-06-01

The full-length cDNA of a protease gene from a marine annelid Arenicola cristata was amplified through rapid amplification of cDNA ends technique and sequenced. The size of the cDNA was 936 bp in length, including an open reading frame encoding a polypeptide of 270 amino acid residues. The deduced amino acid sequnce consisted of pro- and mature sequences. The protease belonged to the serine protease family because it contained the highly conserved sequence GDSGGP. This protease was novel as it showed a low amino acid sequence similarity (< 40%) to other serine proteases. The gene encoding the active form of A. cristata serine protease was cloned and expressed in E. coli. Purified recombinant protease in a supernatant could dissolve an artificial fibrin plate with plasminogen-rich fibrin, whereas the plasminogen-free fibrin showed no clear zone caused by hydrolysis. This result suggested that the recombinant protease showed an indirect fibrinolytic activity of dissolving fibrin, and was probably a plasminogen activator. A rat model with venous thrombosis was established to demonstrate that the recombinant protease could also hydrolyze blood clot in vivo. Therefore, this recombinant protease may be used as a thrombolytic agent for thrombosis treatment. To our knowledge, this study is the first of reporting the fibrinolytic serine protease gene in A. cristata.
Ecological niche modelling and nDNA sequencing support a new, morphologically cryptic beetle species unveiled by DNA barcoding.

PubMed

Hawlitschek, Oliver; Porch, Nick; Hendrich, Lars; Balke, Michael

2011-02-09

DNA sequencing techniques used to estimate biodiversity, such as DNA barcoding, may reveal cryptic species. However, disagreements between barcoding and morphological data have already led to controversy. Species delimitation should therefore not be based on mtDNA alone. Here, we explore the use of nDNA and bioclimatic modelling in a new species of aquatic beetle revealed by mtDNA sequence data. The aquatic beetle fauna of Australia is characterised by high degrees of endemism, including local radiations such as the genus Antiporus. Antiporus femoralis was previously considered to exist in two disjunct, but morphologically indistinguishable populations in south-western and south-eastern Australia. We constructed a phylogeny of Antiporus and detected a deep split between these populations. Diagnostic characters from the highly variable nuclear protein encoding arginine kinase gene confirmed the presence of two isolated populations. We then used ecological niche modelling to examine the climatic niche characteristics of the two populations. All results support the status of the two populations as distinct species. We describe the south-western species as Antiporus occidentalis sp.n. In addition to nDNA sequence data and extended use of mitochondrial sequences, ecological niche modelling has great potential for delineating morphologically cryptic species.

Ecological Niche Modelling and nDNA Sequencing Support a New, Morphologically Cryptic Beetle Species Unveiled by DNA Barcoding

PubMed Central

Hawlitschek, Oliver; Porch, Nick; Hendrich, Lars; Balke, Michael

2011-01-01

Background DNA sequencing techniques used to estimate biodiversity, such as DNA barcoding, may reveal cryptic species. However, disagreements between barcoding and morphological data have already led to controversy. Species delimitation should therefore not be based on mtDNA alone. Here, we explore the use of nDNA and bioclimatic modelling in a new species of aquatic beetle revealed by mtDNA sequence data. Methodology/Principal Findings The aquatic beetle fauna of Australia is characterised by high degrees of endemism, including local radiations such as the genus Antiporus. Antiporus femoralis was previously considered to exist in two disjunct, but morphologically indistinguishable populations in south-western and south-eastern Australia. We constructed a phylogeny of Antiporus and detected a deep split between these populations. Diagnostic characters from the highly variable nuclear protein encoding arginine kinase gene confirmed the presence of two isolated populations. We then used ecological niche modelling to examine the climatic niche characteristics of the two populations. All results support the status of the two populations as distinct species. We describe the south-western species as Antiporus occidentalis sp.n. Conclusion/Significance In addition to nDNA sequence data and extended use of mitochondrial sequences, ecological niche modelling has great potential for delineating morphologically cryptic species. PMID:21347370
Metal resistant plants and phytoremediation of environmental contamination

DOEpatents

Meagher, Richard B.; Li, Yujing; Dhankher, Om P.

2010-04-20

The present disclosure provides a method of producing transgenic plants which are resistant to at least one metal ion by transforming the plant with a recombinant DNA comprising a nucleic acid encoding a bacterial arsenic reductase under the control of a plant expressible promoter, and a nucleic acid encoding a nucleotide sequence encoding a phytochelatin biosynthetic enzyme under the control of a plant expressible promoter. The invention also relates a method of phytoremediation of a contaminated site by growing in the site a transgenic plant expressing a nucleic acid encoding a bacterial arsenate reductase and a nucleic acid encoding a phytochelatin biosynthetic enzyme.
Cloning and sequencing the genes encoding goldfish and carp ependymin.

PubMed

Adams, D S; Shashoua, V E

1994-04-20

Ependymins (EPNs) are brain glycoproteins thought to function in optic nerve regeneration and long-term memory consolidation. To date, epn genes have been characterized in two orders of teleost fish. In this study, polymerase chain reactions (PCR) were used to amplify the complete 1.6-kb epn genes, gf-I and cc-I, from genomic DNA of Cypriniformes, goldfish and carp, respectively. Amplified bands were cloned and sequenced. Each gene consists of six exons and five introns. The exon portion of gf-I encodes a predicted 215-amino-acid (aa) protein previously characterized as GF-I, while cc-I encodes a predicted 215-aa protein 95% homologous to GF-I.
Purification, cDNA cloning, and characterization of LysM-containing plant chitinase from horsetail (Equisetum arvense).

PubMed

Inamine, Saki; Onaga, Shoko; Ohnuma, Takayuki; Fukamizo, Tamo; Taira, Toki

2015-01-01

Chitinase-A (EaChiA), molecular mass 36 kDa, was purified from the vegetative stems of a horsetail (Equisetum arvense) using a series of column chromatography. The N-terminal amino acid sequence of EaChiA was similar to the lysin motif (LysM). A cDNA encoding EaChiA was cloned by rapid amplification of cDNA ends and polymerase chain reaction. It consisted of 1320 nucleotides and encoded an open reading frame of 361 amino acid residues. The deduced amino acid sequence indicated that EaChiA is composed of a N-terminal LysM domain and a C-terminal plant class IIIb chitinase catalytic domain, belonging to the glycoside hydrolase family 18, linked by proline-rich regions. EaChiA has strong chitin-binding activity, however, no antifungal activity. This is the first report of a chitinase from Equisetopsida, a class of fern plants, and the second report of a LysM-containing chitinase from a plant.
Three closely related herpesviruses are associated with fibropapillomatosis in marine turtles

USGS Publications Warehouse

Quackenbush, S.L.; Work, Thierry M.; Balazs, George H.; Casey, Rufina N.; Rovnak, J.; Chaves, A.; duToit, L.; Baines, J.D.; Parrish, C.R.; Bowser, Paul R.; Casey, James W.

1998-01-01

Green turtle fibropapillomatosis is a neoplastic disease of increasingly significant threat to the survivability of this species. Degenerate PCR primers that target highly conserved regions of genes encoding herpesvirus DNA polymerases were used to amplify a DNA sequence from fibropapillomas and fibromas from Hawaiian and Florida green turtles. All of the tumors tested (n= 23) were found to harbor viral DNA, whereas no viral DNA was detected in skin biopsies from tumor-negative turtles. The tissue distribution of the green turtle herpesvirus appears to be generally limited to tumors where viral DNA was found to accumulate at approximately two to five copies per cell and is occasionally detected, only by PCR, in some tissues normally associated with tumor development. In addition, herpesviral DNA was detected in fibropapillomas from two loggerhead and four olive ridley turtles. Nucleotide sequencing of a 483-bp fragment of the turtle herpesvirus DNA polymerase gene determined that the Florida green turtle and loggerhead turtle sequences are identical and differ from the Hawaiian green turtle sequence by five nucleotide changes, which results in two amino acid substitutions. The olive ridley sequence differs from the Florida and Hawaiian green turtle sequences by 15 and 16 nucleotide changes, respectively, resulting in four amino acid substitutions, three of which are unique to the olive ridley sequence. Our data suggest that these closely related turtle herpesviruses are intimately involved in the genesis of fibropapillomatosis.
Genomic instability--an evolving hallmark of cancer.

PubMed

Negrini, Simona; Gorgoulis, Vassilis G; Halazonetis, Thanos D

2010-03-01

Genomic instability is a characteristic of most cancers. In hereditary cancers, genomic instability results from mutations in DNA repair genes and drives cancer development, as predicted by the mutator hypothesis. In sporadic (non-hereditary) cancers the molecular basis of genomic instability remains unclear, but recent high-throughput sequencing studies suggest that mutations in DNA repair genes are infrequent before therapy, arguing against the mutator hypothesis for these cancers. Instead, the mutation patterns of the tumour suppressor TP53 (which encodes p53), ataxia telangiectasia mutated (ATM) and cyclin-dependent kinase inhibitor 2A (CDKN2A; which encodes p16INK4A and p14ARF) support the oncogene-induced DNA replication stress model, which attributes genomic instability and TP53 and ATM mutations to oncogene-induced DNA damage.
The gene therapy of collagen-induced arthritis in rats by intramuscular administration of the plasmid encoding TNF-binding domain of variola virus CrmB protein.

PubMed

Shchelkunov, S N; Taranov, O S; Tregubchak, T V; Maksyutov, R A; Silkov, A N; Nesterov, A E; Sennikov, S V

2016-07-01

Wistar rats with collagen-induced arthritis were intramuscularly injected with the recombinant plasmid pcDNA/sTNF-BD encoding the sequence of the TNF-binding protein domain of variola virus CrmB protein (VARV sTNF-BD) or the pcDNA3.1 vector. Quantitative analysis showed that the histopathological changes in the hind-limb joints of rats were most severe in the animals injected with pcDNA3.1 and much less severe in the group of rats injected with pcDNA/sTNF-BD, which indicates that gene therapy of rheumatoid arthritis is promising in the case of local administration of plasmids governing the synthesis of VARV immunomodulatory proteins.
BLAST and FASTA similarity searching for multiple sequence alignment.

PubMed

Pearson, William R

2014-01-01

BLAST, FASTA, and other similarity searching programs seek to identify homologous proteins and DNA sequences based on excess sequence similarity. If two sequences share much more similarity than expected by chance, the simplest explanation for the excess similarity is common ancestry-homology. The most effective similarity searches compare protein sequences, rather than DNA sequences, for sequences that encode proteins, and use expectation values, rather than percent identity, to infer homology. The BLAST and FASTA packages of sequence comparison programs provide programs for comparing protein and DNA sequences to protein databases (the most sensitive searches). Protein and translated-DNA comparisons to protein databases routinely allow evolutionary look back times from 1 to 2 billion years; DNA:DNA searches are 5-10-fold less sensitive. BLAST and FASTA can be run on popular web sites, but can also be downloaded and installed on local computers. With local installation, target databases can be customized for the sequence data being characterized. With today's very large protein databases, search sensitivity can also be improved by searching smaller comprehensive databases, for example, a complete protein set from an evolutionarily neighboring model organism. By default, BLAST and FASTA use scoring strategies target for distant evolutionary relationships; for comparisons involving short domains or queries, or searches that seek relatively close homologs (e.g. mouse-human), shallower scoring matrices will be more effective. Both BLAST and FASTA provide very accurate statistical estimates, which can be used to reliably identify protein sequences that diverged more than 2 billion years ago.
Seafood delicacy makes great adhesive

ScienceCinema

Idaho National Laboratory - Frank Roberto, Heather Silverman

2017-12-09

Technology from Mother Nature is often hard to beat, so Idaho National Laboratory scientistsgenetically analyzed the adhesive proteins produced by blue mussels, a seafood delicacy. Afterobtaining full-length DNA sequences encoding these proteins, reprod
JVM: Java Visual Mapping tool for next generation sequencing read.

PubMed

Yang, Ye; Liu, Juan

2015-01-01

We developed a program JVM (Java Visual Mapping) for mapping next generation sequencing read to reference sequence. The program is implemented in Java and is designed to deal with millions of short read generated by sequence alignment using the Illumina sequencing technology. It employs seed index strategy and octal encoding operations for sequence alignments. JVM is useful for DNA-Seq, RNA-Seq when dealing with single-end resequencing. JVM is a desktop application, which supports reads capacity from 1 MB to 10 GB.
The product of the Saccharomyces cerevisiae cell cycle gene DBF2 has homology with protein kinases and is periodically expressed in the cell cycle.

PubMed Central

Johnston, L H; Eberly, S L; Chapman, J W; Araki, H; Sugino, A

1990-01-01

Several Saccharomyces cerevisiae dbf mutants defective in DNA synthesis have been described previously. In this paper, one of them, dbf2, is characterized in detail. The DBF2 gene has been cloned and mapped, and its nucleotide sequence has been determined. This process has identified an open reading frame capable of encoding a protein of molecular weight 64,883 (561 amino acids). The deduced amino acid sequence contains all 11 conserved domains found in various protein kinases. DBF2 was periodically expressed in the cell cycle at a time that clearly differed from the time of expression of either the histone H2A or DNA polymerase I gene. Its first function was completed very near to initiation of DNA synthesis. However, DNA synthesis in the mutant was only delayed at 37 degrees C, and the cells blocked in nuclear division. Consistent with this finding, the execution point occurred about 1 h after DNA synthesis, and the nuclear morphology of the mutant at the restrictive temperature was that of cells blocked in late nuclear division. DBF2 is therefore likely to encode a protein kinase that may function in initiation of DNA synthesis and also in late nuclear division. Images PMID:2181271
In silico design of a DNA-based HIV-1 multi-epitope vaccine for Chinese populations

PubMed Central

Yang, Yi; Sun, Weilai; Guo, Jingjing; Zhao, Guangyu; Sun, Shihui; Yu, Hong; Guo, Yan; Li, Jungfeng; Jin, Xia; Du, Lanying; Jiang, Shibo; Kou, Zhihua; Zhou, Yusen

2015-01-01

The development of an HIV-1 vaccine that is capable of inducing effective and broadly cross-reactive humoral and cellular immune responses remains a challenging task because of the extensive diversity of HIV-1, the difference of virus subtypes (clades) in different geographical regions, and the polymorphism of human leukocyte antigens (HLA). We performed an in silico design of 3 DNA vaccines, designated pJW4303-MEG1, pJW4303-MEG2 and pJW4303-MEG3, encoding multi-epitopes that are highly conserved within the HIV-1 subtypes most prevalent in China and can be recognized through HLA alleles dominant in China. The pJW4303-MEG1-encoded protein consisted of one Th epitope in Env, and one, 2, and 6 epitopes in Pol, Env, and Gag proteins, respectively, with a GGGS linker sequence between epitopes. The pJW4303-MEG2-encoded protein contained similar epitopes in a different order, but with the same linker as pJW4303-MEG1. The pJW4303-MEG3-encoded protein contained the same epitopes in the same order as that of pJW4303-MEG2, but with a different linker sequence (AAY). To evaluate immunogenicity, mice were immunized intramuscularly with these DNA vaccines. Both pJW4303-MEG1 and pJW4303-MEG2 vaccines induced equally potent humoral and cellular immune responses in the vaccinated mice, while pJW4303-MEG3 did not induce immune responses. These results indicate that both epitope and linker sequences are important in designing effective epitope-based vaccines against HIV-1 and other viruses. PMID:25839222
FoxP3 as a Missing Link Between Inflammation and Breast Cancer

DTIC Science & Technology

2011-09-01

CONTRACTING ORGANIZATION : The...PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) 8. PERFORMING ORGANIZATION REPORT NUMBER 9. SPONSORING / MONITORING AGENCY NAME(S...sequenced exons 2 and 6 of human YAP, which encodes amino acid sequence encompassing S127 and S347 (S381) sites, respectively. DNA were prepared from
Cloning and characterization of cDNAs encoding human gastrin-releasing peptide.

PubMed Central

Spindel, E R; Chin, W W; Price, J; Rees, L H; Besser, G M; Habener, J F

1984-01-01

We have prepared and cloned cDNAs derived from poly(A)+ RNA from a human pulmonary carcinoid tumor rich in immunoreactivity to gastrin-releasing peptide, a peptide closely related in structure to amphibian bombesin. Mixtures of synthetic oligodeoxyribonucleotides corresponding to amphibian bombesin were used as hybridization probes to screen a cDNA library prepared from the tumor RNA. Sequencing of the recombinant plasmids shows that human gastrin-releasing peptide (hGRP) mRNA encodes a precursor of 148 amino acids containing a typical signal sequence, hGRP consisting of 27 or 28 amino acids, and a carboxyl-terminal extension peptide. hGRP is flanked at its carboxyl terminus by two basic amino acids, following a glycine used for amidation of the carboxyl-terminal methionine. RNA blot analyses of tumor RNA show a major mRNA of 900 bases and a minor mRNA of 850 bases. Blot hybridization analyses using human genomic DNA are consistent with a single hGRP-encoding gene. The presence of two mRNAs encoding the hGRP precursor protein in the face of a single hGRP gene raises the possibility of alternative processing of the single RNA transcript. Images PMID:6207529
Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.

PubMed

Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru

2015-01-01

The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.
CDNA encoding a polypeptide including a hevein sequence

DOEpatents

Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

1995-03-21

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
BAC sequencing using pooled methods.

PubMed

Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina

2015-01-01

Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.
Expression of glutathione peroxidase I gene in selenium-deficient rats.

PubMed Central

Reddy, A P; Hsu, B L; Reddy, P S; Li, N Q; Thyagaraju, K; Reddy, C C; Tam, M F; Tu, C P

1988-01-01

We have characterized a cDNA pGPX1211 encoding rat glutathione peroxidase I. The selenocysteine in the protein corresponded to a TGA codon in the coding region of the cDNA, similar to earlier findings in mouse and human genes, and a gene encoding the formate dehydrogenase from E. coli, another selenoenzyme. The rat GSH peroxidase I has a calculated subunit molecular weight of 22,155 daltons and shares 95% and 86% sequence homology with the mouse and human subunits, respectively. The 3'-noncoding sequence (greater than 930 bp) in pGPX1211 is much longer than that of the human sequences. We found that glutathione peroxidase I mRNA, but not the polypeptide, was expressed under nutritional stress of selenium deficiency where no glutathione peroxidase I activity can be detected. The failure of detecting any apoprotein for the glutathione peroxidase I under selenium deficiency and results published from other laboratories supports the proposal that selenium may be incorporated into the glutathione peroxidase I co-translationally. Images PMID:2838821
Quantum-Sequencing: Fast electronic single DNA molecule sequencing

NASA Astrophysics Data System (ADS)

Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

2014-03-01

A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.
Novel Structure of Ty3 Reverse Transcriptase | Center for Cancer Research

Cancer.gov

Retrotransposons are mobile genetic elements that self amplify via a single-stranded RNA intermediate, which is converted to double-stranded DNA by an encoded reverse transcriptase (RT) with both DNA polymerase (pol) and ribonuclease H (RNase) activities. Categorized by whether they contain flanking long terminal repeat (LTR) sequences, retrotransposons play a critical role in

[Forced Oscillations of DNA Bases].

PubMed

Yakushevich, L V; Krasnobaeva, L A

2016-01-01

This paper presents the results of the studying of forced angular oscillations of the DNA bases with the help of the mathematical model consisting of two coupled nonlinear differential equations that take into account the effects of dissipation and the influence of an external periodic field. The calculation results are illustrated for sequence of gene encoding interferon alpha 17 (IFNA 17).
Cloning and characterization of a cDNA encoding a novel extracellular peroxidase from Trametes versicolor.

PubMed

Collins, P J; O'Brien, M M; Dobson, A D

1999-03-01

The white rot basidiomycete Trametes versicolor secretes a large number of peroxidases which are believed to be involved in the degradation of polymeric lignin. These peroxidases have been classified previously as lignin peroxidases or manganese peroxidases (MnP). We have isolated a novel extracellular peroxidase-encoding cDNA sequence from T. versicolor CU1, the transcript levels of which are repressed by low concentrations of Mn2+ and induced by nitrogen and carbon but not induced in response to a range of stresses which have been reported to induce MnP expression.
Cloning and Characterization of a cDNA Encoding a Novel Extracellular Peroxidase from Trametes versicolor

PubMed Central

Collins, Patrick J.; O’Brien, Margaret M.; Dobson, Alan D. W.

1999-01-01

The white rot basidiomycete Trametes versicolor secretes a large number of peroxidases which are believed to be involved in the degradation of polymeric lignin. These peroxidases have been classified previously as lignin peroxidases or manganese peroxidases (MnP). We have isolated a novel extracellular peroxidase-encoding cDNA sequence from T. versicolor CU1, the transcript levels of which are repressed by low concentrations of Mn2+ and induced by nitrogen and carbon but not induced in response to a range of stresses which have been reported to induce MnP expression. PMID:10049906
The spectrum and clinical impact of epigenetic modifier mutations in myeloma

PubMed Central

Pawlyn, Charlotte; Kaiser, Martin F; Heuck, Christoph; Melchor, Lorenzo; Wardell, Christopher P; Murison, Alex; Chavan, Shweta; Johnson, David C; Begum, Dil; Dahir, Nasrin; Proszek, Paula; Cairns, David A; Boyle, Eileen M; Jones, John R; Cook, Gordon; Drayson, Mark T; Owen, Roger G; Gregory, Walter M; Jackson, Graham H; Barlogie, Bart; Davies, Faith E; Walker, Brian A; Morgan, Gareth J

2016-01-01

Purpose Epigenetic dysregulation is known to be an important contributor to myeloma pathogenesis but, unlike in other B cell malignancies, the full spectrum of somatic mutations in epigenetic modifiers has not been previously reported. We sought to address this using results from whole-exome sequencing in the context of a large prospective clinical trial of newly diagnosed patients and targeted sequencing in a cohort of previously treated patients for comparison. Experimental Design Whole-exome sequencing analysis of 463 presenting myeloma cases entered in the UK NCRI Myeloma XI study and targeted sequencing analysis of 156 previously treated cases from the University of Arkansas for Medical Sciences. We correlated the presence of mutations with clinical outcome from diagnosis and compared the mutations found at diagnosis with later stages of disease. Results In diagnostic myeloma patient samples we identify significant mutations in genes encoding the histone 1 linker protein, previously identified in other B-cell malignancies. Our data suggest an adverse prognostic impact from the presence of lesions in genes encoding DNA methylation modifiers and the histone demethylase KDM6A/UTX. The frequency of mutations in epigenetic modifiers appears to increase following treatment most notably in genes encoding histone methyltransferases and DNA methylation modifiers. Conclusions Numerous mutations identified raise the possibility of targeted treatment strategies for patients either at diagnosis or relapse supporting the use of sequencing-based diagnostics in myeloma to help guide therapy as more epigenetic targeted agents become available. PMID:27235425
Molecular Cloning of an Immunogenic Protein of Baylisascaris procyonis and Expression in Escherichia coli for Use in Developing Improved Serodiagnostic Assays▿

PubMed Central

Dangoudoubiyam, Sriveny; Vemulapalli, Ramesh; Hancock, Kathy; Kazacos, Kevin R.

2010-01-01

Larva migrans caused by Baylisascaris procyonis is an important zoonotic disease. Current serological diagnostic assays for this disease depend on the use of the parasite's larval excretory-secretory (ES) antigens. In order to identify genes encoding ES antigens and to generate recombinant antigens for use in diagnostic assays, construction and immunoscreening of a B. procyonis third-stage larva cDNA expression library was performed and resulted in identification of a partial-length cDNA clone encoding an ES antigen, designated repeat antigen 1 (RAG1). The full-length rag1 cDNA contained a 753-bp open reading frame that encoded a protein of 250 amino acids with 12 tandem repeats of a 12-amino-acid long sequence. The rag1 genomic DNA revealed a single intron of 837 bp that separated the 753-bp coding sequence into two exons delimited by canonical splice sites. No nucleotide or amino acid sequences present in the GenBank databases had significant similarity with those of RAG1. We have cloned, expressed, and purified the recombinant RAG1 (rRAG1) and analyzed its diagnostic potential by enzyme-linked immunosorbent assay. Anti-Baylisascaris species-specific rabbit serum showed strong reactivity to rRAG1, while only minimal to no reactivity was observed with sera against the related ascarids Toxocara canis and Ascaris suum, strongly suggesting the specificity of rRAG1. On the basis of these results, the identified RAG1 appears to be a promising diagnostic antigen for the development of serological assays for specific detection of B. procyonis larva migrans. PMID:20926699
Cloning of the Gene Encoding a 22-Kilodalton Cell Surface Antigen of Mycobacterium bovis BCG and Analysis of Its Potential for DNA Vaccination against Tuberculosis

PubMed Central

Lefèvre, Philippe; Denis, Olivier; De Wit, Lucas; Tanghe, Audrey; Vandenbussche, Paul; Content, Jean; Huygen, Kris

2000-01-01

Using spleen cells from mice vaccinated with live Mycobacterium bovis BCG, we previously generated three monoclonal antibodies reactive against a 22-kDa protein present in mycobacterial culture filtrate (CF) (K. Huygen et al., Infect. Immun. 61:2687–2693, 1993). These monoclonal antibodies were used to screen an M. bovis BCG genomic library made in phage λgt11. The gene encoding a 233-amino-acid (aa) protein, including a putative 26-aa signal sequence, was isolated, and sequence analysis indicated that the protein was 98% identical with the M. tuberculosis Lppx protein and that it contained a sequence 94% identical with the M. leprae 38-mer polypeptide 13B3 recognized by T cells from killed M. leprae-immunized subjects. Flow cytometry and cell fractionation demonstrated that the 22-kDa CF protein is also highly expressed in the bacterial cell wall and membrane compartment but not in the cytosol. C57BL/6, C3H, and BALB/c mice were vaccinated with plasmid DNA encoding the 22-kDa protein and analyzed for immune response and protection against intravenous M. tuberculosis challenge. Whereas DNA vaccination induced elevated antibody responses in C57BL/6 and particularly in C3H mice, Th1-type cytokine response, as measured by interleukin-2 and gamma interferon secretion, was only modest, and no protection against intravenous M. tuberculosis challenge was observed in any of the three mouse strains tested. Therefore, the 22-kDa antigen seems to have little potential for a DNA vaccine against tuberculosis, but it may be a good candidate for a mycobacterial antigen detection test. PMID:10678905
Cloning and sequence analysis of the Antheraea pernyi nucleopolyhedrovirus gp64 gene.

PubMed

Wang, Wenbing; Zhu, Shanying; Wang, Liqun; Yu, Feng; Shen, Weide

2005-12-01

Frequent outbreaks of the purulence disease of Chinese oak silkworm are reported in Middle and Northeast China. The disease is produced by the pathogen Antheraea pernyi nucleopolyhedrovirus (AnpeNPV). To obtain molecular information of the virus, the polyhedra of AnpeNPV were purified and characterized. The genomic DNA of AnpeNPV was extracted and digested with HindIII. The genome size of AnpeNPV is estimated at 128 kb. Based on the analysis of DNA fragments digested with HindIII, 23 fragments were bigger than 564 bp. A genomic library was generated using HindIII and the positive clones were sequenced and analysed. The gp64 gene, encoding the baculovirus envelope protein GP64, was found in an insert. The nucleotide sequence analysis indicated that the AnpeNPV gp64 gene consists of a 1,530 nucleotide open reading frame (ORF), encoding a protein of 509 amino acids. Of the eight gp64 homologues, the AnpeNPV gp64 ORF shared the most sequence similarity with the gp64 gene of Anticarsia gemmatalis NPV, but not Bombyx mori NPV. The upstream region of the AnpeNPV gp64 ORF encoded the conserved transcriptional elements for early and late stage of the viral infection cycle. These results indicated that AnpeNPV belongs to group I NPV and was far removed in molecular phylogeny from the BmNPV.
[Construction and functional identification of eukaryotic expression vector carrying Sprague-Dawley rat MSX-2 gene].

PubMed

Yang, Xian-Xian; Zhang, Mei; Yan, Zhao-Wen; Zhang, Ru-Hong; Mu, Xiong-Zheng

2008-01-01

To construct a high effective eukaryotic expressing plasmid PcDNA 3.1-MSX-2 encoding Sprague-Dawley rat MSX-2 gene for the further study of MSX-2 gene function. The full length SD rat MSX-2 gene was amplified by PCR, and the full length DNA was inserted in the PMD1 8-T vector. It was isolated by restriction enzyme digest with BamHI and Xhol, then ligated into the cloning site of the PcDNA3.1 expression plasmid. The positive recombinant was identified by PCR analysis, restriction endonudease analysis and sequence analysis. Expression of RNA and protein was detected by RT-PCR and Western blot analysis in PcDNA3.1-MSX-2 transfected HEK293 cells. Sequence analysis and restriction endonudease analysis of PcDNA3.1-MSX-2 demonstrated that the position and size of MSX-2 cDNA insertion were consistent with the design. RT-PCR and Western blot analysis showed specific expression of mRNA and protein of MSX-2 in the transfected HEK293 cells. The high effective eukaryotic expression plasmid PcDNA3.1-MSX-2 encoding Sprague-Dawley Rat MSX-2 gene which is related to craniofacial development can be successfully reconstructed. It may serve as the basis for the further study of MSX-2 gene function.
Bridging two scholarly islands enriches both: COI DNA barcodes for species identification versus human mitochondrial variation for the study of migrations and pathologies.

PubMed

Thaler, David S; Stoeckle, Mark Y

2016-10-01

DNA barcodes for species identification and the analysis of human mitochondrial variation have developed as independent fields even though both are based on sequences from animal mitochondria. This study finds questions within each field that can be addressed by reference to the other. DNA barcodes are based on a 648-bp segment of the mitochondrially encoded cytochrome oxidase I. From most species, this segment is the only sequence available. It is impossible to know whether it fairly represents overall mitochondrial variation. For modern humans, the entire mitochondrial genome is available from thousands of healthy individuals. SNPs in the human mitochondrial genome are evenly distributed across all protein-encoding regions arguing that COI DNA barcode is representative. Barcode variation among related species is largely based on synonymous codons. Data on human mitochondrial variation support the interpretation that most - possibly all - synonymous substitutions in mitochondria are selectively neutral. DNA barcodes confirm reports of a low variance in modern humans compared to nonhuman primates. In addition, DNA barcodes allow the comparison of modern human variance to many other extant animal species. Birds are a well-curated group in which DNA barcodes are coupled with census and geographic data. Putting modern human variation in the context of intraspecies variation among birds shows humans to be a single breeding population of average variance.
Cloning and expression of the cDNA encoding human fumarylacetoacetate hydrolase, the enzyme deficient in hereditary tyrosinemia: assignment of the gene to chromosome 15.

PubMed Central

Phaneuf, D; Labelle, Y; Bérubé, D; Arden, K; Cavenee, W; Gagné, R; Tanguay, R M

1991-01-01

Type 1 hereditary tyrosinemia (HT) is an autosomal recessive disease characterized by a deficiency of the enzyme fumarylacetoacetate hydrolase (FAH; E.C.3.7.1.2). We have isolated human FAH cDNA clones by screening a liver cDNA expression library using specific antibodies and plaque hybridization with a rat FAH cDNA probe. A 1,477-bp cDNA was sequenced and shown to code for FAH by an in vitro transcription-translation assay and sequence homology with tryptic fragments of purified FAH. Transient expression of this FAH cDNA in transfected CV-1 mammalian cells resulted in the synthesis of an immunoreactive protein comigrating with purified human liver FAH on SDS-PAGE and having enzymatic activity as shown by the hydrolysis of the natural substrate fumarylacetoacetate. This indicates that the single polypeptide chain encoded by the FAH gene contains all the genetic information required for functional activity, suggesting that the dimer found in vivo is a homodimer. The human FAH cDNA was used as a probe to determine the gene's chromosomal localization using somatic cell hybrids and in situ hybridization. The human FAH gene maps to the long arm of chromosome 15 in the region q23-q25. Images Figure 1 Figure 3 Figure 4 Figure 6 Figure 8 PMID:1998338
Run-length encoding graphic rules, biochemically editable designs and steganographical numeric data embedment for DNA-based cryptographical coding system

PubMed Central

Kawano, Tomonori

2013-01-01

There have been a wide variety of approaches for handling the pieces of DNA as the “unplugged” tools for digital information storage and processing, including a series of studies applied to the security-related area, such as DNA-based digital barcodes, water marks and cryptography. In the present article, novel designs of artificial genes as the media for storing the digitally compressed data for images are proposed for bio-computing purpose while natural genes principally encode for proteins. Furthermore, the proposed system allows cryptographical application of DNA through biochemically editable designs with capacity for steganographical numeric data embedment. As a model case of image-coding DNA technique application, numerically and biochemically combined protocols are employed for ciphering the given “passwords” and/or secret numbers using DNA sequences. The “passwords” of interest were decomposed into single letters and translated into the font image coded on the separate DNA chains with both the coding regions in which the images are encoded based on the novel run-length encoding rule, and the non-coding regions designed for biochemical editing and the remodeling processes revealing the hidden orientation of letters composing the original “passwords.” The latter processes require the molecular biological tools for digestion and ligation of the fragmented DNA molecules targeting at the polymerase chain reaction-engineered termini of the chains. Lastly, additional protocols for steganographical overwriting of the numeric data of interests over the image-coding DNA are also discussed. PMID:23750303
The ENCODE Project at UC Santa Cruz.

PubMed

Thomas, Daryl J; Rosenbloom, Kate R; Clawson, Hiram; Hinrichs, Angie S; Trumbower, Heather; Raney, Brian J; Karolchik, Donna; Barber, Galt P; Harte, Rachel A; Hillman-Jackson, Jennifer; Kuhn, Robert M; Rhead, Brooke L; Smith, Kayla E; Thakkapallayil, Archana; Zweig, Ann S; Haussler, David; Kent, W James

2007-01-01

The goal of the Encyclopedia Of DNA Elements (ENCODE) Project is to identify all functional elements in the human genome. The pilot phase is for comparison of existing methods and for the development of new methods to rigorously analyze a defined 1% of the human genome sequence. Experimental datasets are focused on the origin of replication, DNase I hypersensitivity, chromatin immunoprecipitation, promoter function, gene structure, pseudogenes, non-protein-coding RNAs, transcribed RNAs, multiple sequence alignment and evolutionarily constrained elements. The ENCODE project at UCSC website (http://genome.ucsc.edu/ENCODE) is the primary portal for the sequence-based data produced as part of the ENCODE project. In the pilot phase of the project, over 30 labs provided experimental results for a total of 56 browser tracks supported by 385 database tables. The site provides researchers with a number of tools that allow them to visualize and analyze the data as well as download data for local analyses. This paper describes the portal to the data, highlights the data that has been made available, and presents the tools that have been developed within the ENCODE project. Access to the data and types of interactive analysis that are possible are illustrated through supplemental examples.
Rapid and efficient cDNA library screening by self-ligation of inverse PCR products (SLIP).

PubMed

Hoskins, Roger A; Stapleton, Mark; George, Reed A; Yu, Charles; Wan, Kenneth H; Carlson, Joseph W; Celniker, Susan E

2005-12-02

cDNA cloning is a central technology in molecular biology. cDNA sequences are used to determine mRNA transcript structures, including splice junctions, open reading frames (ORFs) and 5'- and 3'-untranslated regions (UTRs). cDNA clones are valuable reagents for functional studies of genes and proteins. Expressed Sequence Tag (EST) sequencing is the method of choice for recovering cDNAs representing many of the transcripts encoded in a eukaryotic genome. However, EST sequencing samples a cDNA library at random, and it recovers transcripts with low expression levels inefficiently. We describe a PCR-based method for directed screening of plasmid cDNA libraries. We demonstrate its utility in a screen of libraries used in our Drosophila EST projects for 153 transcription factor genes that were not represented by full-length cDNA clones in our Drosophila Gene Collection. We recovered high-quality, full-length cDNAs for 72 genes and variously compromised clones for an additional 32 genes. The method can be used at any scale, from the isolation of cDNA clones for a particular gene of interest, to the improvement of large gene collections in model organisms and the human. Finally, we discuss the relative merits of directed cDNA library screening and RT-PCR approaches.
Structure, inheritance, and expression of hybrid poplar (Populus trichocarpa x Populus deltoides) phenylalanine ammonia-lyase genes.

PubMed Central

Subramaniam, R; Reinold, S; Molitor, E K; Douglas, C J

1993-01-01

A heterologous probe encoding phenylalanine ammonia-lyase (PAL) was used to identify PAL clones in cDNA libraries made with RNA from young leaf tissue of two Populus deltoides x P. trichocarpa F1 hybrid clones. Sequence analysis of a 2.4-kb cDNA confirmed its identity as a full-length PAl clone. The predicted amino acid sequence is conserved in comparison with that of PAL genes from several other plants. Southern blot analysis of popular genomic DNA from parental and hybrid individuals, restriction site polymorphism in PAL cDNA clones, and sequence heterogeneity in the 3' ends of several cDNA clones suggested that PAL is encoded by at least two genes that can be distinguished by HindIII restriction site polymorphisms. Clones containing each type of PAL gene were isolated from a poplar genomic library. Analysis of the segregation of PAL-specific HindIII restriction fragment-length polymorphisms demonstrated the existence of two independently segregating PAL loci, one of which was mapped to a linkage group of the poplar genetic map. Developmentally regulated PAL expression in poplar was analyzed using RNA blots. Highest expression was observed in young stems, apical buds, and young leaves. Expression was lower in older stems and undetectable in mature leaves. Cellular localization of PAL expression by in situ hybridization showed very high levels of expression in subepidermal cells of leaves early during leaf development. In stems and petioles, expression was associated with subepidermal cells and vascular tissues. PMID:8108506
Aptaligner: automated software for aligning pseudorandom DNA X-aptamers from next-generation sequencing data.

PubMed

Lu, Emily; Elizondo-Riojas, Miguel-Angel; Chang, Jeffrey T; Volk, David E

2014-06-10

Next-generation sequencing results from bead-based aptamer libraries have demonstrated that traditional DNA/RNA alignment software is insufficient. This is particularly true for X-aptamers containing specialty bases (W, X, Y, Z, ...) that are identified by special encoding. Thus, we sought an automated program that uses the inherent design scheme of bead-based X-aptamers to create a hypothetical reference library and Markov modeling techniques to provide improved alignments. Aptaligner provides this feature as well as length error and noise level cutoff features, is parallelized to run on multiple central processing units (cores), and sorts sequences from a single chip into projects and subprojects.
Molecular cloning and analysis of Schizosaccharomyces pombe Reb1p: sequence-specific recognition of two sites in the far upstream rDNA intergenic spacer.

PubMed Central

Zhao, A; Guo, A; Liu, Z; Pape, L

1997-01-01

The coding sequences for a Schizosaccharomyces pombe sequence-specific DNA binding protein, Reb1p, have been cloned. The predicted S. pombe Reb1p is 24-29% identical to mouse TTF-1 (transcription termination factor-1) and Saccharomyces cerevisiae REB1 protein, both of which direct termination of RNA polymerase I catalyzed transcripts. The S.pombe Reb1 cDNA encodes a predicted polypeptide of 504 amino acids with a predicted molecular weight of 58.4 kDa. The S. pombe Reb1p is unusual in that the bipartite DNA binding motif identified originally in S.cerevisiae and Klyveromyces lactis REB1 proteins is uninterrupted and thus S.pombe Reb1p may contain the smallest natural REB1 homologous DNA binding domain. Its genomic coding sequences were shown to be interrupted by two introns. A recombinant histidine-tagged Reb1 protein bearing the rDNA binding domain has two homologous, sequence-specific binding sites in the S. pomber DNA intergenic spacer, located between 289 and 480 nt downstream of the end of the approximately 25S rRNA coding sequences. Each binding site is 13-14 bp downstream of two of the three proposed in vivo termination sites. The core of this 17 bp site, AGGTAAGGGTAATGCAC, is specifically protected by Reb1p in footprinting analysis. PMID:9016645
[Construction of plant expression plasmid of chimera SBR-CT delta A1].

PubMed

Mai, Sui; Ling, Junqi

2003-08-01

The purpose of this study is to construct plant expression plasmid containing the gene encoding chimera SBR-CT delta A1. The target gene fragment P2, including the gene-encoded chimera SBR-CT delta A1 (3,498-5,378 bp), was obtained by standard PCR amplification. The PCR products were ligated with pGEM-easy vector through TA clone to form plasmid pTSC. The plasmid pTSC and plasmid pPOKII were digested by restricted endonuclease BamHI and KpnI, and the digested products were extracted and purified for recombination. Then the purified P2 and plasmid pPOKII were recombined by T4 DNA ligase to form recombinant plasmid pROSC; inserting bar gene into the plasmid and form pROSB plasmid. The recombined plasmids were isolated and identified by restricted endonuclease cutting and Sanger dideoxy DNA sequencing. P2 gene was linked to pPOKII plasmid and formed recombinant plasmid pROSC. The DNA sequence and orientation were corrected. And bar gene was inserted into pPOSC and form recombinant plasmid pROSB. Plant expression vector pROSC and pROSB containing the gene encoding chimera SBR-CT delta A1, which may provide useful experiment foundation for further study on edible vaccine against caries have been successfully constructed.
Production of a full-length infectious GFP-tagged cDNA clone of Beet mild yellowing virus for the study of plant-polerovirus interactions.

PubMed

Stevens, Mark; Viganó, Felicita

2007-04-01

The full-length cDNA of Beet mild yellowing virus (Broom's Barn isolate) was sequenced and cloned into the vector pLitmus 29 (pBMYV-BBfl). The sequence of BMYV-BBfl (5721 bases) shared 96% and 98% nucleotide identity with the other complete sequences of BMYV (BMYV-2ITB, France and BMYV-IPP, Germany respectively). Full-length capped RNA transcripts of pBMYV-BBfl were synthesised and found to be biologically active in Arabidopsis thaliana protoplasts following electroporation or PEG inoculation when the protoplasts were subsequently analysed using serological and molecular methods. The BMYV sequence was modified by inserting DNA that encoded the jellyfish green fluorescent protein (GFP) into the P5 gene close to its 3' end. A. thaliana protoplasts electroporated with these RNA transcripts were biologically active and up to 2% of transfected protoplasts showed GFP-specific fluorescence. The exploitation of these cDNA clones for the study of the biology of beet poleroviruses is discussed.
High-density fiber-optic DNA random microsphere array.

PubMed

Ferguson, J A; Steemers, F J; Walt, D R

2000-11-15

A high-density fiber-optic DNA microarray sensor was developed to monitor multiple DNA sequences in parallel. Microarrays were prepared by randomly distributing DNA probe-functionalized 3.1-microm-diameter microspheres in an array of wells etched in a 500-microm-diameter optical imaging fiber. Registration of the microspheres was performed using an optical encoding scheme and a custom-built imaging system. Hybridization was visualized using fluorescent-labeled DNA targets with a detection limit of 10 fM. Hybridization times of seconds are required for nanomolar target concentrations, and analysis is performed in minutes.
Molecular characterization and functional analysis of chalcone synthase from Syringa oblata Lindl. in the flavonoid biosynthetic pathway.

PubMed

Wang, Yu; Dou, Ying; Wang, Rui; Guan, Xuelian; Hu, Zenghui; Zheng, Jian

2017-11-30

The flower color of Syringa oblata Lindl., which is often modulated by the flavonoid content, varies and is an important ornamental feature. Chalcone synthase (CHS) catalyzes the first key step in the flavonoid biosynthetic pathway. However, little is known about the role of S. oblata CHS (SoCHS) in flavonoid biosynthesis in this species. Here, we isolate and analyze the cDNA (SoCHS1) that encodes CHS in S. oblata. We also sought to analyzed the molecular characteristics and function of flavonoid metabolism by SoCHS1. We successfully isolated the CHS-encoding genomic DNA (gDNA) in S. oblata (SoCHS1), and the gene structural analysis indicated it had no intron. The opening reading frame (ORF) sequence of SoCHS1 was 1170bp long and encoded a 389-amino acid polypeptide. Multiple sequence alignment revealed that both the conserved CHS active site residues and CHS signature sequence were in the deduced amino acid sequence of SoCHS1. Crystallographic analysis revealed that the protein structure of SoCHS1 is highly similar to that of FnCHS1 in Freesia hybrida. The quantitative real-time polymerase chain reaction (PCR) performed to detect the SoCHS1 transcript expression levels in flowers, and other tissues revealed the expression was significantly correlated with anthocyanin accumulation during flower development. The ectopic expression results of Nicotiana tabacum showed that SoCHS1 overexpression in transgenic tobacco changed the flower color from pale pink to pink. In conclusion, these results suggest that SoCHS1 plays an essential role in flavonoid biosynthesis in S. oblata, and could be used to modify flavonoid components in other plant species. Copyright © 2017. Published by Elsevier B.V.

Characterization of the gene encoding the polymorphic immunodominant molecule, a neutralizing antigen of Theileria parva

DOE Office of Scientific and Technical Information (OSTI.GOV)

Toye, P.G.; Metzelaar, M.J.; Wijngaard, P.L.J.

1995-08-01

Theileria parva, a tick-transmitted protozoan parasite related to Plasmodium spp., causes the disease East Coast fever, an acute and usually fatal lymphoproliferative disorder of cattle in Africa. Previous studies using sera from cattle that have survived infection identified a polymorphic immunodominant molecule (PIM) that is expressed by both the infective sporozoite stage of the parasite and the intracellular schizont. Here we show that mAb specific for the PIM Ag can inhibit sporozoite invasion of lymphocytes in vitro. A cDNA clone encoding the PIM Ag of the T. parva (Muguga) stock was obtained by using these mAb in a novel eukaryoticmore » expression cloning system that allows isolation of cDNA encoding cytoplasmic or surface Ags. To establish the molecular basis of the polymorphism of PIM, the cDNA of the PIM Ag from a buffalo-derived T. parva stock was isolated and its sequence was compared with that of the cattle-derived Muguga PIM. The two cDNAs showed considerable identity in both the 5{prime} and 3{prime} regions, but there was substantial sequence divergence in the central regions. Several types of repeated sequences were identified in the variant regions. In the Muguga form of the molecule, there were five tandem repeats of the tetrapeptide, QPEP, that were shown, by transfection of a deleted version of the PIM gene, not to react with several anti-PIM mAbs. By isolating and sequencing the genomic version of the gene, we identified two small introns in the 3{prime} region of the gene. Finally, we showed that polyclonal rat Abs against recombinant PIM neutralize sporozoite infectivity in vitro, suggesting that the PIM Ag should be evaluated for its capacity to immunize cattle against East Coast Fever.« less
Molecular cloning of MSSP-2, a c-myc gene single-strand binding protein: characterization of binding specificity and DNA replication activity.

PubMed Central

Takai, T; Nishita, Y; Iguchi-Ariga, S M; Ariga, H

1994-01-01

We have previously reported the human cDNA encoding MSSP-1, a sequence-specific double- and single-stranded DNA binding protein [Negishi, Nishita, Saëgusa, Kakizaki, Galli, Kihara, Tamai, Miyajima, Iguchi-Ariga and Ariga (1994) Oncogene, 9, 1133-1143]. MSSP-1 binds to a DNA replication origin/transcriptional enhancer of the human c-myc gene and has turned out to be identical with Scr2, a human protein which complements the defect of cdc2 kinase in S.pombe [Kataoka and Nojima (1994) Nucleic Acid Res., 22, 2687-2693]. We have cloned the cDNA for MSSP-2, another member of the MSSP family of proteins. The MSSP-2 cDNA shares highly homologous sequences with MSSP-1 cDNA, except for the insertion of 48 bp coding 16 amino acids near the C-terminus. Like MSSP-1, MSSP-2 has RNP-1 consensus sequences. The results of the experiments using bacterially expressed MSSP-2, and its deletion mutants, as histidine fusion proteins suggested that the binding specificity of MSSP-2 to double- and single-stranded DNA is the same as that of MSSP-1, and that the RNP consensus sequences are required for the DNA binding of the protein. MSSP-2 stimulated the DNA replication of an SV40-derived plasmid containing the binding sequence for MSSP-1 or -2. MSSP-2 is hence suggested to play an important role in regulation of DNA replication. Images PMID:7838710
Molecular cloning and characterization of a new basic peroxidase cDNA from soybean hypocotyls infected with Phytophthora sojae f.sp. glycines.

PubMed

Yi, S Y; Hwang, B K

1998-10-31

Differential display techniques were used to isolate cDNA clones corresponding to genes which were expressed in soybean hypocotyls by Phytophthora sojae f.sp. glycines infection. With a partial cDNA clone C20CI4 from the differential display PCR as a probe, a new basic peroxidase cDNA clone, designated GMIPER1, was isolated from a cDNA library of soybean hypocotyls infected with P. sojae f.sp. glycines. Sequence analysis revealed that the peroxidase clone encodes a mature protein of 35,813 Da with a putative signal peptide of 27 amino acids in its N-terminus. The amino acid sequence of the soybean peroxidase GMIPER1 is between 54-75% identical to other plant peroxidases including a soybean seed coat peroxidase. Southern blot analysis indicated that multiple copies of sequences related to GMIPER1 exist in the soybean genome. The mRNAs corresponding to the GMIPER1 cDNA accumulated predominantly in the soybean hypocotyls infected with the incompatible race of P. sojae f.sp. glycines, but were expressed at low levels in the compatible interaction. Soybean GMIPER1 mRNAs were not expressed in hypocotyls, leaves, stems, and roots of soybean seedlings. However, treatments with ethephon, salicylic acid or methyl jasmonate induced the accumulation of the GMIPER1 mRNAs in the different organs of soybean. These results suggest that the GMIPER1 gene encoding a putative pathogen-induced peroxidase may play an important role in induced resistance of soybean to P. sojae f.sp. glycines and in response to various external stresses.
Cloning of a cDNA encoding bovine mitochondrial NADP(+)-specific isocitrate dehydrogenase and structural comparison with its isoenzymes from different species.

PubMed Central

Huh, T L; Ryu, J H; Huh, J W; Sung, H C; Oh, I U; Song, B J; Veech, R L

1993-01-01

Mitochondrial NADP(+)-specific isocitrate dehydrogenase (IDP) was co-purified with the pyruvate dehydrogenase complex from bovine kidney mitochondria. The determination of its N-terminal 16-amino-acid sequence revealed that it is highly similar to the IDP from yeast. A cDNA clone (1.8 kb long) encoding this protein was isolated from a bovine kidney lambda gt11 cDNA library using a synthetic oligodeoxynucleotide. The deduced protein sequence of this cDNA clone rendered a precursor protein of 452 amino-acid residues (50,830 Da) and a mature protein of 413 amino-acid residues (46,519 Da). It is 100% identical to the internal tryptic peptide sequences of the autologous form from pig heart and 62% similar to that from yeast. However, it shares little similarity with the mitochondrial NAD(+)-specific isoenzyme from yeast. Structural analyses of the deduced proteins of IDP isoenzymes from different species indicated that similarity exists in certain regions, which may represent the common domains for the active sites or coenzyme-binding sites. In Northern-blot analysis, one species of mRNA (about 2.2 kb for both bovine and human) was hybridized with a 32P-labelled cDNA probe. Southern-blot analysis of genomic DNAs verified simple patterns of hybridization with this cDNA. These results strongly indicate that the mitochondrial IDP may be derived from a single gene family which does not appear to be closely related to that of the NAD(+)-specific isoenzyme. Images Figure 1 Figure 3 Figure 4 Figure 5 PMID:8318002
Partial structure of the phylloxin gene from the giant monkey frog, Phyllomedusa bicolor: parallel cloning of precursor cDNA and genomic DNA from lyophilized skin secretion.

PubMed

Chen, Tianbao; Gagliardo, Ron; Walker, Brian; Zhou, Mei; Shaw, Chris

2005-12-01

Phylloxin is a novel prototype antimicrobial peptide from the skin of Phyllomedusa bicolor. Here, we describe parallel identification and sequencing of phylloxin precursor transcript (mRNA) and partial gene structure (genomic DNA) from the same sample of lyophilized skin secretion using our recently-described cloning technique. The open-reading frame of the phylloxin precursor was identical in nucleotide sequence to that previously reported and alignment with the nucleotide sequence derived from genomic DNA indicated the presence of a 175 bp intron located in a near identical position to that found in the dermaseptins. The highly-conserved structural organization of skin secretion peptide genes in P. bicolor can thus be extended to include that encoding phylloxin (plx). These data further reinforce our assertion that application of the described methodology can provide robust genomic/transcriptomic/peptidomic data without the need for specimen sacrifice.
The primary structure of L37--a rat ribosomal protein with a zinc finger-like motif.

PubMed

Chan, Y L; Paz, V; Olvera, J; Wool, I G

1993-04-30

The amino acid sequence of the rat 60S ribosomal subunit protein L37 was deduced from the sequence of nucleotides in a recombinant cDNA. Ribosomal protein L37 has 96 amino acids, the NH2-terminal methionine is removed after translation of the mRNA, and has a molecular weight of 10,939. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type. Hybridization of the cDNA to digests of nuclear DNA suggests that there are 13 or 14 copies of the L37 gene. The mRNA for the protein is about 500 nucleotides in length. Rat L37 is related to Saccharomyces cerevisiae ribosomal protein YL35 and to Caenorhabditis elegans L37. We have identified in the data base a DNA sequence that encodes the chicken homolog of rat L37.
Isolation and characterization of a cDNA encoding a membrane bound acyl-CoA binding protein from Agave americana L. epidermis.

PubMed

Guerrero, Consuelo; Martín-Rufián, M; Reina, José J; Heredia, Antonio

2006-01-01

A cDNA encoding an acyl-CoA binding protein (ACBP) homologue has been cloned from a cDNA library made from mRNA isolated from epidermis of young leaves of Agave americana L. The derived amino acid sequence reveals a protein corresponding to the membrane-associated form of ACBPs only previously described in Arabidopsis and rice. Northern blot analysis showed that the A. americana ACBP gene is mainly expressed in the epidermis of mature zone of the leaves. The epidermis of A. americana leaves have a well developed cuticle with the highest amounts of the cuticular components waxes, cutin and cutan suggesting a potential role of the protein in cuticle formation.
Cloning and characterization of the human 5,10-methenyltetrahydrofolate synthetase-encoding cDNA.

PubMed

Dayan, A; Bertrand, R; Beauchemin, M; Chahla, D; Mamo, A; Filion, M; Skup, D; Massie, B; Jolivet, J

1995-11-20

Methenyltetrahydrofolate synthetase (MTHFS) catalyses the obligatory initial metabolic step in the intracellular conversion of 5-formyltetrahydrofolate to other reduced folates. We have isolated and sequenced a human MTHFS cDNA which is 872-bp long and codes for a 203-amino-acid protein of 23,229 Da. Escherichia coli BL21(DE3), transfected with pET11c plasmids containing an open reading frame encoding MTHFS, showed a 100-fold increase in MTHFS activity in bacterial extracts after IPTG induction. Northern blot studies of human tissues determined that the MTHFS mRNA was expressed preferentially in the liver and Southern blot analysis of human genomic DNA suggested the presence of a single-copy gene.
Wound induced Beta vulgaris polygalacturonase-inhibiting protein genes encode a longer leucine-rich repeat domain and inhibit fungal polygalacturonases

USDA-ARS?s Scientific Manuscript database

Polygalacturonase-inhibiting proteins (PGIPs) are leucine-rich repeat (LRR) proteins involved in plant defense. Sugar beet (Beta vulgaris L.) PGIP genes, BvPGIP1, BvPGIP2 and BvPGIP3, were isolated from two breeding lines, F1016 and F1010. Full-length cDNA sequences of the three BvPGIP genes encod...
The nop gene from Phanerochaete chrysosporium encodes a peroxidase with novel structural features

Treesearch

Luis F. Larrondo; Angel Gonzalez; Tomas Perez-Acle; Dan Cullen; Rafael Vicuna

2005-01-01

Inspection of the genome of the ligninolytic basidiomycete Phanerochaete chrysosporium revealed an unusual peroxidase-like sequence. The corresponding full length cDNA was sequenced and an archetypal secretion signal predicted. The deduced mature protein (NoP, novel peroxidase) contains 295 aa residues and is therefore considerably shorter than other Class II (fungal)...
Tyrosine Recombinase Retrotransposons and Transposons.

PubMed

Poulter, Russell T M; Butler, Margi I

2015-04-01

Retrotransposons carrying tyrosine recombinases (YR) are widespread in eukaryotes. The first described tyrosine recombinase mobile element, DIRS1, is a retroelement from the slime mold Dictyostelium discoideum. The YR elements are bordered by terminal repeats related to their replication via free circular dsDNA intermediates. Site-specific recombination is believed to integrate the circle without creating duplications of the target sites. Recently a large number of YR retrotransposons have been described, including elements from fungi (mucorales and basidiomycetes), plants (green algae) and a wide range of animals including nematodes, insects, sea urchins, fish, amphibia and reptiles. YR retrotransposons can be divided into three major groups: the DIRS elements, PAT-like and the Ngaro elements. The three groups form distinct clades on phylogenetic trees based on alignments of reverse transcriptase/ribonuclease H (RT/RH) and YR sequences, and also having some structural distinctions. A group of eukaryote DNA transposons, cryptons, also carry tyrosine recombinases. These DNA transposons do not encode a reverse transcriptase. They have been detected in several pathogenic fungi and oomycetes. Sequence comparisons suggest that the crypton YRs are related to those of the YR retrotransposons. We suggest that the YR retrotransposons arose from the combination of a crypton-like YR DNA transposon and the RT/RH encoding sequence of a retrotransposon. This acquisition must have occurred at a very early point in the evolution of eukaryotes.
Complementary DNA sequencing and identification of mRNAs from the venomous gland of Agkistrodon piscivorus leucostoma.

PubMed

Jia, Ying; Cantu, Bruno A; Sánchez, Elda E; Pérez, John C

2008-06-15

To advance our knowledge on the snake venom composition and transcripts expressed in venom gland at the molecular level, we constructed a cDNA library from the venom gland of Agkistrodon piscivorus leucostoma for the generation of expressed sequence tags (ESTs) database. From the randomly sequenced 2112 independent clones, we have obtained ESTs for 1309 (62%) cDNAs, which showed significant deduced amino acid sequence similarity (scores >80) to previously characterized proteins in National Center for Biotechnology Information (NCBI) database. Ribosomal proteins make up 47 clones (2%) and the remaining 756 (36%) cDNAs represent either unknown identity or show BLASTX sequence identity scores of <80 with known GenBank accessions. The most highly expressed gene encoding phospholipase A(2) (PLA(2)) accounting for 35% of A. p. leucostoma venom gland cDNAs was identified and further confirmed by crude venom applied to sodium dodecyl sulfate/polyacrylamide gel electrophoresis (SDS-PAGE) electrophoresis and protein sequencing. A total of 180 representative genes were obtained from the sequence assemblies and deposited to EST database. Clones showing sequence identity to disintegrins, thrombin-like enzymes, hemorrhagic toxins, fibrinogen clotting inhibitors and plasminogen activators were also identified in our EST database. These data can be used to develop a research program that will help us identify genes encoding proteins that are of medical importance or proteins involved in the mechanisms of the toxin venom.
Identification of cDNAs encoding viper venom hyaluronidases: cross-generic sequence conservation of full-length and unusually short variant transcripts.

PubMed

Harrison, Robert A; Ibison, Frances; Wilbraham, Davina; Wagstaff, Simon C

2007-05-01

The immobilisation of prey by snakes is most efficiently achieved by the rapid dissemination of venom from its site of injection into the blood stream. Hyaluronidase is a common component of snake venoms and has been termed the "venom spreading factor". In the absence of nucleotide or protein sequence data to confirm the functional identity of this venom component, we interrogated a venom gland EST database for the saw-scaled viper, Echis ocellatus (Nigeria), using the gene ontology (GO) term "carbohydrate metabolism". A single hyalurononglucosaminadase-activity matching sequence (EOC00242) was found and used to design PCR primers to acquire the full-length cDNA sequence. Although very different from the bee venom and mammalian hyaluronidase sequences, the E. ocellatus sequence retained all the catalytic, positional and structural residues that characterise this class of carbohydrate metabolising hydrolases. An extraordinarily high level of sequence identity (>95%) was observed in analogous venom gland cDNA sequences isolated (by PCR) from another saw-scaled viper species, E. pyramidum leakeyi (Kenya), and from the sahara horned viper, Cerastes cerastes cerastes (Egypt) and the puff adder, Bitis arietans (Nigeria). Smaller amplicons, lacking hyaluronidase catalytic residues because of 768 bp or 855 bp central deletions, appear to encode either truncated peptides without hyaluronidase activity, or are non-translated transcripts because they lack consensus translation initiating motifs.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Leong, JoAnn Ching

The nucleotide sequence of the IHNV glycoprotein gene has been determined from a cDNA clone containing the entire coding region. The glycoprotein cDNA clone contained a leader sequence of 48 bases, a coding region of 1524 nucleotides, and 39 bases at the 3 foot end. The entire cDNA clone contains 1609 nucleodites and encodes a protein of 508 amino acids. The deduced amino acid sequence gave a translated molecular weight of 56,795 daltons. A hydropathicity profile of the deduced amino acid sequence indicated that there were two major hydrophobic domains: one,at the N-terminus,delineating a signal peptide of 18 amino acidsmore » and the other, at the C-terminus,delineating the region of the transmembrane. Five possible sites of N-linked glyscoylation were identified. Although no nucleic acid homology existed between the IHNV glycoprotein gene and the glycoprotein genes of rabies and VSV, there was significant homology at the amino acid level between all three rhabdovirus glycoproteins.« less
Identification of Arcanobacterium pyogenes isolated by post mortem examinations of a bearded dragon and a gecko by phenotypic and genotypic properties.

PubMed

Ulbegi-Mohyla, H; Hijazin, M; Alber, J; Lämmler, C; Hassan, A A; Abdulmawjood, A; Prenger-Berninghoff, E; Weiss, R; Zschöck, M

2010-09-01

The present study was designed to identify phenotypically and genotypically two Arcanobacterium (A.) pyogenes strains isolated by post mortem examinations of a bearded dragon and a gecko. The A. pyogenes strains showed the typical biochemical properties and displayed CAMP-like synergistic hemolytic activities with various indicator strains. The species identity could be confirmed genotypically by amplification and sequencing of the 16S rDNA gene and, as novel target gene, by sequencing of the beta subunit of RNA polymerase encoding gene rpoB, of both strains and of reference strains representing nine species of the genus Arcanobacterium. The species identity of the two A. pyogenes strains could additionally be confirmed by PCR mediated amplification of species specific parts of the 16S-23S rDNA intergenic spacer region, the pyolysin encoding gene plo and by amplification of the collagen-binding protein encoding gene cbpA. All these molecular targets might help to improve the future identification and further characterization of A. pyogenes which, as demonstrated in the present study, could also be isolated from reptile specimens.
African swine fever virus encodes two genes which share significant homology with the two largest subunits of DNA-dependent RNA polymerases.

PubMed Central

Yáñez, R J; Boursnell, M; Nogal, M L; Yuste, L; Viñuela, E

1993-01-01

A random sequencing strategy applied to two large SalI restriction fragments (SB and SD) of the African swine fever virus (ASFV) genome revealed that they might encode proteins similar to the two largest RNA polymerase subunits of eukaryotes, poxviruses and Escherichia coli. After further mapping by dot-blot hybridization, two large open reading frames (ORFs) were completely sequenced. The first ORF (NP1450L) encodes a protein of 1450 amino acids with extensive similarity to the largest subunit of RNA polymerases. The second one (EP1242L) codes for a protein of 1242 amino acids similar to the second largest RNA polymerase subunit. Proteins NP1450L and EP1242L are more similar to the corresponding subunits of eukaryotic RNA polymerase II than to those of vaccinia virus, the prototype poxvirus, which shares many functional characteristics with ASFV. ORFs NP1450L and EP1242L are mainly expressed late in ASFV infection, after the onset of DNA replication. Images PMID:8506138
A novel chlorophyll a/b binding (Cab) protein gene from petunia which encodes the lower molecular weight Cab precursor protein.

PubMed

Stayton, M M; Black, M; Bedbrook, J; Dunsmuir, P

1986-12-22

The 16 petunia Cab genes which have been characterized are all closely related at the nucleotide sequence level and they encode Cab precursor polypeptides which are similar in sequence and length. Here we describe a novel petunia Cab gene which encodes a unique Cab precursor protein. This protein is a member of the smallest class of Cab precursor proteins for which no gene has previously been assigned in petunia or any other species. The features of this Cab precursor protein are that it is shorter by 2-3 amino acids than the formerly characterized Cab precursors, its transit peptide sequence is unrelated, and the mature polypeptide is significantly diverged at the functionally important N terminus from other petunia Cab proteins. Gene structure also discriminates this gene which is the only intron containing Cab gene in petunia genomic DNA.
Organization of the murine Cd22 locus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Law, Che-Leung; Torres, R.M.; Sundeberg, H.A.

1993-07-01

Murine CD22 (mCD22) is a B cell-associated adhesion protein with seven extracellular Ig-like domains that has 62% amino acid identify to its human homologue. Southern analysis on genomic DNA isolated from tissues and cell lines from several mouse strains using mCD22 cDNA demonstrated that the Cd22 locus encoding mCD22 is a single copy gene of [le]30 kb. Digestion of genomic DNA preparations with four restriction endonucleases revealed the presence of restriction fragment length polymorphisms (RFLP) in BALB/c, C57BL/6, and C3H strains vs DBA/2j, NZB, and NZC strains, suggesting the presence of two or more Cd22 alleles. Using a mCD22 cDNAmore » clone derived from the BALB/c strain, the authors isolated genomic clones from a DBA/2 genomic library that contained all the exons necessary to encode the full length mCD22 cDNA. Fifteen exons, including exon 3 that encodes the translation start codon, were identified. Each extracellular Ig-like domain of mCD22 is encoded by a single exon. A comparison between the nucleotide sequences of the BALB/c CD22 cDNA and the exons of the DBA/2j CD22 genomic clones revealed an 18-nucleotide deletion in exon 4 (encoding the most distal Ig-like domain 1 of mCD22) of the DBA/2j genomic sequence in addition to a number of substitutions, insertions, and deletions in other exons. These nucleotide differences were also present in a cDNA clone isolated from total RNA of LPS-activated DBA/2j splenocytes mosome 7, a region sytenic to human chromosome 19q, close to the previously reported loci, Lyb-8 and Mag (a homologue of Cd22). An antibody (CY34) against the Lyb-8.2 B cell marker reacted with a BHK transfectant expressing the full length mCd22 cDNA, thus demonstrating that Lyb-8 and Cd22 loci are identical. Furthermore, a rat anti-mCD22 mAb, NIM-R6, bound to slgM[sup +] DBA/2j B cells, confirming the expression of a CD22 protein by the Cd22[sup a]/lyb-8[sup a] allele. 63 refs., 7 figs., 1 tab.« less
Identification of DNA gyrase inhibitor (GyrI) in Escherichia coli.

PubMed

Nakanishi, A; Oshida, T; Matsushita, T; Imajoh-Ohmi, S; Ohnuki, T

1998-01-23

DNA gyrase is an essential enzyme in DNA replication in Escherichia coli. It mediates the introduction of negative supercoils near oriC, removal of positive supercoils ahead of the growing DNA fork, and separation of the two daughter duplexes. In the course of purifying DNA gyrase from E. coli KL16, we found an 18-kDa protein that inhibited the supercoiling activity of DNA gyrase, and we coined it DNA gyrase inhibitory protein (GyrI). Its NH2-terminal amino acid sequence of 16 residues was determined to be identical to that of a putative gene product (a polypeptide of 157 amino acids) encoded by yeeB (EMBL accession no. U00009) and sbmC (Baquero, M. R., Bouzon, M., Varea, J., and Moreno, F. (1995) Mol. Microbiol. 18, 301-311) of E. coli. Assuming the identity of the gene (gyrI) encoding GyrI with the previously reported genes yeeB and sbmC, we cloned the gene after amplification by polymerase chain reaction and purified the 18-kDa protein from an E. coli strain overexpressing it. The purified 18-kDa protein was confirmed to inhibit the supercoiling activity of DNA gyrase in vitro. In vivo, both overexpression and antisense expression of the gyrI gene induced filamentous growth of cells and suppressed cell proliferation. GyrI protein is the first identified chromosomally nucleoid-encoded regulatory factor of DNA gyrase in E. coli.
Identification and properties of the largest subunit of the DNA-dependent RNA polymerase of fish lymphocystis disease virus: dramatic difference in the domain organization in the family Iridoviridae.

PubMed

Müller, M; Schnitzler, P; Koonin, E V; Darai, G

1995-05-01

Cytoplasmic DNA viruses encode a DNA-dependent RNA polymerase (DdRP) that is essential for transcription of viral genes. The amino acid sequences of the known largest subunits of DdRPs from different species contain highly conserved regions. Oligonucleotide primers, deduced from two conserved domains (RQP[T/S]LH and NADFDGDE) were used for detecting the corresponding gene of fish lymphocystis disease virus (FLCDV), a member of the family Iridoviridae, which replicates in the cytoplasm of infected cells of flatfish. The gene coding for the largest subunit of the DdRP was identified using a PCR-derived probe. The screening of the complete EcoRI gene library of the viral genome led to the identification of the gene locus of the largest subunit of the DdRP within the EcoRI DNA fragment B (12.4 kbp, 0.034 to 0.165 map units). The nucleotide sequence of a part (8334 bp) of the EcoRI DNA fragment B was determined and a large ORF on the lower strand (ATG = 5787; TAA = 2190) was detected which encodes a protein of 1199 amino acids. Comparison of the amino acid sequences of the largest subunits of the DdRP (RPO1) of FLCDV and Chilo iridescent virus (CIV) revealed a dramatic difference in their domain organization. Unlike the 1051 aa RPO1 of CIV, which lacks the C-terminal domain conserved in eukaryotic, eubacterial and other viral RNA polymerases, the 1199 aa RPO1 of FLCDV is fully collinear with its cellular and viral homologues. Despite this difference, comparative analysis of the amino acid sequences of viral and cellular RNA polymerases suggests a common origin for the largest RNA polymerase subunits of FLCDV and CIV.

An Evolutionary Classification of Genomic Function

PubMed Central

Graur, Dan; Zheng, Yichen; Azevedo, Ricardo B.R.

2015-01-01

The pronouncements of the ENCODE Project Consortium regarding “junk DNA” exposed the need for an evolutionary classification of genomic elements according to their selected-effect function. In the classification scheme presented here, we divide the genome into “functional DNA,” that is, DNA sequences that have a selected-effect function, and “rubbish DNA,” that is, sequences that do not. Functional DNA is further subdivided into “literal DNA” and “indifferent DNA.” In literal DNA, the order of nucleotides is under selection; in indifferent DNA, only the presence or absence of the sequence is under selection. Rubbish DNA is further subdivided into “junk DNA” and “garbage DNA.” Junk DNA neither contributes to nor detracts from the fitness of the organism and, hence, evolves under selective neutrality. Garbage DNA, on the other hand, decreases the fitness of its carriers. Garbage DNA exists in the genome only because natural selection is neither omnipotent nor instantaneous. Each of these four functional categories can be 1) transcribed and translated, 2) transcribed but not translated, or 3) not transcribed. The affiliation of a DNA segment to a particular functional category may change during evolution: Functional DNA may become junk DNA, junk DNA may become garbage DNA, rubbish DNA may become functional DNA, and so on; however, determining the functionality or nonfunctionality of a genomic sequence must be based on its present status rather than on its potential to change (or not to change) in the future. Changes in functional affiliation are divided into pseudogenes, Lazarus DNA, zombie DNA, and Jekyll-to-Hyde DNA. PMID:25635041
Isolation of a polyphenol oxidase (PPO) cDNA from artichoke and expression analysis in wounded artichoke heads.

PubMed

Quarta, Angela; Mita, Giovanni; Durante, Miriana; Arlorio, Marco; De Paolis, Angelo

2013-07-01

The polyphenol oxidase (PPO) enzyme, which can catalyze the oxidation of phenolics to quinones, has been reported to be involved in undesirable browning in many plant foods. This phenomenon is particularly severe in artichoke heads wounded during the manufacturing process. A full-length cDNA encoding for a putative polyphenol oxidase (designated as CsPPO) along with a 1432 bp sequence upstream of the starting ATG codon was characterized for the first time from [Cynara cardunculus var. scolymus (L.) Fiori]. The 1764 bp CsPPO sequence encodes a putative protein of 587 amino acids with a calculated molecular mass of 65,327 Da and an isoelectric point of 5.50. Analysis of the promoter region revealed the presence of cis-acting elements, some of which are putatively involved in the response to light and wounds. Expression analysis of the gene in wounded capitula indicated that CsPPO was significantly induced after 48 h, even though the browning process had started earlier. This suggests that the early browning event observed in artichoke heads was not directly related to de novo mRNA synthesis. Finally, we provide the complete gene sequence encoding for polyphenol oxidase and the upstream regulative region in artichoke. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Localization of Action of the Is50-Encoded Transposase Protein

PubMed Central

Phadnis, Suhas H.; Sasakawa, Chihiro; Berg, Douglas E.

1986-01-01

The movement of the bacterial insertion sequence IS50 and of composite elements containing direct terminal repeats of IS50 involves the two ends of IS50, designated O (outside) and I (inside), which are weakly matched in DNA sequence, and an IS50 encoded protein, transposase, which recognizes the O and I ends and acts preferentially in cis. Previous data had suggested that, initially, transposase interacts preferentially with the O end sequence and then, in a second step, with either an O or an I end. To better understand the cis action of transposase and how IS50 ends are selected, we generated a series of composite transposons which contain direct repeats of IS50 elements. In each transposon, one IS50 element encoded transposase (tnp +), and the other contained a null (tnp-) allele. In each of the five sets of composite transposons studied, the transposon for which the tnp+ IS50 element contained its O end was more active than a complementary transposon for which the tnp - IS50 element contained its O end. This pattern of O end use suggests models in which the cis action of transposase and its choice of ends is determined by protein tracking along DNA molecules. PMID:3007274
DOE Office of Scientific and Technical Information (OSTI.GOV)

Feyereisen-Koener, J.M.

Double-stranded cDNA was prepared from infectious hematopoietic necrosis virus mRNA and cloned into the plasmid vector pUC8. A coprotein (G-protein) of infectious hematopoietic necrosis virus was selected by hybridization to a /sup 32/P-labeled probe. The restriction map and nucleotide sequence of the mRNA encoding the glycoprotein of infectious hematopoietic necrosis virus was determined using this full-length cDNA clone.
Glutathione S-transferase-encoding gene as a potential probe for environmental bacterial isolates capable of degrading polycyclic aromatic hydrocarbons.

PubMed Central

Lloyd-Jones, G; Lau, P C

1997-01-01

Homologs of the glutathione S-transferase (GST)-encoding gene were identified in a collection of aromatic hydrocarbon-degrading Sphingomonas spp. isolated from New Zealand, Antarctica, and the United States by using PCR primers designed from the GST-encoding gene of Sphingomonas paucimobilis EPA505. Sequence analysis of PCR fragments generated from these isolates and of the GST gene amplified from DNA extracted from polycyclic aromatic hydrocarbon (PAH)-contaminated soil revealed a high degree of conservation, which may make the GST-encoding gene a potentially useful marker for PAH-degrading bacteria. PMID:9251217
The First Chameleon Transcriptome: Comparative Genomic Analysis of the OXPHOS System Reveals Loss of COX8 in Iguanian Lizards

PubMed Central

Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

2013-01-01

Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system. PMID:24009133
The first Chameleon transcriptome: comparative genomic analysis of the OXPHOS system reveals loss of COX8 in Iguanian lizards.

PubMed

Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

2013-01-01

Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system.
Molecular cloning of chitinase 33 (chit33) gene from Trichoderma atroviride

PubMed Central

Matroudi, S.; Zamani, M.R.; Motallebi, M.

2008-01-01

In this study Trichoderma atroviride was selected as over producer of chitinase enzyme among 30 different isolates of Trichoderma sp. on the basis of chitinase specific activity. From this isolate the genomic and cDNA clones encoding chit33 have been isolated and sequenced. Comparison of genomic and cDNA sequences for defining gene structure indicates that this gene contains three short introns and also an open reading frame coding for a protein of 321 amino acids. The deduced amino acid sequence includes a 19 aa putative signal peptide. Homology between this sequence and other reported Trichoderma Chit33 proteins are discussed. The coding sequence of chit33 gene was cloned in pEt26b(+) expression vector and expressed in E. coli. PMID:24031242
A novel class of plant-specific zinc-dependent DNA-binding protein that binds to A/T-rich DNA sequences

PubMed Central

Nagano, Yukio; Furuhashi, Hirofumi; Inaba, Takehito; Sasaki, Yukiko

2001-01-01

Complementary DNA encoding a DNA-binding protein, designated PLATZ1 (plant AT-rich sequence- and zinc-binding protein 1), was isolated from peas. The amino acid sequence of the protein is similar to those of other uncharacterized proteins predicted from the genome sequences of higher plants. However, no paralogous sequences have been found outside the plant kingdom. Multiple alignments among these paralogous proteins show that several cysteine and histidine residues are invariant, suggesting that these proteins are a novel class of zinc-dependent DNA-binding proteins with two distantly located regions, C-x2-H-x11-C-x2-C-x(4–5)-C-x2-C-x(3–7)-H-x2-H and C-x2-C-x(10–11)-C-x3-C. In an electrophoretic mobility shift assay, the zinc chelator 1,10-o-phenanthroline inhibited DNA binding, and two distant zinc-binding regions were required for DNA binding. A protein blot with 65ZnCl2 showed that both regions are required for zinc-binding activity. The PLATZ1 protein non-specifically binds to A/T-rich sequences, including the upstream region of the pea GTPase pra2 and plastocyanin petE genes. Expression of the PLATZ1 repressed those of the reporter constructs containing the coding sequence of luciferase gene driven by the cauliflower mosaic virus (CaMV) 35S90 promoter fused to the tandem repeat of the A/T-rich sequences. These results indicate that PLATZ1 is a novel class of plant-specific zinc-dependent DNA-binding protein responsible for A/T-rich sequence-mediated transcriptional repression. PMID:11600698
Precursors of vertebrate peptide antibiotics dermaseptin b and adenoregulin have extensive sequence identities with precursors of opioid peptides dermorphin, dermenkephalin, and deltorphins.

PubMed

Amiche, M; Ducancel, F; Mor, A; Boulain, J C; Menez, A; Nicolas, P

1994-07-08

The dermaseptins are a family of broad spectrum antimicrobial peptides, 27-34 amino acids long, involved in the defense of the naked skin of frogs against microbial invasion. They are the first vertebrate peptides to show lethal effects against the filamentous fungi responsible for severe opportunistic infections accompanying immunodeficiency syndrome and the use of immunosuppressive agents. A cDNA library was constructed from skin poly(A+) RNA of the arboreal frog Phyllomedusa bicolor and screened with an oligonucleotide probe complementary to the COOH terminus of dermaseptin b. Several clones contained a full-length DNA copy of a 443-nucleotide mRNA that encoded a 78-residue dermaseptin b precursor protein. The deduced precursor contained a putative signal sequence at the NH2 terminus, a 20-residue spacer sequence extremely rich (60%) in glutamic and aspartic acids, and a single copy of a dermaseptin b progenitor sequence at the COOH terminus. One clone contained a complete copy of adenoregulin, a 33-residue peptide reported to enhance the binding of agonists to the A1 adenosine receptor. The mRNAs encoding adenoregulin and dermaseptin b were very similar: 70 and 75% nucleotide identities between the 5'- and 3'-untranslated regions, respectively; 91% amino acid identity between the signal peptides; 82% identity between the acidic spacer sequences; and 38% identity between adenoregulin and dermaseptin b. Because adenoregulin and dermaseptin b have similar precursor designs and antimicrobial spectra, adenoregulin should be considered as a new member of the dermaseptin family and alternatively named dermaseptin b II. Preprodermaseptin b and preproadenoregulin have considerable sequence identities to the precursors encoding the opioid heptapeptides dermorphin, dermenkephalin, and deltorphins. This similarity extended into the 5'-untranslated regions of the mRNAs. These findings suggest that the genes encoding the four preproproteins are all members of the same family despite the fact that they encode end products having very different biological activities. These genes might contain a homologous export exon comprising the 5'-untranslated region, the 22-residue signal peptide, the 20-24-residue acidic spacer, and the basic pair Lys-Arg.
A Plastidial Lysophosphatidic Acid Acyltransferase from Oilseed Rape1

PubMed Central

Bourgis, Fabienne; Kader, Jean-Claude; Barret, Pierre; Renard, Michel; Robinson, David; Robinson, Colin; Delseny, Michel; Roscoe, Thomas J.

1999-01-01

The biosynthesis of phosphatidic acid, a key intermediate in the biosynthesis of lipids, is controlled by lysophosphatidic acid (LPA, or 1-acyl-glycerol-3-P) acyltransferase (LPAAT, EC 2.3.1.51). We have isolated a cDNA encoding a novel LPAAT by functional complementation of the Escherichia coli mutant plsC with an immature embryo cDNA library of oilseed rape (Brassica napus). Transformation of the acyltransferase-deficient E. coli strain JC201 with the cDNA sequence BAT2 alleviated the temperature-sensitive phenotype of the plsC mutant and conferred a palmitoyl-coenzyme A-preferring acyltransferase activity to membrane fractions. The BAT2 cDNA encoded a protein of 351 amino acids with a predicted molecular mass of 38 kD and an isoelectric point of 9.7. Chloroplast-import experiments showed processing of a BAT2 precursor protein to a mature protein of approximately 32 kD, which was localized in the membrane fraction. BAT2 is encoded by a minimum of two genes that may be expressed ubiquitously. These data are consistent with the identity of BAT2 as the plastidial enzyme of the prokaryotic glycerol-3-P pathway that uses a palmitoyl-ACP to produce phosphatidic acid with a prokaryotic-type acyl composition. The homologies between the deduced protein sequence of BAT2 with prokaryotic and eukaryotic microsomal LAP acytransferases suggest that seed microsomal forms may have evolved from the plastidial enzyme. PMID:10398728
Nectinepsin: a new extracellular matrix protein of the pexin family. Characterization of a novel cDNA encoding a protein with an RGD cell binding motif.

PubMed

Blancher, C; Omri, B; Bidou, L; Pessac, B; Crisanti, P

1996-10-18

We report the isolation and characterization of a novel cDNA from quail neuroretina encoding a putative protein named nectinepsin. The nectinepsin cDNA identifies a major 2.2-kilobase mRNA that is detected from ED 5 in neuroretina and is increasingly abundant during embryonic development. A nectinepsin mRNA is also found in quail liver, brain, and intestine and in mouse retina. The deduced nectinepsin amino acid sequence contains the RGD cell binding motif of integrin ligands. Furthermore, nectinepsin shares substantial homologies with vitronectin and structural protein similarities with most of the matricial metalloproteases. However, the presence of a specific sequence and the lack of heparin and collagen binding domains of the vitronectin indicate that nectinepsin is a new extracellular matrix protein. Furthermore, genomic Southern blot studies suggest that nectinepsin and vitronectin are encoded by different genes. Western blot analysis with an anti-human vitronectin antiserum revealed, in addition to the 65- and 70-kDa vitronectin bands, an immunoreactive protein of about 54 kDa in all tissues containing nectinepsin mRNA. It seems likely that the form of vitronectin found in chick egg yolk plasma by Nagano et al. ((1992) J. Biol. Chem. 267, 24863-24870) is the protein that corresponds to the nectinepsin cDNA. This new protein could be an important molecule involved in the early steps of the development.
1-deoxy-d-xylulose-5-phosphate reductoisomerases and method of use

DOEpatents

Croteau, Rodney B.; Lange, Bernd M.

2001-01-01

The present invention relates to isolated DNA sequences which code for the expression of plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein, such as the sequence presented in SEQ ID NO:1 which encodes a 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein from peppermint (Mentha x piperita). Additionally, the present invention relates to isolated plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein. In other aspects, the present invention is directed to replicable recombinant cloning vehicles comprising a nucleic acid sequence which codes for a plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase, to modified host cells transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence of the invention.
1-deoxy-D-xylulose-5-phosphate reductoisomerases, and methods of use

DOEpatents

Croteau, Rodney B.; Lange, Bernd M.

2002-07-16

The present invention relates to isolated DNA sequences which code for the expression of plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein, such as the sequence presented in SEQ ID NO:1 which encodes a 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein from peppermint (Mentha x piperita). Additionally, the present invention relates to isolated plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein. In other aspects, the present invention is directed to replicable recombinant cloning vehicles comprising a nucleic acid sequence which codes for a plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase, to modified host cells transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence of the invention.
Control of transcriptional pausing by biased thermal fluctuations on repetitive genomic sequences

PubMed Central

Imashimizu, Masahiko; Afek, Ariel; Takahashi, Hiroki; Lubkowska, Lucyna; Lukatsky, David B.

2016-01-01

In the process of transcription elongation, RNA polymerase (RNAP) pauses at highly nonrandom positions across genomic DNA, broadly regulating transcription; however, molecular mechanisms responsible for the recognition of such pausing positions remain poorly understood. Here, using a combination of statistical mechanical modeling and high-throughput sequencing and biochemical data, we evaluate the effect of thermal fluctuations on the regulation of RNAP pausing. We demonstrate that diffusive backtracking of RNAP, which is biased by repetitive DNA sequence elements, causes transcriptional pausing. This effect stems from the increased microscopic heterogeneity of an elongation complex, and thus is entropy-dominated. This report shows a linkage between repetitive sequence elements encoded in the genome and regulation of RNAP pausing driven by thermal fluctuations. PMID:27830653
Cloning and characterization of a novel zinc finger gene in Xp11.2

DOE Office of Scientific and Technical Information (OSTI.GOV)

Derry, J.M.J.; Jess, U.; Francke, U.

1995-11-20

During a systematic search for open reading frames in chromosome band Xp11.2, a novel gene (ZNF157) that encodes a putative 506-amino-acid protein with the sequence characteristics of a zinc-finger-containing transcription factor was isolated. ZNF157 is encoded by four exons distributed over >20 kb of genomic DNA. The second and third exons contain sequences similar to those of the previously described KRAB-A and KRAB-B domains, motifs that have been shown to mediate transcriptional repression in other members of the protein family. A fourth exon contains 12 zinc finger DNA binding motifs and finger linking regions characteristic of ZNF proteins of themore » Krueppel family. ZNF157 maps to the telomeric end of a cluster of ZNF genes that includes ZNF21, ZNF41, and ZNF81. 19 refs., 2 figs.« less
Localization of HTLV-I tax proviral DNA in mononuclear cells.

PubMed

Zucker-Franklin, Dorothea; Pancake, Bette A; Najfeld, Vesna

2003-01-01

The tax sequence of HTLV-I is demonstrable in the skin and blood mononuclear cells of patients with mycosis fungoides, as well as in the mononuclear leukocytes of some healthy blood donors, but was not demonstrable when PCR/Southern analyses were carried out on preparations of high-molecular-weight genomic DNA. Therefore, it was postulated that tax DNA may not be integrated. To investigate this possibility fluorescence in situ hybridization was carried out on cells arrested in metaphase, using a probe containing the HTLV-I tax proviral DNA full-length open reading frame coding sequence. While metaphases prepared from C91PL cells, a cell line infected with HTLV-I, showed an abundance of chromosome-associated as well as extra-chromosomal signals, metaphases prepared with blood mononuclear cells from healthy tax sequence positive donors did not reveal any tax DNA associated with chromosomes. Such signals were readily detected extra-chromosomally. Although it has been demonstrated that transactivation of genes by gene products encoded by extra-chromosomal DNA may have nosocomial implications, whether transactivation by p40 tax generated from extra-chromosomal tax sequences is responsible for the development of neoplasia remains to be investigated.
Mitochondrial DNA sequence data reveals association of haplogroup U with psychosis in bipolar disorder.

PubMed

Frye, Mark A; Ryu, Euijung; Nassan, Malik; Jenkins, Gregory D; Andreazza, Ana C; Evans, Jared M; McElroy, Susan L; Oglesbee, Devin; Highsmith, W Edward; Biernacka, Joanna M

2017-01-01

Converging genetic, postmortem gene-expression, cellular, and neuroimaging data implicate mitochondrial dysfunction in bipolar disorder. This study was conducted to investigate whether mitochondrial DNA (mtDNA) haplogroups and single nucleotide variants (SNVs) are associated with sub-phenotypes of bipolar disorder. MtDNA from 224 patients with Bipolar I disorder (BPI) was sequenced, and association of sequence variations with 3 sub-phenotypes (psychosis, rapid cycling, and adolescent illness onset) was evaluated. Gene-level tests were performed to evaluate overall burden of minor alleles for each phenotype. The haplogroup U was associated with a higher risk of psychosis. Secondary analyses of SNVs provided nominal evidence for association of psychosis with variants in the tRNA, ND4 and ND5 genes. The association of psychosis with ND4 (gene that encodes NADH dehydrogenase 4) was further supported by gene-level analysis. Preliminary analysis of mtDNA sequence data suggests a higher risk of psychosis with the U haplogroup and variation in the ND4 gene implicated in electron transport chain energy regulation. Further investigation of the functional consequences of this mtDNA variation is encouraged. Copyright Â© 2016. Published by Elsevier Ltd.
ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia.

PubMed

Landt, Stephen G; Marinov, Georgi K; Kundaje, Anshul; Kheradpour, Pouya; Pauli, Florencia; Batzoglou, Serafim; Bernstein, Bradley E; Bickel, Peter; Brown, James B; Cayting, Philip; Chen, Yiwen; DeSalvo, Gilberto; Epstein, Charles; Fisher-Aylor, Katherine I; Euskirchen, Ghia; Gerstein, Mark; Gertz, Jason; Hartemink, Alexander J; Hoffman, Michael M; Iyer, Vishwanath R; Jung, Youngsook L; Karmakar, Subhradip; Kellis, Manolis; Kharchenko, Peter V; Li, Qunhua; Liu, Tao; Liu, X Shirley; Ma, Lijia; Milosavljevic, Aleksandar; Myers, Richard M; Park, Peter J; Pazin, Michael J; Perry, Marc D; Raha, Debasish; Reddy, Timothy E; Rozowsky, Joel; Shoresh, Noam; Sidow, Arend; Slattery, Matthew; Stamatoyannopoulos, John A; Tolstorukov, Michael Y; White, Kevin P; Xi, Simon; Farnham, Peggy J; Lieb, Jason D; Wold, Barbara J; Snyder, Michael

2012-09-01

Chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) has become a valuable and widely used approach for mapping the genomic location of transcription-factor binding and histone modifications in living cells. Despite its widespread use, there are considerable differences in how these experiments are conducted, how the results are scored and evaluated for quality, and how the data and metadata are archived for public use. These practices affect the quality and utility of any global ChIP experiment. Through our experience in performing ChIP-seq experiments, the ENCODE and modENCODE consortia have developed a set of working standards and guidelines for ChIP experiments that are updated routinely. The current guidelines address antibody validation, experimental replication, sequencing depth, data and metadata reporting, and data quality assessment. We discuss how ChIP quality, assessed in these ways, affects different uses of ChIP-seq data. All data sets used in the analysis have been deposited for public viewing and downloading at the ENCODE (http://encodeproject.org/ENCODE/) and modENCODE (http://www.modencode.org/) portals.
ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia

PubMed Central

Landt, Stephen G.; Marinov, Georgi K.; Kundaje, Anshul; Kheradpour, Pouya; Pauli, Florencia; Batzoglou, Serafim; Bernstein, Bradley E.; Bickel, Peter; Brown, James B.; Cayting, Philip; Chen, Yiwen; DeSalvo, Gilberto; Epstein, Charles; Fisher-Aylor, Katherine I.; Euskirchen, Ghia; Gerstein, Mark; Gertz, Jason; Hartemink, Alexander J.; Hoffman, Michael M.; Iyer, Vishwanath R.; Jung, Youngsook L.; Karmakar, Subhradip; Kellis, Manolis; Kharchenko, Peter V.; Li, Qunhua; Liu, Tao; Liu, X. Shirley; Ma, Lijia; Milosavljevic, Aleksandar; Myers, Richard M.; Park, Peter J.; Pazin, Michael J.; Perry, Marc D.; Raha, Debasish; Reddy, Timothy E.; Rozowsky, Joel; Shoresh, Noam; Sidow, Arend; Slattery, Matthew; Stamatoyannopoulos, John A.; Tolstorukov, Michael Y.; White, Kevin P.; Xi, Simon; Farnham, Peggy J.; Lieb, Jason D.; Wold, Barbara J.; Snyder, Michael

2012-01-01

Chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) has become a valuable and widely used approach for mapping the genomic location of transcription-factor binding and histone modifications in living cells. Despite its widespread use, there are considerable differences in how these experiments are conducted, how the results are scored and evaluated for quality, and how the data and metadata are archived for public use. These practices affect the quality and utility of any global ChIP experiment. Through our experience in performing ChIP-seq experiments, the ENCODE and modENCODE consortia have developed a set of working standards and guidelines for ChIP experiments that are updated routinely. The current guidelines address antibody validation, experimental replication, sequencing depth, data and metadata reporting, and data quality assessment. We discuss how ChIP quality, assessed in these ways, affects different uses of ChIP-seq data. All data sets used in the analysis have been deposited for public viewing and downloading at the ENCODE (http://encodeproject.org/ENCODE/) and modENCODE (http://www.modencode.org/) portals. PMID:22955991

Informational structure of genetic sequences and nature of gene splicing

NASA Astrophysics Data System (ADS)

Trifonov, E. N.

1991-10-01

Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.
Analysis of the enzymatic formation of citral in the glands of sweet basil.

PubMed

Iijima, Yoko; Wang, Guodong; Fridman, Eyal; Pichersky, Eran

2006-04-15

Basil glands of the Sweet Dani cultivar contain high levels of citral, a mixture of geranial and its cis-isomer neral, as well as low levels of geraniol and nerol. We have previously reported the identification of a cDNA from Sweet Dani that encodes an enzyme responsible for the formation of geraniol from geranyl diphosphate in the glands, and that these glands cannot synthesize nerol directly from geranyl diphosphate. Here, we report the identification of two basil cDNAs encoding NADP+-dependent dehydrogenases that can use geraniol as the substrate. One cDNA, designated CAD1, represents a gene whose expression is highly specific to gland cells of all three basil cultivars examined, regardless of their citral content, and encodes an enzyme with high sequence similarity to known cinnamyl alcohol dehydrogenases (CADs). The enzyme encoded by CAD1 reversibly oxidizes geraniol to produce geranial (which reversibly isomerizes to neral via keto-enol tautomerization) at half the efficiency compared with its activity with cinnamyl alcohol. CAD1 does not use nerol and neral as substrates. A second cDNA, designated GEDH1, encodes an enzyme with sequence similarity to CAD1 that is capable of reversibly oxidizing geraniol and nerol in equal efficiency, and prolonged incubation of geraniol with GEDH1 in vitro produces not only geranial and neral, but also nerol. GEDH1 is also active, although at a lower efficiency, with cinnamyl alcohol. However, GEDH1 is expressed at low levels in glands of all cultivars compared with its expression in leaves. These and additional data presented indicate that basil glands may contain additional dehydrogenases capable of oxidizing geraniol.
High-resolution characterization of sequence signatures due to non-random cleavage of cell-free DNA.

PubMed

Chandrananda, Dineika; Thorne, Natalie P; Bahlo, Melanie

2015-06-17

High-throughput sequencing of cell-free DNA fragments found in human plasma has been used to non-invasively detect fetal aneuploidy, monitor organ transplants and investigate tumor DNA. However, many biological properties of this extracellular genetic material remain unknown. Research that further characterizes circulating DNA could substantially increase its diagnostic value by allowing the application of more sophisticated bioinformatics tools that lead to an improved signal to noise ratio in the sequencing data. In this study, we investigate various features of cell-free DNA in plasma using deep-sequencing data from two pregnant women (>70X, >50X) and compare them with matched cellular DNA. We utilize a descriptive approach to examine how the biological cleavage of cell-free DNA affects different sequence signatures such as fragment lengths, sequence motifs at fragment ends and the distribution of cleavage sites along the genome. We show that the size distributions of these cell-free DNA molecules are dependent on their autosomal and mitochondrial origin as well as the genomic location within chromosomes. DNA mapping to particular microsatellites and alpha repeat elements display unique size signatures. We show how cell-free fragments occur in clusters along the genome, localizing to nucleosomal arrays and are preferentially cleaved at linker regions by correlating the mapping locations of these fragments with ENCODE annotation of chromatin organization. Our work further demonstrates that cell-free autosomal DNA cleavage is sequence dependent. The region spanning up to 10 positions on either side of the DNA cleavage site show a consistent pattern of preference for specific nucleotides. This sequence motif is present in cleavage sites localized to nucleosomal cores and linker regions but is absent in nucleosome-free mitochondrial DNA. These background signals in cell-free DNA sequencing data stem from the non-random biological cleavage of these fragments. This sequence structure can be harnessed to improve bioinformatics algorithms, in particular for CNV and structural variant detection. Descriptive measures for cell-free DNA features developed here could also be used in biomarker analysis to monitor the changes that occur during different pathological conditions.
Re-entrant DNA gels

PubMed Central

Bomboi, Francesca; Romano, Flavio; Leo, Manuela; Fernandez-Castanon, Javier; Cerbino, Roberto; Bellini, Tommaso; Bordi, Federico; Filetici, Patrizia; Sciortino, Francesco

2016-01-01

DNA is acquiring a primary role in material development, self-assembling by design into complex supramolecular aggregates, the building block of a new-materials world. Using DNA nanoconstructs to translate sophisticated theoretical intuitions into experimental realizations by closely matching idealized models of colloidal particles is a much less explored avenue. Here we experimentally show that an appropriate selection of competing interactions enciphered in multiple DNA sequences results into the successful design of a one-pot DNA hydrogel that melts both on heating and on cooling. The relaxation time, measured by light scattering, slows down dramatically in a limited window of temperatures. The phase diagram displays a peculiar re-entrant shape, the hallmark of the competition between different bonding patterns. Our study shows that it is possible to rationally design biocompatible bulk materials with unconventional phase diagrams and tuneable properties by encoding into DNA sequences both the particle shape and the physics of the collective response. PMID:27767029
Human Ro60 (SSA2) genomic organization and sequence alterations, examined in cutaneous lupus erythematosus.

PubMed

Millard, T P; Ashton, G H S; Kondeatis, E; Vaughan, R W; Hughes, G R V; Khamashta, M A; Hawk, J L M; McGregor, J M; McGrath, J A

2002-02-01

The Ro 60 kDa protein (Ro60 or SSA2) is the major component of the Ro ribonucleoprotein (Ro RNP) complex, to which an immune response is a specific feature of several autoimmune diseases. The genomic organization and any sequence variation within the DNA encoding Ro60 are unknown. To characterize the Ro60 gene structure and to assess whether any sequence alterations might be associated with serum anti-Ro antibody in subacute cutaneous lupus erythematosus (SCLE), thus potentially providing new insight into disease pathogenesis. The cDNA sequence for Ro60 was obtained from the NCBI database and used for a BLAST search for a clone containing the entire genomic sequence. The intron-exon borders were confirmed by designing intronic primer pairs to flank each exon, which were then used to amplify genomic DNA for automated sequencing from 36 caucasian patients with SCLE (anti-Ro positive) and 49 with discoid LE (DLE, anti-Ro negative), in addition to 36 healthy caucasian controls. Heteroduplex analysis of polymerase chain reaction (PCR) products from patients and controls spanning all Ro60 exons (1-8) revealed a common bandshift in the PCR products spanning exon 7. Sequencing of the corresponding PCR products demonstrated an A > G substitution at nucleotide position 1318-7, within the consensus acceptor splice site of exon 7 (GenBank XM001901). The allele frequencies were major allele A (0.71) and minor allele G (0.29) in 72 control chromosomes, with no significant differences found between SCLE patients, DLE patients and controls. The genomic organization of the DNA encoding the Ro60 protein is described, including a common polymorphism within the consensus acceptor splice site of exon 7. Our delineation of a strategy for the genomic amplification of Ro60 forms a basis for further examination of the pathological functions of the Ro RNP in autoimmune disease.
Rapidly expanding genetic diversity and host range of the Circoviridae viral family and other Rep encoding small circular ssDNA genomes.

PubMed

Delwart, Eric; Li, Linlin

2012-03-01

The genomes of numerous circoviruses and distantly related circular ssDNA viruses encoding a rolling circle replication initiator protein (Rep) have been characterized from the tissues of mammals, fish, insects, plants (geminivirus and nanovirus), in human and animal feces, in an algae cell, and in diverse environmental samples. We review the genome organization, phylogenetic relationships and initial prevalence studies of cycloviruses, a proposed new genus in the Circoviridae family. Viral fossil rep sequences were also recently identified integrated on the chromosomes of mammals, frogs, lancelets, crustaceans, mites, gastropods, roundworms, placozoans, hydrozoans, protozoans, land plants, fungi, algae, and phytoplasma bacterias and their plasmids, reflecting the very wide past host range of rep bearing viruses. An ancient origin for viruses with Rep-encoding small circular ssDNA genomes, predating the diversification of eukaryotes, is discussed. The cellular hosts and pathogenicity of many recently described rep-containing circular ssDNA genomes remain to be determined. Future studies of the virome of single cell and multi-cellular eukaryotes are likely to further extend the known diversity and host-range of small rep-containing circular ssDNA viral genomes. Copyright © 2011 Elsevier B.V. All rights reserved.
Microaspiration of esophageal gland cells and cDNA library construction for identifying parasitism genes of plant-parasitic nematodes.

PubMed

Hussey, Richard S; Huang, Guozhong; Allen, Rex

2011-01-01

Identifying parasitism genes encoding proteins secreted from a plant-parasitic nematode's esophageal gland cells and injected through its stylet into plant tissue is the key to understanding the molecular basis of nematode parasitism of plants. Parasitism genes have been cloned by directly microaspirating the cytoplasm from the esophageal gland cells of different parasitic stages of cyst or root-knot nematodes to provide mRNA to create a gland cell-specific cDNA library by long-distance reverse-transcriptase polymerase chain reaction. cDNA clones are sequenced and deduced protein sequences with a signal peptide for secretion are identified for high-throughput in situ hybridization to confirm gland-specific expression.
Characterization of an In Vivo Z-DNA Detection Probe Based on a Cell Nucleus Accumulating Intrabody.

PubMed

Gulis, Galina; Silva, Izabel Cristina Rodrigues; Sousa, Herdson Renney; Sousa, Isabel Garcia; Bezerra, Maryani Andressa Gomes; Quilici, Luana Salgado; Maranhao, Andrea Queiroz; Brigido, Marcelo Macedo

2016-09-01

Left-handed Z-DNA is a physiologically unstable DNA conformation, and its existence in vivo can be attributed to localized torsional distress. Despite evidence for the existence of Z-DNA in vivo, its precise role in the control of gene expression is not fully understood. Here, an in vivo probe based on an anti-Z-DNA intrabody is proposed for native Z-DNA detection. The probe was used for chromatin immunoprecipitation of potential Z-DNA-forming sequences in the human genome. One of the isolated putative Z-DNA-forming sequences was cloned upstream of a reporter gene expression cassette under control of the CMV promoter. The reporter gene encoded an antibody fragment fused to GFP. Transient co-transfection of this vector along with the Z-probe coding vector improved reporter gene expression. This improvement was demonstrated by measuring reporter gene mRNA and protein levels and the amount of fluorescence in co-transfected CHO-K1 cells. These results suggest that the presence of the anti-Z-DNA intrabody can interfere with a Z-DNA-containing reporter gene expression. Therefore, this in vivo probe for the detection of Z-DNA could be used for global correlation of Z-DNA-forming sequences and gene expression regulation.
Novel Immune Modulating Cellular Vaccine for Prostate Cancer

DTIC Science & Technology

2014-10-01

restriction sites. Murine PSMA : The cDNA encoding mPSMA was purchased from Sino Biologicals and was cloned into the HindIII and BamHI sites of pSP73-Sph/A64...sequence) and reverse primer 5’-TATATAGAGCTCTCAGATGTTCCGATACACATCTC-3’ Murine PSMA no signal sequence (mPSMA-SS): Murine PSMA minus the signal sequence...contains a HindIII site for cloning and utilizes an ATG that lies downstream of the signal sequence as the start codon in PSMA -SS ( PSMA without signal
Primary structure of the Aequorea victoria green-fluorescent protein.

PubMed

Prasher, D C; Eckenrode, V K; Ward, W W; Prendergast, F G; Cormier, M J

1992-02-15

Many cnidarians utilize green-fluorescent proteins (GFPs) as energy-transfer acceptors in bioluminescence. GFPs fluoresce in vivo upon receiving energy from either a luciferase-oxyluciferin excited-state complex or a Ca(2+)-activated phosphoprotein. These highly fluorescent proteins are unique due to the chemical nature of their chromophore, which is comprised of modified amino acid (aa) residues within the polypeptide. This report describes the cloning and sequencing of both cDNA and genomic clones of GFP from the cnidarian, Aequorea victoria. The gfp10 cDNA encodes a 238-aa-residue polypeptide with a calculated Mr of 26,888. Comparison of A. victoria GFP genomic clones shows three different restriction enzyme patterns which suggests that at least three different genes are present in the A. victoria population at Friday Harbor, Washington. The gfp gene encoded by the lambda GFP2 genomic clone is comprised of at least three exons spread over 2.6 kb. The nucleotide sequences of the cDNA and the gene will aid in the elucidation of structure-function relationships in this unique class of proteins.
Cell-Free Expression and In Situ Immobilization of Parasite Proteins from Clonorchis sinensis for Rapid Identification of Antigenic Candidates

PubMed Central

Ju, Jung Won; Kim, Ho-Cheol; Shin, Hyun-Il; Kim, Yu Jung; Kim, Dong-Myung

2015-01-01

Progress towards genetic sequencing of human parasites has provided the groundwork for a post-genomic approach to develop novel antigens for the diagnosis and treatment of parasite infections. To fully utilize the genomic data, however, high-throughput methodologies are required for functional analysis of the proteins encoded in the genomic sequences. In this study, we investigated cell-free expression and in situ immobilization of parasite proteins as a novel platform for the discovery of antigenic proteins. PCR-amplified parasite DNA was immobilized on microbeads that were also functionalized to capture synthesized proteins. When the microbeads were incubated in a reaction mixture for cell-free synthesis, proteins expressed from the microbead-immobilized DNA were instantly immobilized on the same microbeads, providing a physical linkage between the genetic information and encoded proteins. This approach of in situ expression and isolation enables streamlined recovery and analysis of cell-free synthesized proteins and also allows facile identification of the genes coding antigenic proteins through direct PCR of the microbead-bound DNA. PMID:26599101
DOE Office of Scientific and Technical Information (OSTI.GOV)

Schlagnhaufer, C.D.; Arteca, R.N.; Pell, E.J.

When potato plants (Solanum tuberosum L. cv Norland) are subjected to oxone stress ethylene is emitted. Increases in ethylene production are often the result of increased expression of the enzyme ACC synthase. We used the polymerase chain reaction (PCR) to clone a cDNA encoding an ozone-induced ACC synthase. After treating potato plants with 300 ppb ozone for 4 h, RNA was extracted using a guanidinium isothiocyanate method. Using degenerate oligonucleotides corresponding to several conserved regions of ACC synthase sequences reported from different plant tissues as primers, we were able to reverse transcribe the RNA and amplify a cDNA for ACCmore » synthase. The clone is 1098 bp in length encoding for 386 amino acids comprising [approximately]80% of the protein. Computer analysis of the deduced amino acid sequence showed that our clone is 50-70% homologous with ACC synthase genes cloned from other plant tissues. Using the cDNA as a probe in northern analysis we found that there is little or no expression in control tissue: however there is a large increase in the expression of the ACC synthase message in response to ozone treatment.« less
Nucleotide sequence of a complementary DNA encoding pea cytosolic copper/zinc superoxide dismutase. [Pisum sativum L

DOE Office of Scientific and Technical Information (OSTI.GOV)

White, D.A.; Zilinskas, B.A.

1991-08-01

The authors now report the nucleotide sequence of the cytosolic Cu/Zn SOD cloned from a {lambda}gt11 cDNA library constructed from mRNA extracted from leaves of 7- to 10-d pea seedlings (Pisum sativum L.). The clone was isolated using a 22-base synthetic oligonucleotide complementary to the amino acid sequence CGIIGLQG. This sequence, found at the protein's carboxy terminus, is highly conserved among plant cytosolic Cu/Zn SODs but not chloroplastic Cu/Zn SODs. The 738-base pair sequence contains an open reading frame specifying 152 codons and a predicted M{sub r} of 18,024 D. The deduced amino acid sequence is highly homologous (79-82% identity)more » with the sequences of other known plant cytosolic Cu/Zn SODs but less highly conserved (63-65%) when compared with several chloroplastic Cu/Zn SODs including pea (10).« less
Plant fatty acid hydroxylases

DOEpatents

Somerville, Chris; Broun, Pierre; van de Loo, Frank

2001-01-01

This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level of lipid fatty acid unsaturation in transgenic plants is described.
Feasibility of Screening for Antibiotic Resistance - Part I

DTIC Science & Technology

2005-09-01

Klebsiella pneumonia (AF052258), Providencia stuartii (AF052259), Serratia marcescens (AF052260). After alignment of the sequences of the various...intracellular targets of fluoroquinolones , the type II DNA topoisomerases gyrase and topoisomerase IV [Balas et al., 1998; Weigel et al., 1998]. Mutations in...the quinolones resistant determining region (QRDR) of the parC gene, which encodes the A subunit of topoisomerase , and the gyrA gene, which encodes
Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome.

PubMed

Wu, Jia Qian; Du, Jiang; Rozowsky, Joel; Zhang, Zhengdong; Urban, Alexander E; Euskirchen, Ghia; Weissman, Sherman; Gerstein, Mark; Snyder, Michael

2008-01-03

Recent studies of the mammalian transcriptome have revealed a large number of additional transcribed regions and extraordinary complexity in transcript diversity. However, there is still much uncertainty regarding precisely what portion of the genome is transcribed, the exact structures of these novel transcripts, and the levels of the transcripts produced. We have interrogated the transcribed loci in 420 selected ENCyclopedia Of DNA Elements (ENCODE) regions using rapid amplification of cDNA ends (RACE) sequencing. We analyzed annotated known gene regions, but primarily we focused on novel transcriptionally active regions (TARs), which were previously identified by high-density oligonucleotide tiling arrays and on random regions that were not believed to be transcribed. We found RACE sequencing to be very sensitive and were able to detect low levels of transcripts in specific cell types that were not detectable by microarrays. We also observed many instances of sense-antisense transcripts; further analysis suggests that many of the antisense transcripts (but not all) may be artifacts generated from the reverse transcription reaction. Our results show that the majority of the novel TARs analyzed (60%) are connected to other novel TARs or known exons. Of previously unannotated random regions, 17% were shown to produce overlapping transcripts. Furthermore, it is estimated that 9% of the novel transcripts encode proteins. We conclude that RACE sequencing is an efficient, sensitive, and highly accurate method for characterization of the transcriptome of specific cell/tissue types. Using this method, it appears that much of the genome is represented in polyA+ RNA. Moreover, a fraction of the novel RNAs can encode protein and are likely to be functional.
Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

DOEpatents

Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

2001-01-01

cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.
Novel mutation in the replication focus targeting sequence domain of DNMT1 causes hereditary sensory and autonomic neuropathy IE.

PubMed

Yuan, Junhui; Higuchi, Yujiro; Nagado, Tatsui; Nozuma, Satoshi; Nakamura, Tomonori; Matsuura, Eiji; Hashiguchi, Akihiro; Sakiyama, Yusuke; Yoshimura, Akiko; Takashima, Hiroshi

2013-03-01

DNMT1, encoding DNA methyltransferase 1 (Dnmt1), is a critical enzyme which is mainly responsible for conversion of unmethylated DNA into hemimethylated DNA. To date, two phenotypes produced by DNMT1 mutations have been reported, including hereditary sensory and autonomic neuropathy (HSAN) type IE with mutations in exon 20, and autosomal dominant cerebellar ataxia, deafness, and narcolepsy caused by mutations in exon 21. We report a sporadic case in a Japanese patient with loss of pain and vibration sense, chronic osteomyelitis, autonomic system dysfunctions, hearing loss, and mild dementia, but without definite cerebellar ataxia. Electrophysiological studies revealed absent sensory nerve action potential with nearly normal motor nerve conduction studies. Brain magnetic resonance imaging revealed mild diffuse cerebral and cerebellar atrophy. Using a next-generation sequencing system, 16 candidate genes were analyzed and a novel missense mutation, c.1706A>G (p.His569Arg), was identified in exon 21 of DNMT1. Our findings suggest that mutation in exon 21 of DNMT1 may also produce a HSAN phenotype. Because all reported mutations of DNMT1 are concentrated in exons 20 and 21, which encode the replication focus targeting sequence (RFTS) domain of Dnmt1, the RFTS domain could be a mutation hot spot. © 2013 Peripheral Nerve Society.
Molecular cloning and characterization of novel phytocystatin gene from turmeric, Curcuma longa.

PubMed

Chan, Seow-Neng; Abu Bakar, Norliza; Mahmood, Maziah; Ho, Chai-Ling; Shaharuddin, Noor Azmi

2014-01-01

Phytocystatin, a type of protease inhibitor (PI), plays major roles in plant defense mechanisms and has been reported to show antipathogenic properties and plant stress tolerance. Recombinant plant PIs are gaining popularity as potential candidates in engineering of crop protection and in synthesizing medicine. It is therefore crucial to identify PI from novel sources like Curcuma longa as it is more effective in combating against pathogens due to its novelty. In this study, a novel cDNA fragment encoding phytocystatin was isolated using degenerate PCR primers, designed from consensus regions of phytocystatin from other plant species. A full-length cDNA of the phytocystatin gene, designated CypCl, was acquired using 5'/3' rapid amplification of cDNA ends method and it has been deposited in NCBI database (accession number KF545954.1). It has a 687 bp long open reading frame (ORF) which encodes 228 amino acids. BLAST result indicated that CypCl is similar to cystatin protease inhibitor from Cucumis sativus with 74% max identity. Sequence analysis showed that CypCl contains most of the motifs found in a cystatin, including a G residue, LARFAV-, QxVxG sequence, PW dipeptide, and SNSL sequence at C-terminal extension. Phylogenetic studies also showed that CypCl is related to phytocystatin from Elaeis guineensis.
Molecular Cloning and Characterization of Novel Phytocystatin Gene from Turmeric, Curcuma longa

PubMed Central

Chan, Seow-Neng; Abu Bakar, Norliza; Mahmood, Maziah; Ho, Chai-Ling

2014-01-01

Phytocystatin, a type of protease inhibitor (PI), plays major roles in plant defense mechanisms and has been reported to show antipathogenic properties and plant stress tolerance. Recombinant plant PIs are gaining popularity as potential candidates in engineering of crop protection and in synthesizing medicine. It is therefore crucial to identify PI from novel sources like Curcuma longa as it is more effective in combating against pathogens due to its novelty. In this study, a novel cDNA fragment encoding phytocystatin was isolated using degenerate PCR primers, designed from consensus regions of phytocystatin from other plant species. A full-length cDNA of the phytocystatin gene, designated CypCl, was acquired using 5′/3′ rapid amplification of cDNA ends method and it has been deposited in NCBI database (accession number KF545954.1). It has a 687 bp long open reading frame (ORF) which encodes 228 amino acids. BLAST result indicated that CypCl is similar to cystatin protease inhibitor from Cucumis sativus with 74% max identity. Sequence analysis showed that CypCl contains most of the motifs found in a cystatin, including a G residue, LARFAV-, QxVxG sequence, PW dipeptide, and SNSL sequence at C-terminal extension. Phylogenetic studies also showed that CypCl is related to phytocystatin from Elaeis guineensis. PMID:25853138

Identification and transcription profiling of NDUFS8 in Aedes taeniorhynchus (Diptera:Culididae): developmental regulation and environmental response

USDA-ARS?s Scientific Manuscript database

The cDNA of a NADH dehydrogenase -ubiquinone Fe-S protein 8 subunit (NDUFS8) gene from Aedes (Ochlerotatus) taeniorhynchus Wiedemann has been cloned and sequenced. The full-length mRNA sequence (824 bp) of AetNDUFS8 encodes an open reading region of 651 bp (i.e., 217 amino acids). To detect whether ...
CAPRRESI: Chimera Assembly by Plasmid Recovery and Restriction Enzyme Site Insertion.

PubMed

Santillán, Orlando; Ramírez-Romero, Miguel A; Dávila, Guillermo

2017-06-25

Here, we present chimera assembly by plasmid recovery and restriction enzyme site insertion (CAPRRESI). CAPRRESI benefits from many strengths of the original plasmid recovery method and introduces restriction enzyme digestion to ease DNA ligation reactions (required for chimera assembly). For this protocol, users clone wildtype genes into the same plasmid (pUC18 or pUC19). After the in silico selection of amino acid sequence regions where chimeras should be assembled, users obtain all the synonym DNA sequences that encode them. Ad hoc Perl scripts enable users to determine all synonym DNA sequences. After this step, another Perl script searches for restriction enzyme sites on all synonym DNA sequences. This in silico analysis is also performed using the ampicillin resistance gene (ampR) found on pUC18/19 plasmids. Users design oligonucleotides inside synonym regions to disrupt wildtype and ampR genes by PCR. After obtaining and purifying complementary DNA fragments, restriction enzyme digestion is accomplished. Chimera assembly is achieved by ligating appropriate complementary DNA fragments. pUC18/19 vectors are selected for CAPRRESI because they offer technical advantages, such as small size (2,686 base pairs), high copy number, advantageous sequencing reaction features, and commercial availability. The usage of restriction enzymes for chimera assembly eliminates the need for DNA polymerases yielding blunt-ended products. CAPRRESI is a fast and low-cost method for fusing protein-coding genes.
What Information is Stored in DNA: Does it Contain Digital Error Correcting Codes?

NASA Astrophysics Data System (ADS)

Liebovitch, Larry

1998-03-01

The longest term correlations in living systems are the information stored in DNA which reflects the evolutionary history of an organism. The 4 bases (A,T,G,C) encode sequences of amino acids as well as locations of binding sites for proteins that regulate DNA. The fidelity of this important information is maintained by ANALOG error check mechanisms. When a single strand of DNA is replicated the complementary base is inserted in the new strand. Sometimes the wrong base is inserted that sticks out disrupting the phosphate backbone. The new base is not yet methylated, so repair enzymes, that slide along the DNA, can tear out the wrong base and replace it with the right one. The bases in DNA form a sequence of 4 different symbols and so the information is encoded in a DIGITAL form. All the digital codes in our society (ISBN book numbers, UPC product codes, bank account numbers, airline ticket numbers) use error checking code, where some digits are functions of other digits to maintain the fidelity of transmitted informaiton. Does DNA also utitlize a DIGITAL error chekcing code to maintain the fidelity of its information and increase the accuracy of replication? That is, are some bases in DNA functions of other bases upstream or downstream? This raises the interesting mathematical problem: How does one determine whether some symbols in a sequence of symbols are a function of other symbols. It also bears on the issue of determining algorithmic complexity: What is the function that generates the shortest algorithm for reproducing the symbol sequence. The error checking codes most used in our technology are linear block codes. We developed an efficient method to test for the presence of such codes in DNA. We coded the 4 bases as (0,1,2,3) and used Gaussian elimination, modified for modulus 4, to test if some bases are linear combinations of other bases. We used this method to analyze the base sequence in the genes from the lac operon and cytochrome C. We did not find evidence for such error correcting codes in these genes. However, we analyzed only a small amount of DNA and if digitial error correcting schemes are present in DNA, they may be more subtle than such simple linear block codes. The basic issue we raise here, is how information is stored in DNA and an appreciation that digital symbol sequences, such as DNA, admit of interesting schemes to store and protect the fidelity of their information content. Liebovitch, Tao, Todorov, Levine. 1996. Biophys. J. 71:1539-1544. Supported by NIH grant EY6234.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.

PubMed

Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

2012-02-01

The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes

PubMed Central

Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

2012-01-01

The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information. PMID:22384404
Aureobasidium pullulans xylanase, gene and signal sequence

DOEpatents

Xin-Liang, Li; Ljungdahl, Lars G.

1997-01-01

A xylanase from Aureobasidium pullulans having a high specific activity is provided as well as a signal protein for controlling excretion into cell culture medium of proteins to which it is attached. DNA encoding these proteins is also provided.
Sequence of a cDNA and expression of the gene encoding a putative epidermal chitin synthase of Manduca sexta.

PubMed

Zhu, Yu-Cheng; Specht, Charles A; Dittmer, Neal T; Muthukrishnan, Subbaratnam; Kanost, Michael R; Kramer, Karl J

2002-11-01

Glycosyltransferases are enzymes that synthesize oligosaccharides, polysaccharides and glycoconjugates. One type of glycosyltransferase is chitin synthase, a very important enzyme in biology, which is utilized by insects, fungi, and other invertebrates to produce chitin, a polysaccharide of beta-1,4-linked N-acetylglucosamine. Chitin is an important component of the insect's exoskeletal cuticle and gut lining. To identify and characterize a chitin synthase gene of the tobacco hornworm, Manduca sexta, degenerate primers were designed from two highly conserved regions in fungal and nematode chitin synthase protein sequences and then used to amplify a similar region from Manduca cDNA. A full-length cDNA of 5152 nucleotides was assembled for the putative Manduca chitin synthase gene, MsCHS1, and sequencing of genomic DNA verified the contiguity of the sequence. The MsCHS1 cDNA has an ORF of 4692 nucleotides that encodes a transmembrane protein of 1564 amino acid residues with a mass of approximately 179 kDa (GenBank no. AY062175). It is most similar, over its entire length of protein sequence, to putative chitin synthases from other insects and nematodes, with 68% identity to enzymes from both the blow fly, Lucilia cuprina, and the fruit fly, Drosophila melanogaster. The similarity with fungal chitin synthases is restricted to the putative catalytic domain, and the MsCHS1 protein has, at equivalent positions, several amino acids that are essential for activity as revealed by mutagenesis of the fungal enzymes. A 5.3-kb transcript of MsCHS1 was identified by northern blot hybridization of RNA from larval epidermis, suggesting that the enzyme functions to make chitin deposited in the cuticle. Further examination by RT-PCR showed that MsCHS1 expression is regulated in the epidermis, with the amount of transcript increasing during phases of cuticle deposition.
The role of DNA repair in herpesvirus pathogenesis.

PubMed

Brown, Jay C

2014-10-01

In cells latently infected with a herpesvirus, the viral DNA is present in the cell nucleus, but it is not extensively replicated or transcribed. In this suppressed state the virus DNA is vulnerable to mutagenic events that affect the host cell and have the potential to destroy the virus' genetic integrity. Despite the potential for genetic damage, however, herpesvirus sequences are well conserved after reactivation from latency. To account for this apparent paradox, I have tested the idea that host cell-encoded mechanisms of DNA repair are able to control genetic damage to latent herpesviruses. Studies were focused on homologous recombination-dependent DNA repair (HR). Methods of DNA sequence analysis were employed to scan herpesvirus genomes for DNA features able to activate HR. Analyses were carried out with a total of 39 herpesvirus DNA sequences, a group that included viruses from the alpha-, beta- and gamma-subfamilies. The results showed that all 39 genome sequences were enriched in two or more of the eight recombination-initiating features examined. The results were interpreted to indicate that HR can stabilize latent herpesvirus genomes. The results also showed, unexpectedly, that repair-initiating DNA features differed in alpha- compared to gamma-herpesviruses. Whereas inverted and tandem repeats predominated in alpha-herpesviruses, gamma-herpesviruses were enriched in short, GC-rich initiation sequences such as CCCAG and depleted in repeats. In alpha-herpesviruses, repair-initiating repeat sequences were found to be concentrated in a specific region (the S segment) of the genome while repair-initiating short sequences were distributed more uniformly in gamma-herpesviruses. The results suggest that repair pathways are activated differently in alpha- compared to gamma-herpesviruses. Copyright © 2014. Published by Elsevier Inc.
Structure of genes and an insertion element in the methane producing archaebacterium Methanobrevibacter smithii.

PubMed

Hamilton, P T; Reeve, J N

1985-01-01

DNA fragments cloned from the methanogenic archaebacterium Methanobrevibacter smithii which complement mutations in the purE and proC genes of E. coli have been sequenced. Sequence analyses, transposon mutagenesis and expression in E. coli minicells indicate that purE and proC complementations result from the synthesis of M. smithii polypeptides with molecular weights of 36,697 and 27,836 respectively. The encoding genes appear to be located in operons. The M. smithii genome contains 69% A/T basepairs (bp) which is reflected in unusual codon usages and intergenic regions containing approximately 85% A/T bp. An insertion element, designated ISM1, was found within the cloned M. smithii DNA located adjacent to the proC complementing region. ISM1 is 1381 bp in length, has 29 bp terminal inverted repeat sequences and contains one major ORF encoded in 87% of the ISM1 sequence. ISM1 is mobile, present in approximately 10 copies per genome and integration duplicates 8 bp at the site of insertion. The duplicated sequences show homology with sequences within the 29 bp terminal repeat sequence of ISM1. Comparison of our data with sequences from halophilic archaebacteria suggests that 5'GAANTTTCA and 5'TTTTAATATAAA may be consensus promoter sequences for archaebacteria. These sequences closely resemble the consensus sequences which precede Drosophila heat-shock genes (Pelham 1982; Davidson et al. 1983). Methanogens appear to employ the eubacterial system of mRNA: 16SrRNA hybridization to ensure initiation of translation; the consensus ribosome binding sequence is 5'AGGTGA.
Hit-Validation Methodologies for Ligands Isolated from DNA-Encoded Chemical Libraries.

PubMed

Zimmermann, Gunther; Li, Yizhou; Rieder, Ulrike; Mattarella, Martin; Neri, Dario; Scheuermann, Jörg

2017-05-04

DNA-encoded chemical libraries (DECLs) are large collections of compounds linked to DNA fragments, serving as amplifiable barcodes, which can be screened on target proteins of interest. In typical DECL selections, preferential binders are identified by high-throughput DNA sequencing, by comparing their frequency before and after the affinity capture step. Hits identified in this procedure need to be confirmed, by resynthesis and by performing affinity measurements. In this article we present new methods based on hybridization of oligonucleotide conjugates with fluorescently labeled complementary oligonucleotides; these facilitate the determination of affinity constants and kinetic dissociation constants. The experimental procedures were demonstrated with acetazolamide, a binder to carbonic anhydrase IX with a dissociation constant in the nanomolar range. The detection of binding events was compatible not only with fluorescence polarization methodologies, but also with Alphascreen technology and with microscale thermophoresis. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Molecular cloning and characterization of ADP-glucose pyrophosphorylase cDNA clones isolated from pea cotyledons.

PubMed

Burgess, D; Penton, A; Dunsmuir, P; Dooner, H

1997-02-01

Three ADP-glucose pyrophosphorylase (ADPG-PPase) cDNA clones have been isolated and characterized from a pea cotyledon cDNA library. Two of these clones (Psagps1 and Psagps2) encode the small subunit of ADPG-PPase. The deduced amino acid sequences for these two clones are 95% identical. Expression of these two genes differs in that the Psagps2 gene shows comparatively higher expression in seeds relative to its expression in other tissues. Psagps2 expression also peaks midway through seed development at a time in which Psagps1 transcripts are still accumulating. The third cDNA isolated (Psagp11) encodes the large subunit of ADPG-PPase. It shows greater selectivity in expression than either of the small subunit clones. It is highly expressed in sink organs (seed, pod, and seed coat) and undetectable in leaves.
Bombyx mori Nucleopolyhedrovirus Encodes a DNA-Binding Protein Capable of Destabilizing Duplex DNA

PubMed Central

Mikhailov, Victor S.; Mikhailova, Alla L.; Iwanaga, Masashi; Gomi, Sumiko; Maeda, Susumu

1998-01-01

A DNA-binding protein (designated DBP) with an apparent molecular mass of 38 kDa was purified to homogeneity from BmN cells (derived from Bombyx mori) infected with the B. mori nucleopolyhedrovirus (BmNPV). Six peptides obtained after digestion of the isolated protein with Achromobacter protease I were partially or completely sequenced. The determined amino acid sequences indicated that DBP was encoded by an open reading frame (ORF16) located at nucleotides (nt) 16189 to 17139 in the BmNPV genome (GenBank accession no. L33180). This ORF (designated dbp) is a homolog of Autographa californica multicapsid NPV ORF25, whose product has not been identified. BmNPV DBP is predicted to contain 317 amino acids (calculated molecular mass of 36.7 kDa) and to have an isoelectric point of 7.8. DBP showed a tendency to multimerization in the course of purification and was found to bind preferentially to single-stranded DNA. When bound to oligonucleotides, DBP protected them from hydrolysis by phage T4 DNA polymerase-associated 3′→5′ exonuclease. The sizes of the protected fragments indicated that a binding site size for DBP is about 30 nt per protein monomer. DBP, but not BmNPV LEF-3, was capable of unwinding partial DNA duplexes in an in vitro system. This helix-destabilizing ability is consistent with the prediction that DBP functions as a single-stranded DNA binding protein in virus replication. PMID:9525636
Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning.

PubMed

Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P

1998-10-20

Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.
Compressing DNA sequence databases with coil.

PubMed

White, W Timothy J; Hendy, Michael D

2008-05-20

Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression - an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression - the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work.
Compressing DNA sequence databases with coil

PubMed Central

White, W Timothy J; Hendy, Michael D

2008-01-01

Background Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression – an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. Results We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression – the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. Conclusion coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work. PMID:18489794
A novel begomovirus isolated from sida contains putative cis- and trans-acting replication specificity determinants that have evolved independently in several geographical lineages.

PubMed

Mauricio-Castillo, J A; Torres-Herrera, S I; Cárdenas-Conejo, Y; Pastor-Palacios, G; Méndez-Lozano, J; Argüello-Astorga, G R

2014-09-01

A novel begomovirus isolated from a Sida rhombifolia plant collected in Sinaloa, Mexico, was characterized. The genomic components of sida mosaic Sinaloa virus (SiMSinV) shared highest sequence identity with DNA-A and DNA-B components of chino del tomate virus (CdTV), suggesting a vertical evolutionary relationship between these viruses. However, recombination analysis indicated that a short segment of SiMSinV DNA-A encompassing the plus-strand replication origin and the 5´-proximal 43 codons of the Rep gene was derived from tomato mottle Taino virus (ToMoTV). Accordingly, the putative cis- and trans-acting replication specificity determinants of SiMSinV were identical to those of ToMoTV but differed from those of CdTV. Modeling of the SiMSinV and CdTV Rep proteins revealed significant differences in the region comprising the small β1/β5 sheet element, where five putative DNA-binding specificity determinants (SPDs) of Rep (i.e., amino acid residues 5, 8, 10, 69 and 71) were previously identified. Computer-assisted searches of public databases led to identification of 33 begomoviruses from three continents encoding proteins with SPDs identical to those of the Rep encoded by SiMSinV. Sequence analysis of the replication origins demonstrated that all 33 begomoviruses harbor potential Rep-binding sites identical to those of SiMSinV. These data support the hypothesis that the Rep β1/β5 sheet region determines specificity of this protein for DNA replication origin sequences.
Pea chloroplast tRNA(Lys) (UUU) gene: transcription and analysis of an intron-containing gene.

PubMed

Boyer, S K; Mullet, J E

1988-07-01

The pea chloroplast trnK gene which encodes tRNA(Lys) (UUU) was sequenced. TrnK is located 210 bp upstream from the promoter of psbA and immediately downstream from the 3'-end of rbcL. The gene is transcribed from the same DNA strand as psbA and rbcL. A 2447 bp intron with class II features is located in the trnK anticodon loop. The intron contains a 506 amino acid open reading frame which could encode an RNA maturase. The primary transcript of trnK is 2.9 kb long; its 5'-end was identified as a site of transcription initiation by in vitro transcription experiments. The 5'-terminus is adjacent to DNA sequences previously identified as transcription promoter elements. The most abundant trnK transcript is 2.5 kb long with termini corresponding to the 5' and 3' ends of the trnK exons. Intron specific RNAs were not detected. This suggests that RNA processing which produces tRNA(Lys) leads to rapid degradation of intron sequences.
The genome sequence of sweet cherry (Prunus avium) for use in genomics-assisted breeding.

PubMed

Shirasawa, Kenta; Isuzugawa, Kanji; Ikenaga, Mitsunobu; Saito, Yutaro; Yamamoto, Toshiya; Hirakawa, Hideki; Isobe, Sachiko

2017-10-01

We determined the genome sequence of sweet cherry (Prunus avium) using next-generation sequencing technology. The total length of the assembled sequences was 272.4 Mb, consisting of 10,148 scaffold sequences with an N50 length of 219.6 kb. The sequences covered 77.8% of the 352.9 Mb sweet cherry genome, as estimated by k-mer analysis, and included >96.0% of the core eukaryotic genes. We predicted 43,349 complete and partial protein-encoding genes. A high-density consensus map with 2,382 loci was constructed using double-digest restriction site-associated DNA sequencing. Comparing the genetic maps of sweet cherry and peach revealed high synteny between the two genomes; thus the scaffolds were integrated into pseudomolecules using map- and synteny-based strategies. Whole-genome resequencing of six modern cultivars found 1,016,866 SNPs and 162,402 insertions/deletions, out of which 0.7% were deleterious. The sequence variants, as well as simple sequence repeats, can be used as DNA markers. The genomic information helps us to identify agronomically important genes and will accelerate genetic studies and breeding programs for sweet cherries. Further information on the genomic sequences and DNA markers is available in DBcherry (http://cherry.kazusa.or.jp (8 May 2017, date last accessed)). © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Molecular analysis of two cDNA clones encoding acidic class I chitinase in maize.

PubMed Central

Wu, S; Kriz, A L; Widholm, J M

1994-01-01

The cloning and analysis of two different cDNA clones encoding putative maize (Zea mays L.) chitinases obtained by polymerase chain reaction (PCR) and cDNA library screening is described. The cDNA library was made from poly(A)+ RNA from leaves challenged with mercuric chloride for 2 d. The two clones, pCh2 and pCh11, appear to encode class I chitinase isoforms with cysteine-rich domains (not found in pCh11 due to the incomplete sequence) and proline-/glycine-rich or proline-rich hinge domains, respectively. The pCh11 clone resembles a previously reported maize seed chitinase; however, the deduced proteins were found to have acidic isoelectric points. Analysis of all monocot chitinase sequences available to date shows that not all class I chitinases possess the basic isoelectric points usually found in dicotyledonous plants and that monocot class II chitinases do not necessarily exhibit acidic isoelectric points. Based on sequence analysis, the pCh2 protein is apparently synthesized as a precursor polypeptide with a signal peptide. Although these two clones belong to class I chitinases, they share only about 70% amino acid homology in the catalytic domain region. Southern blot analysis showed that pCh2 may be encoded by a small gene family, whereas pCh11 was single copy. Northern blot analysis demonstrated that these genes are differentially regulated by mercuric chloride treatment. Mercuric chloride treatment caused rapid induction of pCh2 from 6 to 48 h, whereas pCh11 responded only slightly to the same treatment. During seed germination, embryos constitutively expressed both chitinase genes and the phytohormone abscisic acid had no effect on the expression. The fungus Aspergillus flavus was able to induce both genes to comparable levels in aleurone layers and embryos but not in endosperm tissue. Maize callus growth on the same plate with A. flavus for 1 week showed induction of the transcripts corresponding to pCh2 but not to pCh11. These studies indicate that the different chitinase isoforms in maize might have different functions in the plant, since they show differential expression patterns under different conditions. PMID:7972490
Selfish DNA in protein-coding genes of Rickettsia.

PubMed

Ogata, H; Audic, S; Barbe, V; Artiguenave, F; Fournier, P E; Raoult, D; Claverie, J M

2000-10-13

Rickettsia conorii, the aetiological agent of Mediterranean spotted fever, is an intracellular bacterium transmitted by ticks. Preliminary analyses of the nearly complete genome sequence of R. conorii have revealed 44 occurrences of a previously undescribed palindromic repeat (150 base pairs long) throughout the genome. Unexpectedly, this repeat was found inserted in-frame within 19 different R. conorii open reading frames likely to encode functional proteins. We found the same repeat in proteins of other Rickettsia species. The finding of a mobile element inserted in many unrelated genes suggests the potential role of selfish DNA in the creation of new protein sequences.

Identification, sequencing and expression of an integral membrane protein of the trans-Golgi network (TGN38).

PubMed Central

Luzio, J P; Brake, B; Banting, G; Howell, K E; Braghetta, P; Stanley, K K

1990-01-01

Organelle-specific integral membrane proteins were identified by a novel strategy which gives rise to monospecific antibodies to these proteins as well as to the cDNA clones encoding them. A cDNA expression library was screened with a polyclonal antiserum raised against Triton X-114-extracted organelle proteins and clones were then grouped using antibodies affinity-purified on individual fusion proteins. The identification, molecular cloning and sequencing are described of a type 1 membrane protein (TGN38) which is located specifically in the trans-Golgi network. Images Fig. 1. Fig. 3. PMID:2204342
Rolling Circle Transcription of Ribozymes Targeted to ras and mdr-1

DTIC Science & Technology

2001-09-01

ssDNA) to direct transcription of an tion-PCR, and recyclization were carried out to optimize active hammerhead ribozyme in E. coli cells. transcription...transcription I hammerhead ribozyme I in vitro selection and 12.5 units/ml RNase inhibitor (Promega), in a total reaction volume of 15 tk1. After a...sequence encoding a ssDNA, and splint ssDNA were ethanol-precipitated and used as hammerhead ribozyme . templates to begin the next round of in vitro
Markov chains: computing limit existence and approximations with DNA.

PubMed

Cardona, M; Colomer, M A; Conde, J; Miret, J M; Miró, J; Zaragoza, A

2005-09-01

We present two algorithms to perform computations over Markov chains. The first one determines whether the sequence of powers of the transition matrix of a Markov chain converges or not to a limit matrix. If it does converge, the second algorithm enables us to estimate this limit. The combination of these algorithms allows the computation of a limit using DNA computing. In this sense, we have encoded the states and the transition probabilities using strands of DNA for generating paths of the Markov chain.
In silico analysis of subtilisin from Glaciozyma antarctica PI12

NASA Astrophysics Data System (ADS)

Mustafha, Siti Mardhiah; Murad, Abdul Munir Abdul; Mahadi, Nor Muhammad; Kamaruddin, Shazilah; Bakar, Farah Diba Abu

2015-09-01

Subtilisin constitute as a major player in industrial enzymes that has a wide range of application especially in the detergent industry. In this study, a cDNA encoding for subtilisin (GaSUBT) was extracted from the psychrophilic yeast, Glaciozyma antarctica PI12, PCR amplified and sequenced. Various bioinformatics tools were used to characterize the GaSUBT. GaSUBT contains 1587 bp nucleotides encoding for 529 amino acids. The predicted molecular weight of the deduced protein is 55.34 kDa with an isoelectric point of 6.25. GaSUBT was predicted to possess a signal peptide and pro-peptide consisting of a peptidase inhibitor I9 sequence. From the sequence alignment analysis of deduced amino acids with other subtilisins in the NCBI database showed that the sequences surrounding the catalytic triad that forms the catalytic domain are well conserved.
Three SRA-Domain Methylcytosine-Binding Proteins Cooperate to Maintain Global CpG Methylation and Epigenetic Silencing in Arabidopsis

PubMed Central

Woo, Hye Ryun; Dittmer, Travis A.; Richards, Eric J.

2008-01-01

Methylcytosine-binding proteins decipher the epigenetic information encoded by DNA methylation and provide a link between DNA methylation, modification of chromatin structure, and gene silencing. VARIANT IN METHYLATION 1 (VIM1) encodes an SRA (SET- and RING-associated) domain methylcytosine-binding protein in Arabidopsis thaliana, and loss of VIM1 function causes centromere DNA hypomethylation and centromeric heterochromatin decondensation in interphase. In the Arabidopsis genome, there are five VIM genes that share very high sequence similarity and encode proteins containing a PHD domain, two RING domains, and an SRA domain. To gain further insight into the function and potential redundancy among the VIM proteins, we investigated strains combining different vim mutations and transgenic vim knock-down lines that down-regulate multiple VIM family genes. The vim1 vim3 double mutant and the transgenic vim knock-down lines showed decreased DNA methylation primarily at CpG sites in genic regions, as well as repeated sequences in heterochromatic regions. In addition, transcriptional silencing was released in these plants at most heterochromatin regions examined. Interestingly, the vim1 vim3 mutant and vim knock-down lines gained ectopic CpHpH methylation in the 5S rRNA genes against a background of CpG hypomethylation. The vim1 vim2 vim3 triple mutant displayed abnormal morphological phenotypes including late flowering, which is associated with DNA hypomethylation of the 5′ region of FWA and release of FWA gene silencing. Our findings demonstrate that VIM1, VIM2, and VIM3 have overlapping functions in maintenance of global CpG methylation and epigenetic transcriptional silencing. PMID:18704160
Cloning and analysis of DnaJ family members in the silkworm, Bombyx mori.

PubMed

Li, Yinü; Bu, Cuiyu; Li, Tiantian; Wang, Shibao; Jiang, Feng; Yi, Yongzhu; Yang, Huipeng; Zhang, Zhifang

2016-01-15

Heat shock proteins (Hsps) are involved in a variety of critical biological functions, including protein folding, degradation, and translocation and macromolecule assembly, act as molecular chaperones during periods of stress by binding to other proteins. Using expressed sequence tag (EST) and silkworm (Bombyx mori) transcriptome databases, we identified 27 cDNA sequences encoding the conserved J domain, which is found in DnaJ-type Hsps. Of the 27 J domain-containing sequences, 25 were complete cDNA sequences. We divided them into three types according to the number and presence of conserved domains. By analyzing the gene structures, intron numbers, and conserved domains and constructing a phylogenetic tree, we found that the DnaJ family had undergone convergent evolution, obtaining new domains to expand the diversity of its family members. The acquisition of the new DnaJ domains most likely occurred prior to the evolutionary divergence of prokaryotes and eukaryotes. The expression of DnaJ genes in the silkworm was generally higher in the fat body. The tissue distribution of DnaJ1 proteins was detected by western blotting, demonstrating that in the fifth-instar larvae, the DnaJ1 proteins were expressed at their highest levels in hemocytes, followed by the fat body and head. We also found that the DnaJ1 transcripts were likely differentially translated in different tissues. Using immunofluorescence cytochemistry, we revealed that in the blood cells, DnaJ1 was mainly localized in the cytoplasm. Copyright © 2015 Elsevier B.V. All rights reserved.
BiRen: predicting enhancers with a deep-learning-based model using the DNA sequence alone.

PubMed

Yang, Bite; Liu, Feng; Ren, Chao; Ouyang, Zhangyi; Xie, Ziwei; Bo, Xiaochen; Shu, Wenjie

2017-07-01

Enhancer elements are noncoding stretches of DNA that play key roles in controlling gene expression programmes. Despite major efforts to develop accurate enhancer prediction methods, identifying enhancer sequences continues to be a challenge in the annotation of mammalian genomes. One of the major issues is the lack of large, sufficiently comprehensive and experimentally validated enhancers for humans or other species. Thus, the development of computational methods based on limited experimentally validated enhancers and deciphering the transcriptional regulatory code encoded in the enhancer sequences is urgent. We present a deep-learning-based hybrid architecture, BiRen, which predicts enhancers using the DNA sequence alone. Our results demonstrate that BiRen can learn common enhancer patterns directly from the DNA sequence and exhibits superior accuracy, robustness and generalizability in enhancer prediction relative to other state-of-the-art enhancer predictors based on sequence characteristics. Our BiRen will enable researchers to acquire a deeper understanding of the regulatory code of enhancer sequences. Our BiRen method can be freely accessed at https://github.com/wenjiegroup/BiRen . shuwj@bmi.ac.cn or boxc@bmi.ac.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Pea chloroplast DnaJ-J8 and Toc12 are encoded by the same gene and localized in the stroma.

PubMed

Chiu, Chi-Chou; Chen, Lih-Jen; Li, Hsou-min

2010-11-01

Toc12 is a novel J domain-containing protein identified in pea (Pisum sativum) chloroplasts. It was shown to be an integral outer membrane protein localizing in the intermembrane space of the chloroplast envelope. Furthermore, Toc12 was shown to associate with an intermembrane space Hsp70, suggesting that Toc12 is important for protein translocation across the chloroplast envelope. Toc12 shares a high degree of sequence similarity with Arabidopsis (Arabidopsis thaliana) DnaJ-J8, which has been suggested to be a soluble protein of the chloroplast stroma. Here, we isolated genes encoding DnaJ-J8 from pea and found that Toc12 is a truncated clone of one of the pea DnaJ-J8s. Protein import analyses indicate that Toc12 and DnaJ-J8s possess a cleavable transit peptide and are localized in the stroma. Arabidopsis mutants with T-DNA insertions in the DnaJ-J8 gene show no defect in chloroplast protein import. Implications of these results in the energetics and mechanisms of chloroplast protein import are discussed.
Aureobasidium pullulans xylanase, gene and signal sequence

DOEpatents

Li Xinliang; Ljungdahl, L.G.

1997-01-07

A xylanase from Aureobasidium pullulans having a high specific activity is provided, as well as a signal protein for controlling excretion into cell culture medium of proteins to which it is attached. DNA encoding these proteins is also provided. 4 figs.
A diverse family of serine proteinase genes expressed in cotton boll weevil (Anthonomus grandis): implications for the design of pest-resistant transgenic cotton plants.

PubMed

Oliveira-Neto, Osmundo B; Batista, João A N; Rigden, Daniel J; Fragoso, Rodrigo R; Silva, Rodrigo O; Gomes, Eliane A; Franco, Octávio L; Dias, Simoni C; Cordeiro, Célia M T; Monnerat, Rose G; Grossi-De-Sá, Maria F

2004-09-01

Fourteen different cDNA fragments encoding serine proteinases were isolated by reverse transcription-PCR from cotton boll weevil (Anthonomus grandis) larvae. A large diversity between the sequences was observed, with a mean pairwise identity of 22% in the amino acid sequence. The cDNAs encompassed 11 trypsin-like sequences classifiable into three families and three chymotrypsin-like sequences belonging to a single family. Using a combination of 5' and 3' RACE, the full-length sequence was obtained for five of the cDNAs, named Agser2, Agser5, Agser6, Agser10 and Agser21. The encoded proteins included amino acid sequence motifs of serine proteinase active sites, conserved cysteine residues, and both zymogen activation and signal peptides. Southern blotting analysis suggested that one or two copies of these serine proteinase genes exist in the A. grandis genome. Northern blotting analysis of Agser2 and Agser5 showed that for both genes, expression is induced upon feeding and is concentrated in the gut of larvae and adult insects. Reverse northern analysis of the 14 cDNA fragments showed that only two trypsin-like and two chymotrypsin-like were expressed at detectable levels. Under the effect of the serine proteinase inhibitors soybean Kunitz trypsin inhibitor and black-eyed pea trypsin/chymotrypsin inhibitor, expression of one of the trypsin-like sequences was upregulated while expression of the two chymotrypsin-like sequences was downregulated. Copyright 2004 Elsevier Ltd.
The ENCODE project: implications for psychiatric genetics.

PubMed

Kavanagh, D H; Dwyer, S; O'Donovan, M C; Owen, M J

2013-05-01

The ENCyclopedia Of DNA Elements (ENCODE) project is a public research consortium that aims to identify all functional elements of the human genome sequence. The project comprised 1640 data sets, from 147 different cell type and the findings were released in a coordinated set of 34 publications across several journals. The ENCODE publications report that 80.4% of the human genome displays some functionality. These data have important implications for interpreting results from large-scale genetics studies. We reviewed some of the key findings from the ENCODE publications and discuss how they can influence or inform further investigations into the genetic factors contributing to neuropsychiatric disorders.
Heterogeneous RNA-binding protein M4 is a receptor for carcinoembryonic antigen in Kupffer cells.

PubMed

Bajenova, O V; Zimmer, R; Stolper, E; Salisbury-Rowswell, J; Nanji, A; Thomas, P

2001-08-17

Here we report the isolation of the recombinant cDNA clone from rat macrophages, Kupffer cells (KC) that encodes a protein interacting with carcinoembryonic antigen (CEA). To isolate and identify the CEA receptor gene we used two approaches: screening of a KC cDNA library with a specific antibody and the yeast two-hybrid system for protein interaction using as a bait the N-terminal part of the CEA encoding the binding site. Both techniques resulted in the identification of the rat heterogeneous RNA-binding protein (hnRNP) M4 gene. The rat ortholog cDNA sequence has not been previously described. The open reading frame for this gene contains a 2351-base pair sequence with the polyadenylation signal AATAAA and a termination poly(A) tail. The mRNA shows ubiquitous tissue expression as a 2.4-kilobase transcript. The deduced amino acid sequence comprised a 78-kDa membrane protein with 3 putative RNA-binding domains, arginine/methionine/glutamine-rich C terminus and 3 potential membrane spanning regions. When hnRNP M4 protein is expressed in pGEX4T-3 vector system in Escherichia coli it binds (125)I-labeled CEA in a Ca(2+)-dependent fashion. Transfection of rat hnRNP M4 cDNA into a non-CEA binding mouse macrophage cell line p388D1 resulted in CEA binding. These data provide evidence for a new function of hnRNP M4 protein as a CEA-binding protein in Kupffer cells.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Antonacci, R.; Colombo, I.; Volta, M.

The electron-transfer flavoprotein (ETF), located in the mitochondrial matrix, is a nuclear-encoded enzyme delivering to the respiratory chain electrons by straight-chain acyl-CoA dehydrogenases and other dehydrogenases. ETF is composed of a 35-kDa [alpha]-subunit that is cleaved to a 32-kDa protein during mitochondrial import (ETFA) and a [beta]-subunit that reaches the mitochondrion unmodified (ETFB). The cDNA encoding both these subunits has been cloned and sequenced. 14 refs., 1 fig.
Monoterpene synthases from common sage (Salvia officinalis)

DOEpatents

Croteau, Rodney Bruce; Wise, Mitchell Lynn; Katahira, Eva Joy; Savage, Thomas Jonathan

1999-01-01

cDNAs encoding (+)-bornyl diphosphate synthase, 1,8-cineole synthase and (+)-sabinene synthase from common sage (Salvia officinalis) have been isolated and sequenced, and the corresponding amino acid sequences has been determined. Accordingly, isolated DNA sequences (SEQ ID No:1; SEQ ID No:3 and SEQ ID No:5) are provided which code for the expression of (+)-bornyl diphosphate synthase (SEQ ID No:2), 1,8-cineole synthase (SEQ ID No:4) and (+)-sabinene synthase SEQ ID No:6), respectively, from sage (Salvia officinalis). In other aspects, replicable recombinant cloning vehicles are provided which code for (+)-bornyl diphosphate synthase, 1,8-cineole synthase or (+)-sabinene synthase, or for a base sequence sufficiently complementary to at least a portion of (+)-bornyl diphosphate synthase, 1,8-cineole synthase or (+)-sabinene synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding (+)-bornyl diphosphate synthase, 1,8-cineole synthase or (+)-sabinene synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant monoterpene synthases that may be used to facilitate their production, isolation and purification in significant amounts. Recombinant (+)-bornyl diphosphate synthase, 1,8-cineole synthase and (+)-sabinene synthase may be used to obtain expression or enhanced expression of (+)-bornyl diphosphate synthase, 1,8-cineole synthase and (+)-sabinene synthase in plants in order to enhance the production of monoterpenoids, or may be otherwise employed for the regulation or expression of (+)-bornyl diphosphate synthase, 1,8-cineole synthase and (+)-sabinene synthase, or the production of their products.
Primary structure of prostaglandin G/H synthase from sheep vesicular gland determined from the complementary DNA sequence.

PubMed Central

DeWitt, D L; Smith, W L

1988-01-01

Prostaglandin G/H synthase (8,11,14-icosatrienoate, hydrogen-donor:oxygen oxidoreductase, EC 1.14.99.1) catalyzes the first step in the formation of prostaglandins and thromboxanes, the conversion of arachidonic acid to prostaglandin endoperoxides G and H. This enzyme is the site of action of nonsteroidal anti-inflammatory drugs. We have isolated a 2.7-kilobase complementary DNA (cDNA) encompassing the entire coding region of prostaglandin G/H synthase from sheep vesicular glands. This cDNA, cloned from a lambda gt 10 library prepared from poly(A)+ RNA of vesicular glands, hybridizes with a single 2.75-kilobase mRNA species. The cDNA clone was selected using oligonucleotide probes modeled from amino acid sequences of tryptic peptides prepared from the purified enzyme. The full-length cDNA encodes a protein of 600 amino acids, including a signal sequence of 24 amino acids. Identification of the cDNA as coding for prostaglandin G/H synthase is based on comparison of amino acid sequences of seven peptides comprising 103 amino acids with the amino acid sequence deduced from the nucleotide sequence of the cDNA. The molecular weight of the unglycosylated enzyme lacking the signal peptide is 65,621. The synthase is a glycoprotein, and there are three potential sites for N-glycosylation, two of them in the amino-terminal half of the molecule. The serine reported to be acetylated by aspirin is at position 530, near the carboxyl terminus. There is no significant similarity between the sequence of the synthase and that of any other protein in amino acid or nucleotide sequence libraries, and a heme binding site(s) is not apparent from the amino acid sequence. The availability of a full-length cDNA clone coding for prostaglandin G/H synthase should facilitate studies of the regulation of expression of this enzyme and the structural features important for catalysis and for interaction with anti-inflammatory drugs. Images PMID:3125548
From the selfish gene to selfish metabolism: revisiting the central dogma.

PubMed

de Lorenzo, Víctor

2014-03-01

The standard representation of the Central Dogma (CD) of Molecular Biology conspicuously ignores metabolism. However, both the metabolites and the biochemical fluxes behind any biological phenomenon are encrypted in the DNA sequence. Metabolism constrains and even changes the information flow when the DNA-encoded instructions conflict with the homeostasis of the biochemical network. Inspection of adaptive virulence programs and emergence of xenobiotic-biodegradation pathways in environmental bacteria suggest that their main evolutionary drive is the expansion of their metabolic networks towards new chemical landscapes rather than perpetuation and spreading of their DNA sequences. Faulty enzymatic reactions on suboptimal substrates often produce reactive oxygen species (ROS), a process that fosters DNA diversification and ultimately couples catabolism of the new chemicals to growth. All this calls for a revision of the CD in which metabolism (rather than DNA) has the leading role. © 2014 WILEY Periodicals, Inc.
A septal chromosome segregator protein evolved into a conjugative DNA-translocator protein

PubMed Central

Sepulveda, Edgardo; Vogelmann, Jutta

2011-01-01

Streptomycetes, Gram-positive soil bacteria well known for the production of antibiotics feature a unique conjugative DNA transfer system. In contrast to classical conjugation which is characterized by the secretion of a pilot protein covalently linked to a single-stranded DNA molecule, in Streptomyces a double-stranded DNA molecule is translocated during conjugative transfer. This transfer involves a single plasmid encoded protein, TraB. A detailed biochemical and biophysical characterization of TraB, revealed a close relationship to FtsK, mediating chromosome segregation during bacterial cell division. TraB translocates plasmid DNA by recognizing 8-bp direct repeats located in a specific plasmid region clt. Similar sequences accidentally also occur on chromosomes and have been shown to be bound by TraB. We suggest that TraB mobilizes chromosomal genes by the interaction with these chromosomal clt-like sequences not relying on the integration of the conjugative plasmid into the chromosome. PMID:22479692
Interpreting the biological relevance of bioinformatic analyses with T-DNA sequence for protein allergenicity.

PubMed

Harper, B; McClain, S; Ganko, E W

2012-08-01

Global regulatory agencies require bioinformatic sequence analysis as part of their safety evaluation for transgenic crops. Analysis typically focuses on encoded proteins and adjacent endogenous flanking sequences. Recently, regulatory expectations have expanded to include all reading frames of the inserted DNA. The intent is to provide biologically relevant results that can be used in the overall assessment of safety. This paper evaluates the relevance of assessing the allergenic potential of all DNA reading frames found in common food genes using methods considered for the analysis of T-DNA sequences used in transgenic crops. FASTA and BLASTX algorithms were used to compare genes from maize, rice, soybean, cucumber, melon, watermelon, and tomato using international regulatory guidance. Results show that BLASTX for maize yielded 7254 alignments that exceeded allergen similarity thresholds and 210,772 alignments that matched eight or more consecutive amino acids with an allergen; other crops produced similar results. This analysis suggests that each nontransgenic crop has a much greater potential for allergenic risk than what has been observed clinically. We demonstrate that a meaningful safety assessment is unlikely to be provided by using methods with inherently high frequencies of false positive alignments when broadly applied to all reading frames of DNA sequence. Copyright © 2012 Elsevier Inc. All rights reserved.
Identification of the polypeptides encoded in the unassigned reading frames 2, 4, 4L, and 5 of human mitochondrial DNA.

PubMed Central

Mariottini, P; Chomyn, A; Riley, M; Cottrell, B; Doolittle, R F; Attardi, G

1986-01-01

In previous work, antibodies prepared against chemically synthesized peptides predicted from the DNA sequence were used to identify the polypeptides encoded in three of the eight unassigned reading frames (URFs) of human mitochondrial DNA (mtDNA). In the present study, this approach has been extended to other human mtDNA URFs. In particular, antibodies directed against the NH2-terminal octapeptide of the putative URF2 product specifically precipitated component 11 of the HeLa cell mitochondrial translation products, the reaction being inhibited by the specific peptide. Similarly, antibodies directed against the COOH-terminal nonapeptide of the putative URF4 product reacted specifically with components 4 and 5, and antibodies against a COOH-terminal heptapeptide of the presumptive URF4L product reacted specifically with component 26. Antibodies against the NH2-terminal heptapeptide of the putative product of URF5 reacted with component 1, but only to a marginal extent; however, the results of a trypsin fingerprinting analysis of component 1 point strongly to this component as being the authentic product of URF5. The polypeptide assignments to the mtDNA URFs analyzed here are supported by the relative electrophoretic mobilities of proteins 11, 4-5, 26, and 1, which are those expected for the molecular weights predicted from the DNA sequence for the products of URF2, URF4, URF4L, and URF5, respectively. With the present assignment, seven of the eight human mtDNA URFs have been shown to be expressed in HeLa cells. Images PMID:3456601
Expression of ZmMET1, a gene encoding a DNA methyltransferase from maize, is associated not only with DNA replication in actively proliferating cells, but also with altered DNA methylation status in cold-stressed quiescent cells.

PubMed

Steward, N; Kusano, T; Sano, H

2000-09-01

A cDNA fragment encoding part of a DNA methyltransferase was isolated from maize. The putative amino acid sequence identically matched that deduced from a genomic sequence in the database (accession no. AF063403), and the corresponding gene was designated as ZmMET1. Bacterially expressed ZmMET1 actively methylated DNA in vitro. Transcripts of ZmMET1 could be shown to exclusively accumulate in actively proliferating cells of the meristems of mesocotyls and root apices, suggesting ZmMET1 expression to be associated with DNA replication. This was confirmed by simultaneous decrease of transcripts of ZmMET1 and histone H3, a marker for DNA replication, in seedlings exposed to wounding, desiccation and salinity, all of which suppress cell division. Cold stress also depressed both transcripts in root tissues. In contrast, however, accumulation of ZmMET1 transcripts in shoot mesocotyls was not affected by cold stress, whereas those for H3 sharply decreased. Such a differential accumulation of ZmMET1 transcripts was consistent with ZmMET1 protein levels as revealed by western blotting. Expression of ZmMET1 is thus coexistent, but not completely dependent on DNA replication. Southern hybridization analysis with a methylation-sensitive restriction enzyme revealed that cold treatment induced demethylation of DNA in the Ac/Ds transposon region, but not in other genes, and that such demethylation primarily occurred in roots. These results suggested that the methylation level was decreased selectively by cold treatment, and that ZmMET1 may, at least partly, prevent such demethylation.

Comparative Genomics Reveals Chd1 as a Determinant of Nucleosome Spacing in Vivo.

PubMed

Hughes, Amanda L; Rando, Oliver J

2015-07-14

Packaging of genomic DNA into nucleosomes is nearly universally conserved in eukaryotes, and many features of the nucleosome landscape are quite conserved. Nonetheless, quantitative aspects of nucleosome packaging differ between species because, for example, the average length of linker DNA between nucleosomes can differ significantly even between closely related species. We recently showed that the difference in nucleosome spacing between two Hemiascomycete species-Saccharomyces cerevisiae and Kluyveromyces lactis-is established by trans-acting factors rather than being encoded in cis in the DNA sequence. Here, we generated several S. cerevisiae strains in which endogenous copies of candidate nucleosome spacing factors are deleted and replaced with the orthologous factors from K. lactis. We find no change in nucleosome spacing in such strains in which H1 or Isw1 complexes are swapped. In contrast, the K. lactis gene encoding the ATP-dependent remodeler Chd1 was found to direct longer internucleosomal spacing in S. cerevisiae, establishing that this remodeler is partially responsible for the relatively long internucleosomal spacing observed in K. lactis. By analyzing several chimeric proteins, we find that sequence differences that contribute to the spacing activity of this remodeler are dispersed throughout the coding sequence, but that the strongest spacing effect is linked to the understudied N-terminal end of Chd1. Taken together, our data find a role for sequence evolution of a chromatin remodeler in establishing quantitative aspects of the chromatin landscape in a species-specific manner. Copyright © 2015 Hughes and Rando.
Complex alternative splicing of acetylcholinesterase transcripts in Torpedo electric organ; primary structure of the precursor of the glycolipid-anchored dimeric form.

PubMed Central

Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J

1988-01-01

In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125
Quick identification of acetic acid bacteria based on nucleotide sequences of the 16S-23S rDNA internal transcribed spacer region and of the PQQ-dependent alcohol dehydrogenase gene.

PubMed

Trcek, Janja

2005-10-01

Acetic acid bacteria (AAB) are well known for oxidizing different ethanol-containing substrates into various types of vinegar. They are also used for production of some biotechnologically important products, such as sorbose and gluconic acids. However, their presence is not always appreciated since certain species also spoil wine, juice, beer and fruits. To be able to follow AAB in all these processes, the species involved must be identified accurately and quickly. Because of inaccuracy and very time-consuming phenotypic analysis of AAB, the application of molecular methods is necessary. Since the pairwise comparison among the 16S rRNA gene sequences of AAB shows very high similarity (up to 99.9%) other DNA-targets should be used. Our previous studies showed that the restriction analysis of 16S-23S rDNA internal transcribed spacer region is a suitable approach for quick affiliation of an acetic acid bacterium to a distinct group of restriction types and also for quick identification of a potentially novel species of acetic acid bacterium (Trcek & Teuber 2002; Trcek 2002). However, with the exception of two conserved genes, encoding tRNAIle and tRNAAla, the sequences of 16S-23S rDNA are highly divergent among AAB species. For this reason we analyzed in this study a gene encoding PQQ-dependent ADH as a possible DNA-target. First we confirmed the expression of subunit I of PQQ-dependent ADH (AdhA) also in Asaia, the only genus of AAB which exhibits little or no ADH-activity. Further we analyzed the partial sequences of adhA among some representative species of the genera Acetobacter, Gluconobacter and Gluconacetobacter. The conserved and variable regions in these sequences made possible the construction of A. acetispecific oligonucleotide the specificity of which was confirmed in PCR-reaction using 45 well-defined strains of AAB as DNA-templates. The primer was also successfully used in direct identification of A. aceti from home made cider vinegar as well as for revealing the misclassification of strain IFO 3283 into the species A. aceti.
Identification of Arcanobacterium pyogenes isolated by post mortem examinations of a bearded dragon and a gecko by phenotypic and genotypic properties

PubMed Central

Ülbegi-Mohyla, H.; Hijazin, M.; Alber, J.; Hassan, A. A.; Abdulmawjood, A.; Prenger-Berninghoff, E.; Weiß, R.; Zschöck, M.

2010-01-01

The present study was designed to identify phenotypically and genotypically two Arcanobacterium (A.) pyogenes strains isolated by post mortem examinations of a bearded dragon and a gecko. The A. pyogenes strains showed the typical biochemical properties and displayed CAMP-like synergistic hemolytic activities with various indicator strains. The species identity could be confirmed genotypically by amplification and sequencing of the 16S rDNA gene and, as novel target gene, by sequencing of the beta subunit of RNA polymerase encoding gene rpoB, of both strains and of reference strains representing nine species of the genus Arcanobacterium. The species identity of the two A. pyogenes strains could additionally be confirmed by PCR mediated amplification of species specific parts of the 16S-23S rDNA intergenic spacer region, the pyolysin encoding gene plo and by amplification of the collagen-binding protein encoding gene cbpA. All these molecular targets might help to improve the future identification and further characterization of A. pyogenes which, as demonstrated in the present study, could also be isolated from reptile specimens. PMID:20706035
Isolation of a cDNA Encoding a Granule-Bound 152-Kilodalton Starch-Branching Enzyme in Wheat1

PubMed Central

Båga, Monica; Nair, Ramesh B.; Repellin, Anne; Scoles, Graham J.; Chibbar, Ravindra N.

2000-01-01

Screening of a wheat (Triticum aestivum) cDNA library for starch-branching enzyme I (SBEI) genes combined with 5′-rapid amplification of cDNA ends resulted in isolation of a 4,563-bp composite cDNA, Sbe1c. Based on sequence alignment to characterized SBEI cDNA clones isolated from plants, the SBEIc predicted from the cDNA sequence was produced with a transit peptide directing the polypeptide into plastids. Furthermore, the predicted mature form of SBEIc was much larger (152 kD) than previously characterized plant SBEI (80–100 kD) and contained a partial duplication of SBEI sequences. The first SBEI domain showed high amino acid similarity to a 74-kD wheat SBEI-like protein that is inactive as a branching enzyme when expressed in Escherichia coli. The second SBEI domain on SBEIc was identical in sequence to a functional 87-kD SBEI produced in the wheat endosperm. Immunoblot analysis of proteins produced in developing wheat kernels demonstrated that the 152-kD SBEIc was, in contrast to the 87- to 88-kD SBEI, preferentially associated with the starch granules. Proteins similar in size and recognized by wheat SBEI antibodies were also present in Triticum monococcum, Triticum tauschii, and Triticum turgidum subsp. durum. PMID:10982440
Chromosome-encoded narrow-spectrum Ambler class A beta-lactamase GIL-1 from Citrobacter gillenii.

PubMed

Naas, Thierry; Aubert, Daniel; Ozcan, Ayla; Nordmann, Patrice

2007-04-01

A novel beta-lactamase gene was cloned from the whole-cell DNA of an enterobacterial Citrobacter gillenii reference strain that displayed a weak narrow-spectrum beta-lactam-resistant phenotype and was expressed in Escherichia coli. It encoded a clavulanic acid-inhibited Ambler class A beta-lactamase, GIL-1, with a pI value of 7.5 and a molecular mass of ca. 29 kDa. GIL-1 had the highest percent amino acid sequence identity with TEM-1 and SHV-1, 77%, and 67%, respectively, and only 46%, 31%, and 32% amino acid sequence identity with CKO-1 (C. koseri), CdiA1 (C. diversus), and SED-1 (C. sedlaki), respectively. The substrate profile of the purified GIL-1 was similar to that of beta-lactamases TEM-1 and SHV-1. The blaGIL-1 gene was chromosomally located, as revealed by I-CeuI experiments, and was constitutively expressed at a low level in C. gillenii. No gene homologous to the regulatory ampR genes of chromosomal class C beta-lactamases was found upstream of the blaGIL-1 gene, which fits the noninducibility of beta-lactamase expression in C. gillenii. Rapid amplification of DNA 5' ends analysis of the promoter region revealed putative promoter sequences that diverge from what has been identified as the consensus sequence in E. coli. The blaGIL-1 gene was part of a 5.5-kb DNA fragment bracketed by a 9-bp duplication and inserted between the d-lactate dehydrogenase gene and the ydbH genes; this DNA fragment was absent in other Citrobacter species. This work further illustrates the heterogeneity of beta-lactamases in Citrobacter spp., which may indicate that the variability of Citrobacter species is greater than expected.
Comparative analysis and molecular characterization of a gene BANF1 encoded a DNA-binding protein during mitosis from the Giant Panda and Black Bear.

PubMed

Zeng, Yichun; Hou, Yi-Ling; Ding, Xiang; Hou, Wan-Ru; Li, Jian

2014-01-01

Barrier to autointegration factor 1 (BANF1) is a DNA-binding protein found in the nucleus and cytoplasm of eukaryotic cells that functions to establish nuclear architecture during mitosis. The cDNA and the genomic sequence of BANF1 were cloned from the Giant Panda (Ailuropoda melanoleuca) and Black Bear (Ursus thibetanus mupinensis) using RT-PCR technology and Touchdown-PCR, respectively. The cDNA of the BANF1 cloned from Giant Panda and Black Bear is 297 bp in size, containing an open reading frame of 270 bp encoding 89 amino acids. The length of the genomic sequence from Giant Panda is 521 bp, from Black Bear is 536 bp, which were found both to possess 2 exons. Alignment analysis indicated that the nucleotide sequence and the deduced amino acid sequence are highly conserved to some mammalian species studied. Topology prediction showed there is one Protein kinase C phosphorylation site, one Casein kinase II phosphorylation site, one Tyrosine kinase phosphorylation site, one N-myristoylation site, and one Amidation site in the BANF1 protein of the Giant Panda, and there is one Protein kinase C phosphorylation site, one Tyrosine kinase phosphorylation site, one N-myristoylation site, and one Amidation site in the BANF1 protein of the Black Bear. The BANF1 gene can be readily expressed in E. coli. Results showed that the protein BANF1 fusion with the N-terminally His-tagged form gave rise to the accumulation of an expected 14 kD polypeptide that formed inclusion bodies. The expression products obtained could be used to purify the proteins and study their function further.
tRNomics: analysis of tRNA genes from 50 genomes of Eukarya, Archaea, and Bacteria reveals anticodon-sparing strategies and domain-specific features.

PubMed Central

Marck, Christian; Grosjean, Henri

2002-01-01

From 50 genomes of the three domains of life (7 eukarya, 13 archaea, and 30 bacteria), we extracted, analyzed, and compared over 4,000 sequences corresponding to cytoplasmic, nonorganellar tRNAs. For each genome, the complete set of tRNAs required to read the 61 sense codons was identified, which permitted revelation of three major anticodon-sparing strategies. Other features and sequence peculiarities analyzed are the following: (1) fit to the standard cloverleaf structure, (2) characteristic consensus sequences for elongator and initiator tDNAs, (3) frequencies of bases at each sequence position, (4) type and frequencies of conserved 2D and 3D base pairs, (5) anticodon/tDNA usages and anticodon-sparing strategies, (6) identification of the tRNA-Ile with anticodon CAU reading AUA, (7) size of variable arm, (8) occurrence and location of introns, (9) occurrence of 3'-CCA and 5'-extra G encoded at the tDNA level, and (10) distribution of the tRNA genes in genomes and their mode of transcription. Among all tRNA isoacceptors, we found that initiator tDNA-iMet is the most conserved across the three domains, yet domain-specific signatures exist. Also, according to which tRNA feature is considered (5'-extra G encoded in tDNAs-His, AUA codon read by tRNA-Ile with anticodon CAU, presence of intron, absence of "two-out-of-three" reading mode and short V-arm in tDNA-Tyr) Archaea sequester either with Bacteria or Eukarya. No common features between Eukarya and Bacteria not shared with Archaea could be unveiled. Thus, from the tRNomic point of view, Archaea appears as an "intermediate domain" between Eukarya and Bacteria. PMID:12403461
Method of artificial DNA splicing by directed ligation (SDL).

PubMed Central

Lebedenko, E N; Birikh, K R; Plutalov, O V; Berlin YuA

1991-01-01

An approach to directed genetic recombination in vitro has been devised, which allows for joining together, in a predetermined way, a series of DNA segments to give a precisely spliced polynucleotide sequence (DNA splicing by directed ligation, SDL). The approach makes use of amplification, by means of several polymerase chain reactions (PCR), of a chosen set of DNA segments. Primers for the amplifications contain recognition sites of the class IIS restriction endonucleases, which transform blunt ends of the amplification products into protruding ends of unique primary structures, the ends to be used for joining segments together being mutually complementary. Ligation of the mixture of the segments so synthesized gives the desired sequence in an unambiguous way. The suggested approach has been exemplified by the synthesis of a totally processed (intronless) gene encoding human mature interleukin-1 alpha. Images PMID:1662363
The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins.

PubMed Central

Fanning, T; Singer, M

1987-01-01

Recent work suggests that one or more members of the highly repeated LINE-1 (L1) DNA family found in all mammals may encode one or more proteins. Here we report the sequence of a portion of an L1 cloned from the domestic cat (Felis catus). These data permit comparison of the L1 sequences in four mammalian orders (Carnivore, Lagomorph, Rodent and Primate) and the comparison supports the suggested coding potential. In two separate, noncontiguous regions in the carboxy terminal half of the proteins predicted from the DNA sequences, there are several strongly conserved segments. In one region, these share homology with known or suspected reverse transcriptases, as described by others in rodents and primates. In the second region, closer to the carboxy terminus, the strongly conserved segments are over 90% homologous among the four orders. One of the latter segments is cysteine rich and resembles the putative metal binding domains of nucleic acid binding proteins, including those of TFIIIA and retroviruses. PMID:3562227
More Genetic Engineering With Cloned Hemoglobin Genes

NASA Technical Reports Server (NTRS)

Bailey, James E.

1992-01-01

Cells modified to enhance growth and production of proteins. Method for enhancing both growth of micro-organisms in vitro and production of various proteins or metalbolites in these micro-organisms provides for incorporation of selected chromosomal or extrachormosomal deoxyribonucleic acid (DNA) sequences into micro-organisms from other cells or from artificial sources. Incorporated DNA includes parts encoding desired product(s) or characteristic(s) of cells and parts that control expression of productor characteristic-encoding parts in response to variations in environment. Extended method enables increased research into growth of organisms in oxygen-poor environments. Industrial applications found in enhancement of processing steps requiring oxygen in fermentation, enzymatic degradation, treatment of wastes containing toxic chemicals, brewing, and some oxidative chemical reactions.
The construction and partial characterization of plasmids containing complementary DNA sequences to human calcitonin precursor polyprotein.

PubMed Central

Allison, J; Hall, L; MacIntyre, I; Craig, R K

1981-01-01

(1) Total poly(A)-containing RNA isolated from human thyroid medullary carcinoma tissue was shown to direct the synthesis in the wheat germ cell-free system of a major (Mr 21000) and several minor forms of human calcitonin precursor polyproteins. Evidence for processing of these precursor(s) by the wheat germ cell-free system is also presented. (2) A small complementary DNA (cDNA) plasmid library has been constructed in the PstI site of the plasmid pAT153, using total human thyroid medullary carcinoma poly(A)-containing RNA as the starting material. (3) Plasmids containing abundant cDNA sequences were selected by hybridization in situ, and two of these (ph T-B3 and phT-B6) were characterized by hybridization--translation and restriction analysis. Each was shown to contain human calcitonin precursor polyprotein cDNA sequences. (4) RNA blotting techniques demonstrate that the human calcitonin precursor polyprotein is encoded within a mRNA containing 1000 bases. (5) The results demonstrate that human calcitonin is synthesized as a precursor polyprotein. Images Fig. 1. Fig. 2. Fig. 3. PMID:6896146
Cloning and expression of a cDNA coding for catalase from zebrafish (Danio rerio).

PubMed

Ken, C F; Lin, C T; Wu, J L; Shaw, J F

2000-06-01

A full-length complementary DNA (cDNA) clone encoding a catalase was amplified by the rapid amplication of cDNA ends-polymerase chain reaction (RACE-PCR) technique from zebrafish (Danio rerio) mRNA. Nucleotide sequence analysis of this cDNA clone revealed that it comprised a complete open reading frame coding for 526 amino acid residues and that it had a molecular mass of 59 654 Da. The deduced amino acid sequence showed high similarity with the sequences of catalase from swine (86.9%), mouse (85.8%), rat (85%), human (83.7%), fruit fly (75.6%), nematode (71.1%), and yeast (58.6%). The amino acid residues for secondary structures are apparently conserved as they are present in other mammal species. Furthermore, the coding region of zebrafish catalase was introduced into an expression vector, pET-20b(+), and transformed into Escherichia coli expression host BL21(DE3)pLysS. A 60-kDa active catalase protein was expressed and detected by Coomassie blue staining as well as activity staining on polyacrylamide gel followed electrophoresis.
DNA inversion within the apolipoproteins AI/CIII/AIV-encoding gene cluster of certain patients with premature atherosclerosis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Karathanasis, S.K.; Ferris, E.; Haddad, I.A.

1987-10-01

The genes coding for apolipoproteins (apo) AI, CIII, and AIV, designated APOA1, APOC3, and APOA4, respectively, are closely linked and tandemly organized in the long arm of the human chromosome 11. A DNA rearrangement involving the genes encoding apoAI and apoCIII in certain patients with premature atherosclerosis has been associated with deficiency of both apoAI and apoCIII in the plasma of these patients. Structural characterization of the genes for apoAI and apoCIII in one of these patients indicates that this rearrangement consists of a DNA inversion containing portions of the 3' ends of the apoAI and apoCIII genes, including themore » DNA region between these genes. The breakpoints of this DNA inversion are located within the fourth exon of the apoAI gene and the first intron of the apoCIII gene. Thus, this DNA inversion results in reciprocal fusion of the apoAI and apoCIII gene transcriptional units. Expression of these gene fusions in cultured mammalian cells results in stable mRNA transcripts with sequences representing fusions of the apoAI and apoCIII mRNAs. These results indicate that absence of transcripts with correct apoAI and apoCIII mRNA sequences causes apoAI and apoCIII deficiency in the plasma of these patients and suggest that these apolipoproteins are involved in cholesterol homeostasis and protection against premature atherosclerosis.« less
Molecular Cloning and Characterization of Novel Morus alba Germin-Like Protein Gene Which Encodes for a Silkworm Gut Digestion-Resistant Antimicrobial Protein

PubMed Central

Patnaik, Bharat Bhusan; Kim, Dong Hyun; Oh, Seung Han; Song, Yong-Su; Chanh, Nguyen Dang Minh; Kim, Jong Sun; Jung, Woo-jin; Saha, Atul Kumar; Bindroo, Bharat Bhushan; Han, Yeon Soo

2012-01-01

Background Silkworm fecal matter is considered one of the richest sources of antimicrobial and antiviral protein (substances) and such economically feasible and eco-friendly proteins acting as secondary metabolites from the insect system can be explored for their practical utility in conferring broad spectrum disease resistance against pathogenic microbial specimens. Methodology/Principal Findings Silkworm fecal matter extracts prepared in 0.02 M phosphate buffer saline (pH 7.4), at a temperature of 60°C was subjected to 40% saturated ammonium sulphate precipitation and purified by gel-filtration chromatography (GFC). SDS-PAGE under denaturing conditions showed a single band at about 21.5 kDa. The peak fraction, thus obtained by GFC wastested for homogeneityusing C18reverse-phase high performance liquid chromatography (HPLC). The activity of the purified protein was tested against selected Gram +/− bacteria and phytopathogenic Fusarium species with concentration-dependent inhibitionrelationship. The purified bioactive protein was subjected to matrix-assisted laser desorption and ionization-time of flight mass spectrometry (MALDI-TOF-MS) and N-terminal sequencing by Edman degradation towards its identification. The N-terminal first 18 amino acid sequence following the predicted signal peptide showed homology to plant germin-like proteins (Glp). In order to characterize the full-length gene sequence in detail, the partial cDNA was cloned and sequenced using degenerate primers, followed by 5′- and 3′-rapid amplification of cDNA ends (RACE-PCR). The full-length cDNA sequence composed of 630 bp encoding 209 amino acids and corresponded to germin-like proteins (Glps) involved in plant development and defense. Conclusions/Significance The study reports, characterization of novel Glpbelonging to subfamily 3 from M. alba by the purification of mature active protein from silkworm fecal matter. The N-terminal amino acid sequence of the purified protein was found similar to the deduced amino acid sequence (without the transit peptide sequence) of the full length cDNA from M. alba. PMID:23284650
Pyrin gene and mutants thereof, which cause familial Mediterranean fever

DOEpatents

Kastner, Daniel L [Bethesda, MD; Aksentijevichh, Ivona [Bethesda, MD; Centola, Michael [Tacoma Park, MD; Deng, Zuoming [Gaithersburg, MD; Sood, Ramen [Rockville, MD; Collins, Francis S [Rockville, MD; Blake, Trevor [Laytonsville, MD; Liu, P Paul [Ellicott City, MD; Fischel-Ghodsian, Nathan [Los Angeles, CA; Gumucio, Deborah L [Ann Arbor, MI; Richards, Robert I [North Adelaide, AU; Ricke, Darrell O [San Diego, CA; Doggett, Norman A [Santa Cruz, NM; Pras, Mordechai [Tel-Hashomer, IL

2003-09-30

The invention provides the nucleic acid sequence encoding the protein associated with familial Mediterranean fever (FMF). The cDNA sequence is designated as MEFV. The invention is also directed towards fragments of the DNA sequence, as well as the corresponding sequence for the RNA transcript and fragments thereof. Another aspect of the invention provides the amino acid sequence for a protein (pyrin) associated with FMF. The invention is directed towards both the full length amino acid sequence, fusion proteins containing the amino acid sequence and fragments thereof. The invention is also directed towards mutants of the nucleic acid and amino acid sequences associated with FMF. In particular, the invention discloses three missense mutations, clustered in within about 40 to 50 amino acids, in the highly conserved rfp (B30.2) domain at the C-terminal of the protein. These mutants include M6801, M694V, K695R, and V726A. Additionally, the invention includes methods for diagnosing a patient at risk for having FMF and kits therefor.
Conserved noncoding sequences (CNSs) in higher plants.

PubMed

Freeling, Michael; Subramaniam, Shabarinath

2009-04-01

Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.
Core histone genes of Giardia intestinalis: genomic organization, promoter structure, and expression

PubMed Central

Yee, Janet; Tang, Anita; Lau, Wei-Ling; Ritter, Heather; Delport, Dewald; Page, Melissa; Adam, Rodney D; Müller, Miklós; Wu, Gang

2007-01-01

Background Giardia intestinalis is a protist found in freshwaters worldwide, and is the most common cause of parasitic diarrhea in humans. The phylogenetic position of this parasite is still much debated. Histones are small, highly conserved proteins that associate tightly with DNA to form chromatin within the nucleus. There are two classes of core histone genes in higher eukaryotes: DNA replication-independent histones and DNA replication-dependent ones. Results We identified two copies each of the core histone H2a, H2b and H3 genes, and three copies of the H4 gene, at separate locations on chromosomes 3, 4 and 5 within the genome of Giardia intestinalis, but no gene encoding a H1 linker histone could be recognized. The copies of each gene share extensive DNA sequence identities throughout their coding and 5' noncoding regions, which suggests these copies have arisen from relatively recent gene duplications or gene conversions. The transcription start sites are at triplet A sequences 1–27 nucleotides upstream of the translation start codon for each gene. We determined that a 50 bp region upstream from the start of the histone H4 coding region is the minimal promoter, and a highly conserved 15 bp sequence called the histone motif (him) is essential for its activity. The Giardia core histone genes are constitutively expressed at approximately equivalent levels and their mRNAs are polyadenylated. Competition gel-shift experiments suggest that a factor within the protein complex that binds him may also be a part of the protein complexes that bind other promoter elements described previously in Giardia. Conclusion In contrast to other eukaryotes, the Giardia genome has only a single class of core histone genes that encode replication-independent histones. Our inability to locate a gene encoding the linker histone H1 leads us to speculate that the H1 protein may not be required for the compaction of Giardia's small and gene-rich genome. PMID:17425802
Use of wavelet-packet transforms to develop an engineering model for multifractal characterization of mutation dynamics in pathological and nonpathological gene sequences

NASA Astrophysics Data System (ADS)

Walker, David Lee

1999-12-01

This study uses dynamical analysis to examine in a quantitative fashion the information coding mechanism in DNA sequences. This exceeds the simple dichotomy of either modeling the mechanism by comparing DNA sequence walks as Fractal Brownian Motion (fbm) processes. The 2-D mappings of the DNA sequences for this research are from Iterated Function System (IFS) (Also known as the ``Chaos Game Representation'' (CGR)) mappings of the DNA sequences. This technique converts a 1-D sequence into a 2-D representation that preserves subsequence structure and provides a visual representation. The second step of this analysis involves the application of Wavelet Packet Transforms, a recently developed technique from the field of signal processing. A multi-fractal model is built by using wavelet transforms to estimate the Hurst exponent, H. The Hurst exponent is a non-parametric measurement of the dynamism of a system. This procedure is used to evaluate gene- coding events in the DNA sequence of cystic fibrosis mutations. The H exponent is calculated for various mutation sites in this gene. The results of this study indicate the presence of anti-persistent, random walks and persistent ``sub-periods'' in the sequence. This indicates the hypothesis of a multi-fractal model of DNA information encoding warrants further consideration. This work examines the model's behavior in both pathological (mutations) and non-pathological (healthy) base pair sequences of the cystic fibrosis gene. These mutations both natural and synthetic were introduced by computer manipulation of the original base pair text files. The results show that disease severity and system ``information dynamics'' correlate. These results have implications for genetic engineering as well as in mathematical biology. They suggest that there is scope for more multi-fractal models to be developed.
Characterization of two chitin synthase genes of the red flour beetle, Tribolium castaneum, and alternate exon usage in one of the genes during development.

PubMed

Arakane, Yasuyuki; Hogenkamp, David G; Zhu, Yu Cheng; Kramer, Karl J; Specht, Charles A; Beeman, Richard W; Kanost, Michael R; Muthukrishnan, Subbaratnam

2004-03-01

Two chitin synthase (CHS) genes of the red flour beetle, Tribolium castaneum, were sequenced and their transcription patterns during development examined. By screening a BAC library of genomic DNA from T. castaneum (Tc) with a DNA probe encoding the catalytic domain of a putative Tribolium CHS, several clones that contained CHS genes were identified. Two distinct PCR products were amplified from these BAC clones and confirmed to be highly similar to CHS genes from other insects, nematodes and fungi. The DNA sequences of these genes, TcCHS1 and TcCHS2, were determined by amplification of overlapping PCR fragments from two of the BAC DNAs and mapped to different linkage groups. Each ORF was identified and full-length cDNAs were also amplified, cloned and sequenced. TcCHS1 and TcCHS2 encode transmembrane proteins of 1558 and 1464 amino acids, respectively. The TcCHS1 gene was found to use alternate exons, each encoding 59 amino acids, a feature not found in the TcCHS2 gene. During development, Tribolium expressed TcCHS1 predominantly in the embryonic and pupal stages, whereas TcCHS2 was prevalent in the late larval and adult stages. The alternate exon 8a of TcCHS1 was utilized over a much broader range of development than exon 8b. We propose that the two isoforms of the TcCHS1 enzyme are used predominantly for the formation of chitin in embryonic and pupal cuticles, whereas TcCHS2 is utilized primarily for the synthesis of peritrophic membrane-associated chitin in the midgut.

Characterization of cDNA encoding molt-inhibiting hormone of the crab, Cancer pagurus; expression of MIH in non-X-organ tissues.

PubMed

Lu, W; Wainwright, G; Olohan, L A; Webster, S G; Rees, H H; Turner, P C

2001-10-31

Synthesis of ecdysteroids (molting hormones) by crustacean Y-organs is regulated by a neuropeptide, molt-inhibiting hormone (MIH), produced in eyestalk neural ganglia. We report here the molecular cloning of a cDNA encoding MIH of the edible crab, Cancer pagurus. Full-length MIH cDNA was obtained by using reverse transcription-polymerase chain reaction (RT-PCR) with degenerate oligonucleotides based upon the amino acid sequence of MIH, in conjunction with 5'- and 3'-RACE. Full-length clones of MIH cDNA were obtained that encoded a 35 amino acid putative signal peptide and the mature 78 amino acid peptide. Of various tissues examined by Northern blot analysis, the X-organ was the sole major site of expression of the MIH gene. However, a nested-PCR approach using non-degenerate MIH-specific primers indicated the presence of MIH transcripts in other tissues. Southern blot analysis indicated a simple gene arrangement with at least two copies of the MIH gene in the genome of C. pagurus. Additional Southern blotting experiments detected MIH-hybridizing bands in another Cancer species, Cancer antennarius and another crab species, Carcinus maenas.
Cloning and characterization of the metE gene encoding S-adenosylmethionine synthetase from Bacillus subtilis.

PubMed Central

Yocum, R R; Perkins, J B; Howitt, C L; Pero, J

1996-01-01

The metE gene, encoding S-adenosylmethionine synthetase (EC 2.5.1.6) from Bacillus subtilis, was cloned in two steps by normal and inverse PCR. The DNA sequence of the metE gene contains an open reading frame which encodes a 400-amino-acid sequence that is homologous to other known S-adenosylmethionine synthetases. The cloned gene complements the metE1 mutation and integrates at or near the chromosomal site of metE1. Expression of S-adenosylmethionine synthetase is reduced by only a factor of about 2 by exogenous methioinine. Overproduction of S-adenosylmethionine synthetase from a strong constitutive promoter leads to methionine auxotrophy in B. subtilis, suggesting that S-adenosylmethionine is a corepressor of methionine biosynthesis in B. subtilis, as others have already shown for Escherichia coli. PMID:8755891
Cloning and characterization of the metE gene encoding S-adenosylmethionine synthetase from Bacillus subtilis.

PubMed

Yocum, R R; Perkins, J B; Howitt, C L; Pero, J

1996-08-01

The metE gene, encoding S-adenosylmethionine synthetase (EC 2.5.1.6) from Bacillus subtilis, was cloned in two steps by normal and inverse PCR. The DNA sequence of the metE gene contains an open reading frame which encodes a 400-amino-acid sequence that is homologous to other known S-adenosylmethionine synthetases. The cloned gene complements the metE1 mutation and integrates at or near the chromosomal site of metE1. Expression of S-adenosylmethionine synthetase is reduced by only a factor of about 2 by exogenous methioinine. Overproduction of S-adenosylmethionine synthetase from a strong constitutive promoter leads to methionine auxotrophy in B. subtilis, suggesting that S-adenosylmethionine is a corepressor of methionine biosynthesis in B. subtilis, as others have already shown for Escherichia coli.
Comparison of Human and Guinea Pig Acetylcholinesterase Sequences and Rates of Oxime-Assisted Reactivation

DTIC Science & Technology

2010-01-01

of appropriate animal model systems. For OP poisoning, the guinea pig (Cavia porcellus) is a commonly used animal model because guinea pigs more...endogenous bioscavenger in vivo. Although guinea pigs historically have been used to test OP poisoning therapies, it has been found recently that guinea pig AChE...transcribed mRNA encoding guinea pig AChE, amplified the resulting cDNA, and sequenced this product. The nucleotide and deduced amino acid sequences of
Analysis of beta-carotene hydroxylase gene cDNA isolated from the American oil-palm (Elaeis oleifera) mesocarp tissue cDNA library

PubMed Central

Bhore, Subhash J; Kassim, Amelia; Loh, Chye Ying; Shah, Farida H

2010-01-01

It is well known that the nutritional quality of the American oil-palm (Elaeis oleifera) mesocarp oil is superior to that of African oil-palm (Elaeis guineensis Jacq. Tenera) mesocarp oil. Therefore, it is of important to identify the genetic features for its superior value. This could be achieved through the genome sequencing of the oil-palm. However, the genome sequence is not available in the public domain due to commercial secrecy. Hence, we constructed a cDNA library and generated expressed sequence tags (3,205) from the mesocarp tissue of the American oil-palm. We continued to annotate each of these cDNAs after submitting to GenBank/DDBJ/EMBL. A rough analysis turned our attention to the beta-carotene hydroxylase (Chyb) enzyme encoding cDNA. Then, we completed the full sequencing of cDNA clone for its both strands using M13 forward and reverse primers. The full nucleotide and protein sequence was further analyzed and annotated using various Bioinformatics tools. The analysis results showed the presence of fatty acid hydroxylase superfamily domain in the protein sequence. The multiple sequence alignment of selected Chyb amino acid sequences from other plant species and algal members with E. oleifera Chyb using ClustalW and its phylogenetic analysis suggest that Chyb from monocotyledonous plant species, Lilium hubrid, Crocus sativus and Zea mays are the most evolutionary related with E. oleifera Chyb. This study reports the annotation of E. oleifera Chyb. Abbreviations ESTs - expressed sequence tags, EoChyb - Elaeis oleifera beta-carotene hydroxylase, MC - main cluster PMID:21364789
The promise and challenge of high-throughput sequencing of the antibody repertoire

PubMed Central

Georgiou, George; Ippolito, Gregory C; Beausang, John; Busse, Christian E; Wardemann, Hedda; Quake, Stephen R

2014-01-01

Efforts to determine the antibody repertoire encoded by B cells in the blood or lymphoid organs using high-throughput DNA sequencing technologies have been advancing at an extremely rapid pace and are transforming our understanding of humoral immune responses. Information gained from high-throughput DNA sequencing of immunoglobulin genes (Ig-seq) can be applied to detect B-cell malignancies with high sensitivity, to discover antibodies specific for antigens of interest, to guide vaccine development and to understand autoimmunity. Rapid progress in the development of experimental protocols and informatics analysis tools is helping to reduce sequencing artifacts, to achieve more precise quantification of clonal diversity and to extract the most pertinent biological information. That said, broader application of Ig-seq, especially in clinical settings, will require the development of a standardized experimental design framework that will enable the sharing and meta-analysis of sequencing data generated by different laboratories. PMID:24441474
Production of hydroxylated fatty acids in genetically modified plants

DOEpatents

Somerville, Chris [Portola Valley, CA; Broun, Pierre [Burlingame, CA; van de Loo, Frank [Weston, AU; Boddupalli, Sekhar S [Manchester, MI

2011-08-23

This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level of lipid fatty acid unsaturation in transgenic plants is described.
Production of hydroxylated fatty acids in genetically modified plants

DOEpatents

Somerville, Chris; Broun, Pierre; van de Loo, Frank; Boddupalli, Sekhar S.

2005-08-30

This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level of lipid fatty acid unsaturation in transgenic plants is described.
Molecular cloning of crustins from the hemocytes of Brazilian penaeid shrimps.

PubMed

Rosa, Rafael Diego; Bandeira, Paula Terra; Barracco, Margherita Anna

2007-09-01

Crustins are antimicrobial peptides initially identified in the hemocytes of the crab Carcinus maenas (11.5-kDa peptide or carcinin) and recently also recognized in penaeid shrimps and other crustacean species. The aim of this study was to identify sequences encoding for crustins from the hemocytes of four Brazilian penaeid species: Farfantepenaeus paulensis, Farfantepenaeus subtilis, Farfantepenaeus brasiliensis and Litopenaeus schmitti. Using primers based on consensus nucleotide alignment of crustins from different crustaceans, cDNA sequences coding for crustins in all indigenous penaeid species were amplified. The obtained four crustin sequences encoded for peptides containing a hydrophobic N-terminal region rich in glycine repeats and a C-terminal part with 12 cysteine residues and a conserved whey acidic protein domain. All obtained crustin sequences showed high amino acidic similarity among each other and with crustins from litopenaeid shrimps (76-98%). This is the first report of crustins in native Brazilian penaeid shrimps.
Regulation of Sulfur Assimilation Pathways in Burkholderia cenocepacia through Control of Genes by the SsuR Transcription Factor▿

PubMed Central

Łochowska, Anna; Iwanicka-Nowicka, Roksana; Zielak, Agata; Modelewska, Anna; Thomas, Mark S.; Hryniewicz, Monika M.

2011-01-01

The genome of Burkholderia cenocepacia contains two genes encoding closely related LysR-type transcriptional regulators, CysB and SsuR, involved in control of sulfur assimilation processes. In this study we show that the function of SsuR is essential for the utilization of a number of organic sulfur sources of either environmental or human origin. Among the genes upregulated by SsuR identified here are the tauABC operon encoding a predicted taurine transporter, three tauD-type genes encoding putative taurine dioxygenases, and atsA encoding a putative arylsulfatase. The role of SsuR in expression of these genes/operons was characterized through (i) construction of transcriptional reporter fusions to candidate promoter regions and analysis of their expression in the presence/absence of SsuR and (ii) testing the ability of SsuR to bind SsuR-responsive promoter regions. We also demonstrate that expression of SsuR-activated genes is not repressed in the presence of inorganic sulfate. A more detailed analysis of four SsuR-responsive promoter regions indicated that ∼44 bp of the DNA sequence preceding and/or overlapping the predicted −35 element of such promoters is sufficient for SsuR binding. The DNA sequence homology among SsuR “recognition motifs” at different responsive promoters appears to be limited. PMID:21317335
Analysis of expressed sequence tags from the four main developmental stages of Trypanosoma congolense

PubMed Central

Helm, Jared R.; Hertz-Fowler, Christiane; Aslett, Martin; Berriman, Matthew; Sanders, Mandy; Quail, Michael A.; Soares, Marcelo B.; Bonaldo, Maria F.; Sakurai, Tatsuya; Inoue, Noboru; Donelson, John E.

2009-01-01

Trypanosoma congolense is one of the most economically important pathogens of livestock in Africa. Culture-derived parasites of each of the three main insect stages of the T. congolense life cycle, i.e., the procyclic, epimastigote and metacyclic stages, and bloodstream stage parasites isolated from infected mice, were used to construct stage-specific cDNA libraries and expressed sequence tags (ESTs or cDNA clones) in each library were sequenced. Thirteen EST clusters encoding different variant surface glycoproteins (VSGs) were detected in the metacyclic library and twenty-six VSG EST clusters were found in the bloodstream library, six of which are shared by the metacyclic library. Rare VSG ESTs are present in the epimastigote library, and none were detected in the procyclic library. ESTs encoding enzymes that catalyze oxidative phosphorylation and amino acid metabolism are about twice as abundant in the procyclic and epimastigote stages as in the metacyclic and bloodstream stages. In contrast, ESTs encoding enzymes involved in glycolysis, the citric acid cycle and nucleotide metabolism are about the same in all four developmental stages. Cysteine proteases, kinases and phosphatases are the most abundant enzyme groups represented by the ESTs. All four libraries contain T. congolense-specific expressed sequences not present in the T. brucei and T. cruzi genomes. Normalized cDNA libraries were constructed from the metacyclic and bloodstream stages, and found to be further enriched for T. congolense-specific ESTs. Given that cultured T. congolense offers an experimental advantage over other African trypanosome species, these ESTs provide a basis for further investigation of the molecular properties of these four developmental stages, especially the epimastigote and metacyclic stages for which it is difficult to obtain large quantities of organisms. The T. congolense EST databases are available at: http://www.sanger.ac.uk/Projects/T_congolense/EST_index.shtml. PMID:19559733
Structure and characterization of a cDNA clone for phenylalanine ammonia-lyase from cut-injured roots of sweet potato

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tanaka, Yoshiyuki; Matsuoka, Makoto; Yamanoto, Naoki

A cDNA clone for phenylalanine ammonia-lyase (PAL) induced in wounded sweet potato (Ipomoea batatas Lam.) root was obtained by immunoscreening a cDNA library. The protein produced in Escherichia coli cells containing the plasmid pPAL02 was indistinguishable from sweet potato PAL as judged by Ouchterlony double diffusion assays. The M{sub r} of its subunit was 77,000. The cells converted ({sup 14}C)-L-phenylalanine into ({sup 14}C)-t-cinnamic acid and PAL activity was detected in the homogenate of the cells. The activity was dependent on the presence of the pPAL02 plasmid DNA. The nucleotide sequence of the cDNA contained a 2,121-base pair (bp) open-reading framemore » capable of coding for a polypeptide with 707 amino acids (M{sub r} 77,137), a 22-bp 5{prime}-noncoding region and a 207-bp 3{prime}-noncoding region. The results suggest that the insert DNA fully encoded the amino acid sequence for sweet potato PAL that is induced by wounding. Comparison of the deduced amino acid sequence with that of a PAL cDNA fragment from Phaseolus vulgaris revealed 78.9% homology. The sequence from amino acid residues 258 to 494 was highly conserved, showing 90.7% homology.« less
Characterization of rat calcitonin mRNA.

PubMed Central

Amara, S G; David, D N; Rosenfeld, M G; Roos, B A; Evans, R M

1980-01-01

A chimeric plasmic containing cDNA complementary to rat calcitonin mRNA has been constructed. Partial sequence analysis shows that the insert contains a nucleotide sequence encoding the complete amino acid sequence of calcitonin. Two basic amino acids precede and three basic amino acids follow the hormone sequence, suggesting that calcitonin is generated by the proteolytic cleavage of a larger precursor in a manner analogous to that of other small polypeptide hormones. The COOH-terminal proline, known to be amidated in the secreted hormone, is followed by a glycine in the precursor. The cloned calcitonin DNA was used to characterize the expression of calcitonin mRNA. Cytoplasmic mRNAs from calcitonin-producing rat medullary thyroid carcinoma lines and from normal rat thyroid glands contain a single species, 1050 nucleotides long, whch hybridizes to the cloned calcitonin cDNA. The concentration of calcitonin mRNA sequences is greater in those tumors that produce larger amounts of immunoreactive calcitonin. RNAs from other endocrine tissues, including anterior and neurointermediate lobes of rat pituitary, contain no detectable calcitonin mRNA. Images PMID:6933496
The Essential Component in DNA-Based Information Storage System: Robust Error-Tolerating Module

PubMed Central

Yim, Aldrin Kay-Yuen; Yu, Allen Chi-Shing; Li, Jing-Woei; Wong, Ada In-Chun; Loo, Jacky F. C.; Chan, King Ming; Kong, S. K.; Yip, Kevin Y.; Chan, Ting-Fung

2014-01-01

The size of digital data is ever increasing and is expected to grow to 40,000 EB by 2020, yet the estimated global information storage capacity in 2011 is <300 EB, indicating that most of the data are transient. DNA, as a very stable nano-molecule, is an ideal massive storage device for long-term data archive. The two most notable illustrations are from Church et al. and Goldman et al., whose approaches are well-optimized for most sequencing platforms – short synthesized DNA fragments without homopolymer. Here, we suggested improvements on error handling methodology that could enable the integration of DNA-based computational process, e.g., algorithms based on self-assembly of DNA. As a proof of concept, a picture of size 438 bytes was encoded to DNA with low-density parity-check error-correction code. We salvaged a significant portion of sequencing reads with mutations generated during DNA synthesis and sequencing and successfully reconstructed the entire picture. A modular-based programing framework – DNAcodec with an eXtensible Markup Language-based data format was also introduced. Our experiments demonstrated the practicability of long DNA message recovery with high error tolerance, which opens the field to biocomputing and synthetic biology. PMID:25414846
Genome organization of Tobacco leaf curl Zimbabwe virus, a new, distinct monopartite begomovirus associated with subgenomic defective DNA molecules.

PubMed

Paximadis, M; Rey, M E

2001-12-01

The complete DNA A of the begomovirus Tobacco leaf curl Zimbabwe virus (TbLCZWV) was sequenced: it comprises 2767 nucleotides with six major open reading frames encoding proteins with molecular masses greater than 9 kDa. Full-length TbLCZWV DNA A tandem dimers, cloned in binary vectors (pBin19 and pBI121) and transformed into Agrobacterium tumefaciens, were systemically infectious upon agroinoculation of tobacco and tomato. Efforts to identify a DNA B component were unsuccessful. These findings suggest that TbLCZWV is a new member of the monopartite group of begomoviruses. Phylogenetic analysis identified TbLCZWV as a distinct begomovirus with its closest relative being Chayote mosaic virus. Abutting primer PCR amplified ca. 1300 bp molecules, and cloning and sequencing of two of these molecules revealed them to be subgenomic defective DNA molecules originating from TbLCZWV DNA A. Variable symptom severity associated with tobacco leaf curl disease and TbLCZWV is discussed.
DNA polymerase ι: The long and the short of it!

PubMed

Frank, Ekaterina G; McLenigan, Mary P; McDonald, John P; Huston, Donald; Mead, Samantha; Woodgate, Roger

2017-10-01

The cDNA encoding human DNA polymerase ι (POLI) was cloned in 1999. At that time, it was believed that the POLI gene encoded a protein of 715 amino acids. Advances in DNA sequencing technologies led to the realization that there is an upstream, in-frame initiation codon that would encode a DNA polymerase ι (polι) protein of 740 amino acids. The extra 25 amino acid region is rich in acidic residues (11/25) and is reasonably conserved in eukaryotes ranging from fish to humans. As a consequence, the curated Reference Sequence (RefSeq) database identified polι as a 740 amino acid protein. However, the existence of the 740 amino acid polι has never been shown experimentally. Using highly specific antibodies to the 25 N-terminal amino acids of polι, we were unable to detect the longer 740 amino acid (ι-long) isoform in western blots. However, trace amounts of the ι-long isoform were detected after enrichment by immunoprecipitation. One might argue that the longer isoform may have a distinct biological function, if it exhibits significant differences in its enzymatic properties from the shorter, well-characterized 715 amino acid polι. We therefore purified and characterized recombinant full-length (740 amino acid) polι-long and compared it to full-length (715 amino acid) polι-short in vitro. The metal ion requirements for optimal catalytic activity differ slightly between ι-long and ι-short, but under optimal conditions, both isoforms exhibit indistinguishable enzymatic properties in vitro. We also report that like ι-short, the ι-long isoform can be monoubiquitinated and polyubiuquitinated in vivo, as well as form damage induced foci in vivo. We conclude that the predominant isoform of DNA polι in human cells is the shorter 715 amino acid protein and that if, or when, expressed, the longer 740 amino acid isoform has identical properties to the considerably more abundant shorter isoform. Published by Elsevier B.V.
Newly identified allatostatin Bs and their receptor in the two-spotted cricket, Gryllus bimaculatus.

PubMed

Tsukamoto, Yusuke; Nagata, Shinji

2016-06-01

A cDNA encoding allatostatin Bs (ASTBs) containing the W(X)6W motif was identified using a database generated by a next generation sequencer (NGS) in the two-spotted cricket, Gryllus bimaculatus. The contig sequence revealed the presence of five novel putative ASTBs (GbASTBs) in addition to GbASTBs previously identified in G. bimaculatus. MALDI-TOF MS analyses revealed the presence of these novel and previously identified GbASTBs with three missing GbASTBs. We also identified a cDNA encoding G. bimaculatus GbASTB receptor (GbASTBR) in the NGS data. Phylogenetic analysis demonstrated that this receptor was highly conserved with other insect ASTBRs, including the sex peptide receptor of Drosophila melanogaster. Calcium imaging analyses indicated that the GbASTBR heterologously expressed in HEK293 cells exhibited responses to all identified GbASTBs at a concentration range of 10(-10)-10(-5)M. Copyright © 2016 Elsevier Inc. All rights reserved.
Synthetic biology to access and expand nature’s chemical diversity

PubMed Central

Smanski, Michael J.; Zhou, Hui; Claesen, Jan; Shen, Ben; Fischbach, Michael; Voigt, Christopher A.

2016-01-01

Bacterial genomes encode the biosynthetic potential to produce hundreds of thousands of complex molecules with diverse applications, from medicine to agriculture and materials. Economically accessing the potential encoded within sequenced genomes promises to reinvigorate waning drug discovery pipelines and provide novel routes to intricate chemicals. This is a tremendous undertaking, as the pathways often comprise dozens of genes spanning as much as 100+ kiliobases of DNA, are controlled by complex regulatory networks, and the most interesting molecules are made by non-model organisms. Advances in synthetic biology address these issues, including DNA construction technologies, genetic parts for precision expression control, synthetic regulatory circuits, computer aided design, and multiplexed genome engineering. Collectively, these technologies are moving towards an era when chemicals can be accessed en mass based on sequence information alone. This will enable the harnessing of metagenomic data and massive strain banks for high-throughput molecular discovery and, ultimately, the ability to forward design pathways to complex chemicals not found in nature. PMID:26876034
Investigating the Genome Diversity of B. cereus and Evolutionary Aspects of B. anthracis Emergence

PubMed Central

Papazisi, Leka; Rasko, David A.; Ratnayake, Shashikala; Bock, Geoff R.; Remortel, Brian G.; Appalla, Lakshmi; Liu, Jia; Dracheva, Tatiana; Braisted, John C.; Shallom, Shamira; Jarrahi, Benham; Snesrud, Erik; Ahn, Susie; Sun, Qiang; Rilstone, Jenifer; Økstad, Ole Andreas; Kolstø, Anne-Brit; Fleischmann, Robert D.; Peterson, Scott N.

2011-01-01

Here we report the use of a multi-genome DNA microarray to investigate the genome diversity of Bacillus cereus group members and elucidate the events associated with the emergence of B. anthracis the causative agent of anthrax–a lethal zoonotic disease. We initially performed directed genome sequencing of seven diverse B. cereus strains to identify novel sequences encoded in those genomes. The novel genes identified, combined with those publicly available, allowed the design of a “species” DNA microarray. Comparative genomic hybridization analyses of 41 strains indicates that substantial heterogeneity exists with respect to the genes comprising functional role categories. While the acquisition of the plasmid-encoded pathogenicity island (pXO1) and capsule genes (pXO2) represent a crucial landmark dictating the emergence of B. anthracis, the evolution of this species and its close relatives was associated with an overall a shift in the fraction of genes devoted to energy metabolism, cellular processes, transport, as well as virulence. PMID:21447378
DOE Office of Scientific and Technical Information (OSTI.GOV)

Cheng, J.; Liu, C.; Koopman, W.J.

Ligation of the Fas cell-surface molecule induces apoptosis. Defective Fas-mediated apoptosis has been associated with spontaneous autoimmunity in mice. Using human Fas/Apo-1 cDNA as a probe, the authors have molecularly cloned and characterized the human Fas chromosomal gene. The gene consists of nine exons and spans more than 26 kilobases of DNA. The lengths of introns vary from > 14 kilobases at the 5` end of the gene to 152 base pairs upstream of the exon encoding the transmembrane domain. The domain structure of the human Fas is encoded by an exon or a set of exons. Primer extension analysismore » revealed three major transcription initiation sites. The promoter region lacked canonical {open_quotes}TATA{close_quotes} and {open_quotes}CAAT{close_quotes} boxes but was a {open_quotes}GC-rich{close_quotes} sequence, and contained consensus sequences for AP-1, GF-1, NY-Y, CP-2, EBP20, and c-myb. These data provide the first characterization of the human Fas gene and insight into its regulatory region. 54 refs., 3 figs., 1 tab.« less

Regulation and Functional Expression of Cinnamate 4-Hydroxylase from Parsley

PubMed Central

Koopmann, Edda; Logemann, Elke; Hahlbrock, Klaus

1999-01-01

A previously isolated parsley (Petroselinum crispum) cDNA with high sequence similarity to cinnamate 4-hydroxylase (C4H) cDNAs from several plant sources was expressed in yeast (Saccharomyces cerevisiae) containing a plant NADPH:cytochrome P450 oxidoreductase and verified as encoding a functional C4H (CYP73A10). Low genomic complexity and the occurrence of a single type of cDNA suggest the existence of only one C4H gene in parsley. The encoded mRNA and protein, in contrast to those of a functionally related NADPH:cytochrome P450 oxidoreductase, were strictly coregulated with phenylalanine ammonia-lyase mRNA and protein, respectively, as demonstrated by coinduction under various conditions and colocalization in situ in cross-sections from several different parsley tissues. These results support the hypothesis that the genes encoding the core reactions of phenylpropanoid metabolism form a tight regulatory unit. PMID:9880345
The complete sequence and promoter activity of the human A-raf-1 gene (ARAF1)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lee, J.E.; Beck, T.W.; Brennscheidt, U.

1994-03-01

The raf proto-oncogenes encode cytoplasmic protein serine/threonine kinases, which play a critical role in cell growth and development. One of these, A-raf-1 (human gene symbol, ARAF1), which is predominantly expressed in mouse urogenital tissues, has been mapped to an evolutionarily conserved linkage group composed of ARAF1, SYN1, TIMP, and properdin located at human chromosome Xp11.2. The authors have isolated human genomic DNA clones containing the expressed gene (ARAF1) on the X chromosome and a pseudogene (ARAF2) on chromosome 7p12-q11.21. Analysis of the nucleotide sequence from the ARAF1 genomic clones demonstrated that it consists of 16 exons encoded by minimally 10,776more » nucleotides. The major transcriptional start site (+1) was determined by RNase protection and primer extension assays. Promoter activity was confirmed by functional assays using DNA fragments fused to a CAT reporter gene. The ARAF1 minimal promoter, located between nucleotides -59 and +93, has a low G + C content and lacks consensus TATA and Inr sequences but shows sequence similarity at position -1 to the E box that is known to interact with USF and TFII-I transcription factors. 65 refs., 7 figs., 1 tab.« less
An Archaeal Immune System Can Detect Multiple Protospacer Adjacent Motifs (PAMs) to Target Invader DNA*

PubMed Central

Fischer, Susan; Maier, Lisa-Katharina; Stoll, Britta; Brendel, Jutta; Fischer, Eike; Pfeiffer, Friedhelm; Dyall-Smith, Mike; Marchfelder, Anita

2012-01-01

The clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated (Cas) system provides adaptive and heritable immunity against foreign genetic elements in most archaea and many bacteria. Although this system is widespread and diverse with many subtypes, only a few species have been investigated to elucidate the precise mechanisms for the defense of viruses or plasmids. Approximately 90% of all sequenced archaea encode CRISPR/Cas systems, but their molecular details have so far only been examined in three archaeal species: Sulfolobus solfataricus, Sulfolobus islandicus, and Pyrococcus furiosus. Here, we analyzed the CRISPR/Cas system of Haloferax volcanii using a plasmid-based invader assay. Haloferax encodes a type I-B CRISPR/Cas system with eight Cas proteins and three CRISPR loci for which the identity of protospacer adjacent motifs (PAMs) was unknown until now. We identified six different PAM sequences that are required upstream of the protospacer to permit target DNA recognition. This is only the second archaeon for which PAM sequences have been determined, and the first CRISPR group with such a high number of PAM sequences. Cells could survive the plasmid challenge if their CRISPR/Cas system was altered or defective, e.g. by deletion of the cas gene cassette. Experimental PAM data were supplemented with bioinformatics data on Haloferax and Haloquadratum. PMID:22767603
Cloning, expression and N-terminal myristoylation of CpCPK1, a calcium-dependent protein kinase from zucchini (Cucurbita pepo L.).

PubMed

Ellard-Ivey, M; Hopkins, R B; White, T J; Lomax, T L

1999-01-01

We have isolated a full-length cDNA clone (CpCDPK1) encoding a calcium-dependent protein kinase (CDPK) gene from zucchini (Cucurbita pepo L.). The predicted amino acid sequence of the cDNA shows a remarkably high degree of similarity to members of the CDPK gene family from Arabidopsis thaliana, especially AtCPK1 and AtCPK2. Northern analysis of steady-state mRNA levels for CpCPK1 in etiolated and light-grown zucchini seedlings shows that the transcript is most abundant in etiolated hypocotyls and overall expression is suppressed by light. As described for other members of the CDPK gene family from different species, the CpCPK1 clone has a putative N-terminal myristoylation sequence. In this study, site-directed mutagenesis and an in vitro coupled transcription/translation system were used to demonstrate that the protein encoded by this cDNA is specifically myristoylated by a plant N-myristoyl transferase. This is the first demonstration of myristoylation of a CDPK protein which may contribute to the mechanism by which this protein is localized to the plasma membrane.
Mitochondrial myopathy, lactic acidosis, and sideroblastic anemia (MLASA) plus associated with a novel de novo mutation (m.8969G>A) in the mitochondrial encoded ATP6 gene.

PubMed

Burrage, Lindsay C; Tang, Sha; Wang, Jing; Donti, Taraka R; Walkiewicz, Magdalena; Luchak, J Michael; Chen, Li-Chieh; Schmitt, Eric S; Niu, Zhiyv; Erana, Rodrigo; Hunter, Jill V; Graham, Brett H; Wong, Lee-Jun; Scaglia, Fernando

2014-11-01

Mitochondrial myopathy, lactic acidosis and sideroblastic anemia (MLASA) is a rare mitochondrial disorder that has previously been associated with mutations in PUS1 and YARS2. In the present report, we describe a 6-year old male with an MLASA plus phenotype. This patient had features of MLASA in the setting of developmental delay, sensorineural hearing loss, epilepsy, agenesis of the corpus callosum, failure to thrive, and stroke-like episodes. Sequencing of the mitochondrial genome identified a novel de novo, heteroplasmic mutation in the mitochondrial DNA (mtDNA) encoded ATP6 gene (m.8969G>A, p.S148N). Whole exome sequencing did not identify mutations or variants in PUS1 or YARS2 or any known nuclear genes that could affect mitochondrial function and explain this phenotype. Studies of fibroblasts derived from the patient revealed a decrease in oligomycin-sensitive respiration, a finding which is consistent with a complex V defect. Thus, this mutation in MT-ATP6 may represent the first mtDNA point mutation associated with the MLASA phenotype. Copyright © 2014 Elsevier Inc. All rights reserved.
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

NASA Astrophysics Data System (ADS)

Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
Diversity Analysis of Dairy and Nondairy Lactococcus lactis Isolates, Using a Novel Multilocus Sequence Analysis Scheme and (GTG)5-PCR Fingerprinting▿

PubMed Central

Rademaker, Jan L. W.; Herbet, Hélène; Starrenburg, Marjo J. C.; Naser, Sabri M.; Gevers, Dirk; Kelly, William J.; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E. T.

2007-01-01

The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)5-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene. PMID:17890345
Diversity analysis of dairy and nondairy Lactococcus lactis isolates, using a novel multilocus sequence analysis scheme and (GTG)5-PCR fingerprinting.

PubMed

Rademaker, Jan L W; Herbet, Hélène; Starrenburg, Marjo J C; Naser, Sabri M; Gevers, Dirk; Kelly, William J; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E T

2007-11-01

The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)(5)-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene.
Effects of pre- and pro-sequence of thaumatin on the secretion by Pichia pastoris.

PubMed

Ide, Nobuyuki; Masuda, Tetsuya; Kitabatake, Naofumi

2007-11-23

Thaumatin is a 22-kDa sweet-tasting protein containing eight disulfide bonds. When thaumatin is expressed in Pichia pastoris using the thaumatin cDNA fused with both the alpha-factor signal sequence and the Kex2 protease cleavage site from Saccharomyces cerevisiae, the N-terminal sequence of the secreted thaumatin molecule is not processed correctly. To examine the role of the thaumatin cDNA-encoded N-terminal pre-sequence and C-terminal pro-sequence on the processing of thaumatin and efficiency of thaumatin production in P. pastoris, four expression plasmids with different pre-sequence and pro-sequence were constructed and transformed into P. pastoris. The transformants containing pre-thaumatin gene that has the native plant signal, secreted thaumatin molecules in the medium. The N-terminal amino acid sequence of the secreted thaumatin molecule was processed correctly. The production yield of thaumatin was not affected by the C-terminal pro-sequence, and the pro-sequence was not processed in P. pastoris, indicating that pro-sequence is not necessary for thaumatin synthesis.
Molecular Characterization of a Novel Temperate Sinorhizobium Bacteriophage, ФLM21, Encoding DNA Methyltransferase with CcrM-Like Specificity

PubMed Central

Dziewit, Lukasz; Oscik, Karolina; Bartosik, Dariusz

2014-01-01

ABSTRACT ΦLM21 is a temperate phage isolated from Sinorhizobium sp. strain LM21 (Alphaproteobacteria). Genomic analysis and electron microscopy suggested that ΦLM21 is a member of the family Siphoviridae. The phage has an isometric head and a long noncontractile tail. The genome of ΦLM21 has 50,827 bp of linear double-stranded DNA encoding 72 putative proteins, including proteins responsible for the assembly of the phage particles, DNA packaging, transcription, replication, and lysis. Virion proteins were characterized using mass spectrometry, leading to the identification of the major capsid and tail components, tape measure, and a putative portal protein. We have confirmed the activity of two gene products, a lytic enzyme (a putative chitinase) and a DNA methyltransferase, sharing sequence specificity with the cell cycle-regulating methyltransferase (CcrM) of the bacterial host. Interestingly, the genome of Sinorhizobium phage ΦLM21 shows very limited similarity to other known phage genome sequences and is thus considered unique. IMPORTANCE Prophages are known to play an important role in the genomic diversification of bacteria via horizontal gene transfer. The influence of prophages on pathogenic bacteria is very well documented. However, our knowledge of the overall impact of prophages on the survival of their lysogenic, nonpathogenic bacterial hosts is still limited. In particular, information on prophages of the agronomically important Sinorhizobium species is scarce. In this study, we describe the isolation and molecular characterization of a novel temperate bacteriophage, ΦLM21, of Sinorhizobium sp. LM21. Since we have not found any similar sequences, we propose that this bacteriophage is a novel species. We conducted a functional analysis of selected proteins. We have demonstrated that the phage DNA methyltransferase has the same sequence specificity as the cell cycle-regulating methyltransferase CcrM of its host. We point out that this phenomenon of mimicking the host regulatory mechanisms by viruses is quite common in bacteriophages. PMID:25187538
A novel image encryption algorithm based on the chaotic system and DNA computing

NASA Astrophysics Data System (ADS)

Chai, Xiuli; Gan, Zhihua; Lu, Yang; Chen, Yiran; Han, Daojun

A novel image encryption algorithm using the chaotic system and deoxyribonucleic acid (DNA) computing is presented. Different from the traditional encryption methods, the permutation and diffusion of our method are manipulated on the 3D DNA matrix. Firstly, a 3D DNA matrix is obtained through bit plane splitting, bit plane recombination, DNA encoding of the plain image. Secondly, 3D DNA level permutation based on position sequence group (3DDNALPBPSG) is introduced, and chaotic sequences generated from the chaotic system are employed to permutate the positions of the elements of the 3D DNA matrix. Thirdly, 3D DNA level diffusion (3DDNALD) is given, the confused 3D DNA matrix is split into sub-blocks, and XOR operation by block is manipulated to the sub-DNA matrix and the key DNA matrix from the chaotic system. At last, by decoding the diffused DNA matrix, we get the cipher image. SHA 256 hash of the plain image is employed to calculate the initial values of the chaotic system to avoid chosen plaintext attack. Experimental results and security analyses show that our scheme is secure against several known attacks, and it can effectively protect the security of the images.
Novel Structure of Ty3 Reverse Transcriptase | Center for Cancer Research

Cancer.gov

Retrotransposons are mobile genetic elements that self amplify via a single-stranded RNA intermediate, which is converted to double-stranded DNA by an encoded reverse transcriptase (RT) with both DNA polymerase (pol) and ribonuclease H (RNase) activities. Categorized by whether they contain flanking long terminal repeat (LTR) sequences, retrotransposons play a critical role in the architecture of eukaryotic genomes and are the evolutionary origin of retroviruses, including human immunodeficiency virus (HIV).
Hierarchical assembly of viral nanotemplates with encoded microparticles via nucleic acid hybridization.

PubMed

Tan, Wui Siew; Lewis, Christina L; Horelik, Nicholas E; Pregibon, Daniel C; Doyle, Patrick S; Yi, Hyunmin

2008-11-04

We demonstrate hierarchical assembly of tobacco mosaic virus (TMV)-based nanotemplates with hydrogel-based encoded microparticles via nucleic acid hybridization. TMV nanotemplates possess a highly defined structure and a genetically engineered high density thiol functionality. The encoded microparticles are produced in a high throughput microfluidic device via stop-flow lithography (SFL) and consist of spatially discrete regions containing encoded identity information, an internal control, and capture DNAs. For the hybridization-based assembly, partially disassembled TMVs were programmed with linker DNAs that contain sequences complementary to both the virus 5' end and a selected capture DNA. Fluorescence microscopy, atomic force microscopy (AFM), and confocal microscopy results clearly indicate facile assembly of TMV nanotemplates onto microparticles with high spatial and sequence selectivity. We anticipate that our hybridization-based assembly strategy could be employed to create multifunctional viral-synthetic hybrid materials in a rapid and high-throughput manner. Additionally, we believe that these viral-synthetic hybrid microparticles may find broad applications in high capacity, multiplexed target sensing.
Nuclear magnetic resonance-based model of a TF1/HmU-DNA complex.

PubMed

Silva, M V; Pasternack, L B; Kearns, D R

1997-12-15

Transcription factor 1 (TF1), a type II DNA-binding protein encoded by the Bacillus subtilis bacteriophage SPO1, has the capacity for sequence-selective DNA binding and a preference for 5-hydroxymethyl-2'-deoxyuridine (HmU)-containing DNA. In NMR studies of the TF1/HmU-DNA complex, intermolecular NOEs indicate that the flexible beta-ribbon and C-terminal alpha-helix are involved in the DNA-binding site of TF1, placing it in the beta-sheet category of DNA-binding proteins proposed to bind by wrapping two beta-ribbon "arms" around the DNA. Intermolecular and intramolecular NOEs were used to generate an energy-minimized model of the protein-DNA complex in which both DNA bending and protein structure changes are evident.
Identification and molecular characterization of a novel circular single-stranded DNA virus associated with yerba mate in Argentina.

PubMed

Bejerman, Nicolás; de Breuil, Soledad; Nome, Claudia

2018-06-06

A single-stranded DNA (ssDNA) virus was detected in Yerba mate samples showing chlorotic linear patterns, chlorotic rings and vein yellowing. The full-genome sequences of six different isolates of this ssDNA circular virus were obtained, which share > 99% sequence identity with each other. The newly identified virus has been tentatively named as yerba mate-associated circular DNA virus (YMaCV). The 2707 nt-long viral genome has two and three open reading frame on its complementary and virion-sense strands, respectively. The coat protein is more similar to that of mastreviruses (44% identity), whereas the replication-associated protein of YMaCV is more similar (49% identity) to that encoded by a recently described, unclassified ssDNA virus isolated on trees in Brazil. This is the first report of a circular DNA virus associated with yerba mate. Its unique genome organization and phylogenetic relationships indicates that YMaCV represents a distinct evolutionary lineage within the ssDNA viruses and therefore this virus should be classified as a member of a new species within an unassigned genus or family.
DNA-encoded chemical libraries: advancing beyond conventional small-molecule libraries.

PubMed

Franzini, Raphael M; Neri, Dario; Scheuermann, Jörg

2014-04-15

DNA-encoded chemical libraries (DECLs) represent a promising tool in drug discovery. DECL technology allows the synthesis and screening of chemical libraries of unprecedented size at moderate costs. In analogy to phage-display technology, where large antibody libraries are displayed on the surface of filamentous phage and are genetically encoded in the phage genome, DECLs feature the display of individual small organic chemical moieties on DNA fragments serving as amplifiable identification barcodes. The DNA-tag facilitates the synthesis and allows the simultaneous screening of very large sets of compounds (up to billions of molecules), because the hit compounds can easily be identified and quantified by PCR-amplification of the DNA-barcode followed by high-throughput DNA sequencing. Several approaches have been used to generate DECLs, differing both in the methods used for library encoding and for the combinatorial assembly of chemical moieties. For example, DECLs can be used for fragment-based drug discovery, displaying a single molecule on DNA or two chemical moieties at the extremities of complementary DNA strands. DECLs can vary substantially in the chemical structures and the library size. While ultralarge libraries containing billions of compounds have been reported containing four or more sets of building blocks, also smaller libraries have been shown to be efficient for ligand discovery. In general, it has been found that the overall library size is a poor predictor for library performance and that the number and diversity of the building blocks are rather important indicators. Smaller libraries consisting of two to three sets of building blocks better fulfill the criteria of drug-likeness and often have higher quality. In this Account, we present advances in the DECL field from proof-of-principle studies to practical applications for drug discovery, both in industry and in academia. DECL technology can yield specific binders to a variety of target proteins and is likely to become a standard tool for pharmaceutical hit discovery, lead expansion, and Chemical Biology research. The introduction of new methodologies for library encoding and for compound synthesis in the presence of DNA is an exciting research field and will crucially contribute to the performance and the propagation of the technology.
Molecular cloning and characterization of a cDNA encoding a novel apoplastic protein preferentially expressed in a shikonin-producing callus strain of Lithospermum erythrorhizon.

PubMed

Yamamura, Yoshimi; Sahin, F Pinar; Nagatsu, Akito; Mizukami, Hajime

2003-04-01

A cDNA (LEPS-2) encoding a novel cell wall protein was cloned from shikonin-producing callus tissues of Lithospermum erythrorhizon by differential display between a shikonin-producing culture strain and a non-producing strain. The LEPS-2 cDNA encoded a polypeptide of 184 amino acids. The deduced amino acid sequence exhibited no significant homology with known proteins. Expression of LEPS-2 gene as well as accumulation of LEPS-2 protein was highly correlated with shikonin production in L. erythrorhizon cells in culture. In the intact plant, expression of LEPS-2 was detected only in the roots where shikonin pigments accumulated. Cell fractionation experiments and immunocytochemical analysis showed that the protein was localized in the apoplast fraction of the cell walls. The shikonin pigments were also stored on the cell walls as oil droplets. These results indicate that expression of the LEPS-2 is closely linked with shikonin biosynthesis and the LEPS-2 protein may be involved in the intra-cell wall trapping of shikonin pigments.
Molecular cloning of two human liver 3 alpha-hydroxysteroid/dihydrodiol dehydrogenase isoenzymes that are identical with chlordecone reductase and bile-acid binder.

PubMed Central

Deyashiki, Y; Ogasawara, A; Nakayama, T; Nakanishi, M; Miyabe, Y; Sato, K; Hara, A

1994-01-01

Human liver contains two dihydrodiol dehydrogenases, DD2 and DD4, associated with 3 alpha-hydroxysteroid dehydrogenase activity. We have raised polyclonal antibodies that cross-reacted with the two enzymes and isolated two 1.2 kb cDNA clones (C9 and C11) for the two enzymes from a human liver cDNA library using the antibodies. The clones of C9 and C11 contained coding sequences corresponding to 306 and 321 amino acid residues respectively, but lacked 5'-coding regions around the initiation codon. Sequence analyses of several peptides obtained by enzymic and chemical cleavages of the two purified enzymes verified that the C9 and C11 clones encoded DD2 and DD4 respectively, and further indicated that the sequence of DD2 had at least additional 16 residues upward from the N-terminal sequence deduced from the cDNA. There was 82% amino acid sequence identity between the two enzymes, indicating that the enzymes are genetic isoenzymes. A computer-based comparison of the cDNAs of the isoenzymes with the DNA sequence database revealed that the nucleotide and amino acid sequences of DD2 and DD4 are virtually identical with those of human bile-acid binder and human chlordecone reductase cDNAs respectively. Images Figure 1 PMID:8172617
Assessing information content and interactive relationships of subgenomic DNA sequences of the MHC using complexity theory approaches based on the non-extensive statistical mechanics

NASA Astrophysics Data System (ADS)

Karakatsanis, L. P.; Pavlos, G. P.; Iliopoulos, A. C.; Pavlos, E. G.; Clark, P. M.; Duke, J. L.; Monos, D. S.

2018-09-01

This study combines two independent domains of science, the high throughput DNA sequencing capabilities of Genomics and complexity theory from Physics, to assess the information encoded by the different genomic segments of exonic, intronic and intergenic regions of the Major Histocompatibility Complex (MHC) and identify possible interactive relationships. The dynamic and non-extensive statistical characteristics of two well characterized MHC sequences from the homozygous cell lines, PGF and COX, in addition to two other genomic regions of comparable size, used as controls, have been studied using the reconstructed phase space theorem and the non-extensive statistical theory of Tsallis. The results reveal similar non-linear dynamical behavior as far as complexity and self-organization features. In particular, the low-dimensional deterministic nonlinear chaotic and non-extensive statistical character of the DNA sequences was verified with strong multifractal characteristics and long-range correlations. The nonlinear indices repeatedly verified that MHC sequences, whether exonic, intronic or intergenic include varying levels of information and reveal an interaction of the genes with intergenic regions, whereby the lower the number of genes in a region, the less the complexity and information content of the intergenic region. Finally we showed the significance of the intergenic region in the production of the DNA dynamics. The findings reveal interesting content information in all three genomic elements and interactive relationships of the genes with the intergenic regions. The results most likely are relevant to the whole genome and not only to the MHC. These findings are consistent with the ENCODE project, which has now established that the non-coding regions of the genome remain to be of relevance, as they are functionally important and play a significant role in the regulation of expression of genes and coordination of the many biological processes of the cell.
Isolation and characterization of full-length putative alcohol dehydrogenase genes from polygonum minus

NASA Astrophysics Data System (ADS)

Hamid, Nur Athirah Abd; Ismail, Ismanizan

2013-11-01

Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.

The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

PubMed

Holland, M J; Holland, J P; Thill, G P; Jackson, K A

1981-02-10

Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.
Intramolecular transposition by a synthetic IS50 (Tn5) derivative

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tomcsanyi, T.; Phadnis, S.H.; Berg, D.E.

1990-11-01

We report the formation of deletions and inversions by intramolecular transposition of Tn5-derived mobile elements. The synthetic transposons used contained the IS50 O and I end segments and the transposase gene, a contraselectable gene encoding sucrose sensitivity (sacB), antibiotic resistance genes, and a plasmid replication origin. Both deletions and inversions were associated with loss of a 300-bp segment that is designated the vector because it is outside of the transposon. Deletions were severalfold more frequent than inversions, perhaps reflecting constraints on DNA twisting or abortive transposition. Restriction and DNA sequence analyses showed that both types of rearrangements extended from onemore » transposon end to many different sites in target DNA. In the case of inversions, transposition generated 9-bp direct repeats of target sequences.« less
An oleate 12-hydroxylase from Ricinus communis L. is a fatty acyl desaturase homolog

DOE Office of Scientific and Technical Information (OSTI.GOV)

Van De Loo, F.J.; Broun, P.; Turner, S.

1995-07-18

Recent spectroscopic evidence implicating a binuclear iron site at the reaction center of fatty acyl desaturases suggested to us that certain fatty acyl hydroxylases may share significant amino acid sequence similarity with desaturases. To test this theory, we prepared a cDNA library from developing endosperm of the castor-oil plant (Ricinus communis L.) and obtained partial nucleotide sequences for 468 anonymous clones that were not expressed at high levels in leaves, a tissue deficient in 12-hydroxyoleic acid. This resulted in the identification of several cDNA clones encoding a polypeptide of 387 amino acids with a predicted molecular weight of 44,407 andmore » with {approx}67% sequence homology to microsomal oleate desaturase from Arabidopsis. Expression of a full-length clone under control of the cauliflower mosaic virus 35S promoter in transgenic tobacco resulted in the accumulation of low levels of 12-hydroxyoleic acid in seeds, indicating that the clone encodes the castor oleate hydroxylase. These results suggest that fatty acyl desaturases and hydroxylases share similar reaction mechanisms and provide an example of enzyme evolution. 26 refs., 6 figs., 1 tab.« less
In silico analysis of β-1,3-glucanase from a psychrophilic yeast, Glaciozyma antarctica PI12

NASA Astrophysics Data System (ADS)

Mohammadi, Salimeh; Bakar, Farah Diba Abu; Rabu, Amir; Murad, Abdul Munir Abdul

2014-09-01

1,3-beta-glucanase is an industrially important enzyme having wide range of applications especially in food industry. It is crucial to gain an understanding about the structure and functional aspects of various beta-1,3-glucanase produced from diverse sources. In this, study a cDNA encoding β-1,3-glucanase (GaExg55) was isolated from a psychrophilic yeast, Glaciozyma antarctica PI12. The cDNA sequence has been submitted to Genbank with an accession number (KJ436377). Subsequently, the perdition protein was analyzed using various bioinformatics tools to explore the properties of the protein. GaEXG55 is consisting of 1,440-bp nucleotides encoding 480 amino acid residues. Alignment of the deduced amino acid for GaExg55 with other exo-β-1,3-glucanase available at the NCBI database indicate that deduced amino acids shared a consensus motif NEP, which is signature pattern of GH5 hydrolases. Predicted molecular weight of GaExg55 is 53.66 kDa. GaExg55 sequences possesses signal peptide sequence and it is highly conserved with other fungal exo-beta-1,3 glucanase.
Herpes simplex virus DNA packaging sequences adopt novel structures that are specifically recognized by a component of the cleavage and packaging machinery.

PubMed

Adelman, K; Salmon, B; Baines, J D

2001-03-13

The product of the herpes simplex virus type 1 U(L)28 gene is essential for cleavage of concatemeric viral DNA into genome-length units and packaging of this DNA into viral procapsids. To address the role of U(L)28 in this process, purified U(L)28 protein was assayed for the ability to recognize conserved herpesvirus DNA packaging sequences. We report that DNA fragments containing the pac1 DNA packaging motif can be induced by heat treatment to adopt novel DNA conformations that migrate faster than the corresponding duplex in nondenaturing gels. Surprisingly, these novel DNA structures are high-affinity substrates for U(L)28 protein binding, whereas double-stranded DNA of identical sequence composition is not recognized by U(L)28 protein. We demonstrate that only one strand of the pac1 motif is responsible for the formation of novel DNA structures that are bound tightly and specifically by U(L)28 protein. To determine the relevance of the observed U(L)28 protein-pac1 interaction to the cleavage and packaging process, we have analyzed the binding affinity of U(L)28 protein for pac1 mutants previously shown to be deficient in cleavage and packaging in vivo. Each of the pac1 mutants exhibited a decrease in DNA binding by U(L)28 protein that correlated directly with the reported reduction in cleavage and packaging efficiency, thereby supporting a role for the U(L)28 protein-pac1 interaction in vivo. These data therefore suggest that the formation of novel DNA structures by the pac1 motif confers added specificity on recognition of DNA packaging sequences by the U(L)28-encoded component of the herpesvirus cleavage and packaging machinery.
Isolation of a cDNA for a Growth Factor of Vascular Endothelial Cells from Human Lung Cancer Cells: Its Identity with Insulin‐like Growth Factor II

PubMed Central

Hagiwara, Koichi; Kobayashi, Tatsuo; Tobita, Masato; Kikyo, Nobuaki; Yazaki, Yoshio

1995-01-01

We have found growth‐promoting activity for vascular endothelial cells in the conditioned medium of a human lung cancer cell line, T3M‐11. Purification and characterization of the growth‐promoting activity have been carried out using ammonium sulfate precipitation and gel‐exclusion chromatography. The activity migrated as a single peak just after ribonuclease. It did not bind to a heparin affinity column. These results suggest that the activity is not a heparin‐binding growth factor (including fibroblast growth factors) or a vascular endothelial growth factor. To identify the molecule exhibiting the growth‐promoting activity, a cDNA encoding the growth factor was isolated through functional expression cloning in COS‐1 cells from a cDNA library prepared from T3M‐11 cells. The nucleotide sequence encoded by the cDNA proved to be identical with that of insulin‐like growth factor II. PMID:7730145
Inferring genome-wide interplay landscape between DNA methylation and transcriptional regulation.

PubMed

Tang, Binhua; Wang, Xin

2015-01-01

DNA methylation and transcriptional regulation play important roles in cancer cell development and differentiation processes. Based on the currently available cell line profiling information from the ENCODE Consortium, we propose a Bayesian inference model to infer and construct genome-wide interaction landscape between DNA methylation and transcriptional regulation, which sheds light on the underlying complex functional mechanisms important within the human cancer and disease context. For the first time, we select all the currently available cell lines (>=20) and transcription factors (>=80) profiling information from the ENCODE Consortium portal. Through the integration of those genome-wide profiling sources, our genome-wide analysis detects multiple functional loci of interest, and indicates that DNA methylation is cell- and region-specific, due to the interplay mechanisms with transcription regulatory activities. We validate our analysis results with the corresponding RNA-sequencing technique for those detected genomic loci. Our results provide novel and meaningful insights for the interplay mechanisms of transcriptional regulation and gene expression for the human cancer and disease studies.
The thiostrepton-resistance-encoding gene in Streptomyces laurentii is located within a cluster of ribosomal protein operons.

PubMed

Smith, T M; Jiang, Y F; Shipley, P; Floss, H G

1995-10-16

A common approach to identify and clone biosynthetic gene from an antibiotic-producing streptomycete is to clone the resistance gene for the antibiotic of interest and then use that gene to clone DNA that is linked to it. As a first step toward cloning the genes responsible for the biosynthesis of thiostrepton (Th) in Streptomyces laurentii (Sl), the Th resistance-encoding gene (tsnR) was cloned as a 1.5-kb BamHI-PvuII fragment in Escherichia coli (Ec), and shown to confer Th resistance when introduced into S. lividans TK24. The tsnR-containing DNA fragment was used as a probe to isolate clones from cosmid libraries of DNA in the Ec cosmid vector SuperCos, and pOJ446 (an Ec/streptomycete) cosmid vector. Sequence and genetic analysis of the DNA flanking the tsnR indicates that the Sl tsnR is not closely linked to biosynthetic genes. Instead it is located within a cluster of ribosomal protein operons.
Molecular cloning of Kazal-type proteinase inhibitor of the shrimp Fenneropenaeus chinensis.

PubMed

Kong, Hee Jeong; Cho, Hyun Kook; Park, Eun-Mi; Hong, Gyeong-Eun; Kim, Young-Ok; Nam, Bo-Hye; Kim, Woo-Jin; Lee, Sang-Jun; Han, Hyon Sob; Jang, In-Kwon; Lee, Chang Hoon; Cheong, Jaehun; Choi, Tae-Jin

2009-01-01

Proteinase inhibitors play important roles in host defence systems involving blood coagulation and pathogen digestion. We isolated and characterized a cDNA clone for a Kazal-type proteinase inhibitor (KPI) from a hemocyte cDNA library of the oriental white shrimp Fenneropenaeus chinensis. The KPI gene consists of three exons and two introns. KPI cDNA contains an open reading frame of 396 bp, a polyadenylation signal sequence AATAAA, and a poly (A) tail. KPI cDNA encodes a polypeptide of 131 amino acids with a putative signal peptide of 21 amino acids. The deduced amino acid sequence of KPI contains two homologous Kazal domains, each with six conserved cysteine residues. The mRNA of KPI is expressed in the hemocytes of healthy shrimp, and the higher expression of KPI transcript is observed in shrimp infected with the white spot syndrome virus (WSSV), suggesting a potential role for KPI in host defence mechanisms.
Determination of ABO genotypes with DNA extracted from formalin-fixed, paraffin-embedded tissues.

PubMed

Yamada, M; Yamamoto, Y; Tanegashima, A; Kane, M; Ikehara, Y; Fukunaga, T; Nishi, K

1994-01-01

The gene encoding the specific glycosyltransferases which catalyze the conversion of the H antigen to A or B antigens shows a slight but distinct variation in its allelic nucleotide sequence and can be divided into 6 genotypes when digested with specific restriction enzymes. We extracted DNA from formalin-fixed, paraffin-embedded tissues using SDS/proteinase K treatment followed by phenol/chloroform extraction. The sequence of nucleotides for the A, B and O genes was amplified by the polymerase chain reaction (PCR). DNA fragments of 128 bp and 200 bp could be amplified in the second round of PCR, using an aliquot of the first round PCR product as template. Degraded DNA from paraffin blocks stored for up to 10.7 years could be successfully typed. The ABO genotype was deduced from the digestion patterns with an appropriate combination of restriction enzymes and was compatible with the phenotype obtained from the blood sample.
The GA5 locus of Arabidopsis thaliana encodes a multifunctional gibberellin 20-oxidase: molecular cloning and functional expression.

PubMed

Xu, Y L; Li, L; Wu, K; Peeters, A J; Gage, D A; Zeevaart, J A

1995-07-03

The biosynthesis of gibberellins (GAs) after GA12-aldehyde involves a series of oxidative steps that lead to the formation of bioactive GAs. Previously, a cDNA clone encoding a GA 20-oxidase [gibberellin, 2-oxoglutarate:oxygen oxidoreductase (20-hydroxylating, oxidizing), EC 1.14.11.-] was isolated by immunoscreening a cDNA library from liquid endosperm of pumpkin (Cucurbita maxima L.) with antibodies against partially purified GA 20-oxidase. Here, we report isolation of a genomic clone for GA 20-oxidase from a genomic library of the long-day species Arabidopsis thaliana Heynh., strain Columbia, by using the pumpkin cDNA clone as a heterologous probe. This genomic clone contains a GA 20-oxidase gene that consists of three exons and two introns. The three exons are 1131-bp long and encode 377 amino acid residues. A cDNA clone corresponding to the putative GA 20-oxidase genomic sequence was constructed with the reverse transcription-PCR method, and the identity of the cDNA clone was confirmed by analyzing the capability of the fusion protein expressed in Escherichia coli to convert GA53 to GA44 and GA19 to GA20. The Arabidopsis GA 20-oxidase shares 55% identity and > 80% similarity with the pumpkin GA 20-oxidase at the derived amino acid level. Both GA 20-oxidases share high homology with other 2-oxoglutarate-dependent dioxygenases (2-ODDs), but the highest homology was found between the two GA 20-oxidases. Mapping results indicated tight linkage between the cloned GA 20-oxidase and the GA5 locus of Arabidopsis. The ga5 semidwarf mutant contains a G-->A point mutation that inserts a translational stop codon in the protein-coding sequence, thus confirming that the GA5 locus encodes GA 20-oxidase. Expression of the GA5 gene in Ara-bidopsis leaves was enhanced after plants were transferred from short to long days; it was reduced by GA4 treatment, suggesting end-product repression in the GA biosynthetic pathway.
The GA5 locus of Arabidopsis thaliana encodes a multifunctional gibberellin 20-oxidase: Molecular cloning and functional expression

DOE Office of Scientific and Technical Information (OSTI.GOV)

Xu, Yun-Ling; Li, Li; Wu, Keqiang

1995-07-03

The biosynthesis of gibberellins (GAs) after GA{sub 12}-aldehyde involves a series of oxidative steps that lead to the formation of bioactive GAs. Previously, a cDNA clone encoding a GA 20-oxidase [gibberellin, 2-oxoglutarate:oxygen oxidoreductase (20-hydroxylating, oxidizing), EC 1.14.11-] was isolated by immunoscreening a cDNA library from liquid endosperm of pumpkin (Cucurbita maxima L.) with antibodies against partially purified GA 20-oxidase. Here, we report isolation of a genomic clone for GA 20-oxidase from a genomic library of the long-day species Arabidopsis thaliana Heynh., strain Columbia, by using the pumpkin cDNA clone as a heterologous probe. This genomic clone contains a GA 20-oxidasemore » gene that consists of three exons and two introns. The three exons are 1131-bp long and encode 377 amino acid residues. A cDNA clone corresponding to the putative GA 20-oxidase genomic sequence was constructed with the reverse transcription-PCR method, and the identity of the cDNA clone was confirmed by analyzing the capability of the fusion protein expressed in Escherichia coli to convert GA{sub 53} to GA{sub 44} and GA{sub 19} to GA{sub 20}. The Arabidopsis GA 20-oxidase shares 55% identity and >80% similarity with the pumpkin GA 20-oxidase at the derived amino acid level. Both GA 20-oxidases share high homology with other 2-oxoglutarate-dependent dioxygenases (2-ODDs), but the highest homology was found between the two GA 20-oxidases. Mapping results indicated tight linkage between the cloned GA 20-oxidase and the GA locus of Arabidopsis. The ga5 semidwarf mutant contains a G {yields} A point mutation that inserts a translational stop codon in the protein-coding sequence, thus confirming that the GA5 locus encodes GA 20-oxidase. Expression of the GA5 gene in Arabidopsis leaves was enhanced after plants were transferred from short to long days; it was reduced by GA{sub 4} treatment, suggesting end-product repression in the GA biosynthetic pathway. 28 refs., 6 figs.« less
Investigation of the mechanism of meiotic DNA cleavage by VMA1-derived endonuclease uncovers a meiotic alteration in chromatin structure around the target site.

PubMed

Fukuda, Tomoyuki; Ohta, Kunihiro; Ohya, Yoshikazu

2006-06-01

VMA1-derived endonuclease (VDE), a homing endonuclease in Saccharomyces cerevisiae, is encoded by the mobile intein-coding sequence within the nuclear VMA1 gene. VDE recognizes and cleaves DNA at the 31-bp VDE recognition sequence (VRS) in the VMA1 gene lacking the intein-coding sequence during meiosis to insert a copy of the intein-coding sequence at the cleaved site. The mechanism underlying the meiosis specificity of VMA1 intein-coding sequence homing remains unclear. We studied various factors that might influence the cleavage activity in vivo and found that VDE binding to the VRS can be detected only when DNA cleavage by VDE takes place, implying that meiosis-specific DNA cleavage is regulated by the accessibility of VDE to its target site. As a possible candidate for the determinant of this accessibility, we analyzed chromatin structure around the VRS and revealed that local chromatin structure near the VRS is altered during meiosis. Although the meiotic chromatin alteration exhibits correlations with DNA binding and cleavage by VDE at the VMA1 locus, such a chromatin alteration is not necessarily observed when the VRS is embedded in ectopic gene loci. This suggests that nucleosome positioning or occupancy around the VRS by itself is not the sole mechanism for the regulation of meiosis-specific DNA cleavage by VDE and that other mechanisms are involved in the regulation.
Investigation of the Mechanism of Meiotic DNA Cleavage by VMA1-Derived Endonuclease Uncovers a Meiotic Alteration in Chromatin Structure around the Target Site

PubMed Central

Fukuda, Tomoyuki; Ohta, Kunihiro; Ohya, Yoshikazu

2006-01-01

VMA1-derived endonuclease (VDE), a homing endonuclease in Saccharomyces cerevisiae, is encoded by the mobile intein-coding sequence within the nuclear VMA1 gene. VDE recognizes and cleaves DNA at the 31-bp VDE recognition sequence (VRS) in the VMA1 gene lacking the intein-coding sequence during meiosis to insert a copy of the intein-coding sequence at the cleaved site. The mechanism underlying the meiosis specificity of VMA1 intein-coding sequence homing remains unclear. We studied various factors that might influence the cleavage activity in vivo and found that VDE binding to the VRS can be detected only when DNA cleavage by VDE takes place, implying that meiosis-specific DNA cleavage is regulated by the accessibility of VDE to its target site. As a possible candidate for the determinant of this accessibility, we analyzed chromatin structure around the VRS and revealed that local chromatin structure near the VRS is altered during meiosis. Although the meiotic chromatin alteration exhibits correlations with DNA binding and cleavage by VDE at the VMA1 locus, such a chromatin alteration is not necessarily observed when the VRS is embedded in ectopic gene loci. This suggests that nucleosome positioning or occupancy around the VRS by itself is not the sole mechanism for the regulation of meiosis-specific DNA cleavage by VDE and that other mechanisms are involved in the regulation. PMID:16757746
Gene encoding herbicide safener binding protein

DOEpatents

Walton, Jonathan D.; Scott-Craig, John S.

1999-01-01

The cDNA encoding safener binding protein (SafBP), also referred to as SBP1, is set forth in FIG. 5 and SEQ ID No. 1. The deduced amino acid sequence is provided in FIG. 5 and SEQ ID No. 2. Methods of making and using SBP1 and SafBP to alter a plant's sensitivity to certain herbicides or a plant's responsiveness to certain safeners are also provided, as well as expression vectors, transgenic plants or other organisms transfected with said vectors and seeds from said plants.
Improvement and Optimization of Two Engineered Phage Resistance Mechanisms in Lactococcus lactis

PubMed Central

McGrath, Stephen; Fitzgerald, Gerald F.; van Sinderen, Douwe

2001-01-01

Homologous replication module genes were identified for four P335 type phages. DNA sequence analysis revealed that all four phages exhibited more than 90% DNA homology for at least two genes, designated rep2009 and orf17. One of these genes, rep2009, codes for a putative replisome organizer protein and contains an assumed origin of phage DNA replication (ori2009), which was identical for all four phages. DNA fragments representing the ori2009 sequence confer a phage-encoded resistance (Per) phenotype on lactococcal hosts when they are supplied on a high-copy-number vector. Furthermore, cloning multiple copies of the ori2009 sequence was found to increase the effectiveness of the Per phenotype conferred. A number of antisense plasmids targeting specific genes of the replication module were constructed. Two separate plasmids targeting rep2009 and orf17 were found to efficiently inhibit proliferation of all four phages by interfering with intracellular phage DNA replication. These results represent two highly effective strategies for inhibiting bacteriophage proliferation, and they also identify a novel gene, orf17, which appears to be important for phage DNA replication. Furthermore, these results indicate that although the actual mechanisms of DNA replication are very similar, if not identical, for all four phages, expression of the replication genes is significantly different in each case. PMID:11157223
Sequence analysis of three mitochondrial DNA molecules reveals interesting differences among Saccharomyces yeasts

PubMed Central

Langkjær, R. B.; Casaregola, S.; Ussery, D. W.; Gaillardin, C.; Piškur, J.

2003-01-01

The complete sequences of mitochondrial DNA (mtDNA) from the two budding yeasts Saccharomyces castellii and Saccharomyces servazzii, consisting of 25 753 and 30 782 bp, respectively, were analysed and compared to Saccharomyces cerevisiae mtDNA. While some of the traits are very similar among Saccharomyces yeasts, others have highly diverged. The two mtDNAs are much more compact than that of S.cerevisiae and contain fewer introns and intergenic sequences, although they have almost the same coding potential. A few genes contain group I introns, but group II introns, otherwise found in S.cerevisiae mtDNA, are not present. Surprisingly, four genes (ATP6, COX2, COX3 and COB) in the mtDNA of S.servazzii contain, in total, five +1 frameshifts. mtDNAs of S.castellii, S.servazzii and S.cerevisiae contain all genes on the same strand, except for one tRNA gene. On the other hand, the gene order is very different. Several gene rearrangements have taken place upon separation of the Saccharomyces lineages, and even a part of the transcription units have not been preserved. It seems that the mechanism(s) involved in the generation of the rearrangements has had to ensure that all genes stayed encoded by the same DNA strand. PMID:12799436
Cystic Fibrosis Gene Encodes a cAMP-Dependent Chloride Channel in Heart

NASA Astrophysics Data System (ADS)

Hart, Padraig; Warth, John D.; Levesque, Paul C.; Collier, Mei Lin; Geary, Yvonne; Horowitz, Burton; Hume, Joseph R.

1996-06-01

cAMP-dependent chloride channels in heart contribute to autonomic regulation of action potential duration and membrane potential and have been inferred to be due to cardiac expression of the epithelial cystic fibrosis transmembrane conductance regulator (CFTR) chloride channel. In this report, a cDNA from rabbit ventricle was isolated and sequenced, which encodes an exon 5 splice variant (exon 5-) of CFTR, with >90% identity to human CFTR cDNA present in epithelial cells. Expression of this cDNA in Xenopus oocytes gave rise to robust cAMP-activated chloride currents that were absent in control water-injected oocytes. Antisense oligodeoxynucleotides directed against CFTR significnatly reduced the density of cAMP-dependent chloride currents in acutely cultured myocytes, thereby establishing a direct functional link between cardiac expression of CFTR protein and an endogenous chloride channel in native cardiac myocytes.
Isolation and characterization of a cDNA clone coding for a glutathione S-transferase class delta enzyme from the biting midge Culicoides variipennis sonorensis Wirth and Jones.

PubMed

Abdallah, M A; Pollenz, R S; Droog, F N; Nunamaker, R A; Tabachnick, W J; Murphy, K E

2000-12-01

Culicoides variipennis sonorensis is the primary vector of bluetongue viruses in North America. Glutathione S-transferases (GSTs) are enzymes that catalyze nucleophilic substitutions, converting reactive lipophilic molecules into soluble conjugates. Increased GST activity is associated with development of insecticide resistance. Described here is the isolation of the first cDNA encoding a C. variipennis GST. The clone consists of 720 translated bases encoding a protein with a M(r) of approximately 24,800 composed of 219 amino acids. The deduced amino acid sequence is similar (64%-74%) to class Delta (previously named Theta) GSTs from the dipteran genera Musca, Drosophila, Lucilia and Anopheles. The cDNA was subcloned into pET-11b, expressed in Epicurian coli BL21 (DE3) and has a specific activity of approximately 28,000 units/mg for the substrate 1-chloro-2,4-dinitrobenzene.
A beta-galactosidase gene is expressed during mature fruit abscission of 'Valencia' orange (Citrus sinensis).

PubMed

Wu, Zhencai; Burns, Jacqueline K

2004-07-01

beta-galactosidases have been detected in a wide range of plants and are characterized by their ability to hydrolyse terminal non-reducing beta-D-galactosyl residues from beta-D-galactosides. These enzymes have been detected in a wide range of plant organs and tissues. In a search for differentially expressed genes during the abscission process in citrus, sequences encoding beta-galactosidase were identified. Three cDNA fragments of a beta-galactosidase gene were isolated from a cDNA subtraction library constructed from mature fruit abscission zones 48 h after the application of a mature fruit-specific abscission agent, 5-chloro-3-methyl-4-nitro-1H-pyrazole (CMN-pyrazole). Based on sequence information derived from these fragments, a full-length cDNA of 2847 nucleotides (GenBank accession number AY029198) encoding beta-galactosidase was isolated from mature fruit abscission zones by 5'- and 3'-RACE approaches. The beta-galactosidase cDNA encoded a protein of 737 amino acid residues with a calculated molecular weight of 82 kDa. The deduced protein was highly homologous to plant beta-galactosidases expressed in fruit ripening. Southern blot analysis demonstrated that at least two closely related beta-galactosidase genes were present in 'Valencia' orange. Temporal expression patterns in mature fruit abscission zones indicated beta-galactosidase mRNA was detected 48 h after treatment of CMN-pyrazole and ethephon in mature fruit abscission zones. beta-galactosidase transcripts were detected in leaf abscission zones only after ethephon application. The citrus beta-galactosidase was expressed in stamens and petals of fully opened flowers and young fruitlets. The results suggest that this beta-galactosidase may play a role during abscission as well as early growth and development processes in flowers and fruitlets.

Mitochondrial DNA copy number is regulated in a tissue specific manner by DNA methylation of the nuclear-encoded DNA polymerase gamma A

PubMed Central

Kelly, Richard D. W.; Mahmud, Arsalan; McKenzie, Matthew; Trounce, Ian A.; St John, Justin C.

2012-01-01

DNA methylation is an essential mechanism controlling gene expression during differentiation and development. We investigated the epigenetic regulation of the nuclear-encoded, mitochondrial DNA (mtDNA) polymerase γ catalytic subunit (PolgA) by examining the methylation status of a CpG island within exon 2 of PolgA. Bisulphite sequencing identified low methylation levels (<10%) within exon 2 of mouse oocytes, blastocysts and embryonic stem cells (ESCs), while somatic tissues contained significantly higher levels (>40%). In contrast, induced pluripotent stem (iPS) cells and somatic nuclear transfer ESCs were hypermethylated (>20%), indicating abnormal epigenetic reprogramming. Real time PCR analysis of 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC) immunoprecipitated DNA suggests active DNA methylation and demethylation within exon 2 of PolgA. Moreover, neural differentiation of ESCs promoted de novo methylation and demethylation at the exon 2 locus. Regression analysis demonstrates that cell-specific PolgA expression levels were negatively correlated with DNA methylation within exon 2 and mtDNA copy number. Finally, using chromatin immunoprecipitation (ChIP) against RNA polymerase II (RNApII) phosphorylated on serine 2, we show increased DNA methylation levels are associated with reduced RNApII transcriptional elongation. This is the first study linking nuclear DNA epigenetic regulation with mtDNA regulation during differentiation and cell specialization. PMID:22941637
Mitochondrial DNA Variant in COX1 Subunit Significantly Alters Energy Metabolism of Geographically Divergent Wild Isolates in Caenorhabditis elegans

PubMed Central

Dingley, Stephen D.; Polyak, Erzsebet; Ostrovsky, Julian; Srinivasan, Satish; Lee, Icksoo; Rosenfeld, Amy B.; Tsukikawa, Mai; Xiao, Rui; Selak, Mary A.; Coon, Joshua J.; Hebert, Alexander S.; Grimsrud, Paul A.; Kwon, Young Joon; Pagliarini, David J.; Gai, Xiaowu; Schurr, Theodore G.; Hüttemann, Maik; Nakamaru-Ogiso, Eiko; Falk, Marni J.

2014-01-01

Mitochondrial DNA (mtDNA) sequence variation can influence the penetrance of complex diseases and climatic adaptation. While studies in geographically defined human populations suggest that mtDNA mutations become fixed when they have conferred metabolic capabilities optimally suited for a specific environment, it has been challenging to definitively assign adaptive functions to specific mtDNA sequence variants in mammals. We investigated whether mtDNA genome variation functionally influences Caenorhabditis elegans wild isolates of distinct mtDNA lineages and geographic origins. We found that, relative to N2 (England) wild-type nematodes, CB4856 wild isolates from a warmer native climate (Hawaii) had a unique p.A12S amino acid substitution in the mtDNA-encoded COX1 core catalytic subunit of mitochondrial complex IV (CIV). Relative to N2, CB4856 worms grown at 20 °C had significantly increased CIV enzyme activity, mitochondrial matrix oxidant burden, and sensitivity to oxidative stress but had significantly reduced lifespan and mitochondrial membrane potential. Interestingly, mitochondrial membrane potential was significantly increased in CB4856 grown at its native temperature of 25 °C. A transmitochondrial cybrid worm strain, chpIR (M, CB4856 > N2), was bred as homoplasmic for the CB4856 mtDNA genome in the N2 nuclear background. The cybrid strain also displayed significantly increased CIV activity, demonstrating that this difference results from the mtDNA-encoded p.A12S variant. However, chpIR (M, CB4856 > N2) worms had significantly reduced median and maximal lifespan relative to CB4856, which may relate to their nuclear– mtDNA genome mismatch. Overall, these data suggest that C. elegans wild isolates of varying geographic origins may adapt to environmental challenges through mtDNA variation to modulate critical aspects of mitochondrial energy metabolism. PMID:24534730
Two potato proteins, including a novel RING finger protein (HIP1), interact with the potyviral multifunctional protein HCpro.

PubMed

Guo, Deyin; Spetz, Carl; Saarma, Mart; Valkonen, Jari P T

2003-05-01

Potyviral helper-component proteinase (HCpro) is a multifunctional protein exerting its cellular functions in interaction with putative host proteins. In this study, cellular protein partners of the HCpro encoded by Potato virus A (PVA) (genus Potyvirus) were screened in a potato leaf cDNA library using a yeast two-hybrid system. Two cellular proteins were obtained that interact specifically with PVA HCpro in yeast and in the two in vitro binding assays used. Both proteins are encoded by single-copy genes in the potato genome. Analysis of the deduced amino acid sequences revealed that one (HIP1) of the two HCpro interactors is a novel RING finger protein. The sequence of the other protein (HIP2) showed no resemblance to the protein sequences available from databanks and has known biological functions.
Label-Free Detection of Sequence-Specific DNA Based on Fluorescent Silver Nanoclusters-Assisted Surface Plasmon-Enhanced Energy Transfer.

PubMed

Ma, Jin-Liang; Yin, Bin-Cheng; Le, Huynh-Nhu; Ye, Bang-Ce

2015-06-17

We have developed a label-free method for sequence-specific DNA detection based on surface plasmon enhanced energy transfer (SPEET) process between fluorescent DNA/AgNC string and gold nanoparticles (AuNPs). DNA/AgNC string, prepared by a single-stranded DNA template encoded two emitter-nucleation sequences at its termini and an oligo spacer in the middle, was rationally designed to produce bright fluorescence emission. The proposed method takes advantage of two strategies. The first one is the difference in binding properties of single-stranded DNA (ssDNA) and double-stranded DNA (dsDNA) toward AuNPs. The second one is SPEET process between fluorescent DNA/AgNC string and AuNPs, in which fluorescent DNA/AgNC string can be spontaneously adsorbed onto the surface of AuNPs and correspondingly AuNPs serve as "nanoquencher" to quench the fluorescence of DNA/AgNC string. In the presence of target DNA, the sensing probe hybridized with target DNA to form duplex DNA, leading to a salt-induced AuNP aggregation and subsequently weakened SPEET process between fluorescent DNA/AgNC string and AuNPs. A red-to-blue color change of AuNPs and a concomitant fluorescence increase were clearly observed in the sensing system, which had a concentration dependent manner with specific DNA. The proposed method achieved a detection limit of ∼2.5 nM, offering the following merits of simple design, convenient operation, and low experimental cost because of no chemical modification, organic dye, enzymatic reaction, or separation procedure involved.
Plasmids encoding PKI(1-31), a specific inhibitor of cAMP-stimulated gene expression, inhibit the basal transcriptional activity of some but not all cAMP-regulated DNA response elements in JEG-3 cells.

PubMed

Grove, J R; Deutsch, P J; Price, D J; Habener, J F; Avruch, J

1989-11-25

Plasmids that encode a bioactive amino-terminal fragment of the heat-stable inhibitor of the cAMP-dependent protein kinase, PKI(1-31), were employed to characterize the role of this protein kinase in the control of transcriptional activity mediated by three DNA regulatory elements in the JEG-3 human placental cell line. The 5'-flanking sequence of the human collagenase gene contains the heptameric sequence, 5'-TGAGTCA-3', previously identified as a "phorbol ester" response element. Reporter genes containing either the intact 1.2-kilobase 5'-flanking sequence from the human collagenase gene or just the 7-base pair (bp) response element, when coupled to an enhancerless promoter, each exhibit both cAMP and phorbol ester-stimulated expression in JEG-3 cells. Cotransfection of either construct with plasmids encoding PKI(1-31) inhibits cAMP-stimulated but not basal- or phorbol ester-stimulated expression. Pretreatment of cells with phorbol ester for 1 or 2 days abrogates completely the response to rechallenge with phorbol ester but does not alter the basal expression of either construct; cAMP-stimulated expression, while modestly inhibited, remains vigorous. The 5'-flanking sequence of the human chorionic gonadotropin-alpha subunit (HCG alpha) gene has two copies of the sequence, 5'-TGACGTCA-3', contained in directly adjacent identical 18-bp segments, previously identified as a cAMP-response element. Reporter genes containing either the intact 1.5 kilobase of 5'-flanking sequence from the HCG alpha gene, or just the 36-bp tandem repeat cAMP response element, when coupled to an enhancerless promoter, both exhibit a vigorous cAMP stimulation of expression but no response to phorbol ester in JEG-3 cells. Cotransfection with plasmids encoding PKI(1-31) inhibits both basal and cAMP-stimulated expression in a parallel fashion. The 5'-flanking sequence of the human enkephalin gene mediates cAMP-stimulated expression of reporter genes in both JEG-3 and CV-1 cells. Plasmids encoding PKI(1-31) inhibit the expression that is stimulated by the addition of cAMP analogs in both cell lines; basal expression, however, is inhibited by PKI(1-31) only in the JEG-3 cell line and not in the CV-1 cells. These observations indicate that, in JEG-3 cells, PKI(1-31) is a specific inhibitor of kinase A-mediated gene transcription, but it does not modify kinase C-directed transcription.(ABSTRACT TRUNCATED AT 400 WORDS)
Induction of Strain-Transcending Immunity against Plasmodium chabaudi adami Malaria with a Multiepitope DNA Vaccine

PubMed Central

Scorza, T.; Grubb, K.; Smooker, P.; Rainczuk, A.; Proll, D.; Spithill, T. W.

2005-01-01

A major goal of current malaria vaccine programs is to develop multivalent vaccines that will protect humans against the many heterologous malaria strains that circulate in endemic areas. We describe a multiepitope DNA vaccine, derived from a genomic Plasmodium chabaudi adami DS DNA expression library of 30,000 plasmids, which induces strain-transcending immunity in mice against challenge with P. c. adami DK. Segregation of this library and DNA sequence analysis identified vaccine subpools encoding open reading frames (ORFs)/peptides of >9 amino acids [aa] (the V9+ pool, 303 plasmids) and >50 aa (V50+ pool, 56 plasmids), respectively. The V9+ and V50+ plasmid vaccine subpools significantly cross-protected mice against heterologous P. c. adami DK challenge, and protection correlated with the induction of both specific gamma interferon production by splenic cells and opsonizing antibodies. Bioinformatic analysis showed that 22 of the V50+ ORFs were polypeptides conserved among three or more Plasmodium spp., 13 of which are predicted hypothetical proteins. Twenty-nine of these ORFs are orthologues of predicted Plasmodium falciparum sequences known to be expressed in the blood stage, suggesting that this vaccine pool encodes multiple blood-stage antigens. The results have implications for malaria vaccine design by providing proof-of-principle that significant strain-transcending immunity can be induced using multiepitope blood-stage DNA vaccines and suggest that both cellular responses and opsonizing antibodies are necessary for optimal protection against P. c. adami. PMID:15845504
Single Cell Transcriptomics of Hypothalamic Warm Sensitive Neurons that Control Core Body Temperature and Fever Response

PubMed Central

Eberwine, James; Bartfai, Tamas

2011-01-01

We report on an ‘unbiased’ molecular characterization of individual, adult neurons, active in a central, anterior hypothalamic neuronal circuit, by establishing cDNA libraries from each individual, electrophysiologically identified warm sensitive neuron (WSN). The cDNA libraries were analyzed by Affymetrix microarray. The presence and frequency of cDNAs was confirmed and enhanced with Illumina sequencing of each single cell cDNA library. cDNAs encoding the GABA biosynthetic enzyme. GAD1 and of adrenomedullin, galanin, prodynorphin, somatostatin, and tachykinin were found in the WSNs. The functional cellular and in vivo studies on dozens of the more than 500 neurotransmitter -, hormone- receptors and ion channels, whose cDNA was identified and sequence confirmed, suggest little or no discrepancy between the transcriptional and functional data in WSNs; whenever agonists were available for a receptor whose cDNA was identified, a functional response was found.. Sequencing single neuron libraries permitted identification of rarely expressed receptors like the insulin receptor, adiponectin receptor2 and of receptor heterodimers; information that is lost when pooling cells leads to dilution of signals and mixing signals. Despite the common electrophysiological phenotype and uniform GAD1 expression, WSN- transcriptomes show heterogenity, suggesting strong epigenetic influence on the transcriptome. Our study suggests that it is well-worth interrogating the cDNA libraries of single neurons by sequencing and chipping. PMID:20970451
The Elusive Nature of Adaptive Mitochondrial DNA Evolution of an Arctic Lineage Prone to Frequent Introgression

PubMed Central

Melo-Ferreira, José; Vilela, Joana; Fonseca, Miguel M.; da Fonseca, Rute R.; Boursot, Pierre; Alves, Paulo C.

2014-01-01

Mitochondria play a fundamental role in cellular metabolism, being responsible for most of the energy production of the cell in the oxidative phosphorylation (OXPHOS) pathway. Mitochondrial DNA (mtDNA) encodes for key components of this process, but its direct role in adaptation remains far from understood. Hares (Lepus spp.) are privileged models to study the impact of natural selection on mitogenomic evolution because 1) species are adapted to contrasting environments, including arctic, with different metabolic pressures, and 2) mtDNA introgression from arctic into temperate species is widespread. Here, we analyzed the sequences of 11 complete mitogenomes (ten newly obtained) of hares of temperate and arctic origins (including two of arctic origin introgressed into temperate species). The analysis of patterns of codon substitutions along the reconstructed phylogeny showed evidence for positive selection in several codons in genes of the OXPHOS complexes, most notably affecting the arctic lineage. However, using theoretical models, no predictable effect of these differences was found on the structure and physicochemical properties of the encoded proteins, suggesting that the focus of selection may lie on complex interactions with nuclear encoded peptides. Also, a cloverleaf structure was detected in the control region only from the arctic mtDNA lineage, which may influence mtDNA replication and transcription. These results suggest that adaptation impacted the evolution of hare mtDNA and may have influenced the occurrence and consequences of the many reported cases of massive mtDNA introgression. However, the origin of adaptation remains elusive. PMID:24696399
Turning self-destructing Salmonella into a universal DNA vaccine delivery platform.

PubMed

Kong, Wei; Brovold, Matthew; Koeneman, Brian A; Clark-Curtiss, Josephine; Curtiss, Roy

2012-11-20

We previously developed a biological containment system using recombinant Salmonella Typhimurium strains that are attenuated yet capable of synthesizing protective antigens. The regulated delayed attenuation and programmed self-destructing features designed into these S. Typhimurium strains enable them to efficiently colonize host tissues and allow release of the bacterial cell contents after lysis. To turn such a recombinant attenuated Salmonella vaccine (RASV) strain into a universal DNA vaccine-delivery vehicle, our approach was to genetically modify RASV strains to display a hyperinvasive phenotype to maximize Salmonella host entry and host cell internalization, to enable Salmonella endosomal escape to release a DNA vaccine into the cytosol, and to decrease Salmonella-induced pyroptosis/apoptosis that allows the DNA vaccine time to traffic to the nucleus for efficient synthesis of encoded protective antigens. A DNA vaccine vector that encodes a domain that contributes to the arabinose-regulated lysis phenotype but has a eukaryotic promoter was constructed. The vector was then improved by insertion of multiple DNA nuclear-targeting sequences for efficient nuclear trafficking and gene expression, and by increasing nuclease resistance to protect the plasmid from host degradation. A DNA vaccine encoding influenza WSN virus HA antigen delivered by the RASV strain with the best genetic attributes induced complete protection to mice against a lethal influenza virus challenge. Adoption of these technological improvements will revolutionize means for effective delivery of DNA vaccines to stimulate mucosal, systemic, and cellular protective immunities, and lead to a paradigm shift in cost-effective control and prevention of a diversity of diseases.
Turning self-destructing Salmonella into a universal DNA vaccine delivery platform

PubMed Central

Kong, Wei; Brovold, Matthew; Koeneman, Brian A.; Clark-Curtiss, Josephine; Curtiss, Roy

2012-01-01

We previously developed a biological containment system using recombinant Salmonella Typhimurium strains that are attenuated yet capable of synthesizing protective antigens. The regulated delayed attenuation and programmed self-destructing features designed into these S. Typhimurium strains enable them to efficiently colonize host tissues and allow release of the bacterial cell contents after lysis. To turn such a recombinant attenuated Salmonella vaccine (RASV) strain into a universal DNA vaccine-delivery vehicle, our approach was to genetically modify RASV strains to display a hyperinvasive phenotype to maximize Salmonella host entry and host cell internalization, to enable Salmonella endosomal escape to release a DNA vaccine into the cytosol, and to decrease Salmonella-induced pyroptosis/apoptosis that allows the DNA vaccine time to traffic to the nucleus for efficient synthesis of encoded protective antigens. A DNA vaccine vector that encodes a domain that contributes to the arabinose-regulated lysis phenotype but has a eukaryotic promoter was constructed. The vector was then improved by insertion of multiple DNA nuclear-targeting sequences for efficient nuclear trafficking and gene expression, and by increasing nuclease resistance to protect the plasmid from host degradation. A DNA vaccine encoding influenza WSN virus HA antigen delivered by the RASV strain with the best genetic attributes induced complete protection to mice against a lethal influenza virus challenge. Adoption of these technological improvements will revolutionize means for effective delivery of DNA vaccines to stimulate mucosal, systemic, and cellular protective immunities, and lead to a paradigm shift in cost-effective control and prevention of a diversity of diseases. PMID:23129620
Cloning and identification of a cDNA that encodes a novel human protein with thrombospondin type I repeat domain, hPWTSR.

PubMed

Chen, Jin-Zhong; Wang, Shu; Tang, Rong; Yang, Quan-Sheng; Zhao, Enpeng; Chao, Yaoqiong; Ying, Kang; Xie, Yi; Mao, Yu-Min

2002-09-01

A cDNA was isolated from the fetal brain cDNA library by high throughput cDNA sequencing. The 2390 bp cDNA with an open reading fragment (ORF) of 816 bp encodes a 272 amino acids putative protein with a thrombospondin type I repeat (TSR) domain and a cysteine-rich region at the N-terminus, so it is named hPWTSR. We used Northern blot detected two bands with length of about 3 kb and 4 kb respectively, which expressed in human adult tissues with different intensities. The expression pattern was verified by RT-PCR, revealing that the transcripts were expressed ubiquitously in fetal tissues and human tumor tissues too. However, the transcript was detected neither in ovarian carcinoma GI-102 nor in lung carcinoma LX-1. Blast analysis against NCBI database revealed that the new gene contained at least 5 exons and located in human chromosome 6q22.33. Our results demonstrate that the gene is a novel member of TSR supergene family.
Mitochondrial genome sequence of the Tibetan wild ass (Equus kiang).

PubMed

Luo, Yongjun; Chen, Yu; Liu, Fuyu; Jiang, Chunhua; Gao, Yuqi

2011-02-01

The Tibetan wild ass, or kiang (Equus kiang) is endemic to the cold and hypoxic (4000-7000 m above sea level) climates of the montane and alpine grasslands of the Tibetan Plateau. We report here the complete nucleotide sequence of the E. kiang mitochondrial genome. Our results show that E. kiang mitochondrial DNA is 16,634 bp long, and predicted to encode all the 37 genes that are typical for vertebrates.
Superstatistical model of bacterial DNA architecture

NASA Astrophysics Data System (ADS)

Bogachev, Mikhail I.; Markelov, Oleg A.; Kayumov, Airat R.; Bunde, Armin

2017-02-01

Understanding the physical principles that govern the complex DNA structural organization as well as its mechanical and thermodynamical properties is essential for the advancement in both life sciences and genetic engineering. Recently we have discovered that the complex DNA organization is explicitly reflected in the arrangement of nucleotides depicted by the universal power law tailed internucleotide interval distribution that is valid for complete genomes of various prokaryotic and eukaryotic organisms. Here we suggest a superstatistical model that represents a long DNA molecule by a series of consecutive ~150 bp DNA segments with the alternation of the local nucleotide composition between segments exhibiting long-range correlations. We show that the superstatistical model and the corresponding DNA generation algorithm explicitly reproduce the laws governing the empirical nucleotide arrangement properties of the DNA sequences for various global GC contents and optimal living temperatures. Finally, we discuss the relevance of our model in terms of the DNA mechanical properties. As an outlook, we focus on finding the DNA sequences that encode a given protein while simultaneously reproducing the nucleotide arrangement laws observed from empirical genomes, that may be of interest in the optimization of genetic engineering of long DNA molecules.
Molecular characterization and phylogenetic analysis of a yak (Bos grunniens) κ-casein cDNA from lactating mammary gland.

PubMed

Bai, W L; Yin, R H; Dou, Q L; Jiang, W Q; Zhao, S J; Ma, Z J; Luo, G B; Zhao, Z H

2011-04-01

κ-Casein is one of the major proteins in the milk of mammals. It plays an important role in determining the size and specific function of milk micelles. We have previously identified and characterized a genetic variant of yak κ-casein by evaluating genomic DNA. Here, we isolate and characterize a yak κ-casein cDNA harboring the full-length open reading frame (ORF) from lactating mammary gland. Total RNA was extracted from mammary tissue of lactating female yak, and the κ-casein cDNA were synthesized by RT-PCR technique, then cloned and sequenced. The obtained cDNA of 660-bp contained an ORF sufficient to encode the entire amino acid sequence of κ-casein precursor protein consisting of 190 amino acids with a signal peptide of 21 amino acids. Yak κ-casein has a predicted molecular mass of 19,006.588 Da with a calculated isoelectric point of 7.245. Compared with the corresponding sequences in GenBank of cattle, buffalo, sheep, goat, Arabian camel, horse, and rabbit, yak κ-casein sequence had identity of 64.76-98.78% in cDNA, and identity of 44.79-98.42% and similarity of 53.65-98.42% in deduced amino acids, revealing a high homology with the other livestock species. Based on κ-casein cDNA sequences, the phylogenetic analysis indicated that yak κ-casein had a close relationship with that of cattle. This work might be useful in the genetic engineering researches for yak κ-casein.
Distinct Circular Single-Stranded DNA Viruses Exist in Different Soil Types

PubMed Central

Swanson, Maud M.; Dawson, Lorna; Freitag, Thomas E.; Singh, Brajesh K.; Torrance, Lesley; Mushegian, Arcady R.

2015-01-01

The potential dependence of virus populations on soil types was examined by electron microscopy, and the total abundance of virus particles in four soil types was similar to that previously observed in soil samples. The four soil types examined differed in the relative abundances of four morphological groups of viruses. Machair, a unique type of coastal soil in western Scotland and Ireland, differed from the others tested in having a higher proportion of tailed bacteriophages. The other soils examined contained predominantly spherical and thin filamentous virus particles, but the Machair soil had a more even distribution of the virus types. As the first step in looking at differences in populations in detail, virus sequences from Machair and brown earth (agricultural pasture) soils were examined by metagenomic sequencing after enriching for circular Rep-encoding single-stranded DNA (ssDNA) (CRESS-DNA) virus genomes. Sequences from the family Microviridae (icosahedral viruses mainly infecting bacteria) of CRESS-DNA viruses were predominant in both soils. Phylogenetic analysis of Microviridae major coat protein sequences from the Machair viruses showed that they spanned most of the diversity of the subfamily Gokushovirinae, whose members mainly infect obligate intracellular parasites. The brown earth soil had a higher proportion of sequences that matched the morphologically similar family Circoviridae in BLAST searches. However, analysis of putative replicase proteins that were similar to those of viruses in the Circoviridae showed that they are a novel clade of Circoviridae-related CRESS-DNA viruses distinct from known Circoviridae genera. Different soils have substantially different taxonomic biodiversities even within ssDNA viruses, which may be driven by physicochemical factors. PMID:25841004
Revolting Developments in Our Understanding of the Organization of the Eukaryotic Genome.

ERIC Educational Resources Information Center

Krider, Hallie M.

1984-01-01

Various typs of DNA are discussed. Areas considered include highly repetitive and satellite sequences, genes encoding, ribosomal RNA, histone protein genes, and dispersed repeated genes that jump. Regulated genetic misbehavior, structure and use of unique genes, and higher order complexities of chromosomes are also discussed. (JN)
Production of hydroxylated fatty acids in genetically modified plants

DOEpatents

Somerville, Chris; Broun, Pierre; van de Loo, Frank

2001-01-01

This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants.
MADS-box genes in maize: Frequent targets of selection during domestication

USDA-ARS?s Scientific Manuscript database

MADS-box genes encode transcription factors that are key regulators of plant inflorescence and flower development. We examined DNA sequence variation in 32 maize MADS-box genes and 32 random loci from the maize genome and investigated their involvement in maize domestication and improvement. Using n...
DNA Prime/Adenovirus Boost Malaria Vaccine Encoding P. falciparum CSP and AMA1 Induces Sterile Protection Associated with Cell-Mediated Immunity

DTIC Science & Technology

2013-02-14

immunization, was severe (Grade 3), preventing daily activities . Four weeks after the Ad boost, 15 study subjects were challenged with P. falciparum...administering a drug selectively active against blood stage parasites such as chloroquine [4,5]. While the immunological mechanisms underlying the...promoter sequence activated within the host cell. Alternatively, the genes are inserted into a viral vector, which efficiently transports the DNA into
Molecular cloning and expression of rat liver bile acid CoA ligase.

PubMed

Falany, Charles N; Xie, Xiaowei; Wheeler, James B; Wang, Jin; Smith, Michelle; He, Dongning; Barnes, Stephen

2002-12-01

Bile acid CoA ligase (BAL) is responsible for catalyzing the first step in the conjugation of bile acids with amino acids. Sequencing of putative rat liver BAL cDNAs identified a cDNA (rBAL-1) possessing a 51 nucleotide 5'-untranslated region, an open reading frame of 2,070 bases encoding a 690 aa protein with a molecular mass of 75,960 Da, and a 138 nucleotide 3'-nontranslated region followed by a poly(A) tail. Identity of the cDNA was established by: 1) the rBAL-1 open reading frame encoded peptides obtained by chemical sequencing of the purified rBAL protein; 2) expressed rBAL-1 protein comigrated with purified rBAL during SDS-polyacrylamide gel electrophoresis; and 3) rBAL-1 expressed in insect Sf9 cells had enzymatic properties that were comparable to the enzyme isolated from rat liver. Evidence for a relationship between fatty acid and bile acid metabolism is suggested by specific inhibition of rBAL-1 by cis-unsaturated fatty acids and its high homology to a human very long chain fatty acid CoA ligase. In summary, these results indicate that the cDNA for rat liver BAL has been isolated and expression of the rBAL cDNA in insect Sf9 cells results in a catalytically active enzyme capable of utilizing several different bile acids as substrates.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.