Shi, Xue; Zeng, Haiyang; Xue, Yadong; Luo, Meizhong
2011-10-11
Large-insert BAC and BIBAC libraries are important tools for structural and functional genomics studies of eukaryotic genomes. To facilitate the construction of BAC and BIBAC libraries and the transfer of complete large BAC inserts into BIBAC vectors, which is desired in positional cloning, we developed a pair of new BAC and BIBAC vectors. The new BAC vector pIndigoBAC536-S and the new BIBAC vector BIBAC-S have the following features: 1) both contain two 18-bp non-palindromic I-SceI sites in an inverted orientation at positions that flank an identical DNA fragment containing the lacZ selection marker and the cloning site. Large DNA inserts can be excised from the vectors as single fragments by cutting with I-SceI, allowing the inserts to be easily sized. More importantly, because the two vectors contain different antibiotic resistance genes for transformant selection and produce the same non-complementary 3' protruding ATAA ends by I-SceI that suppress self- and inter-ligations, the exchange of intact large genomic DNA inserts between the BAC and BIBAC vectors is straightforward; 2) both were constructed as high-copy composite vectors. Reliable linearized and dephosphorylated original low-copy pIndigoBAC536-S and BIBAC-S vectors that are ready for library construction can be prepared from the high-copy composite vectors pHZAUBAC1 and pHZAUBIBAC1, respectively, without the need for additional preparation steps or special reagents, thus simplifying the construction of BAC and BIBAC libraries. BIBAC clones constructed with the new BIBAC-S vector are stable in both E. coli and Agrobacterium. The vectors can be accessed through our website http://GResource.hzau.edu.cn. The two new vectors and their respective high-copy composite vectors can largely facilitate the construction and characterization of BAC and BIBAC libraries. The transfer of complete large genomic DNA inserts from one vector to the other is made straightforward.
[Construction of large fragment metagenome library of natural mangrove soil].
Jiang, Yun-Xia; Zheng, Tian-Ling
2007-11-01
Applying our optimized direct extraction method, the percentage of large fragment DNA in the total extracted mangrove soil DNA was significant increased. The large fragment metagenome library derived from natural mangrove soil over four seasons was successfully constructed by the optimized DNA extraction and electro elution purification method. All of the clones had recombinant Cosmids and each differed in their fragment profiles when Cosmid DNA was extracted from 12 randomly picked colonies and digested with BamHI. The average insert size for this library was larger than 35 kbp. This culturing-independent library at least encompassed 335 Mbp valuable genetic information of mangrove soil microbes. It allowed mining of valuable intertidal microbial resource to become a reality. It is a recommended method for those researchers who have still not circumvented the large insert environmental libraries or for those beginning research in this field, so as to avoid them attempting repetitive, fussy work.
Comparison of large-insert, small-insert and pyrosequencing libraries for metagenomic analysis.
Danhorn, Thomas; Young, Curtis R; DeLong, Edward F
2012-11-01
The development of DNA sequencing methods for characterizing microbial communities has evolved rapidly over the past decades. To evaluate more traditional, as well as newer methodologies for DNA library preparation and sequencing, we compared fosmid, short-insert shotgun and 454 pyrosequencing libraries prepared from the same metagenomic DNA samples. GC content was elevated in all fosmid libraries, compared with shotgun and 454 libraries. Taxonomic composition of the different libraries suggested that this was caused by a relative underrepresentation of dominant taxonomic groups with low GC content, notably Prochlorales and the SAR11 cluster, in fosmid libraries. While these abundant taxa had a large impact on library representation, we also observed a positive correlation between taxon GC content and fosmid library representation in other low-GC taxa, suggesting a general trend. Analysis of gene category representation in different libraries indicated that the functional composition of a library was largely a reflection of its taxonomic composition, and no additional systematic biases against particular functional categories were detected at the level of sequencing depth in our samples. Another important but less predictable factor influencing the apparent taxonomic and functional library composition was the read length afforded by the different sequencing technologies. Our comparisons and analyses provide a detailed perspective on the influence of library type on the recovery of microbial taxa in metagenomic libraries and underscore the different uses and utilities of more traditional, as well as contemporary 'next-generation' DNA library construction and sequencing technologies for exploring the genomics of the natural microbial world.
Soares, Marcelo Bento; Bonaldo, Maria de Fatima
1998-01-01
This invention provides a method to normalize a cDNA library comprising: (a) constructing a directionally cloned library containing cDNA inserts wherein the insert is capable of being amplified by polymerase chain reaction; (b) converting a double-stranded cDNA library into single-stranded DNA circles; (c) generating single-stranded nucleic acid molecules complementary to the single-stranded DNA circles converted in step (b) by polymerase chain reaction with appropriate primers; (d) hybridizing the single-stranded DNA circles converted in step (b) with the complementary single-stranded nucleic acid molecules generated in step (c) to produce partial duplexes to an appropriate Cot; and (e) separating the unhybridized single-stranded DNA circles from the hybridized DNA circles, thereby generating a normalized cDNA library. This invention also provides a method to normalize a cDNA library wherein the generating of single-stranded nucleic acid molecules complementary to the single-stranded DNA circles converted in step (b) is by excising cDNA inserts from the double-stranded cDNA library; purifying the cDNA inserts from cloning vectors; and digesting the cDNA inserts with an exonuclease. This invention further provides a method to construct a subtractive cDNA library following the steps described above. This invention further provides normalized and/or subtractive cDNA libraries generated by the above methods.
Soares, M.B.; Fatima Bonaldo, M. de
1998-12-08
This invention provides a method to normalize a cDNA library comprising: (a) constructing a directionally cloned library containing cDNA inserts wherein the insert is capable of being amplified by polymerase chain reaction; (b) converting a double-stranded cDNA library into single-stranded DNA circles; (c) generating single-stranded nucleic acid molecules complementary to the single-stranded DNA circles converted in step (b) by polymerase chain reaction with appropriate primers; (d) hybridizing the single-stranded DNA circles converted in step (b) with the complementary single-stranded nucleic acid molecules generated in step (c) to produce partial duplexes to an appropriate Cot; and (e) separating the unhybridized single-stranded DNA circles from the hybridized DNA circles, thereby generating a normalized cDNA library. This invention also provides a method to normalize a cDNA library wherein the generating of single-stranded nucleic acid molecules complementary to the single-stranded DNA circles converted in step (b) is by excising cDNA inserts from the double-stranded cDNA library; purifying the cDNA inserts from cloning vectors; and digesting the cDNA inserts with an exonuclease. This invention further provides a method to construct a subtractive cDNA library following the steps described above. This invention further provides normalized and/or subtractive cDNA libraries generated by the above methods. 25 figs.
Construction of BAC Libraries from Flow-Sorted Chromosomes.
Šafář, Jan; Šimková, Hana; Doležel, Jaroslav
2016-01-01
Cloned DNA libraries in bacterial artificial chromosome (BAC) are the most widely used form of large-insert DNA libraries. BAC libraries are typically represented by ordered clones derived from genomic DNA of a particular organism. In the case of large eukaryotic genomes, whole-genome libraries consist of a hundred thousand to a million clones, which make their handling and screening a daunting task. The labor and cost of working with whole-genome libraries can be greatly reduced by constructing a library derived from a smaller part of the genome. Here we describe construction of BAC libraries from mitotic chromosomes purified by flow cytometric sorting. Chromosome-specific BAC libraries facilitate positional gene cloning, physical mapping, and sequencing in complex plant genomes.
Lam, Kathy N; Charles, Trevor C
2015-01-01
Clone libraries provide researchers with a powerful resource to study nucleic acid from diverse sources. Metagenomic clone libraries in particular have aided in studies of microbial biodiversity and function, and allowed the mining of novel enzymes. Libraries are often constructed by cloning large inserts into cosmid or fosmid vectors. Recently, there have been reports of GC bias in fosmid metagenomic libraries, and it was speculated to be a result of fragmentation and loss of AT-rich sequences during cloning. However, evidence in the literature suggests that transcriptional activity or gene product toxicity may play a role. To explore possible mechanisms responsible for sequence bias in clone libraries, we constructed a cosmid library from a human microbiome sample and sequenced DNA from different steps during library construction: crude extract DNA, size-selected DNA, and cosmid library DNA. We confirmed a GC bias in the final cosmid library, and we provide evidence that the bias is not due to fragmentation and loss of AT-rich sequences but is likely occurring after DNA is introduced into Escherichia coli. To investigate the influence of strong constitutive transcription, we searched the sequence data for promoters and found that rpoD/σ(70) promoter sequences were underrepresented in the cosmid library. Furthermore, when we examined the genomes of taxa that were differentially abundant in the cosmid library relative to the original sample, we found the bias to be more correlated with the number of rpoD/σ(70) consensus sequences in the genome than with simple GC content. The GC bias of metagenomic libraries does not appear to be due to DNA fragmentation. Rather, analysis of promoter sequences provides support for the hypothesis that strong constitutive transcription from sequences recognized as rpoD/σ(70) consensus-like in E. coli may lead to instability, causing loss of the plasmid or loss of the insert DNA that gives rise to the transcription. Despite widespread use of E. coli to propagate foreign DNA in metagenomic libraries, the effects of in vivo transcriptional activity on clone stability are not well understood. Further work is required to tease apart the effects of transcription from those of gene product toxicity.
[cDNA library construction from panicle meristem of finger millet].
Radchuk, V; Pirko, Ia V; Isaenkov, S V; Emets, A I; Blium, Ia B
2014-01-01
The protocol for production of full-size cDNA using SuperScript Full-Length cDNA Library Construction Kit II (Invitrogen) was tested and high quality cDNA library from meristematic tissue of finger millet panicle (Eleusine coracana (L.) Gaertn) was created. The titer of obtained cDNA library comprised 3.01 x 10(5) CFU/ml in avarage. In average the length of cDNA insertion consisted about 1070 base pairs, the effectivity of cDNA fragment insertions--99.5%. The selective sequencing of cDNA clones from created library was performed. The sequences of cDNA clones were identified with usage of BLAST-search. The results of cDNA library analysis and selective sequencing represents prove good functionality and full length character of inserted cDNA clones. Obtained cDNA library from meristematic tissue of finger millet panicle represents good and valuable source for isolation and identification of key genes regulating metabolism and meristematic development and for mining of new molecular markers to conduct out high quality genetic investigations and molecular breeding as well.
Chen, Chao; Zhao, Xinqing; Jin, Yingyu; Zhao, Zongbao Kent; Suh, Joo-Won
2014-11-01
Bacterial artificial chromosomal (BAC) vectors are increasingly being used in cloning large DNA fragments containing complex biosynthetic pathways to facilitate heterologous production of microbial metabolites for drug development. To express inserted genes using Streptomyces species as the production hosts, an integration expression cassette is required to be inserted into the BAC vector, which includes genetic elements encoding a phage-specific attachment site, an integrase, an origin of transfer, a selection marker and a promoter. Due to the large sizes of DNA inserted into the BAC vectors, it is normally inefficient and time-consuming to assemble these fragments by routine PCR amplifications and restriction-ligations. Here we present a rapid method to insert fragments to construct BAC-based expression vectors. A DNA fragment of about 130 bp was designed, which contains upstream and downstream homologous sequences of both BAC vector and pIB139 plasmid carrying the whole integration expression cassette. In-Fusion cloning was performed using the designer DNA fragment to modify pIB139, followed by λ-RED-mediated recombination to obtain the BAC-based expression vector. We demonstrated the effectiveness of this method by rapid construction of a BAC-based expression vector with an insert of about 120 kb that contains the entire gene cluster for biosynthesis of immunosuppressant FK506. The empty BAC-based expression vector constructed in this study can be conveniently used for construction of BAC libraries using either microbial pure culture or environmental DNA, and the selected BAC clones can be directly used for heterologous expression. Alternatively, if a BAC library has already been constructed using a commercial BAC vector, the selected BAC vectors can be manipulated using the method described here to get the BAC-based expression vectors with desired gene clusters for heterologous expression. The rapid construction of a BAC-based expression vector facilitates heterologous expression of large gene clusters for drug discovery. Copyright © 2014 Elsevier Inc. All rights reserved.
Wang, Chao; Shi, Xue; Liu, Lin; Li, Haiyan; Ammiraju, Jetty S S; Kudrna, David A; Xiong, Wentao; Wang, Hao; Dai, Zhaozhao; Zheng, Yonglian; Lai, Jinsheng; Jin, Weiwei; Messing, Joachim; Bennetzen, Jeffrey L; Wing, Rod A; Luo, Meizhong
2013-11-01
Maize is one of the most important food crops and a key model for genetics and developmental biology. A genetically anchored and high-quality draft genome sequence of maize inbred B73 has been obtained to serve as a reference sequence. To facilitate evolutionary studies in maize and its close relatives, much like the Oryza Map Alignment Project (OMAP) (www.OMAP.org) bacterial artificial chromosome (BAC) resource did for the rice community, we constructed BAC libraries for maize inbred lines Zheng58, Chang7-2, and Mo17 and maize wild relatives Zea mays ssp. parviglumis and Tripsacum dactyloides. Furthermore, to extend functional genomic studies to maize and sorghum, we also constructed binary BAC (BIBAC) libraries for the maize inbred B73 and the sorghum landrace Nengsi-1. The BAC/BIBAC vectors facilitate transfer of large intact DNA inserts from BAC clones to the BIBAC vector and functional complementation of large DNA fragments. These seven Zea Map Alignment Project (ZMAP) BAC/BIBAC libraries have average insert sizes ranging from 92 to 148 kb, organellar DNA from 0.17 to 2.3%, empty vector rates between 0.35 and 5.56%, and genome equivalents of 4.7- to 8.4-fold. The usefulness of the Parviglumis and Tripsacum BAC libraries was demonstrated by mapping clones to the reference genome. Novel genes and alleles present in these ZMAP libraries can now be used for functional complementation studies and positional or homology-based cloning of genes for translational genomics.
Younger, Andrew K D; Su, Peter Y; Shepard, Andrea J; Udani, Shreya V; Cybulski, Thaddeus R; Tyo, Keith E J; Leonard, Joshua N
2018-02-01
Naturally evolved metabolite-responsive biosensors enable applications in metabolic engineering, ranging from screening large genetic libraries to dynamically regulating biosynthetic pathways. However, there are many metabolites for which a natural biosensor does not exist. To address this need, we developed a general method for converting metabolite-binding proteins into metabolite-responsive transcription factors-Biosensor Engineering by Random Domain Insertion (BERDI). This approach takes advantage of an in vitro transposon insertion reaction to generate all possible insertions of a DNA-binding domain into a metabolite-binding protein, followed by fluorescence activated cell sorting to isolate functional biosensors. To develop and evaluate the BERDI method, we generated a library of candidate biosensors in which a zinc finger DNA-binding domain was inserted into maltose binding protein, which served as a model well-studied metabolite-binding protein. Library diversity was characterized by several methods, a selection scheme was deployed, and ultimately several distinct and functional maltose-responsive transcriptional biosensors were identified. We hypothesize that the BERDI method comprises a generalizable strategy that may ultimately be applied to convert a wide range of metabolite-binding proteins into novel biosensors for applications in metabolic engineering and synthetic biology. © The Author(s) 2018. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Wu, Chengcang; Proestou, Dina; Carter, Dorothy; Nicholson, Erica; Santos, Filippe; Zhao, Shaying; Zhang, Hong-Bin; Goldsmith, Marian R
2009-01-01
Background Manduca sexta, Heliothis virescens, and Heliconius erato represent three widely-used insect model species for genomic and fundamental studies in Lepidoptera. Large-insert BAC libraries of these insects are critical resources for many molecular studies, including physical mapping and genome sequencing, but not available to date. Results We report the construction and characterization of six large-insert BAC libraries for the three species and sampling sequence analysis of the genomes. The six BAC libraries were constructed with two restriction enzymes, two libraries for each species, and each has an average clone insert size ranging from 152–175 kb. We estimated that the genome coverage of each library ranged from 6–9 ×, with the two combined libraries of each species being equivalent to 13.0–16.3 × haploid genomes. The genome coverage, quality and utility of the libraries were further confirmed by library screening using 6~8 putative single-copy probes. To provide a first glimpse into these genomes, we sequenced and analyzed the BAC ends of ~200 clones randomly selected from the libraries of each species. The data revealed that the genomes are AT-rich, contain relatively small fractions of repeat elements with a majority belonging to the category of low complexity repeats, and are more abundant in retro-elements than DNA transposons. Among the species, the H. erato genome is somewhat more abundant in repeat elements and simple repeats than those of M. sexta and H. virescens. The BLAST analysis of the BAC end sequences suggested that the evolution of the three genomes is widely varied, with the genome of H. virescens being the most conserved as a typical lepidopteran, whereas both genomes of H. erato and M. sexta appear to have evolved significantly, resulting in a higher level of species- or evolutionary lineage-specific sequences. Conclusion The high-quality and large-insert BAC libraries of the insects, together with the identified BACs containing genes of interest, provide valuable information, resources and tools for comprehensive understanding and studies of the insect genomes and for addressing many fundamental questions in Lepidoptera. The sample of the genomic sequences provides the first insight into the constitution and evolution of the insect genomes. PMID:19558662
Physical mapping of complex genomes
Evans, G.A.
1993-06-15
A method for the simultaneous identification of overlapping cosmid clones among multiple cosmid clones and the use of the method for mapping complex genomes are provided. A library of cosmid clones that contains the DNA to be mapped is constructed and arranged in a manner such that individual clones can be identified and replicas of the arranged clones prepared. In preferred embodiments, the clones are arranged in a two dimensional matrix. In such embodiments, the cosmid clones in a row are pooled, mixed probes complementary to the ends of the DNA inserts in the pooled clones are synthesized, hybridized to a first replica of the library. Hybridizing clones, which include the pooled row, are identified. A second portion of clones is prepared by pooling cosmid clones that correspond to a column in the matrix. The second pool thereby includes one clone from the first portion pooled clones. This common clone is located on the replica at the intersection of the column and row. Mixed probes complementary to the ends of the DNA inserts in the second pooled portion of clones are prepared and hybridized to a second replica of the library. The hybridization pattern on the first and second replicas of the library are compared and cross-hybridizing clones, other than the clones in the pooled column and row, that hybridize to identical clones in the first and second replicas are identified. These clones necessarily include DNA inserts that overlap with the DNA insert in the common clone located at the intersection of the pooled row and pooled column. The DNA in the entire library may be mapped by pooling the clones in each of the rows and columns of the matrix, preparing mixed end-specific probes and hybridizing the probes from each row or column to a replica of the library. Since all clones in the library are located at the intersection of a column and a row, the overlapping clones for all clones in the library may be identified and a physical map constructed.
Survey of microsatellite DNA in pine
C. S. Echt; P. May-Marquardt
1997-01-01
A large insert genomic library from eastern white pine (Pinus strobus) was probed for the microsatellite motifs (AC)n and (AG)n, all 10 trinucleotide motifs, and 22 of the 33 possible tetranucleotide motifs. For comparison with a species from a different subgenus, a loblolly pine (Pinus taeda...
Survey of microsatellite DNA in pine
Craig S. Echt; P. May-Marquardt
1997-01-01
A large insert genomic library from eastern white pine (Pinus strobus) was probed for the microsatellite motifs (AC)n and (AG)n, all 10 trinucleotide motifs, and 22 of the 33 possible tetranucleotide motifs. For comparison with a species from a different subgenus, a loblolly pine (Pinus taeda) genomic...
Preparation and screening of an arrayed human genomic library generated with the P1 cloning system.
Shepherd, N S; Pfrogner, B D; Coulby, J N; Ackerman, S L; Vaidyanathan, G; Sauer, R H; Balkenhol, T C; Sternberg, N
1994-01-01
We describe here the construction and initial characterization of a 3-fold coverage genomic library of the human haploid genome that was prepared using the bacteriophage P1 cloning system. The cloned DNA inserts were produced by size fractionation of a Sau3AI partial digest of high molecular weight genomic DNA isolated from primary cells of human foreskin fibroblasts. The inserts were cloned into the pAd10sacBII vector and packaged in vitro into P1 phage. These were used to generate recombinant bacterial clones, each of which was picked robotically from an agar plate into a well of a 96-well microtiter dish, grown overnight, and stored at -70 degrees C. The resulting library, designated DMPC-HFF#1 series A, consists of approximately 130,000-140,000 recombinant clones that were stored in 1500 microtiter dishes. To screen the library, clones were combined in a pooling strategy and specific loci were identified by PCR analysis. On average, the library contains two or three different clones for each locus screened. To date we have identified a total of 17 clones containing the hypoxanthine-guanine phosphoribosyltransferase, human serum albumin-human alpha-fetoprotein, p53, cyclooxygenase I, human apurinic endonuclease, beta-polymerase, and DNA ligase I genes. The cloned inserts average 80 kb in size and range from 70 to 95 kb, with one 49-kb insert and one 62-kb insert. Images PMID:8146166
A novel helper phage enabling construction of genome-scale ORF-enriched phage display libraries.
Gupta, Amita; Shrivastava, Nimisha; Grover, Payal; Singh, Ajay; Mathur, Kapil; Verma, Vaishali; Kaur, Charanpreet; Chaudhary, Vijay K
2013-01-01
Phagemid-based expression of cloned genes fused to the gIIIP coding sequence and rescue using helper phages, such as VCSM13, has been used extensively for constructing large antibody phage display libraries. However, for randomly primed cDNA and gene fragment libraries, this system encounters reading frame problems wherein only one of 18 phages display the translated foreign peptide/protein fused to phagemid-encoded gIIIP. The elimination of phages carrying out-of-frame inserts is vital in order to improve the quality of phage display libraries. In this study, we designed a novel helper phage, AGM13, which carries trypsin-sensitive sites within the linker regions of gIIIP. This renders the phage highly sensitive to trypsin digestion, which abolishes its infectivity. For open reading frame (ORF) selection, the phagemid-borne phages are rescued using AGM13, so that clones with in-frame inserts express fusion proteins with phagemid-encoded trypsin-resistant gIIIP, which becomes incorporated into the phages along with a few copies of AGM13-encoded trypsin-sensitive gIIIP. In contrast, clones with out-of-frame inserts produce phages carrying only AGM13-encoded trypsin-sensitive gIIIP. Trypsin treatment of the phage population renders the phages with out-of-frame inserts non-infectious, whereas phages carrying in-frame inserts remain fully infectious and can hence be enriched by infection. This strategy was applied efficiently at a genome scale to generate an ORF-enriched whole genome fragment library from Mycobacterium tuberculosis, in which nearly 100% of the clones carried in-frame inserts after selection. The ORF-enriched libraries were successfully used for identification of linear and conformational epitopes for monoclonal antibodies specific to mycobacterial proteins.
Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)
Andreassen, Rune; Lunner, Sigbjørn; Høyheim, Bjørn
2009-01-01
Background Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs) are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP), the number of sequences where the full-length of the cDNA insert has been determined has been small. Results High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91%) of the transcripts were annotated using Gene Ontology (GO) terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS). The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. Conclusion This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS). This suggests that the remaining cDNA libraries generated by SGP represent a valuable cCDS FLIc source. The conservation of 7-mers in 3'UTRs indicates that these motifs are functionally important. Identity between some of these 7-mers and miRNA target sequences suggests that they are miRNA targets in Salmo salar transcripts as well. PMID:19878547
Physical mapping of complex genomes
Evans, Glen A.
1993-01-01
Method for simultaneous identification of overlapping cosmid clones among multiple cosmid clones and the use of the method for mapping complex genomes are provided. A library of cosmid clones that contains the DNA to be mapped is constructed and arranged in a manner such that individual clones can be identified and replicas of the arranged clones prepared. In preferred embodiments, the clones are arranged in a two dimensional matrix. In such embodiments, the cosmid clones in a row are pooled, mixed probes complementary to the ends of the DNA inserts int he pooled clones are synthesized, hybridized to a first replica of the library. Hybridizing clones, which include the pooled row, are identified. A second portion of clones is prepared by pooling cosmid clones that correspond to a column in the matrix. The second pool thereby includes one clone from the first portion pooled clones. This common clone is located on the replica at the intersection of the column and row. Mixed probes complementary to the ends of the DNA inserts in the second pooled portion of clones are prepared and hybridized to a second replica of the library. The hybridization pattern on the first and second replicas of the library are compared and cross-hybridizing clones, other than the clones in the pooled column and row, that hybridize to identical clones in the first and second replicas are identified. These clones necessarily include DNA inserts that overlap with the DNA insert int he common clone located at the intersection of the pooled row and pooled column. The DNA in the entire library may be mapped by pooling the clones in each of the rows and columns of the matrix, preparing mixed end-specific probes and hybridizing the probes from each row or column to a replica of the library. Since all clones in the library are located at the intersection of a column and a row, the overlapping clones for all clones in the library may be identified and a physical map constructed. In other preferred embodiments, the cosmid clones are arranged in a three dimensional matrix, pooled and compared in threes according to intersecting planes of the three dimensional matrix. Arrangements corresponding to geometries of higher dimensions may also be prepared and used to simultaneously identify overlapping clones in highly complex libraries with relatively few hybridization reactions.
[Primary culture of cat intestinal epithelial cell and construction of its cDNA library].
Ye, L; Gui-Hua, Z; Kun, Y; Hong-Fa, W; Ting, X; Gong-Zhen, L; Wei-Xia, Z; Yong, C
2017-04-12
Objective To establish the primary cat intestinal epithelial cells (IECs) culture methods and construct the cDNA library for the following yeast two-hybrid experiment, so as to screen the virulence interaction factors among the final host. Methods The primary cat IECs were cultured by the tissue cultivation and combined digestion with collagenase XI and dispase I separately. Then the cat IECs cultured was identified with the morphological observation and cyto-keratin detection, by using goat anti-cyto-keratin monoclonal antibodies. The mRNA of cat IECs was isolated and used as the template to synthesize the first strand cDNA by SMART™ technology, and then the double-strand cDNAs were acquired by LD-PCR, which were subsequently cloned into the plasmid PGADT7-Rec to construct yeast two-hybrid cDNA library in the yeast strain Y187 by homologous recombination. Matchmaker™ Insert Check PCR was used to detect the size distribution of cDNA fragments after the capacity calculation of the cDNA library. Results The comparison of the two cultivation methods indicated that the combined digestion of collagenase XI and dispase I was more effective than the tissue cultivation. The cat IECs system of continuous culture was established and the cat IECs with high purity were harvested for constructing the yeast two-hybrid cDNA library. The library contained 1.1×10 6 independent clones. The titer was 2.8×10 9 cfu/ml. The size of inserted fragments was among 0.5-2.0 kb. Conclusion The yeast two-hybrid cDNA library of cat IECs meets the requirements of further screen research, and this study lays the foundation of screening the Toxoplasma gondii virulence interaction factors among the cDNA libraries of its final hosts.
Large-Scale Concatenation cDNA Sequencing
Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.
1997-01-01
A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
The effects of variable sample biomass on comparative metagenomics.
Chafee, Meghan; Maignien, Loïs; Simmons, Sheri L
2015-07-01
Longitudinal studies that integrate samples with variable biomass are essential to understand microbial community dynamics across space or time. Shotgun metagenomics is widely used to investigate these communities at the functional level, but little is known about the effects of combining low and high biomass samples on downstream analysis. We investigated the interacting effects of DNA input and library amplification by polymerase chain reaction on comparative metagenomic analysis using dilutions of a single complex template from an Arabidopsis thaliana-associated microbial community. We modified the Illumina Nextera kit to generate high-quality large-insert (680 bp) paired-end libraries using a range of 50 pg to 50 ng of input DNA. Using assembly-based metagenomic analysis, we demonstrate that DNA input level has a significant impact on community structure due to overrepresentation of low-GC genomic regions following library amplification. In our system, these differences were largely superseded by variations between biological replicates, but our results advocate verifying the influence of library amplification on a case-by-case basis. Overall, this study provides recommendations for quality filtering and de-replication prior to analysis, as well as a practical framework to address the issue of low biomass or biomass heterogeneity in longitudinal metagenomic surveys. © 2014 Society for Applied Microbiology and John Wiley & Sons Ltd.
Genomic sequencing of Pleistocene cave bears
DOE Office of Scientific and Technical Information (OSTI.GOV)
Noonan, James P.; Hofreiter, Michael; Smith, Doug
2005-04-01
Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome,more » the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.« less
Entcheva, P; Liebl, W; Johann, A; Hartsch, T; Streit, W R
2001-01-01
Enrichment cultures of microbial consortia enable the diverse metabolic and catabolic activities of these populations to be studied on a molecular level and to be explored as potential sources for biotechnology processes. We have used a combined approach of enrichment culture and direct cloning to construct cosmid libraries with large (>30-kb) inserts from microbial consortia. Enrichment cultures were inoculated with samples from five environments, and high amounts of avidin were added to the cultures to favor growth of biotin-producing microbes. DNA was extracted from three of these enrichment cultures and used to construct cosmid libraries; each library consisted of between 6,000 and 35,000 clones, with an average insert size of 30 to 40 kb. The inserts contained a diverse population of genomic DNA fragments isolated from the consortia organisms. These three libraries were used to complement the Escherichia coli biotin auxotrophic strain ATCC 33767 Delta(bio-uvrB). Initial screens resulted in the isolation of seven different complementing cosmid clones, carrying biotin biosynthesis operons. Biotin biosynthesis capabilities and growth under defined conditions of four of these clones were studied. Biotin measured in the different culture supernatants ranged from 42 to 3,800 pg/ml/optical density unit. Sequencing the identified biotin synthesis genes revealed high similarities to bio operons from gram-negative bacteria. In addition, random sequencing identified other interesting open reading frames, as well as two operons, the histidine utilization operon (hut), and the cluster of genes involved in biosynthesis of molybdopterin cofactors in bacteria (moaABCDE).
Isolation of CYP3A5P cDNA from human liver: a reflection of a novel cytochrome P-450 pseudogene.
Schuetz, J D; Guzelian, P S
1995-03-14
We have isolated, from a human liver cDNA library, a 1627 bp CYP3A5 cDNA variant (CYP3A5P) that contains several large insertions, deletions, and in-frame termination codons. By comparison with the genomic structure of other CYP3A genes, the major insertions in CYP3A5P cDNA demarcate the inferred sites of several CYP3A5 exons. The segments inserted in CYP3A5P have no homology with splice donor acceptor sites. It is unlikely that CYP3A5P cDNA represents an artifact of the cloning procedures since Southern blot analysis of human genomic DNA disclosed that CYP3A5P cDNA hybridized with a DNA fragment distinct from fragments that hybridized with either CYP3A5, CYP3A3 or CYP3A4. Moreover, analysis of adult human liver RNA on Northern blots hybridized with a CYP3A5P cDNA fragment revealed the presence of an mRNA with the predicted size of CYP3A5P. We conclude that CYP3A5P cDNA was derived from a separate gene, CYP3A5P, most likely a pseudogene evolved from CYP3A5.
A High-Throughput Arabidopsis Reverse Genetics System
Sessions, Allen; Burke, Ellen; Presting, Gernot; Aux, George; McElver, John; Patton, David; Dietrich, Bob; Ho, Patrick; Bacwaden, Johana; Ko, Cynthia; Clarke, Joseph D.; Cotton, David; Bullis, David; Snell, Jennifer; Miguel, Trini; Hutchison, Don; Kimmerly, Bill; Mitzel, Theresa; Katagiri, Fumiaki; Glazebrook, Jane; Law, Marc; Goff, Stephen A.
2002-01-01
A collection of Arabidopsis lines with T-DNA insertions in known sites was generated to increase the efficiency of functional genomics. A high-throughput modified thermal asymetric interlaced (TAIL)-PCR protocol was developed and used to amplify DNA fragments flanking the T-DNA left borders from ∼100,000 transformed lines. A total of 85,108 TAIL-PCR products from 52,964 T-DNA lines were sequenced and compared with the Arabidopsis genome to determine the positions of T-DNAs in each line. Predicted T-DNA insertion sites, when mapped, showed a bias against predicted coding sequences. Predicted insertion mutations in genes of interest can be identified using Arabidopsis Gene Index name searches or by BLAST (Basic Local Alignment Search Tool) search. Insertions can be confirmed by simple PCR assays on individual lines. Predicted insertions were confirmed in 257 of 340 lines tested (76%). This resource has been named SAIL (Syngenta Arabidopsis Insertion Library) and is available to the scientific community at www.tmri.org. PMID:12468722
Mesarich, Carl H.; Rees-George, Jonathan; Gardner, Paul P.; Ghomi, Fatemeh Ashari; Gerth, Monica L.; Andersen, Mark T.; Rikkerink, Erik H. A.; Fineran, Peter C.
2017-01-01
Pseudomonas syringae pv. actinidiae (Psa), the causal agent of kiwifruit canker, is one of the most devastating plant diseases of recent times. We have generated two mini-Tn5-based random insertion libraries of Psa ICMP 18884. The first, a ‘phenotype of interest’ (POI) library, consists of 10,368 independent mutants gridded into 96-well plates. By replica plating onto selective media, the POI library was successfully screened for auxotrophic and motility mutants. Lipopolysaccharide (LPS) biosynthesis mutants with ‘Fuzzy-Spreader’-like morphologies were also identified through a visual screen. The second, a ‘mutant of interest’ (MOI) library, comprises around 96,000 independent mutants, also stored in 96-well plates, with approximately 200 individuals per well. The MOI library was sequenced on the Illumina MiSeq platform using Transposon-Directed Insertion site Sequencing (TraDIS) to map insertion sites onto the Psa genome. A grid-based PCR method was developed to recover individual mutants, and using this strategy, the MOI library was successfully screened for a putative LPS mutant not identified in the visual screen. The Psa chromosome and plasmid had 24,031 and 1,236 independent insertion events respectively, giving insertion frequencies of 3.65 and 16.6 per kb respectively. These data suggest that the MOI library is near saturation, with the theoretical probability of finding an insert in any one chromosomal gene estimated to be 97.5%. However, only 47% of chromosomal genes had insertions. This surprisingly low rate cannot be solely explained by the lack of insertions in essential genes, which would be expected to be around 5%. Strikingly, many accessory genes, including most of those encoding type III effectors, lacked insertions. In contrast, 94% of genes on the Psa plasmid had insertions, including for example, the type III effector HopAU1. These results suggest that some chromosomal sites are rendered inaccessible to transposon insertion, either by DNA-binding proteins or by the architecture of the nucleoid. PMID:28249011
Wang, Chun Ming; Lo, Loong Chueng; Feng, Felicia; Gong, Ping; Li, Jian; Zhu, Ze Yuan; Lin, Grace; Yue, Gen Hua
2008-03-25
Barramundi (Lates calcarifer) is an important farmed marine food fish species. Its first generation linkage map has been applied to map QTL for growth traits. To identify genes located in QTL responsible for specific traits, genomic large insert libraries are of crucial importance. We reported herein a bacterial artificial chromosome (BAC) library and the mapping of BAC clones to the linkage map. This BAC library consisted of 49,152 clones with an average insert size of 98 kb, representing 6.9-fold haploid genome coverage. Screening the library with 24 microsatellites and 15 ESTs/genes demonstrated that the library had good genome coverage. In addition, 62 novel microsatellites each isolated from 62 BAC clones were mapped onto the first generation linkage map. A total of 86 BAC clones were anchored on the linkage map with at least one BAC clone on each linkage group. We have constructed the first BAC library for L. calcarifer and mapped 86 BAC clones to the first generation linkage map. This BAC library and the improved linkage map with 302 DNA markers not only supply an indispensable tool to the integration of physical and linkage maps, the fine mapping of QTL and map based cloning genes located in QTL of commercial importance, but also contribute to comparative genomic studies and eventually whole genome sequencing.
Xu, De-Quan; Zhang, Yi-Bing; Xiong, Yuan-Zhu; Gui, Jian-Fang; Jiang, Si-Wen; Su, Yu-Hong
2003-07-01
Using suppression subtractive hybridization (SSH) technique, forward and reverse subtracted cDNA libraries were constructed between Longissimus muscles from Meishan and Landrace pigs. A housekeeping gene, G3PDH, was used to estimate the efficiency of subtractive cDNA. In two cDNA libraries, G3PDH was subtracted very efficiently at appropriate 2(10) and 2(5) folds, respectively, indicating that some differentially expressed genes were also enriched at the same folds and the two subtractive cDNA libraries were very successful. A total of 709 and 673 positive clones were isolated from forward and reverse subtracted cDNA libraries, respectively. Analysis of PCR showed that most of all plasmids in the clones contained 150-750 bp inserts. The construction of subtractive cDNA libraries between muscle tissue from different pig breeds laid solid foundations for isolating and identifying the genes determining muscle growth and meat quality, which will be important to understand the mechanism of muscle growth, determination of meat quality and practice of molecular breeding.
NASA Astrophysics Data System (ADS)
Chen, Juan; Zhu, Tianjiao; Li, Dehai; Cui, Chengbin; Fang, Yuchun; Liu, Hongbing; Liu, Peipei; Gu, Qianqun; Zhu, Weiming
2006-04-01
To study the bioactive metabolites produced by sponge-derived uncultured symbionts, a metagenomic DNA library of the symbionts of sponge Gelliodes gracilis was constructed. The average size of DNA inserts in the library was 20 kb. This library was screened for antibiotic activity using paper dise assaying. Two clones displayed the antibacterial activity against Micrococcus tetragenus. The metabolites of these two clones were analyzed through HPLC. The result showed that their metabolites were quite different from those of the host E. coli DH5α and the host containing vector pHZ132. This study may present a new approach to exploring bioactive metabolites of sponge symbionts.
Large exon size does not limit splicing in vivo.
Chen, I T; Chasin, L A
1994-03-01
Exon sizes in vertebrate genes are, with a few exceptions, limited to less than 300 bases. It has been proposed that this limitation may derive from the exon definition model of splice site recognition. In this model, a downstream donor site enhances splicing at the upstream acceptor site of the same exon. This enhancement may require contact between factors bound to each end of the exon; an exon size limitation would promote such contact. To test the idea that proximity was required for exon definition, we inserted random DNA fragments from Escherichia coli into a central exon in a three-exon dihydrofolate reductase minigene and tested whether the expanded exons were efficiently spliced. DNA from a plasmid library of expanded minigenes was used to transfect a CHO cell deletion mutant lacking the dhfr locus. PCR analysis of DNA isolated from the pooled stable cotransfectant populations displayed a range of DNA insert sizes from 50 to 1,500 nucleotides. A parallel analysis of the RNA from this population by reverse transcription followed by PCR showed a similar size distribution. Central exons as large as 1,400 bases could be spliced into mRNA. We also tested individual plasmid clones containing exon inserts of defined sizes. The largest exon included in mRNA was 1,200 bases in length, well above the 300-base limit implied by the survey of naturally occurring exons. We conclude that a limitation in exon size is not part of the exon definition mechanism.
Lyons, Eli; Sheridan, Paul; Tremmel, Georg; Miyano, Satoru; Sugano, Sumio
2017-10-24
High-throughput screens allow for the identification of specific biomolecules with characteristics of interest. In barcoded screens, DNA barcodes are linked to target biomolecules in a manner allowing for the target molecules making up a library to be identified by sequencing the DNA barcodes using Next Generation Sequencing. To be useful in experimental settings, the DNA barcodes in a library must satisfy certain constraints related to GC content, homopolymer length, Hamming distance, and blacklisted subsequences. Here we report a novel framework to quickly generate large-scale libraries of DNA barcodes for use in high-throughput screens. We show that our framework dramatically reduces the computation time required to generate large-scale DNA barcode libraries, compared with a naїve approach to DNA barcode library generation. As a proof of concept, we demonstrate that our framework is able to generate a library consisting of one million DNA barcodes for use in a fragment antibody phage display screening experiment. We also report generating a general purpose one billion DNA barcode library, the largest such library yet reported in literature. Our results demonstrate the value of our novel large-scale DNA barcode library generation framework for use in high-throughput screening applications.
[Cosmid libraries containing DNA from human chromosome 13].
Kapanadze, B I; Brodianskiĭ, V M; Baranova, A V; Sevat'ianov, S Iu; Fedorova, N D; Kurskov, M M; Kostina, M A; Mironov, A A; Sineokiĭ, S P; Zakhar'ev, V M; Grafodatskiĭ, A S; Modianov, N N; Iankovskiĭ, N K
1996-03-01
We characterized two cosmid libraries constructed from flow-sorted chromosome 13 at the Imperial Cancer Research Fund (ICRF), UK (13,000 clones) and Los Alamos National Laboratory (LANL), USA (17,000 clones). After storage for two years, clones showed high viability (95%) and structural stability. EcoR I and Hind III restriction patterns were studied in more than 500 ICRF and 200 LANL cosmids. The average size of inserts was shown to be 35-37 kb in both the libraries. Most cosmids (83% and 93% of ICRF and LANL libraries, respectively) exceed the lower size limit of DNA fragments that can be packaged and represent a good source for physical mapping of chromosome 13. Total length of inserts is four and five genome equivalents in the ICRF and LANL libraries, respectively. ICRF cosmids showed hybridization to 22 of 24 unique probes tested, which corresponds to a 90% probability of having any DNA fragment represented in the library. More than 1 Mb of chromosome 13 is overlapped by 90 cosmids of 22 groups revealed. A chromosomal region of more than 150 kb, containing the ATP1AL1 gene for alpha-1 peptide of Na+, K(+)-ATPase, is covered by 12 cosmids forming a contig. The results of restriction and hybridization analyses are stored in a CLONE database. These data and all the cosmids described are publicly available.
In vivo insertion pool sequencing identifies virulence factors in a complex fungal–host interaction
Uhse, Simon; Pflug, Florian G.; Stirnberg, Alexandra; Ehrlinger, Klaus; von Haeseler, Arndt
2018-01-01
Large-scale insertional mutagenesis screens can be powerful genome-wide tools if they are streamlined with efficient downstream analysis, which is a serious bottleneck in complex biological systems. A major impediment to the success of next-generation sequencing (NGS)-based screens for virulence factors is that the genetic material of pathogens is often underrepresented within the eukaryotic host, making detection extremely challenging. We therefore established insertion Pool-Sequencing (iPool-Seq) on maize infected with the biotrophic fungus U. maydis. iPool-Seq features tagmentation, unique molecular barcodes, and affinity purification of pathogen insertion mutant DNA from in vivo-infected tissues. In a proof of concept using iPool-Seq, we identified 28 virulence factors, including 23 that were previously uncharacterized, from an initial pool of 195 candidate effector mutants. Because of its sensitivity and quantitative nature, iPool-Seq can be applied to any insertional mutagenesis library and is especially suitable for genetically complex setups like pooled infections of eukaryotic hosts. PMID:29684023
ERIC Educational Resources Information Center
Galewsky, Samuel
2000-01-01
Introduces a series of molecular genetics laboratories where students pick a single colony from a Drosophila melanogester embryo cDNA library and purify the plasmid, then analyze the insert through restriction digests and gel electrophoresis. (Author/YDS)
Yung, Pui Yi; Burke, Catherine; Lewis, Matt; Egan, Suhelen; Kjelleberg, Staffan; Thomas, Torsten
2009-01-01
Metagenomics provides access to the uncultured majority of the microbial world. The approaches employed in this field have, however, had limited success in linking functional genes to the taxonomic or phylogenetic origin of the organism they belong to. Here we present an efficient strategy to recover environmental DNA fragments that contain phylogenetic marker genes from metagenomic libraries. Our method involves the cleavage of 23S ribsosmal RNA (rRNA) genes within pooled library clones by the homing endonuclease I-CeuI followed by the insertion and selection of an antibiotic resistance cassette. This approach was applied to screen a library of 6500 fosmid clones derived from the microbial community associated with the sponge Cymbastela concentrica. Several fosmid clones were recovered after the screen and detailed phylogenetic and taxonomic assignment based on the rRNA gene showed that they belong to previously unknown organisms. In addition, compositional features of these fosmid clones were used to classify and taxonomically assign a dataset of environmental shotgun sequences. Our approach represents a valuable tool for the analysis of rapidly increasing, environmental DNA sequencing information. PMID:19767618
Yang, XinChao; Li, MengHui; Liu, JianHua; Ji, YiHong; Li, XiangRui; Xu, LiXin; Yan, RuoFeng; Song, XiaoKai
2017-02-16
Eimeria maxima is one of the most prevalent Eimeria species causing avian coccidiosis, and results in huge economic loss to the global poultry industry. Current control strategies, such as anti-coccidial medication and live vaccines have been limited because of their drawbacks. The third generation anticoccidial vaccines including the recombinant vaccines as well as DNA vaccines have been suggested as a promising alternative strategy. To date, only a few protective antigens of E. maxima have been reported. Hence, there is an urgent need to identify novel protective antigens of E. maxima for the development of neotype anticoccidial vaccines. With the aim of identifying novel protective genes of E. maxima, a cDNA expression library of E. maxima sporozoites was constructed using Gateway technology. Subsequently, the cDNA expression library was divided into 15 sub-libraries for cDNA expression library immunization (cDELI) using parasite challenged model in chickens. Protective sub-libraries were selected for the next round of screening until individual protective clones were obtained, which were further sequenced and analyzed. Adopting the Gateway technology, a high-quality entry library was constructed, containing 9.2 × 10 6 clones with an average inserted fragments length of 1.63 kb. The expression library capacity was 2.32 × 10 7 colony-forming units (cfu) with an average inserted fragments length of 1.64 Kb. The expression library was screened using parasite challenged model in chickens. The screening yielded 6 immune protective genes including four novel protective genes of EmJS-1, EmRP, EmHP-1 and EmHP-2, and two known protective genes of EmSAG and EmCKRS. EmJS-1 is the selR domain-containing protein of E. maxima whose function is unknown. EmHP-1 and EmHP-2 are the hypothetical proteins of E. maxima. EmRP and EmSAG are rhomboid-like protein and surface antigen glycoproteins of E. maxima respectively, and involved in invasion of the parasite. Our results provide a cDNA expression library for further screening of T cell stimulating or inhibiting antigens of E. maxima. Moreover, our results provide six candidate protective antigens for developing new vaccines against E. maxima.
2013-10-09
have desirable traits. We aim to enlarge the E. coli genome using Lactobacillusplantarum genes to build cells tolerant to EtOH and BT. L. plantarum is...chemicals III. Approach Objective 1 & la: Integrated heterologous (L. plantarum ) DNA into the E. coli chromosome and selected for insertions that...developed in combination with genes identified from screening L. plantarum libraries. Additionally, we have screened heterologous libraries for
Aschard, Hugues; Cattoir, Vincent; Yoder-Himes, Deborah; Lory, Stephen; Pier, Gerald B.
2013-01-01
High-throughput sequencing of transposon (Tn) libraries created within entire genomes identifies and quantifies the contribution of individual genes and operons to the fitness of organisms in different environments. We used insertion-sequencing (INSeq) to analyze the contribution to fitness of all non-essential genes in the chromosome of Pseudomonas aeruginosa strain PA14 based on a library of ∼300,000 individual Tn insertions. In vitro growth in LB provided a baseline for comparison with the survival of the Tn insertion strains following 6 days of colonization of the murine gastrointestinal tract as well as a comparison with Tn-inserts subsequently able to systemically disseminate to the spleen following induction of neutropenia. Sequencing was performed following DNA extraction from the recovered bacteria, digestion with the MmeI restriction enzyme that hydrolyzes DNA 16 bp away from the end of the Tn insert, and fractionation into oligonucleotides of 1,200–1,500 bp that were prepared for high-throughput sequencing. Changes in frequency of Tn inserts into the P. aeruginosa genome were used to quantify in vivo fitness resulting from loss of a gene. 636 genes had <10 sequencing reads in LB, thus defined as unable to grow in this medium. During in vivo infection there were major losses of strains with Tn inserts in almost all known virulence factors, as well as respiration, energy utilization, ion pumps, nutritional genes and prophages. Many new candidates for virulence factors were also identified. There were consistent changes in the recovery of Tn inserts in genes within most operons and Tn insertions into some genes enhanced in vivo fitness. Strikingly, 90% of the non-essential genes were required for in vivo survival following systemic dissemination during neutropenia. These experiments resulted in the identification of the P. aeruginosa strain PA14 genes necessary for optimal survival in the mucosal and systemic environments of a mammalian host. PMID:24039572
Chen, Bo-Ruei; Hale, Devin C; Ciolek, Peter J; Runge, Kurt W
2012-05-03
Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches.
[Construction and characterization of a cDNA library from human liver tissue of cirrhosis].
Chen, Xiao-hong; Chen, Zhi; Chen, Feng; Zhu, Hai-hong; Zhou, Hong-juan; Yao, Hang-ping
2005-03-01
To construct a cDNA library from human liver tissue of cirrhosis. The total RNA from human liver tissue of cirrhosis was extracted using Trizol method, and the mRNA was purified using mRNA purification kit. SMART technique and CDSIII/3' primer were used for first-strand cDNA synthesis. Long distance PCR was then used to synthesize the double-strand cDNA that was then digested by proteinase K and Sfi I, and was fractionated by CHOMA SPIN-400 column. The cDNA fragments longer than 0.4 kb were collected and ligated to lambdaTripl Ex2 vector. Then lambda-phage packaging reaction and library amplification were performed. The qualities of both unamplified and amplified cDNA libraries was strictly checked by conventional titer determination. Eleven plaques were randomly picked and tested using PCR with universal primers derived from the sequence flanking the vector. The titers of unamplifed and amplified libraries were 1.03 x 10(6) pfu/ml and 1.36 x 10(9) pfu/ml respectively. The percentages of recombinants from both libraries were 97.24 % in unamplified library and 99.02 % in amplified library. The lengths of the inserts were 1.02 kb in average (36.36 % 1 approximately equals 2 kb and 63.64 % 0.5 approximately equals 1.0 kb). A high quality cDNA library from human liver tissue of cirrhosis was constructed successfully, which can be used for screening and cloning new special genes associated with the occurrence of cirrhosis.
Gong, Qian; Li, Chang-ying; Chang, Ji-wu; Zhu, Tie-hong
2012-06-01
To screen monoclonal antibodies to amylin from a constructed human phage antibody library and identify their antigenic specificity and combining activities. The heavy chain Fd fragment and light chain of human immunoglobulin genes were amplified from peripheral blood lymphocytes of healthy donors using RT-PCR, and then inserted into phagemid pComb3XSS to generate a human phage antibody library. The insertion of light chain or heavy chain Fd genes were identified by PCR after the digestion of Sac I, Xba I, Xho Iand Spe I. One of positive clones was analyzed by DNA sequencing. The specific anti-amylin clones were screened from antibody library against human amylin antigens and then the positive clones were determined by Phage-ELISA analysis. A Fab phage antibody library with 0.8×10(8); members was constructed with the efficacy of about 70%. DNA sequence analysis indicated V(H); gene belonged to V(H);3 gene family and V(λ); gene belonged to the V(λ); gene family. Using human amylin as panning antigen, specific anti-amylin Fab antibodies were enriched by screening the library for three times. Phage-ELISA assay showed the positive clones had very good specificity to amylin antigen. The successful construction of a phage antibody library and the identification of anti-amylin Fab antibodies provide a basis for further study and preparation of human anti-amylin antibodies.
[Construction of fetal mesenchymal stem cell cDNA subtractive library].
Yang, Li; Wang, Dong-Mei; Li, Liang; Bai, Ci-Xian; Cao, Hua; Li, Ting-Yu; Pei, Xue-Tao
2002-04-01
To identify differentially expressed genes between fetal mesenchymal stem cell (MSC) and adult MSC, especially specified genes expressed in fetal MSC, a cDNA subtractive library of fetal MSC was constructed using suppression subtractive hybridization (SSH) technique. At first, total RNA was isolated from fetal and adult MSC. Using SMART PCR synthesis method, single-strand and double-strand cDNAs were synthesized. After Rsa I digestion, fetal MSC cDNAs were divided into two groups and ligated to adaptor 1 and adaptor 2 respectively. Results showed that the amplified library contains 890 clones. Analysis of 890 clones with PCR demonstrated that 768 clones were positive. The positive rate is 86.3%. The size of inserted fragments in these positive clones was between 0.2 - 1 kb, with an average of 400 - 600 bp. SSH is a convenient and effective method for screening differentially expressed genes. The constructed cDNA subtractive library of fetal MSC cDNA lays solid foundation for screening and cloning new and specific function related genes of fetal MSC.
Genomic clones for human cholinesterase
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kott, M.; Venta, P.J.; Larsen, J.
1987-05-01
A human genomic library was prepared from peripheral white blood cells from a single donor by inserting an MboI partial digest into BamHI poly-linker sites of EMBL3. This library was screened using an oligolabeled human cholinesterase cDNA probe over 700 bp long. The latter probe was obtained from a human basal ganglia cDNA library. Of approximately 2 million clones screened with high stringency conditions several positive clones were identified; two have been plaque purified. One of these clones has been partially mapped using restriction enzymes known to cut within the coded region of the cDNA for human serum cholinesterase. Hybridizationmore » of the fragments and their sizes are as expected if the genomic clone is cholinesterase. Sequencing of the DNA fragments in M13 is in progress to verify the identify of the clone and the location of introns.« less
Li, Hong-Mei; Guo, Kang; Yu, Zhuang; Feng, Rui; Xu, Ping
2015-07-01
Traditional diagnostic technology with tumor biomarkers is inefficient, expensive and requires a large number of serum samples. The purpose of this study was to construct human lung cancer protein chips with new lung cancer biomarkers screened by the T7-phage display library, and improve the early diagnosis rate of lung cancer. A T7-phage cDNA display library was constructed of fresh samples from 30 lung cancer patients. With biopanning and high-throughput screening, we gained the immunogenic phage clones from the cDNA library. The insert of selected phage was blasted at GeneBank for alignment to find the exact or the most similar known genes. Protein chips were then constructed and used to assay their expression level in lung cancer serum from 217 cases of lung cancer groups:80 cases of benign lung disease and 220 healthy controls. After four rounds of Biopanning and two rounds of enzyme-linked immunosorbent assay, 12 phage monoclonal samples were selected from 2880 phage monoclonal samples. After blasting at GeneBank, six similar genes were used to construct diagnostic protein chips. The protein chips were then used to assay expression level in lung cancer serum. The expression level of six genes in lung cancer groups was significantly higher than those in the other two groups (P < 0.05). In this study, we successfully constructed diagnostic protein chips with biomarkers selected from the lung cancer T7-phage cDNA library, which can be used for the early screening of lung cancer patients.
Targeting vector construction through recombineering.
Malureanu, Liviu A
2011-01-01
Gene targeting in mouse embryonic stem cells is an essential, yet still very expensive and highly time-consuming, tool and method to study gene function at the organismal level or to create mouse models of human diseases. Conventional cloning-based methods have been largely used for generating targeting vectors, but are hampered by a number of limiting factors, including the variety and location of restriction enzymes in the gene locus of interest, the specific PCR amplification of repetitive DNA sequences, and cloning of large DNA fragments. Recombineering is a technique that exploits the highly efficient homologous recombination function encoded by λ phage in Escherichia coli. Bacteriophage-based recombination can recombine homologous sequences as short as 30-50 bases, allowing manipulations such as insertion, deletion, or mutation of virtually any genomic region. The large availability of mouse genomic bacterial artificial chromosome (BAC) libraries covering most of the genome facilitates the retrieval of genomic DNA sequences from the bacterial chromosomes through recombineering. This chapter describes a successfully applied protocol and aims to be a detailed guide through the steps of generation of targeting vectors through recombineering.
Magic Pools: Parallel Assessment of Transposon Delivery Vectors in Bacteria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Hualan; Price, Morgan N.; Waters, Robert Jordan
Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach for discovering the functions of bacterial genes. However, the development of a suitable TnSeq strategy for a given bacterium can be costly and time-consuming. To meet this challenge, we describe a part-based strategy for constructing libraries of hundreds of transposon delivery vectors, which we term “magic pools.” Within a magic pool, each transposon vector has a different combination of upstream sequences (promoters and ribosome binding sites) and antibiotic resistance markers as well as a random DNA barcode sequence, which allows the tracking of each vector during mutagenesis experiments. Tomore » identify an efficient vector for a given bacterium, we mutagenize it with a magic pool and sequence the resulting insertions; we then use this efficient vector to generate a large mutant library. We used the magic pool strategy to construct transposon mutant libraries in five genera of bacteria, including three genera of the phylumBacteroidetes. IMPORTANCEMolecular genetics is indispensable for interrogating the physiology of bacteria. However, the development of a functional genetic system for any given bacterium can be time-consuming. Here, we present a streamlined approach for identifying an effective transposon mutagenesis system for a new bacterium. Our strategy first involves the construction of hundreds of different transposon vector variants, which we term a “magic pool.” The efficacy of each vector in a magic pool is monitored in parallel using a unique DNA barcode that is introduced into each vector design. Using archived DNA “parts,” we next reassemble an effective vector for making a whole-genome transposon mutant library that is suitable for large-scale interrogation of gene function using competitive growth assays. Here, we demonstrate the utility of the magic pool system to make mutant libraries in five genera of bacteria.« less
Magic Pools: Parallel Assessment of Transposon Delivery Vectors in Bacteria
Liu, Hualan; Price, Morgan N.; Waters, Robert Jordan; ...
2018-01-16
Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach for discovering the functions of bacterial genes. However, the development of a suitable TnSeq strategy for a given bacterium can be costly and time-consuming. To meet this challenge, we describe a part-based strategy for constructing libraries of hundreds of transposon delivery vectors, which we term “magic pools.” Within a magic pool, each transposon vector has a different combination of upstream sequences (promoters and ribosome binding sites) and antibiotic resistance markers as well as a random DNA barcode sequence, which allows the tracking of each vector during mutagenesis experiments. Tomore » identify an efficient vector for a given bacterium, we mutagenize it with a magic pool and sequence the resulting insertions; we then use this efficient vector to generate a large mutant library. We used the magic pool strategy to construct transposon mutant libraries in five genera of bacteria, including three genera of the phylumBacteroidetes. IMPORTANCEMolecular genetics is indispensable for interrogating the physiology of bacteria. However, the development of a functional genetic system for any given bacterium can be time-consuming. Here, we present a streamlined approach for identifying an effective transposon mutagenesis system for a new bacterium. Our strategy first involves the construction of hundreds of different transposon vector variants, which we term a “magic pool.” The efficacy of each vector in a magic pool is monitored in parallel using a unique DNA barcode that is introduced into each vector design. Using archived DNA “parts,” we next reassemble an effective vector for making a whole-genome transposon mutant library that is suitable for large-scale interrogation of gene function using competitive growth assays. Here, we demonstrate the utility of the magic pool system to make mutant libraries in five genera of bacteria.« less
Huang, D; Wu, W; Zhou, Y; Hu, Z; Lu, L
2004-05-01
Construction of single chromosomal DNA libraries by means of chromosome microdissection and microcloning will be useful for genomic research, especially for those species that have not been extensively studied genetically. Application of the technology of microdissection and microcloning to woody fruit plants has not been reported hitherto, largely due to the generally small sizes of metaphase chromosomes and the difficulty of chromosome preparation. The present study was performed to establish a method for single chromosome microdissection and microcloning in woody fruit species using pomelo as a model. The standard karyotype of a pomelo cultivar ( Citrus grandis cv. Guanxi) was established based on 20 prometaphase photomicrographs. According to the standard karyotype, chromosome 1 was identified and isolated with fine glass microneedles controlled by a micromanipulator. DNA fragments ranging from 0.3 kb to 2 kb were acquired from the isolated single chromosome 1 via two rounds of PCR mediated by Sau3A linker adaptors and then cloned into T-easy vectors to generate a DNA library of chromosome 1. Approximately 30,000 recombinant clones were obtained. Evaluation based on 108 randomly selected clones showed that the sizes of the cloned inserts varied from 0.5 kb to 1.5 kb with an average of 860 bp. Our research suggests that microdissection and microcloning of single small chromosomes in woody plants is feasible.
Construction of C35 gene bait recombinants and T47D cell cDNA library.
Yin, Kun; Xu, Chao; Zhao, Gui-Hua; Liu, Ye; Xiao, Ting; Zhu, Song; Yan, Ge
2017-11-20
C35 is a novel tumor biomarker associated with metastasis progression. To investigate the interaction factors of C35 in its high expressed breast cancer cell lines, we constructed bait recombinant plasmids of C35 gene and T47D cell cDNA library for yeast two-hybrid screening. Full length C35 sequences were subcloned using RT-PCR from cDNA template extracted from T47D cells. Based on functional domain analysis, the full-length C35 1-348bp was also truncated into two fragments C351-153bp and C35154-348bp to avoid auto-activation. The three kinds of C35 genes were successfully amplified and inserted into pGBKT7 to construct bait recombinant plasmids pGBKT7-C351-348bp, pGBKT7-C351-153bp and pGBKT7-C35154-348bp, then transformed into Y187 yeast cells by the lithium acetate method. Auto-activation and toxicity of C35 baits were detected using nutritional deficient medium and X-α-Gal assays. The T47D cell ds cDNA was generated by SMART TM technology and the library was constructed using in vivo recombination-mediated cloning in the AH109 yeast strain using a pGADT7-Rec plasmid. The transformed Y187/pGBKT7-C351-348bp line was intensively inhibited while the truncated Y187/pGBKT7-C35 lines had no auto-activation and toxicity in yeast cells. The titer of established cDNA library was 2 × 10 7 pfu/mL with high transformation efficiency of 1.4 × 10 6 , and the insert size of ds cDNA was distributed homogeneously between 0.5-2.0 kb. Our research generated a T47D cell cDNA library with high titer, and the constructed two C35 "baits" contained a respective functional immunoreceptor tyrosine based activation motif (ITAM) and the conserved last four amino acids Cys-Ile-Leu-Val (CILV) motif, and therefore laid a foundation for screening the C35 interaction factors in a BC cell line.
2012-01-01
Background Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. Results An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. Conclusions This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches. PMID:22554201
Identification of genes differentially expressed in association with acquired cisplatin resistance
Johnsson, A; Zeelenberg, I; Min, Y; Hilinski, J; Berry, C; Howell, S B; Los, G
2000-01-01
The goal of this study was to identify genes whose mRNA levels are differentially expressed in human cells with acquired cisplatin (cDDP) resistance. Using the parental UMSCC10b head and neck carcinoma cell line and the 5.9-fold cDDP-resistant subline, UMSCC10b/Pt-S15, two suppressive subtraction hybridization (SSH) cDNA libraries were prepared. One library represented mRNAs whose levels were increased in the cDDP resistant variant (the UP library), the other one represented mRNAs whose levels were decreased in the resistant cells (the DOWN library). Arrays constructed with inserts recovered from these libraries were hybridized with SSH products to identify truly differentially expressed elements. A total of 51 cDNA fragments present in the UP library and 16 in the DOWN library met the criteria established for differential expression. The sequences of 87% of these cDNA fragments were identified in Genbank. Among the mRNAs in the UP library that were frequently isolated and that showed high levels of differential expression were cytochrome oxidase I, ribosomal protein 28S, elongation factor 1α, α-enolase, stathmin, and HSP70. The approach taken in this study permitted identification of many genes never before linked to the cDDP-resistant phenotype. © 2000 Cancer Research Campaign PMID:10993653
Construction of cDNA library and preliminary analysis of expressed sequence tags from Siberian tiger
Liu, Chang-Qing; Lu, Tao-Feng; Feng, Bao-Gang; Liu, Dan; Guan, Wei-Jun; Ma, Yue-Hui
2010-01-01
In this study we successfully constructed a full-length cDNA library from Siberian tiger, Panthera tigris altaica, the most well-known wild Animal. Total RNA was extracted from cultured Siberian tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.30×106 pfu/ml and 1.62×109 pfu/ml respectively. The proportion of recombinants from unamplified library was 90.5% and average length of exogenous inserts was 1.13 kb. A total of 282 individual ESTs with sizes ranging from 328 to 1,142bps were then analyzed the BLASTX score revealed that 53.9% of the sequences were classified as strong match, 38.6% as nominal and 7.4% as weak match. 28.0% of them were found to be related to enzyme/catalytic protein, 20.9% ESTs to metabolism, 13.1% ESTs to transport, 12.1% ESTs to signal transducer/cell communication, 9.9% ESTs to structure protein, 3.9% ESTs to immunity protein/defense metabolism, 3.2% ESTs to cell cycle, and 8.9 ESTs classified as novel genes. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genomic research of Siberian tigers. PMID:20941376
2012-01-01
Background The ovine Major Histocompatibility Complex (MHC) harbors genes involved in overall resistance/susceptibility of the host to infectious diseases. Compared to human and mouse, the ovine MHC is interrupted by a large piece of autosome insertion via a hypothetical chromosome inversion that constitutes ~25% of ovine chromosome 20. The evolutionary consequence of such an inversion and an insertion (inversion/insertion) in relation to MHC function remains unknown. We previously constructed a BAC clone physical map for the ovine MHC exclusive of the insertion region. Here we report the construction of a high-density physical map covering the autosome insertion in order to address the question of what the inversion/insertion had to do with ruminants during the MHC evolution. Results A total of 119 pairs of comparative bovine oligo primers were utilized to screen an ovine BAC library for positive clones and the orders and overlapping relationships of the identified clones were determined by DNA fingerprinting, BAC-end sequencing, and sequence-specific PCR. A total of 368 positive BAC clones were identified and 108 of the effective clones were ordered into an overlapping BAC contig to cover the consensus region between ovine MHC class IIa and IIb. Therefore, a continuous physical map covering the entire ovine autosome inversion/insertion region was successfully constructed. The map confirmed the bovine sequence assembly for the same homologous region. The DNA sequences of 185 BAC-ends have been deposited into NCBI database with the access numbers HR309252 through HR309068, corresponding to dbGSS ID 30164010 through 30163826. Conclusions We have constructed a high-density BAC clone physical map for the ovine autosome inversion/insertion between the MHC class IIa and IIb. The entire ovine MHC region is now fully covered by a continuous BAC clone contig. The physical map we generated will facilitate MHC functional studies in the ovine, as well as the comparative MHC evolution in ruminants. PMID:22897909
Preparation of fosmid libraries and functional metagenomic analysis of microbial community DNA.
Martínez, Asunción; Osburne, Marcia S
2013-01-01
One of the most important challenges in contemporary microbial ecology is to assign a functional role to the large number of novel genes discovered through large-scale sequencing of natural microbial communities that lack similarity to genes of known function. Functional screening of metagenomic libraries, that is, screening environmental DNA clones for the ability to confer an activity of interest to a heterologous bacterial host, is a promising approach for bridging the gap between metagenomic DNA sequencing and functional characterization. Here, we describe methods for isolating environmental DNA and constructing metagenomic fosmid libraries, as well as methods for designing and implementing successful functional screens of such libraries. © 2013 Elsevier Inc. All rights reserved.
Begin at the beginning: A BAC-end view of the passion fruit (Passiflora) genome.
Santos, Anselmo Azevedo; Penha, Helen Alves; Bellec, Arnaud; Munhoz, Carla de Freitas; Pedrosa-Harand, Andrea; Bergès, Hélène; Vieira, Maria Lucia Carneiro
2014-09-26
The passion fruit (Passiflora edulis) is a tropical crop of economic importance both for juice production and consumption as fresh fruit. The juice is also used in concentrate blends that are consumed worldwide. However, very little is known about the genome of the species. Therefore, improving our understanding of passion fruit genomics is essential and to some degree a pre-requisite if its genetic resources are to be used more efficiently. In this study, we have constructed a large-insert BAC library and provided the first view on the structure and content of the passion fruit genome, using BAC-end sequence (BES) data as a major resource. The library consisted of 82,944 clones and its levels of organellar DNA were very low. The library represents six haploid genome equivalents, and the average insert size was 108 kb. To check its utility for gene isolation, successful macroarray screening experiments were carried out with probes complementary to eight Passiflora gene sequences available in public databases. BACs harbouring those genes were used in fluorescent in situ hybridizations and unique signals were detected for four BACs in three chromosomes (n=9). Then, we explored 10,000 BES and we identified reads likely to contain repetitive mobile elements (19.6% of all BES), simple sequence repeats and putative proteins, and to estimate the GC content (~42%) of the reads. Around 9.6% of all BES were found to have high levels of similarity to plant genes and ontological terms were assigned to more than half of the sequences analysed (940). The vast majority of the top-hits made by our sequences were to Populus trichocarpa (24.8% of the total occurrences), Theobroma cacao (21.6%), Ricinus communis (14.3%), Vitis vinifera (6.5%) and Prunus persica (3.8%). We generated the first large-insert library for a member of Passifloraceae. This BAC library provides a new resource for genetic and genomic studies, as well as it represents a valuable tool for future whole genome study. Remarkably, a number of BAC-end pair sequences could be mapped to intervals of the sequenced Arabidopsis thaliana, V. vinifera and P. trichocarpa chromosomes, and putative collinear microsyntenic regions were identified.
Liu, Changqing; Bai, Chunyu; Guo, Yu; Liu, Dan; Lu, Taofeng; Li, Xiangchen; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun
2014-01-01
Bacterial artificial chromosome (BAC) libraries are extremely valuable for the genome-wide genetic dissection of complex organisms. The Siberian tiger, one of the most well-known wild primitive carnivores in China, is an endangered animal. In order to promote research on its genome, a high-redundancy BAC library of the Siberian tiger was constructed and characterized. The library is divided into two sub-libraries prepared from blood cells and two sub-libraries prepared from fibroblasts. This BAC library contains 153,600 individually archived clones; for PCR-based screening of the library, BACs were placed into 40 superpools of 10 × 384-deep well microplates. The average insert size of BAC clones was estimated to be 116.5 kb, representing approximately 6.46 genome equivalents of the haploid genome and affording a 98.86% statistical probability of obtaining at least one clone containing a unique DNA sequence. Screening the library with 19 microsatellite markers and a SRY sequence revealed that each of these markers were present in the library; the average number of positive clones per marker was 6.74 (range 2 to 12), consistent with 6.46 coverage of the tiger genome. Additionally, we identified 72 microsatellite markers that could potentially be used as genetic markers. This BAC library will serve as a valuable resource for physical mapping, comparative genomic study and large-scale genome sequencing in the tiger. PMID:24608928
Schouten, Henk J; Vande Geest, Henri; Papadimitriou, Sofia; Bemer, Marian; Schaart, Jan G; Smulders, Marinus J M; Perez, Gabino Sanchez; Schijlen, Elio
2017-03-01
Transformation resulted in deletions and translocations at T-DNA inserts, but not in genome-wide small mutations. A tiny T-DNA splinter was detected that probably would remain undetected by conventional techniques. We investigated to which extent Agrobacterium tumefaciens-mediated transformation is mutagenic, on top of inserting T-DNA. To prevent mutations due to in vitro propagation, we applied floral dip transformation of Arabidopsis thaliana. We re-sequenced the genomes of five primary transformants, and compared these to genomic sequences derived from a pool of four wild-type plants. By genome-wide comparisons, we identified ten small mutations in the genomes of the five transgenic plants, not correlated to the positions or number of T-DNA inserts. This mutation frequency is within the range of spontaneous mutations occurring during seed propagation in A. thaliana, as determined earlier. In addition, we detected small as well as large deletions specifically at the T-DNA insert sites. Furthermore, we detected partial T-DNA inserts, one of these a tiny 50-bp fragment originating from a central part of the T-DNA construct used, inserted into the plant genome without flanking other T-DNA. Because of its small size, we named this fragment a T-DNA splinter. As far as we know this is the first report of such a small T-DNA fragment insert in absence of any T-DNA border sequence. Finally, we found evidence for translocations from other chromosomes, flanking T-DNA inserts. In this study, we showed that next-generation sequencing (NGS) is a highly sensitive approach to detect T-DNA inserts in transgenic plants.
Easy preparation of a large-size random gene mutagenesis library in Escherichia coli.
You, Chun; Percival Zhang, Y-H
2012-09-01
A simple and fast protocol for the preparation of a large-size mutant library for directed evolution in Escherichia coli was developed based on the DNA multimers generated by prolonged overlap extension polymerase chain reaction (POE-PCR). This protocol comprised the following: (i) a linear DNA mutant library was generated by error-prone PCR or shuffling, and a linear vector backbone was prepared by regular PCR; (ii) the DNA multimers were generated based on these two DNA templates by POE-PCR; and (iii) the one restriction enzyme-digested DNA multimers were ligated to circular plasmids, followed by transformation to E. coli. Because the ligation efficiency of one DNA fragment was several orders of magnitude higher than that of two DNA fragments for typical mutant library construction, it was very easy to generate a mutant library with a size of more than 10(7) protein mutants per 50 μl of the POE-PCR product. Via this method, four new fluorescent protein mutants were obtained based on monomeric cherry fluorescent protein. This new protocol was simple and fast because it did not require labor-intensive optimizations in restriction enzyme digestion and ligation, did not involve special plasmid design, and enabled constructing a large-size mutant library for directed enzyme evolution within 1 day. Copyright © 2012 Elsevier Inc. All rights reserved.
Kim, Sunggil; Park, Jee Young; Yang, Tae-Jin
2015-06-01
Intact retrotransposon and DNA transposons inserted in a single gene were characterized in onions (Allium cepa) and their transcription and copy numbers were estimated in this study. While analyzing diverse onion germplasm, large insertions in the DFR-A gene encoding dihydroflavonol 4-reductase (DFR) involved in the anthocyanin biosynthesis pathway were found in two accessions. A 5,070-bp long terminal repeat (LTR) retrotransposon inserted in the active DFR-A (R4) allele was identified from one of the large insertions and designated AcCOPIA1. An intact ORF encoded typical domains of copia-like LTR retrotransposons. However, AcCOPIA1 contained atypical 'TG' and 'TA' dinucleotides at the ends of the LTRs. A 4,615-bp DNA transposon was identified in the other large insertion. This DNA transposon, designated AcCACTA1, contained an ORF coding for a transposase showing homology with the CACTA superfamily transposable elements (TEs). Another 5,073-bp DNA transposon was identified from the DFR-A (TRN) allele. This DNA transposon, designated AchAT1, belonged to the hAT superfamily with short 4-bp terminal inverted repeats (TIRs). Finally, a 6,258-bp non-autonomous DNA transposon, designated AcPINK, was identified in the ANS-p allele encoding anthocyanidin synthase, the next downstream enzyme to DFR in the anthocyanin biosynthesis pathway. AcPINK also possessed very short 3-bp TIRs. Active transcription of AcCOPIA1, AcCACTA1, and AchAT1 was observed through RNA-Seq analysis and RT-PCR. The copy numbers of AcPINK estimated by mapping the genomic DNA reads produced by NextSeq 500 were predominantly high compared with the other TEs. A series of evidence indicated that these TEs might have transposed in these onion genes very recently, providing a stepping stone for elucidation of enormously large-sized onion genome structure.
High-throughput microtitre plate-based assay for DNA topoisomerases.
Taylor, James A; Burton, Nicolas P; Maxwell, Anthony
2012-01-01
We have developed a rapid, high-throughput assay for measuring the catalytic activity (DNA supercoiling or relaxation) of DNA topoisomerases. The assay utilizes intermolecular triplex formation between an immobilized triplex-forming oligo (TFO) and a triplex-forming region inserted into the plasmid substrate (pNO1), and capitalizes on the observation that supercoiled DNA forms triplexes more readily than relaxed DNA. Thus, supercoiled DNA is preferentially retained by the TFO under triplex-forming conditions while relaxed DNA can be washed away. Due to its high speed of sample analysis and reduced sample handling over conventional gel-based techniques, this assay can be used to screen chemical libraries for novel inhibitors of topoisomerases.
Yang, Bing-Yan; Huo, Xiu-Ai; Li, Peng-Fei; Wang, Cui-Xia; Duan, Hui-Jun
2014-08-01
Full-length cDNAs are very important for genome annotation and functional analysis of genes. The number of full-length cDNAs from watermelon remains limited. Here we report first the construction of a full-length enriched cDNA library from Fusarium wilt stressed watermelon (Citrullus lanatus Thunb.) cultivar PI296341 root tissues using the SMART method. The titer of primary cDNA library and amplified library was 2.21 x 10(6) and 2.0 x 10(10) pfu/ml, respectively and the rate of recombinant was above 85%. The size of insert fragment ranged from 0.3 to 2.0 kb. In this study, we first cloned a gene named ClWRKY1, which was 1981 bp long and encoded a protein consisting of 394 amino acids. It contained two characteristic WRKY domains and two zinc finger motifs. Quantitative real-time PCR showed that ClWRKY1 expression levels reached maximum level at 12 h after inoculation with Fusarium oxysporum f. sp. niveum. The full-length cDNA library of watermelon root tissues is not only essential for the cloning of genes which are known, but also an initial key for the screening and cloning of new genes that might be involved in resistance to Fusarium wilt.
NASA Astrophysics Data System (ADS)
Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.
1984-08-01
A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.
Friis, Thor Einar; Stephenson, Sally; Xiao, Yin; Whitehead, Jon
2014-01-01
The sheep (Ovis aries) is favored by many musculoskeletal tissue engineering groups as a large animal model because of its docile temperament and ease of husbandry. The size and weight of sheep are comparable to humans, which allows for the use of implants and fixation devices used in human clinical practice. The construction of a complimentary DNA (cDNA) library can capture the expression of genes in both a tissue- and time-specific manner. cDNA libraries have been a consistent source of gene discovery ever since the technology became commonplace more than three decades ago. Here, we describe the construction of a cDNA library using cells derived from sheep bones based on the pBluescript cDNA kit. Thirty clones were picked at random and sequenced. This led to the identification of a novel gene, C12orf29, which our initial experiments indicate is involved in skeletal biology. We also describe a polymerase chain reaction-based cDNA clone isolation method that allows the isolation of genes of interest from a cDNA library pool. The techniques outlined here can be applied in-house by smaller tissue engineering groups to generate tools for biomolecular research for large preclinical animal studies and highlights the power of standard cDNA library protocols to uncover novel genes. PMID:24447069
Gadkar, Vijay J; Filion, Martin
2013-06-01
In various experimental systems, limiting available amounts of RNA may prevent a researcher from performing large-scale analyses of gene transcripts. One way to circumvent this is to 'pre-amplify' the starting RNA/cDNA, so that sufficient amounts are available for any downstream analysis. In the present study, we report the development of a novel protocol for constructing amplified cDNA libraries using the Phi29 DNA polymerase based multiple displacement amplification (MDA) system. Using as little as 200 ng of total RNA, we developed a linear concatenation strategy to make the single-stranded cDNA template amenable for MDA. The concatenation, made possible by the template switching property of the reverse transcriptase enzyme, resulted in the amplified cDNA library with intact 5' ends. MDA generated micrograms of template, allowing large-scale polymerase chain reaction analyses or other large-scale downstream applications. As the amplified cDNA library contains intact 5' ends, it is also compatible with 5' RACE analyses of specific gene transcripts. Empirical validation of this protocol is demonstrated on a highly characterized (tomato) and an uncharacterized (corn gromwell) experimental system.
Subtraction of cap-trapped full-length cDNA libraries to select rare transcripts.
Hirozane-Kishikawa, Tomoko; Shiraki, Toshiyuki; Waki, Kazunori; Nakamura, Mari; Arakawa, Takahiro; Kawai, Jun; Fagiolini, Michela; Hensch, Takao K; Hayashizaki, Yoshihide; Carninci, Piero
2003-09-01
The normalization and subtraction of highly expressed cDNAs from relatively large tissues before cloning dramatically enhanced the gene discovery by sequencing for the mouse full-length cDNA encyclopedia, but these methods have not been suitable for limited RNA materials. To normalize and subtract full-length cDNA libraries derived from limited quantities of total RNA, here we report a method to subtract plasmid libraries excised from size-unbiased amplified lambda phage cDNA libraries that avoids heavily biasing steps such as PCR and plasmid library amplification. The proportion of full-length cDNAs and the gene discovery rate are high, and library diversity can be validated by in silico randomization.
Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng
2012-01-01
To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing. PMID:23202944
Hogg, Matthew; Seki, Mineaki; Wood, Richard D; Doublié, Sylvie; Wallace, Susan S
2011-01-21
DNA polymerase θ (POLQ, polθ) is a large, multidomain DNA polymerase encoded in higher eukaryotic genomes. It is important for maintaining genetic stability in cells and helping protect cells from DNA damage caused by ionizing radiation. POLQ contains an N-terminal helicase-like domain, a large central domain of indeterminate function, and a C-terminal polymerase domain with sequence similarity to the A-family of DNA polymerases. The enzyme has several unique properties, including low fidelity and the ability to insert and extend past abasic sites and thymine glycol lesions. It is not known whether the abasic site bypass activity is an intrinsic property of the polymerase domain or whether helicase activity is also required. Three "insertion" sequence elements present in POLQ are not found in any other A-family DNA polymerase, and it has been proposed that they may lend some unique properties to POLQ. Here, we analyzed the activity of the DNA polymerase in the absence of each sequence insertion. We found that the pol domain is capable of highly efficient bypass of abasic sites in the absence of the helicase-like or central domains. Insertion 1 increases the processivity of the polymerase but has little, if any, bearing on the translesion synthesis properties of the enzyme. However, removal of insertions 2 and 3 reduces activity on undamaged DNA and completely abrogates the ability of the enzyme to bypass abasic sites or thymine glycol lesions. Copyright © 2010 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Zhan, Aibin; Bao, Zhenmin; Hu, Xiaoli; Lu, Wei; Hu, Jingjie
2009-06-01
Microsatellite markers have become one kind of the most important molecular tools used in various researches. A large number of microsatellite markers are required for the whole genome survey in the fields of molecular ecology, quantitative genetics and genomics. Therefore, it is extremely necessary to select several versatile, low-cost, efficient and time- and labor-saving methods to develop a large panel of microsatellite markers. In this study, we used Zhikong scallop ( Chlamys farreri) as the target species to compare the efficiency of the five methods derived from three strategies for microsatellite marker development. The results showed that the strategy of constructing small insert genomic DNA library resulted in poor efficiency, while the microsatellite-enriched strategy highly improved the isolation efficiency. Although the mining public database strategy is time- and cost-saving, it is difficult to obtain a large number of microsatellite markers, mainly due to the limited sequence data of non-model species deposited in public databases. Based on the results in this study, we recommend two methods, microsatellite-enriched library construction method and FIASCO-colony hybridization method, for large-scale microsatellite marker development. Both methods were derived from the microsatellite-enriched strategy. The experimental results obtained from Zhikong scallop also provide the reference for microsatellite marker development in other species with large genomes.
Namouchi, Amine; Mardassi, Helmi
2006-11-01
Evidence suggests that insertion of the IS6110 element is not without consequence to the biology of Mycobacterium tuberculosis complex strains. Thus, mapping of multiple IS6110 insertion sites in the genome of biomedically relevant clinical isolates would result in a better understanding of the role of this mobile element, particularly with regard to transmission, adaptability and virulence. In the present paper, we describe a versatile strategy, referred to as GL-PCR, that amplifies IS6110-flanking sequences based on the construction of a genomic library. M. tuberculosis chromosomal DNA is fully digested with HincII and then ligated into a plasmid vector between T7 and T3 promoter sequences. The ligation reaction product is transformed into Escherichia coli and selective PCR amplification targeting both 5' and 3' IS6110-flanking sequences are performed on the plasmid library DNA. For this purpose, four separate PCR reactions are performed, each combining an outward primer specific for one IS6110 end with either T7 or T3 primer. Determination of the nucleotide sequence of the PCR products generated from a single ligation reaction allowed mapping of 21 out of the 24 IS6110 copies of two 12 banded M. tuberculosis strains, yielding an overall sensitivity of 87,5%. Furthermore, by simply comparing the migration pattern of GL-PCR-generated products, the strategy proved to be as valuable as IS6110 RFLP for molecular typing of M. tuberculosis complex strains. Importantly, GL-PCR was able to discriminate between strains differing by a single IS6110 band.
Method of generating ploynucleotides encoding enhanced folding variants
Bradbury, Andrew M.; Kiss, Csaba; Waldo, Geoffrey S.
2017-05-02
The invention provides directed evolution methods for improving the folding, solubility and stability (including thermostability) characteristics of polypeptides. In one aspect, the invention provides a method for generating folding and stability-enhanced variants of proteins, including but not limited to fluorescent proteins, chromophoric proteins and enzymes. In another aspect, the invention provides methods for generating thermostable variants of a target protein or polypeptide via an internal destabilization baiting strategy. Internally destabilization a protein of interest is achieved by inserting a heterologous, folding-destabilizing sequence (folding interference domain) within DNA encoding the protein of interest, evolving the protein sequences adjacent to the heterologous insertion to overcome the destabilization (using any number of mutagenesis methods), thereby creating a library of variants. The variants in the library are expressed, and those with enhanced folding characteristics selected.
Franzini, Raphael M; Samain, Florent; Abd Elrahman, Maaly; Mikutis, Gediminas; Nauer, Angela; Zimmermann, Mauro; Scheuermann, Jörg; Hall, Jonathan; Neri, Dario
2014-08-20
DNA-encoded chemical libraries are collections of small molecules, attached to DNA fragments serving as identification barcodes, which can be screened against multiple protein targets, thus facilitating the drug discovery process. The preparation of large DNA-encoded chemical libraries crucially depends on the availability of robust synthetic methods, which enable the efficient conjugation to oligonucleotides of structurally diverse building blocks, sharing a common reactive group. Reactions of DNA derivatives with amines and/or carboxylic acids are particularly attractive for the synthesis of encoded libraries, in view of the very large number of building blocks that are commercially available. However, systematic studies on these reactions in the presence of DNA have not been reported so far. We first investigated conditions for the coupling of primary amines to oligonucleotides, using either a nucleophilic attack on chloroacetamide derivatives or a reductive amination on aldehyde-modified DNA. While both methods could be used for the production of secondary amines, the reductive amination approach was generally associated with higher yields and better purity. In a second endeavor, we optimized conditions for the coupling of a diverse set of 501 carboxylic acids to DNA derivatives, carrying primary and secondary amine functions. The coupling efficiency was generally higher for primary amines, compared to secondary amine substituents, but varied considerably depending on the structure of the acids and on the synthetic methods used. Optimal reaction conditions could be found for certain sets of compounds (with conversions >80%), but multiple reaction schemes are needed when assembling large libraries with highly diverse building blocks. The reactions and experimental conditions presented in this article should facilitate the synthesis of future DNA-encoded chemical libraries, while outlining the synthetic challenges that remain to be overcome.
Chapter 7. Cloning and analysis of natural product pathways.
Gust, Bertolt
2009-01-01
The identification of gene clusters of natural products has lead to an enormous wealth of information about their biosynthesis and its regulation, and about self-resistance mechanisms. Well-established routine techniques are now available for the cloning and sequencing of gene clusters. The subsequent functional analysis of the complex biosynthetic machinery requires efficient genetic tools for manipulation. Until recently, techniques for the introduction of defined changes into Streptomyces chromosomes were very time-consuming. In particular, manipulation of large DNA fragments has been challenging due to the absence of suitable restriction sites for restriction- and ligation-based techniques. The homologous recombination approach called recombineering (referred to as Red/ET-mediated recombination in this chapter) has greatly facilitated targeted genetic modifications of complex biosynthetic pathways from actinomycetes by eliminating many of the time-consuming and labor-intensive steps. This chapter describes techniques for the cloning and identification of biosynthetic gene clusters, for the generation of gene replacements within such clusters, for the construction of integrative library clones and their expression in heterologous hosts, and for the assembly of entire biosynthetic gene clusters from the inserts of individual library clones. A systematic approach toward insertional mutation of a complete Streptomyces genome is shown by the use of an in vitro transposon mutagenesis procedure.
DNA-encoded chemical libraries: advancing beyond conventional small-molecule libraries.
Franzini, Raphael M; Neri, Dario; Scheuermann, Jörg
2014-04-15
DNA-encoded chemical libraries (DECLs) represent a promising tool in drug discovery. DECL technology allows the synthesis and screening of chemical libraries of unprecedented size at moderate costs. In analogy to phage-display technology, where large antibody libraries are displayed on the surface of filamentous phage and are genetically encoded in the phage genome, DECLs feature the display of individual small organic chemical moieties on DNA fragments serving as amplifiable identification barcodes. The DNA-tag facilitates the synthesis and allows the simultaneous screening of very large sets of compounds (up to billions of molecules), because the hit compounds can easily be identified and quantified by PCR-amplification of the DNA-barcode followed by high-throughput DNA sequencing. Several approaches have been used to generate DECLs, differing both in the methods used for library encoding and for the combinatorial assembly of chemical moieties. For example, DECLs can be used for fragment-based drug discovery, displaying a single molecule on DNA or two chemical moieties at the extremities of complementary DNA strands. DECLs can vary substantially in the chemical structures and the library size. While ultralarge libraries containing billions of compounds have been reported containing four or more sets of building blocks, also smaller libraries have been shown to be efficient for ligand discovery. In general, it has been found that the overall library size is a poor predictor for library performance and that the number and diversity of the building blocks are rather important indicators. Smaller libraries consisting of two to three sets of building blocks better fulfill the criteria of drug-likeness and often have higher quality. In this Account, we present advances in the DECL field from proof-of-principle studies to practical applications for drug discovery, both in industry and in academia. DECL technology can yield specific binders to a variety of target proteins and is likely to become a standard tool for pharmaceutical hit discovery, lead expansion, and Chemical Biology research. The introduction of new methodologies for library encoding and for compound synthesis in the presence of DNA is an exciting research field and will crucially contribute to the performance and the propagation of the technology.
Williamson, Lynn L; Borlee, Bradley R; Schloss, Patrick D; Guan, Changhui; Allen, Heather K; Handelsman, Jo
2005-10-01
The goal of this study was to design and evaluate a rapid screen to identify metagenomic clones that produce biologically active small molecules. We built metagenomic libraries with DNA from soil on the floodplain of the Tanana River in Alaska. We extracted DNA directly from the soil and cloned it into fosmid and bacterial artificial chromosome vectors, constructing eight metagenomic libraries that contain 53,000 clones with inserts ranging from 1 to 190 kb. To identify clones of interest, we designed a high throughput "intracellular" screen, designated METREX, in which metagenomic DNA is in a host cell containing a biosensor for compounds that induce bacterial quorum sensing. If the metagenomic clone produces a quorum-sensing inducer, the cell produces green fluorescent protein (GFP) and can be identified by fluorescence microscopy or captured by fluorescence-activated cell sorting. Our initial screen identified 11 clones that induce and two that inhibit expression of GFP. The intracellular screen detected quorum-sensing inducers among metagenomic clones that a traditional overlay screen would not. One inducing clone carries a LuxI homologue that directs the synthesis of an N-acyl homoserine lactone quorum-sensing signal molecule. The LuxI homologue has 62% amino acid sequence identity to its closest match in GenBank, AmfI from Pseudomonas fluorescens, and is on a 78-kb insert that contains 67 open reading frames. Another inducing clone carries a gene with homology to homocitrate synthase. Our results demonstrate the power of an intracellular screen to identify functionally active clones and biologically active small molecules in metagenomic libraries.
Angsuthanasombat, C; Chungjatupornchai, W; Kertbundit, S; Luxananil, P; Settasatian, C; Wilairat, P; Panyim, S
1987-07-01
Five recombinant E. coli clones exhibiting toxicity to Aedes aegypti larvae were obtained from a library of 800 clones containing XbaI DNA fragments of 110 kb plasmid from B. thuringiensis var. israelensis. All the five clones (pMU 14/258/303/388/679) had the same 3.8-kb insert and encoded a major protein of 130 kDa which was highly toxic to A. aegypti larvae. Three clones (pMU 258/303/388) transcribed the 130 kD a gene in the same direction as that of lac Z promoter of pUC12 vector whereas the transcription of the other two (pMU 14/679) was in the opposite direction. A 1.9-kb fragment of the 3.8 kb insert coded for a protein of 65 kDa. Partial DNA sequence of the 3.8 kb insert, corresponding to the 5'-terminal of the 130 kDa gene, revealed a continuous reading frame, a Shine-Dalgarno sequence and a tentative 5'-regulatory region. These results demonstrated that the 3.8 kb insert is a minimal DNA fragment containing a regulatory region plus the coding sequence of the 130 kDa protein that is highly toxic to mosquito larvae.
Cloning and Expression of cDNA for Rat Heme Oxygenase
NASA Astrophysics Data System (ADS)
Shibahara, Shigeki; Muller, Rita; Taguchi, Hayao; Yoshida, Tadashi
1985-12-01
Two cDNA clones for rat heme oxygenase have been isolated from a rat spleen cDNA library in λ gt11 by immunological screening using a specific polyclonal antibody. One of these clones has an insert of 1530 nucleotides that contains the entire protein-coding region. To confirm that the isolated cDNA encodes heme oxygenase, we transfected monkey kidney cells (COS-7) with the cDNA carried in a simian virus 40 vector. The heme oxygenase was highly expressed in endoplasmic reticulum of transfected cells. The nucleotide sequence of the cloned cDNA was determined and the primary structure of heme oxygenase was deduced. Heme oxygenase is composed of 289 amino acids and has one hydrophobic segment at its carboxyl terminus, which is probably important for the insertion of heme oxygenase into endoplasmic reticulum. The cloned cDNA was used to analyze the induction of heme oxygenase in rat liver by treatment with CoCl2 or with hemin. RNA blot analysis showed that both CoCl2 and hemin increased the amount of hybridizable mRNA, suggesting that these substances may act at the transcriptional level to increase the amount of heme oxygenase.
Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide
2011-09-01
Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.
NASA Astrophysics Data System (ADS)
Lau, Yun-Fai; Kan, Yuet Wai
1983-09-01
We have developed a series of cosmids that can be used as vectors for genomic recombinant DNA library preparations, as expression vectors in mammalian cells for both transient and stable transformations, and as shuttle vectors between bacteria and mammalian cells. These cosmids were constructed by inserting one of the SV2-derived selectable gene markers-SV2-gpt, SV2-DHFR, and SV2-neo-in cosmid pJB8. High efficiency of genomic cloning was obtained with these cosmids and the size of the inserts was 30-42 kilobases. We isolated recombinant cosmids containing the human α -globin gene cluster from these genomic libraries. The simian virus 40 DNA in these selectable gene markers provides the origin of replication and enhancer sequences necessary for replication in permissive cells such as COS 7 cells and thereby allows transient expression of α -globin genes in these cells. These cosmids and their recombinants could also be stably transformed into mammalian cells by using the respective selection systems. Both of the adult α -globin genes were more actively expressed than the embryonic zeta -globin genes in these transformed cell lines. Because of the presence of the cohesive ends of the Charon 4A phage in the cosmids, the transforming DNA sequences could readily be rescued from these stably transformed cells into bacteria by in vitro packaging of total cellular DNA. Thus, these cosmid vectors are potentially useful for direct isolation of structural genes.
Systematic cloning of human minisatellites from ordered array charomid libraries.
Armour, J A; Povey, S; Jeremiah, S; Jeffreys, A J
1990-11-01
We present a rapid and efficient method for the isolation of minisatellite loci from human DNA. The method combines cloning a size-selected fraction of human MboI DNA fragments in a charomid vector with hybridization screening of the library in ordered array. Size-selection of large MboI fragments enriches for the longer, more variable minisatellites and reduces the size of the library required. The library was screened with a series of multi-locus probes known to detect a large number of hypervariable loci in human DNA. The gridded library allowed both the rapid processing of positive clones and the comparative evaluation of the different multi-locus probes used, in terms of both the relative success in detecting hypervariable loci and the degree of overlap between the sets of loci detected. We report 23 new human minisatellite loci isolated by this method, which map to 14 autosomes and the sex chromosomes.
Phylogenetics of modern birds in the era of genomics
Edwards, Scott V; Bryan Jennings, W; Shedlock, Andrew M
2005-01-01
In the 14 years since the first higher-level bird phylogenies based on DNA sequence data, avian phylogenetics has witnessed the advent and maturation of the genomics era, the completion of the chicken genome and a suite of technologies that promise to add considerably to the agenda of avian phylogenetics. In this review, we summarize current approaches and data characteristics of recent higher-level bird studies and suggest a number of as yet untested molecular and analytical approaches for the unfolding tree of life for birds. A variety of comparative genomics strategies, including adoption of objective quality scores for sequence data, analysis of contiguous DNA sequences provided by large-insert genomic libraries, and the systematic use of retroposon insertions and other rare genomic changes all promise an integrated phylogenetics that is solidly grounded in genome evolution. The avian genome is an excellent testing ground for such approaches because of the more balanced representation of single-copy and repetitive DNA regions than in mammals. Although comparative genomics has a number of obvious uses in avian phylogenetics, its application to large numbers of taxa poses a number of methodological and infrastructural challenges, and can be greatly facilitated by a ‘community genomics’ approach in which the modest sequencing throughputs of single PI laboratories are pooled to produce larger, complementary datasets. Although the polymerase chain reaction era of avian phylogenetics is far from complete, the comparative genomics era—with its ability to vastly increase the number and type of molecular characters and to provide a genomic context for these characters—will usher in a host of new perspectives and opportunities for integrating genome evolution and avian phylogenetics. PMID:16024355
The development and characterisation of a bacterial artificial chromosome library for Fragaria vesca
Bonet, Julio; Girona, Elena Lopez; Sargent, Daniel J; Muñoz-Torres, Monica C; Monfort, Amparo; Abbott, Albert G; Arús, Pere; Simpson, David W; Davik, Jahn
2009-01-01
Background The cultivated strawberry Fragaria ×ananassa is one of the most economically-important soft-fruit species. Few structural genomic resources have been reported for Fragaria and there exists an urgent need for the development of physical mapping resources for the genus. The first stage in the development of a physical map for Fragaria is the construction and characterisation of a high molecular weight bacterial artificial chromosome (BAC) library. Methods A BAC library, consisting of 18,432 clones was constructed from Fragaria vesca f. semperflorens accession 'Ali Baba'. BAC DNA from individual library clones was pooled to create a PCR-based screening assay for the library, whereby individual clones could be identified with just 34 PCR reactions. These pools were used to screen the BAC library and anchor individual clones to the diploid Fragaria reference map (FV×FN). Findings Clones from the BAC library developed contained an average insert size of 85 kb, representing over seven genome equivalents. The pools and superpools developed were used to identify a set of BAC clones containing 70 molecular markers previously mapped to the diploid Fragaria FV×FN reference map. The number of positive colonies identified for each marker suggests the library represents between 4× and 10× coverage of the diploid Fragaria genome, which is in accordance with the estimate of library coverage based on average insert size. Conclusion This BAC library will be used for the construction of a physical map for F. vesca and the superpools will permit physical anchoring of molecular markers using PCR. PMID:19772672
The Essential Genome of Escherichia coli K-12
2018-01-01
ABSTRACT Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. PMID:29463657
Carú, M; Cifuentes, V; Pincheira, G; Jiménez, A
1989-10-01
A plasmid (named pCN2) carrying a 7.6 kb BamHI DNA insert was isolated from a Neurospora crassa genomic library raised in the yeast vector YRp7. Saccharomyces cerevisiae suco and N. crassa inv strains transformed with pNC2 were able to grow on sucrose-based media and expressed invertase activity. Saccharomyces cerevisiae suco (pNC2) expressed a product which immunoreacted with antibody raised against purified invertase from wild type N. crassa, although S. cerevisiae suc+ did not. The cloned DNA hybridized with a 7.6 kb DNA fragment from BamHI-restricted wild type N. crassa DNA. Plasmid pNC2 transformed N. crassa Inv- to Inv+ by integration either near to the endogenous inv locus (40% events) or at other genomic sites (60% events). It appears therefore that the cloned DNA piece encodes the N. crassa invertase enzyme. A 3.8 kb XhoI DNA fragment, derived from pNC2, inserted in YRp7, in both orientation, was able to express invertase activity in yeast, suggesting that it contains an intact invertase gene which is not expressed from a vector promoter.
de Bellocq, J Goüy; Leirs, H
2009-09-01
Sequences of the complete open reading frame (ORF) for rodents major histocompatibility complex (MHC) class II genes are rare. Multimammate rat (Mastomys natalensis) complementary DNA (cDNA) encoding the alpha and beta chains of MHC class II DQ gene was cloned from a rapid amplifications of cDNA Emds (RACE) cDNA library. The ORFs consist of 801 and 771 bp encoding 266 and 256 amino acid residues for DQB and DQA, respectively. The genomic structure of Mana-DQ genes is globally analogous to that described for other rodents except for the insertion of a serine residue in the signal peptide of Mana-DQB, which is unique among known rodents.
Feng, Jiuhuan; Liu, Zhao; Cai, Xiwen; Jan, Chao-Chien
2013-01-01
Conventional karyotypes and various genetic linkage maps have been established in sunflower (Helianthus annuus L., 2n = 34). However, the relationship between linkage groups and individual chromosomes of sunflower remains unknown and has considerable relevance for the sunflower research community. Recently, a set of linkage group-specific bacterial /binary bacterial artificial chromosome (BAC/BIBAC) clones was identified from two complementary BAC and BIBAC libraries constructed for cultivated sunflower cv. HA89. In the present study, we used these linkage group-specific clones (∼100 kb in size) as probes to in situ hybridize to HA89 mitotic chromosomes at metaphase using the BAC- fluorescence in situ hybridization (FISH) technique. Because a characteristic of the sunflower genome is the abundance of repetitive DNA sequences, a high ratio of blocking DNA to probe DNA was applied to hybridization reactions to minimize the background noise. As a result, all sunflower chromosomes were anchored by one or two BAC/BIBAC clones with specific FISH signals. FISH analysis based on tandem repetitive sequences, such as rRNA genes, has been previously reported; however, the BAC-FISH technique developed here using restriction fragment length polymorphism (RFLP)−derived BAC/BIBAC clones as probes to apply genome-wide analysis is new for sunflower. As chromosome-specific cytogenetic markers, the selected BAC/BIBAC clones that encompass the 17 linkage groups provide a valuable tool for identifying sunflower cytogenetic stocks (such as trisomics) and tracking alien chromosomes in interspecific crosses. This work also demonstrates the potential of using a large-insert DNA library for the development of molecular cytogenetic resources. PMID:23316437
DNA modification and functional delivery into human cells using Escherichia coli DH10B
Narayanan, Kumaran; Warburton, Peter E.
2003-01-01
The availability of almost the complete human genome as cloned BAC libraries represents a valuable resource for functional genomic analysis, which, however, has been somewhat limited by the ability to modify and transfer this DNA into mammalian cells intact. Here we report a novel comprehensive Escherichia coli-based vector system for the modification, propagation and delivery of large human genomic BAC clones into mammalian cells. The GET recombination inducible homologous recombination system was used in the BAC host strain E.coli DH10B to precisely insert an EGFPneo cassette into the vector portion of a ∼200 kb human BAC clone, providing a relatively simple method to directly convert available BAC clones into suitable vectors for mammalian cells. GET recombination was also used for the targeted deletion of the asd gene from the E.coli chromosome, resulting in defective cell wall synthesis and diaminopimelic acid auxotrophy. Transfer of the Yersinia pseudotuberculosis invasin gene into E.coli DH10B asd– rendered it competent to invade HeLa cells and deliver DNA, as judged by transient expression of green fluorescent protein and stable neomycin-resistant colonies. The efficiency of DNA transfer and survival of HeLa cells has been optimized for incubation time and multiplicity of infection of invasive E.coli with HeLa cells. This combination of E.coli-based homologous recombination and invasion technologies using BAC host strain E.coli DH10B will greatly improve the utility of the available BAC libraries from the human and other genomes for gene expression and functional genomic studies. PMID:12711696
Molecular cloning of cDNAs for the nerve-cell specific phosphoprotein, synapsin I.
Kilimann, M W; DeGennaro, L J
1985-01-01
To provide access to synapsin I-specific DNA sequences, we have constructed cDNA clones complementary to synapsin I mRNA isolated from rat brain. Synapsin I mRNA was specifically enriched by immunoadsorption of polysomes prepared from the brains of 10-14 day old rats. Employing this enriched mRNA, a cDNA library was constructed in pBR322 and screened by differential colony hybridization with single-stranded cDNA probes made from synapsin I mRNA and total polysomal poly(A)+ RNA. This screening procedure proved to be highly selective. Five independent recombinant plasmids which exhibited distinctly stronger hybridization with the synapsin I probe were characterized further by restriction mapping. All of the cDNA inserts gave restriction enzyme digestion patterns which could be aligned. In addition, some of the cDNA inserts were shown to contain poly(dA) sequences. Final identification of synapsin I cDNA clones relied on the ability of the cDNA inserts to hybridize specifically to synapsin I mRNA. Several plasmids were tested by positive hybridization selection. They specifically selected synapsin I mRNA which was identified by in vitro translation and immunoprecipitation of the translation products. The established cDNA clones were used for a blot-hybridization analysis of synapsin I mRNA. A fragment (1600 bases) from the longest cDNA clone hybridized with two discrete RNA species 5800 and 4500 bases long, in polyadenylated RNA from rat brain and PC12 cells. No hybridization was detected to RNA from rat liver, skeletal muscle or cardiac muscle. Images Fig. 1. Fig. 2. Fig. 4. Fig. 5. PMID:3933975
USDA-ARS?s Scientific Manuscript database
Verticillium dahliae is the primary causal agent for Verticillium wilt disease on a diverse array of economically important crops, including cotton. In previous research, we screened a T-DNA insertional mutant library of the highly virulent isolate Vd080 derived from cotton. In this study, the targ...
Dowen, Jill M.; Putnam, Christopher D.; Kolodner, Richard D.
2010-01-01
The Msh2-Msh3 heterodimer recognizes various DNA mispairs, including loops of DNA ranging from 1 to 14 nucleotides and some base-base mispairs. Homology modeling of the mispair-binding domain (MBD) of Msh3 using the related Msh6 MBD revealed that mismatch recognition must be different, even though the MBD folds must be similar. Model-based point mutation alleles of Saccharomyces cerevisiae msh3 designed to disrupt mispair recognition fell into two classes. One class caused defects in repair of both small and large insertion/deletion mispairs, whereas the second class caused defects only in the repair of small insertion/deletion mispairs; mutations of the first class also caused defects in the removal of nonhomologous tails present at the ends of double-strand breaks (DSBs) during DSB repair, whereas mutations of the second class did not cause defects in the removal of nonhomologous tails during DSB repair. Thus, recognition of small insertion/deletion mispairs by Msh3 appears to require a greater degree of interactions with the DNA conformations induced by small insertion/deletion mispairs than with those induced by large insertion/deletions that are intrinsically bent and strand separated. Mapping of the two classes of mutations onto the Msh3 MBD model appears to distinguish mispair recognition regions from DNA stabilization regions. PMID:20421420
Dowen, Jill M; Putnam, Christopher D; Kolodner, Richard D
2010-07-01
The Msh2-Msh3 heterodimer recognizes various DNA mispairs, including loops of DNA ranging from 1 to 14 nucleotides and some base-base mispairs. Homology modeling of the mispair-binding domain (MBD) of Msh3 using the related Msh6 MBD revealed that mismatch recognition must be different, even though the MBD folds must be similar. Model-based point mutation alleles of Saccharomyces cerevisiae msh3 designed to disrupt mispair recognition fell into two classes. One class caused defects in repair of both small and large insertion/deletion mispairs, whereas the second class caused defects only in the repair of small insertion/deletion mispairs; mutations of the first class also caused defects in the removal of nonhomologous tails present at the ends of double-strand breaks (DSBs) during DSB repair, whereas mutations of the second class did not cause defects in the removal of nonhomologous tails during DSB repair. Thus, recognition of small insertion/deletion mispairs by Msh3 appears to require a greater degree of interactions with the DNA conformations induced by small insertion/deletion mispairs than with those induced by large insertion/deletions that are intrinsically bent and strand separated. Mapping of the two classes of mutations onto the Msh3 MBD model appears to distinguish mispair recognition regions from DNA stabilization regions.
Primary structure and mapping of the hupA gene of Salmonella typhimurium.
Higgins, N P; Hillyard, D
1988-01-01
In bacteria, the complex nucleoid structure is folded and maintained by negative superhelical tension and a set of type II DNA-binding proteins, also called histonelike proteins. The most abundant type II DNA-binding protein is HU. Southern blot analysis showed that Salmonella typhimurium contained two HU genes that corresponded to Escherichia coli genes hupA (encoding HU-2 protein) and hupB (encoding HU-1). Salmonella hupA was cloned, and the nucleotide sequence of the gene was determined. Comparison of hupA of E. coli and S. typhimurium revealed that the HU-2 proteins were identical and that there was high conservation of nucleotide sequences outside the coding frames of the genes. A 300-member genomic library of S. typhimurium was constructed by using random transposition of MudP, a specialized chimeric P22-Mu phage that packages chromosomal DNA unidirectionally from its insertion point. Oligonucleotide hybridization against the library identified one MudP insertion that lies within 28 kilobases of hupA; the MudP was 12% linked to purH at 90.5 min on the standard map. Plasmids expressing HU-2 had a surprising phenotype; they caused growth arrest when they were introduced into E. coli strains bearing a himA or hip mutation. These results suggest that IHF and HU have interactive roles in bacteria. Images PMID:3056912
1989-03-10
fragment of the HIV-1 genome was isolated from XBH10 and inserted into an M13 phage vector. Mutations were introduced by use of 25-mer oligonucleotides which... M13 by Eco RI and religated into the corresponding position of pHXB2gpt. The mutant AS was prepared directly from pHXB2gpt by digestion with Nde I and...detect immunocomplexes. Molecular Cloning and Sequencing of Proviral DNA A X phage library was constructed from the genomic DNA isolated from Hut78 cells
Craig, R K; Hall, L; Parker, D; Campbell, P N
1981-01-01
A complementary DNA (cDNA) plasmid library has been constructed in the plasmid pAT153, using poly(A)-containing RNA isolated from the lactating guinea-pig mammary gland as the starting material. Double stranded cDNA was inserted into the EcoRI site of the plasmid using poly(dA . dT) tails, then transformed into Escherichia coli HB101. From the resulting colonies we have selected and partially characterized plasmids containing cDNA copies of the mRNAs for casein A, casein B, casein C and alpha-lactalbumin. However, the proportion containing casein C cDNA was exceptionally low, and these contained at best 60% of the mRNA sequence. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. PMID:7306038
Pietras, D F; Bennett, K L; Siracusa, L D; Woodworth-Gutai, M; Chapman, V M; Gross, K W; Kane-Haas, C; Hastie, N D
1983-01-01
We report the construction of a small library of recombinant plasmids containing Mus musculus repetitive DNA inserts. The repetitive cloned fraction was derived from denatured genomic DNA by reassociation to a Cot value at which repetitive, but not unique, sequences have reannealed followed by exhaustive S1 nuclease treatment to degrade single stranded DNA. Initial characterizations of this library by colony filter hybridizations have led to the identification of a previously undetected M. musculus minor satellite as well as to clones containing M. musculus major satellite sequences. This new satellite is repeated 10-20 times less than the major satellite in the M. musculus genome. It has a repeat length of 130 nucleotides compared with the M. musculus major satellite with a repeat length of 234 nucleotides. Sequence analysis of the minor satellite has shown that it has a 29 base pair region with extensive homology to one of the major satellite repeating subunits. We also show by in situ hybridization that this minor satellite sequence is located at the centromeres and possibly the arms of at least half the M musculus chromosomes. Sequences related to the minor satellite have been found in the DNA of a related Mus species, Mus spretus, and may represent the major satellite of that species. Images PMID:6314268
Partier, A; Gay, G; Tassy, C; Beckert, M; Feuillet, C; Barret, P
2017-10-01
A large, 53-kbp, intact DNA fragment was inserted into the wheat ( Triticum aestivum L.) genome. FISH analyses of individual transgenic events revealed multiple insertions of intact fragments. Transferring large intact DNA fragments containing clusters of resistance genes or complete metabolic pathways into the wheat genome remains a challenge. In a previous work, we showed that the use of dephosphorylated cassettes for wheat transformation enabled the production of simple integration patterns. Here, we used the same technology to produce a cassette containing a 44-kb Arabidopsis thaliana BAC, flanked by one selection gene and one reporter gene. This 53-kb linear cassette was integrated in the bread wheat (Triticum aestivum L.) genome by biolistic transformation. Our results showed that transgenic plants harboring the entire cassette were generated. The inheritability of the cassette was demonstrated in the T1 and T2 generation. Surprisingly, FISH analysis performed on T1 progeny of independent events identified double genomic insertions of intact fragments in non-homoeologous positions. Inheritability of these double insertions was demonstrated by FISH analysis of the T1 generation. Relative conclusions that can be drawn from molecular or FISH analysis are discussed along with future prospects of the engineering of large fragments for wheat transformation or genome editing.
Michalovova, M; Vyskot, B; Kejnovsky, E
2013-10-01
We analysed the size, relative age and chromosomal localization of nuclear sequences of plastid and mitochondrial origin (NUPTs-nuclear plastid DNA and NUMTs-nuclear mitochondrial DNA) in six completely sequenced plant species. We found that the largest insertions showed lower divergence from organelle DNA than shorter insertions in all species, indicating their recent origin. The largest NUPT and NUMT insertions were localized in the vicinity of the centromeres in the small genomes of Arabidopsis and rice. They were also present in other chromosomal regions in the large genomes of soybean and maize. Localization of NUPTs and NUMTs correlated positively with distribution of transposable elements (TEs) in Arabidopsis and sorghum, negatively in grapevine and soybean, and did not correlate in rice or maize. We propose a model where new plastid and mitochondrial DNA sequences are inserted close to centromeres and are later fragmented by TE insertions and reshuffled away from the centromere or removed by ectopic recombination. The mode and tempo of TE dynamism determines the turnover of NUPTs and NUMTs resulting in their species-specific chromosomal distributions.
Kuzmina, Maria L; Braukmann, Thomas W A; Fazekas, Aron J; Graham, Sean W; Dewaard, Stephanie L; Rodrigues, Anuar; Bennett, Bruce A; Dickinson, Timothy A; Saarela, Jeffery M; Catling, Paul M; Newmaster, Steven G; Percy, Diana M; Fenneman, Erin; Lauron-Moreau, Aurélien; Ford, Bruce; Gillespie, Lynn; Subramanyam, Ragupathy; Whitton, Jeannette; Jennings, Linda; Metsger, Deborah; Warne, Connor P; Brown, Allison; Sears, Elizabeth; Dewaard, Jeremy R; Zakharov, Evgeny V; Hebert, Paul D N
2017-12-01
Constructing complete, accurate plant DNA barcode reference libraries can be logistically challenging for large-scale floras. Here we demonstrate the promise and challenges of using herbarium collections for building a DNA barcode reference library for the vascular plant flora of Canada. Our study examined 20,816 specimens representing 5076 of 5190 vascular plant species in Canada (98%). For 98% of the specimens, at least one of the DNA barcode regions was recovered from the plastid loci rbcL and matK and from the nuclear ITS2 region. We used beta regression to quantify the effects of age, type of preservation, and taxonomic affiliation (family) on DNA sequence recovery. Specimen age and method of preservation had significant effects on sequence recovery for all markers, but influenced some families more (e.g., Boraginaceae) than others (e.g., Asteraceae). Our DNA barcode library represents an unparalleled resource for metagenomic and ecological genetic research working on temperate and arctic biomes. An observed decline in sequence recovery with specimen age may be associated with poor primer matches, intragenomic variation (for ITS2), or inhibitory secondary compounds in some taxa.
Kuzmina, Maria L.; Braukmann, Thomas W. A.; Fazekas, Aron J.; Graham, Sean W.; Dewaard, Stephanie L.; Rodrigues, Anuar; Bennett, Bruce A.; Dickinson, Timothy A.; Saarela, Jeffery M.; Catling, Paul M.; Newmaster, Steven G.; Percy, Diana M.; Fenneman, Erin; Lauron-Moreau, Aurélien; Ford, Bruce; Gillespie, Lynn; Subramanyam, Ragupathy; Whitton, Jeannette; Jennings, Linda; Metsger, Deborah; Warne, Connor P.; Brown, Allison; Sears, Elizabeth; Dewaard, Jeremy R.; Zakharov, Evgeny V.; Hebert, Paul D. N.
2017-01-01
Premise of the study: Constructing complete, accurate plant DNA barcode reference libraries can be logistically challenging for large-scale floras. Here we demonstrate the promise and challenges of using herbarium collections for building a DNA barcode reference library for the vascular plant flora of Canada. Methods: Our study examined 20,816 specimens representing 5076 of 5190 vascular plant species in Canada (98%). For 98% of the specimens, at least one of the DNA barcode regions was recovered from the plastid loci rbcL and matK and from the nuclear ITS2 region. We used beta regression to quantify the effects of age, type of preservation, and taxonomic affiliation (family) on DNA sequence recovery. Results: Specimen age and method of preservation had significant effects on sequence recovery for all markers, but influenced some families more (e.g., Boraginaceae) than others (e.g., Asteraceae). Discussion: Our DNA barcode library represents an unparalleled resource for metagenomic and ecological genetic research working on temperate and arctic biomes. An observed decline in sequence recovery with specimen age may be associated with poor primer matches, intragenomic variation (for ITS2), or inhibitory secondary compounds in some taxa. PMID:29299394
Boeneman, Kelly; Fossum, Solveig; Yang, Yanhua; Fingland, Nicholas; Skarstad, Kirsten; Crooke, Elliott
2009-05-01
DnaA initiates chromosomal replication in Escherichia coli at a well-regulated time in the cell cycle. To determine how the spatial distribution of DnaA is related to the location of chromosomal replication and other cell cycle events, the localization of DnaA in living cells was visualized by confocal fluorescence microscopy. The gfp gene was randomly inserted into a dnaA-bearing plasmid via in vitro transposition to create a library that included internally GFP-tagged DnaA proteins. The library was screened for the ability to rescue dnaA(ts) mutants, and a candidate gfp-dnaA was used to replace the dnaA gene of wild-type cells. The resulting cells produce close to physiological levels of GFP-DnaA from the endogenous promoter as their only source of DnaA and somewhat under-initiate replication with moderate asynchrony. Visualization of GFP-tagged DnaA in living cells revealed that DnaA adopts a helical pattern that spirals along the long axis of the cell, a pattern also seen in wild-type cells by immunofluorescence with affinity purified anti-DnaA antibody. Although the DnaA helices closely resemble the helices of the actin analogue MreB, co-visualization of GFP-tagged DnaA and RFP-tagged MreB demonstrates that DnaA and MreB adopt discrete helical structures along the length of the longitudinal cell axis.
Lim, Kwang-il; Klimczak, Ryan; Yu, Julie H.; Schaffer, David V.
2010-01-01
Retroviral vectors offer benefits of efficient delivery and stable gene expression; however, their clinical use raises the concerns of insertional mutagenesis and potential oncogenesis due to genomic integration preferences in transcriptional start sites (TSS). We have shifted the integration preferences of retroviral vectors by generating a library of viral variants with a DNA-binding domain inserted at random positions throughout murine leukemia virus Gag-Pol, then selecting for variants that are viable and exhibit altered integration properties. We found seven permissive zinc finger domain (ZFD) insertion sites throughout Gag-Pol, including within p12, reverse transcriptase, and integrase. Comprehensive genome integration analysis showed that several ZFD insertions yielded retroviral vector variants with shifted integration patterns that did not favor TSS. Furthermore, integration site analysis revealed selective integration for numerous mutants. For example, two retroviral variants with a given ZFD at appropriate positions in Gag-Pol strikingly integrated primarily into four common sites out of 3.1 × 109 possible human genome locations (P = 4.6 × 10-29). Our findings demonstrate that insertion of DNA-binding motifs into multiple locations in Gag-Pol can make considerable progress toward engineering safer retroviral vectors that integrate into a significantly narrowed pool of sites on human genome and overcome the preference for TSS. PMID:20616052
Kim, Eun Jin; Angell, Scott; Janes, Jeff; Watanabe, Coran M H
2008-06-01
Traditional approaches to natural product discovery involve cell-based screening of natural product extracts followed by compound isolation and characterization. Their importance notwithstanding, continued mining leads to depletion of natural resources and the reisolation of previously identified metabolites. Metagenomic strategies aimed at localizing the biosynthetic cluster genes and expressing them in surrogate hosts offers one possible alternative. A fundamental question that naturally arises when pursuing such a strategy is, how large must the genomic library be to effectively represent the genome of an organism(s) and the biosynthetic gene clusters they harbor? Such an issue is certainly augmented in the absence of expensive robotics to expedite colony picking and/or screening of clones. We have developed an algorism, named BPC (biosynthetic pathway coverage), supported by molecular simulations to deduce the number of BAC clones required to achieve proper coverage of the genome and their respective biosynthetic pathways. The strategy has been applied to the construction of a large-insert BAC library from a marine microorganism, Hon6 (isolated from Honokohau, Maui) thought to represent a new species. The genomic library is constructed with a BAC yeast shuttle vector pClasper lacZ paving the way for the culturing of libraries in both prokaryotic and eukaryotic hosts. Flow cytometric methods are utilized to estimate the genome size of the organism and BPC implemented to assess P-coverage or percent coverage. A genetic selection strategy is illustrated, applications of which could expedite screening efforts in the identification and localization of biosynthetic pathways from marine microbial consortia, offering a powerful complement to genome sequencing and degenerate probe strategies. Implementing this approach, we report on the biotin biosynthetic pathway from the marine microorganism Hon6.
Kunig, Verena; Potowski, Marco; Gohla, Anne; Brunschweiger, Andreas
2018-06-27
DNA-encoded compound libraries are a highly attractive technology for the discovery of small molecule protein ligands. These compound collections consist of small molecules covalently connected to individual DNA sequences carrying readable information about the compound structure. DNA-tagging allows for efficient synthesis, handling and interrogation of vast numbers of chemically synthesized, drug-like compounds. They are screened on proteins by an efficient, generic assay based on Darwinian principles of selection. To date, selection of DNA-encoded libraries allowed for the identification of numerous bioactive compounds. Some of these compounds uncovered hitherto unknown allosteric binding sites on target proteins; several compounds proved their value as chemical biology probes unraveling complex biology; and the first examples of clinical candidates that trace their ancestry to a DNA-encoded library were reported. Thus, DNA-encoded libraries proved their value for the biomedical sciences as a generic technology for the identification of bioactive drug-like molecules numerous times. However, large scale experiments showed that even the selection of billions of compounds failed to deliver bioactive compounds for the majority of proteins in an unbiased panel of target proteins. This raises the question of compound library design.
Tissue Gene Expression Analysis Using Arrayed Normalized cDNA Libraries
Eickhoff, Holger; Schuchhardt, Johannes; Ivanov, Igor; Meier-Ewert, Sebastian; O'Brien, John; Malik, Arif; Tandon, Neeraj; Wolski, Eryk-Witold; Rohlfs, Elke; Nyarsik, Lajos; Reinhardt, Richard; Nietfeld, Wilfried; Lehrach, Hans
2000-01-01
We have used oligonucleotide-fingerprinting data on 60,000 cDNA clones from two different mouse embryonic stages to establish a normalized cDNA clone set. The normalized set of 5,376 clones represents different clusters and therefore, in almost all cases, different genes. The inserts of the cDNA clones were amplified by PCR and spotted on glass slides. The resulting arrays were hybridized with mRNA probes prepared from six different adult mouse tissues. Expression profiles were analyzed by hierarchical clustering techniques. We have chosen radioactive detection because it combines robustness with sensitivity and allows the comparison of multiple normalized experiments. Sensitive detection combined with highly effective clustering algorithms allowed the identification of tissue-specific expression profiles and the detection of genes specifically expressed in the tissues investigated. The obtained results are publicly available (http://www.rzpd.de) and can be used by other researchers as a digital expression reference. [The sequence data described in this paper have been submitted to the EMBL data library under accession nos. AL360374–AL36537.] PMID:10958641
Bacterial Artificial Chromosome Libraries for Mouse Sequencing and Functional Analysis
Osoegawa, Kazutoyo; Tateno, Minako; Woon, Peng Yeong; Frengen, Eirik; Mammoser, Aaron G.; Catanese, Joseph J.; Hayashizaki, Yoshihide; de Jong, Pieter J.
2000-01-01
Bacterial artificial chromosome (BAC) and P1-derived artificial chromosome (PAC) libraries providing a combined 33-fold representation of the murine genome have been constructed using two different restriction enzymes for genomic digestion. A large-insert PAC library was prepared from the 129S6/SvEvTac strain in a bacterial/mammalian shuttle vector to facilitate functional gene studies. For genome mapping and sequencing, we prepared BAC libraries from the 129S6/SvEvTac and the C57BL/6J strains. The average insert sizes for the three libraries range between 130 kb and 200 kb. Based on the numbers of clones and the observed average insert sizes, we estimate each library to have slightly in excess of 10-fold genome representation. The average number of clones found after hybridization screening with 28 probes was in the range of 9–14 clones per marker. To explore the fidelity of the genomic representation in the three libraries, we analyzed three contigs, each established after screening with a single unique marker. New markers were established from the end sequences and screened against all the contig members to determine if any of the BACs and PACs are chimeric or rearranged. Only one chimeric clone and six potential deletions have been observed after extensive analysis of 113 PAC and BAC clones. Seventy-one of the 113 clones were conclusively nonchimeric because both end markers or sequences were mapped to the other confirmed contig members. We could not exclude chimerism for the remaining 41 clones because one or both of the insert termini did not contain unique sequence to design markers. The low rate of chimerism, ∼1%, and the low level of detected rearrangements support the anticipated usefulness of the BAC libraries for genome research. [The sequence data described in this paper have been submitted to the GenBank data library under accession numbers AQ797173–AQ797398.] PMID:10645956
Shimoda, Yoshikazu; Mitsui, Hisayuki; Kamimatsuse, Hiroko; Minamisawa, Kiwamu; Nishiyama, Eri; Ohtsubo, Yoshiyuki; Nagata, Yuji; Tsuda, Masataka; Shinpo, Sayaka; Watanabe, Akiko; Kohara, Mitsuyo; Yamada, Manabu; Nakamura, Yasukazu; Tabata, Satoshi; Sato, Shusei
2008-01-01
Rhizobia are nitrogen-fixing soil bacteria that establish endosymbiosis with some leguminous plants. The completion of several rhizobial genome sequences provides opportunities for genome-wide functional studies of the physiological roles of many rhizobial genes. In order to carry out genome-wide phenotypic screenings, we have constructed a large mutant library of the nitrogen-fixing symbiotic bacterium, Mesorhizobium loti, by transposon mutagenesis. Transposon insertion mutants were generated using the signature-tagged mutagenesis (STM) technique and a total of 29 330 independent mutants were obtained. Along with the collection of transposon mutants, we have determined the transposon insertion sites for 7892 clones, and confirmed insertions in 3680 non-redundant M. loti genes (50.5% of the total number of M. loti genes). Transposon insertions were randomly distributed throughout the M. loti genome without any bias toward G+C contents of insertion target sites and transposon plasmids used for the mutagenesis. We also show the utility of STM mutants by examining the specificity of signature tags and test screenings for growth- and nodulation-deficient mutants. This defined mutant library allows for genome-wide forward- and reverse-genetic functional studies of M. loti and will serve as an invaluable resource for researchers to further our understanding of rhizobial biology. PMID:18658183
Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun
2013-01-01
In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105
Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun
2013-05-24
In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.
Polymenakou, Paraskevi N; Bertilsson, Stefan; Tselepides, Anastasios; Stephanou, Euripides G
2005-10-01
The regional variability of sediment bacterial community composition and diversity was studied by comparative analysis of four large 16S ribosomal DNA (rDNA) clone libraries from sediments in different regions of the Eastern Mediterranean Sea (Thermaikos Gulf, Cretan Sea, and South lonian Sea). Amplified rDNA restriction analysis of 664 clones from the libraries indicate that the rDNA richness and evenness was high: for example, a near-1:1 relationship among screened clones and number of unique restriction patterns when up to 190 clones were screened for each library. Phylogenetic analysis of 207 bacterial 16S rDNA sequences from the sediment libraries demonstrated that Gamma-, Delta-, and Alphaproteobacteria, Holophaga/Acidobacteria, Planctomycetales, Actinobacteria, Bacteroidetes, and Verrucomicrobia were represented in all four libraries. A few clones also grouped with the Betaproteobacteria, Nitrospirae, Spirochaetales, Chlamydiae, Firmicutes, and candidate division OPl 1. The abundance of sequences affiliated with Gammaproteobacteria was higher in libraries from shallow sediments in the Thermaikos Gulf (30 m) and the Cretan Sea (100 m) compared to the deeper South Ionian station (2790 m). Most sequences in the four sediment libraries clustered with uncultured 16S rDNA phylotypes from marine habitats, and many of the closest matches were clones from hydrocarbon seeps, benzene-mineralizing consortia, sulfate reducers, sulk oxidizers, and ammonia oxidizers. LIBSHUFF statistics of 16S rDNA gene sequences from the four libraries revealed major differences, indicating either a very high richness in the sediment bacterial communities or considerable variability in bacterial community composition among regions, or both.
A multi-landing pad DNA integration platform for mammalian cell engineering
Gaidukov, Leonid; Wroblewska, Liliana; Teague, Brian; Nelson, Tom; Zhang, Xin; Liu, Yan; Jagtap, Kalpana; Mamo, Selamawit; Tseng, Wen Allen; Lowe, Alexis; Das, Jishnu; Bandara, Kalpanie; Baijuraj, Swetha; Summers, Nevin M; Zhang, Lin; Weiss, Ron
2018-01-01
Abstract Engineering mammalian cell lines that stably express many transgenes requires the precise insertion of large amounts of heterologous DNA into well-characterized genomic loci, but current methods are limited. To facilitate reliable large-scale engineering of CHO cells, we identified 21 novel genomic sites that supported stable long-term expression of transgenes, and then constructed cell lines containing one, two or three ‘landing pad’ recombination sites at selected loci. By using a highly efficient BxB1 recombinase along with different selection markers at each site, we directed recombinase-mediated insertion of heterologous DNA to selected sites, including targeting all three with a single transfection. We used this method to controllably integrate up to nine copies of a monoclonal antibody, representing about 100 kb of heterologous DNA in 21 transcriptional units. Because the integration was targeted to pre-validated loci, recombinant protein expression remained stable for weeks and additional copies of the antibody cassette in the integrated payload resulted in a linear increase in antibody expression. Overall, this multi-copy site-specific integration platform allows for controllable and reproducible insertion of large amounts of DNA into stable genomic sites, which has broad applications for mammalian synthetic biology, recombinant protein production and biomanufacturing. PMID:29617873
van der Klift, Heleen M; Tops, Carli M; Hes, Frederik J; Devilee, Peter; Wijnen, Juul T
2012-07-01
Heterozygous germline mutations in the mismatch repair gene PMS2 predispose carriers for Lynch syndrome, an autosomal dominant predisposition to cancer. Here, we present a LINE-1-mediated retrotranspositional insertion in PMS2 as a novel mutation type for Lynch syndrome. This insertion, detected with Southern blot analysis in the genomic DNA of the patient, is characterized as a 2.2 kb long 5' truncated SVA_F element. The insertion is not detectable by current diagnostic testing limited to MLPA and direct Sanger sequencing on genomic DNA. The molecular nature of this insertion could only be resolved in RNA from cultured lymphocytes in which nonsense-mediated RNA decay was inhibited. Our report illustrates the technical problems encountered in the detection of this mutation type. Especially large heterozygous insertions will remain unnoticed because of preferential amplification of the smaller wild-type allele in genomic DNA, and are probably underreported in the mutation spectra of autosomal dominant disorders. © 2012 Wiley Periodicals, Inc.
Preferential cleavage sites for Sau3A restriction endonuclease in human ribosomal DNA.
Kupriyanova, N S; Kirilenko, P M; Netchvolodov, K K; Ryskov, A P
2000-07-21
Previous studies of cloned ribosomal DNA (rDNA) variants isolated from the cosmid library of human chromosome 13 have revealed some disproportion in representativity of different rDNA regions (N. S. Kupriyanova, K. K. Netchvolodov, P. M. Kirilenko, B. I. Kapanadze, N. K. Yankovsky, and A. P. Ryskov, Mol. Biol. 30, 51-60, 1996). Here we show nonrandom cleavage of human rDNA with Sau3A or its isoshizomer MboI under mild hydrolysis conditions. The hypersensitive cleavage sites were found to be located in the ribosomal intergenic spacer (rIGS), especially in the regions of about 5-5.5 and 11 kb upstream of the rRNA transcription start point. This finding is based on sequencing mapping of the rDNA insert ends in randomly selected cosmid clones of human chromosome 13 and on the data of digestion kinetics of cloned and noncloned human genomic rDNA with Sau3A and MboI. The results show that a methylation status and superhelicity state of the rIGS have no effect on cleavage site sensitivity. It is interesting that all primary cleavage sites are adjacent to or entering into Alu or Psi cdc 27 retroposons of the rIGS suggesting a possible role of neighboring sequences in nuclease accessibility. The results explain nonequal representation of rDNA sequences in the human genomic DNA library used for this study. Copyright 2000 Academic Press.
Willett-Brozick, J E; Savul, S A; Richey, L E; Baysal, B E
2001-08-01
Constitutional chromosomal translocations are relatively common causes of human morbidity, yet the DNA double-strand break (DSB) repair mechanisms that generate them are incompletely understood. We cloned, sequenced and analyzed the breakpoint junctions of a familial constitutional reciprocal translocation t(9;11)(p24;q23). Within the 10-kb region flanking the breakpoints, chromosome 11 had 25% repeat elements, whereas chromosome 9 had 98% repeats, 95% of which were L1-type LINE elements. The breakpoints occurred within an L1-type repeat element at 9p24 and at the 3'-end of an Alu sequence at 11q23. At the breakpoint junction of derivative chromosome 9, we discovered an unusually large 41-bp insertion, which showed 100% identity to 12S mitochondrial DNA (mtDNA) between nucleotides 896 and 936 of the mtDNA sequence. Analysis of the human genome failed to show the preexistence of the inserted sequence at normal chromosomes 9 and 11 breakpoint junctions or elsewhere in the genome, strongly suggesting that the insertion was derived from human mtDNA and captured into the junction during the DSB repair process. To our knowledge, these findings represent the first observation of spontaneous germ line insertion of modern human mtDNA sequences and suggest that DSB repair may play a role in inter-organellar gene transfer in vivo. Our findings also provide evidence for a previously unrecognized insertional mechanism in human, by which non-mobile extra-chromosomal fragments can be inserted into the genome at DSB repair junctions.
An improved yeast transformation method for the generation of very large human antibody libraries.
Benatuil, Lorenzo; Perez, Jennifer M; Belk, Jonathan; Hsieh, Chung-Ming
2010-04-01
Antibody library selection by yeast display technology is an efficient and highly sensitive method to identify binders to target antigens. This powerful selection tool, however, is often hampered by the typically modest size of yeast libraries (approximately 10(7)) due to the limited yeast transformation efficiency, and the full potential of the yeast display technology for antibody discovery and engineering can only be realized if it can be coupled with a mean to generate very large yeast libraries. We describe here a yeast transformation method by electroporation that allows for the efficient generation of large antibody libraries up to 10(10) in size. Multiple components and conditions including CaCl(2), MgCl(2), sucrose, sorbitol, lithium acetate, dithiothreitol, electroporation voltage, DNA input and cell volume have been tested to identify the best combination. By applying this developed protocol, we have constructed a 1.4 x 10(10) human spleen antibody library essentially in 1 day with a transformation efficiency of 1-1.5 x 10(8) transformants/microg vector DNA. Taken together, we have developed a highly efficient yeast transformation method that enables the generation of very large and productive human antibody libraries for antibody discovery, and we are now routinely making 10(9) libraries in a day for antibody engineering purposes.
Satz, Alexander L; Hochstrasser, Remo; Petersen, Ann C
2017-04-10
To optimize future DNA-encoded library design, we have attempted to quantify the library size at which the signal becomes undetectable. To accomplish this we (i) have calculated that percent yields of individual library members following a screen range from 0.002 to 1%, (ii) extrapolated that ∼1 million copies per library member are required at the outset of a screen, and (iii) from this extrapolation predict that false negative rates will begin to outweigh the benefit of increased diversity at library sizes >10 8 . The above analysis is based upon a large internal data set comprising multiple screens, targets, and libraries; we also augmented our internal data with all currently available literature data. In theory, high false negative rates may be overcome by employing larger amounts of library; however, we argue that using more than currently reported amounts of library (≫10 nmoles) is impractical. The above conclusions may be generally applicable to other DNA encoded library platforms, particularly those platforms that do not allow for library amplification.
Design of 240,000 orthogonal 25mer DNA barcode probes.
Xu, Qikai; Schlabach, Michael R; Hannon, Gregory J; Elledge, Stephen J
2009-02-17
DNA barcodes linked to genetic features greatly facilitate screening these features in pooled formats using microarray hybridization, and new tools are needed to design large sets of barcodes to allow construction of large barcoded mammalian libraries such as shRNA libraries. Here we report a framework for designing large sets of orthogonal barcode probes. We demonstrate the utility of this framework by designing 240,000 barcode probes and testing their performance by hybridization. From the test hybridizations, we also discovered new probe design rules that significantly reduce cross-hybridization after their introduction into the framework of the algorithm. These rules should improve the performance of DNA microarray probe designs for many applications.
Design of 240,000 orthogonal 25mer DNA barcode probes
Xu, Qikai; Schlabach, Michael R.; Hannon, Gregory J.; Elledge, Stephen J.
2009-01-01
DNA barcodes linked to genetic features greatly facilitate screening these features in pooled formats using microarray hybridization, and new tools are needed to design large sets of barcodes to allow construction of large barcoded mammalian libraries such as shRNA libraries. Here we report a framework for designing large sets of orthogonal barcode probes. We demonstrate the utility of this framework by designing 240,000 barcode probes and testing their performance by hybridization. From the test hybridizations, we also discovered new probe design rules that significantly reduce cross-hybridization after their introduction into the framework of the algorithm. These rules should improve the performance of DNA microarray probe designs for many applications. PMID:19171886
Rapid and efficient cDNA library screening by self-ligation of inverse PCR products (SLIP).
Hoskins, Roger A; Stapleton, Mark; George, Reed A; Yu, Charles; Wan, Kenneth H; Carlson, Joseph W; Celniker, Susan E
2005-12-02
cDNA cloning is a central technology in molecular biology. cDNA sequences are used to determine mRNA transcript structures, including splice junctions, open reading frames (ORFs) and 5'- and 3'-untranslated regions (UTRs). cDNA clones are valuable reagents for functional studies of genes and proteins. Expressed Sequence Tag (EST) sequencing is the method of choice for recovering cDNAs representing many of the transcripts encoded in a eukaryotic genome. However, EST sequencing samples a cDNA library at random, and it recovers transcripts with low expression levels inefficiently. We describe a PCR-based method for directed screening of plasmid cDNA libraries. We demonstrate its utility in a screen of libraries used in our Drosophila EST projects for 153 transcription factor genes that were not represented by full-length cDNA clones in our Drosophila Gene Collection. We recovered high-quality, full-length cDNAs for 72 genes and variously compromised clones for an additional 32 genes. The method can be used at any scale, from the isolation of cDNA clones for a particular gene of interest, to the improvement of large gene collections in model organisms and the human. Finally, we discuss the relative merits of directed cDNA library screening and RT-PCR approaches.
Bakker, Theo C M; Giger, Thomas; Frommen, Joachim G; Largiadèr, Carlo R
2017-08-01
There is a need for rapid and reliable molecular sexing of three-spined sticklebacks, Gasterosteus aculeatus, the supermodel species for evolutionary biology. A DNA region at the 5' end of the sex-linked microsatellite Gac4202 was sequenced for the X chromosome of six females and the Y chromosome of five males from three populations. The Y chromosome contained two large insertions, which did not recombine with the phenotype of sex in a cross of 322 individuals. Genetic variation (SNPs and indels) within the insertions was smaller than on flanking DNA sequences. Three molecular PCR-based sex tests were developed, in which the first, the second or both insertions were covered. In five European populations (from DE, CH, NL, GB) of three-spined sticklebacks, tests with both insertions combined showed two clearly separated bands on agarose minigels in males and one band in females. The tests with the separate insertions gave similar results. Thus, the new molecular sexing method gave rapid and reliable results for sexing three-spined sticklebacks and is an improvement and/or alternative to existing methods.
2011-01-01
Background The advent of genomics-based technologies has revolutionized many fields of biological enquiry. However, chromosome walking or flanking sequence cloning is still a necessary and important procedure to determining gene structure. Such methods are used to identify T-DNA insertion sites and so are especially relevant for organisms where large T-DNA insertion libraries have been created, such as rice and Arabidopsis. The currently available methods for flanking sequence cloning, including the popular TAIL-PCR technique, are relatively laborious and slow. Results Here, we report a simple and effective fusion primer and nested integrated PCR method (FPNI-PCR) for the identification and cloning of unknown genomic regions flanked known sequences. In brief, a set of universal primers was designed that consisted of various 15-16 base arbitrary degenerate oligonucleotides. These arbitrary degenerate primers were fused to the 3' end of an adaptor oligonucleotide which provided a known sequence without degenerate nucleotides, thereby forming the fusion primers (FPs). These fusion primers are employed in the first step of an integrated nested PCR strategy which defines the overall FPNI-PCR protocol. In order to demonstrate the efficacy of this novel strategy, we have successfully used it to isolate multiple genomic sequences namely, 21 orthologs of genes in various species of Rosaceace, 4 MYB genes of Rosa rugosa, 3 promoters of transcription factors of Petunia hybrida, and 4 flanking sequences of T-DNA insertion sites in transgenic tobacco lines and 6 specific genes from sequenced genome of rice and Arabidopsis. Conclusions The successful amplification of target products through FPNI-PCR verified that this novel strategy is an effective, low cost and simple procedure. Furthermore, FPNI-PCR represents a more sensitive, rapid and accurate technique than the established TAIL-PCR and hiTAIL-PCR procedures. PMID:22093809
Vettore, André L.; da Silva, Felipe R.; Kemper, Edson L.; Souza, Glaucia M.; da Silva, Aline M.; Ferro, Maria Inês T.; Henrique-Silva, Flavio; Giglioti, Éder A.; Lemos, Manoel V.F.; Coutinho, Luiz L.; Nobrega, Marina P.; Carrer, Helaine; França, Suzelei C.; Bacci, Maurício; Goldman, Maria Helena S.; Gomes, Suely L.; Nunes, Luiz R.; Camargo, Luis E.A.; Siqueira, Walter J.; Van Sluys, Marie-Anne; Thiemann, Otavio H.; Kuramae, Eiko E.; Santelli, Roberto V.; Marino, Celso L.; Targon, Maria L.P.N.; Ferro, Jesus A.; Silveira, Henrique C.S.; Marini, Danyelle C.; Lemos, Eliana G.M.; Monteiro-Vitorello, Claudia B.; Tambor, José H.M.; Carraro, Dirce M.; Roberto, Patrícia G.; Martins, Vanderlei G.; Goldman, Gustavo H.; de Oliveira, Regina C.; Truffi, Daniela; Colombo, Carlos A.; Rossi, Magdalena; de Araujo, Paula G.; Sculaccio, Susana A.; Angella, Aline; Lima, Marleide M.A.; de Rosa, Vicente E.; Siviero, Fábio; Coscrato, Virginia E.; Machado, Marcos A.; Grivet, Laurent; Di Mauro, Sonia M.Z.; Nobrega, Francisco G.; Menck, Carlos F.M.; Braga, Marilia D.V.; Telles, Guilherme P.; Cara, Frank A.A.; Pedrosa, Guilherme; Meidanis, João; Arruda, Paulo
2003-01-01
To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged. PMID:14613979
Lemos, Brenda R; Kaplan, Adam C; Bae, Ji Eun; Ferrazzoli, Alexander E; Kuo, James; Anand, Ranjith P; Waterman, David P; Haber, James E
2018-02-27
Harnessing CRISPR-Cas9 technology provides an unprecedented ability to modify genomic loci via DNA double-strand break (DSB) induction and repair. We analyzed nonhomologous end-joining (NHEJ) repair induced by Cas9 in budding yeast and found that the orientation of binding of Cas9 and its guide RNA (gRNA) profoundly influences the pattern of insertion/deletions (indels) at the site of cleavage. A common indel created by Cas9 is a 1-bp (+1) insertion that appears to result from Cas9 creating a 1-nt 5' overhang that is filled in by a DNA polymerase and ligated. The origin of +1 insertions was investigated by using two gRNAs with PAM sequences located on opposite DNA strands but designed to cleave the same sequence. These templated +1 insertions are dependent on the X-family DNA polymerase, Pol4. Deleting Pol4 also eliminated +2 and +3 insertions, which are biased toward homonucleotide insertions. Using inverted PAM sequences, we also found significant differences in overall NHEJ efficiency and repair profiles, suggesting that the binding of the Cas9:gRNA complex influences subsequent NHEJ processing. As with events induced by the site-specific HO endonuclease, CRISPR-Cas9-mediated NHEJ repair depends on the Ku heterodimer and DNA ligase 4. Cas9 events are highly dependent on the Mre11-Rad50-Xrs2 complex, independent of Mre11's nuclease activity. Inspection of the outcomes of a large number of Cas9 cleavage events in mammalian cells reveals a similar templated origin of +1 insertions in human cells, but also a significant frequency of similarly templated +2 insertions.
The Nucleotide Excision Repair Pathway Limits L1 Retrotransposition
Servant, Geraldine; Streva, Vincent A.; Derbes, Rebecca S.; Wijetunge, Madushani I.; Neeland, Marc; White, Travis B.; Belancio, Victoria P.; Roy-Engel, Astrid M.; Deininger, Prescott L.
2017-01-01
Long interspersed elements 1 (L1) are active mobile elements that constitute almost 17% of the human genome. They amplify through a “copy-and-paste” mechanism termed retrotransposition, and de novo insertions related to these elements have been reported to cause 0.2% of genetic diseases. Our previous data demonstrated that the endonuclease complex ERCC1-XPF, which cleaves a 3′ DNA flap structure, limits L1 retrotransposition. Although the ERCC1-XPF endonuclease participates in several different DNA repair pathways, such as single-strand annealing, or in telomere maintenance, its recruitment to DNA lesions is best characterized in the nucleotide excision repair (NER) pathway. To determine if the NER pathway prevents the insertion of retroelements in the genome, we monitored the retrotransposition efficiencies of engineered L1 elements in NER-deficient cells and in their complemented versions. Core proteins of the NER pathway, XPD and XPA, and the lesion binding protein, XPC, are involved in limiting L1 retrotransposition. In addition, sequence analysis of recovered de novo L1 inserts and their genomic locations in NER-deficient cells demonstrated the presence of abnormally large duplications at the site of insertion, suggesting that NER proteins may also play a role in the normal L1 insertion process. Here, we propose new functions for the NER pathway in the maintenance of genome integrity: limitation of insertional mutations caused by retrotransposons and the prevention of potentially mutagenic large genomic duplications at the site of retrotransposon insertion events. PMID:28049704
Wei, Wei; Zhu, Wenjun; Cheng, Jiasen; Xie, Jiatao; Li, Bo; Jiang, Daohong; Li, Guoqing; Yi, Xianhong
2013-01-01
Coniothyrium minitans is a sclerotial parasite of the plant-pathogenic fungus Sclerotinia sclerotiorum, and conidial production and parasitism are two important aspects for commercialization of this biological control agent. To understand the mechanism of conidiation and parasitism at the molecular level, we constructed a transfer DNA (tDNA) insertional library with the wild-type strain ZS-1. A conidiation-deficient mutant, ZS-1TN22803, was uncovered through screening of this library. This mutant could produce pycnidia on potato dextrose agar (PDA), but most were immature and did not bear conidia. Moreover, this mutant lost the ability to parasitize or rot the sclerotia of S. sclerotiorum. Analysis of the tDNA flanking sequences revealed that a peroxisome biogenesis factor 6 (PEX6) homolog of Saccharomyces cerevisiae, named CmPEX6, was disrupted by the tDNA insertion in this mutant. Targeted gene replacement and gene complementation tests confirmed that a null mutation of CmPEX6 was responsible for the phenotype of ZS-1TN22803. Further analysis showed that both ZS-1TN22803 and the targeted replacement mutants could not grow on PDA medium containing oleic acid, and they produced much less nitric oxide (NO) and hydrogen peroxide (H2O2) than wild-type strain ZS-1. The conidiation of ZS-1TN22803 was partially restored by adding acetyl-CoA or glyoxylic acid to the growth media. Our results suggest that fatty acid β-oxidation, reactive oxygen and nitrogen species, and possibly other unknown pathways in peroxisomes are involved in conidiation and parasitism by C. minitans. PMID:23563946
DOE Office of Scientific and Technical Information (OSTI.GOV)
Griffith, A.J.; Burgess, D.L.; Kohrman, D.
1994-09-01
The Twirler mutation (Tw) causing cleft palate {plus_minus} cleft lip, vestibular defects and obesity is located within 0.5 cM of an ataxia locus (ax) on mouse chromosome 18. We identified a transgene-induced insertional mutation with vestibular and craniofacial defects that appears to be a new allele of Twirler. Mouse DNA flanking the transgene insertion site was isolated from a cosmid library. An evolutionarily conserved, zoo blot positive cosmid subclone was used to probe a human {lambda} genomic library. From the sequence of a highly homologous human {lambda} clone, we designed STS primers and screened a human P1 library. DNA frommore » two positive P1 clones was hybridized with simple sequence probes, and a (CTAT){sub 12} repeat was detected. Analysis of 62 CEPH parents with primers flanking the repeat identified six alleles containing 9 to 14 copies of the repeat, at frequencies of 0.17, 0.17, 0.17, 0.27, 0.15 and 0.07, respectively. The observed heterozygosity was 49/62 with a calculated PIC value of 0.76. This polymorphic microsatellite marker, designated Umi3, was mapped to the predicted conserved human linkage group by analysis of somatic cell hybrid panels. The anticipated short distance between Umi3 and the disease genes will facilitate detection of linkage in small families. We would like to type appropriate human pedigrees with Umi3 in order to identify patients with inherited disorders homologous to the mouse mutations Twirler and ataxia.« less
Goulin, Eduardo Henrique; Savi, Daiani Cristina; Petters, Desirrê Alexia Lourenço; Kava, Vanessa; Galli-Terasawa, Lygia; Silva, Geraldo José; Glienke, Chirlei
2016-11-01
Phyllosticta citricarpa is the epidemiological agent of Citrus Black Spot (CBS) disease, which is responsible for large economic losses worldwide. CBS is characterized by the presence of spores (pycnidiospores) in dark lesions of fruit, which are also responsible for short distance dispersal of the disease. The identification of genes involved in asexual reproduction of P. citricarpa can be an alternative for directional disease control. We analyzed a library of mutants obtained through Agrobacterium tumefaciens transformation system, looking for alterations in growth and reproductive structure formation. Two mutant strains were found to have lost the ability to form pycnidia. The flanking T-DNA insertion regions were identified on P. citricarpa genome by using blast analysis and further gene prediction. The predicted genes containing the T-DNA insertions were identified as Spindle Poison Sensitivity Scp3, Ion Transport protein, and Cullin Binding proteins. The Ion Transport and Cullin Binding proteins are known to be correlated with sexual and asexual reproduction in fungi; however, the exact mechanism by which these proteins act on spore formation in P. citricarpa needs to be better characterized. The Scp3 proteins are suggested here for the first time as being associated with asexual reproduction in fungus. This protein is associated with microtubule formation, and as microtubules play an essential role as spindle machinery for chromosome segregation and cytokinesis, insertions in this gene can lead to abnormal formations, such as that observed here in P. citricarpa. We suggest these genes as new targets for fungicide development and CBS disease control, by iRNA. Copyright © 2016 Elsevier GmbH. All rights reserved.
Trebitz, Anett S; Hoffman, Joel C; Grant, George W; Billehus, Tyler M; Pilgrim, Erik M
2015-07-22
DNA-based identification of mixed-organism samples offers the potential to greatly reduce the need for resource-intensive morphological identification, which would be of value both to bioassessment and non-native species monitoring. The ability to assign species identities to DNA sequences found depends on the availability of comprehensive DNA reference libraries. Here, we compile inventories for aquatic metazoans extant in or threatening to invade the Laurentian Great Lakes and examine the availability of reference mitochondrial COI DNA sequences (barcodes) in the Barcode of Life Data System for them. We found barcode libraries largely complete for extant and threatening-to-invade vertebrates (100% of reptile, 99% of fish, and 92% of amphibian species had barcodes). In contrast, barcode libraries remain poorly developed for precisely those organisms where morphological identification is most challenging; 46% of extant invertebrates lacked reference barcodes with rates especially high among rotifers, oligochaetes, and mites. Lack of species-level identification for many aquatic invertebrates also is a barrier to matching DNA sequences with physical specimens. Attaining the potential for DNA-based identification of mixed-organism samples covering the breadth of aquatic fauna requires a concerted effort to build supporting barcode libraries and voucher collections.
NASA Astrophysics Data System (ADS)
Trebitz, Anett S.; Hoffman, Joel C.; Grant, George W.; Billehus, Tyler M.; Pilgrim, Erik M.
2015-07-01
DNA-based identification of mixed-organism samples offers the potential to greatly reduce the need for resource-intensive morphological identification, which would be of value both to bioassessment and non-native species monitoring. The ability to assign species identities to DNA sequences found depends on the availability of comprehensive DNA reference libraries. Here, we compile inventories for aquatic metazoans extant in or threatening to invade the Laurentian Great Lakes and examine the availability of reference mitochondrial COI DNA sequences (barcodes) in the Barcode of Life Data System for them. We found barcode libraries largely complete for extant and threatening-to-invade vertebrates (100% of reptile, 99% of fish, and 92% of amphibian species had barcodes). In contrast, barcode libraries remain poorly developed for precisely those organisms where morphological identification is most challenging; 46% of extant invertebrates lacked reference barcodes with rates especially high among rotifers, oligochaetes, and mites. Lack of species-level identification for many aquatic invertebrates also is a barrier to matching DNA sequences with physical specimens. Attaining the potential for DNA-based identification of mixed-organism samples covering the breadth of aquatic fauna requires a concerted effort to build supporting barcode libraries and voucher collections.
Active role of a human genomic insert in replication of a yeast artificial chromosome.
van Brabant, A J; Fangman, W L; Brewer, B J
1999-06-01
Yeast artificial chromosomes (YACs) are a common tool for cloning eukaryotic DNA. The manner by which large pieces of foreign DNA are assimilated by yeast cells into a functional chromosome is poorly understood, as is the reason why some of them are stably maintained and some are not. We examined the replication of a stable YAC containing a 240-kb insert of DNA from the human T-cell receptor beta locus. The human insert contains multiple sites that serve as origins of replication. The activity of these origins appears to require the yeast ARS consensus sequence and, as with yeast origins, additional flanking sequences. In addition, the origins in the human insert exhibit a spacing, a range of activation efficiencies, and a variation in times of activation during S phase similar to those found for normal yeast chromosomes. We propose that an appropriate combination of replication origin density, activation times, and initiation efficiencies is necessary for the successful maintenance of YAC inserts.
Cloning of a promoter-like soybean DNA sequence responding to IAA induction in Escherichia coli K12.
Kline, E L; Chiang, S J; Lattora, D; Chaung, W
1992-02-01
We have constructed a soybean genomic DNA library in Escherichia coli K12 strain KC13 using plasmid pPV33, which consists of a promoter-less tetracycline resistance (Tcr) gene. A recombinant clone, KC13(pAU-SB1)+, was obtained by selecting for resistance to tetracycline in the presence of indole-3-acetic acid (IAA). Restriction enzyme cleavage and Southern hybridization analysis revealed that the pAU-SB1 plasmid has a 250 bp soybean DNA insert fused with the Tcr gene. In the presence of a selected group of auxins, induction of the Tcr phenotype and mRNA synthesis of the Tcr gene are observed only in KC13(pAU-SB1)+ cultures. On the other hand, induction of the Tcr phenotype and mRNA synthesis of the Tcr gene are absent in cells harboring the cloning vector pPV33 or a recombinant plasmid containing the 250 bp insert in the reverse orientation, pAU-SB1ro. This demonstrated a need for the insertion of the 250 bp soybean DNA and the specificity of its orientation in response to IAA induction. The start point of mRNA transcription in response to IAA, IBA, IPA, 2,4,5-T, and a-NAP is at base pair -96 or -95 upstream of the translational start site of the Tcr gene and base pair -98 with 2,4-D.
Robust DNA Isolation and High-throughput Sequencing Library Construction for Herbarium Specimens.
Saeidi, Saman; McKain, Michael R; Kellogg, Elizabeth A
2018-03-08
Herbaria are an invaluable source of plant material that can be used in a variety of biological studies. The use of herbarium specimens is associated with a number of challenges including sample preservation quality, degraded DNA, and destructive sampling of rare specimens. In order to more effectively use herbarium material in large sequencing projects, a dependable and scalable method of DNA isolation and library preparation is needed. This paper demonstrates a robust, beginning-to-end protocol for DNA isolation and high-throughput library construction from herbarium specimens that does not require modification for individual samples. This protocol is tailored for low quality dried plant material and takes advantage of existing methods by optimizing tissue grinding, modifying library size selection, and introducing an optional reamplification step for low yield libraries. Reamplification of low yield DNA libraries can rescue samples derived from irreplaceable and potentially valuable herbarium specimens, negating the need for additional destructive sampling and without introducing discernible sequencing bias for common phylogenetic applications. The protocol has been tested on hundreds of grass species, but is expected to be adaptable for use in other plant lineages after verification. This protocol can be limited by extremely degraded DNA, where fragments do not exist in the desired size range, and by secondary metabolites present in some plant material that inhibit clean DNA isolation. Overall, this protocol introduces a fast and comprehensive method that allows for DNA isolation and library preparation of 24 samples in less than 13 h, with only 8 h of active hands-on time with minimal modifications.
Roggo, Clémence; Coronado, Edith; Moreno-Forero, Silvia K; Harshman, Keith; Weber, Johann; van der Meer, Jan Roelof
2013-10-01
Sphingomonas wittichii RW1 is a dibenzofuran and dibenzodioxin-degrading bacterium with potentially interesting properties for bioaugmentation of contaminated sites. In order to understand the capacity of the microorganism to survive in the environment we used a genome-wide transposon scanning approach. RW1 transposon libraries were generated with around 22,000 independent insertions. Libraries were grown for an average of 50 generations (five successive passages in batch liquid medium) with salicylate as sole carbon and energy source in presence or absence of salt stress at -1.5 MPa. Alternatively, libraries were grown in sand with salicylate, at 50% water holding capacity, for 4 and 10 days (equivalent to 7 generations). Library DNA was recovered from the different growth conditions and scanned by ultrahigh throughput sequencing for the positions and numbers of inserted transposed kanamycin resistance gene. No transposon reads were recovered in 579 genes (10% of all annotated genes in the RW1 genome) in any of the libraries, suggesting those to be essential for survival under the used conditions. Libraries recovered from sand differed strongly from those incubated in liquid batch medium. In particular, important functions for survival of cells in sand at the short term concerned nutrient scavenging, energy metabolism and motility. In contrast to this, fatty acid metabolism and oxidative stress response were essential for longer term survival of cells in sand. Comparison to transcriptome data suggested important functions in sand for flagellar movement, pili synthesis, trehalose and polysaccharide synthesis and putative cell surface antigen proteins. Interestingly, a variety of genes were also identified, interruption of which cause significant increase in fitness during growth on salicylate. One of these was an Lrp family transcription regulator and mutants in this gene covered more than 90% of the total library after 50 generations of growth on salicylate. Our results demonstrate the power of genome-wide transposon scanning approaches for analysis of complex traits. © 2013 John Wiley & Sons Ltd and Society for Applied Microbiology.
Amin, Shivani; Rastogi, Rajesh P; Sonani, Ravi R; Ray, Arabinda; Sharma, Rakesh; Madamwar, Datta
2018-04-15
To explore the potential genes from the industrially polluted Amlakhadi canal, located in Ankleshwar, Gujarat, India, its community genome was extracted and cloned into E. coli EPI300™-T1 R using a fosmid vector (pCC2 FOS™) generating a library of 3,92,000 clones with average size of 40kb of DNA-insert. From this library, the clone DM1 producing brown colored melanin-like pigment was isolated and characterized. For over expression of the pigment, further sub-cloning of the clone DM1 was done. Sub-clone containing 10kb of the insert was sequenced for gene identification. The amino acids sequence of a protein 4-Hydroxyphenylpyruvate dioxygenase (HPPD), which is know to be involved in melanin biosynthesis was obtained from the gene sequence. The sequence-homology based 3D structure model of HPPD was constructed and analyzed. The physico-chemical nature of pigment was further analysed using 1 H and 13 C NMR, LC-MS, FTIR and UV-visible spectroscopy. The pigment was readily soluble in DMSO with an absorption maximum around 290nm. Based on the genetic and chemical characterization, the compound was confirmed as melanin-like pigment. The present results indicate that the metagenomic library from industrially polluted environment generated a microbial tool for the production of melanin-like pigment. Copyright © 2018 Elsevier B.V. All rights reserved.
Retroviral expression screening of oncogenes in natural killer cell leukemia.
Choi, Young Lim; Moriuchi, Ryozo; Osawa, Mitsujiro; Iwama, Atsushi; Makishima, Hideki; Wada, Tomoaki; Kisanuki, Hiroyuki; Kaneda, Ruri; Ota, Jun; Koinuma, Koji; Ishikawa, Madoka; Takada, Shuji; Yamashita, Yoshihiro; Oshimi, Kazuo; Mano, Hiroyuki
2005-08-01
Aggressive natural killer cell leukemia (ANKL) is an intractable malignancy that is characterized by the outgrowth of NK cells. To identify transforming genes in ANKL, we constructed a retroviral cDNA expression library from an ANKL cell line KHYG-1. Infection of 3T3 cells with recombinant retroviruses yielded 33 transformed foci. Nucleotide sequencing of the DNA inserts recovered from these foci revealed that 31 of them encoded KRAS2 with a glycine-to-alanine mutation at codon 12. Mutation-specific PCR analysis indicated that the KRAS mutation was present only in KHYG-1 cells, not in another ANKL cell line or in clinical specimens (n=8).
Bashir, Ali; Bansal, Vikas; Bafna, Vineet
2010-06-18
Massively parallel DNA sequencing technologies have enabled the sequencing of several individual human genomes. These technologies are also being used in novel ways for mRNA expression profiling, genome-wide discovery of transcription-factor binding sites, small RNA discovery, etc. The multitude of sequencing platforms, each with their unique characteristics, pose a number of design challenges, regarding the technology to be used and the depth of sequencing required for a particular sequencing application. Here we describe a number of analytical and empirical results to address design questions for two applications: detection of structural variations from paired-end sequencing and estimating mRNA transcript abundance. For structural variation, our results provide explicit trade-offs between the detection and resolution of rearrangement breakpoints, and the optimal mix of paired-read insert lengths. Specifically, we prove that optimal detection and resolution of breakpoints is achieved using a mix of exactly two insert library lengths. Furthermore, we derive explicit formulae to determine these insert length combinations, enabling a 15% improvement in breakpoint detection at the same experimental cost. On empirical short read data, these predictions show good concordance with Illumina 200 bp and 2 Kbp insert length libraries. For transcriptome sequencing, we determine the sequencing depth needed to detect rare transcripts from a small pilot study. With only 1 Million reads, we derive corrections that enable almost perfect prediction of the underlying expression probability distribution, and use this to predict the sequencing depth required to detect low expressed genes with greater than 95% probability. Together, our results form a generic framework for many design considerations related to high-throughput sequencing. We provide software tools http://bix.ucsd.edu/projects/NGS-DesignTools to derive platform independent guidelines for designing sequencing experiments (amount of sequencing, choice of insert length, mix of libraries) for novel applications of next generation sequencing.
Sabri, Suriana; Steen, Jennifer A; Bongers, Mareike; Nielsen, Lars K; Vickers, Claudia E
2013-06-24
Metabolic engineering projects often require integration of multiple genes in order to control the desired phenotype. However, this often requires iterative rounds of engineering because many current insertion approaches are limited by the size of the DNA that can be transferred onto the chromosome. Consequently, construction of highly engineered strains is very time-consuming. A lack of well-characterised insertion loci is also problematic. A series of knock-in/knock-out (KIKO) vectors was constructed for integration of large DNA sequences onto the E. coli chromosome at well-defined loci. The KIKO plasmids target three nonessential genes/operons as insertion sites: arsB (an arsenite transporter); lacZ (β-galactosidase); and rbsA-rbsR (a ribose metabolism operon). Two homologous 'arms' target each insertion locus; insertion is mediated by λ Red recombinase through these arms. Between the arms is a multiple cloning site for the introduction of exogenous sequences and an antibiotic resistance marker (either chloramphenicol or kanamycin) for selection of positive recombinants. The resistance marker can subsequently be removed by flippase-mediated recombination. The insertion cassette is flanked by hairpin loops to isolate it from the effects of external transcription at the integration locus. To characterize each target locus, a xylanase reporter gene (xynA) was integrated onto the chromosomes of E. coli strains W and K-12 using the KIKO vectors. Expression levels varied between loci, with the arsB locus consistently showing the highest level of expression. To demonstrate the simultaneous use of all three loci in one strain, xynA, green fluorescent protein (gfp) and a sucrose catabolic operon (cscAKB) were introduced into lacZ, arsB and rbsAR respectively, and shown to be functional. The KIKO plasmids are a useful tool for efficient integration of large DNA fragments (including multiple genes and pathways) into E. coli. Chromosomal insertion provides stable expression without the need for continuous antibiotic selection. Three non-essential loci have been characterised as insertion loci; combinatorial insertion at all three loci can be performed in one strain. The largest insertion at a single site described here was 5.4 kb; we have used this method in other studies to insert a total of 7.3 kb at one locus and 11.3 kb across two loci. These vectors are particularly useful for integration of multigene cassettes for metabolic engineering applications.
Bowers, Robert M.; Clum, Alicia; Tice, Hope; ...
2015-10-24
Background: The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composi tion and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for largemore » initial DNA template quantities during library preparation. Results: Here, we investigate the effects of varying template concentrations across three low biomass library preparation protocols on their ability to accurately reconstruct a mock microbial community of known composition. We analyze the effects of input DNA quantity and library preparation method on library insert size, GC content, community composition, assembly quality and metagenomic binning. We found that library preparation method and the amount of starting material had significant impacts on the mock community metagenomes. In particular, GC content shifted towards more GC rich sequences at the lower input quantities regardless of library prep method, the number of low quality reads that could not be mapped to the reference genomes increased with decreasing input quantities, and the different library preparation methods had an impact on overall metagenomic community composition. Conclusions: This benchmark study provides recommendations for library creation of representative and minimally biased metagenome shotgun sequencing, enabling insights into functional attributes of low biomass ecosystem microbial communities.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bowers, Robert M.; Clum, Alicia; Tice, Hope
Background: The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composi tion and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for largemore » initial DNA template quantities during library preparation. Results: Here, we investigate the effects of varying template concentrations across three low biomass library preparation protocols on their ability to accurately reconstruct a mock microbial community of known composition. We analyze the effects of input DNA quantity and library preparation method on library insert size, GC content, community composition, assembly quality and metagenomic binning. We found that library preparation method and the amount of starting material had significant impacts on the mock community metagenomes. In particular, GC content shifted towards more GC rich sequences at the lower input quantities regardless of library prep method, the number of low quality reads that could not be mapped to the reference genomes increased with decreasing input quantities, and the different library preparation methods had an impact on overall metagenomic community composition. Conclusions: This benchmark study provides recommendations for library creation of representative and minimally biased metagenome shotgun sequencing, enabling insights into functional attributes of low biomass ecosystem microbial communities.« less
Bartels, Daniela; Kespohl, Sebastian; Albaum, Stefan; Drüke, Tanja; Goesmann, Alexander; Herold, Julia; Kaiser, Olaf; Pühler, Alfred; Pfeiffer, Friedhelm; Raddatz, Günter; Stoye, Jens; Meyer, Folker; Schuster, Stephan C
2005-04-01
We provide the graphical tool BACCardI for the construction of virtual clone maps from standard assembler output files or BLAST based sequence comparisons. This new tool has been applied to numerous genome projects to solve various problems including (a) validation of whole genome shotgun assemblies, (b) support for contig ordering in the finishing phase of a genome project, and (c) intergenome comparison between related strains when only one of the strains has been sequenced and a large insert library is available for the other. The BACCardI software can seamlessly interact with various sequence assembly packages. Genomic assemblies generated from sequence information need to be validated by independent methods such as physical maps. The time-consuming task of building physical maps can be circumvented by virtual clone maps derived from read pair information of large insert libraries.
Construction and Screening of Marine Metagenomic Large Insert Libraries.
Weiland-Bräuer, Nancy; Langfeldt, Daniela; Schmitz, Ruth A
2017-01-01
The marine environment covers more than 70 % of the world's surface. Marine microbial communities are highly diverse and have evolved during extended evolutionary processes of physiological adaptations under the influence of a variety of ecological conditions and selection pressures. They harbor an enormous diversity of microbes with still unknown and probably new physiological characteristics. In the past, marine microbes, mostly bacteria of microbial consortia attached to marine tissues of multicellular organisms, have proven to be a rich source of highly potent bioactive compounds, which represent a considerable number of drug candidates. However, to date, the biodiversity of marine microbes and the versatility of their bioactive compounds and metabolites have not been fully explored. This chapter describes sampling in the marine environment, construction of metagenomic large insert libraries from marine habitats, and exemplarily one function based screen of metagenomic clones for identification of quorum quenching activities.
Devirgiliis, Chiara; Barile, Simona; Perozzi, Giuditta
2014-01-01
Lactic acid bacteria (LAB) represent the predominant microbiota in fermented foods. Foodborne LAB have received increasing attention as potential reservoir of antibiotic resistance (AR) determinants, which may be horizontally transferred to opportunistic pathogens. We have previously reported isolation of AR LAB from the raw ingredients of a fermented cheese, while AR genes could be detected in the final, marketed product only by PCR amplification, thus pointing at the need for more sensitive microbial isolation techniques. We turned therefore to construction of a metagenomic library containing microbial DNA extracted directly from the food matrix. To maximize yield and purity and to ensure that genomic complexity of the library was representative of the original bacterial population, we defined a suitable protocol for total DNA extraction from cheese which can also be applied to other lipid-rich foods. Functional library screening on different antibiotics allowed recovery of ampicillin and kanamycin resistant clones originating from Streptococcus salivarius subsp. thermophilus and Lactobacillus helveticus genomes. We report molecular characterization of the cloned inserts, which were fully sequenced and shown to confer AR phenotype to recipient bacteria. We also show that metagenomics can be applied to food microbiota to identify underrepresented species carrying specific genes of interest. PMID:25243126
Large Genomic Fragment Deletions and Insertions in Mouse Using CRISPR/Cas9
Satheka, Achim Cchitvsanzwhoh; Togo, Jacques; An, Yao; Humphrey, Mabwi; Ban, Luying; Ji, Yan; Jin, Honghong; Feng, Xuechao; Zheng, Yaowu
2015-01-01
ZFN, TALENs and CRISPR/Cas9 system have been used to generate point mutations and large fragment deletions and insertions in genomic modifications. CRISPR/Cas9 system is the most flexible and fast developing technology that has been extensively used to make mutations in all kinds of organisms. However, the most mutations reported up to date are small insertions and deletions. In this report, CRISPR/Cas9 system was used to make large DNA fragment deletions and insertions, including entire Dip2a gene deletion, about 65kb in size, and β-galactosidase (lacZ) reporter gene insertion of larger than 5kb in mouse. About 11.8% (11/93) are positive for 65kb deletion from transfected and diluted ES clones. High targeting efficiencies in ES cells were also achieved with G418 selection, 46.2% (12/26) and 73.1% (19/26) for left and right arms respectively. Targeted large fragment deletion efficiency is about 21.4% of live pups or 6.0% of injected embryos. Targeted insertion of lacZ reporter with NEO cassette showed 27.1% (13/48) of targeting rate by ES cell transfection and 11.1% (2/18) by direct zygote injection. The procedures have bypassed in vitro transcription by directly co-injection of zygotes or co-transfection of embryonic stem cells with circular plasmid DNA. The methods are technically easy, time saving, and cost effective in generating mouse models and will certainly facilitate gene function studies. PMID:25803037
CORALINA: a universal method for the generation of gRNA libraries for CRISPR-based screening.
Köferle, Anna; Worf, Karolina; Breunig, Christopher; Baumann, Valentin; Herrero, Javier; Wiesbeck, Maximilian; Hutter, Lukas H; Götz, Magdalena; Fuchs, Christiane; Beck, Stephan; Stricker, Stefan H
2016-11-14
The bacterial CRISPR system is fast becoming the most popular genetic and epigenetic engineering tool due to its universal applicability and adaptability. The desire to deploy CRISPR-based methods in a large variety of species and contexts has created an urgent need for the development of easy, time- and cost-effective methods enabling large-scale screening approaches. Here we describe CORALINA (comprehensive gRNA library generation through controlled nuclease activity), a method for the generation of comprehensive gRNA libraries for CRISPR-based screens. CORALINA gRNA libraries can be derived from any source of DNA without the need of complex oligonucleotide synthesis. We show the utility of CORALINA for human and mouse genomic DNA, its reproducibility in covering the most relevant genomic features including regulatory, coding and non-coding sequences and confirm the functionality of CORALINA generated gRNAs. The simplicity and cost-effectiveness make CORALINA suitable for any experimental system. The unprecedented sequence complexities obtainable with CORALINA libraries are a necessary pre-requisite for less biased large scale genomic and epigenomic screens.
Building a DNA barcode library of Alaska's non-marine arthropods.
Sikes, Derek S; Bowser, Matthew; Morton, John M; Bickford, Casey; Meierotto, Sarah; Hildebrandt, Kyndall
2017-03-01
Climate change may result in ecological futures with novel species assemblages, trophic mismatch, and mass extinction. Alaska has a limited taxonomic workforce to address these changes. We are building a DNA barcode library to facilitate a metabarcoding approach to monitoring non-marine arthropods. Working with the Canadian Centre for DNA Barcoding, we obtained DNA barcodes from recently collected and authoritatively identified specimens in the University of Alaska Museum (UAM) Insect Collection and the Kenai National Wildlife Refuge collection. We submitted tissues from 4776 specimens, of which 81% yielded DNA barcodes representing 1662 species and 1788 Barcode Index Numbers (BINs), of primarily terrestrial, large-bodied arthropods. This represents 84% of the species available for DNA barcoding in the UAM Insect Collection. There are now 4020 Alaskan arthropod species represented by DNA barcodes, after including all records in Barcode of Life Data Systems (BOLD) of species that occur in Alaska - i.e., 48.5% of the 8277 Alaskan, non-marine-arthropod, named species have associated DNA barcodes. An assessment of the identification power of the library in its current state yielded fewer species-level identifications than expected, but the results were not discouraging. We believe we are the first to deliberately begin development of a DNA barcode library of the entire arthropod fauna for a North American state or province. Although far from complete, this library will become increasingly valuable as more species are added and costs to obtain DNA sequences fall.
2010-01-01
genes from strains that have desirable traits. Here, we aim to enlarge the E. coli genome using Lactobacillus plantarum genes to build cells tolerant to...EtOH and BT. L. plantarum is an organism with established high tolerance to alcohols and solvents more broadly. Objective 2: Build a stress...heterologous (here: L. plantarum ; abbreviated as L. pl) DNA into the E. coli chromosome while selecting for insertions that enhance ethanol tolerance (which
[Screening and identification of anoikis-resistant gene UBCH7 in esophageal cancer cells].
Yang, Yang; Wang, Bo-Shi; Wang, Xiao-Min; Zhang, Yu; Wang, Ming-Rong; Jia, Xue-Mei
2012-02-01
Anoikis is a kind of programmed cell death induced by loss of extracellular matrix (ECM) adhesion, which is one of key factors for homestasis. Resistance to anoikis is required for tumor cell metastasis. We have previously shown several anoikis-resistance genes in esophageal squamous cell carcinoma (ESCC). In order to find novel anoikis-resistant genes in ESCC, we constructed retroviral cDNA library using total RNA from ESCC cell lines. NIH 3T3 cells, which are sensitive to anoikis, were infected with the library constructed. The cells were cultured in soft agar, and the clones which can survive in detached states were selected. The cDNAs inserted into the anoikis-resistant NIH3T3 clones were amplified using retroviral specific primers. Sequencing analysis showed that a cDNA fragment inserted into the anoikis-resistant clone contains full coding sequence (ORF) of human UBCH7/UBE2L3 gene. By infection with retrovirus encoding UBCH7 ORF (pMSCV-UBCH7), forced expression of UBCH7 increased the anoikis-resistance of NIH3T3 cells. More importantly, knockdown of UBCH7 expression by siRNA transfection reduced the anoikis-resistant ability of esophageal cancer MLuC1 cells. The data suggest that UBCH7/UBE2L3 gene would be involved in anoikis-resistance in ESCC.
Shahsavarian, Melody A; Le Minoux, Damien; Matti, Kalyankumar M; Kaveri, Srini; Lacroix-Desmazes, Sébastien; Boquet, Didier; Friboulet, Alain; Avalle, Bérangère; Padiolleau-Lefèvre, Séverine
2014-05-01
Phage display antibody libraries have proven to have a significant role in the discovery of therapeutic antibodies and polypeptides with desired biological and physicochemical properties. Obtaining a large and diverse phage display antibody library, however, is always a challenging task. Various steps of this technique can still undergo optimization in order to obtain an efficient library. In the construction of a single chain fragment variable (scFv) phage display library, the cloning of the scFv fragments into a phagemid vector is of crucial importance. An efficient restriction enzyme digestion of the scFv DNA leads to its proper ligation with the phagemid followed by its successful cloning and expression. Here, we are reporting a different approach to enhance the efficiency of the restriction enzyme digestion step. We have exploited rolling circle amplification (RCA) to produce a long strand of DNA with tandem repeats of scFv sequences, which is found to be highly susceptible to restriction digestion. With this important modification, we are able to construct a large phage display antibody library of naive SJL/J mice. The size of the library is estimated as ~10(8) clones. The number of clones containing a scFv fragment is estimated at 90%. Hence, the present results could considerably aid the utilization of the phage-display technique in order to get an efficiently large antibody library. Copyright © 2014 Elsevier B.V. All rights reserved.
In silico Analysis of 2085 Clones from a Normalized Rat Vestibular Periphery 3′ cDNA Library
Roche, Joseph P.; Cioffi, Joseph A.; Kwitek, Anne E.; Erbe, Christy B.; Popper, Paul
2005-01-01
The inserts from 2400 cDNA clones isolated from a normalized Rattus norvegicus vestibular periphery cDNA library were sequenced and characterized. The Wackym-Soares vestibular 3′ cDNA library was constructed from the saccular and utricular maculae, the ampullae of all three semicircular canals and Scarpa's ganglia containing the somata of the primary afferent neurons, microdissected from 104 male and female rats. The inserts from 2400 randomly selected clones were sequenced from the 5′ end. Each sequence was analyzed using the BLAST algorithm compared to the Genbank nonredundant, rat genome, mouse genome and human genome databases to search for high homology alignments. Of the initial 2400 clones, 315 (13%) were found to be of poor quality and did not yield useful information, and therefore were eliminated from the analysis. Of the remaining 2085 sequences, 918 (44%) were found to represent 758 unique genes having useful annotations that were identified in databases within the public domain or in the published literature; these sequences were designated as known characterized sequences. 1141 sequences (55%) aligned with 1011 unique sequences had no useful annotations and were designated as known but uncharacterized sequences. Of the remaining 26 sequences (1%), 24 aligned with rat genomic sequences, but none matched previously described rat expressed sequence tags or mRNAs. No significant alignment to the rat or human genomic sequences could be found for the remaining 2 sequences. Of the 2085 sequences analyzed, 86% were singletons. The known, characterized sequences were analyzed with the FatiGO online data-mining tool (http://fatigo.bioinfo.cnio.es/) to identify level 5 biological process gene ontology (GO) terms for each alignment and to group alignments with similar or identical GO terms. Numerous genes were identified that have not been previously shown to be expressed in the vestibular system. Further characterization of the novel cDNA sequences may lead to the identification of genes with vestibular-specific functions. Continued analysis of the rat vestibular periphery transcriptome should provide new insights into vestibular function and generate new hypotheses. Physiological studies are necessary to further elucidate the roles of the identified genes and novel sequences in vestibular function. PMID:16103642
Litovchick, Alexander; Clark, Matthew A; Keefe, Anthony D
2014-01-01
The affinity-mediated selection of large libraries of DNA-encoded small molecules is increasingly being used to initiate drug discovery programs. We present universal methods for the encoding of such libraries using the chemical ligation of oligonucleotides. These methods may be used to record the chemical history of individual library members during combinatorial synthesis processes. We demonstrate three different chemical ligation methods as examples of information recording processes (writing) for such libraries and two different cDNA-generation methods as examples of information retrieval processes (reading) from such libraries. The example writing methods include uncatalyzed and Cu(I)-catalyzed alkyne-azide cycloadditions and a novel photochemical thymidine-psoralen cycloaddition. The first reading method “relay primer-dependent bypass” utilizes a relay primer that hybridizes across a chemical ligation junction embedded in a fixed-sequence and is extended at its 3′-terminus prior to ligation to adjacent oligonucleotides. The second reading method “repeat-dependent bypass” utilizes chemical ligation junctions that are flanked by repeated sequences. The upstream repeat is copied prior to a rearrangement event during which the 3′-terminus of the cDNA hybridizes to the downstream repeat and polymerization continues. In principle these reading methods may be used with any ligation chemistry and offer universal strategies for the encoding (writing) and interpretation (reading) of DNA-encoded chemical libraries. PMID:25483841
Wu, Yiming; Hu, Xiaomin; Ge, Yong; Zheng, Dasheng; Yuan, Zhiming
2012-05-01
Bacillus sphaericus has been used with great success in mosquito control programs worldwide. Under conditions of nutrient limitation, it undergoes sporulation via a series of well defined morphological stages. However, only a small number of genes involved in sporulation have been identified. To identify genes associated with sporulation, and to understand the relationship between sporulation and crystal protein synthesis, a random mariner-based transposon insertion mutant library of B. sphaericus strain 2297 was constructed and seven sporulation-defective mutants were selected. Sequencing of the DNA flanking of the transposon insertion identified several genes involved in sporulation. The morphologies of mutants were determined by electron microscopy and synthesis of crystal proteins was analyzed by SDS-PAGE and Western blot. Four mutants blocked at early stages of sporulation failed to produce crystal proteins and had lower larvicidal activity. However, the other three mutants were blocked at later stages and were able to form crystal proteins, and the larvicidal activity was similar to wild type. These results indicated that crystal protein synthesis in B. sphaericus is dependent on sporulation initiation. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
White, K Makay; Matthews, Melinda K; Hughes, Rachel C; Sommer, Andrew J; Griffitts, Joel S; Newell, Peter D; Chaston, John M
2018-03-28
A metagenome wide association (MGWA) study of bacterial host association determinants in Drosophila predicted that LPS biosynthesis genes are significantly associated with host colonization. We were unable to create site-directed mutants for each of the predicted genes in Acetobacter , so we created an arrayed transposon insertion library using Acetobacter fabarum DsW_054 isolated from Drosophila Creation of the A. fabarum DsW_054 gene knock-out library was performed by combinatorial mapping and Illumina sequencing of random transposon insertion mutants. Transposon insertion locations for 6,418 mutants were successfully mapped, including hits within 63% of annotated genes in the A. fabarum DsW_054 genome. For 45/45 members of the library, insertion sites were verified by arbitrary PCR and Sanger sequencing. Mutants with insertions in four different LPS biosynthesis genes were selected from the library to validate the MGWA predictions. Insertion mutations in two genes biosynthetically upstream of Lipid-A formation, lpxC and lpxB , show significant differences in host association, whereas mutations in two genes encoding LPS biosynthesis functions downstream of Lipid-A biosynthesis had no effect. These results suggest an impact of bacterial cell surface molecules on the bacterial capacity for host association. Also, the transposon insertion mutant library will be a useful resource for ongoing research on the genetic basis for Acetobacter traits. Copyright © 2018 White et al.
Ito, Masahiro; Kim, Yun-Gi; Tsuji, Hirokazu; Takahashi, Takuya; Kiwaki, Mayumi; Nomoto, Koji; Danbara, Hirofumi; Okada, Nobuhiko
2014-01-01
Lactobacillus casei ATCC 27139 enhances host innate immunity, and the J1 phage-resistant mutants of this strain lose the activity. A transposon insertion mutant library of L. casei ATCC 27139 was constructed, and nine J1 phage-resistant mutants out of them were obtained. Cloning and sequencing analyses identified three independent genes that were disrupted by insertion of the transposon element: asnH, encoding asparagine synthetase, and dnaJ and dnaK, encoding the molecular chaperones DnaJ and DnaK, respectively. Using an in vivo mouse model of Listeria infection, only asnH mutant showed deficiency in their ability to enhance host innate immunity, and complementation of the mutation by introduction of the wild-type asnH in the mutant strain recovered the immuno-augmenting activity. AsnH protein exhibited asparagine synthetase activity when the lysozyme-treated cell wall extracts of L. casei ATCC 27139 was added as substrate. The asnH mutants lost the thick and rigid peptidoglycan features that are characteristic to the wild-type cells, indicating that AsnH of L. casei is involved in peptidoglycan biosynthesis. These results indicate that asnH is required for the construction of the peptidoglycan composition involved in the immune-activating capacity of L. casei ATCC 27139.
Anton, Brian P; Mongodin, Emmanuel F; Agrawal, Sonia; Fomenkov, Alexey; Byrd, Devon R; Roberts, Richard J; Raleigh, Elisabeth A
2015-01-01
We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems.
Anton, Brian P.; Mongodin, Emmanuel F.; Agrawal, Sonia; Fomenkov, Alexey; Byrd, Devon R.; Roberts, Richard J.; Raleigh, Elisabeth A.
2015-01-01
We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems. PMID:26010885
Knietsch, Anja; Waschkowitz, Tanja; Bowien, Susanne; Henne, Anke; Daniel, Rolf
2003-01-01
Metagenomic DNA libraries from three different soil samples (meadow, sugar beet field, cropland) were constructed. The three unamplified libraries comprised approximately 1267000 independent clones and harbored approximately 4.05 Gbp of environmental DNA. Approximately 300000 recombinant Escherichia coli strains of each library per test substrate were screened for the production of carbonyls from short-chain (C2 to C4) polyols such as 1,2-ethanediol, 2,3-butanediol, and a mixture of glycerol and 1,2-propanediol on indicator agar. Twenty-four positive E. COLI clones were obtained during the initial screen. Fifteen of them contained recombinant plasmids, designated pAK201-215, which conferred a stable carbonyl-forming phenotype on E. coli Sequencing revealed that the inserts of pAK201-215 encoded 26 complete and 14 incomplete predicted protein-encoding genes. Most of these genes were similar to genes with unknown functions from other microorganisms or unrelated to any other known gene. The further analysis was focused on the 7 plasmids (pAK204, pAK206, pAK208, and pAK210-213) recovered from the positive clones, which exhibited an NAD(H)-dependent alcohol oxidoreductase activity with polyols or the correlating carbonyls as substrates in crude extracts. Three genes (ORF6, ORF24, and ORF25) conferring this activity were identified during subcloning of the inserts of pAK204, pAK211, and pAK212. The sequences of the three deduced gene products revealed no significant similarities to known alcohol oxidoreductases, but contained putative glycine-rich regions, which are characteristic for binding of nicotinamide cofactors. Copyright 2003 S. Karger AG, Basel
Kim, Jae-Eung; Huang, Rui; Chen, Hui; You, Chun; Zhang, Y-H Percival
2016-09-01
A foolproof protocol was developed for the construction of mutant DNA library for directed protein evolution. First, a library of linear mutant gene was generated by error-prone PCR or molecular shuffling, and a linear vector backbone was prepared by high-fidelity PCR. Second, the amplified insert and vector fragments were assembled by overlap-extension PCR with a pair of 5'-phosphorylated primers. Third, full-length linear plasmids with phosphorylated 5'-ends were self-ligated with T4 ligase, yielding circular plasmids encoding mutant variants suitable for high-efficiency transformation. Self-made competent Escherichia coli BL21(DE3) showed a transformation efficiency of 2.4 × 10(5) cfu/µg of the self-ligated circular plasmid. Using this method, three mutants of mCherry fluorescent protein were found to alter their colors and fluorescent intensities under visible and UV lights, respectively. Also, one mutant of 6-phosphorogluconate dehydrogenase from a thermophilic bacterium Moorella thermoacetica was found to show the 3.5-fold improved catalytic efficiency (kcat /Km ) on NAD(+) as compared to the wild-type. This protocol is DNA-sequence independent, and does not require restriction enzymes, special E. coli host, or labor-intensive optimization. In addition, this protocol can be used for subcloning the relatively long DNA sequences into any position of plasmids. Copyright © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Lin, Jinke; Kudrna, Dave; Wing, Rod A.
2011-01-01
We describe the construction and characterization of a publicly available BAC library for the tea plant, Camellia sinensis. Using modified methods, the library was constructed with the aim of developing public molecular resources to advance tea plant genomics research. The library consists of a total of 401,280 clones with an average insert size of 135 kb, providing an approximate coverage of 13.5 haploid genome equivalents. No empty vector clones were observed in a random sampling of 576 BAC clones. Further analysis of 182 BAC-end sequences from randomly selected clones revealed a GC content of 40.35% and low chloroplast and mitochondrial contamination. Repetitive sequence analyses indicated that LTR retrotransposons were the most predominant sequence class (86.93%–87.24%), followed by DNA retrotransposons (11.16%–11.69%). Additionally, we found 25 simple sequence repeats (SSRs) that could potentially be used as genetic markers. PMID:21234344
Quadros, Rolen M; Miura, Hiromi; Harms, Donald W; Akatsuka, Hisako; Sato, Takehito; Aida, Tomomi; Redder, Ronald; Richardson, Guy P; Inagaki, Yutaka; Sakai, Daisuke; Buckley, Shannon M; Seshacharyulu, Parthasarathy; Batra, Surinder K; Behlke, Mark A; Zeiner, Sarah A; Jacobi, Ashley M; Izu, Yayoi; Thoreson, Wallace B; Urness, Lisa D; Mansour, Suzanne L; Ohtsuka, Masato; Gurumurthy, Channabasavaiah B
2017-05-17
Conditional knockout mice and transgenic mice expressing recombinases, reporters, and inducible transcriptional activators are key for many genetic studies and comprise over 90% of mouse models created. Conditional knockout mice are generated using labor-intensive methods of homologous recombination in embryonic stem cells and are available for only ~25% of all mouse genes. Transgenic mice generated by random genomic insertion approaches pose problems of unreliable expression, and thus there is a need for targeted-insertion models. Although CRISPR-based strategies were reported to create conditional and targeted-insertion alleles via one-step delivery of targeting components directly to zygotes, these strategies are quite inefficient. Here we describe Easi-CRISPR (Efficient additions with ssDNA inserts-CRISPR), a targeting strategy in which long single-stranded DNA donors are injected with pre-assembled crRNA + tracrRNA + Cas9 ribonucleoprotein (ctRNP) complexes into mouse zygotes. We show for over a dozen loci that Easi-CRISPR generates correctly targeted conditional and insertion alleles in 8.5-100% of the resulting live offspring. Easi-CRISPR solves the major problem of animal genome engineering, namely the inefficiency of targeted DNA cassette insertion. The approach is robust, succeeding for all tested loci. It is versatile, generating both conditional and targeted insertion alleles. Finally, it is highly efficient, as treating an average of only 50 zygotes is sufficient to produce a correctly targeted allele in up to 100% of live offspring. Thus, Easi-CRISPR offers a comprehensive means of building large-scale Cre-LoxP animal resources.
LeProust, Emily M.; Peck, Bill J.; Spirin, Konstantin; McCuen, Heather Brummel; Moore, Bridget; Namsaraev, Eugeni; Caruthers, Marvin H.
2010-01-01
We have achieved the ability to synthesize thousands of unique, long oligonucleotides (150mers) in fmol amounts using parallel synthesis of DNA on microarrays. The sequence accuracy of the oligonucleotides in such large-scale syntheses has been limited by the yields and side reactions of the DNA synthesis process used. While there has been significant demand for libraries of long oligos (150mer and more), the yields in conventional DNA synthesis and the associated side reactions have previously limited the availability of oligonucleotide pools to lengths <100 nt. Using novel array based depurination assays, we show that the depurination side reaction is the limiting factor for the synthesis of libraries of long oligonucleotides on Agilent Technologies’ SurePrint® DNA microarray platform. We also demonstrate how depurination can be controlled and reduced by a novel detritylation process to enable the synthesis of high quality, long (150mer) oligonucleotide libraries and we report the characterization of synthesis efficiency for such libraries. Oligonucleotide libraries prepared with this method have changed the economics and availability of several existing applications (e.g. targeted resequencing, preparation of shRNA libraries, site-directed mutagenesis), and have the potential to enable even more novel applications (e.g. high-complexity synthetic biology). PMID:20308161
Characterization of (CA)n microsatellite repeats from large-insert clones.
Litt, M; Browne, D
2001-05-01
The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit determination of sequences flanking the microsatellites. When cosmids or large-insert phage clones are used as primary sources of (CA)n repeat markers, they have traditionally been subcloned into plasmid vectors such as pUC18 or M13 mp 18/19 cloning vectors to obtain fragments of suitable size for DNA sequencing. This unit presents an alternative approach whereby a set of degenerate sequencing primers that anneal directly to (CA)n microsatellites can be used to determine sequences that are inaccessible with vector-derived primers. Because the primers anneal to the repeat and not to the vector, they can be used with subclones containing inserts of several kilobases and should, in theory, always give sequence in the regions directly flanking the repeat. Degeneracy at the 3 end of each of these primers prevents elongation of primers that have annealed out-of-register. The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit.
2013-01-01
Background Cotton, one of the world’s leading crops, is important to the world’s textile and energy industries, and is a model species for studies of plant polyploidization, cellulose biosynthesis and cell wall biogenesis. Here, we report the construction of a plant-transformation-competent binary bacterial artificial chromosome (BIBAC) library and comparative genome sequence analysis of polyploid Upland cotton (Gossypium hirsutum L.) with one of its diploid putative progenitor species, G. raimondii Ulbr. Results We constructed the cotton BIBAC library in a vector competent for high-molecular-weight DNA transformation in different plant species through either Agrobacterium or particle bombardment. The library contains 76,800 clones with an average insert size of 135 kb, providing an approximate 99% probability of obtaining at least one positive clone from the library using a single-copy probe. The quality and utility of the library were verified by identifying BIBACs containing genes important for fiber development, fiber cellulose biosynthesis, seed fatty acid metabolism, cotton-nematode interaction, and bacterial blight resistance. In order to gain an insight into the Upland cotton genome and its relationship with G. raimondii, we sequenced nearly 10,000 BIBAC ends (BESs) randomly selected from the library, generating approximately one BES for every 250 kb along the Upland cotton genome. The retroelement Gypsy/DIRS1 family predominates in the Upland cotton genome, accounting for over 77% of all transposable elements. From the BESs, we identified 1,269 simple sequence repeats (SSRs), of which 1,006 were new, thus providing additional markers for cotton genome research. Surprisingly, comparative sequence analysis showed that Upland cotton is much more diverged from G. raimondii at the genomic sequence level than expected. There seems to be no significant difference between the relationships of the Upland cotton D- and A-subgenomes with the G. raimondii genome, even though G. raimondii contains a D genome (D5). Conclusions The library represents the first BIBAC library in cotton and related species, thus providing tools useful for integrative physical mapping, large-scale genome sequencing and large-scale functional analysis of the Upland cotton genome. Comparative sequence analysis provides insights into the Upland cotton genome, and a possible mechanism underlying the divergence and evolution of polyploid Upland cotton from its diploid putative progenitor species, G. raimondii. PMID:23537070
Ruiz, Lorena; Motherway, Mary O'Connell; Lanigan, Noreen; van Sinderen, Douwe
2013-01-01
Bifidobacteria are claimed to contribute positively to human health through a range of beneficial or probiotic activities, including amelioration of gastrointestinal and metabolic disorders, and therefore this particular group of gastrointestinal commensals has enjoyed increasing industrial and scientific attention in recent years. However, the molecular mechanisms underlying these probiotic mechanisms are still largely unknown, mainly due to the fact that molecular tools for bifidobacteria are rather poorly developed, with many strains lacking genetic accessibility. In this work, we describe the generation of transposon insertion mutants in two bifidobacterial strains, B. breve UCC2003 and B. breve NCFB2258. We also report the creation of the first transposon mutant library in a bifidobacterial strain, employing B. breve UCC2003 and a Tn5-based transposome strategy. The library was found to be composed of clones containing single transposon insertions which appear to be randomly distributed along the genome. The usefulness of the library to perform phenotypic screenings was confirmed through identification and analysis of mutants defective in D-galactose, D-lactose or pullulan utilization abilities.
Molecular analysis of the anaerobic rumen fungus Orpinomyces - insights into an AT-rich genome.
Nicholson, Matthew J; Theodorou, Michael K; Brookman, Jayne L
2005-01-01
The anaerobic gut fungi occupy a unique niche in the intestinal tract of large herbivorous animals and are thought to act as primary colonizers of plant material during digestion. They are the only known obligately anaerobic fungi but molecular analysis of this group has been hampered by difficulties in their culture and manipulation, and by their extremely high A+T nucleotide content. This study begins to answer some of the fundamental questions about the structure and organization of the anaerobic gut fungal genome. Directed plasmid libraries using genomic DNA digested with highly or moderately rich AT-specific restriction enzymes (VspI and EcoRI) were prepared from a polycentric Orpinomyces isolate. Clones were sequenced from these libraries and the breadth of genomic inserts, both genic and intergenic, was characterized. Genes encoding numerous functions not previously characterized for these fungi were identified, including cytoskeletal, secretory pathway and transporter genes. A peptidase gene with no introns and having sequence similarity to a gene encoding a bacterial peptidase was also identified, extending the range of metabolic enzymes resulting from apparent trans-kingdom transfer from bacteria to fungi, as previously characterized largely for genes encoding plant-degrading enzymes. This paper presents the first thorough analysis of the genic, intergenic and rDNA regions of a variety of genomic segments from an anaerobic gut fungus and provides observations on rules governing intron boundaries, the codon biases observed with different types of genes, and the sequence of only the second anaerobic gut fungal promoter reported. Large numbers of retrotransposon sequences of different types were found and the authors speculate on the possible consequences of any such transposon activity in the genome. The coding sequences identified included several orphan gene sequences, including one with regions strongly suggestive of structural proteins such as collagens and lampirin. This gene was present as a single copy in Orpinomyces, was expressed during vegetative growth and was also detected in genomes from another gut fungal genus, Neocallimastix.
Using Phage Display to Create Recombinant Antibodies.
Dasch, James R; Dasch, Amy L
2017-09-01
A variety of phage display technologies have been developed since the approach was first described for antibodies. The most widely used approaches incorporate antibody sequences into the minor coat protein pIII of the nonlytic filamentous phage fd or M13. Libraries of variable gene sequences, encoding either scFv or Fab fragments, are made by incorporating sequences into phagemid vectors. The phagemid is packaged into phage particles with the assistance of a helper phage to produce the antibody display phage. This protocol describes a method for creating a phagemid library. The multiple cloning site (MCS) of the pBluescript KS(-) phagemid vector is replaced by digestion with the restriction enzyme BssHII, followed by the insertion of four overlapping oligonucleotides to create a new MCS within the vector. Next, the 3' portion of gene III (from M13mp18) is amplified and combined with an antibody sequence using overlap extension PCR. This product is inserted into the phagemid vector to create pPDS. Two helper plasmids are also created from the modified pBluescript vector: pLINK provides the linker between the heavy and light chains, and pFABC provides the CH1 domain of the heavy chain. An antibody cDNA library is constructed from the RNA of interest and ligated into pPDS. The phagemid library is electroporated into Escherichia coli cells along with the VCS-M13 helper phage. © 2017 Cold Spring Harbor Laboratory Press.
Indel detection from DNA and RNA sequencing data with transIndel.
Yang, Rendong; Van Etten, Jamie L; Dehm, Scott M
2018-04-19
Insertions and deletions (indels) are a major class of genomic variation associated with human disease. Indels are primarily detected from DNA sequencing (DNA-seq) data but their transcriptional consequences remain unexplored due to challenges in discriminating medium-sized and large indels from splicing events in RNA-seq data. Here, we developed transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and reconstructs the mid-sized insertions and large deletions based on the linear alignments of split reads from DNA-seq or RNA-seq data. TransIndel exhibits competitive or superior performance over eight state-of-the-art indel detection tools on benchmarks using both synthetic and real DNA-seq data. Additionally, we applied transIndel to DNA-seq and RNA-seq datasets from 333 primary prostate cancer patients from The Cancer Genome Atlas (TCGA) and 59 metastatic prostate cancer patients from AACR-PCF Stand-Up- To-Cancer (SU2C) studies. TransIndel enhanced the taxonomy of DNA- and RNA-level alterations in prostate cancer by identifying recurrent FOXA1 indels as well as exitron splicing in genes implicated in disease progression. Our study demonstrates that transIndel is a robust tool for elucidation of medium- and large-sized indels from DNA-seq and RNA-seq data. Including RNA-seq in indel discovery efforts leads to significant improvements in sensitivity for identification of med-sized and large indels missed by DNA-seq, and reveals non-canonical RNA-splicing events in genes associated with disease pathology.
Construction of a scFv Library with Synthetic, Non-combinatorial CDR Diversity.
Bai, Xuelian; Shim, Hyunbo
2017-01-01
Many large synthetic antibody libraries have been designed, constructed, and successfully generated high-quality antibodies suitable for various demanding applications. While synthetic antibody libraries have many advantages such as optimized framework sequences and a broader sequence landscape than natural antibodies, their sequence diversities typically are generated by random combinatorial synthetic processes which cause the incorporation of many undesired CDR sequences. Here, we describe the construction of a synthetic scFv library using oligonucleotide mixtures that contain predefined, non-combinatorially synthesized CDR sequences. Each CDR is first inserted to a master scFv framework sequence and the resulting single-CDR libraries are subjected to a round of proofread panning. The proofread CDR sequences are assembled to produce the final scFv library with six diversified CDRs.
Constructing high complexity synthetic libraries of long ORFs using in vitro selection
NASA Technical Reports Server (NTRS)
Cho, G.; Keefe, A. D.; Liu, R.; Wilson, D. S.; Szostak, J. W.
2000-01-01
We present a method that can significantly increase the complexity of protein libraries used for in vitro or in vivo protein selection experiments. Protein libraries are often encoded by chemically synthesized DNA, in which part of the open reading frame is randomized. There are, however, major obstacles associated with the chemical synthesis of long open reading frames, especially those containing random segments. Insertions and deletions that occur during chemical synthesis cause frameshifts, and stop codons in the random region will cause premature termination. These problems can together greatly reduce the number of full-length synthetic genes in the library. We describe a strategy in which smaller segments of the synthetic open reading frame are selected in vitro using mRNA display for the absence of frameshifts and stop codons. These smaller segments are then ligated together to form combinatorial libraries of long uninterrupted open reading frames. This process can increase the number of full-length open reading frames in libraries by up to two orders of magnitude, resulting in protein libraries with complexities of greater than 10(13). We have used this methodology to generate three types of displayed protein library: a completely random sequence library, a library of concatemerized oligopeptide cassettes with a propensity for forming amphipathic alpha-helical or beta-strand structures, and a library based on one of the most common enzymatic scaffolds, the alpha/beta (TIM) barrel. Copyright 2000 Academic Press.
A specific DNA probe which identifies Babesia bovis in whole blood.
Petchpoo, W; Tan-ariya, P; Boonsaeng, V; Brockelman, C R; Wilairat, P; Panyim, S
1992-05-01
A genomic library of Babesia bovis DNA from the Mexican strain M was constructed in plasmid pUN121 and cloned in Escherichia coli. Several recombinants which hybridized strongly to radioactively labeled B. bovis genomic DNA in an in situ screening were selected and further analyzed for those which specifically hybridized to B. bovis DNA. It was found that pMU-B1 had the highest sensitivity, detecting 25 pg of purified B. bovis DNA, and 300 parasites in 10 microliters of whole infected blood, or 0.00025% parasitemia. pMU-B1 contained a 6.0 kb B. bovis DNA insert which did not cross-hybridize to Babesia bigemina, Trypanosoma evansi, Plasmodium falciparum, Anaplasma marginale, Boophilus microplus and cow DNA. In the Southern blot analysis of genomic DNA, pMU-B1 could differentiate between two B. bovis geographic isolates, Mexican strain M and Thai isolate TS4. Thus, the pMU-B1 probe will be useful in the diagnosis of Babesia infection in cattle and ticks, and in the differentiation of B. bovis strains.
Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.
Ananiev, E V; Phillips, R L; Rines, H W
1998-01-01
The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055
Templated sequence insertion polymorphisms in the human genome
NASA Astrophysics Data System (ADS)
Onozawa, Masahiro; Aplan, Peter
2016-11-01
Templated Sequence Insertion Polymorphism (TSIP) is a recently described form of polymorphism recognized in the human genome, in which a sequence that is templated from a distant genomic region is inserted into the genome, seemingly at random. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; Class 1 TSIPs show features of insertions that are mediated via the LINE-1 ORF2 protein, including 1) target-site duplication (TSD), 2) polyadenylation 10-30 nucleotides downstream of a “cryptic” polyadenylation signal, and 3) preference for insertion at a 5’-TTTT/A-3’ sequence. In contrast, class 2 TSIPs show features consistent with repair of a DNA double-strand break via insertion of a DNA “patch” that is derived from a distant genomic region. Survey of a large number of normal human volunteers demonstrates that most individuals have 25-30 TSIPs, and that these TSIPs track with specific geographic regions. Similar to other forms of human polymorphism, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases.
Complementary DNA libraries: an overview.
Ying, Shao-Yao
2004-07-01
The generation of complete and full-length cDNA libraries for potential functional assays of specific gene sequences is essential for most molecules in biotechnology and biomedical research. The field of cDNA library generation has changed rapidly in the past 10 yr. This review presents an overview of the method available for the basic information of generating cDNA libraries, including the definition of the cDNA library, different kinds of cDNA libraries, difference between methods for cDNA library generation using conventional approaches and a novel strategy, and the quality of cDNA libraries. It is anticipated that the high-quality cDNA libraries so generated would facilitate studies involving genechips and the microarray, differential display, subtractive hybridization, gene cloning, and peptide library generation.
A novel sodium bicarbonate cotransporter-like gene in an ancient duplicated region: SLC4A9 at 5q31
Lipovich, Leonard; Lynch, Eric D; Lee, Ming K; King, Mary-Claire
2001-01-01
Background: Sodium bicarbonate cotransporter (NBC) genes encode proteins that execute coupled Na+ and HCO3- transport across epithelial cell membranes. We report the discovery, characterization, and genomic context of a novel human NBC-like gene, SLC4A9, on chromosome 5q31. Results: SLC4A9 was initially discovered by genomic sequence annotation and further characterized by sequencing of long-insert cDNA library clones. The predicted protein of 990 amino acids has 12 transmembrane domains and high sequence similarity to other NBCs. The 23-exon gene has 14 known mRNA isoforms. In three regions, mRNA sequence variation is generated by the inclusion or exclusion of portions of an exon. Noncoding SLC4A9 cDNAs were recovered multiple times from different libraries. The 3' untranslated region is fragmented into six alternatively spliced exons and contains expressed Alu, LINE and MER repeats. SLC4A9 has two alternative stop codons and six polyadenylation sites. Its expression is largely restricted to the kidney. In silico approaches were used to characterize two additional novel SLC4A genes and to place SLC4A9 within the context of multiple paralogous gene clusters containing members of the epidermal growth factor (EGF), ankyrin (ANK) and fibroblast growth factor (FGF) families. Seven human EGF-SLC4A-ANK-FGF clusters were found. Conclusion: The novel sodium bicarbonate cotransporter-like gene SLC4A9 demonstrates abundant alternative mRNA processing. It belongs to a growing class of functionally diverse genes characterized by inefficient highly variable splicing. The evolutionary history of the EGF-SLC4A-ANK-FGF gene clusters involves multiple rounds of duplication, apparently followed by large insertions and deletions at paralogous loci and genome-wide gene shuffling. PMID:11305939
GABI-Kat SimpleSearch: new features of the Arabidopsis thaliana T-DNA mutant database.
Kleinboelting, Nils; Huep, Gunnar; Kloetgen, Andreas; Viehoever, Prisca; Weisshaar, Bernd
2012-01-01
T-DNA insertion mutants are very valuable for reverse genetics in Arabidopsis thaliana. Several projects have generated large sequence-indexed collections of T-DNA insertion lines, of which GABI-Kat is the second largest resource worldwide. User access to the collection and its Flanking Sequence Tags (FSTs) is provided by the front end SimpleSearch (http://www.GABI-Kat.de). Several significant improvements have been implemented recently. The database now relies on the TAIRv10 genome sequence and annotation dataset. All FSTs have been newly mapped using an optimized procedure that leads to improved accuracy of insertion site predictions. A fraction of the collection with weak FST yield was re-analysed by generating new FSTs. Along with newly found predictions for older sequences about 20,000 new FSTs were included in the database. Information about groups of FSTs pointing to the same insertion site that is found in several lines but is real only in a single line are included, and many problematic FST-to-line links have been corrected using new wet-lab data. SimpleSearch currently contains data from ~71,000 lines with predicted insertions covering 62.5% of the 27,206 nuclear protein coding genes, and offers insertion allele-specific data from 9545 confirmed lines that are available from the Nottingham Arabidopsis Stock Centre.
Kröber, Magdalena; Bekel, Thomas; Diaz, Naryttza N; Goesmann, Alexander; Jaenicke, Sebastian; Krause, Lutz; Miller, Dimitri; Runte, Kai J; Viehöver, Prisca; Pühler, Alfred; Schlüter, Andreas
2009-06-01
The phylogenetic structure of the microbial community residing in a fermentation sample from a production-scale biogas plant fed with maize silage, green rye and liquid manure was analysed by an integrated approach using clone library sequences and metagenome sequence data obtained by 454-pyrosequencing. Sequencing of 109 clones from a bacterial and an archaeal 16S-rDNA amplicon library revealed that the obtained nucleotide sequences are similar but not identical to 16S-rDNA database sequences derived from different anaerobic environments including digestors and bioreactors. Most of the bacterial 16S-rDNA sequences could be assigned to the phylum Firmicutes with the most abundant class Clostridia and to the class Bacteroidetes, whereas most archaeal 16S-rDNA sequences cluster close to the methanogen Methanoculleus bourgensis. Further sequences of the archaeal library most probably represent so far non-characterised species within the genus Methanoculleus. A similar result derived from phylogenetic analysis of mcrA clone sequences. The mcrA gene product encodes the alpha-subunit of methyl-coenzyme-M reductase involved in the final step of methanogenesis. BLASTn analysis applying stringent settings resulted in assignment of 16S-rDNA metagenome sequence reads to 62 16S-rDNA amplicon sequences thus enabling frequency of abundance estimations for 16S-rDNA clone library sequences. Ribosomal Database Project (RDP) Classifier processing of metagenome 16S-rDNA reads revealed abundance of the phyla Firmicutes, Bacteroidetes and Euryarchaeota and the orders Clostridiales, Bacteroidales and Methanomicrobiales. Moreover, a large fraction of 16S-rDNA metagenome reads could not be assigned to lower taxonomic ranks, demonstrating that numerous microorganisms in the analysed fermentation sample of the biogas plant are still unclassified or unknown.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cremer, T.; Popp, S.; Emmerich, P.
1990-01-01
Chromosomal in situ suppression (CISS)-hybridization of biotinylated phage DNA-library inserts from sorted human chromosomes was used to decorate chromosomes 1 and 7 specifically from pter to qter and to detect structural aberrations of these chromosomes in irradiated human peripheral lymphocytes. In addition, probe pUC1.77 was used to mark the 1q12 subregion in normal and aberrant chromosomes 1. Low LET radiation (60Co-gamma-rays; 1.17 and 1.33 MeV) of lymphocyte cultures was performed with various doses (D = 0, 2, 4, 8 Gy) 5 h after stimulation with phytohaemagglutinin. Irradiated cells were cultivated for an additional 67 h before Colcemid arrested metaphase spreadsmore » were obtained. Aberrations of the specifically stained chromosomes, such as deletions, dicentrics, and rings, were readily scored after in situ hybridization with either the 1q12 specific probe or DNA-library inserts. By the latter approach, translocations of the specifically stained chromosomes could also be reliably assessed. A linear increase of the percentage of specifically stained aberrant chromosomes was observed when plotted as a function of the square of the dose D. A particular advantage of this new approach is provided by the possibility to delineate numerical and structural chromosome aberrations directly in interphase nuclei. These results indicate that cytogenetic monitoring of ionizing radiation may be considerably facilitated by CISS-hybridization.« less
Katzif, Samuel; Danavall, Damien; Bowers, Samera; Balthazar, Jacqueline T.; Shafer, William M.
2003-01-01
A Tn551 insertional library of Staphylococcus aureus strain ISP479 was challenged with an antimicrobial peptide (CG 117-136) derived from human neutrophil cathepsin G (CG). After repeated selection and screening of surviving colonies, a mutant was identified that expressed increased resistance to CG 117-136. Southern hybridization analysis revealed that the Tn551 insert in this mutant (SK1) was carried on a 10.6-kb EcoRI chromosomal DNA fragment. Subsequent physical mapping of this Tn551 insert revealed that it was positioned between a putative promoter sequence and the translational start codon of the cspA gene, which encodes a protein (CspA) highly similar to the major cold shock proteins CspA and CspB of Escherichia coli and Bacillus subtilis, respectively. This Tn551 insertion as well as a separate deletion-insertion mutation in cspA decreased the capacity of S. aureus to respond to the stress of cold shock and increased resistance to CG 117-136. The results indicate for the first time that a physiologic link exists between bacterial susceptibility to an antimicrobial peptide and a stress response system. PMID:12874306
Katzif, Samuel; Danavall, Damien; Bowers, Samera; Balthazar, Jacqueline T; Shafer, William M
2003-08-01
A Tn551 insertional library of Staphylococcus aureus strain ISP479 was challenged with an antimicrobial peptide (CG 117-136) derived from human neutrophil cathepsin G (CG). After repeated selection and screening of surviving colonies, a mutant was identified that expressed increased resistance to CG 117-136. Southern hybridization analysis revealed that the Tn551 insert in this mutant (SK1) was carried on a 10.6-kb EcoRI chromosomal DNA fragment. Subsequent physical mapping of this Tn551 insert revealed that it was positioned between a putative promoter sequence and the translational start codon of the cspA gene, which encodes a protein (CspA) highly similar to the major cold shock proteins CspA and CspB of Escherichia coli and Bacillus subtilis, respectively. This Tn551 insertion as well as a separate deletion-insertion mutation in cspA decreased the capacity of S. aureus to respond to the stress of cold shock and increased resistance to CG 117-136. The results indicate for the first time that a physiologic link exists between bacterial susceptibility to an antimicrobial peptide and a stress response system.
DNA-Encoded Dynamic Combinatorial Chemical Libraries.
Reddavide, Francesco V; Lin, Weilin; Lehnert, Sarah; Zhang, Yixin
2015-06-26
Dynamic combinatorial chemistry (DCC) explores the thermodynamic equilibrium of reversible reactions. Its application in the discovery of protein binders is largely limited by difficulties in the analysis of complex reaction mixtures. DNA-encoded chemical library (DECL) technology allows the selection of binders from a mixture of up to billions of different compounds; however, experimental results often show low a signal-to-noise ratio and poor correlation between enrichment factor and binding affinity. Herein we describe the design and application of DNA-encoded dynamic combinatorial chemical libraries (EDCCLs). Our experiments have shown that the EDCCL approach can be used not only to convert monovalent binders into high-affinity bivalent binders, but also to cause remarkably enhanced enrichment of potent bivalent binders by driving their in situ synthesis. We also demonstrate the application of EDCCLs in DNA-templated chemical reactions. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Retroviral DNA Integration Directed by HIV Integration Protein in Vitro
NASA Astrophysics Data System (ADS)
Bushman, Frederic D.; Fujiwara, Tamio; Craigie, Robert
1990-09-01
Efficient retroviral growth requires integration of a DNA copy of the viral RNA genome into a chromosome of the host. As a first step in analyzing the mechanism of integration of human immunodeficiency virus (HIV) DNA, a cell-free system was established that models the integration reaction. The in vitro system depends on the HIV integration (IN) protein, which was partially purified from insect cells engineered to express IN protein in large quantities. Integration was detected in a biological assay that scores the insertion of a linear DNA containing HIV terminal sequences into a λ DNA target. Some integration products generated in this assay contained five-base pair duplications of the target DNA at the recombination junctions, a characteristic of HIV integration in vivo; the remaining products contained aberrant junctional sequences that may have been produced in a variation of the normal reaction. These results indicate that HIV IN protein is the only viral protein required to insert model HIV DNA sequences into a target DNA in vitro.
Molecular cloning and physical mapping of the genome of fish lymphocystis disease virus.
Darai, G; Delius, H; Clarke, J; Apfel, H; Schnitzler, P; Flügel, R M
1985-10-30
A defined and complete gene library of the fish lymphocystis disease virus (FLDV) genome was established. FLDV DNA was cleaved with EcoRI, BamHI, EcoRI/BamHI and EcoRI/HindIII and the resulting fragments were inserted into the corresponding sites of the pACYC184 or pAT153 plasmid vectors using T4 DNA ligase. Since FLDV DNA is highly methylated at CpG sequences (Darai et al., 1983; Wagner et al., 1985), an Escherichia coli GC-3 strain was required to amplify the recombinant plasmids harboring the FLDV DNA fragments. Bacterial colonies harboring recombinant plasmids were selected. All cloned fragments were individually identified by digestion of the recombinant plasmid DNA with different restriction enzymes and screened by hybridization of recombinant plasmid DNA to viral DNA. This analysis revealed that sequences representing 100% of the viral genome were cloned. Using these recombinant plasmids, the physical maps of the genome were constructed for BamHI, EcoRI, BestEII, and PstI restriction endonucleases. Although the FLDV genome is linear, due to circular permutation the restriction maps are circular.
Highly efficient CRISPR/HDR-mediated knock-in for mouse embryonic stem cells and zygotes.
Wang, Bangmei; Li, Kunyu; Wang, Amy; Reiser, Michelle; Saunders, Thom; Lockey, Richard F; Wang, Jia-Wang
2015-10-01
The clustered regularly interspaced short palindromic repeat (CRISPR) gene editing technique, based on the non-homologous end-joining (NHEJ) repair pathway, has been used to generate gene knock-outs with variable sizes of small insertion/deletions with high efficiency. More precise genome editing, either the insertion or deletion of a desired fragment, can be done by combining the homology-directed-repair (HDR) pathway with CRISPR cleavage. However, HDR-mediated gene knock-in experiments are typically inefficient, and there have been no reports of successful gene knock-in with DNA fragments larger than 4 kb. Here, we describe the targeted insertion of large DNA fragments (7.4 and 5.8 kb) into the genomes of mouse embryonic stem (ES) cells and zygotes, respectively, using the CRISPR/HDR technique without NHEJ inhibitors. Our data show that CRISPR/HDR without NHEJ inhibitors can result in highly efficient gene knock-in, equivalent to CRISPR/HDR with NHEJ inhibitors. Although NHEJ is the dominant repair pathway associated with CRISPR-mediated double-strand breaks (DSBs), and biallelic gene knock-ins are common, NHEJ and biallelic gene knock-ins were not detected. Our results demonstrate that efficient targeted insertion of large DNA fragments without NHEJ inhibitors is possible, a result that should stimulate interest in understanding the mechanisms of high efficiency CRISPR targeting in general.
Construction of a large-scale Burkholderia cenocepacia J2315 transposon mutant library
NASA Astrophysics Data System (ADS)
Wong, Yee-Chin; Pain, Arnab; Nathan, Sheila
2014-09-01
Burkholderia cenocepacia, a pathogenic member of the Burkholderia cepacia complex (Bcc), has emerged as a significant threat towards cystic fibrosis patients, where infection often leads to the fatal clinical manifestation known as cepacia syndrome. Many studies have investigated the pathogenicity of B. cenocepacia as well as its ability to become highly resistant towards many of the antibiotics currently in use. In addition, studies have also been undertaken to understand the pathogen's capacity to adapt and survive in a broad range of environments. Transposon based mutagenesis has been widely used in creating insertional knock-out mutants and coupled with recent advances in sequencing technology, robust tools to study gene function in a genome-wide manner have been developed based on the assembly of saturated transposon mutant libraries. In this study, we describe the construction of a large-scale library of B. cenocepacia transposon mutants. To create transposon mutants of B. cenocepacia strain J2315, electrocompetent bacteria were electrotransformed with the EZ-Tn5
NASA Astrophysics Data System (ADS)
Liu, Hongzhan; Zheng, Fengrong; Sun, Xiuqin; Cai, Yimei
2012-06-01
The aquaculture of sea cucumber Apostichopus japonicus (Echinodermata, Holothuroidea) has grown rapidly during recent years and has become an important sector of the marine industry in Northern China. However, with the rapid growth of the industry and the use of non-standard culture techniques, epidemic diseases of A. japonicus now pose increasing problems to the industry. To screen the genes with stress response to bacterial infection in sea cucumber at a genome wide level, we constructed a cDNA library from A. japonicus Selenka (Aspidochirotida: Stichopodidae) after infecting them with Vibrio sp. for 48 h. Total RNA was extracted from the intestine, mesentery and coelomocyte of infected sea cucumber using Trizol and mRNA was isolated by Oligotex mRNA Kits. The ligated cDNAs were transformed into DH5α, and a library of 3.24×105 clones (3.24×105 cfu mL-1) was obtained with the sizes of inserted fragments ranging from 0.8 to 2.5 kb. Sequencing the cDNA clones resulted in a total of 1106 ESTs that passed the quality control. BlastX and BlastN searches have identified 168 (31.5%) ESTs sharing significant homology with known sequences in NCBI protein or nucleotide databases. Among a panel of 25 putative immunity-related genes, serum lectin isoform, complement component 3, complement component 3-like genes were further studied by real-time PCR and they all increased more than 5 fold in response to Vibrio sp. challenge. Our library provides a valuable molecular tool for future study of invertebrate immunity against bacterial infection and our gene expression data indicates the importance of the immune system in the evolution and development of sea cucumber.
Yokozaki, H; Tahara, H; Oue, N; Tahara, E
2000-01-01
A new transcription variant of hepatocyte growth factor/scatter factor (HGF/SF) was cloned from human gastric cancer cell line HSC-39. Northern blot analysis of eight human gastric cancer cell lines (TMK-1, MKN-1, MKN-7, MKN-28, MKN-45, MKN-74, KATO-III and HSC-39) demonstrated that HSC-39 cells expressed a 1.3 kb abnormal HGF/SF transcript. Screening of 1 x 10(6) colonies of cDNA library from HSC-39 constructed in pAP3neo mammalian expression vector selected four positive clones containing HGF/SF transcript. Among them, two contained a 1.3 kbp insert detecting the identical transcript to that obtained with HGF/SF probe by Northern blotting. Deoxynucleotide sequencing of the 1.3 kbp insert revealed that it was composed of a part of HGF/SF cDNA from exon 14 to exon 18, corresponding to the whole sequence of HGF/SF light chain, with 5' 75 nucleotides unrelated to any sequence involved in HGF/SF.
Continuous Influx of Genetic Material from Host to Virus Populations
Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane
2016-01-01
Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors. PMID:26829124
Continuous Influx of Genetic Material from Host to Virus Populations.
Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane; Cordaux, Richard; Herniou, Elisabeth A
2016-02-01
Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors.
Genomic gigantism: DNA loss is slow in mountain grasshoppers.
Bensasson, D; Petrov, D A; Zhang, D X; Hartl, D L; Hewitt, G M
2001-02-01
Several studies have shown DNA loss to be inversely correlated with genome size in animals. These studies include a comparison between Drosophila and the cricket, Laupala, but there has been no assessment of DNA loss in insects with very large genomes. Podisma pedestris, the brown mountain grasshopper, has a genome over 100 times as large as that of Drosophila and 10 times as large as that of Laupala. We used 58 paralogous nuclear pseudogenes of mitochondrial origin to study the characteristics of insertion, deletion, and point substitution in P. pedestris and Italopodisma. In animals, these pseudogenes are "dead on arrival"; they are abundant in many different eukaryotes, and their mitochondrial origin simplifies the identification of point substitutions accumulated in nuclear pseudogene lineages. There appears to be a mononucleotide repeat within the 643-bp pseudogene sequence studied that acts as a strong hot spot for insertions or deletions (indels). Because the data for other insect species did not contain such an unusual region, hot spots were excluded from species comparisons. The rate of DNA loss relative to point substitution appears to be considerably and significantly lower in the grasshoppers studied than in Drosophila or Laupala. This suggests that the inverse correlation between genome size and the rate of DNA loss can be extended to comparisons between insects with large or gigantic genomes (i.e., Laupala and Podisma). The low rate of DNA loss implies that in grasshoppers, the accumulation of point mutations is a more potent force for obscuring ancient pseudogenes than their loss through indel accumulation, whereas the reverse is true for Drosophila. The main factor contributing to the difference in the rates of DNA loss estimated for grasshoppers, crickets, and Drosophila appears to be deletion size. Large deletions are relatively rare in Podisma and Italopodisma.
Novel encoding methods for DNA-templated chemical libraries.
Li, Gang; Zheng, Wenlu; Liu, Ying; Li, Xiaoyu
2015-06-01
Among various types of DNA-encoded chemical libraries, DNA-templated library takes advantage of the sequence-specificity of DNA hybridization, enabling not only highly effective DNA-templated chemical reactions, but also high fidelity in library encoding. This brief review summarizes recent advances that have been made on the encoding strategies for DNA-templated libraries, and it also highlights their respective advantages and limitations for the preparation of DNA-encoded libraries. Copyright © 2015 Elsevier Ltd. All rights reserved.
A non-canonical transferred DNA insertion at the BRI1 locus in Arabidopsis thaliana.
Zhao, Zhong; Zhu, Yan; Erhardt, Mathieu; Ruan, Ying; Shen, Wen-Hui
2009-04-01
Agrobacterium-mediated transformation is widely used in transgenic plant engineering and has been proven to be a powerful tool for insertional mutagenesis of the plant genome. The transferred DNA (T-DNA) from Agrobacterium is integrated into the plant genome through illegitimate recombination between the T-DNA and the plant DNA. Contrasting to the canonical insertion, here we report on a locus showing a complex mutation associated with T-DNA insertion at the BRI1 gene in Arabidopsis thaliana. We obtained a mutant line, named salade for its phenotype of dwarf stature and proliferating rosette. Molecular characterization of this mutant revealed that in addition to T-DNA a non-T-DNA-localized transposon from bacteria was inserted in the Arabidopsis genome and that a region of more than 11.5 kb of the Arabidopsis genome was deleted at the insertion site. The deleted region contains the brassinosteroid receptor gene BRI1 and the transcription factor gene WRKY13. Our finding reveals non-canonical T-DNA insertion, implicating horizontal gene transfer and cautioning the use of T-DNA as mutagen in transgenic research.
Using PATIMDB to Create Bacterial Transposon Insertion Mutant Libraries
Urbach, Jonathan M.; Wei, Tao; Liberati, Nicole; Grenfell-Lee, Daniel; Villanueva, Jacinto; Wu, Gang; Ausubel, Frederick M.
2015-01-01
PATIMDB is a software package for facilitating the generation of transposon mutant insertion libraries. The software has two main functions: process tracking and automated sequence analysis. The process tracking function specifically includes recording the status and fates of multiwell plates and samples in various stages of library construction. Automated sequence analysis refers specifically to the pipeline of sequence analysis starting with ABI files from a sequencing facility and ending with insertion location identifications. The protocols in this unit describe installation and use of PATIMDB software. PMID:19343706
Kim, Sungmin; Song, Kyo-Hong; Ree, Han-Il; Kim, Won
2012-01-01
Non-biting midges (Diptera: Chironomidae) are a diverse population that commonly causes respiratory allergies in humans. Chironomid larvae can be used to indicate freshwater pollution, but accurate identification on the basis of morphological characteristics is difficult. In this study, we constructed a mitochondrial cytochrome c oxidase subunit I (COI)-based DNA barcode library for Korean chironomids. This library consists of 211 specimens from 49 species, including adults and unidentified larvae. The interspecies and intraspecies COI sequence variations were analyzed. Sophisticated indexes were developed in order to properly evaluate indistinct barcode gaps that are created by insufficient sampling on both the interspecies and intraspecies levels and by variable mutation rates across taxa. In a variety of insect datasets, these indexes were useful for re-evaluating large barcode datasets and for defining COI barcode gaps. The COI-based DNA barcode library will provide a rapid and reliable tool for the molecular identification of Korean chironomid species. Furthermore, this reverse-taxonomic approach will be improved by the continuous addition of other speceis’ sequences to the library. PMID:22138764
Public antibodies to malaria antigens generated by two LAIR1 insertion modalities.
Pieper, Kathrin; Tan, Joshua; Piccoli, Luca; Foglierini, Mathilde; Barbieri, Sonia; Chen, Yiwei; Silacci-Fregni, Chiara; Wolf, Tobias; Jarrossay, David; Anderle, Marica; Abdi, Abdirahman; Ndungu, Francis M; Doumbo, Ogobara K; Traore, Boubacar; Tran, Tuan M; Jongo, Said; Zenklusen, Isabelle; Crompton, Peter D; Daubenberger, Claudia; Bull, Peter C; Sallusto, Federica; Lanzavecchia, Antonio
2017-08-31
In two previously described donors, the extracellular domain of LAIR1, a collagen-binding inhibitory receptor encoded on chromosome 19 (ref. 1), was inserted between the V and DJ segments of an antibody. This insertion generated, through somatic mutations, broadly reactive antibodies against RIFINs, a type of variant antigen expressed on the surface of Plasmodium falciparum-infected erythrocytes. To investigate how frequently such antibodies are produced in response to malaria infection, we screened plasma from two large cohorts of individuals living in malaria-endemic regions. Here we report that 5-10% of malaria-exposed individuals, but none of the European blood donors tested, have high levels of LAIR1-containing antibodies that dominate the response to infected erythrocytes without conferring enhanced protection against febrile malaria. By analysing the antibody-producing B cell clones at the protein, cDNA and gDNA levels, we characterized additional LAIR1 insertions between the V and DJ segments and discovered a second insertion modality whereby the LAIR1 exon encoding the extracellular domain and flanking intronic sequences are inserted into the switch region. By exon shuffling, this mechanism leads to the production of bispecific antibodies in which the LAIR1 domain is precisely positioned at the elbow between the VH and CH1 domains. Additionally, in one donor the genomic DNA encoding the VH and CH1 domains was deleted, leading to the production of a camel-like LAIR1-containing antibody. Sequencing of the switch regions of memory B cells from European blood donors revealed frequent templated inserts originating from transcribed genes that, in rare cases, comprised exons with orientations and frames compatible with expression. These results reveal different modalities of LAIR1 insertion that lead to public and dominant antibodies against infected erythrocytes and suggest that insertion of templated DNA represents an additional mechanism of antibody diversification that can be selected in the immune response against pathogens and exploited for B cell engineering.
Nieminen, Mikko; Tuuri, Timo; Savilahti, Harri
2010-10-01
Human embryonic stem cells are pluripotent cells derived from early human embryo and retain a potential to differentiate into all adult cell types. They provide vast opportunities in cell replacement therapies and are expected to become significant tools in drug discovery as well as in the studies of cellular and developmental functions of human genes. The progress in applying different types of DNA recombination reactions for genome modification in a variety of eukaryotic cell types has provided means to utilize recombination-based strategies also in human embryonic stem cells. Homologous recombination-based methods, particularly those utilizing extended homologous regions and those employing zinc finger nucleases to boost genomic integration, have shown their usefulness in efficient genome modification. Site-specific recombination systems are potent genome modifiers, and they can be used to integrate DNA into loci that contain an appropriate recombination signal sequence, either naturally occurring or suitably pre-engineered. Non-homologous recombination can be used to generate random integrations in genomes relatively effortlessly, albeit with a moderate efficiency and precision. DNA transposition-based strategies offer substantially more efficient random strategies and provide means to generate single-copy insertions, thus potentiating the generation of genome-wide insertion libraries applicable in genetic screens. 2010 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mullinax, R.L.; Gross, E.A.; Amberg, J.R.
1990-10-01
The authors have applied a molecular biology approach to the identification of human monoclonal antibodies. Human peripheral blood lymphocyte mRNA was converted to cDNA and a select subset was amplified by the polymerase chain reaction. These products, containing coding sequences for numerous immunoglobulin heavy- and {kappa} light-chain variable and constant region domains, were inserted into modified bacteriophase {lambda} expression vectors and introduced into Escherichia coli by infection to yield a combinatorial immunoexpression library. Clones with binding activity to tetanus toxoid were identified by filter hybridization with radiolabeled antigen and appeared at a frequency of 0.2{percent} in the library. These humanmore » antigen binding fragments, consisting of a heavy-chain fragment covalently linked to a light chain, displayed high affinity of binding to tetanus toxoid with equilibrium constants in the nanomolar range but did not cross-react with other proteins tested. They estimate that this human immunoexpression library contains 20,000 clones with high affinity and specificity to our chosen antigen.« less
Chimeric TALE recombinases with programmable DNA sequence specificity.
Mercer, Andrew C; Gaj, Thomas; Fuller, Roberta P; Barbas, Carlos F
2012-11-01
Site-specific recombinases are powerful tools for genome engineering. Hyperactivated variants of the resolvase/invertase family of serine recombinases function without accessory factors, and thus can be re-targeted to sequences of interest by replacing native DNA-binding domains (DBDs) with engineered zinc-finger proteins (ZFPs). However, imperfect modularity with particular domains, lack of high-affinity binding to all DNA triplets, and difficulty in construction has hindered the widespread adoption of ZFPs in unspecialized laboratories. The discovery of a novel type of DBD in transcription activator-like effector (TALE) proteins from Xanthomonas provides an alternative to ZFPs. Here we describe chimeric TALE recombinases (TALERs): engineered fusions between a hyperactivated catalytic domain from the DNA invertase Gin and an optimized TALE architecture. We use a library of incrementally truncated TALE variants to identify TALER fusions that modify DNA with efficiency and specificity comparable to zinc-finger recombinases in bacterial cells. We also show that TALERs recombine DNA in mammalian cells. The TALER architecture described herein provides a platform for insertion of customized TALE domains, thus significantly expanding the targeting capacity of engineered recombinases and their potential applications in biotechnology and medicine.
Novel transcripts of the estrogen receptor α gene in channel catfish
Patino, Reynaldo; Xia, Zhenfang; Gale, William L.; Wu, Chunfa; Maule, Alec G.; Chang, Xiaotian
2000-01-01
Complementary DNA libraries from liver and ovary of an immature female channel catfish were screened with a homologous ERα cDNA probe. The hepatic library yielded two new channel catfish ER cDNAs that encode N-terminal ERα variants of different sizes. Relative to the catfish ERα (medium size; 581 residues) previously reported, these new cDNAs encode Long-ERα (36 residues longer) and Short-ERα (389 residues shorter). The 5′-end of Long-ERα cDNA is identical to that of Medium-ERα but has an additional 503-bp segment with an upstream, in-frame translation-start codon. Recombinant Long-ERα binds estrogen with high affinity (Kd = 3.4 nM), similar to that previously reported for Medium-ERα but lower than reported for catfish ERβ. Short-ERα cDNA encodes a protein that lacks most of the receptor protein and does not bind estrogen. Northern hybridization confirmed the existence of multiple hepatic ERα RNAs that include the size range of the ERα cDNAs obtained from the libraries as well as additional sizes. Using primers for RT-PCR that target locations internal to the protein-coding sequence, we also established the presence of several ERα cDNA variants with in-frame insertions in the ligand-binding and DNA-binding domains and in-frame or out-of-frame deletions in the ligand-binding domain. These internal variants showed patterns of expression that differed between the ovary and liver. Further, the ovarian library yielded a full-length, ERα antisense cDNA containing a poly(A) signal and tail. A limited survey of histological preparations from juvenile catfish by in situ hybridization using directionally synthesized cRNA probes also suggested the expression of ERα antisense RNA in a tissue-specific manner. In conclusion, channel catfish seemingly have three broad classes of ERα mRNA variants: those encoding N-terminal truncated variants, those encoding internal variants (including C-terminal truncated variants), and antisense mRNA. The sense variants may encode functional ERα or related proteins that modulate ERα or ERβ activity. The existence of ER antisense mRNA is reported in this study for the first time. Its role may be to participate in the regulation of ER gene expression.
Budiman, Muhammad A.; Mao, Long; Wood, Todd C.; Wing, Rod A.
2000-01-01
Recently a new strategy using BAC end sequences as sequence-tagged connectors (STCs) was proposed for whole-genome sequencing projects. In this study, we present the construction and detailed characterization of a 15.0 haploid genome equivalent BAC library for the cultivated tomato, Lycopersicon esculentum cv. Heinz 1706. The library contains 129,024 clones with an average insert size of 117.5 kb and a chloroplast content of 1.11%. BAC end sequences from 1490 ends were generated and analyzed as a preliminary evaluation for using this library to develop an STC framework to sequence the tomato genome. A total of 1205 BAC end sequences (80.9%) were obtained, with an average length of 360 high-quality bases, and were searched against the GenBank database. Using a cutoff expectation value of <10−6, and combining the results from BLASTN, BLASTX, and TBLASTX searches, 24.3% of the BAC end sequences were similar to known sequences, of which almost half (48.7%) share sequence similarities to retrotransposons and 7% to known genes. Some of the transposable element sequences were the first reported in tomato, such as sequences similar to maize transposon Activator (Ac) ORF and tobacco pararetrovirus-like sequences. Interestingly, there were no BAC end sequences similar to the highly repeated TGRI and TGRII elements. However, the majority (70.3%) of STCs did not share significant sequence similarities to any sequences in GenBank at either the DNA or predicted protein levels, indicating that a large portion of the tomato genome is still unknown. Our data demonstrate that this BAC library is suitable for developing an STC database to sequence the tomato genome. The advantages of developing an STC framework for whole-genome sequencing of tomato are discussed. [The BAC end sequences described in this paper have been deposited in the GenBank data library under accession nos. AQ367111–AQ368361.] PMID:10645957
Brewer, Megan H.; Chaudhry, Rabia; Qi, Jessica; Kidambi, Aditi; Drew, Alexander P.; Ryan, Monique M.; Subramanian, Gopinath M.; Young, Helen K.; Zuchner, Stephan; Reddel, Stephen W.; Nicholson, Garth A.; Kennerson, Marina L.
2016-01-01
With the advent of whole exome sequencing, cases where no pathogenic coding mutations can be found are increasingly being observed in many diseases. In two large, distantly-related families that mapped to the Charcot-Marie-Tooth neuropathy CMTX3 locus at chromosome Xq26.3-q27.3, all coding mutations were excluded. Using whole genome sequencing we found a large DNA interchromosomal insertion within the CMTX3 locus. The 78 kb insertion originates from chromosome 8q24.3, segregates fully with the disease in the two families, and is absent from the general population as well as 627 neurologically normal chromosomes from in-house controls. Large insertions into chromosome Xq27.1 are known to cause a range of diseases and this is the first neuropathy phenotype caused by an interchromosomal insertion at this locus. The CMTX3 insertion represents an understudied pathogenic structural variation mechanism for inherited peripheral neuropathies. Our finding highlights the importance of considering all structural variation types when studying unsolved inherited peripheral neuropathy cases with no pathogenic coding mutations. PMID:27438001
Cao, Shuanghe; Siriwardana, Chamindika L; Kumimoto, Roderick W; Holt, Ben F
2011-05-19
Monocots, especially the temperate grasses, represent some of the most agriculturally important crops for both current food needs and future biofuel development. Because most of the agriculturally important grass species are difficult to study (e.g., they often have large, repetitive genomes and can be difficult to grow in laboratory settings), developing genetically tractable model systems is essential. Brachypodium distachyon (hereafter Brachypodium) is an emerging model system for the temperate grasses. To fully realize the potential of this model system, publicly accessible discovery tools are essential. High quality cDNA libraries that can be readily adapted for multiple downstream purposes are a needed resource. Additionally, yeast two-hybrid (Y2H) libraries are an important discovery tool for protein-protein interactions and are not currently available for Brachypodium. We describe the creation of two high quality, publicly available Gateway™ cDNA entry libraries and their derived Y2H libraries for Brachypodium. The first entry library represents cloned cDNA populations from both short day (SD, 8/16-h light/dark) and long day (LD, 20/4-h light/dark) grown plants, while the second library was generated from hormone treated tissues. Both libraries have extensive genome coverage (~5 × 107 primary clones each) and average clone lengths of ~1.5 Kb. These entry libraries were then used to create two recombination-derived Y2H libraries. Initial proof-of-concept screens demonstrated that a protein with known interaction partners could readily re-isolate those partners, as well as novel interactors. Accessible community resources are a hallmark of successful biological model systems. Brachypodium has the potential to be a broadly useful model system for the grasses, but still requires many of these resources. The Gateway™ compatible entry libraries created here will facilitate studies for multiple user-defined purposes and the derived Y2H libraries can be immediately applied to large scale screening and discovery of novel protein-protein interactions. All libraries are freely available for distribution to the research community.
Zhang, Wei Yun; Zhang, Wenhua; Liu, Zhiyuan; Li, Cong; Zhu, Zhi; Yang, Chaoyong James
2012-01-03
We have developed a novel method for efficiently screening affinity ligands (aptamers) from a complex single-stranded DNA (ssDNA) library by employing single-molecule emulsion polymerase chain reaction (PCR) based on the agarose droplet microfluidic technology. In a typical systematic evolution of ligands by exponential enrichment (SELEX) process, the enriched library is sequenced first, and tens to hundreds of aptamer candidates are analyzed via a bioinformatic approach. Possible candidates are then chemically synthesized, and their binding affinities are measured individually. Such a process is time-consuming, labor-intensive, inefficient, and expensive. To address these problems, we have developed a highly efficient single-molecule approach for aptamer screening using our agarose droplet microfluidic technology. Statistically diluted ssDNA of the pre-enriched library evolved through conventional SELEX against cancer biomarker Shp2 protein was encapsulated into individual uniform agarose droplets for droplet PCR to generate clonal agarose beads. The binding capacity of amplified ssDNA from each clonal bead was then screened via high-throughput fluorescence cytometry. DNA clones with high binding capacity and low K(d) were chosen as the aptamer and can be directly used for downstream biomedical applications. We have identified an ssDNA aptamer that selectively recognizes Shp2 with a K(d) of 24.9 nM. Compared to a conventional sequencing-chemical synthesis-screening work flow, our approach avoids large-scale DNA sequencing and expensive, time-consuming DNA synthesis of large populations of DNA candidates. The agarose droplet microfluidic approach is thus highly efficient and cost-effective for molecular evolution approaches and will find wide application in molecular evolution technologies, including mRNA display, phage display, and so on. © 2011 American Chemical Society
Schlötelburg, C; von Wintzingerode, F; Hauck, R; Hegemann, W; Göbel, U B
2000-07-01
A 16S-rDNA-based molecular study was performed to determine the bacterial diversity of an anaerobic, 1,2-dichloropropane-dechlorinating bioreactor consortium derived from sediment of the River Saale, Germany. Total community DNA was extracted and bacterial 16S rRNA genes were subsequently amplified using conserved primers. A clone library was constructed and analysed by sequencing the 16S rDNA inserts of randomly chosen clones followed by dot blot hybridization with labelled polynucleotide probes. The phylogenetic analysis revealed significant sequence similarities of several as yet uncultured bacterial species in the bioreactor to those found in other reductively dechlorinating freshwater consortia. In contrast, no close relationship was obtained with as yet uncultured bacteria found in reductively dechlorinating consortia derived from marine habitats. One rDNA clone showed >97% sequence similarity to Dehalobacter species, known for reductive dechlorination of tri- and tetrachloroethene. These results suggest that reductive dechlorination in microbial freshwater habitats depends upon a specific bacterial community structure.
Rondon, Michelle R.; Raffel, Sandra J.; Goodman, Robert M.; Handelsman, Jo
1999-01-01
As the study of microbes moves into the era of functional genomics, there is an increasing need for molecular tools for analysis of a wide diversity of microorganisms. Currently, biological study of many prokaryotes of agricultural, medical, and fundamental scientific interest is limited by the lack of adequate genetic tools. We report the application of the bacterial artificial chromosome (BAC) vector to prokaryotic biology as a powerful approach to address this need. We constructed a BAC library in Escherichia coli from genomic DNA of the Gram-positive bacterium Bacillus cereus. This library provides 5.75-fold coverage of the B. cereus genome, with an average insert size of 98 kb. To determine the extent of heterologous expression of B. cereus genes in the library, we screened it for expression of several B. cereus activities in the E. coli host. Clones expressing 6 of 10 activities tested were identified in the library, namely, ampicillin resistance, zwittermicin A resistance, esculin hydrolysis, hemolysis, orange pigment production, and lecithinase activity. We analyzed selected BAC clones genetically to identify rapidly specific B. cereus loci. These results suggest that BAC libraries will provide a powerful approach for studying gene expression from diverse prokaryotes. PMID:10339608
Si, Zengzhi; Du, Bing; Huo, Jinxi; He, Shaozhen; Liu, Qingchang; Zhai, Hong
2016-11-21
Sweetpotato, Ipomoea batatas (L.) Lam., is an important food crop widely grown in the world. However, little is known about the genome of this species because it is a highly heterozygous hexaploid. Gaining a more in-depth knowledge of sweetpotato genome is therefore necessary and imperative. In this study, the first bacterial artificial chromosome (BAC) library of sweetpotato was constructed. Clones from the BAC library were end-sequenced and analyzed to provide genome-wide information about this species. The BAC library contained 240,384 clones with an average insert size of 101 kb and had a 7.93-10.82 × coverage of the genome, and the probability of isolating any single-copy DNA sequence from the library was more than 99%. Both ends of 8310 BAC clones randomly selected from the library were sequenced to generate 11,542 high-quality BAC-end sequences (BESs), with an accumulative length of 7,595,261 bp and an average length of 658 bp. Analysis of the BESs revealed that 12.17% of the sweetpotato genome were known repetitive DNA, including 7.37% long terminal repeat (LTR) retrotransposons, 1.15% Non-LTR retrotransposons and 1.42% Class II DNA transposons etc., 18.31% of the genome were identified as sweetpotato-unique repetitive DNA and 10.00% of the genome were predicted to be coding regions. In total, 3,846 simple sequences repeats (SSRs) were identified, with a density of one SSR per 1.93 kb, from which 288 SSRs primers were designed and tested for length polymorphism using 20 sweetpotato accessions, 173 (60.07%) of them produced polymorphic bands. Sweetpotato BESs had significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum than those of Vitis vinifera, Theobroma cacao and Arabidopsis thaliana. The first BAC library for sweetpotato has been successfully constructed. The high quality BESs provide first insights into sweetpotato genome composition, and have significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum. These resources as a robust platform will be used in high-resolution mapping, gene cloning, assembly of genome sequences, comparative genomics and evolution for sweetpotato.
Enzymatically Generated CRISPR Libraries for Genome Labeling and Screening
Lane, Andrew B.; Strzelecka, Magdalena; Ettinger, Andreas; Grenfell, Andrew W.; Wittmann, Torsten; Heald, Rebecca
2015-01-01
Summary CRISPR-based technologies have emerged as powerful tools to alter genomes and mark chromosomal loci, but an inexpensive method for generating large numbers of RNA guides for whole genome screening and labeling is lacking. Using a method that permits library construction from any source of DNA, we generated guide libraries that label repetitive loci or a single chromosomal locus in Xenopus egg extracts and show that a complex library can target the E. coli genome at high frequency. PMID:26212133
Library Construction from Subnanogram DNA for Pelagic Sea Water and Deep-Sea Sediments
Hirai, Miho; Nishi, Shinro; Tsuda, Miwako; Sunamura, Michinari; Takaki, Yoshihiro; Nunoura, Takuro
2017-01-01
Shotgun metagenomics is a low biased technology for assessing environmental microbial diversity and function. However, the requirement for a sufficient amount of DNA and the contamination of inhibitors in environmental DNA leads to difficulties in constructing a shotgun metagenomic library. We herein examined metagenomic library construction from subnanogram amounts of input environmental DNA from subarctic surface water and deep-sea sediments using two library construction kits: the KAPA Hyper Prep Kit and Nextera XT DNA Library Preparation Kit, with several modifications. The influence of chemical contaminants associated with these environmental DNA samples on library construction was also investigated. Overall, shotgun metagenomic libraries were constructed from 1 pg to 1 ng of input DNA using both kits without harsh library microbial contamination. However, the libraries constructed from 1 pg of input DNA exhibited larger biases in GC contents, k-mers, or small subunit (SSU) rRNA gene compositions than those constructed from 10 pg to 1 ng DNA. The lower limit of input DNA for low biased library construction in this study was 10 pg. Moreover, we revealed that technology-dependent biases (physical fragmentation and linker ligation vs. tagmentation) were larger than those due to the amount of input DNA. PMID:29187708
Construction of a general human chromosome jumping library, with application to cystic fibrosis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Collins, F.S.; Drumm, M.L.; Cole, J.L.
1987-02-27
In many genetic disorders, the responsible gene and its protein product are unknown. The technique known as reverse genetics, in which chromosomal map positions and genetically linked DNA markers are used to identify and clone such genes, is complicated by the fact that the molecular distances from the closest DNA markers to the gene itself are often too large to traverse by standard cloning techniques. To address this situation, a general human chromosome jumping library was constructed that allows the cloning of DNA sequences approximately 100 kilobases away from any starting point in genomic DNA. As an illustration of itsmore » usefulness, this library was searched for a jumping clone, starting at the met oncogene, which is a marker tightly linked to the cystic fibrosis gene that is located on human chromosome 7. Mapping of the new genomic fragment by pulsed field gel electrophoresis confirmed that it resides on chromosome 7 within 240 kilobases downstream of the met gene. The use of chromosome jumping should be applicable to any genetic locus for which a closely linked DNA marker is available.« less
Shin, Sung Jae; Wu, Chia-wei; Steinberg, Howard; Talaat, Adel M.
2006-01-01
Johne's disease, caused by Mycobacterium paratuberculosis infection, is a worldwide problem for the dairy industry and has a possible involvement in Crohn's disease in humans. To identify virulence determinants of this economically important pathogen, a library of 5,060 transposon mutants was constructed using Tn5367 insertion mutagenesis, followed by large-scale sequencing to identify disrupted genes. In this report, 1,150 mutants were analyzed and 970 unique insertion sites were identified. Sequence analysis of the disrupted genes indicated that the insertion of Tn5367 was more prevalent in genomic regions with G+C content (50.5 to 60.5%) lower than the average G+C content (69.3%) of the rest of the genome. Phenotypic screening of the library identified disruptions of genes involved in iron, tryptophan, or mycolic acid metabolic pathways that displayed unique growth characteristics. Bioinformatic analysis of disrupted genes identified a list of potential virulence determinants for further testing with animals. Mouse infection studies showed a significant decrease in tissue colonization by mutants with a disruption in the gcpE, pstA, kdpC, papA2, impA, umaA1, or fabG2_2 gene. Attenuation phenotypes were tissue specific (e.g., for the umaA1 mutant) as well as time specific (e.g., for the impA mutant), suggesting that those genes may be involved in different virulence mechanisms. The identified potential virulence determinants represent novel functional classes that could be necessary for mycobacterial survival during infection and could provide suitable targets for vaccine and drug development against Johne's and Crohn's diseases. PMID:16790754
Virgilio, Massimiliano; Jordaens, Kurt; Breman, Floris C; Backeljau, Thierry; De Meyer, Marc
2012-01-01
We propose a general working strategy to deal with incomplete reference libraries in the DNA barcoding identification of species. Considering that (1) queries with a large genetic distance with their best DNA barcode match are more likely to be misidentified and (2) imposing a distance threshold profitably reduces identification errors, we modelled relationships between identification performances and distance thresholds in four DNA barcode libraries of Diptera (n = 4270), Lepidoptera (n = 7577), Hymenoptera (n = 2067) and Tephritidae (n = 602 DNA barcodes). In all cases, more restrictive distance thresholds produced a gradual increase in the proportion of true negatives, a gradual decrease of false positives and more abrupt variations in the proportions of true positives and false negatives. More restrictive distance thresholds improved precision, yet negatively affected accuracy due to the higher proportions of queries discarded (viz. having a distance query-best match above the threshold). Using a simple linear regression we calculated an ad hoc distance threshold for the tephritid library producing an estimated relative identification error <0.05. According to the expectations, when we used this threshold for the identification of 188 independently collected tephritids, less than 5% of queries with a distance query-best match below the threshold were misidentified. Ad hoc thresholds can be calculated for each particular reference library of DNA barcodes and should be used as cut-off mark defining whether we can proceed identifying the query with a known estimated error probability (e.g. 5%) or whether we should discard the query and consider alternative/complementary identification methods.
Virgilio, Massimiliano; Jordaens, Kurt; Breman, Floris C.; Backeljau, Thierry; De Meyer, Marc
2012-01-01
We propose a general working strategy to deal with incomplete reference libraries in the DNA barcoding identification of species. Considering that (1) queries with a large genetic distance with their best DNA barcode match are more likely to be misidentified and (2) imposing a distance threshold profitably reduces identification errors, we modelled relationships between identification performances and distance thresholds in four DNA barcode libraries of Diptera (n = 4270), Lepidoptera (n = 7577), Hymenoptera (n = 2067) and Tephritidae (n = 602 DNA barcodes). In all cases, more restrictive distance thresholds produced a gradual increase in the proportion of true negatives, a gradual decrease of false positives and more abrupt variations in the proportions of true positives and false negatives. More restrictive distance thresholds improved precision, yet negatively affected accuracy due to the higher proportions of queries discarded (viz. having a distance query-best match above the threshold). Using a simple linear regression we calculated an ad hoc distance threshold for the tephritid library producing an estimated relative identification error <0.05. According to the expectations, when we used this threshold for the identification of 188 independently collected tephritids, less than 5% of queries with a distance query-best match below the threshold were misidentified. Ad hoc thresholds can be calculated for each particular reference library of DNA barcodes and should be used as cut-off mark defining whether we can proceed identifying the query with a known estimated error probability (e.g. 5%) or whether we should discard the query and consider alternative/complementary identification methods. PMID:22359600
2011-01-01
Background One of the key goals of oak genomics research is to identify genes of adaptive significance. This information may help to improve the conservation of adaptive genetic variation and the management of forests to increase their health and productivity. Deep-coverage large-insert genomic libraries are a crucial tool for attaining this objective. We report herein the construction of a BAC library for Quercus robur, its characterization and an analysis of BAC end sequences. Results The EcoRI library generated consisted of 92,160 clones, 7% of which had no insert. Levels of chloroplast and mitochondrial contamination were below 3% and 1%, respectively. Mean clone insert size was estimated at 135 kb. The library represents 12 haploid genome equivalents and, the likelihood of finding a particular oak sequence of interest is greater than 99%. Genome coverage was confirmed by PCR screening of the library with 60 unique genetic loci sampled from the genetic linkage map. In total, about 20,000 high-quality BAC end sequences (BESs) were generated by sequencing 15,000 clones. Roughly 5.88% of the combined BAC end sequence length corresponded to known retroelements while ab initio repeat detection methods identified 41 additional repeats. Collectively, characterized and novel repeats account for roughly 8.94% of the genome. Further analysis of the BESs revealed 1,823 putative genes suggesting at least 29,340 genes in the oak genome. BESs were aligned with the genome sequences of Arabidopsis thaliana, Vitis vinifera and Populus trichocarpa. One putative collinear microsyntenic region encoding an alcohol acyl transferase protein was observed between oak and chromosome 2 of V. vinifera. Conclusions This BAC library provides a new resource for genomic studies, including SSR marker development, physical mapping, comparative genomics and genome sequencing. BES analysis provided insight into the structure of the oak genome. These sequences will be used in the assembly of a future genome sequence for oak. PMID:21645357
Algorithms for optimizing cross-overs in DNA shuffling.
He, Lu; Friedman, Alan M; Bailey-Kellogg, Chris
2012-03-21
DNA shuffling generates combinatorial libraries of chimeric genes by stochastically recombining parent genes. The resulting libraries are subjected to large-scale genetic selection or screening to identify those chimeras with favorable properties (e.g., enhanced stability or enzymatic activity). While DNA shuffling has been applied quite successfully, it is limited by its homology-dependent, stochastic nature. Consequently, it is used only with parents of sufficient overall sequence identity, and provides no control over the resulting chimeric library. This paper presents efficient methods to extend the scope of DNA shuffling to handle significantly more diverse parents and to generate more predictable, optimized libraries. Our CODNS (cross-over optimization for DNA shuffling) approach employs polynomial-time dynamic programming algorithms to select codons for the parental amino acids, allowing for zero or a fixed number of conservative substitutions. We first present efficient algorithms to optimize the local sequence identity or the nearest-neighbor approximation of the change in free energy upon annealing, objectives that were previously optimized by computationally-expensive integer programming methods. We then present efficient algorithms for more powerful objectives that seek to localize and enhance the frequency of recombination by producing "runs" of common nucleotides either overall or according to the sequence diversity of the resulting chimeras. We demonstrate the effectiveness of CODNS in choosing codons and allocating substitutions to promote recombination between parents targeted in earlier studies: two GAR transformylases (41% amino acid sequence identity), two very distantly related DNA polymerases, Pol X and β (15%), and beta-lactamases of varying identity (26-47%). Our methods provide the protein engineer with a new approach to DNA shuffling that supports substantially more diverse parents, is more deterministic, and generates more predictable and more diverse chimeric libraries.
Wolbachia and DNA barcoding insects: patterns, potential, and problems.
Smith, M Alex; Bertrand, Claudia; Crosby, Kate; Eveleigh, Eldon S; Fernandez-Triana, Jose; Fisher, Brian L; Gibbs, Jason; Hajibabaei, Mehrdad; Hallwachs, Winnie; Hind, Katharine; Hrcek, Jan; Huang, Da-Wei; Janda, Milan; Janzen, Daniel H; Li, Yanwei; Miller, Scott E; Packer, Laurence; Quicke, Donald; Ratnasingham, Sujeevan; Rodriguez, Josephine; Rougerie, Rodolphe; Shaw, Mark R; Sheffield, Cory; Stahlhut, Julie K; Steinke, Dirk; Whitfield, James; Wood, Monty; Zhou, Xin
2012-01-01
Wolbachia is a genus of bacterial endosymbionts that impacts the breeding systems of their hosts. Wolbachia can confuse the patterns of mitochondrial variation, including DNA barcodes, because it influences the pathways through which mitochondria are inherited. We examined the extent to which these endosymbionts are detected in routine DNA barcoding, assessed their impact upon the insect sequence divergence and identification accuracy, and considered the variation present in Wolbachia COI. Using both standard PCR assays (Wolbachia surface coding protein--wsp), and bacterial COI fragments we found evidence of Wolbachia in insect total genomic extracts created for DNA barcoding library construction. When >2 million insect COI trace files were examined on the Barcode of Life Datasystem (BOLD) Wolbachia COI was present in 0.16% of the cases. It is possible to generate Wolbachia COI using standard insect primers; however, that amplicon was never confused with the COI of the host. Wolbachia alleles recovered were predominantly Supergroup A and were broadly distributed geographically and phylogenetically. We conclude that the presence of the Wolbachia DNA in total genomic extracts made from insects is unlikely to compromise the accuracy of the DNA barcode library; in fact, the ability to query this DNA library (the database and the extracts) for endosymbionts is one of the ancillary benefits of such a large scale endeavor--which we provide several examples. It is our conclusion that regular assays for Wolbachia presence and type can, and should, be adopted by large scale insect barcoding initiatives. While COI is one of the five multi-locus sequence typing (MLST) genes used for categorizing Wolbachia, there is limited overlap with the eukaryotic DNA barcode region.
Contamination of sequence databases with adaptor sequences
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yoshikawa, Takeo; Sanders, A.R.; Detera-Wadleigh, S.D.
Because of the exponential increase in the amount of DNA sequences being added to the public databases on a daily basis, it has become imperative to identify sources of contamination rapidly. Previously, contaminations of sequence databases have been reported to alert the scientific community to the problem. These contaminations can be divided into two categories. The first category comprises host sequences that have been difficult for submitters to manage or control. Examples include anomalous sequences derived from Escherichia coli, which are inserted into the chromosomes (and plasmids) of the bacterial hosts. Insertion sequences are highly mobile and are capable ofmore » transposing themselves into plasmids during cloning manipulation. Another example of the first category is the infection with yeast genomic DNA or with bacterial DNA of some commercially available cDNA libraries from Clontech. The second category of database contamination is due to the inadvertent inclusion of nonhost sequences. This category includes incorporation of cloning-vector sequences and multicloning sites in the database submission. M13-derived artifacts have been common, since M13-based vectors have been widely used for subcloning DNA fragments. Recognizing this problem, the National Center for Biotechnology Information (NCBI) started to screen, in April 1994, all sequences directly submitted to GenBank, against a set of vector data retrieved from GenBank by use of key-word searches, such as {open_quotes}vector.{close_quotes} In this report, we present evidence for another sequence artifact that is widespread but that, to our knowledge, has not yet been reported. 11 refs., 1 tab.« less
Matthews, R J; Cahir, E D; Thomas, M L
1990-01-01
Protein-tyrosine-phosphatases (protein-tyrosine-phosphate phosphohydrolase, EC 3.13.48) have been implicated in the regulation of cell growth; however, to date few tyrosine phosphatases have been characterized. To identify additional family members, the cDNA for the human tyrosine phosphatase leukocyte common antigen (LCA; CD45) was used to screen, under low stringency, a mouse pre-B-cell cDNA library. Two cDNA clones were isolated and sequence analysis predicts a protein sequence of 793 amino acids. We have named the molecule LRP (LCA-related phosphatase). RNA transfer analysis indicates that the cDNAs were derived from a 3.2-kilobase mRNA. The LRP mRNA is transcribed in a wide variety of tissues. The predicted protein structure can be divided into the following structural features: a short 19-amino acid leader sequence, an exterior domain of 123 amino acids that is predicted to be highly glycosylated, a 24-amino acid membrane-spanning region, and a 627-amino acid cytoplasmic region. The cytoplasmic region contains two approximately 260-amino acid domains, each with homology to the tyrosine phosphatase family. One of the cDNA clones differed in that it had a 108-base-pair insertion that, while preserving the reading frame, would disrupt the first protein-tyrosine-phosphatase domain. Analysis of genomic DNA indicates that the insertion is due to an alternatively spliced exon. LRP appears to be evolutionarily conserved as a putative homologue has been identified in the invertebrate Styela plicata. Images PMID:2162042
Taxonomic and functional assignment of cloned sequences from high Andean forest soil metagenome.
Montaña, José Salvador; Jiménez, Diego Javier; Hernández, Mónica; Angel, Tatiana; Baena, Sandra
2012-02-01
Total metagenomic DNA was isolated from high Andean forest soil and subjected to taxonomical and functional composition analyses by means of clone library generation and sequencing. The obtained yield of 1.7 μg of DNA/g of soil was used to construct a metagenomic library of approximately 20,000 clones (in the plasmid p-Bluescript II SK+) with an average insert size of 4 Kb, covering 80 Mb of the total metagenomic DNA. Metagenomic sequences near the plasmid cloning site were sequenced and them trimmed and assembled, obtaining 299 reads and 31 contigs (0.3 Mb). Taxonomic assignment of total sequences was performed by BLASTX, resulting in 68.8, 44.8 and 24.5% classification into taxonomic groups using the metagenomic RAST server v2.0, WebCARMA v1.0 online system and MetaGenome Analyzer v3.8 software, respectively. Most clone sequences were classified as Bacteria belonging to phlya Actinobacteria, Proteobacteria and Acidobacteria. Among the most represented orders were Actinomycetales (34% average), Rhizobiales, Burkholderiales and Myxococcales and with a greater number of sequences in the genus Mycobacterium (7% average), Frankia, Streptomyces and Bradyrhizobium. The vast majority of sequences were associated with the metabolism of carbohydrates, proteins, lipids and catalytic functions, such as phosphatases, glycosyltransferases, dehydrogenases, methyltransferases, dehydratases and epoxide hydrolases. In this study we compared different methods of taxonomic and functional assignment of metagenomic clone sequences to evaluate microbial diversity in an unexplored soil ecosystem, searching for putative enzymes of biotechnological interest and generating important information for further functional screening of clone libraries.
High-Throughput Analysis of T-DNA Location and Structure Using Sequence Capture.
Inagaki, Soichi; Henry, Isabelle M; Lieberman, Meric C; Comai, Luca
2015-01-01
Agrobacterium-mediated transformation of plants with T-DNA is used both to introduce transgenes and for mutagenesis. Conventional approaches used to identify the genomic location and the structure of the inserted T-DNA are laborious and high-throughput methods using next-generation sequencing are being developed to address these problems. Here, we present a cost-effective approach that uses sequence capture targeted to the T-DNA borders to select genomic DNA fragments containing T-DNA-genome junctions, followed by Illumina sequencing to determine the location and junction structure of T-DNA insertions. Multiple probes can be mixed so that transgenic lines transformed with different T-DNA types can be processed simultaneously, using a simple, index-based pooling approach. We also developed a simple bioinformatic tool to find sequence read pairs that span the junction between the genome and T-DNA or any foreign DNA. We analyzed 29 transgenic lines of Arabidopsis thaliana, each containing inserts from 4 different T-DNA vectors. We determined the location of T-DNA insertions in 22 lines, 4 of which carried multiple insertion sites. Additionally, our analysis uncovered a high frequency of unconventional and complex T-DNA insertions, highlighting the needs for high-throughput methods for T-DNA localization and structural characterization. Transgene insertion events have to be fully characterized prior to use as commercial products. Our method greatly facilitates the first step of this characterization of transgenic plants by providing an efficient screen for the selection of promising lines.
The Hemiptera (Insecta) of Canada: Constructing a Reference Library of DNA Barcodes
Gwiazdowski, Rodger A.; Foottit, Robert G.; Maw, H. Eric L.; Hebert, Paul D. N.
2015-01-01
DNA barcode reference libraries linked to voucher specimens create new opportunities for high-throughput identification and taxonomic re-evaluations. This study provides a DNA barcode library for about 45% of the recognized species of Canadian Hemiptera, and the publically available R workflow used for its generation. The current library is based on the analysis of 20,851 specimens including 1849 species belonging to 628 genera and 64 families. These individuals were assigned to 1867 Barcode Index Numbers (BINs), sequence clusters that often coincide with species recognized through prior taxonomy. Museum collections were a key source for identified specimens, but we also employed high-throughput collection methods that generated large numbers of unidentified specimens. Many of these specimens represented novel BINs that were subsequently identified by taxonomists, adding barcode coverage for additional species. Our analyses based on both approaches includes 94 species not listed in the most recent Canadian checklist, representing a potential 3% increase in the fauna. We discuss the development of our workflow in the context of prior DNA barcode library construction projects, emphasizing the importance of delineating a set of reference specimens to aid investigations in cases of nomenclatural and DNA barcode discordance. The identification for each specimen in the reference set can be annotated on the Barcode of Life Data System (BOLD), allowing experts to highlight questionable identifications; annotations can be added by any registered user of BOLD, and instructions for this are provided. PMID:25923328
Ramos, Enrique; Levinson, Benjamin T; Chasnoff, Sara; Hughes, Andrew; Young, Andrew L; Thornton, Katherine; Li, Allie; Vallania, Francesco L M; Province, Michael; Druley, Todd E
2012-12-06
Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regions and comparing against unaffected cohorts. However, despite persistent declines in sequencing costs, population-based rare variant detection across large genomic target regions remains cost prohibitive for most investigators. In addition, DNA samples are often precious and hybridization methods typically require large amounts of input DNA. Pooled sample DNA sequencing is a cost and time-efficient strategy for surveying populations of individuals for rare variants. We set out to 1) create a scalable, multiplexing method for custom capture with or without individual DNA indexing that was amenable to low amounts of input DNA and 2) expand the functionality of the SPLINTER algorithm for calling substitutions, insertions and deletions across either candidate genes or the entire exome by integrating the variant calling algorithm with the dynamic programming aligner, Novoalign. We report methodology for pooled hybridization capture with pre-enrichment, indexed multiplexing of up to 48 individuals or non-indexed pooled sequencing of up to 92 individuals with as little as 70 ng of DNA per person. Modified solid phase reversible immobilization bead purification strategies enable no sample transfers from sonication in 96-well plates through adapter ligation, resulting in 50% less library preparation reagent consumption. Custom Y-shaped adapters containing novel 7 base pair index sequences with a Hamming distance of ≥2 were directly ligated onto fragmented source DNA eliminating the need for PCR to incorporate indexes, and was followed by a custom blocking strategy using a single oligonucleotide regardless of index sequence. These results were obtained aligning raw reads against the entire genome using Novoalign followed by variant calling of non-indexed pools using SPLINTER or SAMtools for indexed samples. With these pipelines, we find sensitivity and specificity of 99.4% and 99.7% for pooled exome sequencing. Sensitivity, and to a lesser degree specificity, proved to be a function of coverage. For rare variants (≤2% minor allele frequency), we achieved sensitivity and specificity of ≥94.9% and ≥99.99% for custom capture of 2.5 Mb in multiplexed libraries of 22-48 individuals with only ≥5-fold coverage/chromosome, but these parameters improved to ≥98.7 and 100% with 20-fold coverage/chromosome. This highly scalable methodology enables accurate rare variant detection, with or without individual DNA sample indexing, while reducing the amount of required source DNA and total costs through less hybridization reagent consumption, multi-sample sonication in a standard PCR plate, multiplexed pre-enrichment pooling with a single hybridization and lesser sequencing coverage required to obtain high sensitivity.
Han, Guomin; Shao, Qian; Li, Cuiping; Zhao, Kai; Jiang, Li; Fan, Jun; Jiang, Haiyang; Tao, Fang
2018-05-01
Aspergillus flavus often invade many important corps and produce harmful aflatoxins both in preharvest and during storage stages. The regulation mechanism of aflatoxin biosynthesis in this fungus has not been well explored mainly due to the lack of an efficient transformation method for constructing a genome-wide gene mutant library. This challenge was resolved in this study, where a reliable and efficient Agrobacterium tumefaciens-mediated transformation (ATMT) protocol for A. flavus NRRL 3357 was established. The results showed that removal of multinucleate conidia, to collect a homogenous sample of uninucleate conidia for use as the transformation material, is the key step in this procedure. A. tumefaciens strain AGL-1 harboring the ble gene for zeocin resistance under the control of the gpdA promoter from A. nidulans is suitable for genetic transformation of this fungus. We successfully generated A. flavus transformants with an efficiency of ∼ 60 positive transformants per 10 6 conidia using our protocol. A small-scale insertional mutant library (∼ 1,000 mutants) was constructed using this method and the resulting several mutants lacked both production of conidia and aflatoxin biosynthesis capacity. Southern blotting analysis demonstrated that the majority of the transformants contained a single T-DNA insert on the genome. To the best of our knowledge, this is the first report of genetic transformation of A. flavus via ATMT and our protocol provides an effective tool for construction of genome-wide gene mutant libraries for functional analysis of important genes in A. flavus.
Molecular transport through large-diameter DNA nanopores
NASA Astrophysics Data System (ADS)
Krishnan, Swati; Ziegler, Daniela; Arnaut, Vera; Martin, Thomas G.; Kapsner, Korbinian; Henneberg, Katharina; Bausch, Andreas R.; Dietz, Hendrik; Simmel, Friedrich C.
2016-09-01
DNA-based nanopores are synthetic biomolecular membrane pores, whose geometry and chemical functionality can be tuned using the tools of DNA nanotechnology, making them promising molecular devices for applications in single-molecule biosensing and synthetic biology. Here we introduce a large DNA membrane channel with an ~4 nm diameter pore, which has stable electrical properties and spontaneously inserts into flat lipid bilayer membranes. Membrane incorporation is facilitated by a large number of hydrophobic functionalizations or, alternatively, streptavidin linkages between biotinylated channels and lipids. The channel displays an Ohmic conductance of ~3 nS, consistent with its size, and allows electrically driven translocation of single-stranded and double-stranded DNA analytes. Using confocal microscopy and a dye influx assay, we demonstrate the spontaneous formation of membrane pores in giant unilamellar vesicles. Pores can be created both in an outside-in and an inside-out configuration.
(New hosts and vectors for genome cloning)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
The main goal of our project remains the development of new bacterial hosts and vectors for the stable propagation of human DNA clones in E. coli. During the past six months of our current budget period, we have (1) continued to develop new hosts that permit the stable maintenance of unstable features of human DNA, and (2) developed a series of vectors for (a) cloning large DNA inserts, (b) assessing the frequency of human sequences that are lethal to the growth of E. coli, and (c) assessing the stability of human sequences cloned in M13 for large-scale sequencing projects.
[New hosts and vectors for genome cloning]. Progress report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
The main goal of our project remains the development of new bacterial hosts and vectors for the stable propagation of human DNA clones in E. coli. During the past six months of our current budget period, we have (1) continued to develop new hosts that permit the stable maintenance of unstable features of human DNA, and (2) developed a series of vectors for (a) cloning large DNA inserts, (b) assessing the frequency of human sequences that are lethal to the growth of E. coli, and (c) assessing the stability of human sequences cloned in M13 for large-scale sequencing projects.
Random access in large-scale DNA data storage.
Organick, Lee; Ang, Siena Dumas; Chen, Yuan-Jyue; Lopez, Randolph; Yekhanin, Sergey; Makarychev, Konstantin; Racz, Miklos Z; Kamath, Govinda; Gopalan, Parikshit; Nguyen, Bichlien; Takahashi, Christopher N; Newman, Sharon; Parker, Hsing-Yeh; Rashtchian, Cyrus; Stewart, Kendall; Gupta, Gagan; Carlson, Robert; Mulligan, John; Carmean, Douglas; Seelig, Georg; Ceze, Luis; Strauss, Karin
2018-03-01
Synthetic DNA is durable and can encode digital data with high density, making it an attractive medium for data storage. However, recovering stored data on a large-scale currently requires all the DNA in a pool to be sequenced, even if only a subset of the information needs to be extracted. Here, we encode and store 35 distinct files (over 200 MB of data), in more than 13 million DNA oligonucleotides, and show that we can recover each file individually and with no errors, using a random access approach. We design and validate a large library of primers that enable individual recovery of all files stored within the DNA. We also develop an algorithm that greatly reduces the sequencing read coverage required for error-free decoding by maximizing information from all sequence reads. These advances demonstrate a viable, large-scale system for DNA data storage and retrieval.
Nicosia, Aldo; Maggio, Teresa; Mazzola, Salvatore; Cuttitta, Angela
2013-10-30
Anemonia viridis is a widespread and extensively studied Mediterranean species of sea anemone from which a large number of polypeptide toxins, such as blood depressing substances (BDS) peptides, have been isolated. The first members of this class, BDS-1 and BDS-2, are polypeptides belonging to the β-defensin fold family and were initially described for their antihypertensive and antiviral activities. BDS-1 and BDS-2 are 43 amino acid peptides characterised by three disulfide bonds that act as neurotoxins affecting Kv3.1, Kv3.2 and Kv3.4 channel gating kinetics. In addition, BDS-1 inactivates the Nav1.7 and Nav1.3 channels. The development of a large dataset of A. viridis expressed sequence tags (ESTs) and the identification of 13 putative BDS-like cDNA sequences has attracted interest, especially as scientific and diagnostic tools. A comparison of BDS cDNA sequences showed that the untranslated regions are more conserved than the protein-coding regions. Moreover, the KA/KS ratios calculated for all pairwise comparisons showed values greater than 1, suggesting mechanisms of accelerated evolution. The structures of the BDS homologs were predicted by molecular modelling. All toxins possess similar 3D structures that consist of a triple-stranded antiparallel β-sheet and an additional small antiparallel β-sheet located downstream of the cleavage/maturation site; however, the orientation of the triple-stranded β-sheet appears to differ among the toxins. To characterise the spatial expression profile of the putative BDS cDNA sequences, tissue-specific cDNA libraries, enriched for BDS transcripts, were constructed. In addition, the proper amplification of ectodermal or endodermal markers ensured the tissue specificity of each library. Sequencing randomly selected clones from each library revealed ectodermal-specific expression of ten BDS transcripts, while transcripts of BDS-8, BDS-13, BDS-14 and BDS-15 failed to be retrieved, likely due to under-representation in our cDNA libraries. The calculation of the relative abundance of BDS transcripts in the cDNA libraries revealed that BDS-1, BDS-3, BDS-4, BDS-5 and BDS-6 are the most represented transcripts.
Undermethylated DNA as a source of microsatellites from a conifer genome.
Zhou, Y; Bui, T; Auckland, L D; Williams, C G
2002-02-01
Developing microsatellites from the large, highly duplicated conifer genome requires special tools. To improve the efficiency of developing Pinus taeda L. microsatellites, undermethylated (UM) DNA fragments were used to construct a microsatellite-enriched copy library. A methylation-sensitive restriction enzyme, McrBC, was used to enrich for UM DNA before library construction. Digested DNA fragments larger than 9 kb were then excised and digested with RsaI and used to construct nine dinucleotide and trinucleotide libraries. A total of 1016 microsatellite-positive clones were detected among 11 904 clones and 620 of these were unique. Of 245 primer sets that produced a PCR product, 113 could be developed as UM microsatellite markers and 70 were polymorphic. Inheritance and marker informativeness were tested for a random sample of 36 polymorphic markers using a three-generation outbred pedigree. Thirty-one microsatellites (86%) had single-locus inheritance despite the highly duplicated nature of the P. taeda genome. Nineteen UM microsatellites had highly informative intercross mating type configurations. Allele number and frequency were estimated for eleven UM microsatellites using a population survey. Allele numbers for these UM microsatellites ranged from 3 to 12 with an average of 5.7 alleles/locus. Frequencies for the 63 alleles were mostly in the low-common range; only 14 of the 63 were in the rare allele (q < 0.05) class. Enriching for UM DNA was an efficient method for developing polymorphic microsatellites from a large plant genome.
Enzymatically Generated CRISPR Libraries for Genome Labeling and Screening.
Lane, Andrew B; Strzelecka, Magdalena; Ettinger, Andreas; Grenfell, Andrew W; Wittmann, Torsten; Heald, Rebecca
2015-08-10
CRISPR-based technologies have emerged as powerful tools to alter genomes and mark chromosomal loci, but an inexpensive method for generating large numbers of RNA guides for whole genome screening and labeling is lacking. Using a method that permits library construction from any source of DNA, we generated guide libraries that label repetitive loci or a single chromosomal locus in Xenopus egg extracts and show that a complex library can target the E. coli genome at high frequency. Copyright © 2015 Elsevier Inc. All rights reserved.
Arabidopsis research requires a critical re-evaluation of genetic tools.
Nikonorova, Natalia; Yue, Kun; Beeckman, Tom; De Smet, Ive
2018-06-27
An increasing number of reports question conclusions based on loss-of-function lines that have unexpected genetic backgrounds. In this opinion paper, we urge researchers to meticulously (re)investigate phenotypes retrieved from various genetic backgrounds and be critical regarding some previously drawn conclusions. As an example, we provide new evidence that acr4-2 mutant phenotypes with respect to columella stem cells are due to the lack of ACR4 and not - at least not as a major contributor - to a mutation in QRT1. In addition, we take the opportunity to alert the scientific community about the qrt1-2 background of a large number of Syngenta Arabidopsis Insertion Library (SAIL) T-DNA lines, a feature that is not commonly recognized by Arabidopsis researchers. This qrt1-2 background might have an important impact on the interpretation of the results obtained using these research tools, now and in the past. In conclusion, as a community, we should continuously assess and - if necessary - correct our conclusions based on the large number of (genetic) tools our work is built on. In addition, the positive or negative results of this self-criticism should be made available to the scientific community.
High-throughput analysis of T-DNA location and structure using sequence capture
DOE Office of Scientific and Technical Information (OSTI.GOV)
Inagaki, Soichi; Henry, Isabelle M.; Lieberman, Meric C.
Agrobacterium-mediated transformation of plants with T-DNA is used both to introduce transgenes and for mutagenesis. Conventional approaches used to identify the genomic location and the structure of the inserted T-DNA are laborious and high-throughput methods using next-generation sequencing are being developed to address these problems. Here, we present a cost-effective approach that uses sequence capture targeted to the T-DNA borders to select genomic DNA fragments containing T-DNA—genome junctions, followed by Illumina sequencing to determine the location and junction structure of T-DNA insertions. Multiple probes can be mixed so that transgenic lines transformed with different T-DNA types can be processed simultaneously,more » using a simple, index-based pooling approach. We also developed a simple bioinformatic tool to find sequence read pairs that span the junction between the genome and T-DNA or any foreign DNA. We analyzed 29 transgenic lines of Arabidopsis thaliana, each containing inserts from 4 different T-DNA vectors. We determined the location of T-DNA insertions in 22 lines, 4 of which carried multiple insertion sites. Additionally, our analysis uncovered a high frequency of unconventional and complex T-DNA insertions, highlighting the needs for high-throughput methods for T-DNA localization and structural characterization. Transgene insertion events have to be fully characterized prior to use as commercial products. As a result, our method greatly facilitates the first step of this characterization of transgenic plants by providing an efficient screen for the selection of promising lines.« less
High-throughput analysis of T-DNA location and structure using sequence capture
Inagaki, Soichi; Henry, Isabelle M.; Lieberman, Meric C.; ...
2015-10-07
Agrobacterium-mediated transformation of plants with T-DNA is used both to introduce transgenes and for mutagenesis. Conventional approaches used to identify the genomic location and the structure of the inserted T-DNA are laborious and high-throughput methods using next-generation sequencing are being developed to address these problems. Here, we present a cost-effective approach that uses sequence capture targeted to the T-DNA borders to select genomic DNA fragments containing T-DNA—genome junctions, followed by Illumina sequencing to determine the location and junction structure of T-DNA insertions. Multiple probes can be mixed so that transgenic lines transformed with different T-DNA types can be processed simultaneously,more » using a simple, index-based pooling approach. We also developed a simple bioinformatic tool to find sequence read pairs that span the junction between the genome and T-DNA or any foreign DNA. We analyzed 29 transgenic lines of Arabidopsis thaliana, each containing inserts from 4 different T-DNA vectors. We determined the location of T-DNA insertions in 22 lines, 4 of which carried multiple insertion sites. Additionally, our analysis uncovered a high frequency of unconventional and complex T-DNA insertions, highlighting the needs for high-throughput methods for T-DNA localization and structural characterization. Transgene insertion events have to be fully characterized prior to use as commercial products. As a result, our method greatly facilitates the first step of this characterization of transgenic plants by providing an efficient screen for the selection of promising lines.« less
Darwin Assembly: fast, efficient, multi-site bespoke mutagenesis
Cozens, Christopher
2018-01-01
Abstract Engineering proteins for designer functions and biotechnological applications almost invariably requires (or at least benefits from) multiple mutations to non-contiguous residues. Several methods for multiple site-directed mutagenesis exist, but there remains a need for fast and simple methods to efficiently introduce such mutations – particularly for generating large, high quality libraries for directed evolution. Here, we present Darwin Assembly, which can deliver high quality libraries of >108 transformants, targeting multiple (>10) distal sites with minimal wild-type contamination (<0.25% of total population) and which takes a single working day from purified plasmid to library transformation. We demonstrate its efficacy with whole gene codon reassignment of chloramphenicol acetyl transferase, mutating 19 codons in a single reaction in KOD DNA polymerase and generating high quality, multiple-site libraries in T7 RNA polymerase and Tgo DNA polymerase. Darwin Assembly uses commercially available enzymes, can be readily automated, and offers a cost-effective route to highly complex and customizable library generation. PMID:29409059
Method for construction of normalized cDNA libraries
Soares, Marcelo B.; Efstratiadis, Argiris
1998-01-01
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to appropriate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. This invention also provides normalized cDNA libraries generated by the above-described method and uses of the generated libraries.
Method for construction of normalized cDNA libraries
Soares, M.B.; Efstratiadis, A.
1998-11-03
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3` noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to appropriate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. This invention also provides normalized cDNA libraries generated by the above-described method and uses of the generated libraries. 19 figs.
Travis, G H; Sutcliffe, J G
1988-01-01
To isolate cDNA clones of low-abundance mRNAs expressed in monkey cerebral cortex but absent from cerebellum, we developed an improved subtractive cDNA cloning procedure that requires only modest quantities of mRNA. Plasmid DNA from a monkey cerebellum cDNA library was hybridized in large excess to radiolabeled monkey cortex cDNA in a phenol emulsion-enhanced reaction. The unhybridized cortex cDNA was isolated by chromatography on hydroxyapatite and used to probe colonies from a monkey cortex cDNA library. Of 60,000 colonies screened, 163 clones were isolated and confirmed by colony hybridization or RNA blotting to represent mRNAs, ranging from 0.001% to 0.1% abundance, specific to or highly enriched in cerebral cortex relative to cerebellum. Clones of one medium-abundance mRNA were recovered almost quantitatively. Two of the lower-abundance mRNAs were expressed at levels reduced by a factor of 10 in Alzheimer disease relative to normal human cortex. One of these was identified as the monkey preprosomatostatin I mRNA. Images PMID:2894033
Clark, S. H.; Hilliker, A. J.; Chovnick, A.
1988-01-01
This report presents the results of a recombination experiment designed to question the existence of special sites for the initiation or termination of a recombination heteroduplex within the region of the rosy locus. Intragenic recombination events were monitored between two physically separated rosy mutant alleles ry(301) and ry(2) utilizing DNA restriction site polymorphisms as genetic markers. Both ry(301) and ry(2) are known from previous studies to be associated with gene conversion frequencies an order of magnitude lower than single site mutations. The mutations are associated with large, well defined insertions located as internal sites within the locus in prior intragenic mapping studies. On the molecular map, they represent large insertions approximately 2.7 kb apart in the second and third exons, respectively, of the XDH coding region. The present study monitors intragenic recombination in a mutant heterozygous genotype in which DNA homology is disrupted by these large discontinuities, greater than the region of DNA homology and flanking both sides of the locus. If initiation/or termination requires separate sites at either end of the locus, then intragenic recombination within the rosy locus of the heterozygote should be eliminated. Contrary to expectation, significant recombination between these sites is seen. PMID:2834266
DNA-encoded chemistry: enabling the deeper sampling of chemical space.
Goodnow, Robert A; Dumelin, Christoph E; Keefe, Anthony D
2017-02-01
DNA-encoded chemical library technologies are increasingly being adopted in drug discovery for hit and lead generation. DNA-encoded chemistry enables the exploration of chemical spaces four to five orders of magnitude more deeply than is achievable by traditional high-throughput screening methods. Operation of this technology requires developing a range of capabilities including aqueous synthetic chemistry, building block acquisition, oligonucleotide conjugation, large-scale molecular biological transformations, selection methodologies, PCR, sequencing, sequence data analysis and the analysis of large chemistry spaces. This Review provides an overview of the development and applications of DNA-encoded chemistry, highlighting the challenges and future directions for the use of this technology.
The Evolution of DNA-Templated Synthesis as a Tool for Materials Discovery.
O'Reilly, Rachel K; Turberfield, Andrew J; Wilks, Thomas R
2017-10-17
Precise control over reactivity and molecular structure is a fundamental goal of the chemical sciences. Billions of years of evolution by natural selection have resulted in chemical systems capable of information storage, self-replication, catalysis, capture and production of light, and even cognition. In all these cases, control over molecular structure is required to achieve a particular function: without structural control, function may be impaired, unpredictable, or impossible. The search for molecules with a desired function is often achieved by synthesizing a combinatorial library, which contains many or all possible combinations of a set of chemical building blocks (BBs), and then screening this library to identify "successful" structures. The largest libraries made by conventional synthesis are currently of the order of 10 8 distinct molecules. To put this in context, there are 10 13 ways of arranging the 21 proteinogenic amino acids in chains up to 10 units long. Given that we know that a number of these compounds have potent biological activity, it would be highly desirable to be able to search them all to identify leads for new drug molecules. Large libraries of oligonucleotides can be synthesized combinatorially and translated into peptides using systems based on biological replication such as mRNA display, with selected molecules identified by DNA sequencing; but these methods are limited to BBs that are compatible with cellular machinery. In order to search the vast tracts of chemical space beyond nucleic acids and natural peptides, an alternative approach is required. DNA-templated synthesis (DTS) could enable us to meet this challenge. DTS controls chemical product formation by using the specificity of DNA hybridization to bring selected reactants into close proximity, and is capable of the programmed synthesis of many distinct products in the same reaction vessel. By making use of dynamic, programmable DNA processes, it is possible to engineer a system that can translate instructions coded as a sequence of DNA bases into a chemical structure-a process analogous to the action of the ribosome in living organisms but with the potential to create a much more chemically diverse set of products. It is also possible to ensure that each product molecule is tagged with its identifying DNA sequence. Compound libraries synthesized in this way can be exposed to selection against suitable targets, enriching successful molecules. The encoding DNA can then be amplified using the polymerase chain reaction and decoded by DNA sequencing. More importantly, the DNA instruction sequences can be mutated and reused during multiple rounds of amplification, translation, and selection. In other words, DTS could be used as the foundation for a system of synthetic molecular evolution, which could allow us to efficiently search a vast chemical space. This has huge potential to revolutionize materials discovery-imagine being able to evolve molecules for light harvesting, or catalysts for CO 2 fixation. The field of DTS has developed to the point where a wide variety of reactions can be performed on a DNA template. Complex architectures and autonomous "DNA robots" have been implemented for the controlled assembly of BBs, and these mechanisms have in turn enabled the one-pot synthesis of large combinatorial libraries. Indeed, DTS libraries are being exploited by pharmaceutical companies and have already found their way into drug lead discovery programs. This Account explores the processes involved in DTS and highlights the challenges that remain in creating a general system for molecular discovery by evolution.
The Evolution of DNA-Templated Synthesis as a Tool for Materials Discovery
2017-01-01
Conspectus Precise control over reactivity and molecular structure is a fundamental goal of the chemical sciences. Billions of years of evolution by natural selection have resulted in chemical systems capable of information storage, self-replication, catalysis, capture and production of light, and even cognition. In all these cases, control over molecular structure is required to achieve a particular function: without structural control, function may be impaired, unpredictable, or impossible. The search for molecules with a desired function is often achieved by synthesizing a combinatorial library, which contains many or all possible combinations of a set of chemical building blocks (BBs), and then screening this library to identify “successful” structures. The largest libraries made by conventional synthesis are currently of the order of 108 distinct molecules. To put this in context, there are 1013 ways of arranging the 21 proteinogenic amino acids in chains up to 10 units long. Given that we know that a number of these compounds have potent biological activity, it would be highly desirable to be able to search them all to identify leads for new drug molecules. Large libraries of oligonucleotides can be synthesized combinatorially and translated into peptides using systems based on biological replication such as mRNA display, with selected molecules identified by DNA sequencing; but these methods are limited to BBs that are compatible with cellular machinery. In order to search the vast tracts of chemical space beyond nucleic acids and natural peptides, an alternative approach is required. DNA-templated synthesis (DTS) could enable us to meet this challenge. DTS controls chemical product formation by using the specificity of DNA hybridization to bring selected reactants into close proximity, and is capable of the programmed synthesis of many distinct products in the same reaction vessel. By making use of dynamic, programmable DNA processes, it is possible to engineer a system that can translate instructions coded as a sequence of DNA bases into a chemical structure—a process analogous to the action of the ribosome in living organisms but with the potential to create a much more chemically diverse set of products. It is also possible to ensure that each product molecule is tagged with its identifying DNA sequence. Compound libraries synthesized in this way can be exposed to selection against suitable targets, enriching successful molecules. The encoding DNA can then be amplified using the polymerase chain reaction and decoded by DNA sequencing. More importantly, the DNA instruction sequences can be mutated and reused during multiple rounds of amplification, translation, and selection. In other words, DTS could be used as the foundation for a system of synthetic molecular evolution, which could allow us to efficiently search a vast chemical space. This has huge potential to revolutionize materials discovery—imagine being able to evolve molecules for light harvesting, or catalysts for CO2 fixation. The field of DTS has developed to the point where a wide variety of reactions can be performed on a DNA template. Complex architectures and autonomous “DNA robots” have been implemented for the controlled assembly of BBs, and these mechanisms have in turn enabled the one-pot synthesis of large combinatorial libraries. Indeed, DTS libraries are being exploited by pharmaceutical companies and have already found their way into drug lead discovery programs. This Account explores the processes involved in DTS and highlights the challenges that remain in creating a general system for molecular discovery by evolution. PMID:28915003
Walker, M D; Park, C W; Rosen, A; Aronheim, A
1990-01-01
Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Genotype Specification Language.
Wilson, Erin H; Sagawa, Shiori; Weis, James W; Schubert, Max G; Bissell, Michael; Hawthorne, Brian; Reeves, Christopher D; Dean, Jed; Platt, Darren
2016-06-17
We describe here the Genotype Specification Language (GSL), a language that facilitates the rapid design of large and complex DNA constructs used to engineer genomes. The GSL compiler implements a high-level language based on traditional genetic notation, as well as a set of low-level DNA manipulation primitives. The language allows facile incorporation of parts from a library of cloned DNA constructs and from the "natural" library of parts in fully sequenced and annotated genomes. GSL was designed to engage genetic engineers in their native language while providing a framework for higher level abstract tooling. To this end we define four language levels, Level 0 (literal DNA sequence) through Level 3, with increasing abstraction of part selection and construction paths. GSL targets an intermediate language based on DNA slices that translates efficiently into a wide range of final output formats, such as FASTA and GenBank, and includes formats that specify instructions and materials such as oligonucleotide primers to allow the physical construction of the GSL designs by individual strain engineers or an automated DNA assembly core facility.
pYEMF, a pUC18-derived XcmI T-vector for efficient cloning of PCR products.
Gu, Jingsong; Ye, Chunjiang
2011-03-01
A 1330-bp DNA sequence with two XcmI cassettes was inserted into pUC18 to construct an efficient XcmI T-vector parent plasmid, pYEMF. The large size of the inserted DNA fragment improved T-vector cleavage efficiency, and guaranteed good separation of the molecular components after restriction digestion. The pYEMF-T-vector generated from parent plasmid pYEMF permits blue/white colony screening; cloning efficiency analysis showed that most white colonies (>75%) were putative transformants which carried the cloning product. The sequence analysis and design approach presented here will facilitate applications in the fields of molecular biology and genetic engineering.
Jiang, Likun; You, Weiwei; Zhang, Xiaojun; Xu, Jian; Jiang, Yanliang; Wang, Kai; Zhao, Zixia; Chen, Baohua; Zhao, Yunfeng; Mahboob, Shahid; Al-Ghanim, Khalid A; Ke, Caihuan; Xu, Peng
2016-02-01
The small abalone (Haliotis diversicolor) is one of the most important aquaculture species in East Asia. To facilitate gene cloning and characterization, genome analysis, and genetic breeding of it, we constructed a large-insert bacterial artificial chromosome (BAC) library, which is an important genetic tool for advanced genetics and genomics research. The small abalone BAC library includes 92,610 clones with an average insert size of 120 Kb, equivalent to approximately 7.6× of the small abalone genome. We set up three-dimensional pools and super pools of 18,432 BAC clones for target gene screening using PCR method. To assess the approach, we screened 12 target genes in these 18,432 BAC clones and identified 16 positive BAC clones. Eight positive BAC clones were then sequenced and assembled with the next generation sequencing platform. The assembled contigs representing these 8 BAC clones spanned 928 Kb of the small abalone genome, providing the first batch of genome sequences for genome evaluation and characterization. The average GC content of small abalone genome was estimated as 40.33%. A total of 21 protein-coding genes, including 7 target genes, were annotated into the 8 BACs, which proved the feasibility of PCR screening approach with three-dimensional pools in small abalone BAC library. One hundred fifty microsatellite loci were also identified from the sequences for marker development in the future. The BAC library and clone pools provided valuable resources and tools for genetic breeding and conservation of H. diversicolor.
[New hosts and vectors for genome cloning]. Progress report, 1990--1991
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
The main goal of our project remains the development of new bacterial hosts and vectors for the stable propagation of human DNA clones in E. coli. During the past six months of our current budget period, we have (1) continued to develop new hosts that permit the stable maintenance of unstable features of human DNA, and (2) developed a series of vectors for (a) cloning large DNA inserts, (b) assessing the frequency of human sequences that are lethal to the growth of E. coli, and (c) assessing the stability of human sequences cloned in M13 for large-scale sequencing projects.
Constructing and detecting a cDNA library for mites.
Hu, Li; Zhao, YaE; Cheng, Juan; Yang, YuanJun; Li, Chen; Lu, ZhaoHui
2015-10-01
RNA extraction and construction of complementary DNA (cDNA) library for mites have been quite challenging due to difficulties in acquiring tiny living mites and breaking their hard chitin. The present study is to explore a better method to construct cDNA library for mites that will lay the foundation on transcriptome and molecular pathogenesis research. We selected Psoroptes cuniculi as an experimental subject and took the following steps to construct and verify cDNA library. First, we combined liquid nitrogen grinding with TRIzol for total RNA extraction. Then, switching mechanism at 5' end of the RNA transcript (SMART) technique was used to construct full-length cDNA library. To evaluate the quality of cDNA library, the library titer and recombination rate were calculated. The reliability of cDNA library was detected by sequencing and analyzing positive clones and genes amplified by specific primers. The results showed that the RNA concentration was 836 ng/μl and the absorbance ratio at 260/280 nm was 1.82. The library titer was 5.31 × 10(5) plaque-forming unit (PFU)/ml and the recombination rate was 98.21%, indicating that the library was of good quality. In the 33 expressed sequence tags (ESTs) of P. cuniculi, two clones of 1656 and 1658 bp were almost identical with only three variable sites detected, which had an identity of 99.63% with that of Psoroptes ovis, indicating that the cDNA library was reliable. Further detection by specific primers demonstrated that the 553-bp Pso c II gene sequences of P. cuniculi had an identity of 98.56% with those of P. ovis, confirming that the cDNA library was not only reliable but also feasible.
Improving cell mixture deconvolution by identifying optimal DNA methylation libraries (IDOL).
Koestler, Devin C; Jones, Meaghan J; Usset, Joseph; Christensen, Brock C; Butler, Rondi A; Kobor, Michael S; Wiencke, John K; Kelsey, Karl T
2016-03-08
Confounding due to cellular heterogeneity represents one of the foremost challenges currently facing Epigenome-Wide Association Studies (EWAS). Statistical methods leveraging the tissue-specificity of DNA methylation for deconvoluting the cellular mixture of heterogenous biospecimens offer a promising solution, however the performance of such methods depends entirely on the library of methylation markers being used for deconvolution. Here, we introduce a novel algorithm for Identifying Optimal Libraries (IDOL) that dynamically scans a candidate set of cell-specific methylation markers to find libraries that optimize the accuracy of cell fraction estimates obtained from cell mixture deconvolution. Application of IDOL to training set consisting of samples with both whole-blood DNA methylation data (Illumina HumanMethylation450 BeadArray (HM450)) and flow cytometry measurements of cell composition revealed an optimized library comprised of 300 CpG sites. When compared existing libraries, the library identified by IDOL demonstrated significantly better overall discrimination of the entire immune cell landscape (p = 0.038), and resulted in improved discrimination of 14 out of the 15 pairs of leukocyte subtypes. Estimates of cell composition across the samples in the training set using the IDOL library were highly correlated with their respective flow cytometry measurements, with all cell-specific R (2)>0.99 and root mean square errors (RMSEs) ranging from [0.97 % to 1.33 %] across leukocyte subtypes. Independent validation of the optimized IDOL library using two additional HM450 data sets showed similarly strong prediction performance, with all cell-specific R (2)>0.90 and R M S E<4.00 %. In simulation studies, adjustments for cell composition using the IDOL library resulted in uniformly lower false positive rates compared to competing libraries, while also demonstrating an improved capacity to explain epigenome-wide variation in DNA methylation within two large publicly available HM450 data sets. Despite consisting of half as many CpGs compared to existing libraries for whole blood mixture deconvolution, the optimized IDOL library identified herein resulted in outstanding prediction performance across all considered data sets and demonstrated potential to improve the operating characteristics of EWAS involving adjustments for cell distribution. In addition to providing the EWAS community with an optimized library for whole blood mixture deconvolution, our work establishes a systematic and generalizable framework for the assembly of libraries that improve the accuracy of cell mixture deconvolution.
Theoretical modeling of masking DNA application in aptamer-facilitated biomarker discovery.
Cherney, Leonid T; Obrecht, Natalia M; Krylov, Sergey N
2013-04-16
In aptamer-facilitated biomarker discovery (AptaBiD), aptamers are selected from a library of random DNA (or RNA) sequences for their ability to specifically bind cell-surface biomarkers. The library is incubated with intact cells, and cell-bound DNA molecules are separated from those unbound and amplified by the polymerase chain reaction (PCR). The partitioning/amplification cycle is repeated multiple times while alternating target cells and control cells. Efficient aptamer selection in AptaBiD relies on the inclusion of masking DNA within the cell and library mixture. Masking DNA lacks primer regions for PCR amplification and is typically taken in excess to the library. The role of masking DNA within the selection mixture is to outcompete any nonspecific binding sequences within the initial library, thus allowing specific DNA sequences (i.e., aptamers) to be selected more efficiently. Efficient AptaBiD requires an optimum ratio of masking DNA to library DNA, at which aptamers still bind specific binding sites but nonaptamers within the library do not bind nonspecific binding sites. Here, we have developed a mathematical model that describes the binding processes taking place within the equilibrium mixture of masking DNA, library DNA, and target cells. An obtained mathematical solution allows one to estimate the concentration of masking DNA that is required to outcompete the library DNA at a desirable ratio of bound masking DNA to bound library DNA. The required concentration depends on concentrations of the library and cells as well as on unknown cell characteristics. These characteristics include the concentration of total binding sites on the cell surface, N, and equilibrium dissociation constants, K(nsL) and K(nsM), for nonspecific binding of the library DNA and masking DNA, respectively. We developed a theory that allows the determination of N, K(nsL), and K(nsM) based on measurements of EC50 values for cells mixed separately with the library and masking DNA (EC50 is the concentration of fluorescently labeled DNA at which half of the maximum fluorescence signal from DNA-bound cells is reached). We also obtained expressions for signals from bound DNA (measured by flow cytometry) in terms of N, K(nsL), and K(nsM). These expressions can be used for the verification of N, K(nsL), and K(nsM) values found from EC50 measurements. The developed procedure was applied to MCF-7 breast cancer cells, and corresponding values of N, K(nsL), and K(nsM) were established for the first time. The concentration of masking DNA required for AptaBiD with MCF-7 breast cancer cells was also estimated.
Krebs, Arnaud R; Dessus-Babus, Sophie; Burger, Lukas; Schübeler, Dirk
2014-09-26
The majority of mammalian promoters are CpG islands; regions of high CG density that require protection from DNA methylation to be functional. Importantly, how sequence architecture mediates this unmethylated state remains unclear. To address this question in a comprehensive manner, we developed a method to interrogate methylation states of hundreds of sequence variants inserted at the same genomic site in mouse embryonic stem cells. Using this assay, we were able to quantify the contribution of various sequence motifs towards the resulting DNA methylation state. Modeling of this comprehensive dataset revealed that CG density alone is a minor determinant of their unmethylated state. Instead, these data argue for a principal role for transcription factor binding sites, a prediction confirmed by testing synthetic mutant libraries. Taken together, these findings establish the hierarchy between the two cis-encoded mechanisms that define the DNA methylation state and thus the transcriptional competence of CpG islands.
Cone, M C; Petrich, A K; Gould, S J; Zabriskie, T M
1998-06-01
Two small chromosomal DNA fragments (2.6 and 4.8 kb) from the blasticidin S producer Streptomyces griseochromogenes were cloned in the high copy number vector pIJ702 and shown to confer increased resistance to blasticidin S upon S. lividans TK24. These fragments were used to screen a library of S. griseochromogenes DNA prepared in the cosmid shuttle vector pOJ446. Cosmids containing DNA inserts of at least 23 kb were identified which hybridized to one or the other resistance fragment, but not to both. Transformation of S. lividans TK24 with several cosmids hybridizing with the 4.8 kb resistance fragment resulted in clones that produced cytosylglucuronic acid, the first intermediate of the blasticidin S biosynthetic pathway, and other blasticidin-related metabolites. A strain of S. lividans TK24 harboring both the 4.8 kb-hybridizing cosmid and the 2.6 kb resistance fragment cloned in pIJ702 produced 12.5 times as much demethylblasticidin S as the transformant harboring the cosmid alone.
Houtz, Robert L.
1998-01-01
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS) .epsilon.N-methyltransferase (protein methylase III or Rubisco LSMT) is disclosed. This enzyme catalyzes methylation of the .epsilon.-amine of lysine-14 in the large subunit of Rubisco. In addition, a full-length cDNA clone for Rubisco LSMT is disclosed. Transgenic plants and methods of producing same which (1) have the Rubisco LSMT gene inserted into the DNA, and (2) have the Rubisco LSMT gene product or the action of the gene product deleted from the DNA are also provided. Further, methods of using the gene to selectively deliver desired agents to a plant are also disclosed.
Houtz, Robert L.
1999-01-01
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS) .sup..epsilon. N-methyltransferase (protein methylase III or Rubisco LSMT) is disclosed. This enzyme catalyzes methylation of the .epsilon.-amine of lysine-14 in the large subunit of Rubisco. In addition, a full-length cDNA clone for Rubisco LSMT is disclosed. Transgenic plants and methods of producing same which (1) have the Rubisco LSMT gene inserted into the DNA, and (2) have the Rubisco LSMT gene product or the action of the gene product deleted from the DNA are also provided. Further, methods of using the gene to selectively deliver desired agents to a plant are also disclosed.
Houtz, R.L.
1998-03-03
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS) {epsilon}N-methyltransferase (protein methylase III or Rubisco LSMT) is disclosed. This enzyme catalyzes methylation of the {epsilon}-amine of lysine-14 in the large subunit of Rubisco. In addition, a full-length cDNA clone for Rubisco LSMT is disclosed. Transgenic plants and methods of producing same which (1) have the Rubisco LSMT gene inserted into the DNA, and (2) have the Rubisco LSMT gene product or the action of the gene product deleted from the DNA are also provided. Further, methods of using the gene to selectively deliver desired agents to a plant are also disclosed. 5 figs.
Houtz, R.L.
1999-02-02
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS){sup {epsilon}}N-methyltransferase (protein methylase III or Rubisco LSMT) is disclosed. This enzyme catalyzes methylation of the {epsilon}-amine of lysine-14 in the large subunit of Rubisco. In addition, a full-length cDNA clone for Rubisco LSMT is disclosed. Transgenic plants and methods of producing same which (1) have the Rubisco LSMT gene inserted into the DNA, and (2) have the Rubisco LSMT gene product or the action of the gene product deleted from the DNA are also provided. Further, methods of using the gene to selectively deliver desired agents to a plant are also disclosed. 8 figs.
Castañón, Jesús; Román, José Pablo; Jessop, Theodore C; de Blas, Jesús; Haro, Rubén
2018-06-01
DNA-encoded libraries (DELs) have emerged as an efficient and cost-effective drug discovery tool for the exploration and screening of very large chemical space using small-molecule collections of unprecedented size. Herein, we report an integrated automation and informatics system designed to enhance the quality, efficiency, and throughput of the production and affinity selection of these libraries. The platform is governed by software developed according to a database-centric architecture to ensure data consistency, integrity, and availability. Through its versatile protocol management functionalities, this application captures the wide diversity of experimental processes involved with DEL technology, keeps track of working protocols in the database, and uses them to command robotic liquid handlers for the synthesis of libraries. This approach provides full traceability of building-blocks and DNA tags in each split-and-pool cycle. Affinity selection experiments and high-throughput sequencing reads are also captured in the database, and the results are automatically deconvoluted and visualized in customizable representations. Researchers can compare results of different experiments and use machine learning methods to discover patterns in data. As of this writing, the platform has been validated through the generation and affinity selection of various libraries, and it has become the cornerstone of the DEL production effort at Lilly.
Videvall, Elin; Strandh, Maria; Engelbrecht, Anel; Cloete, Schalk; Cornwallis, Charlie K
2017-01-01
The gut microbiome of animals is emerging as an important factor influencing ecological and evolutionary processes. A major bottleneck in obtaining microbiome data from large numbers of samples is the time-consuming laboratory procedures required, specifically the isolation of DNA and generation of amplicon libraries. Recently, direct PCR kits have been developed that circumvent conventional DNA extraction steps, thereby streamlining the laboratory process by reducing preparation time and costs. However, the reliability and efficacy of direct PCR for measuring host microbiomes have not yet been investigated other than in humans with 454 sequencing. Here, we conduct a comprehensive evaluation of the microbial communities obtained with direct PCR and the widely used Mo Bio PowerSoil DNA extraction kit in five distinct gut sample types (ileum, cecum, colon, feces, and cloaca) from 20 juvenile ostriches, using 16S rRNA Illumina MiSeq sequencing. We found that direct PCR was highly comparable over a range of measures to the DNA extraction method in cecal, colon, and fecal samples. However, the two methods significantly differed in samples with comparably low bacterial biomass: cloacal and especially ileal samples. We also sequenced 100 replicate sample pairs to evaluate repeatability during both extraction and PCR stages and found that both methods were highly consistent for cecal, colon, and fecal samples ( r s > 0.7) but had low repeatability for cloacal ( r s = 0.39) and ileal ( r s = -0.24) samples. This study indicates that direct PCR provides a fast, cheap, and reliable alternative to conventional DNA extraction methods for retrieving 16S rRNA data, which can aid future gut microbiome studies. IMPORTANCE The microbial communities of animals can have large impacts on their hosts, and the number of studies using high-throughput sequencing to measure gut microbiomes is rapidly increasing. However, the library preparation procedure in microbiome research is both costly and time-consuming, especially for large numbers of samples. We investigated a cheaper and faster direct PCR method designed to bypass the DNA isolation steps during 16S rRNA library preparation and compared it with a standard DNA extraction method. We used both techniques on five different gut sample types collected from 20 juvenile ostriches and sequenced samples with Illumina MiSeq. The methods were highly comparable and highly repeatable in three sample types with high microbial biomass (cecum, colon, and feces), but larger differences and low repeatability were found in the microbiomes obtained from the ileum and cloaca. These results will help microbiome researchers assess library preparation procedures and plan their studies accordingly.
Technique of laser chromosome welding for chromosome repair and artificial chromosome creation.
Huang, Yao-Xiong; Li, Lin; Yang, Liu; Zhang, Yi
2018-04-01
Here we report a technique of laser chromosome welding that uses a violet pulse laser micro-beam for welding. The technique can integrate any size of a desired chromosome fragment into recipient chromosomes by combining with other techniques of laser chromosome manipulation such as chromosome cutting, moving, and stretching. We demonstrated that our method could perform chromosomal modifications with high precision, speed and ease of use in the absence of restriction enzymes, DNA ligases and DNA polymerases. Unlike the conventional methods such as de novo artificial chromosome synthesis, our method has no limitation on the size of the inserted chromosome fragment. The inserted DNA size can be precisely defined and the processed chromosome can retain its intrinsic structure and integrity. Therefore, our technique provides a high quality alternative approach to directed genetic recombination, and can be used for chromosomal repair, removal of defects and artificial chromosome creation. The technique may also have applicability on the manipulation and extension of large pieces of synthetic DNA.
2011-01-01
Background Common bean is an important legume crop with only a moderate number of short expressed sequence tags (ESTs) made with traditional methods. The goal of this research was to use full-length cDNA technology to develop ESTs that would overlap with the beginning of open reading frames and therefore be useful for gene annotation of genomic sequences. The library was also constructed to represent genes expressed under drought, low soil phosphorus and high soil aluminum toxicity. We also undertook comparisons of the full-length cDNA library to two previous non-full clone EST sets for common bean. Results Two full-length cDNA libraries were constructed: one for the drought tolerant Mesoamerican genotype BAT477 and the other one for the acid-soil tolerant Andean genotype G19833 which has been selected for genome sequencing. Plants were grown in three soil types using deep rooting cylinders subjected to drought and non-drought stress and tissues were collected from both roots and above ground parts. A total of 20,000 clones were selected robotically, half from each library. Then, nearly 10,000 clones from the G19833 library were sequenced with an average read length of 850 nucleotides. A total of 4,219 unigenes were identified consisting of 2,981 contigs and 1,238 singletons. These were functionally annotated with gene ontology terms and placed into KEGG pathways. Compared to other EST sequencing efforts in common bean, about half of the sequences were novel or represented the 5' ends of known genes. Conclusions The present full-length cDNA libraries add to the technological toolbox available for common bean and our sequencing of these clones substantially increases the number of unique EST sequences available for the common bean genome. All of this should be useful for both functional gene annotation, analysis of splice site variants and intron/exon boundary determination by comparison to soybean genes or with common bean whole-genome sequences. In addition the library has a large number of transcription factors and will be interesting for discovery and validation of drought or abiotic stress related genes in common bean. PMID:22118559
Hirotani, M; Kuroda, R; Suzuki, H; Yoshikawa, T
2000-05-01
A cDNA encoding UDP-glucose: baicalein 7-O-glucosyltransferase (UBGT) was isolated from a cDNA library from hairy root cultures of Scutellaria baicalensis Georgi probed with a partial-length cDNA clone of a UDP-glucose: flavonoid 3-O-glucosyltransferase (UFGT) from grape (Vitis vinifera L.). The heterologous probe contained a glucosyltransferase consensus amino acid sequence which was also present in the Scutellaria cDNA clones. The complete nucleotide sequence of the 1688-bp cDNA insert was determined and the deduced amino acid sequences are presented. The nucleotide sequence analysis of UBGT revealed an open reading frame encoding a polypeptide of 476 amino acids with a calculated molecular mass of 53,094 Da. The reaction product for baicalein and UDP-glucose catalyzed by recombinant UBGT in Escherichia coli was identified as authentic baicalein 7-O-glucoside using high-performance liquid chromatography and proton nuclear magnetic resonance spectroscopy. The enzyme activities of recombinant UBGT expressed in E. coli were also detected towards flavonoids such as baicalein, wogonin, apigenin, scutellarein, 7,4'-dihydroxyflavone and kaempferol, and phenolic compounds. The accumulation of UBGT mRNA in hairy roots was in response to wounding or salicylic acid treatments.
Genome-Wide Mutagenesis in Borrelia burgdorferi.
Lin, Tao; Gao, Lihui
2018-01-01
Signature-tagged mutagenesis (STM) is a functional genomics approach to identify bacterial virulence determinants and virulence factors by simultaneously screening multiple mutants in a single host animal, and has been utilized extensively for the study of bacterial pathogenesis, host-pathogen interactions, and spirochete and tick biology. The signature-tagged transposon mutagenesis has been developed to investigate virulence determinants and pathogenesis of Borrelia burgdorferi. Mutants in genes important in virulence are identified by negative selection in which the mutants fail to colonize or disseminate in the animal host and tick vector. STM procedure combined with Luminex Flex ® Map™ technology and next-generation sequencing (e.g., Tn-seq) are the powerful high-throughput tools for the determination of Borrelia burgdorferi virulence determinants. The assessment of multiple tissue sites and two DNA resources at two different time points using Luminex Flex ® Map™ technology provides a robust data set. B. burgdorferi transposon mutant screening indicates that a high proportion of genes are the novel virulence determinants that are required for mouse and tick infection. In this protocol, an effective signature-tagged Himar1-based transposon suicide vector was developed and used to generate a sequence-defined library of nearly 4800 mutants in the infectious B. burgdorferi B31 clone. In STM, signature-tagged suicide vectors are constructed by inserting unique DNA sequences (tags) into the transposable elements. The signature-tagged transposon mutants are generated when transposon suicide vectors are transformed into an infectious B. burgdorferi clone, and the transposable element is transposed into the 5'-TA-3' sequence in the B. burgdorferi genome with the signature tag. The transposon library is created and consists of many sub-libraries, each sub-library has several hundreds of mutants with same tags. A group of mice or ticks are infected with a mixed population of mutants with different tags, after recovered from different tissues of infected mice and ticks, mutants from output pool and input pool are detected using high-throughput, semi-quantitative Luminex ® FLEXMAP™ or next-generation sequencing (Tn-seq) technologies. Thus far, we have created a high-density, sequence-defined transposon library of over 6600 STM mutants for the efficient genome-wide investigation of genes and gene products required for wild-type pathogenesis, host-pathogen interactions, in vitro growth, in vivo survival, physiology, morphology, chemotaxis, motility, structure, metabolism, gene regulation, plasmid maintenance and replication, etc. The insertion sites of 4480 transposon mutants have been determined. About 800 predicted protein-encoding genes in the genome were disrupted in the STM transposon library. The infectivity and some functions of 800 mutants in 500 genes have been determined. Analysis of these transposon mutants has yielded valuable information regarding the genes and gene products important in the pathogenesis and biology of B. burgdorferi and its tick vectors.
Hit-Validation Methodologies for Ligands Isolated from DNA-Encoded Chemical Libraries.
Zimmermann, Gunther; Li, Yizhou; Rieder, Ulrike; Mattarella, Martin; Neri, Dario; Scheuermann, Jörg
2017-05-04
DNA-encoded chemical libraries (DECLs) are large collections of compounds linked to DNA fragments, serving as amplifiable barcodes, which can be screened on target proteins of interest. In typical DECL selections, preferential binders are identified by high-throughput DNA sequencing, by comparing their frequency before and after the affinity capture step. Hits identified in this procedure need to be confirmed, by resynthesis and by performing affinity measurements. In this article we present new methods based on hybridization of oligonucleotide conjugates with fluorescently labeled complementary oligonucleotides; these facilitate the determination of affinity constants and kinetic dissociation constants. The experimental procedures were demonstrated with acetazolamide, a binder to carbonic anhydrase IX with a dissociation constant in the nanomolar range. The detection of binding events was compatible not only with fluorescence polarization methodologies, but also with Alphascreen technology and with microscale thermophoresis. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Single Day Construction of Multigene Circuits with 3G Assembly.
Halleran, Andrew D; Swaminathan, Anandh; Murray, Richard M
2018-05-18
The ability to rapidly design, build, and test prototypes is of key importance to every engineering discipline. DNA assembly often serves as a rate limiting step of the prototyping cycle for synthetic biology. Recently developed DNA assembly methods such as isothermal assembly and type IIS restriction enzyme systems take different approaches to accelerate DNA construction. We introduce a hybrid method, Golden Gate-Gibson (3G), that takes advantage of modular part libraries introduced by type IIS restriction enzyme systems and isothermal assembly's ability to build large DNA constructs in single pot reactions. Our method is highly efficient and rapid, facilitating construction of entire multigene circuits in a single day. Additionally, 3G allows generation of variant libraries enabling efficient screening of different possible circuit constructions. We characterize the efficiency and accuracy of 3G assembly for various construct sizes, and demonstrate 3G by characterizing variants of an inducible cell-lysis circuit.
Digitally encoded DNA nanostructures for multiplexed, single-molecule protein sensing with nanopores
NASA Astrophysics Data System (ADS)
Bell, Nicholas A. W.; Keyser, Ulrich F.
2016-07-01
The simultaneous detection of a large number of different analytes is important in bionanotechnology research and in diagnostic applications. Nanopore sensing is an attractive method in this regard as the approach can be integrated into small, portable device architectures, and there is significant potential for detecting multiple sub-populations in a sample. Here, we show that highly multiplexed sensing of single molecules can be achieved with solid-state nanopores by using digitally encoded DNA nanostructures. Based on the principles of DNA origami, we designed a library of DNA nanostructures in which each member contains a unique barcode; each bit in the barcode is signalled by the presence or absence of multiple DNA dumbbell hairpins. We show that a 3-bit barcode can be assigned with 94% accuracy by electrophoretically driving the DNA structures through a solid-state nanopore. Select members of the library were then functionalized to detect a single, specific antibody through antigen presentation at designed positions on the DNA. This allows us to simultaneously detect four different antibodies of the same isotype at nanomolar concentration levels.
Bell, Nicholas A W; Keyser, Ulrich F
2016-07-01
The simultaneous detection of a large number of different analytes is important in bionanotechnology research and in diagnostic applications. Nanopore sensing is an attractive method in this regard as the approach can be integrated into small, portable device architectures, and there is significant potential for detecting multiple sub-populations in a sample. Here, we show that highly multiplexed sensing of single molecules can be achieved with solid-state nanopores by using digitally encoded DNA nanostructures. Based on the principles of DNA origami, we designed a library of DNA nanostructures in which each member contains a unique barcode; each bit in the barcode is signalled by the presence or absence of multiple DNA dumbbell hairpins. We show that a 3-bit barcode can be assigned with 94% accuracy by electrophoretically driving the DNA structures through a solid-state nanopore. Select members of the library were then functionalized to detect a single, specific antibody through antigen presentation at designed positions on the DNA. This allows us to simultaneously detect four different antibodies of the same isotype at nanomolar concentration levels.
Carpenter, Meredith L.; Buenrostro, Jason D.; Valdiosera, Cristina; Schroeder, Hannes; Allentoft, Morten E.; Sikora, Martin; Rasmussen, Morten; Gravel, Simon; Guillén, Sonia; Nekhrizov, Georgi; Leshtakov, Krasimir; Dimitrova, Diana; Theodossiev, Nikola; Pettener, Davide; Luiselli, Donata; Sandoval, Karla; Moreno-Estrada, Andrés; Li, Yingrui; Wang, Jun; Gilbert, M. Thomas P.; Willerslev, Eske; Greenleaf, William J.; Bustamante, Carlos D.
2013-01-01
Most ancient specimens contain very low levels of endogenous DNA, precluding the shotgun sequencing of many interesting samples because of cost. Ancient DNA (aDNA) libraries often contain <1% endogenous DNA, with the majority of sequencing capacity taken up by environmental DNA. Here we present a capture-based method for enriching the endogenous component of aDNA sequencing libraries. By using biotinylated RNA baits transcribed from genomic DNA libraries, we are able to capture DNA fragments from across the human genome. We demonstrate this method on libraries created from four Iron Age and Bronze Age human teeth from Bulgaria, as well as bone samples from seven Peruvian mummies and a Bronze Age hair sample from Denmark. Prior to capture, shotgun sequencing of these libraries yielded an average of 1.2% of reads mapping to the human genome (including duplicates). After capture, this fraction increased substantially, with up to 59% of reads mapped to human and enrichment ranging from 6- to 159-fold. Furthermore, we maintained coverage of the majority of regions sequenced in the precapture library. Intersection with the 1000 Genomes Project reference panel yielded an average of 50,723 SNPs (range 3,062–147,243) for the postcapture libraries sequenced with 1 million reads, compared with 13,280 SNPs (range 217–73,266) for the precapture libraries, increasing resolution in population genetic analyses. Our whole-genome capture approach makes it less costly to sequence aDNA from specimens containing very low levels of endogenous DNA, enabling the analysis of larger numbers of samples. PMID:24568772
Vrljicak, Pavle; Tao, Shijie; Varshney, Gaurav K; Quach, Helen Ngoc Bao; Joshi, Adita; LaFave, Matthew C; Burgess, Shawn M; Sampath, Karuna
2016-04-07
DNA transposons and retroviruses are important transgenic tools for genome engineering. An important consideration affecting the choice of transgenic vector is their insertion site preferences. Previous large-scale analyses of Ds transposon integration sites in plants were done on the basis of reporter gene expression or germ-line transmission, making it difficult to discern vertebrate integration preferences. Here, we compare over 1300 Ds transposon integration sites in zebrafish with Tol2 transposon and retroviral integration sites. Genome-wide analysis shows that Ds integration sites in the presence or absence of marker selection are remarkably similar and distributed throughout the genome. No strict motif was found, but a preference for structural features in the target DNA associated with DNA flexibility (Twist, Tilt, Rise, Roll, Shift, and Slide) was observed. Remarkably, this feature is also found in transposon and retroviral integrations in maize and mouse cells. Our findings show that structural features influence the integration of heterologous DNA in genomes, and have implications for targeted genome engineering. Copyright © 2016 Vrljicak et al.
Design and Synthesis of Biaryl DNA-Encoded Libraries.
Ding, Yun; Franklin, G Joseph; DeLorey, Jennifer L; Centrella, Paolo A; Mataruse, Sibongile; Clark, Matthew A; Skinner, Steven R; Belyanskaya, Svetlana
2016-10-10
DNA-encoded library technology (ELT) is a powerful tool for the discovery of new small-molecule ligands to various protein targets. Here we report the design and synthesis of biaryl DNA-encoded libraries based on the scaffold of 5-formyl 3-iodobenzoic acid. Three reactions on DNA template, acylation, Suzuki-Miyaura coupling and reductive amination, were applied in the library synthesis. The three cycle library of 3.5 million diversity has delivered potent hits for phosphoinositide 3-kinase α (PI3Kα).
Soares, Marcelo B.; Efstratiadis, Argiris
1997-01-01
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library.
Soares, M.B.; Efstratiadis, A.
1997-06-10
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3{prime} noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. 4 figs.
Microsatellite DNA library for Caiman latirostris.
Zucoloto, Rodrigo Barban; Verdade, Luciano Martins; Coutinho, Luiz Lehmann
2002-12-15
New genetic markers were characterized for the broad-snouted caiman (Caiman latirostris) by constructing libraries enriched for microsatellite DNA. Construction and characterization of these libraries are described in the present study. One microsatellite marker was developed from a (ACC-TGG)(n)enriched microsatellite DNA library, and 12 microsatellite markers were developed from a (AC-TG)(n)enriched microsatellite DNA library. These markers were tested in wild-caught animals, and these tests resulted in ten new polymorphic microsatellites for C. latirostris. Copyright 2002 Wiley-Liss, Inc.
Mitochondrial DNA transfer to the nucleus generates extensive insertion site variation in maize.
Lough, Ashley N; Roark, Leah M; Kato, Akio; Ream, Thomas S; Lamb, Jonathan C; Birchler, James A; Newton, Kathleen J
2008-01-01
Mitochondrial DNA (mtDNA) insertions into nuclear chromosomes have been documented in a number of eukaryotes. We used fluorescence in situ hybridization (FISH) to examine the variation of mtDNA insertions in maize. Twenty overlapping cosmids, representing the 570-kb maize mitochondrial genome, were individually labeled and hybridized to root tip metaphase chromosomes from the B73 inbred line. A minimum of 15 mtDNA insertion sites on nine chromosomes were detectable using this method. One site near the centromere on chromosome arm 9L was identified by a majority of the cosmids. To examine variation in nuclear mitochondrial DNA sequences (NUMTs), a mixture of labeled cosmids was applied to chromosome spreads of ten diverse inbred lines: A188, A632, B37, B73, BMS, KYS, Mo17, Oh43, W22, and W23. The number of detectable NUMTs varied dramatically among the lines. None of the tested inbred lines other than B73 showed the strong hybridization signal on 9L, suggesting that there is a recent mtDNA insertion at this site in B73. Different sources of B73 and W23 were examined for NUMT variation within inbred lines. Differences were detectable, suggesting either that mtDNA is being incorporated or lost from the maize nuclear genome continuously. The results indicate that mtDNA insertions represent a major source of nuclear chromosomal variation.
Second-generation DNA-templated macrocycle libraries for the discovery of bioactive small molecules.
Usanov, Dmitry L; Chan, Alix I; Maianti, Juan Pablo; Liu, David R
2018-07-01
DNA-encoded libraries have emerged as a widely used resource for the discovery of bioactive small molecules, and offer substantial advantages compared with conventional small-molecule libraries. Here, we have developed and streamlined multiple fundamental aspects of DNA-encoded and DNA-templated library synthesis methodology, including computational identification and experimental validation of a 20 × 20 × 20 × 80 set of orthogonal codons, chemical and computational tools for enhancing the structural diversity and drug-likeness of library members, a highly efficient polymerase-mediated template library assembly strategy, and library isolation and purification methods. We have integrated these improved methods to produce a second-generation DNA-templated library of 256,000 small-molecule macrocycles with improved drug-like physical properties. In vitro selection of this library for insulin-degrading enzyme affinity resulted in novel insulin-degrading enzyme inhibitors, including one of unusual potency and novel macrocycle stereochemistry (IC 50 = 40 nM). Collectively, these developments enable DNA-templated small-molecule libraries to serve as more powerful, accessible, streamlined and cost-effective tools for bioactive small-molecule discovery.
Cytogenetic and Sequence Analyses of Mitochondrial DNA Insertions in Nuclear Chromosomes of Maize
Lough, Ashley N.; Faries, Kaitlyn M.; Koo, Dal-Hoe; Hussain, Abid; Roark, Leah M.; Langewisch, Tiffany L.; Backes, Teresa; Kremling, Karl A. G.; Jiang, Jiming; Birchler, James A.; Newton, Kathleen J.
2015-01-01
The transfer of mitochondrial DNA (mtDNA) into nuclear genomes is a regularly occurring process that has been observed in many species. Few studies, however, have focused on the variation of nuclear-mtDNA sequences (NUMTs) within a species. This study examined mtDNA insertions within chromosomes of a diverse set of Zea mays ssp. mays (maize) inbred lines by the use of fluorescence in situ hybridization. A relatively large NUMT on the long arm of chromosome 9 (9L) was identified at approximately the same position in four inbred lines (B73, M825, HP301, and Oh7B). Further examination of the similarly positioned 9L NUMT in two lines, B73 and M825, indicated that the large size of these sites is due to the presence of a majority of the mitochondrial genome; however, only portions of this NUMT (∼252 kb total) were found in the publically available B73 nuclear sequence for chromosome 9. Fiber-fluorescence in situ hybridization analysis estimated the size of the B73 9L NUMT to be ∼1.8 Mb and revealed that the NUMT is methylated. Two regions of mtDNA (2.4 kb and 3.3 kb) within the 9L NUMT are not present in the B73 mitochondrial NB genome; however, these 2.4-kb and 3.3-kb segments are present in other Zea mitochondrial genomes, including that of Zea mays ssp. parviglumis, a progenitor of domesticated maize. PMID:26333837
Wang, Yongjie; Kleespies, Regina G; Ramle, Moslim B; Jehle, Johannes A
2008-09-01
The genomic sequence analysis of many large dsDNA viruses is hampered by the lack of enough sample materials. Here, we report a whole genome amplification of the Oryctes rhinoceros nudivirus (OrNV) isolate Ma07 starting from as few as about 10 ng of purified viral DNA by application of phi29 DNA polymerase- and exonuclease-resistant random hexamer-based multiple displacement amplification (MDA) method. About 60 microg of high molecular weight DNA with fragment sizes of up to 25 kbp was amplified. A genomic DNA clone library was generated using the product DNA. After 8-fold sequencing coverage, the 127,615 bp of OrNV whole genome was sequenced successfully. The results demonstrate that the MDA-based whole genome amplification enables rapid access to genomic information from exiguous virus samples.
Dorraj, Ghamar Soltan; Rassaee, Mohammad Javad; Latifi, Ali Mohammad; Pishgoo, Bahram; Tavallaei, Mahmood
2015-08-20
Troponin T and I are ideal markers which are highly sensitive and specific for myocardial injury and have shown better efficacy than earlier markers. Since aptamers are ssDNA or RNA that bind to a wide variety of target molecules, the purpose of this research was to select an aptamer from a 79bp single-stranded DNA (ssDNA) random library that was used to bind the Human Cardiac Troponin I from a synthetic nucleic acids library by systematic evolution of ligands exponential enrichment (Selex) based on several selection and amplification steps. Human Cardiac Troponin I protein was coated onto the surface of streptavidin magnetic beads to extract specific aptamer from a large and diverse random ssDNA initial oligonucleotide library. As a result, several aptamers were selected and further examined for binding affinity and specificity. Finally TnIApt 23 showed beast affinity in nanomolar range (2.69nM) toward the target protein. A simple and rapid colorimetric detection assay for Human Cardiac Troponin I using the novel and specific aptamer-AuNPs conjugates based on dot blot assay was developed. The detection limit for this protein using aptamer-AuNPs-based assay was found to be 5ng/ml. Copyright © 2015 Elsevier B.V. All rights reserved.
Onozawa, Masahiro; Zhang, Zhenhua; Kim, Yoo Jung; Goldberg, Liat; Varga, Tamas; Bergsagel, P Leif; Kuehl, W Michael; Aplan, Peter D
2014-05-27
We used the I-SceI endonuclease to produce DNA double-strand breaks (DSBs) and observed that a fraction of these DSBs were repaired by insertion of sequences, which we termed "templated sequence insertions" (TSIs), derived from distant regions of the genome. These TSIs were derived from genic, retrotransposon, or telomere sequences and were not deleted from the donor site in the genome, leading to the hypothesis that they were derived from reverse-transcribed RNA. Cotransfection of RNA and an I-SceI expression vector demonstrated insertion of RNA-derived sequences at the DNA-DSB site, and TSIs were suppressed by reverse-transcriptase inhibitors. Both observations support the hypothesis that TSIs were derived from RNA templates. In addition, similar insertions were detected at sites of DNA DSBs induced by transcription activator-like effector nuclease proteins. Whole-genome sequencing of myeloma cell lines revealed additional TSIs, demonstrating that repair of DNA DSBs via insertion was not restricted to experimentally produced DNA DSBs. Analysis of publicly available databases revealed that many of these TSIs are polymorphic in the human genome. Taken together, these results indicate that insertional events should be considered as alternatives to gross chromosomal rearrangements in the interpretation of whole-genome sequence data and that this mutagenic form of DNA repair may play a role in genetic disease, exon shuffling, and mammalian evolution.
Sir- and silencer-independent disruption of silencing in Saccharomyces by Sas10p.
Kamakaka, R T; Rine, J
1998-06-01
A promoter fusion library of Saccharomyces cerevisiae genes was used to exploit phenotypes associated with altered protein dosage. We identified a novel gene, SAS10, by the ability of Sas10p, when overproduced, to disrupt silencing. The predicted Sas10p was 70,200 kD and strikingly rich in charged amino acids. Sas10p was exclusively nuclear in all stages of the cell cycle. Overproduction of Sas10p caused derepression of mating type genes at both HML and HMR, as well as of URA3, TRP1, and ADE2 when inserted near a telomere or at HMR or the rDNA locus. Repressed genes not associated with silenced chromatin were unaffected. Sas10p was essential for viability, and the termination point following Sas10p depletion was as large budded cells. Remarkably, Sas10p overproduction disrupted silencing even under conditions that bypassed the requirement for Sir proteins, ORC, and Rap1p in silencing. These data implied that Sas10p function was intimately connected with the structure of silenced chromatin.
Suboptimal Doses of Raltegravir Cause Aberrant HIV Integrations | Center for Cancer Research
When a cell is infected with HIV, a DNA copy of the HIV genome is inserted into that cell’s chromosomal DNA. This insertion reaction is carried out by the viral enzyme integrase (IN) and involves two distinct steps: removal of two nucleotides from each 3’ end of the viral DNA, followed by the strand transfer reaction, in which the viral DNA ends are inserted into the host
Luo, Meizhong; Kim, Hyeran; Kudrna, Dave; Sisneros, Nicholas B; Lee, So-Jeong; Mueller, Christopher; Collura, Kristi; Zuccolo, Andrea; Buckingham, E Bryan; Grim, Suzanne M; Yanagiya, Kazuyo; Inoko, Hidetoshi; Shiina, Takashi; Flajnik, Martin F; Wing, Rod A; Ohta, Yuko
2006-05-03
Sharks are members of the taxonomic class Chondrichthyes, the oldest living jawed vertebrates. Genomic studies of this group, in comparison to representative species in other vertebrate taxa, will allow us to theorize about the fundamental genetic, developmental, and functional characteristics in the common ancestor of all jawed vertebrates. In order to obtain mapping and sequencing data for comparative genomics, we constructed a bacterial artificial chromosome (BAC) library for the nurse shark, Ginglymostoma cirratum. The BAC library consists of 313,344 clones with an average insert size of 144 kb, covering ~4.5 x 1010 bp and thus providing an 11-fold coverage of the haploid genome. BAC end sequence analyses revealed, in addition to LINEs and SINEs commonly found in other animal and plant genomes, two new groups of nurse shark-specific repetitive elements, NSRE1 and NSRE2 that seem to be major components of the nurse shark genome. Screening the library with single-copy or multi-copy gene probes showed 6-28 primary positive clones per probe of which 50-90% were true positives, demonstrating that the BAC library is representative of the different regions of the nurse shark genome. Furthermore, some BAC clones contained multiple genes, making physical mapping feasible. We have constructed a deep-coverage, high-quality, large insert, and publicly available BAC library for a cartilaginous fish. It will be very useful to the scientific community interested in shark genomic structure, comparative genomics, and functional studies. We found two new groups of repetitive elements specific to the nurse shark genome, which may contribute to the architecture and evolution of the nurse shark genome.
Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.
Chin, Chen-Shan; Alexander, David H; Marks, Patrick; Klammer, Aaron A; Drake, James; Heiner, Cheryl; Clum, Alicia; Copeland, Alex; Huddleston, John; Eichler, Evan E; Turner, Stephen W; Korlach, Jonas
2013-06-01
We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.
Houtz, Robert L.
1999-01-01
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS) .sup..epsilon. N-methyltransferase (protein methylase III or Rubisco LSMT) from a plant which has a des(methyl) lysyl residue in the LS is disclosed. In addition, the full-length cDNA clones for Rubisco LSMT are disclosed. Transgenic plants and methods of producing same which have the Rubisco LSMT gene inserted into the DNA are also provided. Further, methods of inactivating the enzymatic activity of Rubisco LSMT are also disclosed.
Efficient preparation of shuffled DNA libraries through recombination (Gateway) cloning.
Lehtonen, Soili I; Taskinen, Barbara; Ojala, Elina; Kukkurainen, Sampo; Rahikainen, Rolle; Riihimäki, Tiina A; Laitinen, Olli H; Kulomaa, Markku S; Hytönen, Vesa P
2015-01-01
Efficient and robust subcloning is essential for the construction of high-diversity DNA libraries in the field of directed evolution. We have developed a more efficient method for the subcloning of DNA-shuffled libraries by employing recombination cloning (Gateway). The Gateway cloning procedure was performed directly after the gene reassembly reaction, without additional purification and amplification steps, thus simplifying the conventional DNA shuffling protocols. Recombination-based cloning, directly from the heterologous reassembly reaction, conserved the high quality of the library and reduced the time required for the library construction. The described method is generally compatible for the construction of DNA-shuffled gene libraries. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Assembling and auditing a comprehensive DNA barcode reference library for European marine fishes.
Oliveira, L M; Knebelsberger, T; Landi, M; Soares, P; Raupach, M J; Costa, F O
2016-12-01
A large-scale comprehensive reference library of DNA barcodes for European marine fishes was assembled, allowing the evaluation of taxonomic uncertainties and species genetic diversity that were otherwise hidden in geographically restricted studies. A total of 4118 DNA barcodes were assigned to 358 species generating 366 Barcode Index Numbers (BIN). Initial examination revealed as much as 141 BIN discordances (more than one species in each BIN). After implementing an auditing and five-grade (A-E) annotation protocol, the number of discordant species BINs was reduced to 44 (13% grade E), while concordant species BINs amounted to 271 (78% grades A and B) and 14 other had insufficient data (grade D). Fifteen species displayed comparatively high intraspecific divergences ranging from 2·6 to 18·5% (grade C), which is biologically paramount information to be considered in fish species monitoring and stock assessment. On balance, this compilation contributed to the detection of 59 European fish species probably in need of taxonomic clarification or re-evaluation. The generalized implementation of an auditing and annotation protocol for reference libraries of DNA barcodes is recommended. © 2016 The Fisheries Society of the British Isles.
Hurrelbrink, R J; Nestorowicz, A; McMinn, P C
1999-12-01
An infectious cDNA clone of Murray Valley encephalitis virus prototype strain 1-51 (MVE-1-51) was constructed by stably inserting genome-length cDNA into the low-copy-number plasmid vector pMC18. Designated pMVE-1-51, the clone consisted of genome-length cDNA of MVE-1-51 under the control of a T7 RNA polymerase promoter. The clone was constructed by using existing components of a cDNA library, in addition to cDNA of the 3' terminus derived by RT-PCR of poly(A)-tailed viral RNA. Upon comparison with other flavivirus sequences, the previously undetermined sequence of the 3' UTR was found to contain elements conserved throughout the genus FLAVIVIRUS: RNA transcribed from pMVE-1-51 and subsequently transfected into BHK-21 cells generated infectious virus. The plaque morphology, replication kinetics and antigenic profile of clone-derived virus (CDV-1-51) was similar to the parental virus in vitro. Furthermore, the virulence properties of CDV-1-51 and MVE-1-51 (LD(50) values and mortality profiles) were found to be identical in vivo in the mouse model. Through site-directed mutagenesis, the infectious clone should serve as a valuable tool for investigating the molecular determinants of virulence in MVE virus.
Rapid discrimination of sequences flanking and within T-DNA insertions in the Arabidopsis genome.
Ponce, M R; Quesada, V; Micol, J L
1998-05-01
An improvement to previous methods for recovering Arabidopsis thaliana genomic DNA flanking T-DNA insertions is presented that allows for the avoidance of some of the cloning difficulties caused by the concatameric nature of T-DNA inserts. The principle of the procedure is to categorize by size restriction fragments of mutant DNA, produced in separate digestions with NdeI and Bst1107I. Given that the sites for these two enzymes are contiguous within the pGV3850:1003 T-DNA construct, the restriction fragments obtained fall into two categories: those showing identical size in both digestions, which correspond to sequences internal to T-DNA concatamers; and those of different sizes, that contain the junctions between plant DNA and the T-DNA insert. Such a criterion makes it possible to easily distinguish the digestion products corresponding to internal T-DNA parts, which do not deserve further attention, and those which presumably include a segment of the locus of interest. Discrimination between restriction fragments of genomic mutant DNA can be made on rescued plasmids, inverse PCR amplification products or bands in a genomic blot.
Genome-wide analysis of Tol2 transposon reintegration in zebrafish.
Kondrychyn, Igor; Garcia-Lecea, Marta; Emelyanov, Alexander; Parinov, Sergey; Korzh, Vladimir
2009-09-08
Tol2, a member of the hAT family of transposons, has become a useful tool for genetic manipulation of model animals, but information about its interactions with vertebrate genomes is still limited. Furthermore, published reports on Tol2 have mainly been based on random integration of the transposon system after co-injection of a plasmid DNA harboring the transposon and a transposase mRNA. It is important to understand how Tol2 would behave upon activation after integration into the genome. We performed a large-scale enhancer trap (ET) screen and generated 338 insertions of the Tol2 transposon-based ET cassette into the zebrafish genome. These insertions were generated by remobilizing the transposon from two different donor sites in two transgenic lines. We found that 39% of Tol2 insertions occurred in transcription units, mostly into introns. Analysis of the transposon target sites revealed no strict specificity at the DNA sequence level. However, Tol2 was prone to target AT-rich regions with weak palindromic consensus sequences centered at the insertion site. Our systematic analysis of sequential remobilizations of the Tol2 transposon from two independent sites within a vertebrate genome has revealed properties such as a tendency to integrate into transcription units and into AT-rich palindrome-like sequences. This information will influence the development of various applications involving DNA transposons and Tol2 in particular.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weiss, S.B.
Our laboratory has explored the use of short DNA oligomers as targets for activated polycyclic aromatic hydrocarbons, such as benzo(a)pyrene diol epoxide (BPDE), in order to detect alterations in DNA sequence arrangement. In this model system, oligomers alkylated with (+)-BPDE are ligated into M13 viral DNA and used to transfect Escherichia coli. These cells are plated on agar, incubated at 37/sup 0/C, progeny viral clones are selected, amplified, and the viral DNAs isolated are sequenced at the site of oligomer insertion. We have devised a procedure for the preparation of unique duplex DNA oligomers such that the site of oligomermore » alkylation is specific for a single deoxynucleotide species in the two DNA strands. The procedure for oligomer assembly also allows us to vary the position of the alkylated residue in each of the two strands. Using our model system, the results obtained over the past year can be summarized as follows. When nonalkylated oligomer constructs are ligated into M13 viral DNA and used to transfect E. coli, no modifications in DNA sequence arrangement are detected in progeny viral DNAs. On the other hand, with oligomer constructs containing BP-adducts two major types of modifications in DNA sequence arrangement were observed: (1) large deletions, and (2) nonhomologous (illegitimate) recombinants. Both of these DNA modifications result in the complete removal of the oligomer insert. Transfection of E. coli that are recA/sup -/ does not alter these DNA modifications, therefore, it appears that the deletions and recombinants induced by the alkylated inserts are not under control of the RecA gene. As the distance between the alkylated residues in the duplex strands is increased, the number of recombinant events detected is reduced. In addition to the above types of DNA modifications, restoration of the original nucleotide sequence in the alkylated construct was also observed in progeny viral DNAs. 7 refs., 6 figs., 2 tabs.« less
Method for construction of normalized cDNA libraries
Soares, Marcelo B.; Efstratiadis, Argiris
1996-01-01
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library.
Method for construction of normalized cDNA libraries
Soares, M.B.; Efstratiadis, A.
1996-01-09
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form. The method comprises: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3` noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. 4 figs.
Wilson, John-James; Sing, Kong-Wah; Sofian-Azirun, Mohd
2013-01-01
The objective of this study was to build a DNA barcode reference library for the true butterflies of Peninsula Malaysia and assess the value of attaching subspecies names to DNA barcode records. A new DNA barcode library was constructed with butterflies from the Museum of Zoology, University of Malaya collection. The library was analysed in conjunction with publicly available DNA barcodes from other Asia-Pacific localities to test the ability of the DNA barcodes to discriminate species and subspecies. Analyses confirmed the capacity of the new DNA barcode reference library to distinguish the vast majority of species (92%) and revealed that most subspecies possessed unique DNA barcodes (84%). In some cases conspecific subspecies exhibited genetic distances between their DNA barcodes that are typically seen between species, and these were often taxa that have previously been regarded as full species. Subspecies designations as shorthand for geographically and morphologically differentiated groups provide a useful heuristic for assessing how such groups correlate with clustering patterns of DNA barcodes, especially as the number of DNA barcodes per species in reference libraries increases. Our study demonstrates the value in attaching subspecies names to DNA barcode records as they can reveal a history of taxonomic concepts and expose important units of biodiversity.
Wilson, John-James; Sing, Kong-Wah; Sofian-Azirun, Mohd
2013-01-01
The objective of this study was to build a DNA barcode reference library for the true butterflies of Peninsula Malaysia and assess the value of attaching subspecies names to DNA barcode records. A new DNA barcode library was constructed with butterflies from the Museum of Zoology, University of Malaya collection. The library was analysed in conjunction with publicly available DNA barcodes from other Asia-Pacific localities to test the ability of the DNA barcodes to discriminate species and subspecies. Analyses confirmed the capacity of the new DNA barcode reference library to distinguish the vast majority of species (92%) and revealed that most subspecies possessed unique DNA barcodes (84%). In some cases conspecific subspecies exhibited genetic distances between their DNA barcodes that are typically seen between species, and these were often taxa that have previously been regarded as full species. Subspecies designations as shorthand for geographically and morphologically differentiated groups provide a useful heuristic for assessing how such groups correlate with clustering patterns of DNA barcodes, especially as the number of DNA barcodes per species in reference libraries increases. Our study demonstrates the value in attaching subspecies names to DNA barcode records as they can reveal a history of taxonomic concepts and expose important units of biodiversity. PMID:24282514
2010-01-01
Background The Asteraceae represents an important plant family with respect to the numbers of species present in the wild and used by man. Nonetheless, genomic resources for Asteraceae species are relatively underdeveloped, hampering within species genetic studies as well as comparative genomics studies at the family level. So far, six BAC libraries have been described for the main crops of the family, i.e. lettuce and sunflower. Here we present the characterization of BAC libraries of chicory (Cichorium intybus L.) constructed from two genotypes differing in traits related to sexual and vegetative reproduction. Resolving the molecular mechanisms underlying traits controlling the reproductive system of chicory is a key determinant for hybrid development, and more generally will provide new insights into these traits, which are poorly investigated so far at the molecular level in Asteraceae. Findings Two bacterial artificial chromosome (BAC) libraries, CinS2S2 and CinS1S4, were constructed from HindIII-digested high molecular weight DNA of the contrasting genotypes C15 and C30.01, respectively. C15 was hermaphrodite, non-embryogenic, and S2S2 for the S-locus implicated in self-incompatibility, whereas C30.01 was male sterile, embryogenic, and S1S4. The CinS2S2 and CinS1S4 libraries contain 89,088 and 81,408 clones. Mean insert sizes of the CinS2S2 and CinS1S4 clones are 90 and 120 kb, respectively, and provide together a coverage of 12.3 haploid genome equivalents. Contamination with mitochondrial and chloroplast DNA sequences was evaluated with four mitochondrial and four chloroplast specific probes, and was estimated to be 0.024% and 1.00% for the CinS2S2 library, and 0.028% and 2.35% for the CinS1S4 library. Using two single copy genes putatively implicated in somatic embryogenesis, screening of both libraries resulted in detection of 12 and 13 positive clones for each gene, in accordance with expected numbers. Conclusions This indicated that both BAC libraries are valuable tools for molecular studies in chicory, one goal being the positional cloning of the S-locus in this Asteraceae species. PMID:20701751
Gonthier, Lucy; Bellec, Arnaud; Blassiau, Christelle; Prat, Elisa; Helmstetter, Nicolas; Rambaud, Caroline; Huss, Brigitte; Hendriks, Theo; Bergès, Hélène; Quillet, Marie-Christine
2010-08-11
The Asteraceae represents an important plant family with respect to the numbers of species present in the wild and used by man. Nonetheless, genomic resources for Asteraceae species are relatively underdeveloped, hampering within species genetic studies as well as comparative genomics studies at the family level. So far, six BAC libraries have been described for the main crops of the family, i.e. lettuce and sunflower. Here we present the characterization of BAC libraries of chicory (Cichorium intybus L.) constructed from two genotypes differing in traits related to sexual and vegetative reproduction. Resolving the molecular mechanisms underlying traits controlling the reproductive system of chicory is a key determinant for hybrid development, and more generally will provide new insights into these traits, which are poorly investigated so far at the molecular level in Asteraceae. Two bacterial artificial chromosome (BAC) libraries, CinS2S2 and CinS1S4, were constructed from HindIII-digested high molecular weight DNA of the contrasting genotypes C15 and C30.01, respectively. C15 was hermaphrodite, non-embryogenic, and S2S2 for the S-locus implicated in self-incompatibility, whereas C30.01 was male sterile, embryogenic, and S1S4. The CinS2S2 and CinS1S4 libraries contain 89,088 and 81,408 clones. Mean insert sizes of the CinS2S2 and CinS1S4 clones are 90 and 120 kb, respectively, and provide together a coverage of 12.3 haploid genome equivalents. Contamination with mitochondrial and chloroplast DNA sequences was evaluated with four mitochondrial and four chloroplast specific probes, and was estimated to be 0.024% and 1.00% for the CinS2S2 library, and 0.028% and 2.35% for the CinS1S4 library. Using two single copy genes putatively implicated in somatic embryogenesis, screening of both libraries resulted in detection of 12 and 13 positive clones for each gene, in accordance with expected numbers. This indicated that both BAC libraries are valuable tools for molecular studies in chicory, one goal being the positional cloning of the S-locus in this Asteraceae species.
Rabah, Samar O; Lee, Chaehee; Hajrah, Nahid H; Makki, Rania M; Alharby, Hesham F; Alhebshi, Alawiah M; Sabir, Jamal S M; Jansen, Robert K; Ruhlman, Tracey A
2017-11-01
In plant evolution, intracellular gene transfer (IGT) is a prevalent, ongoing process. While nuclear and mitochondrial genomes are known to integrate foreign DNA via IGT and horizontal gene transfer (HGT), plastid genomes (plastomes) have resisted foreign DNA incorporation and only recently has IGT been uncovered in the plastomes of a few land plants. In this study, we completed plastome sequences for l0 crop species and describe a number of structural features including variation in gene and intron content, inversions, and expansion and contraction of the inverted repeat (IR). We identified a putative in cinnamon ( J. Presl) and other sequenced Lauraceae and an apparent functional transfer of to the nucleus of quinoa ( Willd.). In the orchard tree cashew ( L.), we report the insertion of an ∼6.7-kb fragment of mitochondrial DNA into the plastome IR. BLASTn analyses returned high identity hits to mitogenome sequences including an intact open reading frame. Using three plastome markers for five species of , we generated a phylogeny to investigate the distribution and timing of the insertion. Four species share the insertion, suggesting that this event occurred <20 million yr ago in a single clade in the genus. Our study extends the observation of mitochondrial to plastome IGT to include long-lived tree species. While previous studies have suggested possible mechanisms facilitating IGT to the plastome, more examples of this phenomenon, along with more complete mitogenome sequences, will be required before a common, or variable, mechanism can be elucidated. Copyright © 2017 Crop Science Society of America.
The detection of large deletions or duplications in genomic DNA.
Armour, J A L; Barton, D E; Cockburn, D J; Taylor, G R
2002-11-01
While methods for the detection of point mutations and small insertions or deletions in genomic DNA are well established, the detection of larger (>100 bp) genomic duplications or deletions can be more difficult. Most mutation scanning methods use PCR as a first step, but the subsequent analyses are usually qualitative rather than quantitative. Gene dosage methods based on PCR need to be quantitative (i.e., they should report molar quantities of starting material) or semi-quantitative (i.e., they should report gene dosage relative to an internal standard). Without some sort of quantitation, heterozygous deletions and duplications may be overlooked and therefore be under-ascertained. Gene dosage methods provide the additional benefit of reporting allele drop-out in the PCR. This could impact on SNP surveys, where large-scale genotyping may miss null alleles. Here we review recent developments in techniques for the detection of this type of mutation and compare their relative strengths and weaknesses. We emphasize that comprehensive mutation analysis should include scanning for large insertions and deletions and duplications. Copyright 2002 Wiley-Liss, Inc.
Xu, Chao; Dong, Wenpan; Shi, Shuo; Cheng, Tao; Li, Changhao; Liu, Yanlei; Wu, Ping; Wu, Hongkun; Gao, Peng; Zhou, Shiliang
2015-11-01
A well-covered reference library is crucial for successful identification of species by DNA barcoding. The biggest difficulty in building such a reference library is the lack of materials of organisms. Herbarium collections are potentially an enormous resource of materials. In this study, we demonstrate that it is likely to build such reference libraries using the reconstructed (self-primed PCR amplified) DNA from the herbarium specimens. We used 179 rosaceous specimens to test the effects of DNA reconstruction, 420 randomly sampled specimens to estimate the usable percentage and another 223 specimens of true cherries (Cerasus, Rosaceae) to test the coverage of usable specimens to the species. The barcode rbcLb (the central four-sevenths of rbcL gene) and matK was each amplified in two halves and sequenced on Roche GS 454 FLX+. DNA from the herbarium specimens was typically shorter than 300 bp. DNA reconstruction enabled amplification fragments of 400-500 bp without bringing or inducing any sequence errors. About one-third of specimens in the national herbarium of China (PE) were proven usable after DNA reconstruction. The specimens in PE cover all Chinese true cherry species and 91.5% of vascular species listed in Flora of China. It is very possible to build well-covered reference libraries for DNA barcoding of vascular species in China. As exemplified in this study, DNA reconstruction and DNA-labelled next-generation sequencing can accelerate the construction of local reference libraries. By putting the local reference libraries together, a global library for DNA barcoding becomes closer to reality. © 2015 John Wiley & Sons Ltd.
Optical mapping and its potential for large-scale sequencing projects.
Aston, C; Mishra, B; Schwartz, D C
1999-07-01
Physical mapping has been rediscovered as an important component of large-scale sequencing projects. Restriction maps provide landmark sequences at defined intervals, and high-resolution restriction maps can be assembled from ensembles of single molecules by optical means. Such optical maps can be constructed from both large-insert clones and genomic DNA, and are used as a scaffold for accurately aligning sequence contigs generated by shotgun sequencing.
Wang, H; Miao, S; Chen, D; Wang, L; Koide, S S
1999-10-06
The gene (HSD-1) coding a human sperm membrane protein (hSMP-1) was isolated from a human testis cDNA expression library using antibodies found in the serum of an infertile woman. HSD-1 was localized to a single locus on chromosome 9 and assigned to band 9p12-p13 by fluorescent in situ hybridization (FISH) mapping and DAPI (4,6-diamidino-2-phenylindole) banding, using rat/human somatic cell hybrids and metaphase chromosomes of human lymphocytes. In rescreening a testis lambdagt10 cDNA expression library, the full-length cDNA (HSD-1) and several truncated cDNAs with heterologous regions were isolated from positive clones. The heterology consisted of deletion, insertion and alteration of the 5'-end. These heterologous truncated fragments may be produced by alternative splicing of mRNAs. Two recombinant prokaryotic expression vectors were constructed with one of the heterologous fragment (clone #26) with and without the alternative 5'-end. Escherichia coli transfected with the construct containing the alternative 5'-end failed to produce the recombinant product, whereas those transfected with the vector lacking the 5'-end produced hSMP-1. DNASIS analysis of the structure of #26 mRNA suggests that the 5'-end has a stable secondary configuration that may maintain the mRNA in an inactivated state, whereby hindering its translation and preventing the expression of the gene.
Global mapping of transposon location.
Gabriel, Abram; Dapprich, Johannes; Kunkel, Mark; Gresham, David; Pratt, Stephen C; Dunham, Maitreya J
2006-12-15
Transposable genetic elements are ubiquitous, yet their presence or absence at any given position within a genome can vary between individual cells, tissues, or strains. Transposable elements have profound impacts on host genomes by altering gene expression, assisting in genomic rearrangements, causing insertional mutations, and serving as sources of phenotypic variation. Characterizing a genome's full complement of transposons requires whole genome sequencing, precluding simple studies of the impact of transposition on interindividual variation. Here, we describe a global mapping approach for identifying transposon locations in any genome, using a combination of transposon-specific DNA extraction and microarray-based comparative hybridization analysis. We use this approach to map the repertoire of endogenous transposons in different laboratory strains of Saccharomyces cerevisiae and demonstrate that transposons are a source of extensive genomic variation. We also apply this method to mapping bacterial transposon insertion sites in a yeast genomic library. This unique whole genome view of transposon location will facilitate our exploration of transposon dynamics, as well as defining bases for individual differences and adaptive potential.
NASA Astrophysics Data System (ADS)
Panicali, Dennis; Paoletti, Enzo
1982-08-01
We have constructed recombinant vaccinia viruses containing the thymidine kinase gene from herpes simplex virus. The gene was inserted into the genome of a variant of vaccinia virus that had undergone spontaneous deletion as well as into the 120-megadalton genome of the large prototypic vaccinia variant. This was accomplished via in vivo recombination by contransfection of eukaryotic tissue culture cells with cloned BamHI-digested thymidine kinase gene from herpes simplex virus containing flanking vaccinia virus DNA sequences and infectious rescuing vaccinia virus. Pure populations of the recombinant viruses were obtained by replica filter techniques or by growth of the recombinant virus in biochemically selective medium. The herpes simplex virus thymidine kinase gene, as an insert in vaccinia virus, is transcribed in vivo and in vitro, and the fidelity of in vivo transcription into a functional gene product was detected by the phosphorylation of 5-[125I]iodo-2'-deoxycytidine.
Application of Biocatalysis to on-DNA Carbohydrate Library Synthesis.
Thomas, Baptiste; Lu, Xiaojie; Birmingham, William R; Huang, Kun; Both, Peter; Reyes Martinez, Juana Elizabeth; Young, Robert J; Davie, Christopher P; Flitsch, Sabine L
2017-05-04
DNA-encoded libraries are increasingly used for the discovery of bioactive lead compounds in high-throughput screening programs against specific biological targets. Although a number of libraries are now available, they cover limited chemical space due to bias in ease of synthesis and the lack of chemical reactions that are compatible with DNA tagging. For example, compound libraries rarely contain complex biomolecules such as carbohydrates with high levels of functionality, stereochemistry, and hydrophilicity. By using biocatalysis in combination with chemical methods, we aimed to significantly expand chemical space and generate generic libraries with potentially better biocompatibility. For DNA-encoded libraries, biocatalysis is particularly advantageous, as it is highly selective and can be performed in aqueous environments, which is an essential feature for this split-and-mix library technology. In this work, we demonstrated the application of biocatalysis for the on-DNA synthesis of carbohydrate-based libraries by using enzymatic oxidation and glycosylation in combination with traditional organic chemistry. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
When the Human Immunodeficiency Virus (HIV) infects a cell, the virus inserts a copy of its genetic material into the host cell’s DNA. The inserted genetic material, which is also called a provirus, is used to produce new viruses. Because the viral DNA can be inserted at many sites in the host cell DNA, the site of integration marks each infected cell. Patients infected with
Houtz, Robert L.
2001-01-01
The gene sequence for ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) large subunit (LS) .sup..epsilon. N-methyltansferase (protein methylase III or Rubisco LSMT) from a plant which has a des(methyl) lysyl residue in the LS is disclosed. In addition, the full-length cDNA clones for Rubisco LSMT are disclosed. Transgenic plants and methods of producing same which have the Rubisco LSMT gene inserted into the DNA are also provided. Further, methods of inactivating the enzymatic activity of Rubisco LSMT are also disclosed.
Zhao, Wei; Li, Xin; Liu, Wen-Hui; Zhao, Jian; Jin, Yi-Ming; Sui, Ting-Ting
2014-09-01
Human epithelial colorectal adenocarcinoma (Caco-2) cells are widely used as an in vitro model of the human small intestinal mucosa. Caco-2 cells are host cells of the human astrovirus (HAstV) and other enteroviruses. High quality cDNA libraries are pertinent resources and critical tools for protein-protein interaction research, but are currently unavailable for Caco-2 cells. To construct a three-open reading frame, full length-expression cDNA library from the Caco-2 cell line for application to HAstV protein-protein interaction screening, total RNA was extracted from Caco-2 cells. The switching mechanism at the 5' end of the RNA transcript technique was used for cDNA synthesis. Double-stranded cDNA was digested by Sfi I and ligated to reconstruct a pGADT7-Sfi I three-frame vector. The ligation mixture was transformed into Escherichia coli HST08 premium electro cells by electroporation to construct the primary cDNA library. The library capacity was 1.0×10(6)clones. Gel electrophoresis results indicated that the fragments ranged from 0.5kb to 4.2kb. Randomly picked clones show that the recombination rate was 100%. The three-frame primary cDNA library plasmid mixture (5×10(5)cfu) was also transformed into E. coli HST08 premium electro cells, and all clones were harvested to amplify the cDNA library. To detect the sufficiency of the cDNA library, HAstV capsid protein as bait was screened and tested against the Caco-2 cDNA library by a yeast two-hybrid (Y2H) system. A total of 20 proteins were found to interact with the capsid protein. These results showed that a high-quality three-frame cDNA library from Caco-2 cells was successfully constructed. This library was efficient for the application to the Y2H system, and could be used for future research. Copyright © 2014 Elsevier B.V. All rights reserved.
Rise, Matthew L.; von Schalburg, Kristian R.; Brown, Gordon D.; Mawer, Melanie A.; Devlin, Robert H.; Kuipers, Nathanael; Busby, Maura; Beetz-Sargent, Marianne; Alberto, Roberto; Gibbs, A. Ross; Hunt, Peter; Shukin, Robert; Zeznik, Jeffrey A.; Nelson, Colleen; Jones, Simon R.M.; Smailus, Duane E.; Jones, Steven J.M.; Schein, Jacqueline E.; Marra, Marco A.; Butterfield, Yaron S.N.; Stott, Jeff M.; Ng, Siemon H.S.; Davidson, William S.; Koop, Ben F.
2004-01-01
We report 80,388 ESTs from 23 Atlantic salmon (Salmo salar) cDNA libraries (61,819 ESTs), 6 rainbow trout (Oncorhynchus mykiss) cDNA libraries (14,544 ESTs), 2 chinook salmon (Oncorhynchus tshawytscha) cDNA libraries (1317 ESTs), 2 sockeye salmon (Oncorhynchus nerka) cDNA libraries (1243 ESTs), and 2 lake whitefish (Coregonus clupeaformis) cDNA libraries (1465 ESTs). The majority of these are 3′ sequences, allowing discrimination between paralogs arising from a recent genome duplication in the salmonid lineage. Sequence assembly reveals 28,710 different S. salar, 8981 O. mykiss, 1085 O. tshawytscha, 520 O. nerka, and 1176 C. clupeaformis putative transcripts. We annotate the submitted portion of our EST database by molecular function. Higher- and lower-molecular-weight fractions of libraries are shown to contain distinct gene sets, and higher rates of gene discovery are associated with higher-molecular weight libraries. Pyloric caecum library group annotations indicate this organ may function in redox control and as a barrier against systemic uptake of xenobiotics. A microarray is described, containing 7356 salmonid elements representing 3557 different cDNAs. Analyses of cross-species hybridizations to this cDNA microarray indicate that this resource may be used for studies involving all salmonids. PMID:14962987
cDNA library construction of two human Demodexspecies.
Niu, DongLing; Wang, RuiLing; Zhao, YaE; Yang, Rui; Hu, Li; Lei, YuYang; Dan, WeiChao
2017-06-01
The research of Demodex, a type of pathogen causing various dermatoses in animals and human beings, is lacking at RNA level. This study aims at extracting RNA and constructing cDNA library for Demodex. First, P. cuniculiand D. farinaewere mixed to establish homogenization method for RNA extraction. Second, D. folliculorumand D. breviswere collected and preserved in Trizol, which were mixed with D. farinaerespectively to extract RNA. Finally, cDNA library was constructed and its quality was assessed. The results indicated that for D. folliculorum& D. farinae, the recombination rate of cDNA library was 90.67% and the library titer was 7.50 × 104 pfu/ml. 17 of the 59 positive clones were predicted to be of D. folliculorum; For D. brevis& D. farinae, the recombination rate was 90.96% and the library titer was 7.85 x104 pfu/ml. 40 of the 59 positive clones were predicted to be of D. brevis. Further detection by specific primers demonstrated that mtDNA cox1, cox3and ATP6 detected from cDNA libraries had 96.52%-99.73% identities with the corresponding sequences in GenBank. In conclusion, the cDNA libraries constructed for Demodexmixed with D. farinaewere successful and could satisfy the requirements for functional genes detection.
Recent phylogenetic studies have used DNA as the target molecule for the development of environmental 16S rDNA clone libraries. As DNA may persist in the environment, DNA-based libraries cannot be used to identify metabolically active bacteria in water systems. In this study, a...
Molecular architecture of classical cytological landmarks: Centromeres and telomeres
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meyne, J.
1994-11-01
Both the human telomere repeat and the pericentromeric repeat sequence (GGAAT)n were isolated based on evolutionary conservation. Their isolation was based on the premise that chromosomal features as structurally and functionally important as telomeres and centromeres should be highly conserved. Both sequences were isolated by high stringency screening of a human repetitive DNA library with rodent repetitive DNA. The pHuR library (plasmid Human Repeat) used for this project was enriched for repetitive DNA by using a modification of the standard DNA library preparation method. Usually DNA for a library is cut with restriction enzymes, packaged, infected, and the library ismore » screened. A problem with this approach is that many tandem repeats don`t have any (or many) common restriction sites. Therefore, many of the repeat sequences will not be represented in the library because they are not restricted to a viable length for the vector used. To prepare the pHuR library, human DNA was mechanically sheared to a small size. These relatively short DNA fragments were denatured and then renatured to C{sub o}t 50. Theoretically only repetitive DNA sequences should renature under C{sub o}t 50 conditions. The single-stranded regions were digested using S1 nuclease, leaving the double-stranded, renatured repeat sequences.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tanaka, Yoshiyuki; Matsuoka, Makoto; Yamanoto, Naoki
A cDNA clone for phenylalanine ammonia-lyase (PAL) induced in wounded sweet potato (Ipomoea batatas Lam.) root was obtained by immunoscreening a cDNA library. The protein produced in Escherichia coli cells containing the plasmid pPAL02 was indistinguishable from sweet potato PAL as judged by Ouchterlony double diffusion assays. The M{sub r} of its subunit was 77,000. The cells converted ({sup 14}C)-L-phenylalanine into ({sup 14}C)-t-cinnamic acid and PAL activity was detected in the homogenate of the cells. The activity was dependent on the presence of the pPAL02 plasmid DNA. The nucleotide sequence of the cDNA contained a 2,121-base pair (bp) open-reading framemore » capable of coding for a polypeptide with 707 amino acids (M{sub r} 77,137), a 22-bp 5{prime}-noncoding region and a 207-bp 3{prime}-noncoding region. The results suggest that the insert DNA fully encoded the amino acid sequence for sweet potato PAL that is induced by wounding. Comparison of the deduced amino acid sequence with that of a PAL cDNA fragment from Phaseolus vulgaris revealed 78.9% homology. The sequence from amino acid residues 258 to 494 was highly conserved, showing 90.7% homology.« less
Randrianjatovo-Gbalou, Irina; Rosario, Sandrine; Sismeiro, Odile; Varet, Hugo; Legendre, Rachel; Coppée, Jean-Yves; Huteau, Valérie; Pochet, Sylvie; Delarue, Marc
2018-05-21
Nucleic acid aptamers, especially RNA, exhibit valuable advantages compared to protein therapeutics in terms of size, affinity and specificity. However, the synthesis of libraries of large random RNAs is still difficult and expensive. The engineering of polymerases able to directly generate these libraries has the potential to replace the chemical synthesis approach. Here, we start with a DNA polymerase that already displays a significant template-free nucleotidyltransferase activity, human DNA polymerase theta, and we mutate it based on the knowledge of its three-dimensional structure as well as previous mutational studies on members of the same polA family. One mutant exhibited a high tolerance towards ribonucleotides (NTPs) and displayed an efficient ribonucleotidyltransferase activity that resulted in the assembly of long RNA polymers. HPLC analysis and RNA sequencing of the products were used to quantify the incorporation of the four NTPs as a function of initial NTP concentrations and established the randomness of each generated nucleic acid sequence. The same mutant revealed a propensity to accept other modified nucleotides and to extend them in long fragments. Hence, this mutant can deliver random natural and modified RNA polymers libraries ready to use for SELEX, with custom lengths and balanced or unbalanced ratios.
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries
Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P
2008-01-01
Background Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. Results We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. Conclusion EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects. PMID:18402700
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries.
Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P
2008-04-10
Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects.
Shinozuka, Hiroshi; Cogan, Noel O I; Shinozuka, Maiko; Marshall, Alexis; Kay, Pippa; Lin, Yi-Han; Spangenberg, German C; Forster, John W
2015-04-11
Fragmentation at random nucleotide locations is an essential process for preparation of DNA libraries to be used on massively parallel short-read DNA sequencing platforms. Although instruments for physical shearing, such as the Covaris S2 focused-ultrasonicator system, and products for enzymatic shearing, such as the Nextera technology and NEBNext dsDNA Fragmentase kit, are commercially available, a simple and inexpensive method is desirable for high-throughput sequencing library preparation. MspJI is a recently characterised restriction enzyme which recognises the sequence motif CNNR (where R = G or A) when the first base is modified to 5-methylcytosine or 5-hydroxymethylcytosine. A semi-random enzymatic DNA amplicon fragmentation method was developed based on the unique cleavage properties of MspJI. In this method, random incorporation of 5-methyl-2'-deoxycytidine-5'-triphosphate is achieved through DNA amplification with DNA polymerase, followed by DNA digestion with MspJI. Due to the recognition sequence of the enzyme, DNA amplicons are fragmented in a relatively sequence-independent manner. The size range of the resulting fragments was capable of control through optimisation of 5-methyl-2'-deoxycytidine-5'-triphosphate concentration in the reaction mixture. A library suitable for sequencing using the Illumina MiSeq platform was prepared and processed using the proposed method. Alignment of generated short reads to a reference sequence demonstrated a relatively high level of random fragmentation. The proposed method may be performed with standard laboratory equipment. Although the uniformity of coverage was slightly inferior to the Covaris physical shearing procedure, due to efficiencies of cost and labour, the method may be more suitable than existing approaches for implementation in large-scale sequencing activities, such as bacterial artificial chromosome (BAC)-based genome sequence assembly, pan-genomic studies and locus-targeted genotyping-by-sequencing.
Clemans, Daniel L.; Kolenbrander, Paul E.; Debabov, Dmitri V.; Zhang, Qunying; Lunsford, R. Dwayne; Sakone, Holly; Whittaker, Catherine J.; Heaton, Michael P.; Neuhaus, Francis C.
1999-01-01
Most human oral viridans streptococci participate in intrageneric coaggregations, the cell-to-cell adherence among genetically distinct streptococci. Two genes relevant to these intrageneric coaggregations were identified by transposon Tn916 mutagenesis of Streptococcus gordonii DL1 (Challis). A 626-bp sequence flanking the left end of the transposon was homologous to dltA and dltB of Lactobacillus rhamnosus ATCC 7469 (formerly called Lactobacillus casei). A 60-kb probe based on this flanking sequence was used to identify the homologous DNA in a fosmid library of S. gordonii DL1. This DNA encoded d-alanine-d-alanyl carrier protein ligase that was expressed in Escherichia coli from the fosmid clone. The cloned streptococcal dltA was disrupted by inserting an ermAM cassette, and then it was linearized and transformed into S. gordonii DL1 for allelic replacement. Erythromycin-resistant transformants containing a single insertion in dltA exhibited a loss of d-alanyl esters in lipoteichoic acid (LTA) and a loss of intrageneric coaggregation. This phenotype was correlated with the loss of a 100-kDa surface protein reported previously to be involved in mediating intrageneric coaggregation (C. J. Whittaker, D. L. Clemans, and P. E. Kolenbrander, Infect. Immun. 64:4137–4142, 1996). The mutants retained the parental ability to participate in intergeneric coaggregation with human oral actinomyces, indicating the specificity of the mutation in altering intrageneric coaggregations. The mutants were altered morphologically and exhibited aberrant cell septa in a variety of pleomorphs. The natural DNA transformation frequency was reduced 10-fold in these mutants. Southern analysis of chromosomal DNAs from various streptococcal species with the dltA probe revealed the presence of this gene in most viridans streptococci. Thus, it is hypothesized that d-alanyl LTA may provide binding sites for the putative 100-kDa adhesin and scaffolding for the proper presentation of this adhesin to mediate intrageneric coaggregation. PMID:10225909
Clemans, D L; Kolenbrander, P E; Debabov, D V; Zhang, Q; Lunsford, R D; Sakone, H; Whittaker, C J; Heaton, M P; Neuhaus, F C
1999-05-01
Most human oral viridans streptococci participate in intrageneric coaggregations, the cell-to-cell adherence among genetically distinct streptococci. Two genes relevant to these intrageneric coaggregations were identified by transposon Tn916 mutagenesis of Streptococcus gordonii DL1 (Challis). A 626-bp sequence flanking the left end of the transposon was homologous to dltA and dltB of Lactobacillus rhamnosus ATCC 7469 (formerly called Lactobacillus casei). A 60-kb probe based on this flanking sequence was used to identify the homologous DNA in a fosmid library of S. gordonii DL1. This DNA encoded D-alanine-D-alanyl carrier protein ligase that was expressed in Escherichia coli from the fosmid clone. The cloned streptococcal dltA was disrupted by inserting an ermAM cassette, and then it was linearized and transformed into S. gordonii DL1 for allelic replacement. Erythromycin-resistant transformants containing a single insertion in dltA exhibited a loss of D-alanyl esters in lipoteichoic acid (LTA) and a loss of intrageneric coaggregation. This phenotype was correlated with the loss of a 100-kDa surface protein reported previously to be involved in mediating intrageneric coaggregation (C. J. Whittaker, D. L. Clemans, and P. E. Kolenbrander, Infect. Immun. 64:4137-4142, 1996). The mutants retained the parental ability to participate in intergeneric coaggregation with human oral actinomyces, indicating the specificity of the mutation in altering intrageneric coaggregations. The mutants were altered morphologically and exhibited aberrant cell septa in a variety of pleomorphs. The natural DNA transformation frequency was reduced 10-fold in these mutants. Southern analysis of chromosomal DNAs from various streptococcal species with the dltA probe revealed the presence of this gene in most viridans streptococci. Thus, it is hypothesized that D-alanyl LTA may provide binding sites for the putative 100-kDa adhesin and scaffolding for the proper presentation of this adhesin to mediate intrageneric coaggregation.
Seashols-Williams, Sarah; Green, Raquel; Wohlfahrt, Denise; Brand, Angela; Tan-Torres, Antonio Limjuco; Nogales, Francy; Brooks, J Paul; Singh, Baneshwar
2018-05-17
Sequencing and classification of microbial taxa within forensically relevant biological fluids has the potential for applications in the forensic science and biomedical fields. The quantity of bacterial DNA from human samples is currently estimated based on quantity of total DNA isolated. This method can miscalculate bacterial DNA quantity due to the mixed nature of the sample, and consequently library preparation is often unreliable. We developed an assay that can accurately and specifically quantify bacterial DNA within a mixed sample for reliable 16S ribosomal DNA (16S rDNA) library preparation and high throughput sequencing (HTS). A qPCR method was optimized using universal 16S rDNA primers, and a commercially available bacterial community DNA standard was used to develop a precise standard curve. Following qPCR optimization, 16S rDNA libraries from saliva, vaginal and menstrual secretions, urine, and fecal matter were amplified and evaluated at various DNA concentrations; successful HTS data were generated with as low as 20 pg of bacterial DNA. Changes in bacterial DNA quantity did not impact observed relative abundances of major bacterial taxa, but relative abundance changes of minor taxa were observed. Accurate quantification of microbial DNA resulted in consistent, successful library preparations for HTS analysis. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The Essential Genome of Escherichia coli K-12.
Goodall, Emily C A; Robinson, Ashley; Johnston, Iain G; Jabbari, Sara; Turner, Keith A; Cunningham, Adam F; Lund, Peter A; Cole, Jeffrey A; Henderson, Ian R
2018-02-20
Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. IMPORTANCE Incentives to define lists of genes that are essential for bacterial survival include the identification of potential targets for antibacterial drug development, genes required for rapid growth for exploitation in biotechnology, and discovery of new biochemical pathways. To identify essential genes in Escherichia coli , we constructed a transposon mutant library of unprecedented density. Initial automated analysis of the resulting data revealed many discrepancies compared to the literature. We now report more extensive statistical analysis supported by both literature searches and detailed inspection of high-density TraDIS sequencing data for each putative essential gene for the E. coli model laboratory organism. This paper is important because it provides a better understanding of the essential genes of E. coli , reveals the limitations of relying on automated analysis alone, and provides a new standard for the analysis of TraDIS data. Copyright © 2018 Goodall et al.
Evaluation of vector-primed cDNA library production from microgram quantities of total RNA.
Kuo, Jonathan; Inman, Jason; Brownstein, Michael; Usdin, Ted B
2004-12-15
cDNA sequences are important for defining the coding region of genes, and full-length cDNA clones have proven to be useful for investigation of the function of gene products. We produced cDNA libraries containing 3.5-5 x 10(5) primary transformants, starting with 5 mug of total RNA prepared from mouse pituitary, adrenal, thymus, and pineal tissue, using a vector-primed cDNA synthesis method. Of approximately 1000 clones sequenced, approximately 20% contained the full open reading frames (ORFs) of known transcripts, based on the presence of the initiating methionine residue codon. The libraries were complex, with 94, 91, 83 and 55% of the clones from the thymus, adrenal, pineal and pituitary libraries, respectively, represented only once. Twenty-five full-length clones, not yet represented in the Mammalian Gene Collection, were identified. Thus, we have produced useful cDNA libraries for the isolation of full-length cDNA clones that are not yet available in the public domain, and demonstrated the utility of a simple method for making high-quality libraries from small amounts of starting material.
Fine organization of genomic regions tagged to the 5S rDNA locus of the bread wheat 5B chromosome.
Sergeeva, Ekaterina M; Shcherban, Andrey B; Adonina, Irina G; Nesterov, Michail A; Beletsky, Alexey V; Rakitin, Andrey L; Mardanov, Andrey V; Ravin, Nikolai V; Salina, Elena A
2017-11-14
The multigene family encoding the 5S rRNA, one of the most important structurally-functional part of the large ribosomal subunit, is an obligate component of all eukaryotic genomes. 5S rDNA has long been a favored target for cytological and phylogenetic studies due to the inherent peculiarities of its structural organization, such as the tandem arrays of repetitive units and their high interspecific divergence. The complex polyploid nature of the genome of bread wheat, Triticum aestivum, and the technically difficult task of sequencing clusters of tandem repeats mean that the detailed organization of extended genomic regions containing 5S rRNA genes remains unclear. This is despite the recent progress made in wheat genomic sequencing. Using pyrosequencing of BAC clones, in this work we studied the organization of two distinct 5S rDNA-tagged regions of the 5BS chromosome of bread wheat. Three BAC-clones containing 5S rDNA were identified in the 5BS chromosome-specific BAC-library of Triticum aestivum. Using the results of pyrosequencing and assembling, we obtained six 5S rDNA- containing contigs with a total length of 140,417 bp, and two sets (pools) of individual 5S rDNA sequences belonging to separate, but closely located genomic regions on the 5BS chromosome. Both regions are characterized by the presence of approximately 70-80 copies of 5S rDNA, however, they are completely different in their structural organization. The first region contained highly diverged short-type 5S rDNA units that were disrupted by multiple insertions of transposable elements. The second region contained the more conserved long-type 5S rDNA, organized as a single tandem array. FISH using probes specific to both 5S rDNA unit types showed differences in the distribution and intensity of signals on the chromosomes of polyploid wheat species and their diploid progenitors. A detailed structural organization of two closely located 5S rDNA-tagged genomic regions on the 5BS chromosome of bread wheat has been established. These two regions differ in the organization of both 5S rDNA and the neighboring sequences comprised of transposable elements, implying different modes of evolution for these regions.
Method for introducing unidirectional nested deletions
Dunn, John J.; Quesada, Mark A.; Randesi, Matthew
2001-01-01
Disclosed is a method for the introduction of unidirectional deletions in a cloned DNA segment in the context of a cloning vector which contains an f1 endonuclease recognition sequence adjacent to the insertion site of the DNA segment. Also disclosed is a method for producing single-stranded DNA probes utilizing the same cloning vector. An optimal vector, PZIP is described. Methods for introducing unidirectional deletions into a terminal location of a cloned DNA sequence which is inserted into the vector of the present invention are also disclosed. These methods are useful for introducing deletions into either or both ends of a cloned DNA insert, for high throughput sequencing of any DNA of interest.
Three Group-I introns in 18S rDNA of Endosymbiotic Algae of Paramecium bursaria from Japan
NASA Astrophysics Data System (ADS)
Hoshina, Ryo; Kamako, Shin-ichiro; Imamura, Nobutaka
2004-08-01
In the nuclear encoded small subunit ribosomal DNA (18S rDNA) of symbiotic alga of Paramecium bursaria (F36 collected in Japan) possesses three intron-like insertions (Hoshina et al., unpubl. data, 2003). The present study confirmed these exact lengths and insertion sites by reverse transcription-PCR. Two of them were inserted at Escherichia coli 16S rRNA genic position 943 and 1512 that are frequent intron insertion positions, but another insertion position (nearly 1370) was the first finding. Their secondary structures suggested they belong to Group-I intron; one belongs to subgroup IE, others belong to subgroup IC1. Similarity search indicated these introns are ancestral ones.
Helm, Jared R.; Hertz-Fowler, Christiane; Aslett, Martin; Berriman, Matthew; Sanders, Mandy; Quail, Michael A.; Soares, Marcelo B.; Bonaldo, Maria F.; Sakurai, Tatsuya; Inoue, Noboru; Donelson, John E.
2009-01-01
Trypanosoma congolense is one of the most economically important pathogens of livestock in Africa. Culture-derived parasites of each of the three main insect stages of the T. congolense life cycle, i.e., the procyclic, epimastigote and metacyclic stages, and bloodstream stage parasites isolated from infected mice, were used to construct stage-specific cDNA libraries and expressed sequence tags (ESTs or cDNA clones) in each library were sequenced. Thirteen EST clusters encoding different variant surface glycoproteins (VSGs) were detected in the metacyclic library and twenty-six VSG EST clusters were found in the bloodstream library, six of which are shared by the metacyclic library. Rare VSG ESTs are present in the epimastigote library, and none were detected in the procyclic library. ESTs encoding enzymes that catalyze oxidative phosphorylation and amino acid metabolism are about twice as abundant in the procyclic and epimastigote stages as in the metacyclic and bloodstream stages. In contrast, ESTs encoding enzymes involved in glycolysis, the citric acid cycle and nucleotide metabolism are about the same in all four developmental stages. Cysteine proteases, kinases and phosphatases are the most abundant enzyme groups represented by the ESTs. All four libraries contain T. congolense-specific expressed sequences not present in the T. brucei and T. cruzi genomes. Normalized cDNA libraries were constructed from the metacyclic and bloodstream stages, and found to be further enriched for T. congolense-specific ESTs. Given that cultured T. congolense offers an experimental advantage over other African trypanosome species, these ESTs provide a basis for further investigation of the molecular properties of these four developmental stages, especially the epimastigote and metacyclic stages for which it is difficult to obtain large quantities of organisms. The T. congolense EST databases are available at: http://www.sanger.ac.uk/Projects/T_congolense/EST_index.shtml. PMID:19559733
Quantifying and resolving multiple vector transformants in S. cerevisiae plasmid libraries.
Scanlon, Thomas C; Gray, Elizabeth C; Griswold, Karl E
2009-11-20
In addition to providing the molecular machinery for transcription and translation, recombinant microbial expression hosts maintain the critical genotype-phenotype link that is essential for high throughput screening and recovery of proteins encoded by plasmid libraries. It is known that Escherichia coli cells can be simultaneously transformed with multiple unique plasmids and thusly complicate recombinant library screening experiments. As a result of their potential to yield misleading results, bacterial multiple vector transformants have been thoroughly characterized in previous model studies. In contrast to bacterial systems, there is little quantitative information available regarding multiple vector transformants in yeast. Saccharomyces cerevisiae is the most widely used eukaryotic platform for cell surface display, combinatorial protein engineering, and other recombinant library screens. In order to characterize the extent and nature of multiple vector transformants in this important host, plasmid-born gene libraries constructed by yeast homologous recombination were analyzed by DNA sequencing. It was found that up to 90% of clones in yeast homologous recombination libraries may be multiple vector transformants, that on average these clones bear four or more unique mutant genes, and that these multiple vector cells persist as a significant proportion of library populations for greater than 24 hours during liquid outgrowth. Both vector concentration and vector to insert ratio influenced the library proportion of multiple vector transformants, but their population frequency was independent of transformation efficiency. Interestingly, the average number of plasmids born by multiple vector transformants did not vary with their library population proportion. These results highlight the potential for multiple vector transformants to dominate yeast libraries constructed by homologous recombination. The previously unrecognized prevalence and persistence of multiply transformed yeast cells have important implications for yeast library screens. The quantitative information described herein should increase awareness of this issue, and the rapid sequencing approach developed for these studies should be widely useful for identifying multiple vector transformants and avoiding complications associated with cells that have acquired more than one unique plasmid.
Time- and Cost-Efficient Identification of T-DNA Insertion Sites through Targeted Genomic Sequencing
Lepage, Étienne; Zampini, Éric; Boyle, Brian; Brisson, Normand
2013-01-01
Forward genetic screens enable the unbiased identification of genes involved in biological processes. In Arabidopsis, several mutant collections are publicly available, which greatly facilitates such practice. Most of these collections were generated by agrotransformation of a T-DNA at random sites in the plant genome. However, precise mapping of T-DNA insertion sites in mutants isolated from such screens is a laborious and time-consuming task. Here we report a simple, low-cost and time efficient approach to precisely map T-DNA insertions simultaneously in many different mutants. By combining sequence capture, next-generation sequencing and 2D-PCR pooling, we developed a new method that allowed the rapid localization of T-DNA insertion sites in 55 out of 64 mutant plants isolated in a screen for gyrase inhibition hypersensitivity. PMID:23951038
Aigrain, Louise; Gu, Yong; Quail, Michael A
2016-06-13
The emergence of next-generation sequencing (NGS) technologies in the past decade has allowed the democratization of DNA sequencing both in terms of price per sequenced bases and ease to produce DNA libraries. When it comes to preparing DNA sequencing libraries for Illumina, the current market leader, a plethora of kits are available and it can be difficult for the users to determine which kit is the most appropriate and efficient for their applications; the main concerns being not only cost but also minimal bias, yield and time efficiency. We compared 9 commercially available library preparation kits in a systematic manner using the same DNA sample by probing the amount of DNA remaining after each protocol steps using a new droplet digital PCR (ddPCR) assay. This method allows the precise quantification of fragments bearing either adaptors or P5/P7 sequences on both ends just after ligation or PCR enrichment. We also investigated the potential influence of DNA input and DNA fragment size on the final library preparation efficiency. The overall library preparations efficiencies of the libraries show important variations between the different kits with the ones combining several steps into a single one exhibiting some final yields 4 to 7 times higher than the other kits. Detailed ddPCR data also reveal that the adaptor ligation yield itself varies by more than a factor of 10 between kits, certain ligation efficiencies being so low that it could impair the original library complexity and impoverish the sequencing results. When a PCR enrichment step is necessary, lower adaptor-ligated DNA inputs leads to greater amplification yields, hiding the latent disparity between kits. We describe a ddPCR assay that allows us to probe the efficiency of the most critical step in the library preparation, ligation, and to draw conclusion on which kits is more likely to preserve the sample heterogeneity and reduce the need of amplification.
Assembling short reads from jumping libraries with large insert sizes.
Vasilinetc, Irina; Prjibelski, Andrey D; Gurevich, Alexey; Korobeynikov, Anton; Pevzner, Pavel A
2015-10-15
Advances in Next-Generation Sequencing technologies and sample preparation recently enabled generation of high-quality jumping libraries that have a potential to significantly improve short read assemblies. However, assembly algorithms have to catch up with experimental innovations to benefit from them and to produce high-quality assemblies. We present a new algorithm that extends recently described exSPAnder universal repeat resolution approach to enable its applications to several challenging data types, including jumping libraries generated by the recently developed Illumina Nextera Mate Pair protocol. We demonstrate that, with these improvements, bacterial genomes often can be assembled in a few contigs using only a single Nextera Mate Pair library of short reads. Described algorithms are implemented in C++ as a part of SPAdes genome assembler, which is freely available at bioinf.spbau.ru/en/spades. ap@bioinf.spbau.ru Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Nanoneedle insertion into the cell nucleus does not induce double-strand breaks in chromosomal DNA.
Ryu, Seunghwan; Kawamura, Ryuzo; Naka, Ryohei; Silberberg, Yaron R; Nakamura, Noriyuki; Nakamura, Chikashi
2013-09-01
An atomic force microscope probe can be formed into an ultra-sharp cylindrical shape (a nanoneedle) using micro-fabrication techniques such as focused ion beam etching. This nanoneedle can be effectively inserted through the plasma membrane of a living cell to not only access the cytosol, but also to penetrate through the nuclear membrane. This technique shows great potential as a tool for performing intranuclear measurements and manipulations. Repeated insertions of a nanoneedle into a live cell were previously shown not to affect cell viability. However, the effect of nanoneedle insertion on the nucleus and nuclear components is still unknown. DNA is the most crucial component of the nucleus for proper cell function and may be physically damaged by a nanoneedle. To investigate the integrity of DNA following nanoneedle insertion, the occurrence of DNA double-strand breaks (DSBs) was assessed. The results showed that there was no chromosomal DNA damage due to nanoneedle insertion into the nucleus, as indicated by the expression level of γ-H2AX, a molecular marker of DSBs. Copyright © 2013 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Martin, Marjolaine; Vandermies, Marie; Joyeux, Coline; Martin, Renée; Barbeyron, Tristan; Michel, Gurvan; Vandenbol, Micheline
2016-01-01
Alga-associated microorganisms, in the context of their numerous interactions with the host and the complexity of the marine environment, are known to produce diverse hydrolytic enzymes with original biochemistry. We recently isolated several macroalgal-polysaccharide-degrading bacteria from the surface of the brown alga Ascophyllum nodosum. These active isolates belong to two classes: the Flavobacteriia and the Gammaproteobacteria. In the present study, we constructed two "plurigenomic" (with multiple bacterial genomes) libraries with the 5 most interesting isolates (regarding their phylogeny and their enzymatic activities) of each class (Fv and Gm libraries). Both libraries were screened for diverse hydrolytic activities. Five activities, out of the 48 previously identified in the natural polysaccharolytic isolates, were recovered by functional screening: a xylanase (GmXyl7), a beta-glucosidase (GmBg1), an esterase (GmEst7) and two iota-carrageenases (Fvi2.5 and Gmi1.3). We discuss here the potential role of the used host-cell, the average DNA insert-sizes and the used restriction enzymes on the divergent screening yields obtained for both libraries and get deeper inside the "great screen anomaly". Interestingly, the discovered esterase probably stands for a novel family of homoserine o-acetyltransferase-like-esterases, while the two iota-carrageenases represent new members of the poorly known GH82 family (containing only 19 proteins since its description in 2000). These original results demonstrate the efficiency of our uncommon "plurigenomic" library approach and the underexplored potential of alga-associated cultivable microbiota for the identification of novel and algal-specific enzymes. Copyright © 2016 Elsevier GmbH. All rights reserved.
Burke, W D; Calalang, C C; Eickbush, T H
1987-01-01
Two classes of DNA elements interrupt a fraction of the rRNA repeats of Bombyx mori. We have analyzed by genomic blotting and sequence analysis one class of these elements which we have named R2. These elements occupy approximately 9% of the rDNA units of B. mori and appear to be homologous to the type II rDNA insertions detected in Drosophila melanogaster. Approximately 25 copies of R2 exist within the B. mori genome, of which at least 20 are located at a precise location within otherwise typical rDNA units. Nucleotide sequence analysis has revealed that the 4.2-kilobase-pair R2 element has a single large open reading frame, occupying over 82% of the total length of the element. The central region of this 1,151-amino-acid open reading frame shows homology to the reverse transcriptase enzymes found in retroviruses and certain transposable elements. Amino acid homology of this region is highest to the mobile line 1 elements of mammals, followed by the mitochondrial type II introns of fungi, and the pol gene of retroviruses. Less homology exists with transposable elements of D. melanogaster and Saccharomyces cerevisiae. Two additional regions of sequence homology between L1 and R2 elements were also found outside the reverse transcriptase region. We suggest that the R2 elements are retrotransposons that are site specific in their insertion into the genome. Such mobility would enable these elements to occupy a small fraction of the rDNA units of B. mori despite their continual elimination from the rDNA locus by sequence turnover. Images PMID:2439905
Manlig, Erika; Wahlberg, Per
2017-01-01
Abstract Sodium bisulphite treatment of DNA combined with next generation sequencing (NGS) is a powerful combination for the interrogation of genome-wide DNA methylation profiles. Library preparation for whole genome bisulphite sequencing (WGBS) is challenging due to side effects of the bisulphite treatment, which leads to extensive DNA damage. Recently, a new generation of methods for bisulphite sequencing library preparation have been devised. They are based on initial bisulphite treatment of the DNA, followed by adaptor tagging of single stranded DNA fragments, and enable WGBS using low quantities of input DNA. In this study, we present a novel approach for quick and cost effective WGBS library preparation that is based on splinted adaptor tagging (SPLAT) of bisulphite-converted single-stranded DNA. Moreover, we validate SPLAT against three commercially available WGBS library preparation techniques, two of which are based on bisulphite treatment prior to adaptor tagging and one is a conventional WGBS method. PMID:27899585
A Fast Solution to NGS Library Prep with Low Nanogram DNA Input
Liu, Pingfang; Lohman, Gregory J.S.; Cantor, Eric; Langhorst, Bradley W.; Yigit, Erbay; Apone, Lynne M.; Munafo, Daniela B.; Stewart, Fiona J.; Evans, Thomas C.; Nichols, Nicole; Dimalanta, Eileen T.; Davis, Theodore B.; Sumner, Christine
2013-01-01
Next Generation Sequencing (NGS) has significantly impacted human genetics, enabling a comprehensive characterization of the human genome as well as a better understanding of many genomic abnormalities. By delivering massive DNA sequences at unprecedented speed and cost, NGS promises to make personalized medicine a reality in the foreseeable future. To date, library construction with clinical samples has been a challenge, primarily due to the limited quantities of sample DNA available. Our objective here was to overcome this challenge by developing NEBNext® Ultra DNA Library Prep Kit, a fast library preparation method. Specifically, we streamlined the workflow utilizing novel NEBNext reagents and adaptors, including a new DNA polymerase that has been optimized to minimize GC bias. As a result of this work, we have developed a simple method for library construction from an amount of DNA as low as 5 ng, which can be used for both intact and fragmented DNA. Moreover, the workflow is compatible with multiple NGS platforms.
Yu, Bing; Ni, Ming; Li, Wen-Han; Lei, Ping; Xing, Wei; Xiao, Dai-Wen; Huang, Yu; Tang, Zhen-Jie; Zhu, Hui-Fen; Shen, Guan-Xin
2005-07-14
To identify the scFv antibody fragments specific for hepatocellular carcinoma by biopanning from a large human naive scFv phage display library. A large human naive scFv phage library was used to search for the specific targets by biopanning with the hepatocellular carcinoma cell line HepG2 for the positive-selecting and the normal liver cell line L02 for the counter-selecting. After three rounds of biopanning, individual scFv phages binding selectively to HepG2 cells were picked out. PCR was carried out for identification of the clones containing scFv gene sequence. The specific scFv phages were selected by ELISA and flow cytometry. DNA sequences of positive clones were analyzed by using Applied Biosystem Automated DNA sequencers 3 730. The expression proteins of the specific scFv antibody fragments in E.coli HB2151 were purified by the affinity chromatography and detected by SDS-PAGE, Western blot and ELISA. The biological effect of the soluble antibody fragments on the HepG2 cells was investigated by observing the cell proliferation. Two different positive clones were obtained and the functional variable sequences were identified. Their DNA sequences of the scFv antibody fragments were submitted to GenBank (accession nos: AY686498 and AY686499). The soluble scFv antibody fragments were successfully expressed in E.coli HB2151. The relative molecular mass of the expression products was about 36 ku, according to its predicted M(r) value. The two soluble scFv antibody fragments also had specific binding activity and obvious growth inhibition properties to HepG2 cells. The phage library biopanning permits identification of specific antibody fragments for hepatocellular carcinoma and affords experiment evidence for its immunotherapy study.
Hara, Yuichiro; Tatsumi, Kaori; Yoshida, Michio; Kajikawa, Eriko; Kiyonari, Hiroshi; Kuraku, Shigehiro
2015-11-18
RNA-seq enables gene expression profiling in selected spatiotemporal windows and yields massive sequence information with relatively low cost and time investment, even for non-model species. However, there remains a large room for optimizing its workflow, in order to take full advantage of continuously developing sequencing capacity. Transcriptome sequencing for three embryonic stages of Madagascar ground gecko (Paroedura picta) was performed with the Illumina platform. The output reads were assembled de novo for reconstructing transcript sequences. In order to evaluate the completeness of transcriptome assemblies, we prepared a reference gene set consisting of vertebrate one-to-one orthologs. To take advantage of increased read length of >150 nt, we demonstrated shortened RNA fragmentation time, which resulted in a dramatic shift of insert size distribution. To evaluate products of multiple de novo assembly runs incorporating reads with different RNA sources, read lengths, and insert sizes, we introduce a new reference gene set, core vertebrate genes (CVG), consisting of 233 genes that are shared as one-to-one orthologs by all vertebrate genomes examined (29 species)., The completeness assessment performed by the computational pipelines CEGMA and BUSCO referring to CVG, demonstrated higher accuracy and resolution than with the gene set previously established for this purpose. As a result of the assessment with CVG, we have derived the most comprehensive transcript sequence set of the Madagascar ground gecko by means of assembling individual libraries followed by clustering the assembled sequences based on their overall similarities. Our results provide several insights into optimizing de novo RNA-seq workflow, including the coordination between library insert size and read length, which manifested in improved connectivity of assemblies. The approach and assembly assessment with CVG demonstrated here would be applicable to transcriptome analysis of other species as well as whole genome analyses.
Hennig, Bianca P.; Velten, Lars; Racke, Ines; Tu, Chelsea Szu; Thoms, Matthias; Rybin, Vladimir; Besir, Hüseyin; Remans, Kim; Steinmetz, Lars M.
2017-01-01
Efficient preparation of high-quality sequencing libraries that well represent the biological sample is a key step for using next-generation sequencing in research. Tn5 enables fast, robust, and highly efficient processing of limited input material while scaling to the parallel processing of hundreds of samples. Here, we present a robust Tn5 transposase purification strategy based on an N-terminal His6-Sumo3 tag. We demonstrate that libraries prepared with our in-house Tn5 are of the same quality as those processed with a commercially available kit (Nextera XT), while they dramatically reduce the cost of large-scale experiments. We introduce improved purification strategies for two versions of the Tn5 enzyme. The first version carries the previously reported point mutations E54K and L372P, and stably produces libraries of constant fragment size distribution, even if the Tn5-to-input molecule ratio varies. The second Tn5 construct carries an additional point mutation (R27S) in the DNA-binding domain. This construct allows for adjustment of the fragment size distribution based on enzyme concentration during tagmentation, a feature that opens new opportunities for use of Tn5 in customized experimental designs. We demonstrate the versatility of our Tn5 enzymes in different experimental settings, including a novel single-cell polyadenylation site mapping protocol as well as ultralow input DNA sequencing. PMID:29118030
3G vector-primer plasmid for constructing full-length-enriched cDNA libraries.
Zheng, Dong; Zhou, Yanna; Zhang, Zidong; Li, Zaiyu; Liu, Xuedong
2008-09-01
We designed a 3G vector-primer plasmid for the generation of full-length-enriched complementary DNA (cDNA) libraries. By employing the terminal transferase activity of reverse transcriptase and the modified strand replacement method, this plasmid (assembled with a polydT end and a deoxyguanosine [dG] end) combines priming full-length cDNA strand synthesis and directional cDNA cloning. As a result, the number of steps involved in cDNA library preparation is decreased while simplifying downstream gene manipulation, sequencing, and subcloning. The 3G vector-primer plasmid method yields fully represented plasmid primed libraries that are equivalent to those made by the SMART (switching mechanism at 5' end of RNA transcript) approach.
Complementation of a Fanconi anemia group A cell line by UbA{sup 52}
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moses, R.E.; Heina, J.A.; Jakobs, P.M.
1994-09-01
Cells from patients with Fanconi anemia (FA) display chromosomal instability and increased sensitivity to mitomycin C (MMC) and diepoxybutane (DEB) relative to normal cells. Several genes act in this pathway of DNA damage processing based upon four known complementation groups in FA. We have made a cDNA expression library in a vector with a G418 selectable marker to identify FA genes other than the FA-C group. Approximately 1 x 10{sup 6} independent cDNA clones were isolated with an average cDNA size of 1.5 kb. Five cell lines resistant to MMC and DEB were isolated from 6 x 10{sup 6} G418-resistantmore » transfectants from 65 individual transfections of the FA-A fibroblast line GM6914. The isolated cell lines also showed normal chromosome stability. The same cDNA (600 bp) was recovered from three independent cell lines by PCR using flanking sequence primers. The gene has sequence identity with a known gene, the ubiquitin fusion gene, UbA{sub 52}. Interestingly, each of the cDNAs were inserted in antisense orientation relative to the cytomegalovirus (CMV) promoter as determined by sequencing and PCR using UbA{sub 52}-specific internal primers. Southern blot analysis indicated the cell lines had distinct chromosomal insertion sites. Mutation analysis by chemical cleavage showed no reading frame mutations, indicating that UbA{sub 52} is not the FA-A gene. Re-transfection with the UbA{sub 52} gene in antisense gave complementation for MMC, DEB and chromosome stability to varying degrees. Re-transfection of the antisense construct with the CMV promotor removed or with a sense construct did not alter the MMC sensitivity. We conclude that the antisense UbA{sub 52} gene has a non-specific effect, perhaps acting by altering the cell cycle or susceptibility to apoptosis.« less
Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain
2011-01-01
cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.
Editor’s note: This article was originally published on the Center for Cancer Research website. When the Human Immunodeficiency Virus (HIV) infects a cell, the virus inserts a copy of its genetic material into the host cell’s DNA. The inserted genetic material, which is also called a provirus, is used to produce new viruses. Because the viral DNA can be inserted at many sites
DOE Office of Scientific and Technical Information (OSTI.GOV)
Harris, J.M.; Venditti, C.P.; Chorney, M.J.
1994-09-01
An association between idiopathic hemochromatosis (HFE) and the HLA-A3 locus has been previously well-established. In an attempt to identify potential HFE candidate genes, a genomic DNA fragment distal to the HLA-A9 breakpoint was used to screen a B cell cDNA library; a member (3.8-1) of a new multigene family, composed of five distinct genomic cross-reactive fragments, was identified. Clone 3.8-1 represents the 3{prime} end of 9.6 kb transcript which is expressed in multiple tissues including the spleen, thymus, lung and kidney. Sequencing and genome database analysis indicate that 3.8-1 is unique, with no homology to any known entries. The genomicmore » residence of 3-8.1, defined by polymorphism analysis and physical mapping using YAC clones, appears to be absent from the genomes of higher primates, although four other cross-reactivities are maintained. The absence of this gene as well as other probes which map in the TNF to HLA-B interval, suggest that this portion of the human HMC, located between the Class I and Class III regions, arose in humans as the result of a post-speciation insertional event. The large size of the 3.8-1 gene and the possible categorization of 3.8-1 as a human-specific gene are significant given the genetic data that place an autoimmune susceptibility element for IDDM and myasthenia gravis in the precise region where this gene resides. In an attempt to isolate the 5{prime} end of this large transcript, we have constructed a cosmid contig which encompasses the genomic locus of this gene and are progressively isolating coding sequences by exon trapping.« less
A Glimpse into the Satellite DNA Library in Characidae Fish (Teleostei, Characiformes)
Utsunomia, Ricardo; Ruiz-Ruano, Francisco J.; Silva, Duílio M. Z. A.; Serrano, Érica A.; Rosa, Ivana F.; Scudeler, Patrícia E. S.; Hashimoto, Diogo T.; Oliveira, Claudio; Camacho, Juan Pedro M.; Foresti, Fausto
2017-01-01
Satellite DNA (satDNA) is an abundant fraction of repetitive DNA in eukaryotic genomes and plays an important role in genome organization and evolution. In general, satDNA sequences follow a concerted evolutionary pattern through the intragenomic homogenization of different repeat units. In addition, the satDNA library hypothesis predicts that related species share a series of satDNA variants descended from a common ancestor species, with differential amplification of different satDNA variants. The finding of a same satDNA family in species belonging to different genera within Characidae fish provided the opportunity to test both concerted evolution and library hypotheses. For this purpose, we analyzed here sequence variation and abundance of this satDNA family in ten species, by a combination of next generation sequencing (NGS), PCR and Sanger sequencing, and fluorescence in situ hybridization (FISH). We found extensive between-species variation for the number and size of pericentromeric FISH signals. At genomic level, the analysis of 1000s of DNA sequences obtained by Illumina sequencing and PCR amplification allowed defining 150 haplotypes which were linked in a common minimum spanning tree, where different patterns of concerted evolution were apparent. This also provided a glimpse into the satDNA library of this group of species. In consistency with the library hypothesis, different variants for this satDNA showed high differences in abundance between species, from highly abundant to simply relictual variants. PMID:28855916
Novel selection methods for DNA-encoded chemical libraries
Chan, Alix I.; McGregor, Lynn M.; Liu, David R.
2015-01-01
Driven by the need for new compounds to serve as biological probes and leads for therapeutic development and the growing accessibility of DNA technologies including high-throughput sequencing, many academic and industrial groups have begun to use DNA-encoded chemical libraries as a source of bioactive small molecules. In this review, we describe the technologies that have enabled the selection of compounds with desired activities from these libraries. These methods exploit the sensitivity of in vitro selection coupled with DNA amplification to overcome some of the limitations and costs associated with conventional screening methods. In addition, we highlight newer techniques with the potential to be applied to the high-throughput evaluation of DNA-encoded chemical libraries. PMID:25723146
Procedure for normalization of cDNA libraries
Bonaldo, Maria DeFatima; Soares, Marcelo Bento
1997-01-01
This invention provides a method to normalize a cDNA library constructed in a vector capable of being converted to single-stranded circles and capable of producing complementary nucleic acid molecules to the single-stranded circles comprising: (a) converting the cDNA library in single-stranded circles; (b) generating complementary nucleic acid molecules to the single-stranded circles; (c) hybridizing the single-stranded circles converted in step (a) with complementary nucleic acid molecules of step (b) to produce partial duplexes to an appropriate Cot; (e) separating the unhybridized single-stranded circles from the hybridized single-stranded circles, thereby generating a normalized cDNA library.
Burke, Sean V; Wysocki, William P; Zuloaga, Fernando O; Craine, Joseph M; Pires, J Chris; Edger, Patrick P; Mayfield-Jones, Dustin; Clark, Lynn G; Kelchner, Scot A; Duvall, Melvin R
2016-06-18
Panicoideae are the second largest subfamily in Poaceae (grass family), with 212 genera and approximately 3316 species. Previous studies have begun to reveal relationships within the subfamily, but largely lack resolution and/or robust support for certain tribal and subtribal groups. This study aims to resolve these relationships, as well as characterize a putative mitochondrial insert in one linage. 35 newly sequenced Panicoideae plastomes were combined in a phylogenomic study with 37 other species: 15 Panicoideae and 22 from outgroups. A robust Panicoideae topology largely congruent with previous studies was obtained, but with some incongruences with previously reported subtribal relationships. A mitochondrial DNA (mtDNA) to plastid DNA (ptDNA) transfer was discovered in the Paspalum lineage. The phylogenomic analysis returned a topology that largely supports previous studies. Five previously recognized subtribes appear on the topology to be non-monophyletic. Additionally, evidence for mtDNA to ptDNA transfer was identified in both Paspalum fimbriatum and P. dilatatum, and suggests a single rare event that took place in a common progenitor. Finally, the framework from this study can guide larger whole plastome sampling to discern the relationships in Cyperochloeae, Steyermarkochloeae, Gynerieae, and other incertae sedis taxa that are weakly supported or unresolved.
Huemer, Peter; Mutanen, Marko; Sefc, Kristina M; Hebert, Paul D N
2014-01-01
This study examines the performance of DNA barcodes (mt cytochrome c oxidase 1 gene) in the identification of 1004 species of Lepidoptera shared by two localities (Finland, Austria) that are 1600 km apart. Maximum intraspecific distances for the pooled data were less than 2% for 880 species (87.6%), while deeper divergence was detected in 124 species. Despite such variation, the overall DNA barcode library possessed diagnostic COI sequences for 98.8% of the taxa. Because a reference library based on Finnish specimens was highly effective in identifying specimens from Austria, we conclude that barcode libraries based on regional sampling can often be effective for a much larger area. Moreover, dispersal ability (poor, good) and distribution patterns (disjunct, fragmented, continuous, migratory) had little impact on levels of intraspecific geographic divergence. Furthermore, the present study revealed that, despite the intensity of past taxonomic work on European Lepidoptera, nearly 20% of the species shared by Austria and Finland require further work to clarify their status. Particularly discordant BIN (Barcode Index Number) cases should be checked to ascertain possible explanatory factors such as incorrect taxonomy, hybridization, introgression, and Wolbachia infections.
Sharma, Nandita; Tanksale, Himgouri; Kapley, Atya; Purohit, Hemant J
2012-12-01
Metagenomic libraries herald the era of magnifying the microbial world, tapping into the vast metabolic potential of uncultivated microbes, and enhancing the rate of discovery of novel genes and pathways. In this paper, we describe a method that facilitates the extraction of metagenomic DNA from activated sludge of an industrial wastewater treatment plant and its use in mining the metagenome via library construction. The efficiency of this method was demonstrated by the large representation of the bacterial genome in the constructed metagenomic libraries and by the functional clones obtained. The BAC library represented 95.6 times the bacterial genome, while, the pUC library represented 41.7 times the bacterial genome. Twelve clones in the BAC library demonstrated lipolytic activity, while four clones demonstrated dioxygenase activity. Four clones in pUC library tested positive for cellulase activity. This method, using FTA cards, not only can be used for library construction, but can also store the metagenome at room temperature.
Knott, V; Rees, D J; Cheng, Z; Brownlee, G G
1988-01-01
Sets of overlapping cosmid clones generated by random sampling and fingerprinting methods complement data at pyrB (96.5') and oriC (84') in the published physical map of E. coli. A new cloning strategy using sheared DNA, and a low copy, inducible cosmid vector were used in order to reduce bias in libraries, in conjunction with micro-methods for preparing cosmid DNA from a large number of clones. Our results are relevant to the design of the best approach to the physical mapping of large genomes. PMID:2834694
Langevin, Stanley A.; Bent, Zachary W.; Solberg, Owen D.; Curtis, Deanna J.; Lane, Pamela D.; Williams, Kelly P.; Schoeniger, Joseph S.; Sinha, Anupama; Lane, Todd W.; Branda, Steven S.
2013-01-01
Use of second generation sequencing (SGS) technologies for transcriptional profiling (RNA-Seq) has revolutionized transcriptomics, enabling measurement of RNA abundances with unprecedented specificity and sensitivity and the discovery of novel RNA species. Preparation of RNA-Seq libraries requires conversion of the RNA starting material into cDNA flanked by platform-specific adaptor sequences. Each of the published methods and commercial kits currently available for RNA-Seq library preparation suffers from at least one major drawback, including long processing times, large starting material requirements, uneven coverage, loss of strand information and high cost. We report the development of a new RNA-Seq library preparation technique that produces representative, strand-specific RNA-Seq libraries from small amounts of starting material in a fast, simple and cost-effective manner. Additionally, we have developed a new quantitative PCR-based assay for precisely determining the number of PCR cycles to perform for optimal enrichment of the final library, a key step in all SGS library preparation workflows. PMID:23558773
Lambda Red Mediated Gap Repair Utilizes a Novel Replicative Intermediate in Escherichia coli
Reddy, Thimma R.; Fevat, Léna M. S.; Munson, Sarah E.; Stewart, A. Francis; Cowley, Shaun M.
2015-01-01
The lambda phage Red recombination system can mediate efficient homologous recombination in Escherichia coli, which is the basis of the DNA engineering technique termed recombineering. Red mediated insertion of DNA requires DNA replication, involves a single-stranded DNA intermediate and is more efficient on the lagging strand of the replication fork. Lagging strand recombination has also been postulated to explain the Red mediated repair of gapped plasmids by an Okazaki fragment gap filling model. Here, we demonstrate that gap repair involves a different strand independent mechanism. Gap repair assays examining the strand asymmetry of recombination did not show a lagging strand bias. Directly testing an ssDNA plasmid showed lagging strand recombination is possible but dsDNA plasmids did not employ this mechanism. Insertional recombination combined with gap repair also did not demonstrate preferential lagging strand bias, supporting a different gap repair mechanism. The predominant recombination route involved concerted insertion and subcloning though other routes also operated at lower frequencies. Simultaneous insertion of DNA resulted in modification of both strands and was unaffected by mutations to DNA polymerase I, responsible for Okazaki fragment maturation. The lower efficiency of an alternate Red mediated ends-in recombination pathway and the apparent lack of a Holliday junction intermediate suggested that gap repair does not involve a different Red recombination pathway. Our results may be explained by a novel replicative intermediate in gap repair that does not involve a replication fork. We exploited these observations by developing a new recombineering application based on concerted insertion and gap repair, termed SPI (subcloning plus insertion). SPI selected against empty vector background and selected for correct gap repair recombinants. We used SPI to simultaneously insert up to four different gene cassettes in a single recombineering reaction. Consequently, our findings have important implications for the understanding of E. coli replication and Red recombination. PMID:25803509
Garcia, S; Kovařík, A
2013-01-01
In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S–5.8S–26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S–18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S–5.8S–26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants. PMID:23512008
Garcia, S; Kovařík, A
2013-07-01
In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S-5.8S-26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S-18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S-5.8S-26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants.
Preparation of metagenomic libraries from naturally occurring marine viruses.
Solonenko, Sergei A; Sullivan, Matthew B
2013-01-01
Microbes are now well recognized as major drivers of the biogeochemical cycling that fuels the Earth, and their viruses (phages) are known to be abundant and important in microbial mortality, horizontal gene transfer, and modulating microbial metabolic output. Investigation of environmental phages has been frustrated by an inability to culture the vast majority of naturally occurring diversity coupled with the lack of robust, quantitative, culture-independent methods for studying this uncultured majority. However, for double-stranded DNA phages, a quantitative viral metagenomic sample-to-sequence workflow now exists. Here, we review these advances with special emphasis on the technical details of preparing DNA sequencing libraries for metagenomic sequencing from environmentally relevant low-input DNA samples. Library preparation steps broadly involve manipulating the sample DNA by fragmentation, end repair and adaptor ligation, size fractionation, and amplification. One critical area of future research and development is parallel advances for alternate nucleic acid types such as single-stranded DNA and RNA viruses that are also abundant in nature. Combinations of recent advances in fragmentation (e.g., acoustic shearing and tagmentation), ligation reactions (adaptor-to-template ratio reference table availability), size fractionation (non-gel-sizing), and amplification (linear amplification for deep sequencing and linker amplification protocols) enhance our ability to generate quantitatively representative metagenomic datasets from low-input DNA samples. Such datasets are already providing new insights into the role of viruses in marine systems and will continue to do so as new environments are explored and synergies and paradigms emerge from large-scale comparative analyses. © 2013 Elsevier Inc. All rights reserved.
The bacterial composition of chlorinated drinking water was analyzed using 16S rRNA gene clone libraries derived from DNA extracts of 12 samples and compared to clone libraries previously generated using RNA extracts from the same samples. Phylogenetic analysis of 761 DNA-based ...
Yang, Hongmei; Yao, Wenbin; Wang, Yihan; Shi, Lei; Su, Rui; Wan, Debin; Xu, Niusheng; Lian, Wenhui; Chen, Changbao; Liu, Shuying
2017-02-14
Conventional strategies for the screening of DNA triplex binders cannot be used for complicated samples, such as ligand libraries created by combinatorial chemistry or from natural product extracts. In the current study, an ultra-high-performance liquid chromatography coupled with an Orbitrap mass spectrometry (UHPLC-Orbitrap-MS)-based approach, which we call peak area-fading (PAF) UHPLC-Orbitrap-MS and was designed for just such a purpose, is reported. The triplex DNA modified 96-well plate and the single stranded oligonucleotide modified 96-well plate (as control) were incubated with ligand libraries, and the unbound ligands were directly determined via UHPLC-ESI-MS. The binders were detected through the decrease (fading) in the peak areas compared to those of the control group. Several factors, such as incubation time, incubation temperature, and buffer, which might affect the binding affinity and reproducibility, were optimized. The potential of the approach was examined using the extracts of Rhizoma Coptidis and Phellodendron chinense Schneid cortexe. The triplex DNA-binding capabilities of the five components (epiberberine, coptisine, jatrorrhizine, berberrubine, and columbamine) were found for the first time, indicating their efficiency for the analysis of complicated samples. In contrast to our previous study, which suffered from a serious drawback of poor reproducibility, this method is more robust and more suitable for high-throughput measurements, opening a new experimental strategy in assessing large libraries of potential drug candidates that work by forming a drug/DNA complex.
[Isolation and function of genes regulating aphB expression in Vibrio cholerae].
Chen, Haili; Zhu, Zhaoqin; Zhong, Zengtao; Zhu, Jun; Kan, Biao
2012-02-04
We identified genes that regulate the expression of aphB, the gene encoding a key virulence regulator in Vibrio cholerae O1 E1 Tor C6706(-). We constructed a transposon library in V. cholerae C6706 strain containing a P(aphB)-luxCDABE and P(aphB)-lacZ transcriptional reporter plasmids. Using a chemiluminescence imager system, we rapidly detected aphB promoter expression level at a large scale. We then sequenced the transposon insertion sites by arbitrary PCR and sequencing analysis. We obtained two candidate mutants T1 and T2 which displayed reduced aphB expression from approximately 40,000 transposon insertion mutants. Sequencing analysis shows that Tn inserted in vc1585 reading frame in the T1 mutant and Tn inserted in the end of coding sequence of vc1602 in the T2 mutant. By using a genetic screen, we identified two potential genes that may involve in regulation of the expression of the key virulence regulator AphB. This study sheds light on our further investigation to fully understand V. cholerae virulence gene regulatory cascades.
Pauthenier, Cyrille; Faulon, Jean-Loup
2014-07-01
PrecisePrimer is a web-based primer design software made to assist experimentalists in any repetitive primer design task such as preparing, cloning and shuffling DNA libraries. Unlike other popular primer design tools, it is conceived to generate primer libraries with popular PCR polymerase buffers proposed as pre-set options. PrecisePrimer is also meant to design primers in batches, such as for DNA libraries creation of DNA shuffling experiments and to have the simplest interface possible. It integrates the most up-to-date melting temperature algorithms validated with experimental data, and cross validated with other computational tools. We generated a library of primers for the extraction and cloning of 61 genes from yeast DNA genomic extract using default parameters. All primer pairs efficiently amplified their target without any optimization of the PCR conditions. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
CLA1, a novel gene required for chloroplast development, is highly conserved in evolution.
Mandel, M A; Feldmann, K A; Herrera-Estrella, L; Rocha-Sosa, M; León, P
1996-05-01
An albino mutant designated cla1-1 (for "cloroplastos alterados', or "altered chloroplasts') has been isolated from a T-DNA-generated library of Arabidopsis thaliana. In cla1-1 plants, chloroplast development is arrested at an early stage. cla1-1 plants behave like wild-type in their capacity to etiolate and produce anthocyanins indicating that the light signal transduction pathway seems to be unaffected. Genetic and molecular analyses show that the disruption of a single gene, CLA1, by the T-DNA insertion is responsible for the mutant phenotype. RNA expression patterns indicate that CLA1 is positively regulated by light and that it has different effects on the steady-state RNA levels of some nuclear- and chloroplast-encoded photosynthetic genes. Although the specific function of the CLA1 gene is still unknown, it encodes a novel protein conserved in evolution between photosynthetic bacteria and plants which is essential for chloroplast development in Arabidopsis.
Chang, Y. Paul; Xu, Meng; Machado, Ana Carolina Dantas; Yu, Xian Jessica; Rohs, Remo; Chen, Xiaojiang S.
2013-01-01
SUMMARY The DNA tumor virus Simian virus 40 (SV40) is a model system for studying eukaryotic replication. SV40 large tumor antigen (LTag) is the initiator/helicase that is essential for genome replication. LTag recognizes and assembles at the viral replication origin. We determined the structure of two multidomain LTag subunits bound to origin DNA. The structure reveals that the origin binding domains (OBDs) and Zn and AAA+ domains are involved in origin recognition and assembly. Notably, the OBDs recognize the origin in an unexpected manner. The histidine residues of the AAA+ domains insert into a narrow minor groove region with enhanced negative electrostatic potential. Computational analysis indicates that this region is intrinsically narrow, demonstrating the role of DNA shape readout in origin recognition. Our results provide important insights into the assembly of the LTag initiator/ helicase at the replication origin and suggest that histidine contacts with the minor groove serve as a mechanism of DNA shape readout. PMID:23545501
Bourras, Salim; Meyer, Michel; Grandaubert, Jonathan; Lapalu, Nicolas; Fudal, Isabelle; Linglin, Juliette; Ollivier, Benedicte; Blaise, Françoise; Balesdent, Marie-Hélène; Rouxel, Thierry
2012-08-01
The ever-increasing generation of sequence data is accompanied by unsatisfactory functional annotation, and complex genomes, such as those of plants and filamentous fungi, show a large number of genes with no predicted or known function. For functional annotation of unknown or hypothetical genes, the production of collections of mutants using Agrobacterium tumefaciens-mediated transformation (ATMT) associated with genotyping and phenotyping has gained wide acceptance. ATMT is also widely used to identify pathogenicity determinants in pathogenic fungi. A systematic analysis of T-DNA borders was performed in an ATMT-mutagenized collection of the phytopathogenic fungus Leptosphaeria maculans to evaluate the features of T-DNA integration in its particular transposable element-rich compartmentalized genome. A total of 318 T-DNA tags were recovered and analyzed for biases in chromosome and genic compartments, existence of CG/AT skews at the insertion site, and occurrence of microhomologies between the T-DNA left border (LB) and the target sequence. Functional annotation of targeted genes was done using the Gene Ontology annotation. The T-DNA integration mainly targeted gene-rich, transcriptionally active regions, and it favored biological processes consistent with the physiological status of a germinating spore. T-DNA integration was strongly biased toward regulatory regions, and mainly promoters. Consistent with the T-DNA intranuclear-targeting model, the density of T-DNA insertion correlated with CG skew near the transcription initiation site. The existence of microhomologies between promoter sequences and the T-DNA LB flanking sequence was also consistent with T-DNA integration to host DNA mediated by homologous recombination based on the microhomology-mediated end-joining pathway.
Development and Synthesis of DNA-Encoded Benzimidazole Library.
Ding, Yun; Chai, Jing; Centrella, Paolo A; Gondo, Chenaimwoyo; DeLorey, Jennifer L; Clark, Matthew A
2018-04-25
Encoded library technology (ELT) is an effective approach to the discovery of novel small-molecule ligands for biological targets. A key factor for the success of the technology is the chemical diversity of the libraries. Here we report the development of DNA-conjugated benzimidazoles. Using 4-fluoro-3-nitrobenzoic acid as a key synthon, we synthesized a 320 million-member DNA-encoded benzimidazole library using Fmoc-protected amino acids, amines and aldehydes as diversity elements. Affinity selection of the library led to the discovery of a novel, potent and specific antagonist of the NK3 receptor.
Construction and screening of marine metagenomic libraries.
Weiland, Nancy; Löscher, Carolin; Metzger, Rebekka; Schmitz, Ruth
2010-01-01
Marine microbial communities are highly diverse and have evolved during extended evolutionary processes of physiological adaptations under the influence of a variety of ecological conditions and selection pressures. They harbor an enormous diversity of microbes with still unknown and probably new physiological characteristics. Besides, the surfaces of marine multicellular organisms are typically covered by a consortium of epibiotic bacteria and act as barriers, where diverse interactions between microorganisms and hosts take place. Thus, microbial diversity in the water column of the oceans and the microbial consortia on marine tissues of multicellular organisms are rich sources for isolating novel bioactive compounds and genes. Here we describe the sampling, construction of large-insert metagenomic libraries from marine habitats and exemplarily one function based screen of metagenomic clones.
Caruccio, Nicholas
2011-01-01
DNA library preparation is a common entry point and bottleneck for next-generation sequencing. Current methods generally consist of distinct steps that often involve significant sample loss and hands-on time: DNA fragmentation, end-polishing, and adaptor-ligation. In vitro transposition with Nextera™ Transposomes simultaneously fragments and covalently tags the target DNA, thereby combining these three distinct steps into a single reaction. Platform-specific sequencing adaptors can be added, and the sample can be enriched and bar-coded using limited-cycle PCR to prepare di-tagged DNA fragment libraries. Nextera technology offers a streamlined, efficient, and high-throughput method for generating bar-coded libraries compatible with multiple next-generation sequencing platforms.
Novel selection methods for DNA-encoded chemical libraries.
Chan, Alix I; McGregor, Lynn M; Liu, David R
2015-06-01
Driven by the need for new compounds to serve as biological probes and leads for therapeutic development and the growing accessibility of DNA technologies including high-throughput sequencing, many academic and industrial groups have begun to use DNA-encoded chemical libraries as a source of bioactive small molecules. In this review, we describe the technologies that have enabled the selection of compounds with desired activities from these libraries. These methods exploit the sensitivity of in vitro selection coupled with DNA amplification to overcome some of the limitations and costs associated with conventional screening methods. In addition, we highlight newer techniques with the potential to be applied to the high-throughput evaluation of DNA-encoded chemical libraries. Copyright © 2015 Elsevier Ltd. All rights reserved.
Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard
2013-01-01
Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce. PMID:23409088
Interaction Analysis through Proteomic Phage Display
2014-01-01
Phage display is a powerful technique for profiling specificities of peptide binding domains. The method is suited for the identification of high-affinity ligands with inhibitor potential when using highly diverse combinatorial peptide phage libraries. Such experiments further provide consensus motifs for genome-wide scanning of ligands of potential biological relevance. A complementary but considerably less explored approach is to display expression products of genomic DNA, cDNA, open reading frames (ORFs), or oligonucleotide libraries designed to encode defined regions of a target proteome on phage particles. One of the main applications of such proteomic libraries has been the elucidation of antibody epitopes. This review is focused on the use of proteomic phage display to uncover protein-protein interactions of potential relevance for cellular function. The method is particularly suited for the discovery of interactions between peptide binding domains and their targets. We discuss the largely unexplored potential of this method in the discovery of domain-motif interactions of potential biological relevance. PMID:25295249
Matvienko, Marta; Kozik, Alexander; Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard
2013-01-01
Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce.
Ulrich, Alexander; Andersen, Kasper R.; Schwartz, Thomas U.
2012-01-01
We present a fast, reliable and inexpensive restriction-free cloning method for seamless DNA insertion into any plasmid without sequence limitation. Exponential megapriming PCR (EMP) cloning requires two consecutive PCR steps and can be carried out in one day. We show that EMP cloning has a higher efficiency than restriction-free (RF) cloning, especially for long inserts above 2.5 kb. EMP further enables simultaneous cloning of multiple inserts. PMID:23300917
Ulrich, Alexander; Andersen, Kasper R; Schwartz, Thomas U
2012-01-01
We present a fast, reliable and inexpensive restriction-free cloning method for seamless DNA insertion into any plasmid without sequence limitation. Exponential megapriming PCR (EMP) cloning requires two consecutive PCR steps and can be carried out in one day. We show that EMP cloning has a higher efficiency than restriction-free (RF) cloning, especially for long inserts above 2.5 kb. EMP further enables simultaneous cloning of multiple inserts.
Naumer, Matthias; Ying, Ying; Michelfelder, Stefan; Reuter, Antje; Trepel, Martin; Müller, Oliver J; Kleinschmidt, Jürgen A
2012-05-01
Libraries based on the insertion of random peptide ligands into the capsid of adeno-associated virus type 2 (AAV2) have been widely used to improve the efficiency and selectivity of the AAV vector system. However, so far only libraries of 7-mer peptide ligands have been inserted at one well-characterized capsid position. Here, we expanded the combinatorial AAV2 display system to a panel of novel AAV libraries, displaying peptides of 5, 7, 12, 19, or 26 amino acids in length at capsid position 588 or displaying 7-mer peptides at position 453, the most prominently exposed region of the viral capsid. Library selections on two unrelated cell types-human coronary artery endothelial cells and rat cardiomyoblasts-revealed the isolation of cell type-characteristic peptides of different lengths mediating strongly improved target-cell transduction, except for the 26-mer peptide ligands. Characterization of vector selectivity by transduction of nontarget cells and comparative gene-transduction analysis using a panel of 44 human tumor cell lines revealed that insertion of different-length peptides allows targeting of distinct cellular receptors for cell entry with similar efficiency, but with different selectivity. The application of such novel AAV2 libraries broadens the spectrum of targetable receptors by capsid-modified AAV vectors and provides the opportunity to choose the best suited targeting ligand for a certain application from a number of different candidates.
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health
Martin, William F.
2017-01-01
Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. PMID:28444372
Suboptimal Doses of Raltegravir Cause Aberrant HIV Integrations | Center for Cancer Research
When a cell is infected with HIV, a DNA copy of the HIV genome is inserted into that cell’s chromosomal DNA. This insertion reaction is carried out by the viral enzyme integrase (IN) and involves two distinct steps: removal of two nucleotides from each 3’ end of the viral DNA, followed by the strand transfer reaction, in which the viral DNA ends are inserted into the host chromosomal DNA. Integration is essential for viral replication, making it an important target for antiviral therapy. Raltegravir, and the other approved integrase inhibitor, Elvitegravir, are called integrase strand transfer inhibitors (INSTIs), because they bind to the active site of IN and block the strand transfer reaction.
Procedure for normalization of cDNA libraries
Bonaldo, M.D.; Soares, M.B.
1997-12-30
This invention provides a method to normalize a cDNA library constructed in a vector capable of being converted to single-stranded circles and capable of producing complementary nucleic acid molecules to the single-stranded circles comprising: (a) converting the cDNA library in single-stranded circles; (b) generating complementary nucleic acid molecules to the single-stranded circles; (c) hybridizing the single-stranded circles converted in step (a) with complementary nucleic acid molecules of step (b) to produce partial duplexes to an appropriate Cot; (e) separating the unhybridized single-stranded circles from the hybridized single-stranded circles, thereby generating a normalized cDNA library. 1 fig.
Stress-Driven Selection of Novel Phenotypes
NASA Technical Reports Server (NTRS)
Fox, George E.; Stepaov, Victor G.; Liu, Yamei
2011-01-01
A process has been developed that can confer novel properties, such as metal resistance, to a host bacterium. This same process can also be used to produce RNAs and peptides that have novel properties, such as the ability to bind particular compounds. It is inherent in the method that the peptide or RNA will behave as expected in the target organism. Plasmid-born mini-gene libraries coding for either a population of combinatorial peptides or stable, artificial RNAs carrying random inserts are produced. These libraries, which have no bias towards any biological function, are used to transform the organism of interest and to serve as an initial source of genetic variation for stress-driven evolution. The transformed bacteria are propagated under selective pressure in order to obtain variants with the desired properties. The process is highly distinct from in vitro methods because the variants are selected in the context of the cell while it is experiencing stress. Hence, the selected peptide or RNA will, by definition, work as expected in the target cell as the cell adapts to its presence during the selection process. Once the novel gene, which produces the sought phenotype, is obtained, it can be transferred to the main genome to increase the genetic stability in the organism. Alternatively, the cell line can be used to produce novel RNAs or peptides with selectable properties in large quantity for separate purposes. The system allows for easy, large-scale purification of the RNAs or peptide products. The process has been reduced to practice by imposing sub-inhibitory concentrations of NiCl2 on cells of the bacterium Escherichia coli that were transformed separately with the peptide library and RNA library. The evolved resistant clones were isolated, and sequences of the selected mini-gene variants were established. Clones resistant to NiCl2 were found to carry identical plasmid variants with a functional mini-gene that specifically conferred significant nickel tolerance on the host cells. Sequencing of the selected mini-gene revealed a propensity of the encoded peptide to bind transient metal ions. Expression of the mini-gene markedly improved growth parameters of the evolved clones at sub-inhibitory concentrations of NiCl2 while being slightly detrimental in the absence of stress. Similar results have been obtained with the RNA libraries. Overall, the results demonstrate a very natural outcome of the selection experiments in which the mini-genes were expected to be either successfully integrated into bacterial genetic networks, or rejected depending upon their effect on host fitness. This described approach can be useful as a laboratory model to study the dynamics of bacterial adaptive evolution on the molecular level. It can also provide a strategy for screening expressed DNA libraries in search of novel genes with desirable properties.
Lab-on-a-chip platform for high throughput drug discovery with DNA-encoded chemical libraries
NASA Astrophysics Data System (ADS)
Grünzner, S.; Reddavide, F. V.; Steinfelder, C.; Cui, M.; Busek, M.; Klotzbach, U.; Zhang, Y.; Sonntag, F.
2017-02-01
The fast development of DNA-encoded chemical libraries (DECL) in the past 10 years has received great attention from pharmaceutical industries. It applies the selection approach for small molecular drug discovery. Because of the limited choices of DNA-compatible chemical reactions, most DNA-encoded chemical libraries have a narrow structural diversity and low synthetic yield. There is also a poor correlation between the ranking of compounds resulted from analyzing the sequencing data and the affinity measured through biochemical assays. By combining DECL with dynamical chemical library, the resulting DNA-encoded dynamic library (EDCCL) explores the thermodynamic equilibrium of reversible reactions as well as the advantages of DNA encoded compounds for manipulation/detection, thus leads to enhanced signal-to-noise ratio of the selection process and higher library quality. However, the library dynamics are caused by the weak interactions between the DNA strands, which also result in relatively low affinity of the bidentate interaction, as compared to a stable DNA duplex. To take advantage of both stably assembled dual-pharmacophore libraries and EDCCLs, we extended the concept of EDCCLs to heat-induced EDCCLs (hi-EDCCLs), in which the heat-induced recombination process of stable DNA duplexes and affinity capture are carried out separately. To replace the extremely laborious and repetitive manual process, a fully automated device will facilitate the use of DECL in drug discovery. Herein we describe a novel lab-on-a-chip platform for high throughput drug discovery with hi-EDCCL. A microfluidic system with integrated actuation was designed which is able to provide a continuous sample circulation by reducing the volume to a minimum. It consists of a cooled and a heated chamber for constant circulation. The system is capable to generate stable temperatures above 75 °C in the heated chamber to melt the double strands of the DNA and less than 15 °C in the cooled chamber, to reanneal the reshuffled library. In the binding chamber (the cooled chamber) specific retaining structures are integrated. These hold back beads functionalized with the target protein, while the chamber is continuously flushed with library molecules. Afterwards the whole system can be flushed with buffer to wash out unspecific bound molecules. Finally the protein-loaded beads with attached molecules can be eluted for further investigation.
A method for high-throughput production of sequence-verified DNA libraries and strain collections.
Smith, Justin D; Schlecht, Ulrich; Xu, Weihong; Suresh, Sundari; Horecka, Joe; Proctor, Michael J; Aiyar, Raeka S; Bennett, Richard A O; Chu, Angela; Li, Yong Fuga; Roy, Kevin; Davis, Ronald W; Steinmetz, Lars M; Hyman, Richard W; Levy, Sasha F; St Onge, Robert P
2017-02-13
The low costs of array-synthesized oligonucleotide libraries are empowering rapid advances in quantitative and synthetic biology. However, high synthesis error rates, uneven representation, and lack of access to individual oligonucleotides limit the true potential of these libraries. We have developed a cost-effective method called Recombinase Directed Indexing (REDI), which involves integration of a complex library into yeast, site-specific recombination to index library DNA, and next-generation sequencing to identify desired clones. We used REDI to generate a library of ~3,300 DNA probes that exhibited > 96% purity and remarkable uniformity (> 95% of probes within twofold of the median abundance). Additionally, we created a collection of ~9,000 individually accessible CRISPR interference yeast strains for > 99% of genes required for either fermentative or respiratory growth, demonstrating the utility of REDI for rapid and cost-effective creation of strain collections from oligonucleotide pools. Our approach is adaptable to any complex DNA library, and fundamentally changes how these libraries can be parsed, maintained, propagated, and characterized. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.
Trujillano, D; Ramos, M D; González, J; Tornador, C; Sotillo, F; Escaramis, G; Ossowski, S; Armengol, L; Casals, T; Estivill, X
2013-07-01
Here we have developed a novel and much more efficient strategy for the complete molecular characterisation of the cystic fibrosis (CF) transmembrane regulator (CFTR) gene, based on multiplexed targeted resequencing. We have tested this approach in a cohort of 92 samples with previously characterised CFTR mutations and polymorphisms. After enrichment of the pooled barcoded DNA libraries with a custom NimbleGen SeqCap EZ Choice array (Roche) and sequencing with a HiSeq2000 (Illumina) sequencer, we applied several bioinformatics tools to call mutations and polymorphisms in CFTR. The combination of several bioinformatics tools allowed us to detect all known pathogenic variants (point mutations, short insertions/deletions, and large genomic rearrangements) and polymorphisms (including the poly-T and poly-thymidine-guanine polymorphic tracts) in the 92 samples. In addition, we report the precise characterisation of the breakpoints of seven genomic rearrangements in CFTR, including those of a novel deletion of exon 22 and a complex 85 kb inversion which includes two large deletions affecting exons 4-8 and 12-21, respectively. This work is a proof-of-principle that targeted resequencing is an accurate and cost-effective approach for the genetic testing of CF and CFTR-related disorders (ie, male infertility) amenable to the routine clinical practice, and ready to substitute classical molecular methods in medical genetics.
Complementation of a red-light-indifferent cyanobacterial mutant.
Chiang, G G; Schaefer, M R; Grossman, A R
1992-01-01
Many cyanobacteria alter their phycobilisome composition in response to changes in light wavelength in a process termed complementary chromatic adaptation. Mutant strains FdR1 and FdR2 of the filamentous cyanobacterium Fremyella diplosiphon are characterized by aberrant chromatic adaptation. Instead of adjusting to different wavelengths of light, FdR1 and FdR2 behave as if they are always in green light; they do not respond to red light. We have previously reported complementation of FdR1 by conjugal transfer of a wild-type genomic library. The complementing DNA has now been localized by genetic analysis to a region on the rescued genomic subclone that contains a gene designated rcaC. This region of DNA is also able to complement FdR2. Southern blot analysis of genomic DNA from FdR1 and FdR2 indicates that these strains harbor DNA insertions within the rcaC sequence that may have resulted from the activity of transposable genetic elements. The predicted amino acid sequence of RcaC shares strong identity to response regulators of bacterial two-component regulatory systems. This relationship is discussed in the context of the signal-transduction pathway mediating regulation of genes encoding phycobilisome polypeptides during chromatic adaptation. Images PMID:1409650
Knutzon, D S; Lardizabal, K D; Nelsen, J S; Bleibaum, J L; Davies, H M; Metz, J G
1995-01-01
Immature coconut (Cocos nucifera) endosperm contains a 1-acyl-sn-glycerol-3-phosphate acyltransferase (LPAAT) activity that shows a preference for medium-chain-length fatty acyl-coenzyme A substrates (H.M. Davies, D.J. Hawkins, J.S. Nelsen [1995] Phytochemistry 39:989-996). Beginning with solubilized membrane preparations, we have used chromatographic separations to identify a polypeptide with an apparent molecular mass of 29 kD, whose presence in various column fractions correlates with the acyltransferase activity detected in those same fractions. Amino acid sequence data obtained from several peptides generated from this protein were used to isolate a full-length clone from a coconut endosperm cDNA library. Clone pCGN5503 contains a 1325-bp cDNA insert with an open reading frame encoding a 308-amino acid protein with a calculated molecular mass of 34.8 kD. Comparison of the deduced amino acid sequence of pCGN5503 to sequences in the data banks revealed significant homology to other putative LPAAT sequences. Expression of the coconut cDNA in Escherichia coli conferred upon those cells a novel LPAAT activity whose substrate activity profile matched that of the coconut enzyme. PMID:8552723
Discovery of DNA repair inhibitors by combinatorial library profiling
Moeller, Benjamin J.; Sidman, Richard L.; Pasqualini, Renata; Arap, Wadih
2011-01-01
Small molecule inhibitors of DNA repair are emerging as potent and selective anti-cancer therapies, but the sheer magnitude of the protein networks involved in DNA repair processes poses obstacles to discovery of effective candidate drugs. To address this challenge, we used a subtractive combinatorial selection approach to identify a panel of peptide ligands that bind DNA repair complexes. Supporting the concept that these ligands have therapeutic potential, we show that one selected peptide specifically binds and non-competitively inactivates DNA-PKcs, a protein kinase critical in double-strand DNA break repair. In doing so, this ligand sensitizes BRCA-deficient tumor cells to genotoxic therapy. Our findings establish a platform for large-scale parallel screening for ligand-directed DNA repair inhibitors, with immediate applicability to cancer therapy. PMID:21343400
Murgha, Yusuf; Beliveau, Brian; Semrau, Kassandra; Schwartz, Donald; Wu, Chao-Ting; Gulari, Erdogan; Rouillard, Jean-Marie
2015-06-01
Oligonucleotide microarrays allow the production of complex custom oligonucleotide libraries for nucleic acid detection-based applications such as fluorescence in situ hybridization (FISH). We have developed a PCR-free method to make single-stranded DNA (ssDNA) fluorescent probes through an intermediate RNA library. A double-stranded oligonucleotide library is amplified by transcription to create an RNA library. Next, dye- or hapten-conjugate primers are used to reverse transcribe the RNA to produce a dye-labeled cDNA library. Finally the RNA is hydrolyzed under alkaline conditions to obtain the single-stranded fluorescent probes library. Starting from unique oligonucleotide library constructs, we present two methods to produce single-stranded probe libraries. The two methods differ in the type of reverse transcription (RT) primer, the incorporation of fluorescent dye, and the purification of fluorescent probes. The first method employs dye-labeled reverse transcription primers to produce multiple differentially single-labeled probe subsets from one microarray library. The fluorescent probes are purified from excess primers by oligonucleotide-bead capture. The second method uses an RNA:DNA chimeric primer and amino-modified nucleotides to produce amino-allyl probes. The excess primers and RNA are hydrolyzed under alkaline conditions, followed by probe purification and labeling with amino-reactive dyes. The fluorescent probes created by the combination of transcription and reverse transcription can be used for FISH and to detect any RNA and DNA targets via hybridization.
Critical factors for assembling a high volume of DNA barcodes
Hajibabaei, Mehrdad; deWaard, Jeremy R; Ivanova, Natalia V; Ratnasingham, Sujeevan; Dooh, Robert T; Kirk, Stephanie L; Mackie, Paula M; Hebert, Paul D.N
2005-01-01
Large-scale DNA barcoding projects are now moving toward activation while the creation of a comprehensive barcode library for eukaryotes will ultimately require the acquisition of some 100 million barcodes. To satisfy this need, analytical facilities must adopt protocols that can support the rapid, cost-effective assembly of barcodes. In this paper we discuss the prospects for establishing high volume DNA barcoding facilities by evaluating key steps in the analytical chain from specimens to barcodes. Alliances with members of the taxonomic community represent the most effective strategy for provisioning the analytical chain with specimens. The optimal protocols for DNA extraction and subsequent PCR amplification of the barcode region depend strongly on their condition, but production targets of 100K barcode records per year are now feasible for facilities working with compliant specimens. The analysis of museum collections is currently challenging, but PCR cocktails that combine polymerases with repair enzyme(s) promise future success. Barcode analysis is already a cost-effective option for species identification in some situations and this will increasingly be the case as reference libraries are assembled and analytical protocols are simplified. PMID:16214753
Selection and Screening of DNA Aptamers for Inorganic Nanomaterials.
Zhou, Yibo; Huang, Zhicheng; Yang, Ronghua; Liu, Juewen
2018-02-21
Searching for DNA sequences that can strongly and selectively bind to inorganic surfaces is a long-standing topic in bionanotechnology, analytical chemistry and biointerface research. This can be achieved either by aptamer selection starting with a very large library of ≈10 14 random DNA sequences, or by careful screening of a much smaller library (usually from a few to a few hundred) with rationally designed sequences. Unlike typical molecular targets, inorganic surfaces often have quite strong DNA adsorption affinities due to polyvalent binding and even chemical interactions. This leads to a very high background binding making aptamer selection difficult. Screening, on the other hand, can be designed to compare relative binding affinities of different DNA sequences and could be more appropriate for inorganic surfaces. The resulting sequences have been used for DNA-directed assembly, sorting of carbon nanotubes, and DNA-controlled growth of inorganic nanomaterials. It was recently discovered that poly-cytosine (C) DNA can strongly bind to a diverse range of nanomaterials including nanocarbons (graphene oxide and carbon nanotubes), various metal oxides and transition-metal dichalcogenides. In this Concept article, we articulate the need for screening and potential artifacts associated with traditional aptamer selection methods for inorganic surfaces. Representative examples of application are discussed, and a few future research opportunities are proposed towards the end of this article. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
SV40 host-substituted variants: a new look at the monkey DNA inserts and recombinant junctions.
Singer, Maxine; Winocour, Ernest
2011-04-10
The available monkey genomic data banks were examined in order to determine the chromosomal locations of the host DNA inserts in 8 host-substituted SV40 variant DNAs. Five of the 8 variants contained more than one linked monkey DNA insert per tandem repeat unit and in all cases but one, the 19 monkey DNA inserts in the 8 variants mapped to different locations in the monkey genome. The 50 parental DNAs (32 monkey and 18 SV40 DNA segments) which spanned the crossover and flanking regions that participated in monkey/monkey and monkey/SV40 recombinations were characterized by substantial levels of microhomology of up to 8 nucleotides in length; the parental DNAs also exhibited direct and inverted repeats at or adjacent to the crossover sequences. We discuss how the host-substituted SV40 variants arose and the nature of the recombination mechanisms involved. Copyright © 2011 Elsevier Inc. All rights reserved.
Maumus, Florian; Blanc, Guillaume
2016-12-14
The nucleocytoplasmic large DNA viruses (NCLDV) are a group of extremely complex double-stranded DNA viruses, which are major parasites of a variety of eukaryotes. Recent studies showed that certain unicellular eukaryotes contain fragments of NCLDV DNA integrated in their genome, when surprisingly many of these organisms were not previously shown to be infected by NCLDVs. These findings prompted us to search the genome of Acanthamoeba castellanii strain Neff (Neff), one of the most prolific hosts in the discovery of giant NCLDVs, for possible DNA inserts of viral origin. We report the identification of 267 markers of lateral gene transfer with viruses, approximately half of which are clustered in Neff genome regions of viral origins, transcriptionally inactive or exhibit nucleotide-composition signatures suggestive of a foreign origin. The integrated viral genes had diverse origin among relatives of viruses that infect Neff, including Mollivirus, Pandoravirus, Marseillevirus, Pithovirus, and Mimivirus However, phylogenetic analysis suggests the existence of a yet-undiscovered family of amoeba-infecting NCLDV in addition to the five already characterized. The active transcription of some apparently anciently integrated virus-like genes suggests that some viral genes might have been domesticated during the amoeba evolution. These insights confirm that genomic insertion of NCLDV DNA is a common theme in eukaryotes. This gene flow contributed fertilizing the eukaryotic gene repertoire and participated in the occurrence of orphan genes, a long standing issue in genomics. Search for viral inserts in eukaryotic genomes followed by environmental screening of the original viruses should be used to isolate radically new NCLDVs. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
NASA Astrophysics Data System (ADS)
Yoshikazu, Kawata; Shin-Ichi, Yano; Hiroyuki, Kojima
1998-03-01
An efficient and simple method for constructing a genomic DNA library using a TA cloning vector is presented. It is based on the sonicative cleavage of genomic DNA and modification of fragment ends with Taq DNA polymerase, followed by ligation using a TA vector. This method was applied for cloning of the phytoene synthase gene crt B from Spirulina platensis. This method is useful when genomic DNA cannot be efficiently digested with restriction enzymes, a problem often encountered during the construction of a genomic DNA library of cyanobacteria.
Isolation and characterization of novel lipases/esterases from a bovine rumen metagenome.
Privé, Florence; Newbold, C Jamie; Kaderbhai, Naheed N; Girdwood, Susan G; Golyshina, Olga V; Golyshin, Peter N; Scollan, Nigel D; Huws, Sharon A
2015-07-01
Improving the health beneficial fatty acid content of meat and milk is a major challenge requiring an increased understanding of rumen lipid metabolism. In this study, we isolated and characterized rumen bacterial lipases/esterases using functional metagenomics. Metagenomic libraries were constructed from DNA extracted from strained rumen fluid (SRF), solid-attached bacteria (SAB) and liquid-associated rumen bacteria (LAB), ligated into a fosmid vector and subsequently transformed into an Escherichia coli host. Fosmid libraries consisted of 7,744; 8,448; and 7,680 clones with an average insert size of 30 to 35 kbp for SRF, SAB and LAB, respectively. Transformants were screened on spirit blue agar plates containing tributyrin for lipase/esterase activity. Five SAB and four LAB clones exhibited lipolytic activity, and no positive clones were found in the SRF library. Fosmids from positive clones were pyrosequenced and twelve putative lipase/esterase genes and two phospholipase genes retrieved. Although the derived proteins clustered into diverse esterase and lipase families, a degree of novelty was seen, with homology ranging from 40 to 78% following BlastP searches. Isolated lipases/esterases exhibited activity against mostly short- to medium-chain substrates across a range of temperatures and pH. The function of these novel enzymes recovered in ruminal metabolism needs further investigation, alongside their potential industrial uses.
Ligation Bias in Illumina Next-Generation DNA Libraries: Implications for Sequencing Ancient Genomes
Seguin-Orlando, Andaine; Schubert, Mikkel; Clary, Joel; Stagegaard, Julia; Alberdi, Maria T.; Prado, José Luis; Prieto, Alfredo; Willerslev, Eske; Orlando, Ludovic
2013-01-01
Ancient DNA extracts consist of a mixture of endogenous molecules and contaminant DNA templates, often originating from environmental microbes. These two populations of templates exhibit different chemical characteristics, with the former showing depurination and cytosine deamination by-products, resulting from post-mortem DNA damage. Such chemical modifications can interfere with the molecular tools used for building second-generation DNA libraries, and limit our ability to fully characterize the true complexity of ancient DNA extracts. In this study, we first use fresh DNA extracts to demonstrate that library preparation based on adapter ligation at AT-overhangs are biased against DNA templates starting with thymine residues, contrarily to blunt-end adapter ligation. We observe the same bias on fresh DNA extracts sheared on Bioruptor, Covaris and nebulizers. This contradicts previous reports suggesting that this bias could originate from the methods used for shearing DNA. This also suggests that AT-overhang adapter ligation efficiency is affected in a sequence-dependent manner and results in an uneven representation of different genomic contexts. We then show how this bias could affect the base composition of ancient DNA libraries prepared following AT-overhang ligation, mainly by limiting the ability to ligate DNA templates starting with thymines and therefore deaminated cytosines. This results in particular nucleotide misincorporation damage patterns, deviating from the signature generally expected for authenticating ancient sequence data. Consequently, we show that models adequate for estimating post-mortem DNA damage levels must be robust to the molecular tools used for building ancient DNA libraries. PMID:24205269
Immune-Related Transcriptome of Coptotermes formosanus Shiraki Workers: The Defense Mechanism
Hussain, Abid; Li, Yi-Feng; Cheng, Yu; Liu, Yang; Chen, Chuan-Cheng; Wen, Shuo-Yang
2013-01-01
Formosan subterranean termites, Coptotermes formosanus Shiraki, live socially in microbial-rich habitats. To understand the molecular mechanism by which termites combat pathogenic microbes, a full-length normalized cDNA library and four Suppression Subtractive Hybridization (SSH) libraries were constructed from termite workers infected with entomopathogenic fungi (Metarhizium anisopliae and Beauveria bassiana), Gram-positive Bacillus thuringiensis and Gram-negative Escherichia coli, and the libraries were analyzed. From the high quality normalized cDNA library, 439 immune-related sequences were identified. These sequences were categorized as pattern recognition receptors (47 sequences), signal modulators (52 sequences), signal transducers (137 sequences), effectors (39 sequences) and others (164 sequences). From the SSH libraries, 27, 17, 22 and 15 immune-related genes were identified from each SSH library treated with M. anisopliae, B. bassiana, B. thuringiensis and E. coli, respectively. When the normalized cDNA library was compared with the SSH libraries, 37 immune-related clusters were found in common; 56 clusters were identified in the SSH libraries, and 259 were identified in the normalized cDNA library. The immune-related gene expression pattern was further investigated using quantitative real time PCR (qPCR). Important immune-related genes were characterized, and their potential functions were discussed based on the integrated analysis of the results. We suggest that normalized cDNA and SSH libraries enable us to discover functional genes transcriptome. The results remarkably expand our knowledge about immune-inducible genes in C. formosanus Shiraki and enable the future development of novel control strategies for the management of Formosan subterranean termites. PMID:23874972
Han, Dong-gang; Duan, Xiao-yi; Guo, You-min; Zhou, Qi; Wang, Quan-ying; Yang, Guang-xiao
2010-01-01
To obtain specific anti-epidermal growth factor receptor variant III (EGFRvIII) single chain antibody (ScFv) by phage antibody library display system. The total RNA was extracted from the spleen B cells of BALB/c mice immunized with pep-3-OVA protein, and the first-strand cDNA was synthesized by reverse transcription. Antibody VH and VL gene fragments were amplified and joined to a ScFv gene with the linker. The ScFv gene was ligated into the phagemid vector pCANTAB5E, which was transformed into competent E. coli TG1. The transformed cells were then infected with M13KO7 helper phage to yield the recombinant phage to construct the phage ScFv library. Pep-3-BSA protein was used to screen the phage antibody library and ELISA carried out to characterize the activity of the antibody. The VH and VL gene fragments of the antibody were about 350 bp and 320 bp in length as analyzed by agarose gel electrophoresis. The ScFv gene was 780 bp, consistent with the expected length. The recombinant phagemid with ScFv gene insert was rescued, and an immune phage ScFv library with the content of 5.0x10(6) was constructed. The recombinant ScFv phage had a titer of 3.0x10(4) cfu/ml, and the fourth phage harvest yielded 56 times as much as that of the first one. SDS-PAGE demonstrated a molecular mass of the soluble ScFv of about 28 kD. ELISA results indicated good specificity of the ScFv to bind EGFRvIII. An immune phage ScFv library is successfully constructed, and the ScFv antibody fragment is capable of specific binding to EGFRvIII.
Langevin, Stanley A; Bent, Zachary W; Solberg, Owen D; Curtis, Deanna J; Lane, Pamela D; Williams, Kelly P; Schoeniger, Joseph S; Sinha, Anupama; Lane, Todd W; Branda, Steven S
2013-04-01
Use of second generation sequencing (SGS) technologies for transcriptional profiling (RNA-Seq) has revolutionized transcriptomics, enabling measurement of RNA abundances with unprecedented specificity and sensitivity and the discovery of novel RNA species. Preparation of RNA-Seq libraries requires conversion of the RNA starting material into cDNA flanked by platform-specific adaptor sequences. Each of the published methods and commercial kits currently available for RNA-Seq library preparation suffers from at least one major drawback, including long processing times, large starting material requirements, uneven coverage, loss of strand information and high cost. We report the development of a new RNA-Seq library preparation technique that produces representative, strand-specific RNA-Seq libraries from small amounts of starting material in a fast, simple and cost-effective manner. Additionally, we have developed a new quantitative PCR-based assay for precisely determining the number of PCR cycles to perform for optimal enrichment of the final library, a key step in all SGS library preparation workflows.
Seto, P; Hirayu, H; Magnusson, R P; Gestautas, J; Portmann, L; DeGroot, L J; Rapoport, B
1987-01-01
The thyroid microsomal antigen (MSA) in autoimmune thyroid disease is a protein of approximately 107 kD. We screened a human thyroid cDNA library constructed in the expression vector lambda gt11 with anti-107-kD monoclonal antibodies. Of five clones obtained, the recombinant beta-galactosidase fusion protein from one clone (PM-5) was confirmed to react with the monoclonal antiserum. The complementary DNA (cDNA) insert from PM-5 (0.8 kb) was used as a probe on Northern blot analysis to estimate the size of the mRNA coding for the MSA. The 2.9-kb messenger RNA (mRNA) species observed was the same size as that coding for human thyroid peroxidase (TPO). The probe did not bind to human liver mRNA, indicating the thyroid-specific nature of the PM-5-related mRNA. The nucleotide sequence of PM-5 (842 bp) was determined and consisted of a single open reading frame. Comparison of the nucleotide sequence of PM-5 with that presently available for pig TPO indicates 84% homology. In conclusion, a cDNA clone representing part of the microsomal antigen has been isolated. Sequence homology with porcine TPO, as well as identity in the size of the mRNA species for both the microsomal antigen and TPO, indicate that the microsomal antigen is, at least in part, TPO. Images PMID:3654979
USDA-ARS?s Scientific Manuscript database
We developed two leafy spurge BAC libraries that together represent approximately 5X coverage of the leafy spurge genome. The BAC libraries have an average insert size of approximately 143 kb, and copies of the library and filters for hybridization-based screening are publicly available through the ...
Blueprints for green biotech: development and application of standards for plant synthetic biology.
Patron, Nicola J
2016-06-15
Synthetic biology aims to apply engineering principles to the design and modification of biological systems and to the construction of biological parts and devices. The ability to programme cells by providing new instructions written in DNA is a foundational technology of the field. Large-scale de novo DNA synthesis has accelerated synthetic biology by offering custom-made molecules at ever decreasing costs. However, for large fragments and for experiments in which libraries of DNA sequences are assembled in different combinations, assembly in the laboratory is still desirable. Biological assembly standards allow DNA parts, even those from multiple laboratories and experiments, to be assembled together using the same reagents and protocols. The adoption of such standards for plant synthetic biology has been cohesive for the plant science community, facilitating the application of genome editing technologies to plant systems and streamlining progress in large-scale, multi-laboratory bioengineering projects. © 2016 The Author(s). published by Portland Press Limited on behalf of the Biochemical Society.
Johnson, LeeAnn K; Brown, Mary B; Carruthers, Ethan A; Ferguson, John A; Dombek, Priscilla E; Sadowsky, Michael J
2004-08-01
A horizontal, fluorophore-enhanced, repetitive extragenic palindromic-PCR (rep-PCR) DNA fingerprinting technique (HFERP) was developed and evaluated as a means to differentiate human from animal sources of Escherichia coli. Box A1R primers and PCR were used to generate 2,466 rep-PCR and 1,531 HFERP DNA fingerprints from E. coli strains isolated from fecal material from known human and 12 animal sources: dogs, cats, horses, deer, geese, ducks, chickens, turkeys, cows, pigs, goats, and sheep. HFERP DNA fingerprinting reduced within-gel grouping of DNA fingerprints and improved alignment of DNA fingerprints between gels, relative to that achieved using rep-PCR DNA fingerprinting. Jackknife analysis of the complete rep-PCR DNA fingerprint library, done using Pearson's product-moment correlation coefficient, indicated that animal and human isolates were assigned to the correct source groups with an 82.2% average rate of correct classification. However, when only unique isolates were examined, isolates from a single animal having a unique DNA fingerprint, Jackknife analysis showed that isolates were assigned to the correct source groups with a 60.5% average rate of correct classification. The percentages of correctly classified isolates were about 15 and 17% greater for rep-PCR and HFERP, respectively, when analyses were done using the curve-based Pearson's product-moment correlation coefficient, rather than the band-based Jaccard algorithm. Rarefaction analysis indicated that, despite the relatively large size of the known-source database, genetic diversity in E. coli was very great and is most likely accounting for our inability to correctly classify many environmental E. coli isolates. Our data indicate that removal of duplicate genotypes within DNA fingerprint libraries, increased database size, proper methods of statistical analysis, and correct alignment of band data within and between gels improve the accuracy of microbial source tracking methods.
Defining the ABC of gene essentiality in streptococci.
Charbonneau, Amelia R L; Forman, Oliver P; Cain, Amy K; Newland, Graham; Robinson, Carl; Boursnell, Mike; Parkhill, Julian; Leigh, James A; Maskell, Duncan J; Waller, Andrew S
2017-05-31
Utilising next generation sequencing to interrogate saturated bacterial mutant libraries provides unprecedented information for the assignment of genome-wide gene essentiality. Exposure of saturated mutant libraries to specific conditions and subsequent sequencing can be exploited to uncover gene essentiality relevant to the condition. Here we present a barcoded transposon directed insertion-site sequencing (TraDIS) system to define an essential gene list for Streptococcus equi subsp. equi, the causative agent of strangles in horses, for the first time. The gene essentiality data for this group C Streptococcus was compared to that of group A and B streptococci. Six barcoded variants of pGh9:ISS1 were designed and used to generate mutant libraries containing between 33,000-66,000 unique mutants. TraDIS was performed on DNA extracted from each library and data were analysed separately and as a combined master pool. Gene essentiality determined that 19.5% of the S. equi genome was essential. Gene essentialities were compared to those of group A and group B streptococci, identifying concordances of 90.2% and 89.4%, respectively and an overall concordance of 83.7% between the three species. The use of barcoded pGh9:ISS1 to generate mutant libraries provides a highly useful tool for the assignment of gene function in S. equi and other streptococci. The shared essential gene set of group A, B and C streptococci provides further evidence of the close genetic relationships between these important pathogenic bacteria. Therefore, the ABC of gene essentiality reported here provides a solid foundation towards reporting the functional genome of streptococci.
Nogales, Balbina; Moore, Edward R. B.; Llobet-Brossa, Enrique; Rossello-Mora, Ramon; Amann, Rudolf; Timmis, Kenneth N.
2001-01-01
The bacterial diversity assessed from clone libraries prepared from rRNA (two libraries) and ribosomal DNA (rDNA) (one library) from polychlorinated biphenyl (PCB)-polluted soil has been analyzed. A good correspondence of the community composition found in the two types of library was observed. Nearly 29% of the cloned sequences in the rDNA library were identical to sequences in the rRNA libraries. More than 60% of the total cloned sequence types analyzed were grouped in phylogenetic groups (a clone group with sequence similarity higher than 97% [98% for Burkholderia and Pseudomonas-type clones]) represented in both types of libraries. Some of those phylogenetic groups, mostly represented by a single (or pair) of cloned sequence type(s), were observed in only one of the types of library. An important difference between the libraries was the lack of clones representative of the Actinobacteria in the rDNA library. The PCB-polluted soil exhibited a high bacterial diversity which included representatives of two novel lineages. The apparent abundance of bacteria affiliated to the beta-subclass of the Proteobacteria, and to the genus Burkholderia in particular, was confirmed by fluorescence in situ hybridization analysis. The possible influence on apparent diversity of low template concentrations was assessed by dilution of the RNA template prior to amplification by reverse transcription-PCR. Although differences in the composition of the two rRNA libraries obtained from high and low RNA concentrations were observed, the main components of the bacterial community were represented in both libraries, and therefore their detection was not compromised by the lower concentrations of template used in this study. PMID:11282645
Twenty-five Years of DNA-Encoded Chemical Libraries.
Neri, Dario
2017-05-04
Reference library: The availability of DNA-encoded chemical libraries containing billions of compounds facilitates the discovery of binding molecules for pharmaceutical applications and for investigating biological processes. This Special Issue highlights the use of this library technology and some of the latest developments in the field. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Googling DNA sequences on the World Wide Web.
Hajibabaei, Mehrdad; Singer, Gregory A C
2009-11-10
New web-based technologies provide an excellent opportunity for sharing and accessing information and using web as a platform for interaction and collaboration. Although several specialized tools are available for analyzing DNA sequence information, conventional web-based tools have not been utilized for bioinformatics applications. We have developed a novel algorithm and implemented it for searching species-specific genomic sequences, DNA barcodes, by using popular web-based methods such as Google. We developed an alignment independent character based algorithm based on dividing a sequence library (DNA barcodes) and query sequence to words. The actual search is conducted by conventional search tools such as freely available Google Desktop Search. We implemented our algorithm in two exemplar packages. We developed pre and post-processing software to provide customized input and output services, respectively. Our analysis of all publicly available DNA barcode sequences shows a high accuracy as well as rapid results. Our method makes use of conventional web-based technologies for specialized genetic data. It provides a robust and efficient solution for sequence search on the web. The integration of our search method for large-scale sequence libraries such as DNA barcodes provides an excellent web-based tool for accessing this information and linking it to other available categories of information on the web.
Optimized Reaction Conditions for Amide Bond Formation in DNA-Encoded Combinatorial Libraries.
Li, Yizhou; Gabriele, Elena; Samain, Florent; Favalli, Nicholas; Sladojevich, Filippo; Scheuermann, Jörg; Neri, Dario
2016-08-08
DNA-encoded combinatorial libraries are increasingly being used as tools for the discovery of small organic binding molecules to proteins of biological or pharmaceutical interest. In the majority of cases, synthetic procedures for the formation of DNA-encoded combinatorial libraries incorporate at least one step of amide bond formation between amino-modified DNA and a carboxylic acid. We investigated reaction conditions and established a methodology by using 1-ethyl-3-(3-(dimethylamino)propyl)carbodiimide, 1-hydroxy-7-azabenzotriazole and N,N'-diisopropylethylamine (EDC/HOAt/DIPEA) in combination, which provided conversions greater than 75% for 423/543 (78%) of the carboxylic acids tested. These reaction conditions were efficient with a variety of primary and secondary amines, as well as with various types of amino-modified oligonucleotides. The reaction conditions, which also worked efficiently over a broad range of DNA concentrations and reaction scales, should facilitate the synthesis of novel DNA-encoded combinatorial libraries.
From the ORFeome concept to highly comprehensive, full-genome screening libraries.
Rid, Raphaela; Abdel-Hadi, Omar; Maier, Richard; Wagner, Martin; Hundsberger, Harald; Hintner, Helmut; Bauer, Johann; Onder, Kamil
2013-02-01
Recombination-based cloning techniques have in recent times facilitated the establishment of genome-scale single-gene ORFeome repositories. Their further handling and downstream application in systematic fashion is, however, practically impeded because of logistical plus economic challenges. At this juncture, simultaneously transferring entire gene collections in compiled pool format could represent an advanced compromise between systematic ORFeome (an organism's entire set of protein-encoding open reading frames) projects and traditional random library approaches, but has not yet been considered in great detail. In our endeavor to merge the comprehensiveness of ORFeomes with a basically simple, streamlined, and easily executable single-tube design, we have here produced five different pooled screening-ready libraries for both Staphylococcus aureus and Homo sapiens. By evaluating the parallel transfer efficiencies of differentially sized genes from initial polymerase chain reaction (PCR) product amplification to entry and final destination library construction via quantitative real-time PCR, we found that the complexity of the gene population is fairly stably maintained once an entry resource has been successfully established, and that no apparent size-selection bias loss of large inserts takes place. Recombinational transfer processes are hence robust enough for straightforwardly achieving such pooled screening libraries.
Young, Robert S
2016-07-01
Frequent evolutionary birth and death events have created a large quantity of biologically important, lineage-specific DNA within mammalian genomes. The birth and death of DNA sequences is so frequent that the total number of these insertions and deletions in the human population remains unknown, although there are differences between these groups, e.g. transposable elements contribute predominantly to sequence insertion. Functional turnover - where the activity of a locus is specific to one lineage, but the underlying DNA remains conserved - can also drive birth and death. However, this does not appear to be a major driver of divergent transcriptional regulation. Both sequence and functional turnover have contributed to the birth and death of thousands of functional promoters in the human and mouse genomes. These findings reveal the pervasive nature of evolutionary birth and death and suggest that lineage-specific regions may play an important but previously underappreciated role in human biology and disease. © 2016 The Authors BioEssays Published by WILEY Periodicals, Inc.
Display of a maize cDNA library on baculovirus infected insect cells.
Meller Harel, Helene Y; Fontaine, Veronique; Chen, Hongying; Jones, Ian M; Millner, Paul A
2008-08-12
Maize is a good model system for cereal crop genetics and development because of its rich genetic heritage and well-characterized morphology. The sequencing of its genome is well advanced, and new technologies for efficient proteomic analysis are needed. Baculovirus expression systems have been used for the last twenty years to express in insect cells a wide variety of eukaryotic proteins that require complex folding or extensive posttranslational modification. More recently, baculovirus display technologies based on the expression of foreign sequences on the surface of Autographa californica (AcMNPV) have been developed. We investigated the potential of a display methodology for a cDNA library of maize young seedlings. We constructed a full-length cDNA library of young maize etiolated seedlings in the transfer vector pAcTMVSVG. The library contained a total of 2.5 x 10(5) independent clones. Expression of two known maize proteins, calreticulin and auxin binding protein (ABP1), was shown by western blot analysis of protein extracts from insect cells infected with the cDNA library. Display of the two proteins in infected insect cells was shown by selective biopanning using magnetic cell sorting and demonstrated proof of concept that the baculovirus maize cDNA display library could be used to identify and isolate proteins. The maize cDNA library constructed in this study relies on the novel technology of baculovirus display and is unique in currently published cDNA libraries. Produced to demonstrate proof of principle, it opens the way for the development of a eukaryotic in vivo display tool which would be ideally suited for rapid screening of the maize proteome for binding partners, such as proteins involved in hormone regulation or defence.
The ATRX cDNA is prone to bacterial IS10 element insertions that alter its structure.
Valle-García, David; Griffiths, Lyra M; Dyer, Michael A; Bernstein, Emily; Recillas-Targa, Félix
2014-01-01
The SWI/SNF-like chromatin-remodeling protein ATRX has emerged as a key factor in the regulation of α-globin gene expression, incorporation of histone variants into the chromatin template and, more recently, as a frequently mutated gene across a wide spectrum of cancers. Therefore, the availability of a functional ATRX cDNA for expression studies is a valuable tool for the scientific community. We have identified two independent transposon insertions of a bacterial IS10 element into exon 8 of ATRX isoform 2 coding sequence in two different plasmids derived from a single source. We demonstrate that these insertion events are common and there is an insertion hotspot within the ATRX cDNA. Such IS10 insertions produce a truncated form of ATRX, which significantly compromises its nuclear localization. In turn, we describe ways to prevent IS10 insertion during propagation and cloning of ATRX-containing vectors, including optimal growth conditions, bacterial strains, and suggested sequencing strategies. Finally, we have generated an insertion-free plasmid that is available to the community for expression studies of ATRX.
Petersen, David W; Kawasaki, Ernest S
2007-01-01
DNA microarray technology has become a powerful tool in the arsenal of the molecular biologist. Capitalizing on high precision robotics and the wealth of DNA sequences annotated from the genomes of a large number of organisms, the manufacture of microarrays is now possible for the average academic laboratory with the funds and motivation. Microarray production requires attention to both biological and physical resources, including DNA libraries, robotics, and qualified personnel. While the fabrication of microarrays is a very labor-intensive process, production of quality microarrays individually tailored on a project-by-project basis will help researchers shed light on future scientific questions.
Eukaryotic ribosome display with in situ DNA recovery.
He, Mingyue; Edwards, Bryan M; Kastelic, Damjana; Taussig, Michael J
2012-01-01
Ribosome display is a cell-free display technology for in vitro selection and optimisation of proteins from large diversified libraries. It operates through the formation of stable protein-ribosome-mRNA (PRM) complexes and selection of ligand-binding proteins, followed by DNA recovery from the selected genetic information. Both prokaryotic and eukaryotic ribosome display systems have been developed. In this chapter, we describe the eukaryotic rabbit reticulocyte method in which a distinct in situ single-primer RT-PCR procedure is used to recover DNA from the selected PRM complexes without the need for prior disruption of the ribosome.
Triple helix purification and sequencing
Wang, Renfeng; Smith, Lloyd M.; Tong, Xinchun E.
1995-01-01
Disclosed herein are methods, kits, and equipment for purifying single stranded circular DNA and then using the DNA for DNA sequencing purposes. Templates are provided with an insert having a hybridization region. An elongated oligonucleotide has two regions that are complementary to the insert and the oligo is bound to a magnetic anchor. The oligo hybridizes to the insert on two sides to form a stable triple helix complex. The anchor can then be used to drag the template out of solution using a magnet. The system can purify sequencing templates, and if desired the triple helix complex can be opened up to a double helix so that the oligonucleotide will act as a primer for further DNA synthesis.
Triple helix purification and sequencing
Wang, R.; Smith, L.M.; Tong, X.E.
1995-03-28
Disclosed herein are methods, kits, and equipment for purifying single stranded circular DNA and then using the DNA for DNA sequencing purposes. Templates are provided with an insert having a hybridization region. An elongated oligonucleotide has two regions that are complementary to the insert and the oligo is bound to a magnetic anchor. The oligo hybridizes to the insert on two sides to form a stable triple helix complex. The anchor can then be used to drag the template out of solution using a magnet. The system can purify sequencing templates, and if desired the triple helix complex can be opened up to a double helix so that the oligonucleotide will act as a primer for further DNA synthesis. 4 figures.
Reducing DNA context dependence in bacterial promoters
Carr, Swati B.; Densmore, Douglas M.
2017-01-01
Variation in the DNA sequence upstream of bacterial promoters is known to affect the expression levels of the products they regulate, sometimes dramatically. While neutral synthetic insulator sequences have been found to buffer promoters from upstream DNA context, there are no established methods for designing effective insulator sequences with predictable effects on expression levels. We address this problem with Degenerate Insulation Screening (DIS), a novel method based on a randomized 36-nucleotide insulator library and a simple, high-throughput, flow-cytometry-based screen that randomly samples from a library of 436 potential insulated promoters. The results of this screen can then be compared against a reference uninsulated device to select a set of insulated promoters providing a precise level of expression. We verify this method by insulating the constitutive, inducible, and repressible promotors of a four transcriptional-unit inverter (NOT-gate) circuit, finding both that order dependence is largely eliminated by insulation and that circuit performance is also significantly improved, with a 5.8-fold mean improvement in on/off ratio. PMID:28422998
DNA-Compatible Nitro Reduction and Synthesis of Benzimidazoles.
Du, Huang-Chi; Huang, Hongbing
2017-10-18
DNA-encoded chemical libraries have emerged as a cost-effective alternative to high-throughput screening (HTS) for hit identification in drug discovery. A key factor for productive DNA-encoded libraries is the chemical diversity of the small molecule moiety attached to an encoding DNA oligomer. The library structure diversity is often limited to DNA-compatible chemical reactions in aqueous media. Herein, we describe a facile process for reducing aryl nitro groups to aryl amines. The new protocol offers simple operation and circumvents the pyrophoric potential of the conventional method (Raney nickel). The reaction is performed in aqueous solution and does not compromise DNA structural integrity. The utility of this method is demonstrated by the versatile synthesis of benzimidazoles on DNA.
Trinh, T. Q.; Sinden, R. R.
1993-01-01
We describe a system to measure the frequency of both deletions and duplications between direct repeats. Short 17- and 18-bp palindromic and nonpalindromic DNA sequences were cloned into the EcoRI site within the chloramphenicol acetyltransferase gene of plasmids pBR325 and pJT7. This creates an insert between direct repeated EcoRI sites and results in a chloramphenicol-sensitive phenotype. Selection for chloramphenicol resistance was utilized to select chloramphenicol resistant revertants that included those with precise deletion of the insert from plasmid pBR325 and duplication of the insert in plasmid pJT7. The frequency of deletion or duplication varied more than 500-fold depending on the sequence of the short sequence inserted into the EcoRI site. For the nonpalindromic inserts, multiple internal direct repeats and the length of the direct repeats appear to influence the frequency of deletion. Certain palindromic DNA sequences with the potential to form DNA hairpin structures that might stabilize the misalignment of direct repeats had a high frequency of deletion. Other DNA sequences with the potential to form structures that might destabilize misalignment of direct repeats had a very low frequency of deletion. Duplication mutations occurred at the highest frequency when the DNA between the direct repeats contained no direct or inverted repeats. The presence of inverted repeats dramatically reduced the frequency of duplications. The results support the slippage-misalignment model, suggesting that misalignment occurring during DNA replication leads to deletion and duplication mutations. The results also support the idea that the formation of DNA secondary structures during DNA replication can facilitate and direct specific mutagenic events. PMID:8325478
Wu, Zining; Graybill, Todd L; Zeng, Xin; Platchek, Michael; Zhang, Jean; Bodmer, Vera Q; Wisnoski, David D; Deng, Jianghe; Coppo, Frank T; Yao, Gang; Tamburino, Alex; Scavello, Genaro; Franklin, G Joseph; Mataruse, Sibongile; Bedard, Katie L; Ding, Yun; Chai, Jing; Summerfield, Jennifer; Centrella, Paolo A; Messer, Jeffrey A; Pope, Andrew J; Israel, David I
2015-12-14
DNA-encoded small-molecule library technology has recently emerged as a new paradigm for identifying ligands against drug targets. To date, this technology has been used with soluble protein targets that are produced and used in a purified state. Here, we describe a cell-based method for identifying small-molecule ligands from DNA-encoded libraries against integral membrane protein targets. We use this method to identify novel, potent, and specific inhibitors of NK3, a member of the tachykinin family of G-protein coupled receptors (GPCRs). The method is simple and broadly applicable to other GPCRs and integral membrane proteins. We have extended the application of DNA-encoded library technology to membrane-associated targets and demonstrate the feasibility of selecting DNA-tagged, small-molecule ligands from complex combinatorial libraries against targets in a heterogeneous milieu, such as the surface of a cell.
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.
Hazkani-Covo, Einat; Martin, William F
2017-05-01
Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Colby, Sheila M.; Crock, John; Dowdle-Rizzo, Barbara; Lemaux, Peggy G.; Croteau, Rodney
1998-01-01
Germacrene C was found by GC-MS and NMR analysis to be the most abundant sesquiterpene in the leaf oil of Lycopersicon esculentum cv. VFNT Cherry, with lesser amounts of germacrene A, guaia-6,9-diene, germacrene B, β-caryophyllene, α-humulene, and germacrene D. Soluble enzyme preparations from leaves catalyzed the divalent metal ion-dependent cyclization of [1-3H]farnesyl diphosphate to these same sesquiterpene olefins, as determined by radio-GC. To obtain a germacrene synthase cDNA, a set of degenerate primers was constructed based on conserved amino acid sequences of related terpenoid cyclases. With cDNA prepared from leaf epidermis-enriched mRNA, these primers amplified a 767-bp fragment that was used as a hybridization probe to screen the cDNA library. Thirty-one clones were evaluated for functional expression of terpenoid cyclase activity in Escherichia coli by using labeled geranyl, farnesyl, and geranylgeranyl diphosphates as substrates. Nine cDNA isolates expressed sesquiterpene synthase activity, and GC-MS analysis of the products identified germacrene C with smaller amounts of germacrene A, B, and D. None of the expressed proteins was active with geranylgeranyl diphosphate; however, one truncated protein converted geranyl diphosphate to the monoterpene limonene. The cDNA inserts specify a deduced polypeptide of 548 amino acids (Mr = 64,114), and sequence comparison with other plant sesquiterpene cyclases indicates that germacrene C synthase most closely resembles cotton δ-cadinene synthase (50% identity). PMID:9482865
Boltaña, Sebastian; Castellana, Barbara; Goetz, Giles; Tort, Lluis; Teles, Mariana; Mulero, Victor; Novoa, Beatriz; Figueras, Antonio; Goetz, Frederick W; Gallardo-Escarate, Cristian; Planas, Josep V; Mackenzie, Simon
2017-02-03
This study describes the development and validation of an enriched oligonucleotide-microarray platform for Sparus aurata (SAQ) to provide a platform for transcriptomic studies in this species. A transcriptome database was constructed by assembly of gilthead sea bream sequences derived from public repositories of mRNA together with reads from a large collection of expressed sequence tags (EST) from two extensive targeted cDNA libraries characterizing mRNA transcripts regulated by both bacterial and viral challenge. The developed microarray was further validated by analysing monocyte/macrophage activation profiles after challenge with two Gram-negative bacterial pathogen-associated molecular patterns (PAMPs; lipopolysaccharide (LPS) and peptidoglycan (PGN)). Of the approximately 10,000 EST sequenced, we obtained a total of 6837 EST longer than 100 nt, with 3778 and 3059 EST obtained from the bacterial-primed and from the viral-primed cDNA libraries, respectively. Functional classification of contigs from the bacterial- and viral-primed cDNA libraries by Gene Ontology (GO) showed that the top five represented categories were equally represented in the two libraries: metabolism (approximately 24% of the total number of contigs), carrier proteins/membrane transport (approximately 15%), effectors/modulators and cell communication (approximately 11%), nucleoside, nucleotide and nucleic acid metabolism (approximately 7.5%) and intracellular transducers/signal transduction (approximately 5%). Transcriptome analyses using this enriched oligonucleotide platform identified differential shifts in the response to PGN and LPS in macrophage-like cells, highlighting responsive gene-cassettes tightly related to PAMP host recognition. As observed in other fish species, PGN is a powerful activator of the inflammatory response in S. aurata macrophage-like cells. We have developed and validated an oligonucleotide microarray (SAQ) that provides a platform enriched for the study of gene expression in S. aurata with an emphasis upon immunity and the immune response.
Schiavo, Giuseppina; Hoffmann, Orsolya Ivett; Ribani, Anisa; Utzeri, Valerio Joe; Ghionda, Marco Ciro; Bertolini, Francesca; Geraci, Claudia; Bovo, Samuele; Fontanesi, Luca
2017-10-01
Nuclear DNA sequences of mitochondrial origin (numts) are derived by insertion of mitochondrial DNA (mtDNA), into the nuclear genome. In this study, we provide, for the first time, a genome picture of numts inserted in the pig nuclear genome. The Sus scrofa reference nuclear genome (Sscrofa10.2) was aligned with circularized and consensus mtDNA sequences using LAST software. A total of 430 numt sequences that may represent 246 different numt integration events (57 numt regions determined by at least two numt sequences and 189 singletons) were identified, covering about 0.0078% of the nuclear genome. Numt integration events were correlated (0.99) to the chromosome length. The longest numt sequence (about 11 kbp) was located on SSC2. Six numts were sequenced and PCR amplified in pigs of European commercial and local pig breeds, of the Chinese Meishan breed and in European wild boars. Three of them were polymorphic for the presence or absence of the insertion. Surprisingly, the estimated age of insertion of two of the three polymorphic numts was more ancient than that of the speciation time of the Sus scrofa, supporting that these polymorphic sites were originated from interspecies admixture that contributed to shape the pig genome. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Nithaniyal, Stalin; Newmaster, Steven G; Ragupathy, Subramanyam; Krishnamoorthy, Devanathan; Vassou, Sophie Lorraine; Parani, Madasamy
2014-01-01
India is rich with biodiversity, which includes a large number of endemic, rare and threatened plant species. Previous studies have used DNA barcoding to inventory species for applications in biodiversity monitoring, conservation impact assessment, monitoring of illegal trading, authentication of traded medicinal plants etc. This is the first tropical dry evergreen forest (TDEF) barcode study in the World and the first attempt to assemble a reference barcode library for the trees of India as part of a larger project initiated by this research group. We sampled 429 trees representing 143 tropical dry evergreen forest (TDEF) species, which included 16 threatened species. DNA barcoding was completed using rbcL and matK markers. The tiered approach (1st tier rbcL; 2nd tier matK) correctly identified 136 out of 143 species (95%). This high level of species resolution was largely due to the fact that the tree species were taxonomically diverse in the TDEF. Ability to resolve taxonomically diverse tree species of TDEF was comparable among the best match method, the phylogenetic method, and the characteristic attribute organization system method. We demonstrated the utility of the TDEF reference barcode library to authenticate wood samples from timber operations in the TDEF. This pilot research study will enable more comprehensive surveys of the illegal timber trade of threatened species in the TDEF. This TDEF reference barcode library also contains trees that have medicinal properties, which could be used to monitor unsustainable and indiscriminate collection of plants from the wild for their medicinal value.
Nithaniyal, Stalin; Newmaster, Steven G.; Ragupathy, Subramanyam; Krishnamoorthy, Devanathan; Vassou, Sophie Lorraine; Parani, Madasamy
2014-01-01
Background India is rich with biodiversity, which includes a large number of endemic, rare and threatened plant species. Previous studies have used DNA barcoding to inventory species for applications in biodiversity monitoring, conservation impact assessment, monitoring of illegal trading, authentication of traded medicinal plants etc. This is the first tropical dry evergreen forest (TDEF) barcode study in the World and the first attempt to assemble a reference barcode library for the trees of India as part of a larger project initiated by this research group. Methodology/Principal Findings We sampled 429 trees representing 143 tropical dry evergreen forest (TDEF) species, which included 16 threatened species. DNA barcoding was completed using rbcL and matK markers. The tiered approach (1st tier rbcL; 2nd tier matK) correctly identified 136 out of 143 species (95%). This high level of species resolution was largely due to the fact that the tree species were taxonomically diverse in the TDEF. Ability to resolve taxonomically diverse tree species of TDEF was comparable among the best match method, the phylogenetic method, and the characteristic attribute organization system method. Conclusions We demonstrated the utility of the TDEF reference barcode library to authenticate wood samples from timber operations in the TDEF. This pilot research study will enable more comprehensive surveys of the illegal timber trade of threatened species in the TDEF. This TDEF reference barcode library also contains trees that have medicinal properties, which could be used to monitor unsustainable and indiscriminate collection of plants from the wild for their medicinal value. PMID:25259794
Piggott, Andrew M; Kriegel, Alison M; Willows, Robert D; Karuso, Peter
2009-10-01
Reverse chemical proteomics using T7 phage display is a powerful technique for identifying cellular receptors of biologically active small molecules. However, to date this method has generally been limited to cDNA libraries constructed from mRNA isolated from eukaryotes. In this paper, we describe the construction of the first prokaryotic T7 phage display libraries from randomly digested Pseudomonas stutzeri and Vibrio fischeri gDNA, as well as a plant cDNA library from Arabidopsis thaliana. We also describe the use of T7 phage display to identify novel proteins from environmental DNA samples using biotinylated FK506 as a model affinity probe.
Key Aspects of Nucleic Acid Library Design for in Vitro Selection
Vorobyeva, Maria A.; Davydova, Anna S.; Vorobjev, Pavel E.; Pyshnyi, Dmitrii V.; Venyaminova, Alya G.
2018-01-01
Nucleic acid aptamers capable of selectively recognizing their target molecules have nowadays been established as powerful and tunable tools for biospecific applications, be it therapeutics, drug delivery systems or biosensors. It is now generally acknowledged that in vitro selection enables one to generate aptamers to almost any target of interest. However, the success of selection and the affinity of the resulting aptamers depend to a large extent on the nature and design of an initial random nucleic acid library. In this review, we summarize and discuss the most important features of the design of nucleic acid libraries for in vitro selection such as the nature of the library (DNA, RNA or modified nucleotides), the length of a randomized region and the presence of fixed sequences. We also compare and contrast different randomization strategies and consider computer methods of library design and some other aspects. PMID:29401748
Studier, F. William
1995-04-18
Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.
Studier, F.W.
1995-04-18
Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.
Robust Sub-nanomolar Library Preparation for High Throughput Next Generation Sequencing.
Wu, Wells W; Phue, Je-Nie; Lee, Chun-Ting; Lin, Changyi; Xu, Lai; Wang, Rong; Zhang, Yaqin; Shen, Rong-Fong
2018-05-04
Current library preparation protocols for Illumina HiSeq and MiSeq DNA sequencers require ≥2 nM initial library for subsequent loading of denatured cDNA onto flow cells. Such amounts are not always attainable from samples having a relatively low DNA or RNA input; or those for which a limited number of PCR amplification cycles is preferred (less PCR bias and/or more even coverage). A well-tested sub-nanomolar library preparation protocol for Illumina sequencers has however not been reported. The aim of this study is to provide a much needed working protocol for sub-nanomolar libraries to achieve outcomes as informative as those obtained with the higher library input (≥ 2 nM) recommended by Illumina's protocols. Extensive studies were conducted to validate a robust sub-nanomolar (initial library of 100 pM) protocol using PhiX DNA (as a control), genomic DNA (Bordetella bronchiseptica and microbial mock community B for 16S rRNA gene sequencing), messenger RNA, microRNA, and other small noncoding RNA samples. The utility of our protocol was further explored for PhiX library concentrations as low as 25 pM, which generated only slightly fewer than 50% of the reads achieved under the standard Illumina protocol starting with > 2 nM. A sub-nanomolar library preparation protocol (100 pM) could generate next generation sequencing (NGS) results as robust as the standard Illumina protocol. Following the sub-nanomolar protocol, libraries with initial concentrations as low as 25 pM could also be sequenced to yield satisfactory and reproducible sequencing results.
Oxidized nucleotide insertion by pol β confounds ligation during base excision repair
Çağlayan, Melike; Horton, Julie K.; Dai, Da-Peng; Stefanick, Donna F.; Wilson, Samuel H.
2017-01-01
Oxidative stress in cells can lead to accumulation of reactive oxygen species and oxidation of DNA precursors. Oxidized purine nucleotides can be inserted into DNA during replication and repair. The main pathway for correcting oxidized bases in DNA is base excision repair (BER), and in vertebrates DNA polymerase β (pol β) provides gap filling and tailoring functions. Here we report that the DNA ligation step of BER is compromised after pol β insertion of oxidized purine nucleotides into the BER intermediate in vitro. These results suggest the possibility that BER mediated toxic strand breaks are produced in cells under oxidative stress conditions. We observe enhanced cytotoxicity in oxidizing-agent treated pol β expressing mouse fibroblasts, suggesting formation of DNA strand breaks under these treatment conditions. Increased cytotoxicity following MTH1 knockout or treatment with MTH1 inhibitor suggests the oxidation of precursor nucleotides. PMID:28067232
Stability and dynamics of membrane-spanning DNA nanopores
NASA Astrophysics Data System (ADS)
Maingi, Vishal; Burns, Jonathan R.; Uusitalo, Jaakko J.; Howorka, Stefan; Marrink, Siewert J.; Sansom, Mark S. P.
2017-03-01
Recently developed DNA-based analogues of membrane proteins have advanced synthetic biology. A fundamental question is how hydrophilic nanostructures reside in the hydrophobic environment of the membrane. Here, we use multiscale molecular dynamics (MD) simulations to explore the structure, stability and dynamics of an archetypical DNA nanotube inserted via a ring of membrane anchors into a phospholipid bilayer. Coarse-grained MD reveals that the lipids reorganize locally to interact closely with the membrane-spanning section of the DNA tube. Steered simulations along the bilayer normal establish the metastable nature of the inserted pore, yielding a force profile with barriers for membrane exit due to the membrane anchors. Atomistic, equilibrium simulations at two salt concentrations confirm the close packing of lipid around of the stably inserted DNA pore and its cation selectivity, while revealing localized structural fluctuations. The wide-ranging and detailed insight informs the design of next-generation DNA pores for synthetic biology or biomedicine.
Sonet, Gontran; Jordaens, Kurt; Braet, Yves; Bourguignon, Luc; Dupont, Eréna; Backeljau, Thierry; De Meyer, Marc; Desmyter, Stijn
2013-01-01
Abstract Fly larvae living on dead corpses can be used to estimate post-mortem intervals. The identification of these flies is decisive in forensic casework and can be facilitated by using DNA barcodes provided that a representative and comprehensive reference library of DNA barcodes is available. We constructed a local (Belgium and France) reference library of 85 sequences of the COI DNA barcode fragment (mitochondrial cytochrome c oxidase subunit I gene), from 16 fly species of forensic interest (Calliphoridae, Muscidae, Fanniidae). This library was then used to evaluate the ability of two public libraries (GenBank and the Barcode of Life Data Systems – BOLD) to identify specimens from Belgian and French forensic cases. The public libraries indeed allow a correct identification of most specimens. Yet, some of the identifications remain ambiguous and some forensically important fly species are not, or insufficiently, represented in the reference libraries. Several search options offered by GenBank and BOLD can be used to further improve the identifications obtained from both libraries using DNA barcodes. PMID:24453564
Reiterative Recombination for the in vivo assembly of libraries of multigene pathways.
Wingler, Laura M; Cornish, Virginia W
2011-09-13
The increasing sophistication of synthetic biology is creating a demand for robust, broadly accessible methodology for constructing multigene pathways inside of the cell. Due to the difficulty of rationally designing pathways that function as desired in vivo, there is a further need to assemble libraries of pathways in parallel, in order to facilitate the combinatorial optimization of performance. While some in vitro DNA assembly methods can theoretically make libraries of pathways, these techniques are resource intensive and inherently require additional techniques to move the DNA back into cells. All previously reported in vivo assembly techniques have been low yielding, generating only tens to hundreds of constructs at a time. Here, we develop "Reiterative Recombination," a robust method for building multigene pathways directly in the yeast chromosome. Due to its use of endonuclease-induced homologous recombination in conjunction with recyclable markers, Reiterative Recombination provides a highly efficient, technically simple strategy for sequentially assembling an indefinite number of DNA constructs at a defined locus. In this work, we describe the design and construction of the first Reiterative Recombination system in Saccharomyces cerevisiae, and we show that it can be used to assemble multigene constructs. We further demonstrate that Reiterative Recombination can construct large mock libraries of at least 10(4) biosynthetic pathways. We anticipate that our system's simplicity and high efficiency will make it a broadly accessible technology for pathway construction and render it a valuable tool for optimizing pathways in vivo.
Reiterative Recombination for the in vivo assembly of libraries of multigene pathways
Wingler, Laura M.; Cornish, Virginia W.
2011-01-01
The increasing sophistication of synthetic biology is creating a demand for robust, broadly accessible methodology for constructing multigene pathways inside of the cell. Due to the difficulty of rationally designing pathways that function as desired in vivo, there is a further need to assemble libraries of pathways in parallel, in order to facilitate the combinatorial optimization of performance. While some in vitro DNA assembly methods can theoretically make libraries of pathways, these techniques are resource intensive and inherently require additional techniques to move the DNA back into cells. All previously reported in vivo assembly techniques have been low yielding, generating only tens to hundreds of constructs at a time. Here, we develop “Reiterative Recombination,” a robust method for building multigene pathways directly in the yeast chromosome. Due to its use of endonuclease-induced homologous recombination in conjunction with recyclable markers, Reiterative Recombination provides a highly efficient, technically simple strategy for sequentially assembling an indefinite number of DNA constructs at a defined locus. In this work, we describe the design and construction of the first Reiterative Recombination system in Saccharomyces cerevisiae, and we show that it can be used to assemble multigene constructs. We further demonstrate that Reiterative Recombination can construct large mock libraries of at least 104 biosynthetic pathways. We anticipate that our system’s simplicity and high efficiency will make it a broadly accessible technology for pathway construction and render it a valuable tool for optimizing pathways in vivo. PMID:21876185
Orphan, V J; Taylor, L T; Hafenbradl, D; Delong, E F
2000-02-01
Recent investigations of oil reservoirs in a variety of locales have indicated that these habitats may harbor active thermophilic prokaryotic assemblages. In this study, we used both molecular and culture-based methods to characterize prokaryotic consortia associated with high-temperature, sulfur-rich oil reservoirs in California. Enrichment cultures designed for anaerobic thermophiles, both autotrophic and heterotrophic, were successful at temperatures ranging from 60 to 90 degrees C. Heterotrophic enrichments from all sites yielded sheathed rods (Thermotogales), pleomorphic rods resembling Thermoanaerobacter, and Thermococcus-like isolates. The predominant autotrophic microorganisms recovered from inorganic enrichments using H(2), acetate, and CO(2) as energy and carbon sources were methanogens, including isolates closely related to Methanobacterium, Methanococcus, and Methanoculleus species. Two 16S rRNA gene (rDNA) libraries were generated from total community DNA collected from production wellheads, using either archaeal or universal oligonucleotide primer sets. Sequence analysis of the universal library indicated that a large percentage of clones were highly similar to known bacterial and archaeal isolates recovered from similar habitats. Represented genera in rDNA clone libraries included Thermoanaerobacter, Thermococcus, Desulfothiovibrio, Aminobacterium, Acidaminococcus, Pseudomonas, Halomonas, Acinetobacter, Sphingomonas, Methylobacterium, and Desulfomicrobium. The archaeal library was dominated by methanogen-like rDNAs, with a lower percentage of clones belonging to the Thermococcales. Our results strongly support the hypothesis that sulfur-utilizing and methane-producing thermophilic microorganisms have a widespread distribution in oil reservoirs and the potential to actively participate in the biogeochemical transformation of carbon, hydrogen, and sulfur in situ.
2011-01-01
Background Lupinus angustifolius L, also known as narrow-leafed lupin (NLL), is becoming an important grain legume crop that is valuable for sustainable farming and is becoming recognised as a potential human health food. Recent interest is being directed at NLL to improve grain production, disease and pest management and health benefits of the grain. However, studies have been hindered by a lack of extensive genomic resources for the species. Results A NLL BAC library was constructed consisting of 111,360 clones with an average insert size of 99.7 Kbp from cv Tanjil. The library has approximately 12 × genome coverage. Both ends of 9600 randomly selected BAC clones were sequenced to generate 13985 BAC end-sequences (BESs), covering approximately 1% of the NLL genome. These BESs permitted a preliminary characterisation of the NLL genome such as organisation and composition, with the BESs having approximately 39% G:C content, 16.6% repetitive DNA and 5.4% putative gene-encoding regions. From the BESs 9966 simple sequence repeat (SSR) motifs were identified and some of these are shown to be potential markers. Conclusions The NLL BAC library and BAC-end sequences are powerful resources for genetic and genomic research on lupin. These resources will provide a robust platform for future high-resolution mapping, map-based cloning, comparative genomics and assembly of whole-genome sequencing data for the species. PMID:22014081
Gao, Jin-Xin; Jing, Jing; Yu, Chuan-Jin; Chen, Jie
2015-06-01
Curvularia lunata is an important maize foliar fungal pathogen that distributes widely in maize growing area in China, and several key pathogenic factors have been isolated. An yeast two-hybrid (Y2H) library is a very useful platform to further unravel novel pathogenic factors in C. lunata. To construct a high-quality full length-expression cDNA library from the C. lunata for application to pathogenesis-related protein-protein interaction screening, total RNA was extracted. The SMART (Switching Mechanism At 5' end of the RNA Transcript) technique was used for cDNA synthesis. Double-stranded cDNA was ligated into the pGADT7-Rec vector with Herring Testes Carrier DNA using homologous recombination method. The ligation mixture was transformed into competent yeast AH109 cells to construct the primary cDNA library. Eventually, a high qualitative library was successfully established according to an evaluation on quality. The transformation efficiency was about 6.39 ×10(5) transformants/3 μg pGADT7-Rec. The titer of the primary cDNA library was 2.5×10(8) cfu/mL. The numbers for the cDNA library was 2.46×10(5). Randomly picked clones show that the recombination rate was 88.24%. Gel electrophoresis results indicated that the fragments ranged from 0.4 kb to 3.0 kb. Melanin synthesis protein Brn1 (1,3,8-hydroxynaphthalene reductase) was used as a "bait" to test the sufficiency of the Y2H library. As a result, a cDNA clone encoding VelB protein that was known to be involved in the regulation of diverse cellular processes, including control of secondary metabolism containing melanin and toxin production in many filamentous fungi was identified. Further study on the exact role of the VelB gene is underway.
Non-biased and efficient global amplification of a single-cell cDNA library
Huang, Huan; Goto, Mari; Tsunoda, Hiroyuki; Sun, Lizhou; Taniguchi, Kiyomi; Matsunaga, Hiroko; Kambara, Hideki
2014-01-01
Analysis of single-cell gene expression promises a more precise understanding of molecular mechanisms of a living system. Most techniques only allow studies of the expressions for limited numbers of gene species. When amplification of cDNA was carried out for analysing more genes, amplification biases were frequently reported. A non-biased and efficient global-amplification method, which uses a single-cell cDNA library immobilized on beads, was developed for analysing entire gene expressions for single cells. Every step in this analysis from reverse transcription to cDNA amplification was optimized. By removing degrading excess primers, the bias due to the digestion of cDNA was prevented. Since the residual reagents, which affect the efficiency of each subsequent reaction, could be removed by washing beads, the conditions for uniform and maximized amplification of cDNAs were achieved. The differences in the amplification rates for randomly selected eight genes were within 1.5-folds, which could be negligible for most of the applications of single-cell analysis. The global amplification gives a large amount of amplified cDNA (>100 μg) from a single cell (2-pg mRNA), and that amount is enough for downstream analysis. The proposed global-amplification method was used to analyse transcript ratios of multiple cDNA targets (from several copies to several thousand copies) quantitatively. PMID:24141095
Kirouac, Kevin N.; Basu, Ashis K.; Ling, Hong
2013-01-01
Polycyclic aromatic hydrocarbons and their nitro derivatives are culprits of the detrimental health effects of environmental pollution. These hydrophobic compounds metabolize to reactive species and attach to DNA producing bulky lesions, such as N-[deoxyguanosine-8-yl]-1-aminopyrene (APG), in genomic DNA. The bulky adducts block DNA replication by high-fidelity polymerases and compromise replication fidelities and efficiencies by specialized lesion bypass polymerases. Here we present three crystal structures of the DNA polymerase Dpo4, a model translesion DNA polymerase of the Y family, in complex with APG-lesion-containing DNA in pre-insertion and extension stages. APG is captured in two conformations in the pre-insertion complex; one is highly exposed to the solvent, whereas the other is harbored in a shallow cleft between the finger and unique Y family little finger domain. In contrast, APG is in a single conformation at the extension stage, in which the pyrene ring is sandwiched between the little finger domain and a base from the turning back single-stranded template strand. Strikingly, a nucleotide intercalates the DNA helix to form a quaternary complex with Dpo4, DNA, and an incoming nucleotide, which stabilizes the distorted DNA structure at the extension stage. The unique APG DNA conformations in Dpo4 inhibit DNA translocation through the polymerase active site for APG bypass. We also modeled an insertion complex that illustrates a solvent-exposed pyrene ring contributing to an unstable insertion state. The structural work combined with our lesion replication assays provides a novel structural mechanism on bypass of DNA adducts containing polycyclic aromatic hydrocarbon moieties. PMID:23876706
Kirouac, Kevin N; Basu, Ashis K; Ling, Hong
2013-11-15
Polycyclic aromatic hydrocarbons and their nitro derivatives are culprits of the detrimental health effects of environmental pollution. These hydrophobic compounds metabolize to reactive species and attach to DNA producing bulky lesions, such as N-[deoxyguanosine-8-yl]-1-aminopyrene (APG), in genomic DNA. The bulky adducts block DNA replication by high-fidelity polymerases and compromise replication fidelities and efficiencies by specialized lesion bypass polymerases. Here we present three crystal structures of the DNA polymerase Dpo4, a model translesion DNA polymerase of the Y family, in complex with APG-lesion-containing DNA in pre-insertion and extension stages. APG is captured in two conformations in the pre-insertion complex; one is highly exposed to the solvent, whereas the other is harbored in a shallow cleft between the finger and unique Y family little finger domain. In contrast, APG is in a single conformation at the extension stage, in which the pyrene ring is sandwiched between the little finger domain and a base from the turning back single-stranded template strand. Strikingly, a nucleotide intercalates the DNA helix to form a quaternary complex with Dpo4, DNA, and an incoming nucleotide, which stabilizes the distorted DNA structure at the extension stage. The unique APG DNA conformations in Dpo4 inhibit DNA translocation through the polymerase active site for APG bypass. We also modeled an insertion complex that illustrates a solvent-exposed pyrene ring contributing to an unstable insertion state. The structural work combined with our lesion replication assays provides a novel structural mechanism on bypass of DNA adducts containing polycyclic aromatic hydrocarbon moieties. © 2013.
Mortensen, Christian; Karlsen, Stine; Grønbæk, Henning; Nielsen, Dennis T; Frevert, Susanne; Clemmesen, Jens O; Møller, Søren; Jensen, Jørgen S; Bendtsen, Flemming
2013-10-01
Bacterial translocation (BT) with immune activation may lead to hemodynamical alterations and poor outcomes in patients with cirrhosis. We investigated bacterial DNA (bDNA), a marker of BT, and its relation to portal pressure and markers of inflammation in the portal and hepatic veins in patients with cirrhosis undergoing TIPS insertion. We analysed plasma for bDNA and markers of inflammation in 28 patients [median portal pressure gradient 15 (11-19) mmHg] during TIPS treatment for refractory ascites (n = 19) or acute variceal bleeding (n = 9). Advanced cirrhosis was present in the majority [Child-Pugh class (A/B/C): 1/14/13], and most often caused by alcohol (n = 21). bDNA was detectable in one or both samples in 16 of 28 patients (57%). bDNA was present in 39% of the samples from the portal vein vs 43% of the samples in the hepatic vein (P = 0.126). Antibiotics had no effect on bDNA or markers of inflammation. Markers of inflammation did not differ between the hepatic and portal veins with the exceptions of soluble urokinase plasminogen activating receptor (suPAR) and vascular endothelial growth factor (VEGF), both higher in the hepatic vein (P = 0.031 and 0.003 respectively). No transhepatic gradient of bDNA was evident, suggesting that no major hepatic elimination of bDNA occurs in advanced liver disease. bDNA, in contrast to previous reports was largely unrelated to a panel of markers of inflammation and without relation to portal pressure. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Yoon, Jung-Hoon; Roy Choudhury, Jayati; Park, Jeseong; Prakash, Satya; Prakash, Louise
2017-11-10
N3-Methyladenine (3-MeA) is formed in DNA by reaction with S -adenosylmethionine, the reactive methyl donor, and by reaction with alkylating agents. 3-MeA protrudes into the DNA minor groove and strongly blocks synthesis by replicative DNA polymerases (Pols). However, the mechanisms for replicating through this lesion in human cells remain unidentified. Here we analyzed the roles of translesion synthesis (TLS) Pols in the replication of 3-MeA-damaged DNA in human cells. Because 3-MeA has a short half-life in vitro , we used the stable 3-deaza analog, 3-deaza-3-methyladenine (3-dMeA), which blocks the DNA minor groove similarly to 3-MeA. We found that replication through the 3-dMeA adduct is mediated via three different pathways, dependent upon Polι/Polκ, Polθ, and Polζ. As inferred from biochemical studies, in the Polι/Polκ pathway, Polι inserts a nucleotide (nt) opposite 3-dMeA and Polκ extends synthesis from the inserted nt. In the Polθ pathway, Polθ carries out both the insertion and extension steps of TLS opposite 3-dMeA, and in the Polζ pathway, Polζ extends synthesis following nt insertion by an as yet unidentified Pol. Steady-state kinetic analyses indicated that Polι and Polθ insert the correct nt T opposite 3-dMeA with a much reduced catalytic efficiency and that both Pols exhibit a high propensity for inserting a wrong nt opposite this adduct. However, despite their low fidelity of synthesis opposite 3-dMeA, TLS opposite this lesion replicates DNA in a highly error-free manner in human cells. We discuss the implications of these observations for TLS mechanisms in human cells. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Hoople, Gordon D; Richards, Andrew; Wu, Yan; Pisano, Albert P; Zhang, Kun
2018-03-26
The ability to amplify and sequence either DNA or RNA from small starting samples has only been achieved in the last five years. Unfortunately, the standard protocols for generating genomic or transcriptomic libraries are incompatible and researchers must choose whether to sequence DNA or RNA for a particular sample. Gel-seq solves this problem by enabling researchers to simultaneously prepare libraries for both DNA and RNA starting with 100 - 1000 cells using a simple hydrogel device. This paper presents a detailed approach for the fabrication of the device as well as the biological protocol to generate paired libraries. We designed Gel-seq so that it could be easily implemented by other researchers; many genetics labs already have the necessary equipment to reproduce the Gel-seq device fabrication. Our protocol employs commonly-used kits for both whole-transcript amplification (WTA) and library preparation, which are also likely to be familiar to researchers already versed in generating genomic and transcriptomic libraries. Our approach allows researchers to bring to bear the power of both DNA and RNA sequencing on a single sample without splitting and with negligible added cost.
Neri, Dario; Lerner, Richard A
2018-06-20
The discovery of organic ligands that bind specifically to proteins is a central problem in chemistry, biology, and the biomedical sciences. The encoding of individual organic molecules with distinctive DNA tags, serving as amplifiable identification bar codes, allows the construction and screening of combinatorial libraries of unprecedented size, thus facilitating the discovery of ligands to many different protein targets. Fundamentally, one links powers of genetics and chemical synthesis. After the initial description of DNA-encoded chemical libraries in 1992, several experimental embodiments of the technology have been reduced to practice. This review provides a historical account of important milestones in the development of DNA-encoded chemical libraries, a survey of relevant ongoing research activities, and a glimpse into the future.
Guo, Bingfu; Guo, Yong; Hong, Huilong; Qiu, Li-Juan
2016-01-01
Molecular characterization of sequence flanking exogenous fragment insertion is essential for safety assessment and labeling of genetically modified organism (GMO). In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS) method. More than 22.4 Gb sequence data (∼21 × coverage) for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundaries of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of genomic insertion sites of G2-EPSPS and GAT transgenes will facilitate the utilization of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS was a cost-effective and rapid method for identifying sites of T-DNA insertions and flanking sequences in soybean.
2011-01-01
Background Abiotic stresses, such as water deficit and soil salinity, result in changes in physiology, nutrient use, and vegetative growth in vines, and ultimately, yield and flavor in berries of wine grape, Vitis vinifera L. Large-scale expressed sequence tags (ESTs) were generated, curated, and analyzed to identify major genetic determinants responsible for stress-adaptive responses. Although roots serve as the first site of perception and/or injury for many types of abiotic stress, EST sequencing in root tissues of wine grape exposed to abiotic stresses has been extremely limited to date. To overcome this limitation, large-scale EST sequencing was conducted from root tissues exposed to multiple abiotic stresses. Results A total of 62,236 expressed sequence tags (ESTs) were generated from leaf, berry, and root tissues from vines subjected to abiotic stresses and compared with 32,286 ESTs sequenced from 20 public cDNA libraries. Curation to correct annotation errors, clustering and assembly of the berry and leaf ESTs with currently available V. vinifera full-length transcripts and ESTs yielded a total of 13,278 unique sequences, with 2302 singletons and 10,976 mapped to V. vinifera gene models. Of these, 739 transcripts were found to have significant differential expression in stressed leaves and berries including 250 genes not described previously as being abiotic stress responsive. In a second analysis of 16,452 ESTs from a normalized root cDNA library derived from roots exposed to multiple, short-term, abiotic stresses, 135 genes with root-enriched expression patterns were identified on the basis of their relative EST abundance in roots relative to other tissues. Conclusions The large-scale analysis of relative EST frequency counts among a diverse collection of 23 different cDNA libraries from leaf, berry, and root tissues of wine grape exposed to a variety of abiotic stress conditions revealed distinct, tissue-specific expression patterns, previously unrecognized stress-induced genes, and many novel genes with root-enriched mRNA expression for improving our understanding of root biology and manipulation of rootstock traits in wine grape. mRNA abundance estimates based on EST library-enriched expression patterns showed only modest correlations between microarray and quantitative, real-time reverse transcription-polymerase chain reaction (qRT-PCR) methods highlighting the need for deep-sequencing expression profiling methods. PMID:21592389
Kerschner, Joseph E; Erdos, Geza; Hu, Fen Ze; Burrows, Amy; Cioffi, Joseph; Khampang, Pawjai; Dahlgren, Margaret; Hayes, Jay; Keefe, Randy; Janto, Benjamin; Post, J Christopher; Ehrlich, Garth D
2010-04-01
We sought to construct and partially characterize complementary DNA (cDNA) libraries prepared from the middle ear mucosa (MEM) of chinchillas to better understand pathogenic aspects of infection and inflammation, particularly with respect to leukotriene biogenesis and response. Chinchilla MEM was harvested from controls and after middle ear inoculation with nontypeable Haemophilus influenzae. RNA was extracted to generate cDNA libraries. Randomly selected clones were subjected to sequence analysis to characterize the libraries and to provide DNA sequence for phylogenetic analyses. Reverse transcription-polymerase chain reaction of the RNA pools was used to generate cDNA sequences corresponding to genes associated with leukotriene biosynthesis and metabolism. Sequence analysis of 921 randomly selected clones from the uninfected MEM cDNA library produced approximately 250,000 nucleotides of almost entirely novel sequence data. Searches of the GenBank database with the Basic Local Alignment Search Tool provided for identification of 515 unique genes expressed in the MEM and not previously described in chinchillas. In almost all cases, the chinchilla cDNA sequences displayed much greater homology to human or other primate genes than with rodent species. Genes associated with leukotriene metabolism were present in both normal and infected MEM. Based on both phylogenetic comparisons and gene expression similarities with humans, chinchilla MEM appears to be an excellent model for the study of middle ear inflammation and infection. The higher degree of sequence similarity between chinchillas and humans compared to chinchillas and rodents was unexpected. The cDNA libraries from normal and infected chinchilla MEM will serve as useful molecular tools in the study of otitis media and should yield important information with respect to middle ear pathogenesis.
Kerschner, Joseph E.; Erdos, Geza; Hu, Fen Ze; Burrows, Amy; Cioffi, Joseph; Khampang, Pawjai; Dahlgren, Margaret; Hayes, Jay; Keefe, Randy; Janto, Benjamin; Post, J. Christopher; Ehrlich, Garth D.
2010-01-01
Objectives We sought to construct and partially characterize complementary DNA (cDNA) libraries prepared from the middle ear mucosa (MEM) of chinchillas to better understand pathogenic aspects of infection and inflammation, particularly with respect to leukotriene biogenesis and response. Methods Chinchilla MEM was harvested from controls and after middle ear inoculation with nontypeable Haemophilus influenzae. RNA was extracted to generate cDNA libraries. Randomly selected clones were subjected to sequence analysis to characterize the libraries and to provide DNA sequence for phylogenetic analyses. Reverse transcription–polymerase chain reaction of the RNA pools was used to generate cDNA sequences corresponding to genes associated with leukotriene biosynthesis and metabolism. Results Sequence analysis of 921 randomly selected clones from the uninfected MEM cDNA library produced approximately 250,000 nucleotides of almost entirely novel sequence data. Searches of the GenBank database with the Basic Local Alignment Search Tool provided for identification of 515 unique genes expressed in the MEM and not previously described in chinchillas. In almost all cases, the chinchilla cDNA sequences displayed much greater homology to human or other primate genes than with rodent species. Genes associated with leukotriene metabolism were present in both normal and infected MEM. Conclusions Based on both phylogenetic comparisons and gene expression similarities with humans, chinchilla MEM appears to be an excellent model for the study of middle ear inflammation and infection. The higher degree of sequence similarity between chinchillas and humans compared to chinchillas and rodents was unexpected. The cDNA libraries from normal and infected chinchilla MEM will serve as useful molecular tools in the study of otitis media and should yield important information with respect to middle ear pathogenesis. PMID:20433028
Enyeart, Peter J; Mohr, Georg; Ellington, Andrew D; Lambowitz, Alan M
2014-01-13
Mobile group II introns are bacterial retrotransposons that combine the activities of an autocatalytic intron RNA (a ribozyme) and an intron-encoded reverse transcriptase to insert site-specifically into DNA. They recognize DNA target sites largely by base pairing of sequences within the intron RNA and achieve high DNA target specificity by using the ribozyme active site to couple correct base pairing to RNA-catalyzed intron integration. Algorithms have been developed to program the DNA target site specificity of several mobile group II introns, allowing them to be made into 'targetrons.' Targetrons function for gene targeting in a wide variety of bacteria and typically integrate at efficiencies high enough to be screened easily by colony PCR, without the need for selectable markers. Targetrons have found wide application in microbiological research, enabling gene targeting and genetic engineering of bacteria that had been intractable to other methods. Recently, a thermostable targetron has been developed for use in bacterial thermophiles, and new methods have been developed for using targetrons to position recombinase recognition sites, enabling large-scale genome-editing operations, such as deletions, inversions, insertions, and 'cut-and-pastes' (that is, translocation of large DNA segments), in a wide range of bacteria at high efficiency. Using targetrons in eukaryotes presents challenges due to the difficulties of nuclear localization and sub-optimal magnesium concentrations, although supplementation with magnesium can increase integration efficiency, and directed evolution is being employed to overcome these barriers. Finally, spurred by new methods for expressing group II intron reverse transcriptases that yield large amounts of highly active protein, thermostable group II intron reverse transcriptases from bacterial thermophiles are being used as research tools for a variety of applications, including qRT-PCR and next-generation RNA sequencing (RNA-seq). The high processivity and fidelity of group II intron reverse transcriptases along with their novel template-switching activity, which can directly link RNA-seq adaptor sequences to cDNAs during reverse transcription, open new approaches for RNA-seq and the identification and profiling of non-coding RNAs, with potentially wide applications in research and biotechnology.
Characterization of a highly polymorphic region 5′ to JH in the human immunoglobulin heavy chain
Silva, Alcino J.; Johnson, John P.; White, Raymond L.
1987-01-01
A cloned DNA segment 1.25 kilobases (kb) upstream from the joining segments of the human heavy chain immunoglobulin gene revealed extensive polymorphic variation at this locus, and the polymorphic pattern was stably transmitted to the next generation. Genomic restriction analysis showed that the polymorphism was caused by insertions/deletions within an MspI/BamHI fragment. Sequencing of one allele, 848 base pairs (bp) long, revealed eleven 50-base-pair tandem repeats. A second allele, 648 bp long, was cloned from a human genomic cosmid library, sequenced, and found to contain four fewer repeats than the first allele. A survey of 186 chromosomes from unrelated individuals of primarily northern European descent revealed at least six alleles. Images PMID:2884636
FragIdent--automatic identification and characterisation of cDNA-fragments.
Seelow, Dominik; Goehler, Heike; Hoffmann, Katrin
2009-03-02
Many genetic studies and functional assays are based on cDNA fragments. After the generation of cDNA fragments from an mRNA sample, their content is at first unknown and must be assigned by sequencing reactions or hybridisation experiments. Even in characterised libraries, a considerable number of clones are wrongly annotated. Furthermore, mix-ups can happen in the laboratory. It is therefore essential to the relevance of experimental results to confirm or determine the identity of the employed cDNA fragments. However, the manual approach for the characterisation of these fragments using BLAST web interfaces is not suited for larger number of sequences and so far, no user-friendly software is publicly available. Here we present the development of FragIdent, an application for the automatic identification of open reading frames (ORFs) within cDNA-fragments. The software performs BLAST analyses to identify the genes represented by the sequences and suggests primers to complete the sequencing of the whole insert. Gene-specific information as well as the protein domains encoded by the cDNA fragment are retrieved from Internet-based databases and included in the output. The application features an intuitive graphical interface and is designed for researchers without any bioinformatics skills. It is suited for projects comprising up to several hundred different clones. We used FragIdent to identify 84 cDNA clones from a yeast two-hybrid experiment. Furthermore, we identified 131 protein domains within our analysed clones. The source code is freely available from our homepage at http://compbio.charite.de/genetik/FragIdent/.
Zirconium(IV)-Catalyzed Ring Opening of on-DNA Epoxides in Water.
Fan, Lijun; Davie, Christopher P
2017-05-04
DNA-encoded library technology (ELT) has spurred wide interest in the pharmaceutical industry as a powerful tool for hit and lead generation. In recent years a number of "DNA-compatible" chemical modifications have been published and used to synthesize vastly diverse screening libraries. Herein we report a newly developed, zirconium tetrakis(dodecyl sulfate) [Zr(DS) 4 ] catalyzed ring-opening of on-DNA epoxides in water with amines, including anilines. Subsequent cyclization of the resulting on-DNA β-amino alcohols leads to a variety of biologically interesting, nonaromatic heterocycles. Under these conditions, a library of 137 million on-DNA β-amino alcohols and their cyclization products was assembled. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Lange, T; Hedden, P; Graebe, J E
1994-01-01
In the biosynthetic pathway to the gibberellins (GAs), carbon-20 is removed by oxidation to give the C19-GAs, which include the biologically active plant hormones. We report the isolation of a cDNA clone encoding a GA 20-oxidase [gibberellin, 2-oxoglutarate:oxygen oxidoreductase (20-hydroxylating, oxidizing) EC 1.14.11.-] by screening a cDNA library from developing cotyledons of pumpkin (Cucurbita maxima L.) for expression of this enzyme. When mRNA from either the cotyledons or the endosperm was translated in vitro using rabbit reticulocyte lysates, the products contained GA12 20-oxidase activity. A polyclonal antiserum was raised against the amino acid sequence of a peptide released by tryptic digestion of purified GA 20-oxidase from the endosperm. A cDNA expression library in lambda gt11 was prepared from cotyledon mRNA and screened with the antiserum. The identity of positive clones was confirmed by the demonstration of GA12 20-oxidase activity in single bacteriophage plaques. Recombinant protein from a selected clone catalyzed the three-step conversions of GA12 to GA25 and of GA53 to GA17, as well as the formation of the C19-GAs, GA1, GA9, and GA20, from their respective aldehyde precursors, GA23, GA24, and GA19. The nucleotide sequence of the cDNA insert contains an open reading frame of 1158 nt encoding a protein of 386 amino acid residues. The predicted M(r) (43,321) and pI (5.3) are similar to those determined experimentally for the native GA 20-oxidase. Furthermore, the derived amino acid sequence includes sequences obtained from the N terminus and two tryptic peptides from the native enzyme. It also contains regions that are highly conserved in a group of non-heme Fe-containing dioxygenases. Images PMID:8078921
Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics
Rinke, Christian; Low, Serene; Woodcroft, Ben J.; ...
2016-09-22
High-throughput sequencing libraries are typically limited by the requirement for nanograms to micrograms of input DNA. This bottleneck impedes the microscale analysis of ecosystems and the exploration of low biomass samples. Current methods for amplifying environmental DNA to bypass this bottleneck introduce considerable bias into metagenomic profiles. For this study, we describe and validate a simple modification of the Illumina Nextera XT DNA library preparation kit which allows creation of shotgun libraries from sub-nanogram amounts of input DNA. Community composition was reproducible down to 100 fg of input DNA based on analysis of a mock community comprising 54 phylogenetically diversemore » Bacteria and Archaea. The main technical issues with the low input libraries were a greater potential for contamination, limited DNA complexity which has a direct effect on assembly and binning, and an associated higher percentage of read duplicates. We recommend a lower limit of 1 pg (~100–1,000 microbial cells) to ensure community composition fidelity, and the inclusion of negative controls to identify reagent-specific contaminants. Applying the approach to marine surface water, pronounced differences were observed between bacterial community profiles of microliter volume samples, which we attribute to biological variation. This result is consistent with expected microscale patchiness in marine communities. We thus envision that our benchmarked, slightly modified low input DNA protocol will be beneficial for microscale and low biomass metagenomics.« less
Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rinke, Christian; Low, Serene; Woodcroft, Ben J.
High-throughput sequencing libraries are typically limited by the requirement for nanograms to micrograms of input DNA. This bottleneck impedes the microscale analysis of ecosystems and the exploration of low biomass samples. Current methods for amplifying environmental DNA to bypass this bottleneck introduce considerable bias into metagenomic profiles. For this study, we describe and validate a simple modification of the Illumina Nextera XT DNA library preparation kit which allows creation of shotgun libraries from sub-nanogram amounts of input DNA. Community composition was reproducible down to 100 fg of input DNA based on analysis of a mock community comprising 54 phylogenetically diversemore » Bacteria and Archaea. The main technical issues with the low input libraries were a greater potential for contamination, limited DNA complexity which has a direct effect on assembly and binning, and an associated higher percentage of read duplicates. We recommend a lower limit of 1 pg (~100–1,000 microbial cells) to ensure community composition fidelity, and the inclusion of negative controls to identify reagent-specific contaminants. Applying the approach to marine surface water, pronounced differences were observed between bacterial community profiles of microliter volume samples, which we attribute to biological variation. This result is consistent with expected microscale patchiness in marine communities. We thus envision that our benchmarked, slightly modified low input DNA protocol will be beneficial for microscale and low biomass metagenomics.« less
Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics
Low, Serene; Raina, Jean-Baptiste; Skarshewski, Adam; Le, Xuyen H.; Butler, Margaret K.; Stocker, Roman; Seymour, Justin; Tyson, Gene W.
2016-01-01
High-throughput sequencing libraries are typically limited by the requirement for nanograms to micrograms of input DNA. This bottleneck impedes the microscale analysis of ecosystems and the exploration of low biomass samples. Current methods for amplifying environmental DNA to bypass this bottleneck introduce considerable bias into metagenomic profiles. Here we describe and validate a simple modification of the Illumina Nextera XT DNA library preparation kit which allows creation of shotgun libraries from sub-nanogram amounts of input DNA. Community composition was reproducible down to 100 fg of input DNA based on analysis of a mock community comprising 54 phylogenetically diverse Bacteria and Archaea. The main technical issues with the low input libraries were a greater potential for contamination, limited DNA complexity which has a direct effect on assembly and binning, and an associated higher percentage of read duplicates. We recommend a lower limit of 1 pg (∼100–1,000 microbial cells) to ensure community composition fidelity, and the inclusion of negative controls to identify reagent-specific contaminants. Applying the approach to marine surface water, pronounced differences were observed between bacterial community profiles of microliter volume samples, which we attribute to biological variation. This result is consistent with expected microscale patchiness in marine communities. We thus envision that our benchmarked, slightly modified low input DNA protocol will be beneficial for microscale and low biomass metagenomics. PMID:27688978
2014-01-01
Background It is well known that different Eimeria maxima strains exhibit significant antigenic variation. However, the genetic basis of these phenotypes remains unclear. Methods Total RNA and mRNA were isolated from unsporulated oocysts of E. maxima strains SH and NT, which were found to have significant differences in immunogenicity in our previous research. Two subtractive cDNA libraries were constructed using suppression subtractive hybridization (SSH) and specific genes were further analyzed by dot-blot hybridization and qRT-PCR analysis. Results A total of 561 clones were selected from both cDNA libraries and the length of the inserted fragments was 0.25–1.0 kb. Dot-blot hybridization revealed a total of 86 differentially expressed clones (63 from strain SH and 23 from strain NT). Nucleotide sequencing analysis of these clones revealed ten specific contigs (six from strain SH and four from strain NT). Further analysis found that six contigs from strain SH and three from strain NT shared significant identities with previously reported proteins, and one contig was presumed to be novel. The specific differentially expressed genes were finally verified by RT-PCR and qRT-PCR analyses. Conclusions The data presented here suggest that specific genes identified between the two strains may be important molecules in the immunogenicity of E. maxima that may present potential new drug targets or vaccine candidates for coccidiosis. PMID:24894832
Library Resources for Bac End Sequencing. Final Technical Report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pieter J. de Jong
2000-10-01
Studies directed towards the specific aims outlined for this research award are summarized. The RPCI II Human Bac Library has been expanded by the addition of 6.9-fold genomic coverage. This segment has been generated from a MBOI partial digest of the same anonymous donor DNA used for the rest of the library. A new cloning vector, pTARBAC1, has been constructed and used in the construction of RPCI-II segment 5. This new cloning vector provides a new strategy in identifying targeted genomic regions and will greatly facilitate a large-scale analysis for positional cloning. A new maleCS7BC/6J mouse BAC library has beenmore » constructed. RPCI-23 contain 576 plates (approx 210,000 clones) and represents approximately 11-fold coverage of the mouse genome.« less
Megabase sequencing of human genome by ordered-shotgun-sequencing (OSS) strategy
NASA Astrophysics Data System (ADS)
Chen, Ellson Y.
1997-05-01
So far we have used OSS strategy to sequence over 2 megabases DNA in large-insert clones from regions of human X chromosomes with different characteristic levels of GC content. The method starts by randomly fragmenting a BAC, YAC or PAC to 8-12 kb pieces and subcloning those into lambda phage. Insert-ends of these clones are sequenced and overlapped to create a partial map. Complete sequencing is then done on a minimal tiling path of selected subclones, recursively focusing on those at the edges of contigs to facilitate mergers of clones across the entire target. To reduce manual labor, PCR processes have been adapted to prepare sequencing templates throughout the entire operation. The streamlined process can thus lend itself to further automation. The OSS approach is suitable for large- scale genomic sequencing, providing considerable flexibility in the choice of subclones or regions for more or less intensive sequencing. For example, subclones containing contaminating host cell DNA or cloning vector can be recognized and ignored with minimal sequencing effort; regions overlapping a neighboring clone already sequenced need not be redone; and segments containing tandem repeats or long repetitive sequences can be spotted early on and targeted for additional attention.
Multi-Threaded DNA Tag/Anti-Tag Library Generator for Multi-Core Platforms
2009-05-01
base pair) Watson ‐ Crick strand pairs that bind perfectly within pairs, but poorly across pairs. A variety of DNA strand hybridization metrics...AFRL-RI-RS-TR-2009-131 Final Technical Report May 2009 MULTI-THREADED DNA TAG/ANTI-TAG LIBRARY GENERATOR FOR MULTI-CORE PLATFORMS...TYPE Final 3. DATES COVERED (From - To) Jun 08 – Feb 09 4. TITLE AND SUBTITLE MULTI-THREADED DNA TAG/ANTI-TAG LIBRARY GENERATOR FOR MULTI-CORE
A database of annotated tentative orthologs from crop abiotic stress transcripts.
Balaji, Jayashree; Crouch, Jonathan H; Petite, Prasad V N S; Hoisington, David A
2006-10-07
A minimal requirement to initiate a comparative genomics study on plant responses to abiotic stresses is a dataset of orthologous sequences. The availability of a large amount of sequence information, including those derived from stress cDNA libraries allow for the identification of stress related genes and orthologs associated with the stress response. Orthologous sequences serve as tools to explore genes and their relationships across species. For this purpose, ESTs from stress cDNA libraries across 16 crop species including 6 important cereal crops and 10 dicots were systematically collated and subjected to bioinformatics analysis such as clustering, grouping of tentative orthologous sets, identification of protein motifs/patterns in the predicted protein sequence, and annotation with stress conditions, tissue/library source and putative function. All data are available to the scientific community at http://intranet.icrisat.org/gt1/tog/homepage.htm. We believe that the availability of annotated plant abiotic stress ortholog sets will be a valuable resource for researchers studying the biology of environmental stresses in plant systems, molecular evolution and genomics.
Zhang, Wenli; Fu, Jun; Liu, Jing; Wang, Hailong; Schiwon, Maren; Janz, Sebastian; Schaffarczyk, Lukas; von der Goltz, Lukas; Ehrke-Schulz, Eric; Dörner, Johannes; Solanki, Manish; Boehme, Philip; Bergmann, Thorsten; Lieber, Andre; Lauber, Chris; Dahl, Andreas; Petzold, Andreas; Zhang, Youming; Stewart, A Francis; Ehrhardt, Anja
2017-05-23
Adenoviruses (Ads) are large human-pathogenic double-stranded DNA (dsDNA) viruses presenting an enormous natural diversity associated with a broad variety of diseases. However, only a small fraction of adenoviruses has been explored in basic virology and biomedical research, highlighting the need to develop robust and adaptable methodologies and resources. We developed a method for high-throughput direct cloning and engineering of adenoviral genomes from different sources utilizing advanced linear-linear homologous recombination (LLHR) and linear-circular homologous recombination (LCHR). We describe 34 cloned adenoviral genomes originating from clinical samples, which were characterized by next-generation sequencing (NGS). We anticipate that this recombineering strategy and the engineered adenovirus library will provide an approach to study basic and clinical virology. High-throughput screening (HTS) of the reporter-tagged Ad library in a panel of cell lines including osteosarcoma disease-specific cell lines revealed alternative virus types with enhanced transduction and oncolysis efficiencies. This highlights the usefulness of this resource. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
A Versatile Microfluidic Device for Automating Synthetic Biology.
Shih, Steve C C; Goyal, Garima; Kim, Peter W; Koutsoubelis, Nicolas; Keasling, Jay D; Adams, Paul D; Hillson, Nathan J; Singh, Anup K
2015-10-16
New microbes are being engineered that contain the genetic circuitry, metabolic pathways, and other cellular functions required for a wide range of applications such as producing biofuels, biobased chemicals, and pharmaceuticals. Although currently available tools are useful in improving the synthetic biology process, further improvements in physical automation would help to lower the barrier of entry into this field. We present an innovative microfluidic platform for assembling DNA fragments with 10× lower volumes (compared to that of current microfluidic platforms) and with integrated region-specific temperature control and on-chip transformation. Integration of these steps minimizes the loss of reagents and products compared to that with conventional methods, which require multiple pipetting steps. For assembling DNA fragments, we implemented three commonly used DNA assembly protocols on our microfluidic device: Golden Gate assembly, Gibson assembly, and yeast assembly (i.e., TAR cloning, DNA Assembler). We demonstrate the utility of these methods by assembling two combinatorial libraries of 16 plasmids each. Each DNA plasmid is transformed into Escherichia coli or Saccharomyces cerevisiae using on-chip electroporation and further sequenced to verify the assembly. We anticipate that this platform will enable new research that can integrate this automated microfluidic platform to generate large combinatorial libraries of plasmids and will help to expedite the overall synthetic biology process.
DNA Extraction Protocols for Whole-Genome Sequencing in Marine Organisms.
Panova, Marina; Aronsson, Henrik; Cameron, R Andrew; Dahl, Peter; Godhe, Anna; Lind, Ulrika; Ortega-Martinez, Olga; Pereyra, Ricardo; Tesson, Sylvie V M; Wrange, Anna-Lisa; Blomberg, Anders; Johannesson, Kerstin
2016-01-01
The marine environment harbors a large proportion of the total biodiversity on this planet, including the majority of the earths' different phyla and classes. Studying the genomes of marine organisms can bring interesting insights into genome evolution. Today, almost all marine organismal groups are understudied with respect to their genomes. One potential reason is that extraction of high-quality DNA in sufficient amounts is challenging for many marine species. This is due to high polysaccharide content, polyphenols and other secondary metabolites that will inhibit downstream DNA library preparations. Consequently, protocols developed for vertebrates and plants do not always perform well for invertebrates and algae. In addition, many marine species have large population sizes and, as a consequence, highly variable genomes. Thus, to facilitate the sequence read assembly process during genome sequencing, it is desirable to obtain enough DNA from a single individual, which is a challenge in many species of invertebrates and algae. Here, we present DNA extraction protocols for seven marine species (four invertebrates, two algae, and a marine yeast), optimized to provide sufficient DNA quality and yield for de novo genome sequencing projects.
Genetic Dissection of Tropodithietic Acid Biosynthesis by Marine Roseobacters▿ ‡
Geng, Haifeng; Bruhn, Jesper Bartholin; Nielsen, Kristian F.; Gram, Lone; Belas, Robert
2008-01-01
The symbiotic association between the roseobacter Silicibacter sp. strain TM1040 and the dinoflagellate Pfiesteria piscicida involves bacterial chemotaxis to dinoflagellate-produced dimethylsulfoniopropionate (DMSP), DMSP demethylation, and ultimately a biofilm on the surface of the host. Biofilm formation is coincident with the production of an antibiotic and a yellow-brown pigment. In this report, we demonstrate that the antibiotic is a sulfur-containing compound, tropodithietic acid (TDA). Using random transposon insertion mutagenesis, 12 genes were identified as critical for TDA biosynthesis by the bacteria, and mutation in any one of these results in a loss of antibiotic activity (Tda−) and pigment production. Unexpectedly, six of the genes, referred to as tdaA-F, could not be found on the annotated TM1040 genome and were instead located on a previously unidentified plasmid (ca. 130 kb; pSTM3) that exhibited a low frequency of spontaneous loss. Homologs of tdaA and tdaB from Silicibacter sp. strain TM1040 were identified by mutagenesis in another TDA-producing roseobacter, Phaeobacter sp. strain 27-4, which also possesses two large plasmids (ca. 60 and ca. 70 kb, respectively), and tda genes were found by DNA-DNA hybridization in 88% of a diverse collection of nine roseobacters with known antibiotic activity. These data suggest that roseobacters may use a common pathway for TDA biosynthesis that involves plasmid-encoded proteins. Using metagenomic library databases and a bioinformatics approach, differences in the biogeographical distribution between the critical TDA synthesis genes were observed. The implications of these results to roseobacter survival and the interaction between TM1040 and its dinoflagellate host are discussed. PMID:18192410
A large-scale full-length cDNA analysis to explore the budding yeast transcriptome
Miura, Fumihito; Kawaguchi, Noriko; Sese, Jun; Toyoda, Atsushi; Hattori, Masahira; Morishita, Shinichi; Ito, Takashi
2006-01-01
We performed a large-scale cDNA analysis to explore the transcriptome of the budding yeast Saccharomyces cerevisiae. We sequenced two cDNA libraries, one from the cells exponentially growing in a minimal medium and the other from meiotic cells. Both libraries were generated by using a vector-capping method that allows the accurate mapping of transcription start sites (TSSs). Consequently, we identified 11,575 TSSs associated with 3,638 annotated genomic features, including 3,599 ORFs, to suggest that most yeast genes have two or more TSSs. In addition, we identified 45 previously undescribed introns, including those affecting current ORF annotations and those spliced alternatively. Furthermore, the analysis revealed 667 transcription units in the intergenic regions and transcripts derived from antisense strands of 367 known features. We also found that 348 ORFs carry TSSs in their 3′-halves to generate sense transcripts starting from inside the ORFs. These results indicate that the budding yeast transcriptome is considerably more complex than previously thought, and it shares many recently revealed characteristics with the transcriptomes of mammals and other higher eukaryotes. Thus, the genome-wide active transcription that generates novel classes of transcripts appears to be an intrinsic feature of the eukaryotic cells. The budding yeast will serve as a versatile model for the studies on these aspects of transcriptome, and the full-length cDNA clones can function as an invaluable resource in such studies. PMID:17101987
Christensen, Shawn M; Ye, Junqiang; Eickbush, Thomas H
2006-11-21
Non-LTR retrotransposons insert into eukaryotic genomes by target-primed reverse transcription (TPRT), a process in which cleaved DNA targets are used to prime reverse transcription of the element's RNA transcript. Many of the steps in the integration pathway of these elements can be characterized in vitro for the R2 element because of the rigid sequence specificity of R2 for both its DNA target and its RNA template. R2 retrotransposition involves identical subunits of the R2 protein bound to different DNA sequences upstream and downstream of the insertion site. The key determinant regulating which DNA-binding conformation the protein adopts was found to be a 320-nt RNA sequence from near the 5' end of the R2 element. In the absence of this 5' RNA the R2 protein binds DNA sequences upstream of the insertion site, cleaves the first DNA strand, and conducts TPRT when RNA containing the 3' untranslated region of the R2 transcript is present. In the presence of the 320-nt 5' RNA, the R2 protein binds DNA sequences downstream of the insertion site. Cleavage of the second DNA strand by the downstream subunit does not appear to occur until after the 5' RNA is removed from this subunit. We postulate that the removal of the 5' RNA normally occurs during reverse transcription, and thus provides a critical temporal link to first- and second-strand DNA cleavage in the R2 retrotransposition reaction.
Purification of nanogram-range immunoprecipitated DNA in ChIP-seq application.
Zhong, Jian; Ye, Zhenqing; Lenz, Samuel W; Clark, Chad R; Bharucha, Adil; Farrugia, Gianrico; Robertson, Keith D; Zhang, Zhiguo; Ordog, Tamas; Lee, Jeong-Heon
2017-12-21
Chromatin immunoprecipitation-sequencing (ChIP-seq) is a widely used epigenetic approach for investigating genome-wide protein-DNA interactions in cells and tissues. The approach has been relatively well established but several key steps still require further improvement. As a part of the procedure, immnoprecipitated DNA must undergo purification and library preparation for subsequent high-throughput sequencing. Current ChIP protocols typically yield nanogram quantities of immunoprecipitated DNA mainly depending on the target of interest and starting chromatin input amount. However, little information exists on the performance of reagents used for the purification of such minute amounts of immunoprecipitated DNA in ChIP elution buffer and their effects on ChIP-seq data. Here, we compared DNA recovery, library preparation efficiency, and ChIP-seq results obtained with several commercial DNA purification reagents applied to 1 ng ChIP DNA and also investigated the impact of conditions under which ChIP DNA is stored. We compared DNA recovery of ten commercial DNA purification reagents and phenol/chloroform extraction from 1 to 50 ng of immunopreciptated DNA in ChIP elution buffer. The recovery yield was significantly different with 1 ng of DNA while similar in higher DNA amounts. We also observed that the low nanogram range of purified DNA is prone to loss during storage depending on the type of polypropylene tube used. The immunoprecipitated DNA equivalent to 1 ng of purified DNA was subject to DNA purification and library preparation to evaluate the performance of four better performing purification reagents in ChIP-seq applications. Quantification of library DNAs indicated the selected purification kits have a negligible impact on the efficiency of library preparation. The resulting ChIP-seq data were comparable with the dataset generated by ENCODE consortium and were highly correlated between the data from different purification reagents. This study provides comparative data on commercial DNA purification reagents applied to nanogram-range immunopreciptated ChIP DNA and evidence for the importance of storage conditions of low nanogram-range purified DNA. We verified consistent high performance of a subset of the tested reagents. These results will facilitate the improvement of ChIP-seq methodology for low-input applications.
Sequence Polishing Library (SPL) v10.0
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oberortner, Ernst
The Sequence Polishing Library (SPL) is a suite of software tools in order to automate "Design for Synthesis and Assembly" workflows. Specifically: The SPL "Converter" tool converts files among the following sequence data exchange formats: CSV, FASTA, GenBank, and Synthetic Biology Open Language (SBOL); The SPL "Juggler" tool optimizes the codon usages of DNA coding sequences according to an optimization strategy, a user-specific codon usage table and genetic code. In addition, the SPL "Juggler" can translate amino acid sequences into DNA sequences.:The SPL "Polisher" verifies NA sequences against DNA synthesis constraints, such as GC content, repeating k-mers, and restriction sites.more » In case of violations, the "Polisher" reports the violations in a comprehensive manner. The "Polisher" tool can also modify the violating regions according to an optimization strategy, a user-specific codon usage table and genetic code;The SPL "Partitioner" decomposes large DNA sequences into smaller building blocks with partial overlaps that enable an efficient assembly. The "Partitioner" enables the user to configure the characteristics of the overlaps, which are mostly determined by the utilized assembly protocol, such as length, GC content, or melting temperature.« less
Design and screening of M13 phage display cDNA libraries.
Georgieva, Yuliya; Konthur, Zoltán
2011-02-17
The last decade has seen a steady increase in screening of cDNA expression product libraries displayed on the surface of filamentous bacteriophage. At the same time, the range of applications extended from the identification of novel allergens over disease markers to protein-protein interaction studies. However, the generation and selection of cDNA phage display libraries is subjected to intrinsic biological limitations due to their complex nature and heterogeneity, as well as technical difficulties regarding protein presentation on the phage surface. Here, we review the latest developments in this field, discuss a number of strategies and improvements anticipated to overcome these challenges making cDNA and open reading frame (ORF) libraries more readily accessible for phage display. Furthermore, future trends combining phage display with next generation sequencing (NGS) will be presented.
Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda
2012-01-01
Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of -45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a -10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated.
Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E.; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda
2012-01-01
Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of ∼45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a ∼10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated. PMID:23071448
Barnard, G F; Staniunas, R J; Mori, M; Puder, M; Jessup, M J; Steele, G D; Chen, L B
1993-09-01
The levels of a number of ribosomal protein mRNAs are reported to be increased in human colon cancer. We have assessed whether selected ribosomal protein mRNAs are overexpressed in other gastrointestinal malignancies, namely gastric and hepatocellular carcinomas. Subtracted complementary DNA libraries were generated from paired samples of human (a) colorectal carcinoma minus adjacent normal colonic mucosa and (b) hepatocellular carcinoma minus adjacent normal liver. Screening of approximately 3% of these library clones determined that ribosomal protein mRNAs encoding L18 and L37 (not previously reported) and P0 and S6 were overexpressed in one or the other library. Their complementary DNA inserts were then used as probes to evaluate their expression in a larger number of paired tumor/normal surgical samples of human colonic, gastric, and hepatocellular carcinomas, by Northern hybridization. The mRNA signal was greater in the colonic carcinoma than in paired adjacent normal colonic mucosa in 38 of 42 cases for P0 [tumor/normal (T/N) ratio = 3.0 +/- 0.3, mean +/- SE, P < 0.001] (G. F. Barnard, R. J. Staniunas, S. Bao, K. Mafune, J. L. Gollan, G. D. Steele, Jr., and L. B. Chen, Cancer Res., 52: 3067-3072, 1992), in 25 of 28 cases for L18 (T/N ratio = 3.7 +/- 0.5, P < 0.001), in 27 of 28 cases for L37 (T/N ratio = 5.3 +/- 0.4, P < 0.001), and in 24 of 28 cases for S6 (T/N ratio = 3.1 +/- 0.5, P < 0.01). The level of mRNA overexpression of L18 and S6 did not correlate with the Dukes' stage of disease. In hepatocellular carcinoma samples, using the same four ribosomal protein complementary DNA probes, only P0 mRNA was significantly increased (T/N ratio = 2.8 +/- 0.4, n = 6, P = 0.047). In gastric carcinoma samples, none of these mRNAs was increased (mean T/N ratios = 0.9-1.2, n = 6). Therefore, gastric and hepatocellular carcinomas do not overexpress the same ribosomal protein mRNAs as do colonic carcinoma.
McVey, Mitch
2010-01-01
DNA double-strand breaks are repaired by multiple mechanisms that are roughly grouped into the categories of homology-directed repair and non-homologous end joining. End-joining repair can be further classified as either classical non-homologous end joining, which requires DNA ligase 4, or “alternative” end joining, which does not. Alternative end joining has been associated with genomic deletions and translocations, but its molecular mechanism(s) are largely uncharacterized. Here, we report that Drosophila melanogaster DNA polymerase theta (pol theta), encoded by the mus308 gene and previously implicated in DNA interstrand crosslink repair, plays a crucial role in DNA ligase 4-independent alternative end joining. In the absence of pol theta, end joining is impaired and residual repair often creates large deletions flanking the break site. Analysis of break repair junctions from flies with mus308 separation-of-function alleles suggests that pol theta promotes the use of long microhomologies during alternative end joining and increases the likelihood of complex insertion events. Our results establish pol theta as a key protein in alternative end joining in Drosophila and suggest a potential mechanistic link between alternative end joining and interstrand crosslink repair. PMID:20617203
Brightwell, Gale; Boerema, Jackie; Mills, John; Mowat, Eilidh; Pulford, David
2006-05-25
We examined the bacterial community present on an Intralox conveyor belt system in an operating lamb boning room by sequencing the 16S ribosomal DNA (rDNA) of bacteria extracted in the presence or absence of cultivation. RFLP patterns for 16S rDNA clone library and cultures were generated using HaeIII and MspI restriction endonucleases. 16S rDNA amplicons produced 8 distinct RFLP pattern groups. RFLP groups I-IV were represented in the clone library and RFLP groups I and V-VIII were represented amongst the cultured isolates. Partial DNA sequences from each RFLP group revealed that all group I, II and VIII representatives were Pseudomonas spp., group III were Sphingomonas spp., group IV clones were most similar to an uncultured alpha proteobacterium, group V was similar to a Serratia spp., group VI with an Alcaligenes spp., and group VII with Microbacterium spp. Sphingomonads were numerically dominant in the culture-independent clone library and along with the group IV alpha proteobacterium were not represented amongst the cultured isolates. Serratia, Alcaligenes and Microbacterium spp. were only represented with cultured isolates. Pseudomonads were detected by both culture-dependent (84% of isolates) and culture-independent (12.5% of clones) methods and their presence at high frequency does pose the risk of product spoilage if transferred onto meat stored under aerobic conditions. The detection of sphingomonads in large numbers by the culture-independent method demands further analysis because sphingomonads may represent a new source of meat spoilage that has not been previously recognised in the meat processing environment. The 16S rDNA collections generated by both methods were important at representing the diversity of the bacterial population associated with an Intralox conveyor belt system.
Kitchen, J L; Li, Z; Crooke, E
1999-05-11
The initiation of Escherichia coli chromosomal replication by DnaA protein is strongly influenced by the tight binding of the nucleotides ATP and ADP. Anionic phospholipids in a fluid bilayer promote the conversion of inactive ADP-DnaA protein to replicatively active ATP-DnaA protein in vitro, and thus likely play a key role in regulating DnaA activity. Previous studies have revealed that, during this reactivation, a specific region of DnaA protein inserts into the hydrophobic portion of the lipid bilayer in an acidic phospholipid-dependent manner. To elucidate the requirement for acidic phospholipids in the reactivation process, the contribution of electrostatic forces in the interaction of DnaA and lipid was examined. DnaA-lipid binding required anionic phospholipids, and DnaA-lipid binding as well as lipid-mediated release of DnaA-bound nucleotide were inhibited by increased ionic strength, suggesting the involvement of electrostatic interactions in these processes. As the vesicular content of acidic phospholipids was increased, both nucleotide release and DnaA-lipid binding increased in a linear, parallel manner. Given that DnaA-membrane binding, the insertion of DnaA into the membrane, and the consequent nucleotide release all require anionic phospholipids, the acidic headgroup may be necessary to recruit DnaA protein to the membrane for insertion and subsequent reactivation for replication.
Using Cellular Proteins to Reveal Mechanisms of HIV Infection | Center for Cancer Research
A vital step in HIV infection is the insertion of viral DNA into the genome of the host cell. In order for the insertion to occur, viral nucleic acid must be transported through the membrane that separates the main cellular compartment (the cytoplasm) from the nucleus, where the host DNA is located. Scientists are actively studying the mechanism used to transport viral DNA
Popp, Nicole; Schlömann, Michael; Mau, Margit
2006-11-01
Soils contaminated with mineral oil hydrocarbons are often cleaned in off-site bioremediation systems. In order to find out which bacteria are active during the degradation phase in such systems, the diversity of the active microflora in a degrading soil remediation system was investigated by small-subunit (SSU) rRNA analysis. Two sequential RNA extracts from one soil sample were generated by a procedure incorporating bead beating. Both extracts were analysed separately by generating individual SSU rDNA clone libraries from cDNA of the two extracts. The sequencing results showed moderate diversity. The two clone libraries were dominated by Gammaproteobacteria, especially Pseudomonas spp. Alphaproteobacteria and Betaproteobacteria were two other large groups in the clone libraries. Actinobacteria, Firmicutes, Bacteroidetes and Epsilonproteobacteria were detected in lower numbers. The obtained sequences were predominantly related to genera for which cultivated representatives have been described, but were often clustered together in the phylogenetic tree, and the sequences that were most similar were originally obtained from soils and not from pure cultures. Most of the dominant genera in the clone libraries, e.g. Pseudomonas, Acinetobacter, Sphingomonas, Acidovorax and Thiobacillus, had already been detected in (mineral oil hydrocarbon) contaminated environmental samples. The occurrence of the genera Zymomonas and Rhodoferax was novel in mineral oil hydrocarbon-contaminated soil.
Kleinboelting, Nils; Huep, Gunnar; Weisshaar, Bernd
2017-01-01
SimpleSearch provides access to a database containing information about T-DNA insertion lines of the GABI-Kat collection of Arabidopsis thaliana mutants. These mutants are an important tool for reverse genetics, and GABI-Kat is the second largest collection of such T-DNA insertion mutants. Insertion sites were deduced from flanking sequence tags (FSTs), and the database contains information about mutant plant lines as well as insertion alleles. Here, we describe improvements within the interface (available at http://www.gabi-kat.de/db/genehits.php) and with regard to the database content that have been realized in the last five years. These improvements include the integration of the Araport11 genome sequence annotation data containing the recently updated A. thaliana structural gene descriptions, an updated visualization component that displays groups of insertions with very similar insertion positions, mapped confirmation sequences, and primers. The visualization component provides a quick way to identify insertions of interest, and access to improved data about the exact structure of confirmed insertion alleles. In addition, the database content has been extended by incorporating additional insertion alleles that were detected during the confirmation process, as well as by adding new FSTs that have been produced during continued efforts to complement gaps in FST availability. Finally, the current database content regarding predicted and confirmed insertion alleles as well as primer sequences has been made available as downloadable flat files. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.
DNA polymerase preference determines PCR priming efficiency.
Pan, Wenjing; Byrne-Steele, Miranda; Wang, Chunlin; Lu, Stanley; Clemmons, Scott; Zahorchak, Robert J; Han, Jian
2014-01-30
Polymerase chain reaction (PCR) is one of the most important developments in modern biotechnology. However, PCR is known to introduce biases, especially during multiplex reactions. Recent studies have implicated the DNA polymerase as the primary source of bias, particularly initiation of polymerization on the template strand. In our study, amplification from a synthetic library containing a 12 nucleotide random portion was used to provide an in-depth characterization of DNA polymerase priming bias. The synthetic library was amplified with three commercially available DNA polymerases using an anchored primer with a random 3' hexamer end. After normalization, the next generation sequencing (NGS) results of the amplified libraries were directly compared to the unamplified synthetic library. Here, high throughput sequencing was used to systematically demonstrate and characterize DNA polymerase priming bias. We demonstrate that certain sequence motifs are preferred over others as primers where the six nucleotide sequences at the 3' end of the primer, as well as the sequences four base pairs downstream of the priming site, may influence priming efficiencies. DNA polymerases in the same family from two different commercial vendors prefer similar motifs, while another commercially available enzyme from a different DNA polymerase family prefers different motifs. Furthermore, the preferred priming motifs are GC-rich. The DNA polymerase preference for certain sequence motifs was verified by amplification from single-primer templates. We incorporated the observed DNA polymerase preference into a primer-design program that guides the placement of the primer to an optimal location on the template. DNA polymerase priming bias was characterized using a synthetic library amplification system and NGS. The characterization of DNA polymerase priming bias was then utilized to guide the primer-design process and demonstrate varying amplification efficiencies among three commercially available DNA polymerases. The results suggest that the interaction of the DNA polymerase with the primer:template junction during the initiation of DNA polymerization is very important in terms of overall amplification bias and has broader implications for both the primer design process and multiplex PCR.
Open resource metagenomics: a model for sharing metagenomic libraries.
Neufeld, J D; Engel, K; Cheng, J; Moreno-Hagelsieb, G; Rose, D R; Charles, T C
2011-11-30
Both sequence-based and activity-based exploitation of environmental DNA have provided unprecedented access to the genomic content of cultivated and uncultivated microorganisms. Although researchers deposit microbial strains in culture collections and DNA sequences in databases, activity-based metagenomic studies typically only publish sequences from the hits retrieved from specific screens. Physical metagenomic libraries, conceptually similar to entire sequence datasets, are usually not straightforward to obtain by interested parties subsequent to publication. In order to facilitate unrestricted distribution of metagenomic libraries, we propose the adoption of open resource metagenomics, in line with the trend towards open access publishing, and similar to culture- and mutant-strain collections that have been the backbone of traditional microbiology and microbial genetics. The concept of open resource metagenomics includes preparation of physical DNA libraries, preferably in versatile vectors that facilitate screening in a diversity of host organisms, and pooling of clones so that single aliquots containing complete libraries can be easily distributed upon request. Database deposition of associated metadata and sequence data for each library provides researchers with information to select the most appropriate libraries for further research projects. As a starting point, we have established the Canadian MetaMicroBiome Library (CM(2)BL [1]). The CM(2)BL is a publicly accessible collection of cosmid libraries containing environmental DNA from soils collected from across Canada, spanning multiple biomes. The libraries were constructed such that the cloned DNA can be easily transferred to Gateway® compliant vectors, facilitating functional screening in virtually any surrogate microbial host for which there are available plasmid vectors. The libraries, which we are placing in the public domain, will be distributed upon request without restriction to members of both the academic research community and industry. This article invites the scientific community to adopt this philosophy of open resource metagenomics to extend the utility of functional metagenomics beyond initial publication, circumventing the need to start from scratch with each new research project.
Open resource metagenomics: a model for sharing metagenomic libraries
Neufeld, J.D.; Engel, K.; Cheng, J.; Moreno-Hagelsieb, G.; Rose, D.R.; Charles, T.C.
2011-01-01
Both sequence-based and activity-based exploitation of environmental DNA have provided unprecedented access to the genomic content of cultivated and uncultivated microorganisms. Although researchers deposit microbial strains in culture collections and DNA sequences in databases, activity-based metagenomic studies typically only publish sequences from the hits retrieved from specific screens. Physical metagenomic libraries, conceptually similar to entire sequence datasets, are usually not straightforward to obtain by interested parties subsequent to publication. In order to facilitate unrestricted distribution of metagenomic libraries, we propose the adoption of open resource metagenomics, in line with the trend towards open access publishing, and similar to culture- and mutant-strain collections that have been the backbone of traditional microbiology and microbial genetics. The concept of open resource metagenomics includes preparation of physical DNA libraries, preferably in versatile vectors that facilitate screening in a diversity of host organisms, and pooling of clones so that single aliquots containing complete libraries can be easily distributed upon request. Database deposition of associated metadata and sequence data for each library provides researchers with information to select the most appropriate libraries for further research projects. As a starting point, we have established the Canadian MetaMicroBiome Library (CM2BL [1]). The CM2BL is a publicly accessible collection of cosmid libraries containing environmental DNA from soils collected from across Canada, spanning multiple biomes. The libraries were constructed such that the cloned DNA can be easily transferred to Gateway® compliant vectors, facilitating functional screening in virtually any surrogate microbial host for which there are available plasmid vectors. The libraries, which we are placing in the public domain, will be distributed upon request without restriction to members of both the academic research community and industry. This article invites the scientific community to adopt this philosophy of open resource metagenomics to extend the utility of functional metagenomics beyond initial publication, circumventing the need to start from scratch with each new research project. PMID:22180823
SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read
2010-01-01
Background High-throughput automated sequencing has enabled an exponential growth rate of sequencing data. This requires increasing sequence quality and reliability in order to avoid database contamination with artefactual sequences. The arrival of pyrosequencing enhances this problem and necessitates customisable pre-processing algorithms. Results SeqTrim has been implemented both as a Web and as a standalone command line application. Already-published and newly-designed algorithms have been included to identify sequence inserts, to remove low quality, vector, adaptor, low complexity and contaminant sequences, and to detect chimeric reads. The availability of several input and output formats allows its inclusion in sequence processing workflows. Due to its specific algorithms, SeqTrim outperforms other pre-processors implemented as Web services or standalone applications. It performs equally well with sequences from EST libraries, SSH libraries, genomic DNA libraries and pyrosequencing reads and does not lead to over-trimming. Conclusions SeqTrim is an efficient pipeline designed for pre-processing of any type of sequence read, including next-generation sequencing. It is easily configurable and provides a friendly interface that allows users to know what happened with sequences at every pre-processing stage, and to verify pre-processing of an individual sequence if desired. The recommended pipeline reveals more information about each sequence than previously described pre-processors and can discard more sequencing or experimental artefacts. PMID:20089148
NASA Astrophysics Data System (ADS)
Feng, Yanwei; Liu, Wenfen; Xu, Xin; Yang, Jianmin; Wang, Weijun; Wei, Xiumei; Liu, Xiangquan; Sun, Guohua
2017-10-01
Amphioctopus fangsiao is one of the most economically important species and has been considered to be a candidate for aquaculture. In order to facilitate its fine-scale genetic analyses, we constructed a normalized full-length library successfully and developed a set of microsatellite markers in this study. The normalized full-length library had a storage capacity of 6.9×105 independent clones. The recombination efficiency was 95% and the average size of inserted fragments was longer than 1000 bp. A total of 3440 high quality ESTs were obtained, which were assembled into 1803 unigenes. Of these unigenes, 450 (25%) were assigned into 33 Gene Ontology terms, 576 (31.9%) into 153 Kyoto Encyclopedia of Genes and Genomes pathways, and 275 (15.3%) into 22 Clusters of Orthologous Groups. Seventy-six polymorphic microsatellite markers were identified. The number of alleles per locus ranged from 4 to 17, and the observed and expected heterozygosities varied between 0.167 and 0.967 and between 0.326 and 0.944, respectively. Twelve loci were significantly deviated from Hardy-Weinberg equilibrium after Bonferroni correction and no linkage disequilibrium was found between different loci. This study provided not only a useful resource for the isolation of the functional genes, but also a set of informative microsatellites for the assessment of population structure and conservation genetics of A. fangsiao.
Xu, Y L; Li, L; Wu, K; Peeters, A J; Gage, D A; Zeevaart, J A
1995-07-03
The biosynthesis of gibberellins (GAs) after GA12-aldehyde involves a series of oxidative steps that lead to the formation of bioactive GAs. Previously, a cDNA clone encoding a GA 20-oxidase [gibberellin, 2-oxoglutarate:oxygen oxidoreductase (20-hydroxylating, oxidizing), EC 1.14.11.-] was isolated by immunoscreening a cDNA library from liquid endosperm of pumpkin (Cucurbita maxima L.) with antibodies against partially purified GA 20-oxidase. Here, we report isolation of a genomic clone for GA 20-oxidase from a genomic library of the long-day species Arabidopsis thaliana Heynh., strain Columbia, by using the pumpkin cDNA clone as a heterologous probe. This genomic clone contains a GA 20-oxidase gene that consists of three exons and two introns. The three exons are 1131-bp long and encode 377 amino acid residues. A cDNA clone corresponding to the putative GA 20-oxidase genomic sequence was constructed with the reverse transcription-PCR method, and the identity of the cDNA clone was confirmed by analyzing the capability of the fusion protein expressed in Escherichia coli to convert GA53 to GA44 and GA19 to GA20. The Arabidopsis GA 20-oxidase shares 55% identity and > 80% similarity with the pumpkin GA 20-oxidase at the derived amino acid level. Both GA 20-oxidases share high homology with other 2-oxoglutarate-dependent dioxygenases (2-ODDs), but the highest homology was found between the two GA 20-oxidases. Mapping results indicated tight linkage between the cloned GA 20-oxidase and the GA5 locus of Arabidopsis. The ga5 semidwarf mutant contains a G-->A point mutation that inserts a translational stop codon in the protein-coding sequence, thus confirming that the GA5 locus encodes GA 20-oxidase. Expression of the GA5 gene in Ara-bidopsis leaves was enhanced after plants were transferred from short to long days; it was reduced by GA4 treatment, suggesting end-product repression in the GA biosynthetic pathway.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xu, Yun-Ling; Li, Li; Wu, Keqiang
1995-07-03
The biosynthesis of gibberellins (GAs) after GA{sub 12}-aldehyde involves a series of oxidative steps that lead to the formation of bioactive GAs. Previously, a cDNA clone encoding a GA 20-oxidase [gibberellin, 2-oxoglutarate:oxygen oxidoreductase (20-hydroxylating, oxidizing), EC 1.14.11-] was isolated by immunoscreening a cDNA library from liquid endosperm of pumpkin (Cucurbita maxima L.) with antibodies against partially purified GA 20-oxidase. Here, we report isolation of a genomic clone for GA 20-oxidase from a genomic library of the long-day species Arabidopsis thaliana Heynh., strain Columbia, by using the pumpkin cDNA clone as a heterologous probe. This genomic clone contains a GA 20-oxidasemore » gene that consists of three exons and two introns. The three exons are 1131-bp long and encode 377 amino acid residues. A cDNA clone corresponding to the putative GA 20-oxidase genomic sequence was constructed with the reverse transcription-PCR method, and the identity of the cDNA clone was confirmed by analyzing the capability of the fusion protein expressed in Escherichia coli to convert GA{sub 53} to GA{sub 44} and GA{sub 19} to GA{sub 20}. The Arabidopsis GA 20-oxidase shares 55% identity and >80% similarity with the pumpkin GA 20-oxidase at the derived amino acid level. Both GA 20-oxidases share high homology with other 2-oxoglutarate-dependent dioxygenases (2-ODDs), but the highest homology was found between the two GA 20-oxidases. Mapping results indicated tight linkage between the cloned GA 20-oxidase and the GA locus of Arabidopsis. The ga5 semidwarf mutant contains a G {yields} A point mutation that inserts a translational stop codon in the protein-coding sequence, thus confirming that the GA5 locus encodes GA 20-oxidase. Expression of the GA5 gene in Arabidopsis leaves was enhanced after plants were transferred from short to long days; it was reduced by GA{sub 4} treatment, suggesting end-product repression in the GA biosynthetic pathway. 28 refs., 6 figs.« less
Prioritizing multiple therapeutic targets in parallel using automated DNA-encoded library screening
NASA Astrophysics Data System (ADS)
Machutta, Carl A.; Kollmann, Christopher S.; Lind, Kenneth E.; Bai, Xiaopeng; Chan, Pan F.; Huang, Jianzhong; Ballell, Lluis; Belyanskaya, Svetlana; Besra, Gurdyal S.; Barros-Aguirre, David; Bates, Robert H.; Centrella, Paolo A.; Chang, Sandy S.; Chai, Jing; Choudhry, Anthony E.; Coffin, Aaron; Davie, Christopher P.; Deng, Hongfeng; Deng, Jianghe; Ding, Yun; Dodson, Jason W.; Fosbenner, David T.; Gao, Enoch N.; Graham, Taylor L.; Graybill, Todd L.; Ingraham, Karen; Johnson, Walter P.; King, Bryan W.; Kwiatkowski, Christopher R.; Lelièvre, Joël; Li, Yue; Liu, Xiaorong; Lu, Quinn; Lehr, Ruth; Mendoza-Losana, Alfonso; Martin, John; McCloskey, Lynn; McCormick, Patti; O'Keefe, Heather P.; O'Keeffe, Thomas; Pao, Christina; Phelps, Christopher B.; Qi, Hongwei; Rafferty, Keith; Scavello, Genaro S.; Steiginga, Matt S.; Sundersingh, Flora S.; Sweitzer, Sharon M.; Szewczuk, Lawrence M.; Taylor, Amy; Toh, May Fern; Wang, Juan; Wang, Minghui; Wilkins, Devan J.; Xia, Bing; Yao, Gang; Zhang, Jean; Zhou, Jingye; Donahue, Christine P.; Messer, Jeffrey A.; Holmes, David; Arico-Muendel, Christopher C.; Pope, Andrew J.; Gross, Jeffrey W.; Evindar, Ghotas
2017-07-01
The identification and prioritization of chemically tractable therapeutic targets is a significant challenge in the discovery of new medicines. We have developed a novel method that rapidly screens multiple proteins in parallel using DNA-encoded library technology (ELT). Initial efforts were focused on the efficient discovery of antibacterial leads against 119 targets from Acinetobacter baumannii and Staphylococcus aureus. The success of this effort led to the hypothesis that the relative number of ELT binders alone could be used to assess the ligandability of large sets of proteins. This concept was further explored by screening 42 targets from Mycobacterium tuberculosis. Active chemical series for six targets from our initial effort as well as three chemotypes for DHFR from M. tuberculosis are reported. The findings demonstrate that parallel ELT selections can be used to assess ligandability and highlight opportunities for successful lead and tool discovery.
PuLSE: Quality control and quantification of peptide sequences explored by phage display libraries.
Shave, Steven; Mann, Stefan; Koszela, Joanna; Kerr, Alastair; Auer, Manfred
2018-01-01
The design of highly diverse phage display libraries is based on assumption that DNA bases are incorporated at similar rates within the randomized sequence. As library complexity increases and expected copy numbers of unique sequences decrease, the exploration of library space becomes sparser and the presence of truly random sequences becomes critical. We present the program PuLSE (Phage Library Sequence Evaluation) as a tool for assessing randomness and therefore diversity of phage display libraries. PuLSE runs on a collection of sequence reads in the fastq file format and generates tables profiling the library in terms of unique DNA sequence counts and positions, translated peptide sequences, and normalized 'expected' occurrences from base to residue codon frequencies. The output allows at-a-glance quantitative quality control of a phage library in terms of sequence coverage both at the DNA base and translated protein residue level, which has been missing from toolsets and literature. The open source program PuLSE is available in two formats, a C++ source code package for compilation and integration into existing bioinformatics pipelines and precompiled binaries for ease of use.
Comprehensive identification of Vibrio vulnificus genes required for growth in human serum.
Carda-Diéguez, M; Silva-Hernández, F X; Hubbard, T P; Chao, M C; Waldor, M K; Amaro, C
2018-12-31
Vibrio vulnificus can be a highly invasive pathogen capable of spreading from an infection site to the bloodstream, causing sepsis and death. To survive and proliferate in blood, the pathogen requires mechanisms to overcome the innate immune defenses and metabolic limitations of this host niche. We created a high-density transposon mutant library in YJ016, a strain representative of the most virulent V. vulnificus lineage (or phylogroup) and used transposon insertion sequencing (TIS) screens to identify loci that enable the pathogen to survive and proliferate in human serum. Initially, genes underrepresented for insertions were used to estimate the V. vulnificus essential gene set; comparisons of these genes with similar TIS-based classification of underrepresented genes in other vibrios enabled the compilation of a common Vibrio essential gene set. Analysis of the relative abundance of insertion mutants in the library after exposure to serum suggested that genes involved in capsule biogenesis are critical for YJ016 complement resistance. Notably, homologues of two genes required for YJ016 serum-resistance and capsule biogenesis were not previously linked to capsule biogenesis and are largely absent from other V. vulnificus strains. The relative abundance of mutants after exposure to heat inactivated serum was compared with the findings from the serum screen. These comparisons suggest that in both conditions the pathogen relies on its Na + transporting NADH-ubiquinone reductase (NQR) complex and type II secretion system to survive/proliferate within the metabolic constraints of serum. Collectively, our findings reveal the potency of comparative TIS screens to provide knowledge of how a pathogen overcomes the diverse limitations to growth imposed by serum.
Oumeraci, Tonio; Jensen, Vanessa; Talbot, Steven R; Hofmann, Winfried; Kostrzewa, Markus; Schlegelberger, Brigitte; von Neuhoff, Nils; Häussler, Susanne
2015-01-01
Pseudomonas aeruginosa is a gram-negative bacterium that is ubiquitously present in the aerobic biosphere. As an antibiotic-resistant facultative pathogen, it is a major cause of hospital-acquired infections. Its rapid and accurate identification is crucial in clinical and therapeutic environments. In a large-scale MALDI-TOF mass spectrometry-based screen of the Harvard transposon insertion mutant library of P. aeruginosa strain PA14, intact-cell proteome profile spectra of 5547 PA14 transposon mutants exhibiting a plethora of different phenotypes were acquired and analyzed. Of all P. aeruginosa PA14 mutant profiles 99.7% were correctly identified as P. aeruginosa with the Biotyper software on the species level. On the strain level, 99.99% of the profiles were mapped to five different individual P. aeruginosa Biotyper database entries. A principal component analysis-based approach was used to determine the most important discriminatory mass features between these Biotyper groups. Although technical replicas were consistently categorized to specific Biotyper groups in 94.2% of the mutant profiles, biological replicas were not, indicating that the distinct proteotypes are affected by growth conditions. The PA14 mutant profile collection presented here constitutes the largest coherent P. aeruginosa MALDI-TOF spectral dataset publicly available today. Transposon insertions in thousands of different P. aeruginosa genes did not affect species identification from MALDI-TOF mass spectra, clearly demonstrating the robustness of the approach. However, the assignment of the individual spectra to sub-groups proved to be non-consistent in biological replicas, indicating that the differentiation between biotyper groups in this nosocomial pathogen is unassured.
Optimizing Illumina next-generation sequencing library preparation for extremely AT-biased genomes.
Oyola, Samuel O; Otto, Thomas D; Gu, Yong; Maslen, Gareth; Manske, Magnus; Campino, Susana; Turner, Daniel J; Macinnis, Bronwyn; Kwiatkowski, Dominic P; Swerdlow, Harold P; Quail, Michael A
2012-01-03
Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences. We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates. We have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of DNA starting material.
Dunn, R. C.; Laurie, C. C.
1995-01-01
Variation in the DNA sequence and level of alcohol dehydrogenase (Adh) gene expression in Drosophila melanogaster have been studied to determine what types of DNA polymorphisms contribute to phenotypic variation in natural populations. The Adh gene, like many others, shows a high level of variability in both DNA sequence and quantitative level of expression. A number of transposable element insertions occur in the Adh region and one of these, a copia insertion in the 5' flanking region, is associated with unusually low Adh expression. To determine whether this insertion (called RI42) causes the low expression level, the insertion was excised from the cloned RI42 Adh gene and the effect was assessed by P-element transformation. Removal of this insertion causes a threefold increase in the level of ADH, clearly showing that it contributes to the naturally occurring variation in expression at this locus. Removal of all but one LTR also causes a threefold increase, indicating that the mechanism is not a simple sequence disruption. Furthermore, this copia insertion, which is located between the two Adh promoters and their upstream enhancer sequences, has differential effects on the levels of proximal and distal transcripts. Finally, a test for the possible modifying effects of two suppressor loci, su(w(a)) and su(f), on this insertional mutation was negative, in contrast to a previous report in the literature. PMID:7498745
Microsatellite DNA capture from enriched libraries.
Gonzalez, Elena G; Zardoya, Rafael
2013-01-01
Microsatellites are DNA sequences of tandem repeats of one to six nucleotides, which are highly polymorphic, and thus the molecular markers of choice in many kinship, population genetic, and conservation studies. There have been significant technical improvements since the early methods for microsatellite isolation were developed, and today the most common procedures take advantage of the hybrid capture methods of enriched-targeted microsatellite DNA. Furthermore, recent advents in sequencing technologies (i.e., next-generation sequencing, NGS) have fostered the mining of microsatellite markers in non-model organisms, affording a cost-effective way of obtaining a large amount of sequence data potentially useful for loci characterization. The rapid improvements of NGS platforms together with the increase in available microsatellite information open new avenues to the understanding of the evolutionary forces that shape genetic structuring in wild populations. Here, we provide detailed methodological procedures for microsatellite isolation based on the screening of GT microsatellite-enriched libraries, either by cloning and Sanger sequencing of positive clones or by direct NGS. Guides for designing new species-specific primers and basic genotyping are also given.
[Current applications of high-throughput DNA sequencing technology in antibody drug research].
Yu, Xin; Liu, Qi-Gang; Wang, Ming-Rong
2012-03-01
Since the publication of a high-throughput DNA sequencing technology based on PCR reaction was carried out in oil emulsions in 2005, high-throughput DNA sequencing platforms have been evolved to a robust technology in sequencing genomes and diverse DNA libraries. Antibody libraries with vast numbers of members currently serve as a foundation of discovering novel antibody drugs, and high-throughput DNA sequencing technology makes it possible to rapidly identify functional antibody variants with desired properties. Herein we present a review of current applications of high-throughput DNA sequencing technology in the analysis of antibody library diversity, sequencing of CDR3 regions, identification of potent antibodies based on sequence frequency, discovery of functional genes, and combination with various display technologies, so as to provide an alternative approach of discovery and development of antibody drugs.
Chicken microsatellite markers isolated from libraries enriched for simple tandem repeats.
Gibbs, M; Dawson, D A; McCamley, C; Wardle, A F; Armour, J A; Burke, T
1997-12-01
The total number of microsatellite loci is considered to be at least 10-fold lower in avian species than in mammalian species. Therefore, efficient large-scale cloning of chicken microsatellites, as required for the construction of a high-resolution linkage map, is facilitated by the construction of libraries using an enrichment strategy. In this study, a plasmid library enriched for tandem repeats was constructed from chicken genomic DNA by hybridization selection. Using this technique the proportion of recombinant clones that cross-hybridized to probes containing simple tandem repeats was raised to 16%, compared with < 0.1% in a non-enriched library. Primers were designed from 121 different sequences. Polymerase chain reaction (PCR) analysis of two chicken reference pedigrees enabled 72 loci to be localized within the collaborative chicken genetic map, and at least 30 of the remaining loci have been shown to be informative in these or other crosses.
Exploring Nitrilase Sequence Space for Enantioselective Catalysis†
Robertson, Dan E.; Chaplin, Jennifer A.; DeSantis, Grace; Podar, Mircea; Madden, Mark; Chi, Ellen; Richardson, Toby; Milan, Aileen; Miller, Mark; Weiner, David P.; Wong, Kelvin; McQuaid, Jeff; Farwell, Bob; Preston, Lori A.; Tan, Xuqiu; Snead, Marjory A.; Keller, Martin; Mathur, Eric; Kretz, Patricia L.; Burk, Mark J.; Short, Jay M.
2004-01-01
Nitrilases are important in the biosphere as participants in synthesis and degradation pathways for naturally occurring, as well as xenobiotically derived, nitriles. Because of their inherent enantioselectivity, nitrilases are also attractive as mild, selective catalysts for setting chiral centers in fine chemical synthesis. Unfortunately, <20 nitrilases have been reported in the scientific and patent literature, and because of stability or specificity shortcomings, their utility has been largely unrealized. In this study, 137 unique nitrilases, discovered from screening of >600 biotope-specific environmental DNA (eDNA) libraries, were characterized. Using culture-independent means, phylogenetically diverse genomes were captured from entire biotopes, and their genes were expressed heterologously in a common cloning host. Nitrilase genes were targeted in a selection-based expression assay of clonal populations numbering 106 to 1010 members per eDNA library. A phylogenetic analysis of the novel sequences discovered revealed the presence of at least five major sequence clades within the nitrilase subfamily. Using three nitrile substrates targeted for their potential in chiral pharmaceutical synthesis, the enzymes were characterized for substrate specificity and stereospecificity. A number of important correlations were found between sequence clades and the selective properties of these nitrilases. These enzymes, discovered using a high-throughput, culture-independent method, provide a catalytic toolbox for enantiospecific synthesis of a variety of carboxylic acid derivatives, as well as an intriguing library for evolutionary and structural analyses. PMID:15066841
Xue, Jian; Wu, Riga; Pan, Yajiao; Wang, Shunxia; Qu, Baowang; Qin, Ying; Shi, Yuequn; Zhang, Chuchu; Li, Ran; Zhang, Liyan; Zhou, Cheng; Sun, Hongyu
2018-04-02
Massively parallel sequencing (MPS) technologies, also termed as next-generation sequencing (NGS), are becoming increasingly popular in study of short tandem repeats (STR). However, current library preparation methods are usually based on ligation or two-round PCR that requires more steps, making it time-consuming (about 2 days), laborious and expensive. In this study, a 16-plex STR typing system was designed with fusion primer strategy based on the Ion Torrent S5 XL platform which could effectively resolve the above challenges for forensic DNA database-type samples (bloodstains, saliva stains, etc.). The efficiency of this system was tested in 253 Han Chinese participants. The libraries were prepared without DNA isolation and adapter ligation, and the whole process only required approximately 5 h. The proportion of thoroughly genotyped samples in which all the 16 loci were successfully genotyped was 86% (220/256). Of the samples, 99.7% showed 100% concordance between NGS-based STR typing and capillary electrophoresis (CE)-based STR typing. The inconsistency might have been caused by off-ladder alleles and mutations in primer binding sites. Overall, this panel enabled the large-scale genotyping of the DNA samples with controlled quality and quantity because it is a simple, operation-friendly process flow that saves labor, time and costs. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Hughes, Stephen R; Butt, Tauseef R; Bartolett, Scott; Riedmuller, Steven B; Farrelly, Philip
2011-08-01
The molecular biological techniques for plasmid-based assembly and cloning of gene open reading frames are essential for elucidating the function of the proteins encoded by the genes. High-throughput integrated robotic molecular biology platforms that have the capacity to rapidly clone and express heterologous gene open reading frames in bacteria and yeast and to screen large numbers of expressed proteins for optimized function are an important technology for improving microbial strains for biofuel production. The process involves the production of full-length complementary DNA libraries as a source of plasmid-based clones to express the desired proteins in active form for determination of their functions. Proteins that were identified by high-throughput screening as having desired characteristics are overexpressed in microbes to enable them to perform functions that will allow more cost-effective and sustainable production of biofuels. Because the plasmid libraries are composed of several thousand unique genes, automation of the process is essential. This review describes the design and implementation of an automated integrated programmable robotic workcell capable of producing complementary DNA libraries, colony picking, isolating plasmid DNA, transforming yeast and bacteria, expressing protein, and performing appropriate functional assays. These operations will allow tailoring microbial strains to use renewable feedstocks for production of biofuels, bioderived chemicals, fertilizers, and other coproducts for profitable and sustainable biorefineries. Published by Elsevier Inc.
Knechtel, Johann
2017-01-01
Abstract We have developed a novel approach for creating membrane-spanning protein-based pores. The construction principle is based on using well-defined, circular DNA nanostructures to arrange a precise number of pore-forming protein toxin monomers. We can thereby obtain, for the first time, protein pores with specifically set diameters. We demonstrate this principle by constructing artificial alpha-hemolysin (αHL) pores. The DNA/αHL hybrid nanopores composed of twelve, twenty or twenty-six monomers show stable insertions into lipid bilayers during electrical recordings, along with steady, pore size-dependent current levels. Our approach successfully advances the applicability of nanopores, in particular towards label-free studies of single molecules in large nanoscaled biological structures. PMID:29088457
ERIC Educational Resources Information Center
Soja, Constance M.; Huerta, Deborah
2001-01-01
Describes an interactive internet exercise that enables students to engage in cooperative library and web research on a controversial topic in science, specifically the cloning of extinct lifeforms. Creates a dynamic learning environment in a large introductory geology course and demonstrates the importance of scientific literacy. (Author/SAH)
NASA Astrophysics Data System (ADS)
Essler, Markus; Ruoslahti, Erkki
2002-02-01
In vivo phage display identifies peptides that selectively home to the vasculature of individual organs, tissues, and tumors. Here we report the identification of a cyclic nonapeptide, CPGPEGAGC, which homes to normal breast tissue with a 100-fold selectivity over nontargeted phage. The homing of the phage is inhibited by its cognate synthetic peptide. Phage localization in tissue sections showed that the breast-homing phage binds to the blood vessels in the breast, but not in other tissues. The phage also bound to the vasculature of hyperplastic and malignant lesions in transgenic breast cancer mice. Expression cloning with a phage-displayed cDNA library yielded a phage that specifically bound to the breast-homing peptide. The cDNA insert was homologous to a fragment of aminopeptidase P. The homing peptide bound aminopeptidase P from malignant breast tissue in affinity chromatography. Antibodies against aminopeptidase P inhibited the in vitro binding of the phage-displayed cDNA to the peptide and the in vivo homing of phage carrying the peptide. These results indicate that aminopeptidase P is the receptor for the breast-homing peptide. This peptide may be useful in designing drugs for the prevention and treatment of breast cancer.
Pei, Zhihua; Sun, Xiaoning; Tang, Yan; Wang, Kai; Gao, Yunhang; Ma, Hongxia
2014-10-01
Musca domestica (Diptera: Muscidae), the housefly, exhibits unique immune defences and can produce antimicrobial peptides upon stimulation with bacteria. Based on the cDNA library constructed using the suppression subtractive hybridization (SSH) method, a 198-bp antimicrobial peptide gene, which we named MDAP-2, was amplified by rapid amplification of cDNA ends (RACE) from M. domestica larvae stimulated with Salmonella pullorum (Enterobacteriaceae: Salmonella). In the present study, the full-length MDAP-2 gene was cloned and inserted into a His-tagged Escherichia coli prokaryotic expression system to enable production of the recombinant peptide. The recombinant MDAP-2 peptide was purified using Ni-NTA HisTrap FF crude column chromatography. The bacteriostatic activity of the recombinant purified MDAP-2 protein was assessed. The results indicated that MDAP-2 had in vitro antibacterial activity against all of the tested Gram- bacteria from clinical isolates, including E. coli (Enterobacteriaceae: Escherichia), one strain of S. pullorum (Enterobacteriaceae: Salmonella), and one strain of Pasteurella multocida. DNA sequencing and BLAST analysis showed that the MDAP-2 antimicrobial peptide gene was not homologous to any other antimicrobial peptide genes in GenBank. The antibacterial mechanisms of the newly discovered MDAP-2 peptide warrant further study. Copyright © 2014 Elsevier B.V. All rights reserved.
Itoh, S; Abe, Y; Kubo, A; Okuda, M; Shimoji, M; Nakayama, K; Kamataki, T
1997-02-07
An 11.5 kb fragment of the mouse Cyp3a16 gene containing the 5' flanking region was isolated from the lambda DASHII mouse genomic library. A part of the 5' flanking region and the first exon of Cyp3a16 gene were sequenced. S1 mapping analysis showed the presence of two transcriptional initiation sites. The first exon was completely identical to Cyp3a16 cDNA. The identity of 5' flanking sequences between Cyp3a16 and Cyp3a11 genes was about 69%. A typical TATA box and a basic transcription element (BTE) were found as seen with other CYP3A genes from various animal species Moreover, some putative transcriptional regulatory elements were also found in addition to the sequence motif seen for the formation of Z-type DNA. To examine the transcriptional activity of Cyp3a11 gene, DNA fragments in the 5'-flanking region of the gene were inserted front of the luciferase structural gene, and the constructs were transfected in primary hepatocytes. The analysis of the luciferase activity indicated that the region between -146 and -56 was necessary for the transcription of CYP3a16 gene.
Characterization of Bleomycin-Mediated Cleavage of a Hairpin DNA Library
Segerman, Zachary J.; Roy, Basab; Hecht, Sidney M.
2013-01-01
A study of BLM A5 was conducted using a previously isolated library of hairpin DNAs found to bind strongly to metal free BLM. The ability of Fe(II)•BLM to effect cleavage on both the 3' and 5'-arms of the hairpin DNAs was characterized. The strongly bound DNAs were found to be efficient substrates for Fe•BLM A5-mediated hairpin DNA cleavage. Surprisingly, the most prevalent site of BLM-mediated cleavage was found to be the 5′-AT-3′ dinucleotide sequence. This dinucleotide sequence, and other sequences generally not cleaved well by BLM when examined using arbitrarily chosen DNA substrates, were apparent when examining the library of ten hairpin DNAs. In total, 132 sites of DNA cleavage were produced by exposure of the hairpin DNA library to Fe•BLM A5. The existence of multiple sites of cleavage on both the 3′- and 5′-arms of the hairpin DNAs suggested that some of these might be double-strand cleavage events. Accordingly, an assay was developed with which to test the propensity of the hairpin DNAs to undergo double-strand DNA damage. One hairpin DNA was characterized using this method, and gave results consistent with earlier reports of double-strand DNA cleavage, but with a sequence selectivity different from those reported previously. PMID:23834496
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hadano, S.; Ishida, Y.; Tomiyasu, H.
1994-09-01
To complete a transcription map of the 1 Mb region in human chromosome 4p16.3 containing the Huntington disease (HD) gene, the isolation of cDNA clones are being performed throughout. Our method relies on a direct screening of the cDNA libraries probed with single copy microclones from 3 YAC clones spanning 1 Mbp of the HD gene region. AC-DNAs were isolated by a preparative pulsed-field gel electrophoresis, amplified by both a single unique primer (SUP)-PCR and a linker ligation PCR, and 6 microclone-DNA libraries were generated. Then, 8,640 microclones from these libraries were independently amplified by PCR, and arrayed onto themore » membranes. 800-900 microclones that were not cross-hybridized with total human and yeast genomic DNA, TAC vector DNA, and ribosomal cDNA on a dot hybridization (putatively carrying single copy sequences) were pooled to make 9 probe pools. A total of {approximately}1.8x10{sup 7} plaques from the human brain cDNA libraries was screened with 9 pool-probes, and then 672 positive cDNA clones were obtained. So far, 597 cDNA clones were defined and arrayed onto a map of the 1 Mbp of the HD gene region by hybridization with HD region-specific cosmid contigs and YAC clones. Further characterization including a DNA sequencing and Northern blot analysis is currently underway.« less
Large inserts for big data: artificial chromosomes in the genomic era.
Tocchetti, Arianna; Donadio, Stefano; Sosio, Margherita
2018-05-01
The exponential increase in available microbial genome sequences coupled with predictive bioinformatic tools is underscoring the genetic capacity of bacteria to produce an unexpected large number of specialized bioactive compounds. Since most of the biosynthetic gene clusters (BGCs) present in microbial genomes are cryptic, i.e. not expressed under laboratory conditions, a variety of cloning systems and vectors have been devised to harbor DNA fragments large enough to carry entire BGCs and to allow their transfer in suitable heterologous hosts. This minireview provides an overview of the vectors and approaches that have been developed for cloning large BGCs, and successful examples of heterologous expression.
Recent advances on the encoding and selection methods of DNA-encoded chemical library.
Shi, Bingbing; Zhou, Yu; Huang, Yiran; Zhang, Jianfu; Li, Xiaoyu
2017-02-01
DNA-encoded chemical library (DEL) has emerged as a powerful and versatile tool for ligand discovery in chemical biology research and in drug discovery. Encoding and selection methods are two of the most important technological aspects of DEL that can dictate the performance and utilities of DELs. In this digest, we have summarized recent advances on the encoding and selection strategies of DEL and also discussed the latest developments on DNA-encoded dynamic library, a new frontier in DEL research. Copyright © 2016 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Huang, Shuohao; Kawabe, Yoshinori; Ito, Akira
2012-01-06
Highlights: Black-Right-Pointing-Pointer Adeno-associated virus (AAV) is capable of targeted integration in human cells. Black-Right-Pointing-Pointer Integrase-defective retroviral vector (IDRV) enables a circular DNA delivery. Black-Right-Pointing-Pointer A targeted integration system of IDRV DNA using the AAV integration mechanism. Black-Right-Pointing-Pointer Targeted IDRV integration ameliorates the safety concerns for retroviral vectors. -- Abstract: Retroviral vectors have been employed in clinical trials for gene therapy owing to their relative large packaging capacity, alterable cell tropism, and chromosomal integration for stable transgene expression. However, uncontrollable integrations of transgenes are likely to cause safety issues, such as insertional mutagenesis. A targeted transgene integration system for retroviral vectors,more » therefore, is a straightforward way to address the insertional mutagenesis issue. Adeno-associated virus (AAV) is the only known virus capable of targeted integration in human cells. In the presence of AAV Rep proteins, plasmids possessing the p5 integration efficiency element (p5IEE) can be integrated into the AAV integration site (AAVS1) in the human genome. In this report, we describe a system that can target the circular DNA derived from non-integrating retroviral vectors to the AAVS1 site by utilizing the Rep/p5IEE integration mechanism. Our results showed that after G418 selection 30% of collected clones had retroviral DNA targeted at the AAVS1 site.« less
Diverse Antibiotic Resistance Genes in Dairy Cow Manure
Wichmann, Fabienne; Udikovic-Kolic, Nikolina; Andrew, Sheila; Handelsman, Jo
2014-01-01
ABSTRACT Application of manure from antibiotic-treated animals to crops facilitates the dissemination of antibiotic resistance determinants into the environment. However, our knowledge of the identity, diversity, and patterns of distribution of these antibiotic resistance determinants remains limited. We used a new combination of methods to examine the resistome of dairy cow manure, a common soil amendment. Metagenomic libraries constructed with DNA extracted from manure were screened for resistance to beta-lactams, phenicols, aminoglycosides, and tetracyclines. Functional screening of fosmid and small-insert libraries identified 80 different antibiotic resistance genes whose deduced protein sequences were on average 50 to 60% identical to sequences deposited in GenBank. The resistance genes were frequently found in clusters and originated from a taxonomically diverse set of species, suggesting that some microorganisms in manure harbor multiple resistance genes. Furthermore, amid the great genetic diversity in manure, we discovered a novel clade of chloramphenicol acetyltransferases. Our study combined functional metagenomics with third-generation PacBio sequencing to significantly extend the roster of functional antibiotic resistance genes found in animal gut bacteria, providing a particularly broad resource for understanding the origins and dispersal of antibiotic resistance genes in agriculture and clinical settings. PMID:24757214
Increasing leaf vein density by mutagenesis: laying the foundations for C4 rice.
Feldman, Aryo B; Murchie, Erik H; Leung, Hei; Baraoidan, Marietta; Coe, Robert; Yu, Su-May; Lo, Shuen-Fang; Quick, William P
2014-01-01
A high leaf vein density is both an essential feature of C4 photosynthesis and a foundation trait to C4 evolution, ensuring the optimal proportion and proximity of mesophyll and bundle sheath cells for permitting the rapid exchange of photosynthates. Two rice mutant populations, a deletion mutant library with a cv. IR64 background (12,470 lines) and a T-DNA insertion mutant library with a cv. Tainung 67 background (10,830 lines), were screened for increases in vein density. A high throughput method with handheld microscopes was developed and its accuracy was supported by more rigorous microscopy analysis. Eight lines with significantly increased leaf vein densities were identified to be used as genetic stock for the global C4 Rice Consortium. The candidate population was shown to include both shared and independent mutations and so more than one gene controlled the high vein density phenotype. The high vein density trait was found to be linked to a narrow leaf width trait but the linkage was incomplete. The more genetically robust narrow leaf width trait was proposed to be used as a reliable phenotypic marker for finding high vein density variants in rice in future screens.
Sahu, Binod B; Shaw, Birendra P
2009-01-01
Background Despite wealth of information generated on salt tolerance mechanism, its basics still remain elusive. Thus, there is a need of continued effort to understand the salt tolerance mechanism using suitable biotechnological techniques and test plants (species) to enable development of salt tolerant cultivars of interest. Therefore, the present study was undertaken to generate information on salt stress responsive genes in a natural halophyte, Suaeda maritima, using PCR-based suppression subtractive hybridization (PCR-SSH) technique. Results Forward and reverse SSH cDNA libraries were constructed after exposing the young plants to 425 mM NaCl for 24 h. From the forward SSH cDNA library, 429 high quality ESTs were obtained. BLASTX search and TIGR assembler programme revealed overexpression of 167 unigenes comprising 89 singletons and 78 contigs with ESTs redundancy of 81.8%. Among the unigenes, 32.5% were found to be of special interest, indicating novel function of these genes with regard to salt tolerance. Literature search for the known unigenes revealed that only 17 of them were salt-inducible. A comparative analysis of the existing SSH cDNA libraries for NaCl stress in plants showed that only a few overexpressing unigenes were common in them. Moreover, the present study also showed increased expression of phosphoethanolamine N-methyltransferase gene, indicating the possible accumulation of a much studied osmoticum, glycinebetaine, in halophyte under salt stress. Functional categorization of the proteins as per the Munich database in general revealed that salt tolerance could be largely determined by the proteins involved in transcription, signal transduction, protein activity regulation and cell differentiation and organogenesis. Conclusion The study provided a clear indication of possible vital role of glycinebetaine in the salt tolerance process in S. maritima. However, the salt-induced expression of a large number of genes involved in a wide range of cellular functions was indicative of highly complex nature of the process as such. Most of the salt inducible genes, nonetheless, appeared to be species-specific. In light of the observations made, it is reasonable to emphasize that a comparative analysis of ESTs from SSH cDNA libraries generated systematically for a few halophytes with varying salt exposure time may clearly identify the key salt tolerance determinant genes to a minimum number, highly desirable for any genetic manipulation adventure. PMID:19497134