Complementary DNA libraries: an overview.
Ying, Shao-Yao
2004-07-01
The generation of complete and full-length cDNA libraries for potential functional assays of specific gene sequences is essential for most molecules in biotechnology and biomedical research. The field of cDNA library generation has changed rapidly in the past 10 yr. This review presents an overview of the method available for the basic information of generating cDNA libraries, including the definition of the cDNA library, different kinds of cDNA libraries, difference between methods for cDNA library generation using conventional approaches and a novel strategy, and the quality of cDNA libraries. It is anticipated that the high-quality cDNA libraries so generated would facilitate studies involving genechips and the microarray, differential display, subtractive hybridization, gene cloning, and peptide library generation.
Novel encoding methods for DNA-templated chemical libraries.
Li, Gang; Zheng, Wenlu; Liu, Ying; Li, Xiaoyu
2015-06-01
Among various types of DNA-encoded chemical libraries, DNA-templated library takes advantage of the sequence-specificity of DNA hybridization, enabling not only highly effective DNA-templated chemical reactions, but also high fidelity in library encoding. This brief review summarizes recent advances that have been made on the encoding strategies for DNA-templated libraries, and it also highlights their respective advantages and limitations for the preparation of DNA-encoded libraries. Copyright © 2015 Elsevier Ltd. All rights reserved.
Soares, Marcelo Bento; Bonaldo, Maria de Fatima
1998-01-01
This invention provides a method to normalize a cDNA library comprising: (a) constructing a directionally cloned library containing cDNA inserts wherein the insert is capable of being amplified by polymerase chain reaction; (b) converting a double-stranded cDNA library into single-stranded DNA circles; (c) generating single-stranded nucleic acid molecules complementary to the single-stranded DNA circles converted in step (b) by polymerase chain reaction with appropriate primers; (d) hybridizing the single-stranded DNA circles converted in step (b) with the complementary single-stranded nucleic acid molecules generated in step (c) to produce partial duplexes to an appropriate Cot; and (e) separating the unhybridized single-stranded DNA circles from the hybridized DNA circles, thereby generating a normalized cDNA library. This invention also provides a method to normalize a cDNA library wherein the generating of single-stranded nucleic acid molecules complementary to the single-stranded DNA circles converted in step (b) is by excising cDNA inserts from the double-stranded cDNA library; purifying the cDNA inserts from cloning vectors; and digesting the cDNA inserts with an exonuclease. This invention further provides a method to construct a subtractive cDNA library following the steps described above. This invention further provides normalized and/or subtractive cDNA libraries generated by the above methods.
Soares, M.B.; Fatima Bonaldo, M. de
1998-12-08
This invention provides a method to normalize a cDNA library comprising: (a) constructing a directionally cloned library containing cDNA inserts wherein the insert is capable of being amplified by polymerase chain reaction; (b) converting a double-stranded cDNA library into single-stranded DNA circles; (c) generating single-stranded nucleic acid molecules complementary to the single-stranded DNA circles converted in step (b) by polymerase chain reaction with appropriate primers; (d) hybridizing the single-stranded DNA circles converted in step (b) with the complementary single-stranded nucleic acid molecules generated in step (c) to produce partial duplexes to an appropriate Cot; and (e) separating the unhybridized single-stranded DNA circles from the hybridized DNA circles, thereby generating a normalized cDNA library. This invention also provides a method to normalize a cDNA library wherein the generating of single-stranded nucleic acid molecules complementary to the single-stranded DNA circles converted in step (b) is by excising cDNA inserts from the double-stranded cDNA library; purifying the cDNA inserts from cloning vectors; and digesting the cDNA inserts with an exonuclease. This invention further provides a method to construct a subtractive cDNA library following the steps described above. This invention further provides normalized and/or subtractive cDNA libraries generated by the above methods. 25 figs.
Library Construction from Subnanogram DNA for Pelagic Sea Water and Deep-Sea Sediments
Hirai, Miho; Nishi, Shinro; Tsuda, Miwako; Sunamura, Michinari; Takaki, Yoshihiro; Nunoura, Takuro
2017-01-01
Shotgun metagenomics is a low biased technology for assessing environmental microbial diversity and function. However, the requirement for a sufficient amount of DNA and the contamination of inhibitors in environmental DNA leads to difficulties in constructing a shotgun metagenomic library. We herein examined metagenomic library construction from subnanogram amounts of input environmental DNA from subarctic surface water and deep-sea sediments using two library construction kits: the KAPA Hyper Prep Kit and Nextera XT DNA Library Preparation Kit, with several modifications. The influence of chemical contaminants associated with these environmental DNA samples on library construction was also investigated. Overall, shotgun metagenomic libraries were constructed from 1 pg to 1 ng of input DNA using both kits without harsh library microbial contamination. However, the libraries constructed from 1 pg of input DNA exhibited larger biases in GC contents, k-mers, or small subunit (SSU) rRNA gene compositions than those constructed from 10 pg to 1 ng DNA. The lower limit of input DNA for low biased library construction in this study was 10 pg. Moreover, we revealed that technology-dependent biases (physical fragmentation and linker ligation vs. tagmentation) were larger than those due to the amount of input DNA. PMID:29187708
Lyons, Eli; Sheridan, Paul; Tremmel, Georg; Miyano, Satoru; Sugano, Sumio
2017-10-24
High-throughput screens allow for the identification of specific biomolecules with characteristics of interest. In barcoded screens, DNA barcodes are linked to target biomolecules in a manner allowing for the target molecules making up a library to be identified by sequencing the DNA barcodes using Next Generation Sequencing. To be useful in experimental settings, the DNA barcodes in a library must satisfy certain constraints related to GC content, homopolymer length, Hamming distance, and blacklisted subsequences. Here we report a novel framework to quickly generate large-scale libraries of DNA barcodes for use in high-throughput screens. We show that our framework dramatically reduces the computation time required to generate large-scale DNA barcode libraries, compared with a naїve approach to DNA barcode library generation. As a proof of concept, we demonstrate that our framework is able to generate a library consisting of one million DNA barcodes for use in a fragment antibody phage display screening experiment. We also report generating a general purpose one billion DNA barcode library, the largest such library yet reported in literature. Our results demonstrate the value of our novel large-scale DNA barcode library generation framework for use in high-throughput screening applications.
[cDNA library construction from panicle meristem of finger millet].
Radchuk, V; Pirko, Ia V; Isaenkov, S V; Emets, A I; Blium, Ia B
2014-01-01
The protocol for production of full-size cDNA using SuperScript Full-Length cDNA Library Construction Kit II (Invitrogen) was tested and high quality cDNA library from meristematic tissue of finger millet panicle (Eleusine coracana (L.) Gaertn) was created. The titer of obtained cDNA library comprised 3.01 x 10(5) CFU/ml in avarage. In average the length of cDNA insertion consisted about 1070 base pairs, the effectivity of cDNA fragment insertions--99.5%. The selective sequencing of cDNA clones from created library was performed. The sequences of cDNA clones were identified with usage of BLAST-search. The results of cDNA library analysis and selective sequencing represents prove good functionality and full length character of inserted cDNA clones. Obtained cDNA library from meristematic tissue of finger millet panicle represents good and valuable source for isolation and identification of key genes regulating metabolism and meristematic development and for mining of new molecular markers to conduct out high quality genetic investigations and molecular breeding as well.
Method for construction of normalized cDNA libraries
Soares, Marcelo B.; Efstratiadis, Argiris
1998-01-01
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to appropriate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. This invention also provides normalized cDNA libraries generated by the above-described method and uses of the generated libraries.
Method for construction of normalized cDNA libraries
Soares, M.B.; Efstratiadis, A.
1998-11-03
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3` noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to appropriate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. This invention also provides normalized cDNA libraries generated by the above-described method and uses of the generated libraries. 19 figs.
Constructing and detecting a cDNA library for mites.
Hu, Li; Zhao, YaE; Cheng, Juan; Yang, YuanJun; Li, Chen; Lu, ZhaoHui
2015-10-01
RNA extraction and construction of complementary DNA (cDNA) library for mites have been quite challenging due to difficulties in acquiring tiny living mites and breaking their hard chitin. The present study is to explore a better method to construct cDNA library for mites that will lay the foundation on transcriptome and molecular pathogenesis research. We selected Psoroptes cuniculi as an experimental subject and took the following steps to construct and verify cDNA library. First, we combined liquid nitrogen grinding with TRIzol for total RNA extraction. Then, switching mechanism at 5' end of the RNA transcript (SMART) technique was used to construct full-length cDNA library. To evaluate the quality of cDNA library, the library titer and recombination rate were calculated. The reliability of cDNA library was detected by sequencing and analyzing positive clones and genes amplified by specific primers. The results showed that the RNA concentration was 836 ng/μl and the absorbance ratio at 260/280 nm was 1.82. The library titer was 5.31 × 10(5) plaque-forming unit (PFU)/ml and the recombination rate was 98.21%, indicating that the library was of good quality. In the 33 expressed sequence tags (ESTs) of P. cuniculi, two clones of 1656 and 1658 bp were almost identical with only three variable sites detected, which had an identity of 99.63% with that of Psoroptes ovis, indicating that the cDNA library was reliable. Further detection by specific primers demonstrated that the 553-bp Pso c II gene sequences of P. cuniculi had an identity of 98.56% with those of P. ovis, confirming that the cDNA library was not only reliable but also feasible.
Theoretical modeling of masking DNA application in aptamer-facilitated biomarker discovery.
Cherney, Leonid T; Obrecht, Natalia M; Krylov, Sergey N
2013-04-16
In aptamer-facilitated biomarker discovery (AptaBiD), aptamers are selected from a library of random DNA (or RNA) sequences for their ability to specifically bind cell-surface biomarkers. The library is incubated with intact cells, and cell-bound DNA molecules are separated from those unbound and amplified by the polymerase chain reaction (PCR). The partitioning/amplification cycle is repeated multiple times while alternating target cells and control cells. Efficient aptamer selection in AptaBiD relies on the inclusion of masking DNA within the cell and library mixture. Masking DNA lacks primer regions for PCR amplification and is typically taken in excess to the library. The role of masking DNA within the selection mixture is to outcompete any nonspecific binding sequences within the initial library, thus allowing specific DNA sequences (i.e., aptamers) to be selected more efficiently. Efficient AptaBiD requires an optimum ratio of masking DNA to library DNA, at which aptamers still bind specific binding sites but nonaptamers within the library do not bind nonspecific binding sites. Here, we have developed a mathematical model that describes the binding processes taking place within the equilibrium mixture of masking DNA, library DNA, and target cells. An obtained mathematical solution allows one to estimate the concentration of masking DNA that is required to outcompete the library DNA at a desirable ratio of bound masking DNA to bound library DNA. The required concentration depends on concentrations of the library and cells as well as on unknown cell characteristics. These characteristics include the concentration of total binding sites on the cell surface, N, and equilibrium dissociation constants, K(nsL) and K(nsM), for nonspecific binding of the library DNA and masking DNA, respectively. We developed a theory that allows the determination of N, K(nsL), and K(nsM) based on measurements of EC50 values for cells mixed separately with the library and masking DNA (EC50 is the concentration of fluorescently labeled DNA at which half of the maximum fluorescence signal from DNA-bound cells is reached). We also obtained expressions for signals from bound DNA (measured by flow cytometry) in terms of N, K(nsL), and K(nsM). These expressions can be used for the verification of N, K(nsL), and K(nsM) values found from EC50 measurements. The developed procedure was applied to MCF-7 breast cancer cells, and corresponding values of N, K(nsL), and K(nsM) were established for the first time. The concentration of masking DNA required for AptaBiD with MCF-7 breast cancer cells was also estimated.
Carpenter, Meredith L.; Buenrostro, Jason D.; Valdiosera, Cristina; Schroeder, Hannes; Allentoft, Morten E.; Sikora, Martin; Rasmussen, Morten; Gravel, Simon; Guillén, Sonia; Nekhrizov, Georgi; Leshtakov, Krasimir; Dimitrova, Diana; Theodossiev, Nikola; Pettener, Davide; Luiselli, Donata; Sandoval, Karla; Moreno-Estrada, Andrés; Li, Yingrui; Wang, Jun; Gilbert, M. Thomas P.; Willerslev, Eske; Greenleaf, William J.; Bustamante, Carlos D.
2013-01-01
Most ancient specimens contain very low levels of endogenous DNA, precluding the shotgun sequencing of many interesting samples because of cost. Ancient DNA (aDNA) libraries often contain <1% endogenous DNA, with the majority of sequencing capacity taken up by environmental DNA. Here we present a capture-based method for enriching the endogenous component of aDNA sequencing libraries. By using biotinylated RNA baits transcribed from genomic DNA libraries, we are able to capture DNA fragments from across the human genome. We demonstrate this method on libraries created from four Iron Age and Bronze Age human teeth from Bulgaria, as well as bone samples from seven Peruvian mummies and a Bronze Age hair sample from Denmark. Prior to capture, shotgun sequencing of these libraries yielded an average of 1.2% of reads mapping to the human genome (including duplicates). After capture, this fraction increased substantially, with up to 59% of reads mapped to human and enrichment ranging from 6- to 159-fold. Furthermore, we maintained coverage of the majority of regions sequenced in the precapture library. Intersection with the 1000 Genomes Project reference panel yielded an average of 50,723 SNPs (range 3,062–147,243) for the postcapture libraries sequenced with 1 million reads, compared with 13,280 SNPs (range 217–73,266) for the precapture libraries, increasing resolution in population genetic analyses. Our whole-genome capture approach makes it less costly to sequence aDNA from specimens containing very low levels of endogenous DNA, enabling the analysis of larger numbers of samples. PMID:24568772
Design and Synthesis of Biaryl DNA-Encoded Libraries.
Ding, Yun; Franklin, G Joseph; DeLorey, Jennifer L; Centrella, Paolo A; Mataruse, Sibongile; Clark, Matthew A; Skinner, Steven R; Belyanskaya, Svetlana
2016-10-10
DNA-encoded library technology (ELT) is a powerful tool for the discovery of new small-molecule ligands to various protein targets. Here we report the design and synthesis of biaryl DNA-encoded libraries based on the scaffold of 5-formyl 3-iodobenzoic acid. Three reactions on DNA template, acylation, Suzuki-Miyaura coupling and reductive amination, were applied in the library synthesis. The three cycle library of 3.5 million diversity has delivered potent hits for phosphoinositide 3-kinase α (PI3Kα).
Soares, Marcelo B.; Efstratiadis, Argiris
1997-01-01
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library.
Soares, M.B.; Efstratiadis, A.
1997-06-10
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3{prime} noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. 4 figs.
Microsatellite DNA library for Caiman latirostris.
Zucoloto, Rodrigo Barban; Verdade, Luciano Martins; Coutinho, Luiz Lehmann
2002-12-15
New genetic markers were characterized for the broad-snouted caiman (Caiman latirostris) by constructing libraries enriched for microsatellite DNA. Construction and characterization of these libraries are described in the present study. One microsatellite marker was developed from a (ACC-TGG)(n)enriched microsatellite DNA library, and 12 microsatellite markers were developed from a (AC-TG)(n)enriched microsatellite DNA library. These markers were tested in wild-caught animals, and these tests resulted in ten new polymorphic microsatellites for C. latirostris. Copyright 2002 Wiley-Liss, Inc.
Second-generation DNA-templated macrocycle libraries for the discovery of bioactive small molecules.
Usanov, Dmitry L; Chan, Alix I; Maianti, Juan Pablo; Liu, David R
2018-07-01
DNA-encoded libraries have emerged as a widely used resource for the discovery of bioactive small molecules, and offer substantial advantages compared with conventional small-molecule libraries. Here, we have developed and streamlined multiple fundamental aspects of DNA-encoded and DNA-templated library synthesis methodology, including computational identification and experimental validation of a 20 × 20 × 20 × 80 set of orthogonal codons, chemical and computational tools for enhancing the structural diversity and drug-likeness of library members, a highly efficient polymerase-mediated template library assembly strategy, and library isolation and purification methods. We have integrated these improved methods to produce a second-generation DNA-templated library of 256,000 small-molecule macrocycles with improved drug-like physical properties. In vitro selection of this library for insulin-degrading enzyme affinity resulted in novel insulin-degrading enzyme inhibitors, including one of unusual potency and novel macrocycle stereochemistry (IC 50 = 40 nM). Collectively, these developments enable DNA-templated small-molecule libraries to serve as more powerful, accessible, streamlined and cost-effective tools for bioactive small-molecule discovery.
Efficient preparation of shuffled DNA libraries through recombination (Gateway) cloning.
Lehtonen, Soili I; Taskinen, Barbara; Ojala, Elina; Kukkurainen, Sampo; Rahikainen, Rolle; Riihimäki, Tiina A; Laitinen, Olli H; Kulomaa, Markku S; Hytönen, Vesa P
2015-01-01
Efficient and robust subcloning is essential for the construction of high-diversity DNA libraries in the field of directed evolution. We have developed a more efficient method for the subcloning of DNA-shuffled libraries by employing recombination cloning (Gateway). The Gateway cloning procedure was performed directly after the gene reassembly reaction, without additional purification and amplification steps, thus simplifying the conventional DNA shuffling protocols. Recombination-based cloning, directly from the heterologous reassembly reaction, conserved the high quality of the library and reduced the time required for the library construction. The described method is generally compatible for the construction of DNA-shuffled gene libraries. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Method for construction of normalized cDNA libraries
Soares, Marcelo B.; Efstratiadis, Argiris
1996-01-01
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library.
Method for construction of normalized cDNA libraries
Soares, M.B.; Efstratiadis, A.
1996-01-09
This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form. The method comprises: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3` noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. 4 figs.
Wilson, John-James; Sing, Kong-Wah; Sofian-Azirun, Mohd
2013-01-01
The objective of this study was to build a DNA barcode reference library for the true butterflies of Peninsula Malaysia and assess the value of attaching subspecies names to DNA barcode records. A new DNA barcode library was constructed with butterflies from the Museum of Zoology, University of Malaya collection. The library was analysed in conjunction with publicly available DNA barcodes from other Asia-Pacific localities to test the ability of the DNA barcodes to discriminate species and subspecies. Analyses confirmed the capacity of the new DNA barcode reference library to distinguish the vast majority of species (92%) and revealed that most subspecies possessed unique DNA barcodes (84%). In some cases conspecific subspecies exhibited genetic distances between their DNA barcodes that are typically seen between species, and these were often taxa that have previously been regarded as full species. Subspecies designations as shorthand for geographically and morphologically differentiated groups provide a useful heuristic for assessing how such groups correlate with clustering patterns of DNA barcodes, especially as the number of DNA barcodes per species in reference libraries increases. Our study demonstrates the value in attaching subspecies names to DNA barcode records as they can reveal a history of taxonomic concepts and expose important units of biodiversity.
Wilson, John-James; Sing, Kong-Wah; Sofian-Azirun, Mohd
2013-01-01
The objective of this study was to build a DNA barcode reference library for the true butterflies of Peninsula Malaysia and assess the value of attaching subspecies names to DNA barcode records. A new DNA barcode library was constructed with butterflies from the Museum of Zoology, University of Malaya collection. The library was analysed in conjunction with publicly available DNA barcodes from other Asia-Pacific localities to test the ability of the DNA barcodes to discriminate species and subspecies. Analyses confirmed the capacity of the new DNA barcode reference library to distinguish the vast majority of species (92%) and revealed that most subspecies possessed unique DNA barcodes (84%). In some cases conspecific subspecies exhibited genetic distances between their DNA barcodes that are typically seen between species, and these were often taxa that have previously been regarded as full species. Subspecies designations as shorthand for geographically and morphologically differentiated groups provide a useful heuristic for assessing how such groups correlate with clustering patterns of DNA barcodes, especially as the number of DNA barcodes per species in reference libraries increases. Our study demonstrates the value in attaching subspecies names to DNA barcode records as they can reveal a history of taxonomic concepts and expose important units of biodiversity. PMID:24282514
Xu, Chao; Dong, Wenpan; Shi, Shuo; Cheng, Tao; Li, Changhao; Liu, Yanlei; Wu, Ping; Wu, Hongkun; Gao, Peng; Zhou, Shiliang
2015-11-01
A well-covered reference library is crucial for successful identification of species by DNA barcoding. The biggest difficulty in building such a reference library is the lack of materials of organisms. Herbarium collections are potentially an enormous resource of materials. In this study, we demonstrate that it is likely to build such reference libraries using the reconstructed (self-primed PCR amplified) DNA from the herbarium specimens. We used 179 rosaceous specimens to test the effects of DNA reconstruction, 420 randomly sampled specimens to estimate the usable percentage and another 223 specimens of true cherries (Cerasus, Rosaceae) to test the coverage of usable specimens to the species. The barcode rbcLb (the central four-sevenths of rbcL gene) and matK was each amplified in two halves and sequenced on Roche GS 454 FLX+. DNA from the herbarium specimens was typically shorter than 300 bp. DNA reconstruction enabled amplification fragments of 400-500 bp without bringing or inducing any sequence errors. About one-third of specimens in the national herbarium of China (PE) were proven usable after DNA reconstruction. The specimens in PE cover all Chinese true cherry species and 91.5% of vascular species listed in Flora of China. It is very possible to build well-covered reference libraries for DNA barcoding of vascular species in China. As exemplified in this study, DNA reconstruction and DNA-labelled next-generation sequencing can accelerate the construction of local reference libraries. By putting the local reference libraries together, a global library for DNA barcoding becomes closer to reality. © 2015 John Wiley & Sons Ltd.
Lam, Kathy N; Charles, Trevor C
2015-01-01
Clone libraries provide researchers with a powerful resource to study nucleic acid from diverse sources. Metagenomic clone libraries in particular have aided in studies of microbial biodiversity and function, and allowed the mining of novel enzymes. Libraries are often constructed by cloning large inserts into cosmid or fosmid vectors. Recently, there have been reports of GC bias in fosmid metagenomic libraries, and it was speculated to be a result of fragmentation and loss of AT-rich sequences during cloning. However, evidence in the literature suggests that transcriptional activity or gene product toxicity may play a role. To explore possible mechanisms responsible for sequence bias in clone libraries, we constructed a cosmid library from a human microbiome sample and sequenced DNA from different steps during library construction: crude extract DNA, size-selected DNA, and cosmid library DNA. We confirmed a GC bias in the final cosmid library, and we provide evidence that the bias is not due to fragmentation and loss of AT-rich sequences but is likely occurring after DNA is introduced into Escherichia coli. To investigate the influence of strong constitutive transcription, we searched the sequence data for promoters and found that rpoD/σ(70) promoter sequences were underrepresented in the cosmid library. Furthermore, when we examined the genomes of taxa that were differentially abundant in the cosmid library relative to the original sample, we found the bias to be more correlated with the number of rpoD/σ(70) consensus sequences in the genome than with simple GC content. The GC bias of metagenomic libraries does not appear to be due to DNA fragmentation. Rather, analysis of promoter sequences provides support for the hypothesis that strong constitutive transcription from sequences recognized as rpoD/σ(70) consensus-like in E. coli may lead to instability, causing loss of the plasmid or loss of the insert DNA that gives rise to the transcription. Despite widespread use of E. coli to propagate foreign DNA in metagenomic libraries, the effects of in vivo transcriptional activity on clone stability are not well understood. Further work is required to tease apart the effects of transcription from those of gene product toxicity.
Application of Biocatalysis to on-DNA Carbohydrate Library Synthesis.
Thomas, Baptiste; Lu, Xiaojie; Birmingham, William R; Huang, Kun; Both, Peter; Reyes Martinez, Juana Elizabeth; Young, Robert J; Davie, Christopher P; Flitsch, Sabine L
2017-05-04
DNA-encoded libraries are increasingly used for the discovery of bioactive lead compounds in high-throughput screening programs against specific biological targets. Although a number of libraries are now available, they cover limited chemical space due to bias in ease of synthesis and the lack of chemical reactions that are compatible with DNA tagging. For example, compound libraries rarely contain complex biomolecules such as carbohydrates with high levels of functionality, stereochemistry, and hydrophilicity. By using biocatalysis in combination with chemical methods, we aimed to significantly expand chemical space and generate generic libraries with potentially better biocompatibility. For DNA-encoded libraries, biocatalysis is particularly advantageous, as it is highly selective and can be performed in aqueous environments, which is an essential feature for this split-and-mix library technology. In this work, we demonstrated the application of biocatalysis for the on-DNA synthesis of carbohydrate-based libraries by using enzymatic oxidation and glycosylation in combination with traditional organic chemistry. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Zhao, Wei; Li, Xin; Liu, Wen-Hui; Zhao, Jian; Jin, Yi-Ming; Sui, Ting-Ting
2014-09-01
Human epithelial colorectal adenocarcinoma (Caco-2) cells are widely used as an in vitro model of the human small intestinal mucosa. Caco-2 cells are host cells of the human astrovirus (HAstV) and other enteroviruses. High quality cDNA libraries are pertinent resources and critical tools for protein-protein interaction research, but are currently unavailable for Caco-2 cells. To construct a three-open reading frame, full length-expression cDNA library from the Caco-2 cell line for application to HAstV protein-protein interaction screening, total RNA was extracted from Caco-2 cells. The switching mechanism at the 5' end of the RNA transcript technique was used for cDNA synthesis. Double-stranded cDNA was digested by Sfi I and ligated to reconstruct a pGADT7-Sfi I three-frame vector. The ligation mixture was transformed into Escherichia coli HST08 premium electro cells by electroporation to construct the primary cDNA library. The library capacity was 1.0×10(6)clones. Gel electrophoresis results indicated that the fragments ranged from 0.5kb to 4.2kb. Randomly picked clones show that the recombination rate was 100%. The three-frame primary cDNA library plasmid mixture (5×10(5)cfu) was also transformed into E. coli HST08 premium electro cells, and all clones were harvested to amplify the cDNA library. To detect the sufficiency of the cDNA library, HAstV capsid protein as bait was screened and tested against the Caco-2 cDNA library by a yeast two-hybrid (Y2H) system. A total of 20 proteins were found to interact with the capsid protein. These results showed that a high-quality three-frame cDNA library from Caco-2 cells was successfully constructed. This library was efficient for the application to the Y2H system, and could be used for future research. Copyright © 2014 Elsevier B.V. All rights reserved.
Rise, Matthew L.; von Schalburg, Kristian R.; Brown, Gordon D.; Mawer, Melanie A.; Devlin, Robert H.; Kuipers, Nathanael; Busby, Maura; Beetz-Sargent, Marianne; Alberto, Roberto; Gibbs, A. Ross; Hunt, Peter; Shukin, Robert; Zeznik, Jeffrey A.; Nelson, Colleen; Jones, Simon R.M.; Smailus, Duane E.; Jones, Steven J.M.; Schein, Jacqueline E.; Marra, Marco A.; Butterfield, Yaron S.N.; Stott, Jeff M.; Ng, Siemon H.S.; Davidson, William S.; Koop, Ben F.
2004-01-01
We report 80,388 ESTs from 23 Atlantic salmon (Salmo salar) cDNA libraries (61,819 ESTs), 6 rainbow trout (Oncorhynchus mykiss) cDNA libraries (14,544 ESTs), 2 chinook salmon (Oncorhynchus tshawytscha) cDNA libraries (1317 ESTs), 2 sockeye salmon (Oncorhynchus nerka) cDNA libraries (1243 ESTs), and 2 lake whitefish (Coregonus clupeaformis) cDNA libraries (1465 ESTs). The majority of these are 3′ sequences, allowing discrimination between paralogs arising from a recent genome duplication in the salmonid lineage. Sequence assembly reveals 28,710 different S. salar, 8981 O. mykiss, 1085 O. tshawytscha, 520 O. nerka, and 1176 C. clupeaformis putative transcripts. We annotate the submitted portion of our EST database by molecular function. Higher- and lower-molecular-weight fractions of libraries are shown to contain distinct gene sets, and higher rates of gene discovery are associated with higher-molecular weight libraries. Pyloric caecum library group annotations indicate this organ may function in redox control and as a barrier against systemic uptake of xenobiotics. A microarray is described, containing 7356 salmonid elements representing 3557 different cDNAs. Analyses of cross-species hybridizations to this cDNA microarray indicate that this resource may be used for studies involving all salmonids. PMID:14962987
cDNA library construction of two human Demodexspecies.
Niu, DongLing; Wang, RuiLing; Zhao, YaE; Yang, Rui; Hu, Li; Lei, YuYang; Dan, WeiChao
2017-06-01
The research of Demodex, a type of pathogen causing various dermatoses in animals and human beings, is lacking at RNA level. This study aims at extracting RNA and constructing cDNA library for Demodex. First, P. cuniculiand D. farinaewere mixed to establish homogenization method for RNA extraction. Second, D. folliculorumand D. breviswere collected and preserved in Trizol, which were mixed with D. farinaerespectively to extract RNA. Finally, cDNA library was constructed and its quality was assessed. The results indicated that for D. folliculorum& D. farinae, the recombination rate of cDNA library was 90.67% and the library titer was 7.50 × 104 pfu/ml. 17 of the 59 positive clones were predicted to be of D. folliculorum; For D. brevis& D. farinae, the recombination rate was 90.96% and the library titer was 7.85 x104 pfu/ml. 40 of the 59 positive clones were predicted to be of D. brevis. Further detection by specific primers demonstrated that mtDNA cox1, cox3and ATP6 detected from cDNA libraries had 96.52%-99.73% identities with the corresponding sequences in GenBank. In conclusion, the cDNA libraries constructed for Demodexmixed with D. farinaewere successful and could satisfy the requirements for functional genes detection.
Subtraction of cap-trapped full-length cDNA libraries to select rare transcripts.
Hirozane-Kishikawa, Tomoko; Shiraki, Toshiyuki; Waki, Kazunori; Nakamura, Mari; Arakawa, Takahiro; Kawai, Jun; Fagiolini, Michela; Hensch, Takao K; Hayashizaki, Yoshihide; Carninci, Piero
2003-09-01
The normalization and subtraction of highly expressed cDNAs from relatively large tissues before cloning dramatically enhanced the gene discovery by sequencing for the mouse full-length cDNA encyclopedia, but these methods have not been suitable for limited RNA materials. To normalize and subtract full-length cDNA libraries derived from limited quantities of total RNA, here we report a method to subtract plasmid libraries excised from size-unbiased amplified lambda phage cDNA libraries that avoids heavily biasing steps such as PCR and plasmid library amplification. The proportion of full-length cDNAs and the gene discovery rate are high, and library diversity can be validated by in silico randomization.
DNA-encoded chemical libraries: advancing beyond conventional small-molecule libraries.
Franzini, Raphael M; Neri, Dario; Scheuermann, Jörg
2014-04-15
DNA-encoded chemical libraries (DECLs) represent a promising tool in drug discovery. DECL technology allows the synthesis and screening of chemical libraries of unprecedented size at moderate costs. In analogy to phage-display technology, where large antibody libraries are displayed on the surface of filamentous phage and are genetically encoded in the phage genome, DECLs feature the display of individual small organic chemical moieties on DNA fragments serving as amplifiable identification barcodes. The DNA-tag facilitates the synthesis and allows the simultaneous screening of very large sets of compounds (up to billions of molecules), because the hit compounds can easily be identified and quantified by PCR-amplification of the DNA-barcode followed by high-throughput DNA sequencing. Several approaches have been used to generate DECLs, differing both in the methods used for library encoding and for the combinatorial assembly of chemical moieties. For example, DECLs can be used for fragment-based drug discovery, displaying a single molecule on DNA or two chemical moieties at the extremities of complementary DNA strands. DECLs can vary substantially in the chemical structures and the library size. While ultralarge libraries containing billions of compounds have been reported containing four or more sets of building blocks, also smaller libraries have been shown to be efficient for ligand discovery. In general, it has been found that the overall library size is a poor predictor for library performance and that the number and diversity of the building blocks are rather important indicators. Smaller libraries consisting of two to three sets of building blocks better fulfill the criteria of drug-likeness and often have higher quality. In this Account, we present advances in the DECL field from proof-of-principle studies to practical applications for drug discovery, both in industry and in academia. DECL technology can yield specific binders to a variety of target proteins and is likely to become a standard tool for pharmaceutical hit discovery, lead expansion, and Chemical Biology research. The introduction of new methodologies for library encoding and for compound synthesis in the presence of DNA is an exciting research field and will crucially contribute to the performance and the propagation of the technology.
Recent phylogenetic studies have used DNA as the target molecule for the development of environmental 16S rDNA clone libraries. As DNA may persist in the environment, DNA-based libraries cannot be used to identify metabolically active bacteria in water systems. In this study, a...
Molecular architecture of classical cytological landmarks: Centromeres and telomeres
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meyne, J.
1994-11-01
Both the human telomere repeat and the pericentromeric repeat sequence (GGAAT)n were isolated based on evolutionary conservation. Their isolation was based on the premise that chromosomal features as structurally and functionally important as telomeres and centromeres should be highly conserved. Both sequences were isolated by high stringency screening of a human repetitive DNA library with rodent repetitive DNA. The pHuR library (plasmid Human Repeat) used for this project was enriched for repetitive DNA by using a modification of the standard DNA library preparation method. Usually DNA for a library is cut with restriction enzymes, packaged, infected, and the library ismore » screened. A problem with this approach is that many tandem repeats don`t have any (or many) common restriction sites. Therefore, many of the repeat sequences will not be represented in the library because they are not restricted to a viable length for the vector used. To prepare the pHuR library, human DNA was mechanically sheared to a small size. These relatively short DNA fragments were denatured and then renatured to C{sub o}t 50. Theoretically only repetitive DNA sequences should renature under C{sub o}t 50 conditions. The single-stranded regions were digested using S1 nuclease, leaving the double-stranded, renatured repeat sequences.« less
Seashols-Williams, Sarah; Green, Raquel; Wohlfahrt, Denise; Brand, Angela; Tan-Torres, Antonio Limjuco; Nogales, Francy; Brooks, J Paul; Singh, Baneshwar
2018-05-17
Sequencing and classification of microbial taxa within forensically relevant biological fluids has the potential for applications in the forensic science and biomedical fields. The quantity of bacterial DNA from human samples is currently estimated based on quantity of total DNA isolated. This method can miscalculate bacterial DNA quantity due to the mixed nature of the sample, and consequently library preparation is often unreliable. We developed an assay that can accurately and specifically quantify bacterial DNA within a mixed sample for reliable 16S ribosomal DNA (16S rDNA) library preparation and high throughput sequencing (HTS). A qPCR method was optimized using universal 16S rDNA primers, and a commercially available bacterial community DNA standard was used to develop a precise standard curve. Following qPCR optimization, 16S rDNA libraries from saliva, vaginal and menstrual secretions, urine, and fecal matter were amplified and evaluated at various DNA concentrations; successful HTS data were generated with as low as 20 pg of bacterial DNA. Changes in bacterial DNA quantity did not impact observed relative abundances of major bacterial taxa, but relative abundance changes of minor taxa were observed. Accurate quantification of microbial DNA resulted in consistent, successful library preparations for HTS analysis. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Polymenakou, Paraskevi N; Bertilsson, Stefan; Tselepides, Anastasios; Stephanou, Euripides G
2005-10-01
The regional variability of sediment bacterial community composition and diversity was studied by comparative analysis of four large 16S ribosomal DNA (rDNA) clone libraries from sediments in different regions of the Eastern Mediterranean Sea (Thermaikos Gulf, Cretan Sea, and South lonian Sea). Amplified rDNA restriction analysis of 664 clones from the libraries indicate that the rDNA richness and evenness was high: for example, a near-1:1 relationship among screened clones and number of unique restriction patterns when up to 190 clones were screened for each library. Phylogenetic analysis of 207 bacterial 16S rDNA sequences from the sediment libraries demonstrated that Gamma-, Delta-, and Alphaproteobacteria, Holophaga/Acidobacteria, Planctomycetales, Actinobacteria, Bacteroidetes, and Verrucomicrobia were represented in all four libraries. A few clones also grouped with the Betaproteobacteria, Nitrospirae, Spirochaetales, Chlamydiae, Firmicutes, and candidate division OPl 1. The abundance of sequences affiliated with Gammaproteobacteria was higher in libraries from shallow sediments in the Thermaikos Gulf (30 m) and the Cretan Sea (100 m) compared to the deeper South Ionian station (2790 m). Most sequences in the four sediment libraries clustered with uncultured 16S rDNA phylotypes from marine habitats, and many of the closest matches were clones from hydrocarbon seeps, benzene-mineralizing consortia, sulfate reducers, sulk oxidizers, and ammonia oxidizers. LIBSHUFF statistics of 16S rDNA gene sequences from the four libraries revealed major differences, indicating either a very high richness in the sediment bacterial communities or considerable variability in bacterial community composition among regions, or both.
Xu, De-Quan; Zhang, Yi-Bing; Xiong, Yuan-Zhu; Gui, Jian-Fang; Jiang, Si-Wen; Su, Yu-Hong
2003-07-01
Using suppression subtractive hybridization (SSH) technique, forward and reverse subtracted cDNA libraries were constructed between Longissimus muscles from Meishan and Landrace pigs. A housekeeping gene, G3PDH, was used to estimate the efficiency of subtractive cDNA. In two cDNA libraries, G3PDH was subtracted very efficiently at appropriate 2(10) and 2(5) folds, respectively, indicating that some differentially expressed genes were also enriched at the same folds and the two subtractive cDNA libraries were very successful. A total of 709 and 673 positive clones were isolated from forward and reverse subtracted cDNA libraries, respectively. Analysis of PCR showed that most of all plasmids in the clones contained 150-750 bp inserts. The construction of subtractive cDNA libraries between muscle tissue from different pig breeds laid solid foundations for isolating and identifying the genes determining muscle growth and meat quality, which will be important to understand the mechanism of muscle growth, determination of meat quality and practice of molecular breeding.
Evaluation of vector-primed cDNA library production from microgram quantities of total RNA.
Kuo, Jonathan; Inman, Jason; Brownstein, Michael; Usdin, Ted B
2004-12-15
cDNA sequences are important for defining the coding region of genes, and full-length cDNA clones have proven to be useful for investigation of the function of gene products. We produced cDNA libraries containing 3.5-5 x 10(5) primary transformants, starting with 5 mug of total RNA prepared from mouse pituitary, adrenal, thymus, and pineal tissue, using a vector-primed cDNA synthesis method. Of approximately 1000 clones sequenced, approximately 20% contained the full open reading frames (ORFs) of known transcripts, based on the presence of the initiating methionine residue codon. The libraries were complex, with 94, 91, 83 and 55% of the clones from the thymus, adrenal, pineal and pituitary libraries, respectively, represented only once. Twenty-five full-length clones, not yet represented in the Mammalian Gene Collection, were identified. Thus, we have produced useful cDNA libraries for the isolation of full-length cDNA clones that are not yet available in the public domain, and demonstrated the utility of a simple method for making high-quality libraries from small amounts of starting material.
Construction of BAC Libraries from Flow-Sorted Chromosomes.
Šafář, Jan; Šimková, Hana; Doležel, Jaroslav
2016-01-01
Cloned DNA libraries in bacterial artificial chromosome (BAC) are the most widely used form of large-insert DNA libraries. BAC libraries are typically represented by ordered clones derived from genomic DNA of a particular organism. In the case of large eukaryotic genomes, whole-genome libraries consist of a hundred thousand to a million clones, which make their handling and screening a daunting task. The labor and cost of working with whole-genome libraries can be greatly reduced by constructing a library derived from a smaller part of the genome. Here we describe construction of BAC libraries from mitotic chromosomes purified by flow cytometric sorting. Chromosome-specific BAC libraries facilitate positional gene cloning, physical mapping, and sequencing in complex plant genomes.
Aigrain, Louise; Gu, Yong; Quail, Michael A
2016-06-13
The emergence of next-generation sequencing (NGS) technologies in the past decade has allowed the democratization of DNA sequencing both in terms of price per sequenced bases and ease to produce DNA libraries. When it comes to preparing DNA sequencing libraries for Illumina, the current market leader, a plethora of kits are available and it can be difficult for the users to determine which kit is the most appropriate and efficient for their applications; the main concerns being not only cost but also minimal bias, yield and time efficiency. We compared 9 commercially available library preparation kits in a systematic manner using the same DNA sample by probing the amount of DNA remaining after each protocol steps using a new droplet digital PCR (ddPCR) assay. This method allows the precise quantification of fragments bearing either adaptors or P5/P7 sequences on both ends just after ligation or PCR enrichment. We also investigated the potential influence of DNA input and DNA fragment size on the final library preparation efficiency. The overall library preparations efficiencies of the libraries show important variations between the different kits with the ones combining several steps into a single one exhibiting some final yields 4 to 7 times higher than the other kits. Detailed ddPCR data also reveal that the adaptor ligation yield itself varies by more than a factor of 10 between kits, certain ligation efficiencies being so low that it could impair the original library complexity and impoverish the sequencing results. When a PCR enrichment step is necessary, lower adaptor-ligated DNA inputs leads to greater amplification yields, hiding the latent disparity between kits. We describe a ddPCR assay that allows us to probe the efficiency of the most critical step in the library preparation, ligation, and to draw conclusion on which kits is more likely to preserve the sample heterogeneity and reduce the need of amplification.
Manlig, Erika; Wahlberg, Per
2017-01-01
Abstract Sodium bisulphite treatment of DNA combined with next generation sequencing (NGS) is a powerful combination for the interrogation of genome-wide DNA methylation profiles. Library preparation for whole genome bisulphite sequencing (WGBS) is challenging due to side effects of the bisulphite treatment, which leads to extensive DNA damage. Recently, a new generation of methods for bisulphite sequencing library preparation have been devised. They are based on initial bisulphite treatment of the DNA, followed by adaptor tagging of single stranded DNA fragments, and enable WGBS using low quantities of input DNA. In this study, we present a novel approach for quick and cost effective WGBS library preparation that is based on splinted adaptor tagging (SPLAT) of bisulphite-converted single-stranded DNA. Moreover, we validate SPLAT against three commercially available WGBS library preparation techniques, two of which are based on bisulphite treatment prior to adaptor tagging and one is a conventional WGBS method. PMID:27899585
A Fast Solution to NGS Library Prep with Low Nanogram DNA Input
Liu, Pingfang; Lohman, Gregory J.S.; Cantor, Eric; Langhorst, Bradley W.; Yigit, Erbay; Apone, Lynne M.; Munafo, Daniela B.; Stewart, Fiona J.; Evans, Thomas C.; Nichols, Nicole; Dimalanta, Eileen T.; Davis, Theodore B.; Sumner, Christine
2013-01-01
Next Generation Sequencing (NGS) has significantly impacted human genetics, enabling a comprehensive characterization of the human genome as well as a better understanding of many genomic abnormalities. By delivering massive DNA sequences at unprecedented speed and cost, NGS promises to make personalized medicine a reality in the foreseeable future. To date, library construction with clinical samples has been a challenge, primarily due to the limited quantities of sample DNA available. Our objective here was to overcome this challenge by developing NEBNext® Ultra DNA Library Prep Kit, a fast library preparation method. Specifically, we streamlined the workflow utilizing novel NEBNext reagents and adaptors, including a new DNA polymerase that has been optimized to minimize GC bias. As a result of this work, we have developed a simple method for library construction from an amount of DNA as low as 5 ng, which can be used for both intact and fragmented DNA. Moreover, the workflow is compatible with multiple NGS platforms.
3G vector-primer plasmid for constructing full-length-enriched cDNA libraries.
Zheng, Dong; Zhou, Yanna; Zhang, Zidong; Li, Zaiyu; Liu, Xuedong
2008-09-01
We designed a 3G vector-primer plasmid for the generation of full-length-enriched complementary DNA (cDNA) libraries. By employing the terminal transferase activity of reverse transcriptase and the modified strand replacement method, this plasmid (assembled with a polydT end and a deoxyguanosine [dG] end) combines priming full-length cDNA strand synthesis and directional cDNA cloning. As a result, the number of steps involved in cDNA library preparation is decreased while simplifying downstream gene manipulation, sequencing, and subcloning. The 3G vector-primer plasmid method yields fully represented plasmid primed libraries that are equivalent to those made by the SMART (switching mechanism at 5' end of RNA transcript) approach.
[Primary culture of cat intestinal epithelial cell and construction of its cDNA library].
Ye, L; Gui-Hua, Z; Kun, Y; Hong-Fa, W; Ting, X; Gong-Zhen, L; Wei-Xia, Z; Yong, C
2017-04-12
Objective To establish the primary cat intestinal epithelial cells (IECs) culture methods and construct the cDNA library for the following yeast two-hybrid experiment, so as to screen the virulence interaction factors among the final host. Methods The primary cat IECs were cultured by the tissue cultivation and combined digestion with collagenase XI and dispase I separately. Then the cat IECs cultured was identified with the morphological observation and cyto-keratin detection, by using goat anti-cyto-keratin monoclonal antibodies. The mRNA of cat IECs was isolated and used as the template to synthesize the first strand cDNA by SMART™ technology, and then the double-strand cDNAs were acquired by LD-PCR, which were subsequently cloned into the plasmid PGADT7-Rec to construct yeast two-hybrid cDNA library in the yeast strain Y187 by homologous recombination. Matchmaker™ Insert Check PCR was used to detect the size distribution of cDNA fragments after the capacity calculation of the cDNA library. Results The comparison of the two cultivation methods indicated that the combined digestion of collagenase XI and dispase I was more effective than the tissue cultivation. The cat IECs system of continuous culture was established and the cat IECs with high purity were harvested for constructing the yeast two-hybrid cDNA library. The library contained 1.1×10 6 independent clones. The titer was 2.8×10 9 cfu/ml. The size of inserted fragments was among 0.5-2.0 kb. Conclusion The yeast two-hybrid cDNA library of cat IECs meets the requirements of further screen research, and this study lays the foundation of screening the Toxoplasma gondii virulence interaction factors among the cDNA libraries of its final hosts.
[Construction and characterization of a cDNA library from human liver tissue of cirrhosis].
Chen, Xiao-hong; Chen, Zhi; Chen, Feng; Zhu, Hai-hong; Zhou, Hong-juan; Yao, Hang-ping
2005-03-01
To construct a cDNA library from human liver tissue of cirrhosis. The total RNA from human liver tissue of cirrhosis was extracted using Trizol method, and the mRNA was purified using mRNA purification kit. SMART technique and CDSIII/3' primer were used for first-strand cDNA synthesis. Long distance PCR was then used to synthesize the double-strand cDNA that was then digested by proteinase K and Sfi I, and was fractionated by CHOMA SPIN-400 column. The cDNA fragments longer than 0.4 kb were collected and ligated to lambdaTripl Ex2 vector. Then lambda-phage packaging reaction and library amplification were performed. The qualities of both unamplified and amplified cDNA libraries was strictly checked by conventional titer determination. Eleven plaques were randomly picked and tested using PCR with universal primers derived from the sequence flanking the vector. The titers of unamplifed and amplified libraries were 1.03 x 10(6) pfu/ml and 1.36 x 10(9) pfu/ml respectively. The percentages of recombinants from both libraries were 97.24 % in unamplified library and 99.02 % in amplified library. The lengths of the inserts were 1.02 kb in average (36.36 % 1 approximately equals 2 kb and 63.64 % 0.5 approximately equals 1.0 kb). A high quality cDNA library from human liver tissue of cirrhosis was constructed successfully, which can be used for screening and cloning new special genes associated with the occurrence of cirrhosis.
Kunig, Verena; Potowski, Marco; Gohla, Anne; Brunschweiger, Andreas
2018-06-27
DNA-encoded compound libraries are a highly attractive technology for the discovery of small molecule protein ligands. These compound collections consist of small molecules covalently connected to individual DNA sequences carrying readable information about the compound structure. DNA-tagging allows for efficient synthesis, handling and interrogation of vast numbers of chemically synthesized, drug-like compounds. They are screened on proteins by an efficient, generic assay based on Darwinian principles of selection. To date, selection of DNA-encoded libraries allowed for the identification of numerous bioactive compounds. Some of these compounds uncovered hitherto unknown allosteric binding sites on target proteins; several compounds proved their value as chemical biology probes unraveling complex biology; and the first examples of clinical candidates that trace their ancestry to a DNA-encoded library were reported. Thus, DNA-encoded libraries proved their value for the biomedical sciences as a generic technology for the identification of bioactive drug-like molecules numerous times. However, large scale experiments showed that even the selection of billions of compounds failed to deliver bioactive compounds for the majority of proteins in an unbiased panel of target proteins. This raises the question of compound library design.
A Glimpse into the Satellite DNA Library in Characidae Fish (Teleostei, Characiformes)
Utsunomia, Ricardo; Ruiz-Ruano, Francisco J.; Silva, Duílio M. Z. A.; Serrano, Érica A.; Rosa, Ivana F.; Scudeler, Patrícia E. S.; Hashimoto, Diogo T.; Oliveira, Claudio; Camacho, Juan Pedro M.; Foresti, Fausto
2017-01-01
Satellite DNA (satDNA) is an abundant fraction of repetitive DNA in eukaryotic genomes and plays an important role in genome organization and evolution. In general, satDNA sequences follow a concerted evolutionary pattern through the intragenomic homogenization of different repeat units. In addition, the satDNA library hypothesis predicts that related species share a series of satDNA variants descended from a common ancestor species, with differential amplification of different satDNA variants. The finding of a same satDNA family in species belonging to different genera within Characidae fish provided the opportunity to test both concerted evolution and library hypotheses. For this purpose, we analyzed here sequence variation and abundance of this satDNA family in ten species, by a combination of next generation sequencing (NGS), PCR and Sanger sequencing, and fluorescence in situ hybridization (FISH). We found extensive between-species variation for the number and size of pericentromeric FISH signals. At genomic level, the analysis of 1000s of DNA sequences obtained by Illumina sequencing and PCR amplification allowed defining 150 haplotypes which were linked in a common minimum spanning tree, where different patterns of concerted evolution were apparent. This also provided a glimpse into the satDNA library of this group of species. In consistency with the library hypothesis, different variants for this satDNA showed high differences in abundance between species, from highly abundant to simply relictual variants. PMID:28855916
Friis, Thor Einar; Stephenson, Sally; Xiao, Yin; Whitehead, Jon
2014-01-01
The sheep (Ovis aries) is favored by many musculoskeletal tissue engineering groups as a large animal model because of its docile temperament and ease of husbandry. The size and weight of sheep are comparable to humans, which allows for the use of implants and fixation devices used in human clinical practice. The construction of a complimentary DNA (cDNA) library can capture the expression of genes in both a tissue- and time-specific manner. cDNA libraries have been a consistent source of gene discovery ever since the technology became commonplace more than three decades ago. Here, we describe the construction of a cDNA library using cells derived from sheep bones based on the pBluescript cDNA kit. Thirty clones were picked at random and sequenced. This led to the identification of a novel gene, C12orf29, which our initial experiments indicate is involved in skeletal biology. We also describe a polymerase chain reaction-based cDNA clone isolation method that allows the isolation of genes of interest from a cDNA library pool. The techniques outlined here can be applied in-house by smaller tissue engineering groups to generate tools for biomolecular research for large preclinical animal studies and highlights the power of standard cDNA library protocols to uncover novel genes. PMID:24447069
Novel selection methods for DNA-encoded chemical libraries
Chan, Alix I.; McGregor, Lynn M.; Liu, David R.
2015-01-01
Driven by the need for new compounds to serve as biological probes and leads for therapeutic development and the growing accessibility of DNA technologies including high-throughput sequencing, many academic and industrial groups have begun to use DNA-encoded chemical libraries as a source of bioactive small molecules. In this review, we describe the technologies that have enabled the selection of compounds with desired activities from these libraries. These methods exploit the sensitivity of in vitro selection coupled with DNA amplification to overcome some of the limitations and costs associated with conventional screening methods. In addition, we highlight newer techniques with the potential to be applied to the high-throughput evaluation of DNA-encoded chemical libraries. PMID:25723146
Procedure for normalization of cDNA libraries
Bonaldo, Maria DeFatima; Soares, Marcelo Bento
1997-01-01
This invention provides a method to normalize a cDNA library constructed in a vector capable of being converted to single-stranded circles and capable of producing complementary nucleic acid molecules to the single-stranded circles comprising: (a) converting the cDNA library in single-stranded circles; (b) generating complementary nucleic acid molecules to the single-stranded circles; (c) hybridizing the single-stranded circles converted in step (a) with complementary nucleic acid molecules of step (b) to produce partial duplexes to an appropriate Cot; (e) separating the unhybridized single-stranded circles from the hybridized single-stranded circles, thereby generating a normalized cDNA library.
The bacterial composition of chlorinated drinking water was analyzed using 16S rRNA gene clone libraries derived from DNA extracts of 12 samples and compared to clone libraries previously generated using RNA extracts from the same samples. Phylogenetic analysis of 761 DNA-based ...
Pauthenier, Cyrille; Faulon, Jean-Loup
2014-07-01
PrecisePrimer is a web-based primer design software made to assist experimentalists in any repetitive primer design task such as preparing, cloning and shuffling DNA libraries. Unlike other popular primer design tools, it is conceived to generate primer libraries with popular PCR polymerase buffers proposed as pre-set options. PrecisePrimer is also meant to design primers in batches, such as for DNA libraries creation of DNA shuffling experiments and to have the simplest interface possible. It integrates the most up-to-date melting temperature algorithms validated with experimental data, and cross validated with other computational tools. We generated a library of primers for the extraction and cloning of 61 genes from yeast DNA genomic extract using default parameters. All primer pairs efficiently amplified their target without any optimization of the PCR conditions. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Development and Synthesis of DNA-Encoded Benzimidazole Library.
Ding, Yun; Chai, Jing; Centrella, Paolo A; Gondo, Chenaimwoyo; DeLorey, Jennifer L; Clark, Matthew A
2018-04-25
Encoded library technology (ELT) is an effective approach to the discovery of novel small-molecule ligands for biological targets. A key factor for the success of the technology is the chemical diversity of the libraries. Here we report the development of DNA-conjugated benzimidazoles. Using 4-fluoro-3-nitrobenzoic acid as a key synthon, we synthesized a 320 million-member DNA-encoded benzimidazole library using Fmoc-protected amino acids, amines and aldehydes as diversity elements. Affinity selection of the library led to the discovery of a novel, potent and specific antagonist of the NK3 receptor.
Caruccio, Nicholas
2011-01-01
DNA library preparation is a common entry point and bottleneck for next-generation sequencing. Current methods generally consist of distinct steps that often involve significant sample loss and hands-on time: DNA fragmentation, end-polishing, and adaptor-ligation. In vitro transposition with Nextera™ Transposomes simultaneously fragments and covalently tags the target DNA, thereby combining these three distinct steps into a single reaction. Platform-specific sequencing adaptors can be added, and the sample can be enriched and bar-coded using limited-cycle PCR to prepare di-tagged DNA fragment libraries. Nextera technology offers a streamlined, efficient, and high-throughput method for generating bar-coded libraries compatible with multiple next-generation sequencing platforms.
Novel selection methods for DNA-encoded chemical libraries.
Chan, Alix I; McGregor, Lynn M; Liu, David R
2015-06-01
Driven by the need for new compounds to serve as biological probes and leads for therapeutic development and the growing accessibility of DNA technologies including high-throughput sequencing, many academic and industrial groups have begun to use DNA-encoded chemical libraries as a source of bioactive small molecules. In this review, we describe the technologies that have enabled the selection of compounds with desired activities from these libraries. These methods exploit the sensitivity of in vitro selection coupled with DNA amplification to overcome some of the limitations and costs associated with conventional screening methods. In addition, we highlight newer techniques with the potential to be applied to the high-throughput evaluation of DNA-encoded chemical libraries. Copyright © 2015 Elsevier Ltd. All rights reserved.
Procedure for normalization of cDNA libraries
Bonaldo, M.D.; Soares, M.B.
1997-12-30
This invention provides a method to normalize a cDNA library constructed in a vector capable of being converted to single-stranded circles and capable of producing complementary nucleic acid molecules to the single-stranded circles comprising: (a) converting the cDNA library in single-stranded circles; (b) generating complementary nucleic acid molecules to the single-stranded circles; (c) hybridizing the single-stranded circles converted in step (a) with complementary nucleic acid molecules of step (b) to produce partial duplexes to an appropriate Cot; (e) separating the unhybridized single-stranded circles from the hybridized single-stranded circles, thereby generating a normalized cDNA library. 1 fig.
Lab-on-a-chip platform for high throughput drug discovery with DNA-encoded chemical libraries
NASA Astrophysics Data System (ADS)
Grünzner, S.; Reddavide, F. V.; Steinfelder, C.; Cui, M.; Busek, M.; Klotzbach, U.; Zhang, Y.; Sonntag, F.
2017-02-01
The fast development of DNA-encoded chemical libraries (DECL) in the past 10 years has received great attention from pharmaceutical industries. It applies the selection approach for small molecular drug discovery. Because of the limited choices of DNA-compatible chemical reactions, most DNA-encoded chemical libraries have a narrow structural diversity and low synthetic yield. There is also a poor correlation between the ranking of compounds resulted from analyzing the sequencing data and the affinity measured through biochemical assays. By combining DECL with dynamical chemical library, the resulting DNA-encoded dynamic library (EDCCL) explores the thermodynamic equilibrium of reversible reactions as well as the advantages of DNA encoded compounds for manipulation/detection, thus leads to enhanced signal-to-noise ratio of the selection process and higher library quality. However, the library dynamics are caused by the weak interactions between the DNA strands, which also result in relatively low affinity of the bidentate interaction, as compared to a stable DNA duplex. To take advantage of both stably assembled dual-pharmacophore libraries and EDCCLs, we extended the concept of EDCCLs to heat-induced EDCCLs (hi-EDCCLs), in which the heat-induced recombination process of stable DNA duplexes and affinity capture are carried out separately. To replace the extremely laborious and repetitive manual process, a fully automated device will facilitate the use of DECL in drug discovery. Herein we describe a novel lab-on-a-chip platform for high throughput drug discovery with hi-EDCCL. A microfluidic system with integrated actuation was designed which is able to provide a continuous sample circulation by reducing the volume to a minimum. It consists of a cooled and a heated chamber for constant circulation. The system is capable to generate stable temperatures above 75 °C in the heated chamber to melt the double strands of the DNA and less than 15 °C in the cooled chamber, to reanneal the reshuffled library. In the binding chamber (the cooled chamber) specific retaining structures are integrated. These hold back beads functionalized with the target protein, while the chamber is continuously flushed with library molecules. Afterwards the whole system can be flushed with buffer to wash out unspecific bound molecules. Finally the protein-loaded beads with attached molecules can be eluted for further investigation.
A method for high-throughput production of sequence-verified DNA libraries and strain collections.
Smith, Justin D; Schlecht, Ulrich; Xu, Weihong; Suresh, Sundari; Horecka, Joe; Proctor, Michael J; Aiyar, Raeka S; Bennett, Richard A O; Chu, Angela; Li, Yong Fuga; Roy, Kevin; Davis, Ronald W; Steinmetz, Lars M; Hyman, Richard W; Levy, Sasha F; St Onge, Robert P
2017-02-13
The low costs of array-synthesized oligonucleotide libraries are empowering rapid advances in quantitative and synthetic biology. However, high synthesis error rates, uneven representation, and lack of access to individual oligonucleotides limit the true potential of these libraries. We have developed a cost-effective method called Recombinase Directed Indexing (REDI), which involves integration of a complex library into yeast, site-specific recombination to index library DNA, and next-generation sequencing to identify desired clones. We used REDI to generate a library of ~3,300 DNA probes that exhibited > 96% purity and remarkable uniformity (> 95% of probes within twofold of the median abundance). Additionally, we created a collection of ~9,000 individually accessible CRISPR interference yeast strains for > 99% of genes required for either fermentative or respiratory growth, demonstrating the utility of REDI for rapid and cost-effective creation of strain collections from oligonucleotide pools. Our approach is adaptable to any complex DNA library, and fundamentally changes how these libraries can be parsed, maintained, propagated, and characterized. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.
Rapid and efficient cDNA library screening by self-ligation of inverse PCR products (SLIP).
Hoskins, Roger A; Stapleton, Mark; George, Reed A; Yu, Charles; Wan, Kenneth H; Carlson, Joseph W; Celniker, Susan E
2005-12-02
cDNA cloning is a central technology in molecular biology. cDNA sequences are used to determine mRNA transcript structures, including splice junctions, open reading frames (ORFs) and 5'- and 3'-untranslated regions (UTRs). cDNA clones are valuable reagents for functional studies of genes and proteins. Expressed Sequence Tag (EST) sequencing is the method of choice for recovering cDNAs representing many of the transcripts encoded in a eukaryotic genome. However, EST sequencing samples a cDNA library at random, and it recovers transcripts with low expression levels inefficiently. We describe a PCR-based method for directed screening of plasmid cDNA libraries. We demonstrate its utility in a screen of libraries used in our Drosophila EST projects for 153 transcription factor genes that were not represented by full-length cDNA clones in our Drosophila Gene Collection. We recovered high-quality, full-length cDNAs for 72 genes and variously compromised clones for an additional 32 genes. The method can be used at any scale, from the isolation of cDNA clones for a particular gene of interest, to the improvement of large gene collections in model organisms and the human. Finally, we discuss the relative merits of directed cDNA library screening and RT-PCR approaches.
Robust DNA Isolation and High-throughput Sequencing Library Construction for Herbarium Specimens.
Saeidi, Saman; McKain, Michael R; Kellogg, Elizabeth A
2018-03-08
Herbaria are an invaluable source of plant material that can be used in a variety of biological studies. The use of herbarium specimens is associated with a number of challenges including sample preservation quality, degraded DNA, and destructive sampling of rare specimens. In order to more effectively use herbarium material in large sequencing projects, a dependable and scalable method of DNA isolation and library preparation is needed. This paper demonstrates a robust, beginning-to-end protocol for DNA isolation and high-throughput library construction from herbarium specimens that does not require modification for individual samples. This protocol is tailored for low quality dried plant material and takes advantage of existing methods by optimizing tissue grinding, modifying library size selection, and introducing an optional reamplification step for low yield libraries. Reamplification of low yield DNA libraries can rescue samples derived from irreplaceable and potentially valuable herbarium specimens, negating the need for additional destructive sampling and without introducing discernible sequencing bias for common phylogenetic applications. The protocol has been tested on hundreds of grass species, but is expected to be adaptable for use in other plant lineages after verification. This protocol can be limited by extremely degraded DNA, where fragments do not exist in the desired size range, and by secondary metabolites present in some plant material that inhibit clean DNA isolation. Overall, this protocol introduces a fast and comprehensive method that allows for DNA isolation and library preparation of 24 samples in less than 13 h, with only 8 h of active hands-on time with minimal modifications.
Murgha, Yusuf; Beliveau, Brian; Semrau, Kassandra; Schwartz, Donald; Wu, Chao-Ting; Gulari, Erdogan; Rouillard, Jean-Marie
2015-06-01
Oligonucleotide microarrays allow the production of complex custom oligonucleotide libraries for nucleic acid detection-based applications such as fluorescence in situ hybridization (FISH). We have developed a PCR-free method to make single-stranded DNA (ssDNA) fluorescent probes through an intermediate RNA library. A double-stranded oligonucleotide library is amplified by transcription to create an RNA library. Next, dye- or hapten-conjugate primers are used to reverse transcribe the RNA to produce a dye-labeled cDNA library. Finally the RNA is hydrolyzed under alkaline conditions to obtain the single-stranded fluorescent probes library. Starting from unique oligonucleotide library constructs, we present two methods to produce single-stranded probe libraries. The two methods differ in the type of reverse transcription (RT) primer, the incorporation of fluorescent dye, and the purification of fluorescent probes. The first method employs dye-labeled reverse transcription primers to produce multiple differentially single-labeled probe subsets from one microarray library. The fluorescent probes are purified from excess primers by oligonucleotide-bead capture. The second method uses an RNA:DNA chimeric primer and amino-modified nucleotides to produce amino-allyl probes. The excess primers and RNA are hydrolyzed under alkaline conditions, followed by probe purification and labeling with amino-reactive dyes. The fluorescent probes created by the combination of transcription and reverse transcription can be used for FISH and to detect any RNA and DNA targets via hybridization.
NASA Astrophysics Data System (ADS)
Yoshikazu, Kawata; Shin-Ichi, Yano; Hiroyuki, Kojima
1998-03-01
An efficient and simple method for constructing a genomic DNA library using a TA cloning vector is presented. It is based on the sonicative cleavage of genomic DNA and modification of fragment ends with Taq DNA polymerase, followed by ligation using a TA vector. This method was applied for cloning of the phytoene synthase gene crt B from Spirulina platensis. This method is useful when genomic DNA cannot be efficiently digested with restriction enzymes, a problem often encountered during the construction of a genomic DNA library of cyanobacteria.
Ligation Bias in Illumina Next-Generation DNA Libraries: Implications for Sequencing Ancient Genomes
Seguin-Orlando, Andaine; Schubert, Mikkel; Clary, Joel; Stagegaard, Julia; Alberdi, Maria T.; Prado, José Luis; Prieto, Alfredo; Willerslev, Eske; Orlando, Ludovic
2013-01-01
Ancient DNA extracts consist of a mixture of endogenous molecules and contaminant DNA templates, often originating from environmental microbes. These two populations of templates exhibit different chemical characteristics, with the former showing depurination and cytosine deamination by-products, resulting from post-mortem DNA damage. Such chemical modifications can interfere with the molecular tools used for building second-generation DNA libraries, and limit our ability to fully characterize the true complexity of ancient DNA extracts. In this study, we first use fresh DNA extracts to demonstrate that library preparation based on adapter ligation at AT-overhangs are biased against DNA templates starting with thymine residues, contrarily to blunt-end adapter ligation. We observe the same bias on fresh DNA extracts sheared on Bioruptor, Covaris and nebulizers. This contradicts previous reports suggesting that this bias could originate from the methods used for shearing DNA. This also suggests that AT-overhang adapter ligation efficiency is affected in a sequence-dependent manner and results in an uneven representation of different genomic contexts. We then show how this bias could affect the base composition of ancient DNA libraries prepared following AT-overhang ligation, mainly by limiting the ability to ligate DNA templates starting with thymines and therefore deaminated cytosines. This results in particular nucleotide misincorporation damage patterns, deviating from the signature generally expected for authenticating ancient sequence data. Consequently, we show that models adequate for estimating post-mortem DNA damage levels must be robust to the molecular tools used for building ancient DNA libraries. PMID:24205269
Immune-Related Transcriptome of Coptotermes formosanus Shiraki Workers: The Defense Mechanism
Hussain, Abid; Li, Yi-Feng; Cheng, Yu; Liu, Yang; Chen, Chuan-Cheng; Wen, Shuo-Yang
2013-01-01
Formosan subterranean termites, Coptotermes formosanus Shiraki, live socially in microbial-rich habitats. To understand the molecular mechanism by which termites combat pathogenic microbes, a full-length normalized cDNA library and four Suppression Subtractive Hybridization (SSH) libraries were constructed from termite workers infected with entomopathogenic fungi (Metarhizium anisopliae and Beauveria bassiana), Gram-positive Bacillus thuringiensis and Gram-negative Escherichia coli, and the libraries were analyzed. From the high quality normalized cDNA library, 439 immune-related sequences were identified. These sequences were categorized as pattern recognition receptors (47 sequences), signal modulators (52 sequences), signal transducers (137 sequences), effectors (39 sequences) and others (164 sequences). From the SSH libraries, 27, 17, 22 and 15 immune-related genes were identified from each SSH library treated with M. anisopliae, B. bassiana, B. thuringiensis and E. coli, respectively. When the normalized cDNA library was compared with the SSH libraries, 37 immune-related clusters were found in common; 56 clusters were identified in the SSH libraries, and 259 were identified in the normalized cDNA library. The immune-related gene expression pattern was further investigated using quantitative real time PCR (qPCR). Important immune-related genes were characterized, and their potential functions were discussed based on the integrated analysis of the results. We suggest that normalized cDNA and SSH libraries enable us to discover functional genes transcriptome. The results remarkably expand our knowledge about immune-inducible genes in C. formosanus Shiraki and enable the future development of novel control strategies for the management of Formosan subterranean termites. PMID:23874972
Satz, Alexander L; Hochstrasser, Remo; Petersen, Ann C
2017-04-10
To optimize future DNA-encoded library design, we have attempted to quantify the library size at which the signal becomes undetectable. To accomplish this we (i) have calculated that percent yields of individual library members following a screen range from 0.002 to 1%, (ii) extrapolated that ∼1 million copies per library member are required at the outset of a screen, and (iii) from this extrapolation predict that false negative rates will begin to outweigh the benefit of increased diversity at library sizes >10 8 . The above analysis is based upon a large internal data set comprising multiple screens, targets, and libraries; we also augmented our internal data with all currently available literature data. In theory, high false negative rates may be overcome by employing larger amounts of library; however, we argue that using more than currently reported amounts of library (≫10 nmoles) is impractical. The above conclusions may be generally applicable to other DNA encoded library platforms, particularly those platforms that do not allow for library amplification.
Nogales, Balbina; Moore, Edward R. B.; Llobet-Brossa, Enrique; Rossello-Mora, Ramon; Amann, Rudolf; Timmis, Kenneth N.
2001-01-01
The bacterial diversity assessed from clone libraries prepared from rRNA (two libraries) and ribosomal DNA (rDNA) (one library) from polychlorinated biphenyl (PCB)-polluted soil has been analyzed. A good correspondence of the community composition found in the two types of library was observed. Nearly 29% of the cloned sequences in the rDNA library were identical to sequences in the rRNA libraries. More than 60% of the total cloned sequence types analyzed were grouped in phylogenetic groups (a clone group with sequence similarity higher than 97% [98% for Burkholderia and Pseudomonas-type clones]) represented in both types of libraries. Some of those phylogenetic groups, mostly represented by a single (or pair) of cloned sequence type(s), were observed in only one of the types of library. An important difference between the libraries was the lack of clones representative of the Actinobacteria in the rDNA library. The PCB-polluted soil exhibited a high bacterial diversity which included representatives of two novel lineages. The apparent abundance of bacteria affiliated to the beta-subclass of the Proteobacteria, and to the genus Burkholderia in particular, was confirmed by fluorescence in situ hybridization analysis. The possible influence on apparent diversity of low template concentrations was assessed by dilution of the RNA template prior to amplification by reverse transcription-PCR. Although differences in the composition of the two rRNA libraries obtained from high and low RNA concentrations were observed, the main components of the bacterial community were represented in both libraries, and therefore their detection was not compromised by the lower concentrations of template used in this study. PMID:11282645
Twenty-five Years of DNA-Encoded Chemical Libraries.
Neri, Dario
2017-05-04
Reference library: The availability of DNA-encoded chemical libraries containing billions of compounds facilitates the discovery of binding molecules for pharmaceutical applications and for investigating biological processes. This Special Issue highlights the use of this library technology and some of the latest developments in the field. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Optimized Reaction Conditions for Amide Bond Formation in DNA-Encoded Combinatorial Libraries.
Li, Yizhou; Gabriele, Elena; Samain, Florent; Favalli, Nicholas; Sladojevich, Filippo; Scheuermann, Jörg; Neri, Dario
2016-08-08
DNA-encoded combinatorial libraries are increasingly being used as tools for the discovery of small organic binding molecules to proteins of biological or pharmaceutical interest. In the majority of cases, synthetic procedures for the formation of DNA-encoded combinatorial libraries incorporate at least one step of amide bond formation between amino-modified DNA and a carboxylic acid. We investigated reaction conditions and established a methodology by using 1-ethyl-3-(3-(dimethylamino)propyl)carbodiimide, 1-hydroxy-7-azabenzotriazole and N,N'-diisopropylethylamine (EDC/HOAt/DIPEA) in combination, which provided conversions greater than 75% for 423/543 (78%) of the carboxylic acids tested. These reaction conditions were efficient with a variety of primary and secondary amines, as well as with various types of amino-modified oligonucleotides. The reaction conditions, which also worked efficiently over a broad range of DNA concentrations and reaction scales, should facilitate the synthesis of novel DNA-encoded combinatorial libraries.
Comparison of large-insert, small-insert and pyrosequencing libraries for metagenomic analysis.
Danhorn, Thomas; Young, Curtis R; DeLong, Edward F
2012-11-01
The development of DNA sequencing methods for characterizing microbial communities has evolved rapidly over the past decades. To evaluate more traditional, as well as newer methodologies for DNA library preparation and sequencing, we compared fosmid, short-insert shotgun and 454 pyrosequencing libraries prepared from the same metagenomic DNA samples. GC content was elevated in all fosmid libraries, compared with shotgun and 454 libraries. Taxonomic composition of the different libraries suggested that this was caused by a relative underrepresentation of dominant taxonomic groups with low GC content, notably Prochlorales and the SAR11 cluster, in fosmid libraries. While these abundant taxa had a large impact on library representation, we also observed a positive correlation between taxon GC content and fosmid library representation in other low-GC taxa, suggesting a general trend. Analysis of gene category representation in different libraries indicated that the functional composition of a library was largely a reflection of its taxonomic composition, and no additional systematic biases against particular functional categories were detected at the level of sequencing depth in our samples. Another important but less predictable factor influencing the apparent taxonomic and functional library composition was the read length afforded by the different sequencing technologies. Our comparisons and analyses provide a detailed perspective on the influence of library type on the recovery of microbial taxa in metagenomic libraries and underscore the different uses and utilities of more traditional, as well as contemporary 'next-generation' DNA library construction and sequencing technologies for exploring the genomics of the natural microbial world.
Display of a maize cDNA library on baculovirus infected insect cells.
Meller Harel, Helene Y; Fontaine, Veronique; Chen, Hongying; Jones, Ian M; Millner, Paul A
2008-08-12
Maize is a good model system for cereal crop genetics and development because of its rich genetic heritage and well-characterized morphology. The sequencing of its genome is well advanced, and new technologies for efficient proteomic analysis are needed. Baculovirus expression systems have been used for the last twenty years to express in insect cells a wide variety of eukaryotic proteins that require complex folding or extensive posttranslational modification. More recently, baculovirus display technologies based on the expression of foreign sequences on the surface of Autographa californica (AcMNPV) have been developed. We investigated the potential of a display methodology for a cDNA library of maize young seedlings. We constructed a full-length cDNA library of young maize etiolated seedlings in the transfer vector pAcTMVSVG. The library contained a total of 2.5 x 10(5) independent clones. Expression of two known maize proteins, calreticulin and auxin binding protein (ABP1), was shown by western blot analysis of protein extracts from insect cells infected with the cDNA library. Display of the two proteins in infected insect cells was shown by selective biopanning using magnetic cell sorting and demonstrated proof of concept that the baculovirus maize cDNA display library could be used to identify and isolate proteins. The maize cDNA library constructed in this study relies on the novel technology of baculovirus display and is unique in currently published cDNA libraries. Produced to demonstrate proof of principle, it opens the way for the development of a eukaryotic in vivo display tool which would be ideally suited for rapid screening of the maize proteome for binding partners, such as proteins involved in hormone regulation or defence.
Preparation of fosmid libraries and functional metagenomic analysis of microbial community DNA.
Martínez, Asunción; Osburne, Marcia S
2013-01-01
One of the most important challenges in contemporary microbial ecology is to assign a functional role to the large number of novel genes discovered through large-scale sequencing of natural microbial communities that lack similarity to genes of known function. Functional screening of metagenomic libraries, that is, screening environmental DNA clones for the ability to confer an activity of interest to a heterologous bacterial host, is a promising approach for bridging the gap between metagenomic DNA sequencing and functional characterization. Here, we describe methods for isolating environmental DNA and constructing metagenomic fosmid libraries, as well as methods for designing and implementing successful functional screens of such libraries. © 2013 Elsevier Inc. All rights reserved.
Trebitz, Anett S; Hoffman, Joel C; Grant, George W; Billehus, Tyler M; Pilgrim, Erik M
2015-07-22
DNA-based identification of mixed-organism samples offers the potential to greatly reduce the need for resource-intensive morphological identification, which would be of value both to bioassessment and non-native species monitoring. The ability to assign species identities to DNA sequences found depends on the availability of comprehensive DNA reference libraries. Here, we compile inventories for aquatic metazoans extant in or threatening to invade the Laurentian Great Lakes and examine the availability of reference mitochondrial COI DNA sequences (barcodes) in the Barcode of Life Data System for them. We found barcode libraries largely complete for extant and threatening-to-invade vertebrates (100% of reptile, 99% of fish, and 92% of amphibian species had barcodes). In contrast, barcode libraries remain poorly developed for precisely those organisms where morphological identification is most challenging; 46% of extant invertebrates lacked reference barcodes with rates especially high among rotifers, oligochaetes, and mites. Lack of species-level identification for many aquatic invertebrates also is a barrier to matching DNA sequences with physical specimens. Attaining the potential for DNA-based identification of mixed-organism samples covering the breadth of aquatic fauna requires a concerted effort to build supporting barcode libraries and voucher collections.
NASA Astrophysics Data System (ADS)
Trebitz, Anett S.; Hoffman, Joel C.; Grant, George W.; Billehus, Tyler M.; Pilgrim, Erik M.
2015-07-01
DNA-based identification of mixed-organism samples offers the potential to greatly reduce the need for resource-intensive morphological identification, which would be of value both to bioassessment and non-native species monitoring. The ability to assign species identities to DNA sequences found depends on the availability of comprehensive DNA reference libraries. Here, we compile inventories for aquatic metazoans extant in or threatening to invade the Laurentian Great Lakes and examine the availability of reference mitochondrial COI DNA sequences (barcodes) in the Barcode of Life Data System for them. We found barcode libraries largely complete for extant and threatening-to-invade vertebrates (100% of reptile, 99% of fish, and 92% of amphibian species had barcodes). In contrast, barcode libraries remain poorly developed for precisely those organisms where morphological identification is most challenging; 46% of extant invertebrates lacked reference barcodes with rates especially high among rotifers, oligochaetes, and mites. Lack of species-level identification for many aquatic invertebrates also is a barrier to matching DNA sequences with physical specimens. Attaining the potential for DNA-based identification of mixed-organism samples covering the breadth of aquatic fauna requires a concerted effort to build supporting barcode libraries and voucher collections.
Easy preparation of a large-size random gene mutagenesis library in Escherichia coli.
You, Chun; Percival Zhang, Y-H
2012-09-01
A simple and fast protocol for the preparation of a large-size mutant library for directed evolution in Escherichia coli was developed based on the DNA multimers generated by prolonged overlap extension polymerase chain reaction (POE-PCR). This protocol comprised the following: (i) a linear DNA mutant library was generated by error-prone PCR or shuffling, and a linear vector backbone was prepared by regular PCR; (ii) the DNA multimers were generated based on these two DNA templates by POE-PCR; and (iii) the one restriction enzyme-digested DNA multimers were ligated to circular plasmids, followed by transformation to E. coli. Because the ligation efficiency of one DNA fragment was several orders of magnitude higher than that of two DNA fragments for typical mutant library construction, it was very easy to generate a mutant library with a size of more than 10(7) protein mutants per 50 μl of the POE-PCR product. Via this method, four new fluorescent protein mutants were obtained based on monomeric cherry fluorescent protein. This new protocol was simple and fast because it did not require labor-intensive optimizations in restriction enzyme digestion and ligation, did not involve special plasmid design, and enabled constructing a large-size mutant library for directed enzyme evolution within 1 day. Copyright © 2012 Elsevier Inc. All rights reserved.
DNA-Compatible Nitro Reduction and Synthesis of Benzimidazoles.
Du, Huang-Chi; Huang, Hongbing
2017-10-18
DNA-encoded chemical libraries have emerged as a cost-effective alternative to high-throughput screening (HTS) for hit identification in drug discovery. A key factor for productive DNA-encoded libraries is the chemical diversity of the small molecule moiety attached to an encoding DNA oligomer. The library structure diversity is often limited to DNA-compatible chemical reactions in aqueous media. Herein, we describe a facile process for reducing aryl nitro groups to aryl amines. The new protocol offers simple operation and circumvents the pyrophoric potential of the conventional method (Raney nickel). The reaction is performed in aqueous solution and does not compromise DNA structural integrity. The utility of this method is demonstrated by the versatile synthesis of benzimidazoles on DNA.
Wu, Zining; Graybill, Todd L; Zeng, Xin; Platchek, Michael; Zhang, Jean; Bodmer, Vera Q; Wisnoski, David D; Deng, Jianghe; Coppo, Frank T; Yao, Gang; Tamburino, Alex; Scavello, Genaro; Franklin, G Joseph; Mataruse, Sibongile; Bedard, Katie L; Ding, Yun; Chai, Jing; Summerfield, Jennifer; Centrella, Paolo A; Messer, Jeffrey A; Pope, Andrew J; Israel, David I
2015-12-14
DNA-encoded small-molecule library technology has recently emerged as a new paradigm for identifying ligands against drug targets. To date, this technology has been used with soluble protein targets that are produced and used in a purified state. Here, we describe a cell-based method for identifying small-molecule ligands from DNA-encoded libraries against integral membrane protein targets. We use this method to identify novel, potent, and specific inhibitors of NK3, a member of the tachykinin family of G-protein coupled receptors (GPCRs). The method is simple and broadly applicable to other GPCRs and integral membrane proteins. We have extended the application of DNA-encoded library technology to membrane-associated targets and demonstrate the feasibility of selecting DNA-tagged, small-molecule ligands from complex combinatorial libraries against targets in a heterogeneous milieu, such as the surface of a cell.
Piggott, Andrew M; Kriegel, Alison M; Willows, Robert D; Karuso, Peter
2009-10-01
Reverse chemical proteomics using T7 phage display is a powerful technique for identifying cellular receptors of biologically active small molecules. However, to date this method has generally been limited to cDNA libraries constructed from mRNA isolated from eukaryotes. In this paper, we describe the construction of the first prokaryotic T7 phage display libraries from randomly digested Pseudomonas stutzeri and Vibrio fischeri gDNA, as well as a plant cDNA library from Arabidopsis thaliana. We also describe the use of T7 phage display to identify novel proteins from environmental DNA samples using biotinylated FK506 as a model affinity probe.
Robust Sub-nanomolar Library Preparation for High Throughput Next Generation Sequencing.
Wu, Wells W; Phue, Je-Nie; Lee, Chun-Ting; Lin, Changyi; Xu, Lai; Wang, Rong; Zhang, Yaqin; Shen, Rong-Fong
2018-05-04
Current library preparation protocols for Illumina HiSeq and MiSeq DNA sequencers require ≥2 nM initial library for subsequent loading of denatured cDNA onto flow cells. Such amounts are not always attainable from samples having a relatively low DNA or RNA input; or those for which a limited number of PCR amplification cycles is preferred (less PCR bias and/or more even coverage). A well-tested sub-nanomolar library preparation protocol for Illumina sequencers has however not been reported. The aim of this study is to provide a much needed working protocol for sub-nanomolar libraries to achieve outcomes as informative as those obtained with the higher library input (≥ 2 nM) recommended by Illumina's protocols. Extensive studies were conducted to validate a robust sub-nanomolar (initial library of 100 pM) protocol using PhiX DNA (as a control), genomic DNA (Bordetella bronchiseptica and microbial mock community B for 16S rRNA gene sequencing), messenger RNA, microRNA, and other small noncoding RNA samples. The utility of our protocol was further explored for PhiX library concentrations as low as 25 pM, which generated only slightly fewer than 50% of the reads achieved under the standard Illumina protocol starting with > 2 nM. A sub-nanomolar library preparation protocol (100 pM) could generate next generation sequencing (NGS) results as robust as the standard Illumina protocol. Following the sub-nanomolar protocol, libraries with initial concentrations as low as 25 pM could also be sequenced to yield satisfactory and reproducible sequencing results.
Sonet, Gontran; Jordaens, Kurt; Braet, Yves; Bourguignon, Luc; Dupont, Eréna; Backeljau, Thierry; De Meyer, Marc; Desmyter, Stijn
2013-01-01
Abstract Fly larvae living on dead corpses can be used to estimate post-mortem intervals. The identification of these flies is decisive in forensic casework and can be facilitated by using DNA barcodes provided that a representative and comprehensive reference library of DNA barcodes is available. We constructed a local (Belgium and France) reference library of 85 sequences of the COI DNA barcode fragment (mitochondrial cytochrome c oxidase subunit I gene), from 16 fly species of forensic interest (Calliphoridae, Muscidae, Fanniidae). This library was then used to evaluate the ability of two public libraries (GenBank and the Barcode of Life Data Systems – BOLD) to identify specimens from Belgian and French forensic cases. The public libraries indeed allow a correct identification of most specimens. Yet, some of the identifications remain ambiguous and some forensically important fly species are not, or insufficiently, represented in the reference libraries. Several search options offered by GenBank and BOLD can be used to further improve the identifications obtained from both libraries using DNA barcodes. PMID:24453564
Gao, Jin-Xin; Jing, Jing; Yu, Chuan-Jin; Chen, Jie
2015-06-01
Curvularia lunata is an important maize foliar fungal pathogen that distributes widely in maize growing area in China, and several key pathogenic factors have been isolated. An yeast two-hybrid (Y2H) library is a very useful platform to further unravel novel pathogenic factors in C. lunata. To construct a high-quality full length-expression cDNA library from the C. lunata for application to pathogenesis-related protein-protein interaction screening, total RNA was extracted. The SMART (Switching Mechanism At 5' end of the RNA Transcript) technique was used for cDNA synthesis. Double-stranded cDNA was ligated into the pGADT7-Rec vector with Herring Testes Carrier DNA using homologous recombination method. The ligation mixture was transformed into competent yeast AH109 cells to construct the primary cDNA library. Eventually, a high qualitative library was successfully established according to an evaluation on quality. The transformation efficiency was about 6.39 ×10(5) transformants/3 μg pGADT7-Rec. The titer of the primary cDNA library was 2.5×10(8) cfu/mL. The numbers for the cDNA library was 2.46×10(5). Randomly picked clones show that the recombination rate was 88.24%. Gel electrophoresis results indicated that the fragments ranged from 0.4 kb to 3.0 kb. Melanin synthesis protein Brn1 (1,3,8-hydroxynaphthalene reductase) was used as a "bait" to test the sufficiency of the Y2H library. As a result, a cDNA clone encoding VelB protein that was known to be involved in the regulation of diverse cellular processes, including control of secondary metabolism containing melanin and toxin production in many filamentous fungi was identified. Further study on the exact role of the VelB gene is underway.
Building a DNA barcode library of Alaska's non-marine arthropods.
Sikes, Derek S; Bowser, Matthew; Morton, John M; Bickford, Casey; Meierotto, Sarah; Hildebrandt, Kyndall
2017-03-01
Climate change may result in ecological futures with novel species assemblages, trophic mismatch, and mass extinction. Alaska has a limited taxonomic workforce to address these changes. We are building a DNA barcode library to facilitate a metabarcoding approach to monitoring non-marine arthropods. Working with the Canadian Centre for DNA Barcoding, we obtained DNA barcodes from recently collected and authoritatively identified specimens in the University of Alaska Museum (UAM) Insect Collection and the Kenai National Wildlife Refuge collection. We submitted tissues from 4776 specimens, of which 81% yielded DNA barcodes representing 1662 species and 1788 Barcode Index Numbers (BINs), of primarily terrestrial, large-bodied arthropods. This represents 84% of the species available for DNA barcoding in the UAM Insect Collection. There are now 4020 Alaskan arthropod species represented by DNA barcodes, after including all records in Barcode of Life Data Systems (BOLD) of species that occur in Alaska - i.e., 48.5% of the 8277 Alaskan, non-marine-arthropod, named species have associated DNA barcodes. An assessment of the identification power of the library in its current state yielded fewer species-level identifications than expected, but the results were not discouraging. We believe we are the first to deliberately begin development of a DNA barcode library of the entire arthropod fauna for a North American state or province. Although far from complete, this library will become increasingly valuable as more species are added and costs to obtain DNA sequences fall.
Hoople, Gordon D; Richards, Andrew; Wu, Yan; Pisano, Albert P; Zhang, Kun
2018-03-26
The ability to amplify and sequence either DNA or RNA from small starting samples has only been achieved in the last five years. Unfortunately, the standard protocols for generating genomic or transcriptomic libraries are incompatible and researchers must choose whether to sequence DNA or RNA for a particular sample. Gel-seq solves this problem by enabling researchers to simultaneously prepare libraries for both DNA and RNA starting with 100 - 1000 cells using a simple hydrogel device. This paper presents a detailed approach for the fabrication of the device as well as the biological protocol to generate paired libraries. We designed Gel-seq so that it could be easily implemented by other researchers; many genetics labs already have the necessary equipment to reproduce the Gel-seq device fabrication. Our protocol employs commonly-used kits for both whole-transcript amplification (WTA) and library preparation, which are also likely to be familiar to researchers already versed in generating genomic and transcriptomic libraries. Our approach allows researchers to bring to bear the power of both DNA and RNA sequencing on a single sample without splitting and with negligible added cost.
[Construction of large fragment metagenome library of natural mangrove soil].
Jiang, Yun-Xia; Zheng, Tian-Ling
2007-11-01
Applying our optimized direct extraction method, the percentage of large fragment DNA in the total extracted mangrove soil DNA was significant increased. The large fragment metagenome library derived from natural mangrove soil over four seasons was successfully constructed by the optimized DNA extraction and electro elution purification method. All of the clones had recombinant Cosmids and each differed in their fragment profiles when Cosmid DNA was extracted from 12 randomly picked colonies and digested with BamHI. The average insert size for this library was larger than 35 kbp. This culturing-independent library at least encompassed 335 Mbp valuable genetic information of mangrove soil microbes. It allowed mining of valuable intertidal microbial resource to become a reality. It is a recommended method for those researchers who have still not circumvented the large insert environmental libraries or for those beginning research in this field, so as to avoid them attempting repetitive, fussy work.
Neri, Dario; Lerner, Richard A
2018-06-20
The discovery of organic ligands that bind specifically to proteins is a central problem in chemistry, biology, and the biomedical sciences. The encoding of individual organic molecules with distinctive DNA tags, serving as amplifiable identification bar codes, allows the construction and screening of combinatorial libraries of unprecedented size, thus facilitating the discovery of ligands to many different protein targets. Fundamentally, one links powers of genetics and chemical synthesis. After the initial description of DNA-encoded chemical libraries in 1992, several experimental embodiments of the technology have been reduced to practice. This review provides a historical account of important milestones in the development of DNA-encoded chemical libraries, a survey of relevant ongoing research activities, and a glimpse into the future.
Kerschner, Joseph E; Erdos, Geza; Hu, Fen Ze; Burrows, Amy; Cioffi, Joseph; Khampang, Pawjai; Dahlgren, Margaret; Hayes, Jay; Keefe, Randy; Janto, Benjamin; Post, J Christopher; Ehrlich, Garth D
2010-04-01
We sought to construct and partially characterize complementary DNA (cDNA) libraries prepared from the middle ear mucosa (MEM) of chinchillas to better understand pathogenic aspects of infection and inflammation, particularly with respect to leukotriene biogenesis and response. Chinchilla MEM was harvested from controls and after middle ear inoculation with nontypeable Haemophilus influenzae. RNA was extracted to generate cDNA libraries. Randomly selected clones were subjected to sequence analysis to characterize the libraries and to provide DNA sequence for phylogenetic analyses. Reverse transcription-polymerase chain reaction of the RNA pools was used to generate cDNA sequences corresponding to genes associated with leukotriene biosynthesis and metabolism. Sequence analysis of 921 randomly selected clones from the uninfected MEM cDNA library produced approximately 250,000 nucleotides of almost entirely novel sequence data. Searches of the GenBank database with the Basic Local Alignment Search Tool provided for identification of 515 unique genes expressed in the MEM and not previously described in chinchillas. In almost all cases, the chinchilla cDNA sequences displayed much greater homology to human or other primate genes than with rodent species. Genes associated with leukotriene metabolism were present in both normal and infected MEM. Based on both phylogenetic comparisons and gene expression similarities with humans, chinchilla MEM appears to be an excellent model for the study of middle ear inflammation and infection. The higher degree of sequence similarity between chinchillas and humans compared to chinchillas and rodents was unexpected. The cDNA libraries from normal and infected chinchilla MEM will serve as useful molecular tools in the study of otitis media and should yield important information with respect to middle ear pathogenesis.
Kerschner, Joseph E.; Erdos, Geza; Hu, Fen Ze; Burrows, Amy; Cioffi, Joseph; Khampang, Pawjai; Dahlgren, Margaret; Hayes, Jay; Keefe, Randy; Janto, Benjamin; Post, J. Christopher; Ehrlich, Garth D.
2010-01-01
Objectives We sought to construct and partially characterize complementary DNA (cDNA) libraries prepared from the middle ear mucosa (MEM) of chinchillas to better understand pathogenic aspects of infection and inflammation, particularly with respect to leukotriene biogenesis and response. Methods Chinchilla MEM was harvested from controls and after middle ear inoculation with nontypeable Haemophilus influenzae. RNA was extracted to generate cDNA libraries. Randomly selected clones were subjected to sequence analysis to characterize the libraries and to provide DNA sequence for phylogenetic analyses. Reverse transcription–polymerase chain reaction of the RNA pools was used to generate cDNA sequences corresponding to genes associated with leukotriene biosynthesis and metabolism. Results Sequence analysis of 921 randomly selected clones from the uninfected MEM cDNA library produced approximately 250,000 nucleotides of almost entirely novel sequence data. Searches of the GenBank database with the Basic Local Alignment Search Tool provided for identification of 515 unique genes expressed in the MEM and not previously described in chinchillas. In almost all cases, the chinchilla cDNA sequences displayed much greater homology to human or other primate genes than with rodent species. Genes associated with leukotriene metabolism were present in both normal and infected MEM. Conclusions Based on both phylogenetic comparisons and gene expression similarities with humans, chinchilla MEM appears to be an excellent model for the study of middle ear inflammation and infection. The higher degree of sequence similarity between chinchillas and humans compared to chinchillas and rodents was unexpected. The cDNA libraries from normal and infected chinchilla MEM will serve as useful molecular tools in the study of otitis media and should yield important information with respect to middle ear pathogenesis. PMID:20433028
Franzini, Raphael M; Samain, Florent; Abd Elrahman, Maaly; Mikutis, Gediminas; Nauer, Angela; Zimmermann, Mauro; Scheuermann, Jörg; Hall, Jonathan; Neri, Dario
2014-08-20
DNA-encoded chemical libraries are collections of small molecules, attached to DNA fragments serving as identification barcodes, which can be screened against multiple protein targets, thus facilitating the drug discovery process. The preparation of large DNA-encoded chemical libraries crucially depends on the availability of robust synthetic methods, which enable the efficient conjugation to oligonucleotides of structurally diverse building blocks, sharing a common reactive group. Reactions of DNA derivatives with amines and/or carboxylic acids are particularly attractive for the synthesis of encoded libraries, in view of the very large number of building blocks that are commercially available. However, systematic studies on these reactions in the presence of DNA have not been reported so far. We first investigated conditions for the coupling of primary amines to oligonucleotides, using either a nucleophilic attack on chloroacetamide derivatives or a reductive amination on aldehyde-modified DNA. While both methods could be used for the production of secondary amines, the reductive amination approach was generally associated with higher yields and better purity. In a second endeavor, we optimized conditions for the coupling of a diverse set of 501 carboxylic acids to DNA derivatives, carrying primary and secondary amine functions. The coupling efficiency was generally higher for primary amines, compared to secondary amine substituents, but varied considerably depending on the structure of the acids and on the synthetic methods used. Optimal reaction conditions could be found for certain sets of compounds (with conversions >80%), but multiple reaction schemes are needed when assembling large libraries with highly diverse building blocks. The reactions and experimental conditions presented in this article should facilitate the synthesis of future DNA-encoded chemical libraries, while outlining the synthetic challenges that remain to be overcome.
Zirconium(IV)-Catalyzed Ring Opening of on-DNA Epoxides in Water.
Fan, Lijun; Davie, Christopher P
2017-05-04
DNA-encoded library technology (ELT) has spurred wide interest in the pharmaceutical industry as a powerful tool for hit and lead generation. In recent years a number of "DNA-compatible" chemical modifications have been published and used to synthesize vastly diverse screening libraries. Herein we report a newly developed, zirconium tetrakis(dodecyl sulfate) [Zr(DS) 4 ] catalyzed ring-opening of on-DNA epoxides in water with amines, including anilines. Subsequent cyclization of the resulting on-DNA β-amino alcohols leads to a variety of biologically interesting, nonaromatic heterocycles. Under these conditions, a library of 137 million on-DNA β-amino alcohols and their cyclization products was assembled. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics
Rinke, Christian; Low, Serene; Woodcroft, Ben J.; ...
2016-09-22
High-throughput sequencing libraries are typically limited by the requirement for nanograms to micrograms of input DNA. This bottleneck impedes the microscale analysis of ecosystems and the exploration of low biomass samples. Current methods for amplifying environmental DNA to bypass this bottleneck introduce considerable bias into metagenomic profiles. For this study, we describe and validate a simple modification of the Illumina Nextera XT DNA library preparation kit which allows creation of shotgun libraries from sub-nanogram amounts of input DNA. Community composition was reproducible down to 100 fg of input DNA based on analysis of a mock community comprising 54 phylogenetically diversemore » Bacteria and Archaea. The main technical issues with the low input libraries were a greater potential for contamination, limited DNA complexity which has a direct effect on assembly and binning, and an associated higher percentage of read duplicates. We recommend a lower limit of 1 pg (~100–1,000 microbial cells) to ensure community composition fidelity, and the inclusion of negative controls to identify reagent-specific contaminants. Applying the approach to marine surface water, pronounced differences were observed between bacterial community profiles of microliter volume samples, which we attribute to biological variation. This result is consistent with expected microscale patchiness in marine communities. We thus envision that our benchmarked, slightly modified low input DNA protocol will be beneficial for microscale and low biomass metagenomics.« less
Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rinke, Christian; Low, Serene; Woodcroft, Ben J.
High-throughput sequencing libraries are typically limited by the requirement for nanograms to micrograms of input DNA. This bottleneck impedes the microscale analysis of ecosystems and the exploration of low biomass samples. Current methods for amplifying environmental DNA to bypass this bottleneck introduce considerable bias into metagenomic profiles. For this study, we describe and validate a simple modification of the Illumina Nextera XT DNA library preparation kit which allows creation of shotgun libraries from sub-nanogram amounts of input DNA. Community composition was reproducible down to 100 fg of input DNA based on analysis of a mock community comprising 54 phylogenetically diversemore » Bacteria and Archaea. The main technical issues with the low input libraries were a greater potential for contamination, limited DNA complexity which has a direct effect on assembly and binning, and an associated higher percentage of read duplicates. We recommend a lower limit of 1 pg (~100–1,000 microbial cells) to ensure community composition fidelity, and the inclusion of negative controls to identify reagent-specific contaminants. Applying the approach to marine surface water, pronounced differences were observed between bacterial community profiles of microliter volume samples, which we attribute to biological variation. This result is consistent with expected microscale patchiness in marine communities. We thus envision that our benchmarked, slightly modified low input DNA protocol will be beneficial for microscale and low biomass metagenomics.« less
Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics
Low, Serene; Raina, Jean-Baptiste; Skarshewski, Adam; Le, Xuyen H.; Butler, Margaret K.; Stocker, Roman; Seymour, Justin; Tyson, Gene W.
2016-01-01
High-throughput sequencing libraries are typically limited by the requirement for nanograms to micrograms of input DNA. This bottleneck impedes the microscale analysis of ecosystems and the exploration of low biomass samples. Current methods for amplifying environmental DNA to bypass this bottleneck introduce considerable bias into metagenomic profiles. Here we describe and validate a simple modification of the Illumina Nextera XT DNA library preparation kit which allows creation of shotgun libraries from sub-nanogram amounts of input DNA. Community composition was reproducible down to 100 fg of input DNA based on analysis of a mock community comprising 54 phylogenetically diverse Bacteria and Archaea. The main technical issues with the low input libraries were a greater potential for contamination, limited DNA complexity which has a direct effect on assembly and binning, and an associated higher percentage of read duplicates. We recommend a lower limit of 1 pg (∼100–1,000 microbial cells) to ensure community composition fidelity, and the inclusion of negative controls to identify reagent-specific contaminants. Applying the approach to marine surface water, pronounced differences were observed between bacterial community profiles of microliter volume samples, which we attribute to biological variation. This result is consistent with expected microscale patchiness in marine communities. We thus envision that our benchmarked, slightly modified low input DNA protocol will be beneficial for microscale and low biomass metagenomics. PMID:27688978
Multi-Threaded DNA Tag/Anti-Tag Library Generator for Multi-Core Platforms
2009-05-01
base pair) Watson ‐ Crick strand pairs that bind perfectly within pairs, but poorly across pairs. A variety of DNA strand hybridization metrics...AFRL-RI-RS-TR-2009-131 Final Technical Report May 2009 MULTI-THREADED DNA TAG/ANTI-TAG LIBRARY GENERATOR FOR MULTI-CORE PLATFORMS...TYPE Final 3. DATES COVERED (From - To) Jun 08 – Feb 09 4. TITLE AND SUBTITLE MULTI-THREADED DNA TAG/ANTI-TAG LIBRARY GENERATOR FOR MULTI-CORE
Litovchick, Alexander; Clark, Matthew A; Keefe, Anthony D
2014-01-01
The affinity-mediated selection of large libraries of DNA-encoded small molecules is increasingly being used to initiate drug discovery programs. We present universal methods for the encoding of such libraries using the chemical ligation of oligonucleotides. These methods may be used to record the chemical history of individual library members during combinatorial synthesis processes. We demonstrate three different chemical ligation methods as examples of information recording processes (writing) for such libraries and two different cDNA-generation methods as examples of information retrieval processes (reading) from such libraries. The example writing methods include uncatalyzed and Cu(I)-catalyzed alkyne-azide cycloadditions and a novel photochemical thymidine-psoralen cycloaddition. The first reading method “relay primer-dependent bypass” utilizes a relay primer that hybridizes across a chemical ligation junction embedded in a fixed-sequence and is extended at its 3′-terminus prior to ligation to adjacent oligonucleotides. The second reading method “repeat-dependent bypass” utilizes chemical ligation junctions that are flanked by repeated sequences. The upstream repeat is copied prior to a rearrangement event during which the 3′-terminus of the cDNA hybridizes to the downstream repeat and polymerization continues. In principle these reading methods may be used with any ligation chemistry and offer universal strategies for the encoding (writing) and interpretation (reading) of DNA-encoded chemical libraries. PMID:25483841
Purification of nanogram-range immunoprecipitated DNA in ChIP-seq application.
Zhong, Jian; Ye, Zhenqing; Lenz, Samuel W; Clark, Chad R; Bharucha, Adil; Farrugia, Gianrico; Robertson, Keith D; Zhang, Zhiguo; Ordog, Tamas; Lee, Jeong-Heon
2017-12-21
Chromatin immunoprecipitation-sequencing (ChIP-seq) is a widely used epigenetic approach for investigating genome-wide protein-DNA interactions in cells and tissues. The approach has been relatively well established but several key steps still require further improvement. As a part of the procedure, immnoprecipitated DNA must undergo purification and library preparation for subsequent high-throughput sequencing. Current ChIP protocols typically yield nanogram quantities of immunoprecipitated DNA mainly depending on the target of interest and starting chromatin input amount. However, little information exists on the performance of reagents used for the purification of such minute amounts of immunoprecipitated DNA in ChIP elution buffer and their effects on ChIP-seq data. Here, we compared DNA recovery, library preparation efficiency, and ChIP-seq results obtained with several commercial DNA purification reagents applied to 1 ng ChIP DNA and also investigated the impact of conditions under which ChIP DNA is stored. We compared DNA recovery of ten commercial DNA purification reagents and phenol/chloroform extraction from 1 to 50 ng of immunopreciptated DNA in ChIP elution buffer. The recovery yield was significantly different with 1 ng of DNA while similar in higher DNA amounts. We also observed that the low nanogram range of purified DNA is prone to loss during storage depending on the type of polypropylene tube used. The immunoprecipitated DNA equivalent to 1 ng of purified DNA was subject to DNA purification and library preparation to evaluate the performance of four better performing purification reagents in ChIP-seq applications. Quantification of library DNAs indicated the selected purification kits have a negligible impact on the efficiency of library preparation. The resulting ChIP-seq data were comparable with the dataset generated by ENCODE consortium and were highly correlated between the data from different purification reagents. This study provides comparative data on commercial DNA purification reagents applied to nanogram-range immunopreciptated ChIP DNA and evidence for the importance of storage conditions of low nanogram-range purified DNA. We verified consistent high performance of a subset of the tested reagents. These results will facilitate the improvement of ChIP-seq methodology for low-input applications.
Design and screening of M13 phage display cDNA libraries.
Georgieva, Yuliya; Konthur, Zoltán
2011-02-17
The last decade has seen a steady increase in screening of cDNA expression product libraries displayed on the surface of filamentous bacteriophage. At the same time, the range of applications extended from the identification of novel allergens over disease markers to protein-protein interaction studies. However, the generation and selection of cDNA phage display libraries is subjected to intrinsic biological limitations due to their complex nature and heterogeneity, as well as technical difficulties regarding protein presentation on the phage surface. Here, we review the latest developments in this field, discuss a number of strategies and improvements anticipated to overcome these challenges making cDNA and open reading frame (ORF) libraries more readily accessible for phage display. Furthermore, future trends combining phage display with next generation sequencing (NGS) will be presented.
DNA polymerase preference determines PCR priming efficiency.
Pan, Wenjing; Byrne-Steele, Miranda; Wang, Chunlin; Lu, Stanley; Clemmons, Scott; Zahorchak, Robert J; Han, Jian
2014-01-30
Polymerase chain reaction (PCR) is one of the most important developments in modern biotechnology. However, PCR is known to introduce biases, especially during multiplex reactions. Recent studies have implicated the DNA polymerase as the primary source of bias, particularly initiation of polymerization on the template strand. In our study, amplification from a synthetic library containing a 12 nucleotide random portion was used to provide an in-depth characterization of DNA polymerase priming bias. The synthetic library was amplified with three commercially available DNA polymerases using an anchored primer with a random 3' hexamer end. After normalization, the next generation sequencing (NGS) results of the amplified libraries were directly compared to the unamplified synthetic library. Here, high throughput sequencing was used to systematically demonstrate and characterize DNA polymerase priming bias. We demonstrate that certain sequence motifs are preferred over others as primers where the six nucleotide sequences at the 3' end of the primer, as well as the sequences four base pairs downstream of the priming site, may influence priming efficiencies. DNA polymerases in the same family from two different commercial vendors prefer similar motifs, while another commercially available enzyme from a different DNA polymerase family prefers different motifs. Furthermore, the preferred priming motifs are GC-rich. The DNA polymerase preference for certain sequence motifs was verified by amplification from single-primer templates. We incorporated the observed DNA polymerase preference into a primer-design program that guides the placement of the primer to an optimal location on the template. DNA polymerase priming bias was characterized using a synthetic library amplification system and NGS. The characterization of DNA polymerase priming bias was then utilized to guide the primer-design process and demonstrate varying amplification efficiencies among three commercially available DNA polymerases. The results suggest that the interaction of the DNA polymerase with the primer:template junction during the initiation of DNA polymerization is very important in terms of overall amplification bias and has broader implications for both the primer design process and multiplex PCR.
Open resource metagenomics: a model for sharing metagenomic libraries.
Neufeld, J D; Engel, K; Cheng, J; Moreno-Hagelsieb, G; Rose, D R; Charles, T C
2011-11-30
Both sequence-based and activity-based exploitation of environmental DNA have provided unprecedented access to the genomic content of cultivated and uncultivated microorganisms. Although researchers deposit microbial strains in culture collections and DNA sequences in databases, activity-based metagenomic studies typically only publish sequences from the hits retrieved from specific screens. Physical metagenomic libraries, conceptually similar to entire sequence datasets, are usually not straightforward to obtain by interested parties subsequent to publication. In order to facilitate unrestricted distribution of metagenomic libraries, we propose the adoption of open resource metagenomics, in line with the trend towards open access publishing, and similar to culture- and mutant-strain collections that have been the backbone of traditional microbiology and microbial genetics. The concept of open resource metagenomics includes preparation of physical DNA libraries, preferably in versatile vectors that facilitate screening in a diversity of host organisms, and pooling of clones so that single aliquots containing complete libraries can be easily distributed upon request. Database deposition of associated metadata and sequence data for each library provides researchers with information to select the most appropriate libraries for further research projects. As a starting point, we have established the Canadian MetaMicroBiome Library (CM(2)BL [1]). The CM(2)BL is a publicly accessible collection of cosmid libraries containing environmental DNA from soils collected from across Canada, spanning multiple biomes. The libraries were constructed such that the cloned DNA can be easily transferred to Gateway® compliant vectors, facilitating functional screening in virtually any surrogate microbial host for which there are available plasmid vectors. The libraries, which we are placing in the public domain, will be distributed upon request without restriction to members of both the academic research community and industry. This article invites the scientific community to adopt this philosophy of open resource metagenomics to extend the utility of functional metagenomics beyond initial publication, circumventing the need to start from scratch with each new research project.
Open resource metagenomics: a model for sharing metagenomic libraries
Neufeld, J.D.; Engel, K.; Cheng, J.; Moreno-Hagelsieb, G.; Rose, D.R.; Charles, T.C.
2011-01-01
Both sequence-based and activity-based exploitation of environmental DNA have provided unprecedented access to the genomic content of cultivated and uncultivated microorganisms. Although researchers deposit microbial strains in culture collections and DNA sequences in databases, activity-based metagenomic studies typically only publish sequences from the hits retrieved from specific screens. Physical metagenomic libraries, conceptually similar to entire sequence datasets, are usually not straightforward to obtain by interested parties subsequent to publication. In order to facilitate unrestricted distribution of metagenomic libraries, we propose the adoption of open resource metagenomics, in line with the trend towards open access publishing, and similar to culture- and mutant-strain collections that have been the backbone of traditional microbiology and microbial genetics. The concept of open resource metagenomics includes preparation of physical DNA libraries, preferably in versatile vectors that facilitate screening in a diversity of host organisms, and pooling of clones so that single aliquots containing complete libraries can be easily distributed upon request. Database deposition of associated metadata and sequence data for each library provides researchers with information to select the most appropriate libraries for further research projects. As a starting point, we have established the Canadian MetaMicroBiome Library (CM2BL [1]). The CM2BL is a publicly accessible collection of cosmid libraries containing environmental DNA from soils collected from across Canada, spanning multiple biomes. The libraries were constructed such that the cloned DNA can be easily transferred to Gateway® compliant vectors, facilitating functional screening in virtually any surrogate microbial host for which there are available plasmid vectors. The libraries, which we are placing in the public domain, will be distributed upon request without restriction to members of both the academic research community and industry. This article invites the scientific community to adopt this philosophy of open resource metagenomics to extend the utility of functional metagenomics beyond initial publication, circumventing the need to start from scratch with each new research project. PMID:22180823
PuLSE: Quality control and quantification of peptide sequences explored by phage display libraries.
Shave, Steven; Mann, Stefan; Koszela, Joanna; Kerr, Alastair; Auer, Manfred
2018-01-01
The design of highly diverse phage display libraries is based on assumption that DNA bases are incorporated at similar rates within the randomized sequence. As library complexity increases and expected copy numbers of unique sequences decrease, the exploration of library space becomes sparser and the presence of truly random sequences becomes critical. We present the program PuLSE (Phage Library Sequence Evaluation) as a tool for assessing randomness and therefore diversity of phage display libraries. PuLSE runs on a collection of sequence reads in the fastq file format and generates tables profiling the library in terms of unique DNA sequence counts and positions, translated peptide sequences, and normalized 'expected' occurrences from base to residue codon frequencies. The output allows at-a-glance quantitative quality control of a phage library in terms of sequence coverage both at the DNA base and translated protein residue level, which has been missing from toolsets and literature. The open source program PuLSE is available in two formats, a C++ source code package for compilation and integration into existing bioinformatics pipelines and precompiled binaries for ease of use.
Kröber, Magdalena; Bekel, Thomas; Diaz, Naryttza N; Goesmann, Alexander; Jaenicke, Sebastian; Krause, Lutz; Miller, Dimitri; Runte, Kai J; Viehöver, Prisca; Pühler, Alfred; Schlüter, Andreas
2009-06-01
The phylogenetic structure of the microbial community residing in a fermentation sample from a production-scale biogas plant fed with maize silage, green rye and liquid manure was analysed by an integrated approach using clone library sequences and metagenome sequence data obtained by 454-pyrosequencing. Sequencing of 109 clones from a bacterial and an archaeal 16S-rDNA amplicon library revealed that the obtained nucleotide sequences are similar but not identical to 16S-rDNA database sequences derived from different anaerobic environments including digestors and bioreactors. Most of the bacterial 16S-rDNA sequences could be assigned to the phylum Firmicutes with the most abundant class Clostridia and to the class Bacteroidetes, whereas most archaeal 16S-rDNA sequences cluster close to the methanogen Methanoculleus bourgensis. Further sequences of the archaeal library most probably represent so far non-characterised species within the genus Methanoculleus. A similar result derived from phylogenetic analysis of mcrA clone sequences. The mcrA gene product encodes the alpha-subunit of methyl-coenzyme-M reductase involved in the final step of methanogenesis. BLASTn analysis applying stringent settings resulted in assignment of 16S-rDNA metagenome sequence reads to 62 16S-rDNA amplicon sequences thus enabling frequency of abundance estimations for 16S-rDNA clone library sequences. Ribosomal Database Project (RDP) Classifier processing of metagenome 16S-rDNA reads revealed abundance of the phyla Firmicutes, Bacteroidetes and Euryarchaeota and the orders Clostridiales, Bacteroidales and Methanomicrobiales. Moreover, a large fraction of 16S-rDNA metagenome reads could not be assigned to lower taxonomic ranks, demonstrating that numerous microorganisms in the analysed fermentation sample of the biogas plant are still unclassified or unknown.
Gadkar, Vijay J; Filion, Martin
2013-06-01
In various experimental systems, limiting available amounts of RNA may prevent a researcher from performing large-scale analyses of gene transcripts. One way to circumvent this is to 'pre-amplify' the starting RNA/cDNA, so that sufficient amounts are available for any downstream analysis. In the present study, we report the development of a novel protocol for constructing amplified cDNA libraries using the Phi29 DNA polymerase based multiple displacement amplification (MDA) system. Using as little as 200 ng of total RNA, we developed a linear concatenation strategy to make the single-stranded cDNA template amenable for MDA. The concatenation, made possible by the template switching property of the reverse transcriptase enzyme, resulted in the amplified cDNA library with intact 5' ends. MDA generated micrograms of template, allowing large-scale polymerase chain reaction analyses or other large-scale downstream applications. As the amplified cDNA library contains intact 5' ends, it is also compatible with 5' RACE analyses of specific gene transcripts. Empirical validation of this protocol is demonstrated on a highly characterized (tomato) and an uncharacterized (corn gromwell) experimental system.
[Current applications of high-throughput DNA sequencing technology in antibody drug research].
Yu, Xin; Liu, Qi-Gang; Wang, Ming-Rong
2012-03-01
Since the publication of a high-throughput DNA sequencing technology based on PCR reaction was carried out in oil emulsions in 2005, high-throughput DNA sequencing platforms have been evolved to a robust technology in sequencing genomes and diverse DNA libraries. Antibody libraries with vast numbers of members currently serve as a foundation of discovering novel antibody drugs, and high-throughput DNA sequencing technology makes it possible to rapidly identify functional antibody variants with desired properties. Herein we present a review of current applications of high-throughput DNA sequencing technology in the analysis of antibody library diversity, sequencing of CDR3 regions, identification of potent antibodies based on sequence frequency, discovery of functional genes, and combination with various display technologies, so as to provide an alternative approach of discovery and development of antibody drugs.
NASA Astrophysics Data System (ADS)
Chen, Juan; Zhu, Tianjiao; Li, Dehai; Cui, Chengbin; Fang, Yuchun; Liu, Hongbing; Liu, Peipei; Gu, Qianqun; Zhu, Weiming
2006-04-01
To study the bioactive metabolites produced by sponge-derived uncultured symbionts, a metagenomic DNA library of the symbionts of sponge Gelliodes gracilis was constructed. The average size of DNA inserts in the library was 20 kb. This library was screened for antibiotic activity using paper dise assaying. Two clones displayed the antibacterial activity against Micrococcus tetragenus. The metabolites of these two clones were analyzed through HPLC. The result showed that their metabolites were quite different from those of the host E. coli DH5α and the host containing vector pHZ132. This study may present a new approach to exploring bioactive metabolites of sponge symbionts.
LeProust, Emily M.; Peck, Bill J.; Spirin, Konstantin; McCuen, Heather Brummel; Moore, Bridget; Namsaraev, Eugeni; Caruthers, Marvin H.
2010-01-01
We have achieved the ability to synthesize thousands of unique, long oligonucleotides (150mers) in fmol amounts using parallel synthesis of DNA on microarrays. The sequence accuracy of the oligonucleotides in such large-scale syntheses has been limited by the yields and side reactions of the DNA synthesis process used. While there has been significant demand for libraries of long oligos (150mer and more), the yields in conventional DNA synthesis and the associated side reactions have previously limited the availability of oligonucleotide pools to lengths <100 nt. Using novel array based depurination assays, we show that the depurination side reaction is the limiting factor for the synthesis of libraries of long oligonucleotides on Agilent Technologies’ SurePrint® DNA microarray platform. We also demonstrate how depurination can be controlled and reduced by a novel detritylation process to enable the synthesis of high quality, long (150mer) oligonucleotide libraries and we report the characterization of synthesis efficiency for such libraries. Oligonucleotide libraries prepared with this method have changed the economics and availability of several existing applications (e.g. targeted resequencing, preparation of shRNA libraries, site-directed mutagenesis), and have the potential to enable even more novel applications (e.g. high-complexity synthetic biology). PMID:20308161
Characterization of Bleomycin-Mediated Cleavage of a Hairpin DNA Library
Segerman, Zachary J.; Roy, Basab; Hecht, Sidney M.
2013-01-01
A study of BLM A5 was conducted using a previously isolated library of hairpin DNAs found to bind strongly to metal free BLM. The ability of Fe(II)•BLM to effect cleavage on both the 3' and 5'-arms of the hairpin DNAs was characterized. The strongly bound DNAs were found to be efficient substrates for Fe•BLM A5-mediated hairpin DNA cleavage. Surprisingly, the most prevalent site of BLM-mediated cleavage was found to be the 5′-AT-3′ dinucleotide sequence. This dinucleotide sequence, and other sequences generally not cleaved well by BLM when examined using arbitrarily chosen DNA substrates, were apparent when examining the library of ten hairpin DNAs. In total, 132 sites of DNA cleavage were produced by exposure of the hairpin DNA library to Fe•BLM A5. The existence of multiple sites of cleavage on both the 3′- and 5′-arms of the hairpin DNAs suggested that some of these might be double-strand cleavage events. Accordingly, an assay was developed with which to test the propensity of the hairpin DNAs to undergo double-strand DNA damage. One hairpin DNA was characterized using this method, and gave results consistent with earlier reports of double-strand DNA cleavage, but with a sequence selectivity different from those reported previously. PMID:23834496
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hadano, S.; Ishida, Y.; Tomiyasu, H.
1994-09-01
To complete a transcription map of the 1 Mb region in human chromosome 4p16.3 containing the Huntington disease (HD) gene, the isolation of cDNA clones are being performed throughout. Our method relies on a direct screening of the cDNA libraries probed with single copy microclones from 3 YAC clones spanning 1 Mbp of the HD gene region. AC-DNAs were isolated by a preparative pulsed-field gel electrophoresis, amplified by both a single unique primer (SUP)-PCR and a linker ligation PCR, and 6 microclone-DNA libraries were generated. Then, 8,640 microclones from these libraries were independently amplified by PCR, and arrayed onto themore » membranes. 800-900 microclones that were not cross-hybridized with total human and yeast genomic DNA, TAC vector DNA, and ribosomal cDNA on a dot hybridization (putatively carrying single copy sequences) were pooled to make 9 probe pools. A total of {approximately}1.8x10{sup 7} plaques from the human brain cDNA libraries was screened with 9 pool-probes, and then 672 positive cDNA clones were obtained. So far, 597 cDNA clones were defined and arrayed onto a map of the 1 Mbp of the HD gene region by hybridization with HD region-specific cosmid contigs and YAC clones. Further characterization including a DNA sequencing and Northern blot analysis is currently underway.« less
Recent advances on the encoding and selection methods of DNA-encoded chemical library.
Shi, Bingbing; Zhou, Yu; Huang, Yiran; Zhang, Jianfu; Li, Xiaoyu
2017-02-01
DNA-encoded chemical library (DEL) has emerged as a powerful and versatile tool for ligand discovery in chemical biology research and in drug discovery. Encoding and selection methods are two of the most important technological aspects of DEL that can dictate the performance and utilities of DELs. In this digest, we have summarized recent advances on the encoding and selection strategies of DEL and also discussed the latest developments on DNA-encoded dynamic library, a new frontier in DEL research. Copyright © 2016 Elsevier Ltd. All rights reserved.
Jiao, Lichao; Yu, Min; Wiedenhoeft, Alex C; He, Tuo; Li, Jianing; Liu, Bo; Jiang, Xiaomei; Yin, Yafang
2018-01-31
DNA barcoding has been proposed as a useful tool for forensic wood identification and development of a reliable DNA reference library is an essential first step. Xylaria (wood collections) are potentially enormous data repositories if DNA information could be extracted from wood specimens. In this study, 31 xylarium wood specimens and 8 leaf specimens of six important commercial species of Pterocarpus were selected to investigate the reliability of DNA barcodes for authentication at the species level and to determine the feasibility of building wood DNA barcode reference libraries from xylarium specimens. Four DNA barcodes (ITS2, matK, ndhF-rpl32 and rbcL) and their combination were tested to evaluate their discrimination ability for Pterocarpus species with both TaxonDNA and tree-based analytical methods. The results indicated that the combination barcode of matK + ndhF-rpl32 + ITS2 yielded the best discrimination for the Pterocarpus species studied. The mini-barcode ndhF-rpl32 (167-173 bps) performed well distinguishing P. santalinus from its wood anatomically inseparable species P. tinctorius. Results from this study verified not only the feasibility of building DNA barcode libraries using xylarium wood specimens, but the importance of using wood rather than leaves as the source tissue, when wood is the botanical material to be identified.
NASA Astrophysics Data System (ADS)
Litovchick, Alexander; Dumelin, Christoph E.; Habeshian, Sevan; Gikunju, Diana; Guié, Marie-Aude; Centrella, Paolo; Zhang, Ying; Sigel, Eric A.; Cuozzo, John W.; Keefe, Anthony D.; Clark, Matthew A.
2015-06-01
A chemical ligation method for construction of DNA-encoded small-molecule libraries has been developed. Taking advantage of the ability of the Klenow fragment of DNA polymerase to accept templates with triazole linkages in place of phosphodiesters, we have designed a strategy for chemically ligating oligonucleotide tags using cycloaddition chemistry. We have utilized this strategy in the construction and selection of a small molecule library, and successfully identified inhibitors of the enzyme soluble epoxide hydrolase.
Physical mapping of complex genomes
Evans, G.A.
1993-06-15
A method for the simultaneous identification of overlapping cosmid clones among multiple cosmid clones and the use of the method for mapping complex genomes are provided. A library of cosmid clones that contains the DNA to be mapped is constructed and arranged in a manner such that individual clones can be identified and replicas of the arranged clones prepared. In preferred embodiments, the clones are arranged in a two dimensional matrix. In such embodiments, the cosmid clones in a row are pooled, mixed probes complementary to the ends of the DNA inserts in the pooled clones are synthesized, hybridized to a first replica of the library. Hybridizing clones, which include the pooled row, are identified. A second portion of clones is prepared by pooling cosmid clones that correspond to a column in the matrix. The second pool thereby includes one clone from the first portion pooled clones. This common clone is located on the replica at the intersection of the column and row. Mixed probes complementary to the ends of the DNA inserts in the second pooled portion of clones are prepared and hybridized to a second replica of the library. The hybridization pattern on the first and second replicas of the library are compared and cross-hybridizing clones, other than the clones in the pooled column and row, that hybridize to identical clones in the first and second replicas are identified. These clones necessarily include DNA inserts that overlap with the DNA insert in the common clone located at the intersection of the pooled row and pooled column. The DNA in the entire library may be mapped by pooling the clones in each of the rows and columns of the matrix, preparing mixed end-specific probes and hybridizing the probes from each row or column to a replica of the library. Since all clones in the library are located at the intersection of a column and a row, the overlapping clones for all clones in the library may be identified and a physical map constructed.
A Mini-Library of Sequenced Human DNA Fragments: Linking Bench Experiments with Informatics
ERIC Educational Resources Information Center
Dalgleish, Raymond; Shanks, Morag E.; Monger, Karen; Butler, Nicola J.
2012-01-01
We describe the development of a mini-library of human DNA fragments for use in an enquiry-based learning (EBL) undergraduate practical incorporating "wet-lab" and bioinformatics tasks. In spite of the widespread emergence of the polymerase chain reaction (PCR), the cloning and analysis of DNA fragments in "Escherichia coli"…
Shuffle Optimizer: A Program to Optimize DNA Shuffling for Protein Engineering.
Milligan, John N; Garry, Daniel J
2017-01-01
DNA shuffling is a powerful tool to develop libraries of variants for protein engineering. Here, we present a protocol to use our freely available and easy-to-use computer program, Shuffle Optimizer. Shuffle Optimizer is written in the Python computer language and increases the nucleotide homology between two pieces of DNA desired to be shuffled together without changing the amino acid sequence. In addition we also include sections on optimal primer design for DNA shuffling and library construction, a small-volume ultrasonicator method to create sheared DNA, and finally a method to reassemble the sheared fragments and recover and clone the library. The Shuffle Optimizer program and these protocols will be useful to anyone desiring to perform any of the nucleotide homology-dependent shuffling methods.
Satz, Alexander L
2016-07-11
Simulated screening of DNA encoded libraries indicates that the presence of truncated byproducts complicates the relationship between library member enrichment and equilibrium association constant (these truncates result from incomplete chemical reactions during library synthesis). Further, simulations indicate that some patterns observed in reported experimental data may result from the presence of truncated byproducts in the library mixture and not structure-activity relationships. Potential experimental methods of minimizing the presence of truncates are assessed via simulation; the relationship between enrichment and equilibrium association constant for libraries of differing purities is investigated. Data aggregation techniques are demonstrated that allow for more accurate analysis of screening results, in particular when the screened library contains significant quantities of truncates.
Chemical Space of DNA-Encoded Libraries.
Franzini, Raphael M; Randolph, Cassie
2016-07-28
In recent years, DNA-encoded chemical libraries (DECLs) have attracted considerable attention as a potential discovery tool in drug development. Screening encoded libraries may offer advantages over conventional hit discovery approaches and has the potential to complement such methods in pharmaceutical research. As a result of the increased application of encoded libraries in drug discovery, a growing number of hit compounds are emerging in scientific literature. In this review we evaluate reported encoded library-derived structures and identify general trends of these compounds in relation to library design parameters. We in particular emphasize the combinatorial nature of these libraries. Generally, the reported molecules demonstrate the ability of this technology to afford hits suitable for further lead development, and on the basis of them, we derive guidelines for DECL design.
Eberwine, James; Bartfai, Tamas
2011-01-01
We report on an ‘unbiased’ molecular characterization of individual, adult neurons, active in a central, anterior hypothalamic neuronal circuit, by establishing cDNA libraries from each individual, electrophysiologically identified warm sensitive neuron (WSN). The cDNA libraries were analyzed by Affymetrix microarray. The presence and frequency of cDNAs was confirmed and enhanced with Illumina sequencing of each single cell cDNA library. cDNAs encoding the GABA biosynthetic enzyme. GAD1 and of adrenomedullin, galanin, prodynorphin, somatostatin, and tachykinin were found in the WSNs. The functional cellular and in vivo studies on dozens of the more than 500 neurotransmitter -, hormone- receptors and ion channels, whose cDNA was identified and sequence confirmed, suggest little or no discrepancy between the transcriptional and functional data in WSNs; whenever agonists were available for a receptor whose cDNA was identified, a functional response was found.. Sequencing single neuron libraries permitted identification of rarely expressed receptors like the insulin receptor, adiponectin receptor2 and of receptor heterodimers; information that is lost when pooling cells leads to dilution of signals and mixing signals. Despite the common electrophysiological phenotype and uniform GAD1 expression, WSN- transcriptomes show heterogenity, suggesting strong epigenetic influence on the transcriptome. Our study suggests that it is well-worth interrogating the cDNA libraries of single neurons by sequencing and chipping. PMID:20970451
Seong, Ki Moon; Park, Hweon; Kim, Seong Jung; Ha, Hyo Nam; Lee, Jae Yung; Kim, Joon
2007-06-01
A yeast transcriptional activator, Gcn4p, induces the expression of genes that are involved in amino acid and purine biosynthetic pathways under amino acid starvation. Gcn4p has an acidic activation domain in the central region and a bZIP domain in the C-terminus that is divided into the DNA-binding motif and dimerization leucine zipper motif. In order to identify amino acids in the DNA-binding motif of Gcn4p which are involved in transcriptional activation, we constructed mutant libraries in the DNA-binding motif through an innovative application of random mutagenesis. Mutant library made by oligonucleotides which were mutated randomly using the Poisson distribution showed that the actual mutation frequency was in good agreement with expected values. This method could save the time and effort to create a mutant library with a predictable mutation frequency. Based on the studies using the mutant libraries constructed by the new method, the specific residues of the DNA-binding domain in Gcn4p appear to be involved in the transcriptional activities on a conserved binding site.
Libraries of Synthetic TALE-Activated Promoters: Methods and Applications.
Schreiber, T; Tissier, A
2016-01-01
The discovery of proteins with programmable DNA-binding specificities triggered a whole array of applications in synthetic biology, including genome editing, regulation of transcription, and epigenetic modifications. Among those, transcription activator-like effectors (TALEs) due to their natural function as transcription regulators, are especially well-suited for the development of orthogonal systems for the control of gene expression. We describe here the construction and testing of libraries of synthetic TALE-activated promoters which are under the control of a single TALE with a given DNA-binding specificity. These libraries consist of a fixed DNA-binding element for the TALE, a TATA box, and variable sequences of 19 bases upstream and 43 bases downstream of the DNA-binding element. These libraries were cloned using a Golden Gate cloning strategy making them usable as standard parts in a modular cloning system. The broad range of promoter activities detected and the versatility of these promoter libraries make them valuable tools for applications in the fine-tuning of expression in metabolic engineering projects or in the design and implementation of regulatory circuits. © 2016 Elsevier Inc. All rights reserved.
Systematic cloning of human minisatellites from ordered array charomid libraries.
Armour, J A; Povey, S; Jeremiah, S; Jeffreys, A J
1990-11-01
We present a rapid and efficient method for the isolation of minisatellite loci from human DNA. The method combines cloning a size-selected fraction of human MboI DNA fragments in a charomid vector with hybridization screening of the library in ordered array. Size-selection of large MboI fragments enriches for the longer, more variable minisatellites and reduces the size of the library required. The library was screened with a series of multi-locus probes known to detect a large number of hypervariable loci in human DNA. The gridded library allowed both the rapid processing of positive clones and the comparative evaluation of the different multi-locus probes used, in terms of both the relative success in detecting hypervariable loci and the degree of overlap between the sets of loci detected. We report 23 new human minisatellite loci isolated by this method, which map to 14 autosomes and the sex chromosomes.
BOKP: A DNA Barcode Reference Library for Monitoring Herbal Drugs in the Korean Pharmacopeia
Liu, Jinxin; Shi, Linchun; Song, Jingyuan; Sun, Wei; Han, Jianping; Liu, Xia; Hou, Dianyun; Yao, Hui; Li, Mingyue; Chen, Shilin
2017-01-01
Herbal drug authentication is an important task in traditional medicine; however, it is challenged by the limitations of traditional authentication methods and the lack of trained experts. DNA barcoding is conspicuous in almost all areas of the biological sciences and has already been added to the British pharmacopeia and Chinese pharmacopeia for routine herbal drug authentication. However, DNA barcoding for the Korean pharmacopeia still requires significant improvements. Here, we present a DNA barcode reference library for herbal drugs in the Korean pharmacopeia and developed a species identification engine named KP-IDE to facilitate the adoption of this DNA reference library for the herbal drug authentication. Using taxonomy records, specimen records, sequence records, and reference records, KP-IDE can identify an unknown specimen. Currently, there are 6,777 taxonomy records, 1,054 specimen records, 30,744 sequence records (ITS2 and psbA-trnH) and 285 reference records. Moreover, 27 herbal drug materials were collected from the Seoul Yangnyeongsi herbal medicine market to give an example for real herbal drugs authentications. Our study demonstrates the prospects of the DNA barcode reference library for the Korean pharmacopeia and provides future directions for the use of DNA barcoding for authenticating herbal drugs listed in other modern pharmacopeias. PMID:29326593
Ludgate, Jackie L; Wright, James; Stockwell, Peter A; Morison, Ian M; Eccles, Michael R; Chatterjee, Aniruddha
2017-08-31
Formalin fixed paraffin embedded (FFPE) tumor samples are a major source of DNA from patients in cancer research. However, FFPE is a challenging material to work with due to macromolecular fragmentation and nucleic acid crosslinking. FFPE tissue particularly possesses challenges for methylation analysis and for preparing sequencing-based libraries relying on bisulfite conversion. Successful bisulfite conversion is a key requirement for sequencing-based methylation analysis. Here we describe a complete and streamlined workflow for preparing next generation sequencing libraries for methylation analysis from FFPE tissues. This includes, counting cells from FFPE blocks and extracting DNA from FFPE slides, testing bisulfite conversion efficiency with a polymerase chain reaction (PCR) based test, preparing reduced representation bisulfite sequencing libraries and massively parallel sequencing. The main features and advantages of this protocol are: An optimized method for extracting good quality DNA from FFPE tissues. An efficient bisulfite conversion and next generation sequencing library preparation protocol that uses 50 ng DNA from FFPE tissue. Incorporation of a PCR-based test to assess bisulfite conversion efficiency prior to sequencing. We provide a complete workflow and an integrated protocol for performing DNA methylation analysis at the genome-scale and we believe this will facilitate clinical epigenetic research that involves the use of FFPE tissue.
Kuzmina, Maria L; Braukmann, Thomas W A; Fazekas, Aron J; Graham, Sean W; Dewaard, Stephanie L; Rodrigues, Anuar; Bennett, Bruce A; Dickinson, Timothy A; Saarela, Jeffery M; Catling, Paul M; Newmaster, Steven G; Percy, Diana M; Fenneman, Erin; Lauron-Moreau, Aurélien; Ford, Bruce; Gillespie, Lynn; Subramanyam, Ragupathy; Whitton, Jeannette; Jennings, Linda; Metsger, Deborah; Warne, Connor P; Brown, Allison; Sears, Elizabeth; Dewaard, Jeremy R; Zakharov, Evgeny V; Hebert, Paul D N
2017-12-01
Constructing complete, accurate plant DNA barcode reference libraries can be logistically challenging for large-scale floras. Here we demonstrate the promise and challenges of using herbarium collections for building a DNA barcode reference library for the vascular plant flora of Canada. Our study examined 20,816 specimens representing 5076 of 5190 vascular plant species in Canada (98%). For 98% of the specimens, at least one of the DNA barcode regions was recovered from the plastid loci rbcL and matK and from the nuclear ITS2 region. We used beta regression to quantify the effects of age, type of preservation, and taxonomic affiliation (family) on DNA sequence recovery. Specimen age and method of preservation had significant effects on sequence recovery for all markers, but influenced some families more (e.g., Boraginaceae) than others (e.g., Asteraceae). Our DNA barcode library represents an unparalleled resource for metagenomic and ecological genetic research working on temperate and arctic biomes. An observed decline in sequence recovery with specimen age may be associated with poor primer matches, intragenomic variation (for ITS2), or inhibitory secondary compounds in some taxa.
Kuzmina, Maria L.; Braukmann, Thomas W. A.; Fazekas, Aron J.; Graham, Sean W.; Dewaard, Stephanie L.; Rodrigues, Anuar; Bennett, Bruce A.; Dickinson, Timothy A.; Saarela, Jeffery M.; Catling, Paul M.; Newmaster, Steven G.; Percy, Diana M.; Fenneman, Erin; Lauron-Moreau, Aurélien; Ford, Bruce; Gillespie, Lynn; Subramanyam, Ragupathy; Whitton, Jeannette; Jennings, Linda; Metsger, Deborah; Warne, Connor P.; Brown, Allison; Sears, Elizabeth; Dewaard, Jeremy R.; Zakharov, Evgeny V.; Hebert, Paul D. N.
2017-01-01
Premise of the study: Constructing complete, accurate plant DNA barcode reference libraries can be logistically challenging for large-scale floras. Here we demonstrate the promise and challenges of using herbarium collections for building a DNA barcode reference library for the vascular plant flora of Canada. Methods: Our study examined 20,816 specimens representing 5076 of 5190 vascular plant species in Canada (98%). For 98% of the specimens, at least one of the DNA barcode regions was recovered from the plastid loci rbcL and matK and from the nuclear ITS2 region. We used beta regression to quantify the effects of age, type of preservation, and taxonomic affiliation (family) on DNA sequence recovery. Results: Specimen age and method of preservation had significant effects on sequence recovery for all markers, but influenced some families more (e.g., Boraginaceae) than others (e.g., Asteraceae). Discussion: Our DNA barcode library represents an unparalleled resource for metagenomic and ecological genetic research working on temperate and arctic biomes. An observed decline in sequence recovery with specimen age may be associated with poor primer matches, intragenomic variation (for ITS2), or inhibitory secondary compounds in some taxa. PMID:29299394
Construction of cDNA library and preliminary analysis of expressed sequence tags from Siberian tiger
Liu, Chang-Qing; Lu, Tao-Feng; Feng, Bao-Gang; Liu, Dan; Guan, Wei-Jun; Ma, Yue-Hui
2010-01-01
In this study we successfully constructed a full-length cDNA library from Siberian tiger, Panthera tigris altaica, the most well-known wild Animal. Total RNA was extracted from cultured Siberian tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.30×106 pfu/ml and 1.62×109 pfu/ml respectively. The proportion of recombinants from unamplified library was 90.5% and average length of exogenous inserts was 1.13 kb. A total of 282 individual ESTs with sizes ranging from 328 to 1,142bps were then analyzed the BLASTX score revealed that 53.9% of the sequences were classified as strong match, 38.6% as nominal and 7.4% as weak match. 28.0% of them were found to be related to enzyme/catalytic protein, 20.9% ESTs to metabolism, 13.1% ESTs to transport, 12.1% ESTs to signal transducer/cell communication, 9.9% ESTs to structure protein, 3.9% ESTs to immunity protein/defense metabolism, 3.2% ESTs to cell cycle, and 8.9 ESTs classified as novel genes. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genomic research of Siberian tigers. PMID:20941376
Ståhlberg, Anders; Krzyzanowski, Paul M; Jackson, Jennifer B; Egyud, Matthew; Stein, Lincoln; Godfrey, Tony E
2016-06-20
Detection of cell-free DNA in liquid biopsies offers great potential for use in non-invasive prenatal testing and as a cancer biomarker. Fetal and tumor DNA fractions however can be extremely low in these samples and ultra-sensitive methods are required for their detection. Here, we report an extremely simple and fast method for introduction of barcodes into DNA libraries made from 5 ng of DNA. Barcoded adapter primers are designed with an oligonucleotide hairpin structure to protect the molecular barcodes during the first rounds of polymerase chain reaction (PCR) and prevent them from participating in mis-priming events. Our approach enables high-level multiplexing and next-generation sequencing library construction with flexible library content. We show that uniform libraries of 1-, 5-, 13- and 31-plex can be generated. Utilizing the barcodes to generate consensus reads for each original DNA molecule reduces background sequencing noise and allows detection of variant alleles below 0.1% frequency in clonal cell line DNA and in cell-free plasma DNA. Thus, our approach bridges the gap between the highly sensitive but specific capabilities of digital PCR, which only allows a limited number of variants to be analyzed, with the broad target capability of next-generation sequencing which traditionally lacks the sensitivity to detect rare variants. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Yang, XinChao; Li, MengHui; Liu, JianHua; Ji, YiHong; Li, XiangRui; Xu, LiXin; Yan, RuoFeng; Song, XiaoKai
2017-02-16
Eimeria maxima is one of the most prevalent Eimeria species causing avian coccidiosis, and results in huge economic loss to the global poultry industry. Current control strategies, such as anti-coccidial medication and live vaccines have been limited because of their drawbacks. The third generation anticoccidial vaccines including the recombinant vaccines as well as DNA vaccines have been suggested as a promising alternative strategy. To date, only a few protective antigens of E. maxima have been reported. Hence, there is an urgent need to identify novel protective antigens of E. maxima for the development of neotype anticoccidial vaccines. With the aim of identifying novel protective genes of E. maxima, a cDNA expression library of E. maxima sporozoites was constructed using Gateway technology. Subsequently, the cDNA expression library was divided into 15 sub-libraries for cDNA expression library immunization (cDELI) using parasite challenged model in chickens. Protective sub-libraries were selected for the next round of screening until individual protective clones were obtained, which were further sequenced and analyzed. Adopting the Gateway technology, a high-quality entry library was constructed, containing 9.2 × 10 6 clones with an average inserted fragments length of 1.63 kb. The expression library capacity was 2.32 × 10 7 colony-forming units (cfu) with an average inserted fragments length of 1.64 Kb. The expression library was screened using parasite challenged model in chickens. The screening yielded 6 immune protective genes including four novel protective genes of EmJS-1, EmRP, EmHP-1 and EmHP-2, and two known protective genes of EmSAG and EmCKRS. EmJS-1 is the selR domain-containing protein of E. maxima whose function is unknown. EmHP-1 and EmHP-2 are the hypothetical proteins of E. maxima. EmRP and EmSAG are rhomboid-like protein and surface antigen glycoproteins of E. maxima respectively, and involved in invasion of the parasite. Our results provide a cDNA expression library for further screening of T cell stimulating or inhibiting antigens of E. maxima. Moreover, our results provide six candidate protective antigens for developing new vaccines against E. maxima.
DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis.
MacConnell, Andrew B; McEnaney, Patrick J; Cavett, Valerie J; Paegel, Brian M
2015-09-14
The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the "structure elucidation problem": the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very molecular features that confer binding potency and specificity, such as stereochemistry, regiochemistry, and scaffold rigidity, are conspicuously absent from most libraries because isomerism introduces mass redundancy and diverse scaffolds yield uninterpretable MS fragmentation. Here we present DNA-encoded solid-phase synthesis (DESPS), comprising parallel compound synthesis in organic solvent and aqueous enzymatic ligation of unprotected encoding dsDNA oligonucleotides. Computational encoding language design yielded 148 thermodynamically optimized sequences with Hamming string distance ≥ 3 and total read length <100 bases for facile sequencing. Ligation is efficient (70% yield), specific, and directional over 6 encoding positions. A series of isomers served as a testbed for DESPS's utility in split-and-pool diversification. Single-bead quantitative PCR detected 9 × 10(4) molecules/bead and sequencing allowed for elucidation of each compound's synthetic history. We applied DESPS to the combinatorial synthesis of a 75,645-member OBOC library containing scaffold, stereochemical and regiochemical diversity using mixed-scale resin (160-μm quality control beads and 10-μm screening beads). Tandem DNA sequencing/MALDI-TOF MS analysis of 19 quality control beads showed excellent agreement (<1 ppt) between DNA sequence-predicted mass and the observed mass. DESPS synergistically unites the advantages of solid-phase synthesis and DNA encoding, enabling single-bead structural elucidation of complex compounds and synthesis using reactions normally considered incompatible with unprotected DNA. The widespread availability of inexpensive oligonucleotide synthesis, enzymes, DNA sequencing, and PCR make implementation of DESPS straightforward, and may prompt the chemistry community to revisit the synthesis of more complex and diverse libraries.
Zhou, X.; Robinson, J.L.; Geraci, C.J.; Parker, C.R.; Flint, O.S.; Etnier, D.A.; Ruiter, D.; DeWalt, R.E.; Jacobus, L.M.; Hebert, P.D.N.
2011-01-01
Deoxyribonucleic acid (DNA) barcoding is an effective tool for species identification and lifestage association in a wide range of animal taxa. We developed a strategy for rapid construction of a regional DNA-barcode reference library and used the caddisflies (Trichoptera) of the Great Smoky Mountains National Park (GSMNP) as a model. Nearly 1000 cytochrome c oxidase subunit I (COI) sequences, representing 209 caddisfly species previously recorded from GSMNP, were obtained from the global Trichoptera Barcode of Life campaign. Most of these sequences were collected from outside the GSMNP area. Another 645 COI sequences, representing 80 species, were obtained from specimens collected in a 3-d bioblitz (short-term, intense sampling program) in GSMNP. The joint collections provided barcode coverage for 212 species, 91% of the GSMNP fauna. Inclusion of samples from other localities greatly expedited construction of the regional DNA-barcode reference library. This strategy increased intraspecific divergence and decreased average distances to nearest neighboring species, but the DNA-barcode library was able to differentiate 93% of the GSMNP Trichoptera species examined. Global barcoding projects will aid construction of regional DNA-barcode libraries, but local surveys make crucial contributions to progress by contributing rare or endemic species and full-length barcodes generated from high-quality DNA. DNA taxonomy is not a goal of our present work, but the investigation of COI divergence patterns in caddisflies is providing new insights into broader biodiversity patterns in this group and has directed attention to various issues, ranging from the need to re-evaluate species taxonomy with integrated morphological and molecular evidence to the necessity of an appropriate interpretation of barcode analyses and its implications in understanding species diversity (in contrast to a simple claim for barcoding failure).
Rhodes, Johanna; Beale, Mathew A; Fisher, Matthew C
2014-01-01
The industry of next-generation sequencing is constantly evolving, with novel library preparation methods and new sequencing machines being released by the major sequencing technology companies annually. The Illumina TruSeq v2 library preparation method was the most widely used kit and the market leader; however, it has now been discontinued, and in 2013 was replaced by the TruSeq Nano and TruSeq PCR-free methods, leaving a gap in knowledge regarding which is the most appropriate library preparation method to use. Here, we used isolates from the pathogenic fungi Cryptococcus neoformans var. grubii and sequenced them using the existing TruSeq DNA v2 kit (Illumina), along with two new kits: the TruSeq Nano DNA kit (Illumina) and the NEBNext Ultra DNA kit (New England Biolabs) to provide a comparison. Compared to the original TruSeq DNA v2 kit, both newer kits gave equivalent or better sequencing data, with increased coverage. When comparing the two newer kits, we found little difference in cost and workflow, with the NEBNext Ultra both slightly cheaper and faster than the TruSeq Nano. However, the quality of data generated using the TruSeq Nano DNA kit was superior due to higher coverage at regions of low GC content, and more SNPs identified. Researchers should therefore evaluate their resources and the type of application (and hence data quality) being considered when ultimately deciding on which library prep method to use.
Litovchick, Alexander; Dumelin, Christoph E.; Habeshian, Sevan; Gikunju, Diana; Guié, Marie-Aude; Centrella, Paolo; Zhang, Ying; Sigel, Eric A.; Cuozzo, John W.; Keefe, Anthony D.; Clark, Matthew A.
2015-01-01
A chemical ligation method for construction of DNA-encoded small-molecule libraries has been developed. Taking advantage of the ability of the Klenow fragment of DNA polymerase to accept templates with triazole linkages in place of phosphodiesters, we have designed a strategy for chemically ligating oligonucleotide tags using cycloaddition chemistry. We have utilized this strategy in the construction and selection of a small molecule library, and successfully identified inhibitors of the enzyme soluble epoxide hydrolase. PMID:26061191
Plans and progress for building a Great Lakes fauna DNA barcode reference library
DNA reference libraries provide researchers with an important tool for assessing regional biodiversity by allowing unknown genetic sequences to be assigned identities, while also providing a means for taxonomists to validate identifications. Expanding the representation of Great...
Isolation of a DNA Probe for Lactobacillus curvatus
Petrick, Hendrik A. R.; Ambrosio, Riccardo E.; Holzapfel, Wilhelm H.
1988-01-01
A genomic library of Lactobacillus curvatus DSM 20019 was constructed in bacteriophage λ gt11. A 1.2-kilobase DNA probe specific for L. curvatus was isolated from this library. When this probe was hybridized to DNA from Lactobacillus isolates from different sources classified by conventional techniques, differing degrees of hybridization were obtained. This could imply that these isolates may have been incorrectly classified. Images PMID:16347554
Eberwine, James; Bartfai, Tamas
2011-03-01
We report on an 'unbiased' molecular characterization of individual, adult neurons, active in a central, anterior hypothalamic neuronal circuit, by establishing cDNA libraries from each individual, electrophysiologically identified warm sensitive neuron (WSN). The cDNA libraries were analyzed by Affymetrix microarray. The presence and frequency of cDNAs were confirmed and enhanced with Illumina sequencing of each single cell cDNA library. cDNAs encoding the GABA biosynthetic enzyme Gad1 and of adrenomedullin, galanin, prodynorphin, somatostatin, and tachykinin were found in the WSNs. The functional cellular and in vivo studies on dozens of the more than 500 neurotransmitters, hormone receptors and ion channels, whose cDNA was identified and sequence confirmed, suggest little or no discrepancy between the transcriptional and functional data in WSNs; whenever agonists were available for a receptor whose cDNA was identified, a functional response was found. Sequencing single neuron libraries permitted identification of rarely expressed receptors like the insulin receptor, adiponectin receptor 2 and of receptor heterodimers; information that is lost when pooling cells leads to dilution of signals and mixing signals. Despite the common electrophysiological phenotype and uniform Gad1 expression, WSN transcriptomes show heterogeneity, suggesting strong epigenetic influence on the transcriptome. Our study suggests that it is well-worth interrogating the cDNA libraries of single neurons by sequencing and chipping. Copyright © 2010 Elsevier Inc. All rights reserved.
McCormick, M K; Campbell, E; Deaven, L; Moyzis, R
1993-01-01
Construction of chromosome-specific yeast artificial chromosome (YAC) libraries from sorted chromosomes was undertaken (i) to eliminate drawbacks associated with first-generation total genomic YAC libraries, such as the high frequency of chimeric YACs, and (ii) to provide an alternative method for generating chromosome-specific YAC libraries in addition to isolating such collections from a total genomic library. Chromosome-specific YAC libraries highly enriched for human chromosomes 16 and 21 were constructed. By maximizing the percentage of fragments with two ligatable ends and performing yeast transformations with less than saturating amounts of DNA in the presence of carrier DNA, YAC libraries with a low percentage of chimeric clones were obtained. The smaller number of YAC clones in these chromosome-specific libraries reduces the effort involved in PCR-based screening and allows hybridization methods to be a manageable screening approach. Images PMID:8430075
An efficient and sensitive method for preparing cDNA libraries from scarce biological samples
Sterling, Catherine H.; Veksler-Lublinsky, Isana; Ambros, Victor
2015-01-01
The preparation and high-throughput sequencing of cDNA libraries from samples of small RNA is a powerful tool to quantify known small RNAs (such as microRNAs) and to discover novel RNA species. Interest in identifying the small RNA repertoire present in tissues and in biofluids has grown substantially with the findings that small RNAs can serve as indicators of biological conditions and disease states. Here we describe a novel and straightforward method to clone cDNA libraries from small quantities of input RNA. This method permits the generation of cDNA libraries from sub-picogram quantities of RNA robustly, efficiently and reproducibly. We demonstrate that the method provides a significant improvement in sensitivity compared to previous cloning methods while maintaining reproducible identification of diverse small RNA species. This method should have widespread applications in a variety of contexts, including biomarker discovery from scarce samples of human tissue or body fluids. PMID:25056322
[Construction of fetal mesenchymal stem cell cDNA subtractive library].
Yang, Li; Wang, Dong-Mei; Li, Liang; Bai, Ci-Xian; Cao, Hua; Li, Ting-Yu; Pei, Xue-Tao
2002-04-01
To identify differentially expressed genes between fetal mesenchymal stem cell (MSC) and adult MSC, especially specified genes expressed in fetal MSC, a cDNA subtractive library of fetal MSC was constructed using suppression subtractive hybridization (SSH) technique. At first, total RNA was isolated from fetal and adult MSC. Using SMART PCR synthesis method, single-strand and double-strand cDNAs were synthesized. After Rsa I digestion, fetal MSC cDNAs were divided into two groups and ligated to adaptor 1 and adaptor 2 respectively. Results showed that the amplified library contains 890 clones. Analysis of 890 clones with PCR demonstrated that 768 clones were positive. The positive rate is 86.3%. The size of inserted fragments in these positive clones was between 0.2 - 1 kb, with an average of 400 - 600 bp. SSH is a convenient and effective method for screening differentially expressed genes. The constructed cDNA subtractive library of fetal MSC cDNA lays solid foundation for screening and cloning new and specific function related genes of fetal MSC.
Synthesis and cell-free cloning of DNA libraries using programmable microfluidics
Yehezkel, Tuval Ben; Rival, Arnaud; Raz, Ofir; Cohen, Rafael; Marx, Zipora; Camara, Miguel; Dubern, Jean-Frédéric; Koch, Birgit; Heeb, Stephan; Krasnogor, Natalio; Delattre, Cyril; Shapiro, Ehud
2016-01-01
Microfluidics may revolutionize our ability to write synthetic DNA by addressing several fundamental limitations associated with generating novel genetic constructs. Here we report the first de novo synthesis and cell-free cloning of custom DNA libraries in sub-microliter reaction droplets using programmable digital microfluidics. Specifically, we developed Programmable Order Polymerization (POP), Microfluidic Combinatorial Assembly of DNA (M-CAD) and Microfluidic In-vitro Cloning (MIC) and applied them to de novo synthesis, combinatorial assembly and cell-free cloning of genes, respectively. Proof-of-concept for these methods was demonstrated by programming an autonomous microfluidic system to construct and clone libraries of yeast ribosome binding sites and bacterial Azurine, which were then retrieved in individual droplets and validated. The ability to rapidly and robustly generate designer DNA molecules in an autonomous manner should have wide application in biological research and development. PMID:26481354
Identification of genes differentially expressed in association with acquired cisplatin resistance
Johnsson, A; Zeelenberg, I; Min, Y; Hilinski, J; Berry, C; Howell, S B; Los, G
2000-01-01
The goal of this study was to identify genes whose mRNA levels are differentially expressed in human cells with acquired cisplatin (cDDP) resistance. Using the parental UMSCC10b head and neck carcinoma cell line and the 5.9-fold cDDP-resistant subline, UMSCC10b/Pt-S15, two suppressive subtraction hybridization (SSH) cDNA libraries were prepared. One library represented mRNAs whose levels were increased in the cDDP resistant variant (the UP library), the other one represented mRNAs whose levels were decreased in the resistant cells (the DOWN library). Arrays constructed with inserts recovered from these libraries were hybridized with SSH products to identify truly differentially expressed elements. A total of 51 cDNA fragments present in the UP library and 16 in the DOWN library met the criteria established for differential expression. The sequences of 87% of these cDNA fragments were identified in Genbank. Among the mRNAs in the UP library that were frequently isolated and that showed high levels of differential expression were cytochrome oxidase I, ribosomal protein 28S, elongation factor 1α, α-enolase, stathmin, and HSP70. The approach taken in this study permitted identification of many genes never before linked to the cDDP-resistant phenotype. © 2000 Cancer Research Campaign PMID:10993653
Highly multiplexed targeted DNA sequencing from single nuclei.
Leung, Marco L; Wang, Yong; Kim, Charissa; Gao, Ruli; Jiang, Jerry; Sei, Emi; Navin, Nicholas E
2016-02-01
Single-cell DNA sequencing methods are challenged by poor physical coverage, high technical error rates and low throughput. To address these issues, we developed a single-cell DNA sequencing protocol that combines flow-sorting of single nuclei, time-limited multiple-displacement amplification (MDA), low-input library preparation, DNA barcoding, targeted capture and next-generation sequencing (NGS). This approach represents a major improvement over our previous single nucleus sequencing (SNS) Nature Protocols paper in terms of generating higher-coverage data (>90%), thereby enabling the detection of genome-wide variants in single mammalian cells at base-pair resolution. Furthermore, by pooling 48-96 single-cell libraries together for targeted capture, this approach can be used to sequence many single-cell libraries in parallel in a single reaction. This protocol greatly reduces the cost of single-cell DNA sequencing, and it can be completed in 5-6 d by advanced users. This single-cell DNA sequencing protocol has broad applications for studying rare cells and complex populations in diverse fields of biological research and medicine.
Hayashi, Yoshinobu; Shigenobu, Shuji; Watanabe, Dai; Toga, Kouhei; Saiki, Ryota; Shimada, Keisuke; Bourguignon, Thomas; Lo, Nathan; Hojo, Masaru; Maekawa, Kiyoto; Miura, Toru
2013-01-01
In termites, division of labor among castes, categories of individuals that perform specialized tasks, increases colony-level productivity and is the key to their ecological success. Although molecular studies on caste polymorphism have been performed in termites, we are far from a comprehensive understanding of the molecular basis of this phenomenon. To facilitate future molecular studies, we aimed to construct expressed sequence tag (EST) libraries covering wide ranges of gene repertoires in three representative termite species, Hodotermopsis sjostedti, Reticulitermes speratus and Nasutitermes takasagoensis. We generated normalized cDNA libraries from whole bodies, except for guts containing microbes, of almost all castes, sexes and developmental stages and sequenced them with the 454 GS FLX titanium system. We obtained >1.2 million quality-filtered reads yielding >400 million bases for each of the three species. Isotigs, which are analogous to individual transcripts, and singletons were produced by assembling the reads and annotated using public databases. Genes related to juvenile hormone, which plays crucial roles in caste differentiation of termites, were identified from the EST libraries by BLAST search. To explore the potential for DNA methylation, which plays an important role in caste differentiation of honeybees, tBLASTn searches for DNA methyltransferases (dnmt1, dnmt2 and dnmt3) and methyl-CpG binding domain (mbd) were performed against the EST libraries. All four of these genes were found in the H. sjostedti library, while all except dnmt3 were found in R. speratus and N. takasagoensis. The ratio of the observed to the expected CpG content (CpG O/E), which is a proxy for DNA methylation level, was calculated for the coding sequences predicted from the isotigs and singletons. In all of the three species, the majority of coding sequences showed depletion of CpG O/E (less than 1), and the distributions of CpG O/E were bimodal, suggesting the presence of DNA methylation.
Hayashi, Yoshinobu; Shigenobu, Shuji; Watanabe, Dai; Toga, Kouhei; Saiki, Ryota; Shimada, Keisuke; Bourguignon, Thomas; Lo, Nathan; Hojo, Masaru; Maekawa, Kiyoto; Miura, Toru
2013-01-01
In termites, division of labor among castes, categories of individuals that perform specialized tasks, increases colony-level productivity and is the key to their ecological success. Although molecular studies on caste polymorphism have been performed in termites, we are far from a comprehensive understanding of the molecular basis of this phenomenon. To facilitate future molecular studies, we aimed to construct expressed sequence tag (EST) libraries covering wide ranges of gene repertoires in three representative termite species, Hodotermopsis sjostedti , Reticulitermessperatus and Nasutitermestakasagoensis . We generated normalized cDNA libraries from whole bodies, except for guts containing microbes, of almost all castes, sexes and developmental stages and sequenced them with the 454 GS FLX titanium system. We obtained >1.2 million quality-filtered reads yielding >400 million bases for each of the three species. Isotigs, which are analogous to individual transcripts, and singletons were produced by assembling the reads and annotated using public databases. Genes related to juvenile hormone, which plays crucial roles in caste differentiation of termites, were identified from the EST libraries by BLAST search. To explore the potential for DNA methylation, which plays an important role in caste differentiation of honeybees, tBLASTn searches for DNA methyltransferases (dnmt1, dnmt2 and dnmt3) and methyl-CpG binding domain (mbd) were performed against the EST libraries. All four of these genes were found in the H . sjostedti library, while all except dnmt3 were found in R . speratus and N . takasagoensis . The ratio of the observed to the expected CpG content (CpG O/E), which is a proxy for DNA methylation level, was calculated for the coding sequences predicted from the isotigs and singletons. In all of the three species, the majority of coding sequences showed depletion of CpG O/E (less than 1), and the distributions of CpG O/E were bimodal, suggesting the presence of DNA methylation. PMID:24098800
Chemical Biology Probes from Advanced DNA-encoded Libraries.
Salamon, Hazem; Klika Škopić, Mateja; Jung, Kathrin; Bugain, Olivia; Brunschweiger, Andreas
2016-02-19
The identification of bioactive compounds is a crucial step toward development of probes for chemical biology studies. Screening of DNA-encoded small molecule libraries (DELs) has emerged as a validated technology to interrogate vast chemical space. DELs consist of chimeric molecules composed of a low-molecular weight compound that is conjugated to a DNA identifier tag. They are screened as pooled libraries using selection to identify "hits." Screening of DELs has identified numerous bioactive compounds. Some of these molecules were instrumental in gaining a deeper understanding of biological systems. One of the main challenges in the field is the development of synthesis methodology for DELs.
Physical mapping of complex genomes
Evans, Glen A.
1993-01-01
Method for simultaneous identification of overlapping cosmid clones among multiple cosmid clones and the use of the method for mapping complex genomes are provided. A library of cosmid clones that contains the DNA to be mapped is constructed and arranged in a manner such that individual clones can be identified and replicas of the arranged clones prepared. In preferred embodiments, the clones are arranged in a two dimensional matrix. In such embodiments, the cosmid clones in a row are pooled, mixed probes complementary to the ends of the DNA inserts int he pooled clones are synthesized, hybridized to a first replica of the library. Hybridizing clones, which include the pooled row, are identified. A second portion of clones is prepared by pooling cosmid clones that correspond to a column in the matrix. The second pool thereby includes one clone from the first portion pooled clones. This common clone is located on the replica at the intersection of the column and row. Mixed probes complementary to the ends of the DNA inserts in the second pooled portion of clones are prepared and hybridized to a second replica of the library. The hybridization pattern on the first and second replicas of the library are compared and cross-hybridizing clones, other than the clones in the pooled column and row, that hybridize to identical clones in the first and second replicas are identified. These clones necessarily include DNA inserts that overlap with the DNA insert int he common clone located at the intersection of the pooled row and pooled column. The DNA in the entire library may be mapped by pooling the clones in each of the rows and columns of the matrix, preparing mixed end-specific probes and hybridizing the probes from each row or column to a replica of the library. Since all clones in the library are located at the intersection of a column and a row, the overlapping clones for all clones in the library may be identified and a physical map constructed. In other preferred embodiments, the cosmid clones are arranged in a three dimensional matrix, pooled and compared in threes according to intersecting planes of the three dimensional matrix. Arrangements corresponding to geometries of higher dimensions may also be prepared and used to simultaneously identify overlapping clones in highly complex libraries with relatively few hybridization reactions.
ORF phage display to identify cellular proteins with different functions.
Li, Wei
2012-09-01
Open reading frame (ORF) phage display is a new branch of phage display aimed at improving its efficiency to identify cellular proteins with specific binding or functional activities. Despite the success of phage display with antibody libraries and random peptide libraries, phage display with cDNA libraries of cellular proteins identifies a high percentage of non-ORF clones encoding unnatural short peptides with minimal biological implications. This is mainly because of the uncontrollable reading frames of cellular proteins in conventional cDNA libraries. ORF phage display solves this problem by eliminating non-ORF clones to generate ORF cDNA libraries. Here I summarize the procedures of ORF phage display, discuss the factors influencing its efficiency, present examples of its versatile applications, and highlight evidence of its capability of identifying biologically relevant cellular proteins. ORF phage display coupled with different selection strategies is capable of delineating diverse functions of cellular proteins with unique advantages. Copyright © 2012 Elsevier Inc. All rights reserved.
Algorithms for optimizing cross-overs in DNA shuffling.
He, Lu; Friedman, Alan M; Bailey-Kellogg, Chris
2012-03-21
DNA shuffling generates combinatorial libraries of chimeric genes by stochastically recombining parent genes. The resulting libraries are subjected to large-scale genetic selection or screening to identify those chimeras with favorable properties (e.g., enhanced stability or enzymatic activity). While DNA shuffling has been applied quite successfully, it is limited by its homology-dependent, stochastic nature. Consequently, it is used only with parents of sufficient overall sequence identity, and provides no control over the resulting chimeric library. This paper presents efficient methods to extend the scope of DNA shuffling to handle significantly more diverse parents and to generate more predictable, optimized libraries. Our CODNS (cross-over optimization for DNA shuffling) approach employs polynomial-time dynamic programming algorithms to select codons for the parental amino acids, allowing for zero or a fixed number of conservative substitutions. We first present efficient algorithms to optimize the local sequence identity or the nearest-neighbor approximation of the change in free energy upon annealing, objectives that were previously optimized by computationally-expensive integer programming methods. We then present efficient algorithms for more powerful objectives that seek to localize and enhance the frequency of recombination by producing "runs" of common nucleotides either overall or according to the sequence diversity of the resulting chimeras. We demonstrate the effectiveness of CODNS in choosing codons and allocating substitutions to promote recombination between parents targeted in earlier studies: two GAR transformylases (41% amino acid sequence identity), two very distantly related DNA polymerases, Pol X and β (15%), and beta-lactamases of varying identity (26-47%). Our methods provide the protein engineer with a new approach to DNA shuffling that supports substantially more diverse parents, is more deterministic, and generates more predictable and more diverse chimeric libraries.
Milnthorpe, Andrew T; Soloviev, Mikhail
2011-04-15
The Cancer Genome Anatomy Project (CGAP) xProfiler and cDNA Digital Gene Expression Displayer (DGED) have been made available to the scientific community over a decade ago and since then were used widely to find genes which are differentially expressed between cancer and normal tissues. The tissue types are usually chosen according to the ontology hierarchy developed by NCBI. The xProfiler uses an internally available flat file database to determine the presence or absence of genes in the chosen libraries, while cDNA DGED uses the publicly available UniGene Expression and Gene relational databases to count the sequences found for each gene in the presented libraries. We discovered that the CGAP approach often includes libraries from dependent or irrelevant tissues (one third of libraries were incorrect on average, with some tissue searches no correct libraries being selected at all). We also discovered that the CGAP approach reported genes from outside the selected libraries and may omit genes found within the libraries. Other errors include the incorrect estimation of the significance values and inaccurate settings for the library size cut-off values. We advocated a revised approach to finding libraries associated with tissues. In doing so, libraries from dependent or irrelevant tissues do not get included in the final library pool. We also revised the method for determining the presence or absence of a gene by searching the UniGene relational database, revised calculation of statistical significance and sorted the library cut-off filter. Our results justify re-evaluation of all previously reported results where NCBI CGAP expression data and tools were used.
2011-01-01
Background The Cancer Genome Anatomy Project (CGAP) xProfiler and cDNA Digital Gene Expression Displayer (DGED) have been made available to the scientific community over a decade ago and since then were used widely to find genes which are differentially expressed between cancer and normal tissues. The tissue types are usually chosen according to the ontology hierarchy developed by NCBI. The xProfiler uses an internally available flat file database to determine the presence or absence of genes in the chosen libraries, while cDNA DGED uses the publicly available UniGene Expression and Gene relational databases to count the sequences found for each gene in the presented libraries. Results We discovered that the CGAP approach often includes libraries from dependent or irrelevant tissues (one third of libraries were incorrect on average, with some tissue searches no correct libraries being selected at all). We also discovered that the CGAP approach reported genes from outside the selected libraries and may omit genes found within the libraries. Other errors include the incorrect estimation of the significance values and inaccurate settings for the library size cut-off values. We advocated a revised approach to finding libraries associated with tissues. In doing so, libraries from dependent or irrelevant tissues do not get included in the final library pool. We also revised the method for determining the presence or absence of a gene by searching the UniGene relational database, revised calculation of statistical significance and sorted the library cut-off filter. Conclusion Our results justify re-evaluation of all previously reported results where NCBI CGAP expression data and tools were used. PMID:21496233
Improving cell mixture deconvolution by identifying optimal DNA methylation libraries (IDOL).
Koestler, Devin C; Jones, Meaghan J; Usset, Joseph; Christensen, Brock C; Butler, Rondi A; Kobor, Michael S; Wiencke, John K; Kelsey, Karl T
2016-03-08
Confounding due to cellular heterogeneity represents one of the foremost challenges currently facing Epigenome-Wide Association Studies (EWAS). Statistical methods leveraging the tissue-specificity of DNA methylation for deconvoluting the cellular mixture of heterogenous biospecimens offer a promising solution, however the performance of such methods depends entirely on the library of methylation markers being used for deconvolution. Here, we introduce a novel algorithm for Identifying Optimal Libraries (IDOL) that dynamically scans a candidate set of cell-specific methylation markers to find libraries that optimize the accuracy of cell fraction estimates obtained from cell mixture deconvolution. Application of IDOL to training set consisting of samples with both whole-blood DNA methylation data (Illumina HumanMethylation450 BeadArray (HM450)) and flow cytometry measurements of cell composition revealed an optimized library comprised of 300 CpG sites. When compared existing libraries, the library identified by IDOL demonstrated significantly better overall discrimination of the entire immune cell landscape (p = 0.038), and resulted in improved discrimination of 14 out of the 15 pairs of leukocyte subtypes. Estimates of cell composition across the samples in the training set using the IDOL library were highly correlated with their respective flow cytometry measurements, with all cell-specific R (2)>0.99 and root mean square errors (RMSEs) ranging from [0.97 % to 1.33 %] across leukocyte subtypes. Independent validation of the optimized IDOL library using two additional HM450 data sets showed similarly strong prediction performance, with all cell-specific R (2)>0.90 and R M S E<4.00 %. In simulation studies, adjustments for cell composition using the IDOL library resulted in uniformly lower false positive rates compared to competing libraries, while also demonstrating an improved capacity to explain epigenome-wide variation in DNA methylation within two large publicly available HM450 data sets. Despite consisting of half as many CpGs compared to existing libraries for whole blood mixture deconvolution, the optimized IDOL library identified herein resulted in outstanding prediction performance across all considered data sets and demonstrated potential to improve the operating characteristics of EWAS involving adjustments for cell distribution. In addition to providing the EWAS community with an optimized library for whole blood mixture deconvolution, our work establishes a systematic and generalizable framework for the assembly of libraries that improve the accuracy of cell mixture deconvolution.
Shi, Liang; Khandurina, Julia; Ronai, Zsolt; Li, Bi-Yu; Kwan, Wai King; Wang, Xun; Guttman, András
2003-01-01
A capillary gel electrophoresis based automated DNA fraction collection technique was developed to support a novel DNA fragment-pooling strategy for expressed sequence tag (EST) library construction. The cDNA population is first cleaved by BsaJ I and EcoR I restriction enzymes, and then subpooled by selective ligation with specific adapters followed by polymerase chain reaction (PCR) amplification and labeling. Combination of this cDNA fingerprinting method with high-resolution capillary gel electrophoresis separation and precise fractionation of individual cDNA transcript representatives avoids redundant fragment selection and concomitant repetitive sequencing of abundant transcripts. Using a computer-controlled capillary electrophoresis device the transcript representatives were separated by their size and fractions were automatically collected in every 30 s into 96-well plates. The high resolving power of the sieving matrix ensured sequencing grade separation of the DNA fragments (i.e., single-base resolution) and successful fraction collection. Performance and precision of the fraction collection procedure was validated by PCR amplification of the collected DNA fragments followed by capillary electrophoresis analysis for size and purity verification. The collected and PCR-amplified transcript representatives, ranging up to several hundred base pairs, were then sequenced to create an EST library.
Virgilio, Massimiliano; Jordaens, Kurt; Breman, Floris C; Backeljau, Thierry; De Meyer, Marc
2012-01-01
We propose a general working strategy to deal with incomplete reference libraries in the DNA barcoding identification of species. Considering that (1) queries with a large genetic distance with their best DNA barcode match are more likely to be misidentified and (2) imposing a distance threshold profitably reduces identification errors, we modelled relationships between identification performances and distance thresholds in four DNA barcode libraries of Diptera (n = 4270), Lepidoptera (n = 7577), Hymenoptera (n = 2067) and Tephritidae (n = 602 DNA barcodes). In all cases, more restrictive distance thresholds produced a gradual increase in the proportion of true negatives, a gradual decrease of false positives and more abrupt variations in the proportions of true positives and false negatives. More restrictive distance thresholds improved precision, yet negatively affected accuracy due to the higher proportions of queries discarded (viz. having a distance query-best match above the threshold). Using a simple linear regression we calculated an ad hoc distance threshold for the tephritid library producing an estimated relative identification error <0.05. According to the expectations, when we used this threshold for the identification of 188 independently collected tephritids, less than 5% of queries with a distance query-best match below the threshold were misidentified. Ad hoc thresholds can be calculated for each particular reference library of DNA barcodes and should be used as cut-off mark defining whether we can proceed identifying the query with a known estimated error probability (e.g. 5%) or whether we should discard the query and consider alternative/complementary identification methods.
Virgilio, Massimiliano; Jordaens, Kurt; Breman, Floris C.; Backeljau, Thierry; De Meyer, Marc
2012-01-01
We propose a general working strategy to deal with incomplete reference libraries in the DNA barcoding identification of species. Considering that (1) queries with a large genetic distance with their best DNA barcode match are more likely to be misidentified and (2) imposing a distance threshold profitably reduces identification errors, we modelled relationships between identification performances and distance thresholds in four DNA barcode libraries of Diptera (n = 4270), Lepidoptera (n = 7577), Hymenoptera (n = 2067) and Tephritidae (n = 602 DNA barcodes). In all cases, more restrictive distance thresholds produced a gradual increase in the proportion of true negatives, a gradual decrease of false positives and more abrupt variations in the proportions of true positives and false negatives. More restrictive distance thresholds improved precision, yet negatively affected accuracy due to the higher proportions of queries discarded (viz. having a distance query-best match above the threshold). Using a simple linear regression we calculated an ad hoc distance threshold for the tephritid library producing an estimated relative identification error <0.05. According to the expectations, when we used this threshold for the identification of 188 independently collected tephritids, less than 5% of queries with a distance query-best match below the threshold were misidentified. Ad hoc thresholds can be calculated for each particular reference library of DNA barcodes and should be used as cut-off mark defining whether we can proceed identifying the query with a known estimated error probability (e.g. 5%) or whether we should discard the query and consider alternative/complementary identification methods. PMID:22359600
Library construction for next-generation sequencing: Overviews and challenges
Head, Steven R.; Komori, H. Kiyomi; LaMere, Sarah A.; Whisenant, Thomas; Van Nieuwerburgh, Filip; Salomon, Daniel R.; Ordoukhanian, Phillip
2014-01-01
High-throughput sequencing, also known as next-generation sequencing (NGS), has revolutionized genomic research. In recent years, NGS technology has steadily improved, with costs dropping and the number and range of sequencing applications increasing exponentially. Here, we examine the critical role of sequencing library quality and consider important challenges when preparing NGS libraries from DNA and RNA sources. Factors such as the quantity and physical characteristics of the RNA or DNA source material as well as the desired application (i.e., genome sequencing, targeted sequencing, RNA-seq, ChIP-seq, RIP-seq, and methylation) are addressed in the context of preparing high quality sequencing libraries. In addition, the current methods for preparing NGS libraries from single cells are also discussed. PMID:24502796
Sproul, John S; Maddison, David R
2017-11-01
Despite advances that allow DNA sequencing of old museum specimens, sequencing small-bodied, historical specimens can be challenging and unreliable as many contain only small amounts of fragmented DNA. Dependable methods to sequence such specimens are especially critical if the specimens are unique. We attempt to sequence small-bodied (3-6 mm) historical specimens (including nomenclatural types) of beetles that have been housed, dried, in museums for 58-159 years, and for which few or no suitable replacement specimens exist. To better understand ideal approaches of sample preparation and produce preparation guidelines, we compared different library preparation protocols using low amounts of input DNA (1-10 ng). We also explored low-cost optimizations designed to improve library preparation efficiency and sequencing success of historical specimens with minimal DNA, such as enzymatic repair of DNA. We report successful sample preparation and sequencing for all historical specimens despite our low-input DNA approach. We provide a list of guidelines related to DNA repair, bead handling, reducing adapter dimers and library amplification. We present these guidelines to facilitate more economical use of valuable DNA and enable more consistent results in projects that aim to sequence challenging, irreplaceable historical specimens. © 2017 John Wiley & Sons Ltd.
Large-Scale Concatenation cDNA Sequencing
Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.
1997-01-01
A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
Primer Extension Mutagenesis Powered by Selective Rolling Circle Amplification
Huovinen, Tuomas; Brockmann, Eeva-Christine; Akter, Sultana; Perez-Gamarra, Susan; Ylä-Pelto, Jani; Liu, Yuan; Lamminmäki, Urpo
2012-01-01
Primer extension mutagenesis is a popular tool to create libraries for in vitro evolution experiments. Here we describe a further improvement of the method described by T.A. Kunkel using uracil-containing single-stranded DNA as the template for the primer extension by additional uracil-DNA glycosylase treatment and rolling circle amplification (RCA) steps. It is shown that removal of uracil bases from the template leads to selective amplification of the nascently synthesized circular DNA strand carrying the desired mutations by phi29 DNA polymerase. Selective RCA (sRCA) of the DNA heteroduplex formed in Kunkel's mutagenesis increases the mutagenesis efficiency from 50% close to 100% and the number of transformants 300-fold without notable diversity bias. We also observed that both the mutated and the wild-type DNA were present in at least one third of the cells transformed directly with Kunkel's heteroduplex. In contrast, the cells transformed with sRCA product contained only mutated DNA. In sRCA, the complex cell-based selection for the mutant strand is replaced with the more controllable enzyme-based selection and less DNA is needed for library creation. Construction of a gene library of ten billion members is demonstrated with the described method with 240 nanograms of DNA as starting material. PMID:22355397
Wu, Jian; Dai, Wei; Wu, Lin; Wang, Jinke
2018-02-13
Next-generation sequencing (NGS) is fundamental to the current biological and biomedical research. Construction of sequencing library is a key step of NGS. Therefore, various library construction methods have been explored. However, the current methods are still limited by some shortcomings. This study developed a new NGS library construction method, Single strand Adaptor Library Preparation (SALP), by using a novel single strand adaptor (SSA). SSA is a double-stranded oligonucleotide with a 3' overhang of 3 random nucleotides, which can be efficiently ligated to the 3' end of single strand DNA by T4 DNA ligase. SALP can be started with any denatured DNA fragments such as those sheared by Tn5 tagmentation, enzyme digestion and sonication. When started with Tn5-tagmented chromatin, SALP can overcome a key limitation of ATAC-seq and become a high-throughput NGS library construction method, SALP-seq, which can be used to comparatively characterize the chromatin openness state of multiple cells unbiasly. In this way, this study successfully characterized the comparative chromatin openness states of four different cell lines, including GM12878, HepG2, HeLa and 293T, with SALP-seq. Similarly, this study also successfully characterized the chromatin openness states of HepG2 cells with SALP-seq by using 10 5 to 500 cells. This study developed a new NGS library construction method, SALP, by using a novel kind of single strand adaptor (SSA), which should has wide applications in the future due to its unique performance.
USDA-ARS?s Scientific Manuscript database
Genic microsatellites or simple sequence repeat (genic-SSR) markers were developed in boxwood (Buxus taxa) for genetic diversity analysis, identification of taxa, and to facilitate breeding. cDNA libraries were developed from mRNA extracted from leaves of Buxus sempervirens ‘Vardar Valley’ and seque...
Design, synthesis and selection of DNA-encoded small-molecule libraries.
Clark, Matthew A; Acharya, Raksha A; Arico-Muendel, Christopher C; Belyanskaya, Svetlana L; Benjamin, Dennis R; Carlson, Neil R; Centrella, Paolo A; Chiu, Cynthia H; Creaser, Steffen P; Cuozzo, John W; Davie, Christopher P; Ding, Yun; Franklin, G Joseph; Franzen, Kurt D; Gefter, Malcolm L; Hale, Steven P; Hansen, Nils J V; Israel, David I; Jiang, Jinwei; Kavarana, Malcolm J; Kelley, Michael S; Kollmann, Christopher S; Li, Fan; Lind, Kenneth; Mataruse, Sibongile; Medeiros, Patricia F; Messer, Jeffrey A; Myers, Paul; O'Keefe, Heather; Oliff, Matthew C; Rise, Cecil E; Satz, Alexander L; Skinner, Steven R; Svendsen, Jennifer L; Tang, Lujia; van Vloten, Kurt; Wagner, Richard W; Yao, Gang; Zhao, Baoguang; Morgan, Barry A
2009-09-01
Biochemical combinatorial techniques such as phage display, RNA display and oligonucleotide aptamers have proven to be reliable methods for generation of ligands to protein targets. Adapting these techniques to small synthetic molecules has been a long-sought goal. We report the synthesis and interrogation of an 800-million-member DNA-encoded library in which small molecules are covalently attached to an encoding oligonucleotide. The library was assembled by a combination of chemical and enzymatic synthesis, and interrogated by affinity selection. We describe methods for the selection and deconvolution of the chemical display library, and the discovery of inhibitors for two enzymes: Aurora A kinase and p38 MAP kinase.
Vassou, Sophie Lorraine; Nithaniyal, Stalin; Raju, Balaji; Parani, Madasamy
2016-07-18
Ayurveda is a system of traditional medicine that originated in ancient India, and it is still in practice. Medicinal plants are the backbone of Ayurveda, which heavily relies on the plant-derived therapeutics. While Ayurveda is becoming more popular in several countries throughout the World, lack of authenticated medicinal plant raw drugs is a growing concern. Our aim was to DNA barcode the medicinal plants that are listed in the Ayurvedic Pharmacopoeia of India (API) to create a reference DNA barcode library, and to use the same to authenticate the raw drugs that are sold in markets. We have DNA barcoded 347 medicinal plants using rbcL marker, and curated rbcL DNA barcodes for 27 medicinal plants from public databases. These sequences were used to create Ayurvedic Pharmacopoeia of India - Reference DNA Barcode Library (API-RDBL). This library was used to authenticate 100 medicinal plant raw drugs, which were in the form of powders (82) and seeds (18). Ayurvedic Pharmacopoeia of India - Reference DNA Barcode Library (API-RDBL) was created with high quality and authentic rbcL barcodes for 374 out of the 395 medicinal plants that are included in the API. The rbcL DNA barcode differentiated 319 species (85 %) with the pairwise divergence ranging between 0.2 and 29.9 %. PCR amplification and DNA sequencing success rate of rbcL marker was 100 % even for the poorly preserved medicinal plant raw drugs that were collected from local markets. DNA barcoding revealed that only 79 % raw drugs were authentic, and the remaining 21 % samples were adulterated. Further, adulteration was found to be much higher with powders (ca. 25 %) when compared to seeds (ca. 5 %). The present study demonstrated the utility of DNA barcoding in authenticating medicinal plant raw drugs, and found that approximately one fifth of the market samples were adulterated. Powdered raw drugs, which are very difficult to be identified by taxonomists as well as common people, seem to be the easy target for adulteration. Developing a quality control protocol for medicinal plant raw drugs by incorporating DNA barcoding as a component is essential to ensure safety to the consumers.
A mix-and-read drop-based in vitro two-hybrid method for screening high-affinity peptide binders
Cui, Naiwen; Zhang, Huidan; Schneider, Nils; Tao, Ye; Asahara, Haruichi; Sun, Zhiyi; Cai, Yamei; Koehler, Stephan A.; de Greef, Tom F. A.; Abbaspourrad, Alireza; Weitz, David A.; Chong, Shaorong
2016-01-01
Drop-based microfluidics have recently become a novel tool by providing a stable linkage between phenotype and genotype for high throughput screening. However, use of drop-based microfluidics for screening high-affinity peptide binders has not been demonstrated due to the lack of a sensitive functional assay that can detect single DNA molecules in drops. To address this sensitivity issue, we introduced in vitro two-hybrid system (IVT2H) into microfluidic drops and developed a streamlined mix-and-read drop-IVT2H method to screen a random DNA library. Drop-IVT2H was based on the correlation between the binding affinity of two interacting protein domains and transcriptional activation of a fluorescent reporter. A DNA library encoding potential peptide binders was encapsulated with IVT2H such that single DNA molecules were distributed in individual drops. We validated drop-IVT2H by screening a three-random-residue library derived from a high-affinity MDM2 inhibitor PMI. The current drop-IVT2H platform is ideally suited for affinity screening of small-to-medium-sized libraries (103–106). It can obtain hits within a single day while consuming minimal amounts of reagents. Drop-IVT2H simplifies and accelerates the drop-based microfluidics workflow for screening random DNA libraries, and represents a novel alternative method for protein engineering and in vitro directed protein evolution. PMID:26940078
The Hemiptera (Insecta) of Canada: Constructing a Reference Library of DNA Barcodes
Gwiazdowski, Rodger A.; Foottit, Robert G.; Maw, H. Eric L.; Hebert, Paul D. N.
2015-01-01
DNA barcode reference libraries linked to voucher specimens create new opportunities for high-throughput identification and taxonomic re-evaluations. This study provides a DNA barcode library for about 45% of the recognized species of Canadian Hemiptera, and the publically available R workflow used for its generation. The current library is based on the analysis of 20,851 specimens including 1849 species belonging to 628 genera and 64 families. These individuals were assigned to 1867 Barcode Index Numbers (BINs), sequence clusters that often coincide with species recognized through prior taxonomy. Museum collections were a key source for identified specimens, but we also employed high-throughput collection methods that generated large numbers of unidentified specimens. Many of these specimens represented novel BINs that were subsequently identified by taxonomists, adding barcode coverage for additional species. Our analyses based on both approaches includes 94 species not listed in the most recent Canadian checklist, representing a potential 3% increase in the fauna. We discuss the development of our workflow in the context of prior DNA barcode library construction projects, emphasizing the importance of delineating a set of reference specimens to aid investigations in cases of nomenclatural and DNA barcode discordance. The identification for each specimen in the reference set can be annotated on the Barcode of Life Data System (BOLD), allowing experts to highlight questionable identifications; annotations can be added by any registered user of BOLD, and instructions for this are provided. PMID:25923328
Arculeo, Marco; Bonello, Juan J.; Bonnici, Leanne; Cannas, Rita; Carbonara, Pierluigi; Cau, Alessandro; Charilaou, Charis; El Ouamari, Najib; Fiorentino, Fabio; Follesa, Maria Cristina; Garofalo, Germana; Golani, Daniel; Guarniero, Ilaria; Hanner, Robert; Hemida, Farid; Kada, Omar; Lo Brutto, Sabrina; Mancusi, Cecilia; Morey, Gabriel; Schembri, Patrick J.; Serena, Fabrizio; Sion, Letizia; Stagioni, Marco; Tursi, Angelo; Vrgoc, Nedo; Steinke, Dirk; Tinti, Fausto
2017-01-01
Cartilaginous fish are particularly vulnerable to anthropogenic stressors and environmental change because of their K-selected reproductive strategy. Accurate data from scientific surveys and landings are essential to assess conservation status and to develop robust protection and management plans. Currently available data are often incomplete or incorrect as a result of inaccurate species identifications, due to a high level of morphological stasis, especially among closely related taxa. Moreover, several diagnostic characters clearly visible in adult specimens are less evident in juveniles. Here we present results generated by the ELASMOMED Consortium, a regional network aiming to sample and DNA-barcode the Mediterranean Chondrichthyans with the ultimate goal to provide a comprehensive DNA barcode reference library. This library will support and improve the molecular taxonomy of this group and the effectiveness of management and conservation measures. We successfully barcoded 882 individuals belonging to 42 species (17 sharks, 24 batoids and one chimaera), including four endemic and several threatened ones. Morphological misidentifications were found across most orders, further confirming the need for a comprehensive DNA barcoding library as a valuable tool for the reliable identification of specimens in support of taxonomist who are reviewing current identification keys. Despite low intraspecific variation among their barcode sequences and reduced samples size, five species showed preliminary evidence of phylogeographic structure. Overall, the ELASMOMED initiative further emphasizes the key role accurate DNA barcoding libraries play in establishing reliable diagnostic species specific features in otherwise taxonomically problematic groups for biodiversity management and conservation actions. PMID:28107413
Preparation of Low-Input and Ligation-Free ChIP-seq Libraries Using Template-Switching Technology.
Bolduc, Nathalie; Lehman, Alisa P; Farmer, Andrew
2016-10-10
Chromatin immunoprecipitation (ChIP) followed by high-throughput sequencing (ChIP-seq) has become the gold standard for mapping of transcription factors and histone modifications throughout the genome. However, for ChIP experiments involving few cells or targeting low-abundance transcription factors, the small amount of DNA recovered makes ligation of adapters very challenging. In this unit, we describe a ChIP-seq workflow that can be applied to small cell numbers, including a robust single-tube and ligation-free method for preparation of sequencing libraries from sub-nanogram amounts of ChIP DNA. An example ChIP protocol is first presented, resulting in selective enrichment of DNA-binding proteins and cross-linked DNA fragments immobilized on beads via an antibody bridge. This is followed by a protocol for fast and easy cross-linking reversal and DNA recovery. Finally, we describe a fast, ligation-free library preparation protocol, featuring DNA SMART technology, resulting in samples ready for Illumina sequencing. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
Methods for transforming and expression screening of filamentous fungal cells with a DNA library
Teter, Sarah; Lamsa, Michael; Cherry, Joel; Ward, Connie
2015-06-02
The present invention relates to methods for expression screening of filamentous fungal transformants, comprising: (a) isolating single colony transformants of a DNA library introduced into E. coli; (b) preparing DNA from each of the single colony E. coli transformants; (c) introducing a sample of each of the DNA preparations of step (b) into separate suspensions of protoplasts of a filamentous fungus to obtain transformants thereof, wherein each transformant contains one or more copies of an individual polynucleotide from the DNA library; (d) growing the individual filamentous fungal transformants of step (c) on selective growth medium, thereby permitting growth of the filamentous fungal transformants, while suppressing growth of untransformed filamentous fungi; and (e) measuring activity or a property of each polypeptide encoded by the individual polynucleotides. The present invention also relates to isolated polynucleotides encoding polypeptides of interest obtained by such methods, to nucleic acid constructs, expression vectors, and recombinant host cells comprising the isolated polynucleotides, and to methods of producing the polypeptides encoded by the isolated polynucleotides.
Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine
2002-06-15
To explore the expression profile of the human lens and to provide a resource for microarray studies, expressed sequence tag (EST) analysis has been performed on cDNA libraries from adult lenses. A cDNA library was constructed from two adult (40 year old) human lenses. Over two thousand clones were sequenced from the unamplified, un-normalized library. The library was then normalized and a further 2200 sequences were obtained. All the data were analyzed using GRIST (GRouping and Identification of Sequence Tags), a procedure for gene identification and clustering. The lens library (by) contains a low percentage of non-mRNA contaminants and a high fraction (over 75%) of apparently full length cDNA clones. Approximately 2000 reads from the unamplified library yields 810 clusters, potentially representing individual genes expressed in the lens. After normalization, the content of crystallins and other abundant cDNAs is markedly reduced and a similar number of reads from this library (fs) yields 1455 unique groups of which only two thirds correspond to named genes in GenBank. Among the most abundant cDNAs is one for a novel gene related to glutamine synthetase, which was designated "lengsin" (LGS). Analyses of ESTs also reveal examples of alternative transcripts, including a major alternative splice form for the lens specific membrane protein MP19. Variant forms for other transcripts, including those encoding the apoptosis inhibitor Livin and the armadillo repeat protein ARVCF, are also described. The lens cDNA libraries are a resource for gene discovery, full length cDNAs for functional studies and microarrays. The discovery of an abundant, novel transcript, lengsin, and a major novel splice form of MP19 reflect the utility of unamplified libraries constructed from dissected tissue. Many novel transcripts and splice forms are represented, some of which may be candidates for genetic diseases.
Verma, Digvijay; Satyanarayana, T
2011-09-01
An improved single-step protocol has been developed for extracting pure community humic substance-free DNA from alkaline soils and sediments. The method is based on direct cell lysis in the presence of powdered activated charcoal and polyvinylpolypyrrolidone followed by precipitation with polyethyleneglycol and isopropanol. The strategy allows simultaneous isolation and purification of DNA while minimizing the loss of DNA with respect to other available protocols for metagenomic DNA extraction. Moreover, the purity levels are significant, which are difficult to attain with any of the methods reported in the literature for DNA extraction from soils. The DNA thus extracted was free from humic substances and, therefore, could be processed for restriction digestion, PCR amplification as well as for the construction of metagenomic libraries.
Maggi, Elaine C; Gravina, Silvia; Cheng, Haiying; Piperdi, Bilal; Yuan, Ziqiang; Dong, Xiao; Libutti, Steven K; Vijg, Jan; Montagna, Cristina
2018-01-01
The goal of this study was to develop a method for whole genome cell-free DNA (cfDNA) methylation analysis in humans and mice with the ultimate goal to facilitate the identification of tumor derived DNA methylation changes in the blood. Plasma or serum from patients with pancreatic neuroendocrine tumors or lung cancer, and plasma from a murine model of pancreatic adenocarcinoma was used to develop a protocol for cfDNA isolation, library preparation and whole-genome bisulfite sequencing of ultra low quantities of cfDNA, including tumor-specific DNA. The protocol developed produced high quality libraries consistently generating a conversion rate >98% that will be applicable for the analysis of human and mouse plasma or serum to detect tumor-derived changes in DNA methylation.
Differential cDNA cloning by enzymatic degrading subtraction (EDS).
Zeng, J; Gorski, R A; Hamer, D
1994-01-01
We describe a new method, called enzymatic degrading subtraction (EDS), for the construction of subtractive libraries from PCR amplified cDNA. The novel features of this method are that i) the tester DNA is blocked by thionucleotide incorporation; ii) the rate of hybridization is accelerated by phenol-emulsion reassociation; and iii) the driver cDNA and hybrid molecules are enzymatically removed by digestion with exonucleases III and VII rather than by physical partitioning. We demonstrate the utility of EDS by constructing a subtractive library enriched for cDNAs expressed in adult but not in embryonic rat brains. Images PMID:7971268
Holland, Erika G; Buhr, Diane L; Acca, Felicity E; Alderman, Dawn; Bovat, Kristin; Busygina, Valeria; Kay, Brian K; Weiner, Michael P; Kiss, Margaret M
2013-08-30
Affinity maturation is an important part of the recombinant antibody development process. There are several well-established approaches for generating libraries of mutated antibody genes for affinity maturation, but these approaches are generally too laborious or expensive to allow high-throughput, parallel processing of multiple antibodies. Here, we describe a scalable approach that enables the generation of libraries with greater than 10(8) clones from a single Escherichia coli transformation. In our method, a mutated DNA fragment is produced using PCR conditions that promote nucleotide misincorporation into newly synthesized DNA. In the PCR reaction, one of the primers contains at least three phosphorothioate linkages at its 5' end, and treatment of the PCR product with a 5' to 3' exonuclease is used to preferentially remove the strand synthesized with the non-modified primer, resulting in a single-stranded DNA fragment. This fragment then serves as a megaprimer to prime DNA synthesis on a uracilated, circular, single-stranded template in a Kunkel-like mutagenesis reaction that biases nucleotide base-changes between the megaprimer and uracilated DNA sequence in favor of the in vitro synthesized megaprimer. This method eliminates the inefficient subcloning steps that are normally required for the construction of affinity maturation libraries from randomly mutagenized antibody genes. Copyright © 2013. Published by Elsevier B.V.
Borsu, Laetitia; Intrieri, Julie; Thampi, Linta; Yu, Helena; Riely, Gregory; Nafa, Khedoudja; Chandramohan, Raghu; Ladanyi, Marc; Arcila, Maria E
2016-11-01
Although next-generation sequencing (NGS) is a robust technology for comprehensive assessment of EGFR-mutant lung adenocarcinomas with acquired resistance to tyrosine kinase inhibitors, it may not provide sufficiently rapid and sensitive detection of the EGFR T790M mutation, the most clinically relevant resistance biomarker. Here, we describe a digital PCR (dPCR) assay for rapid T790M detection on aliquots of NGS libraries prepared for comprehensive profiling, fully maximizing broad genomic analysis on limited samples. Tumor DNAs from patients with EGFR-mutant lung adenocarcinomas and acquired resistance to epidermal growth factor receptor inhibitors were prepared for Memorial Sloan-Kettering-Integrated Mutation Profiling of Actionable Cancer Targets sequencing, a hybrid capture-based assay interrogating 410 cancer-related genes. Precapture library aliquots were used for rapid EGFR T790M testing by dPCR, and results were compared with NGS and locked nucleic acid-PCR Sanger sequencing (reference high sensitivity method). Seventy resistance samples showed 99% concordance with the reference high sensitivity method in accuracy studies. Input as low as 2.5 ng provided a sensitivity of 1% and improved further with increasing DNA input. dPCR on libraries required less DNA and showed better performance than direct genomic DNA. dPCR on NGS libraries is a robust and rapid approach to EGFR T790M testing, allowing most economical utilization of limited material for comprehensive assessment. The same assay can also be performed directly on any limited DNA source and cell-free DNA. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
DNA-Encoded Dynamic Combinatorial Chemical Libraries.
Reddavide, Francesco V; Lin, Weilin; Lehnert, Sarah; Zhang, Yixin
2015-06-26
Dynamic combinatorial chemistry (DCC) explores the thermodynamic equilibrium of reversible reactions. Its application in the discovery of protein binders is largely limited by difficulties in the analysis of complex reaction mixtures. DNA-encoded chemical library (DECL) technology allows the selection of binders from a mixture of up to billions of different compounds; however, experimental results often show low a signal-to-noise ratio and poor correlation between enrichment factor and binding affinity. Herein we describe the design and application of DNA-encoded dynamic combinatorial chemical libraries (EDCCLs). Our experiments have shown that the EDCCL approach can be used not only to convert monovalent binders into high-affinity bivalent binders, but also to cause remarkably enhanced enrichment of potent bivalent binders by driving their in situ synthesis. We also demonstrate the application of EDCCLs in DNA-templated chemical reactions. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genomic clones for human cholinesterase
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kott, M.; Venta, P.J.; Larsen, J.
1987-05-01
A human genomic library was prepared from peripheral white blood cells from a single donor by inserting an MboI partial digest into BamHI poly-linker sites of EMBL3. This library was screened using an oligolabeled human cholinesterase cDNA probe over 700 bp long. The latter probe was obtained from a human basal ganglia cDNA library. Of approximately 2 million clones screened with high stringency conditions several positive clones were identified; two have been plaque purified. One of these clones has been partially mapped using restriction enzymes known to cut within the coded region of the cDNA for human serum cholinesterase. Hybridizationmore » of the fragments and their sizes are as expected if the genomic clone is cholinesterase. Sequencing of the DNA fragments in M13 is in progress to verify the identify of the clone and the location of introns.« less
Rapid and Easy Protocol for Quantification of Next-Generation Sequencing Libraries.
Hawkins, Steve F C; Guest, Paul C
2018-01-01
The emergence of next-generation sequencing (NGS) over the last 10 years has increased the efficiency of DNA sequencing in terms of speed, ease, and price. However, the exact quantification of a NGS library is crucial in order to obtain good data on sequencing platforms developed by the current market leader Illumina. Different approaches for DNA quantification are available currently and the most commonly used are based on analysis of the physical properties of the DNA through spectrophotometric or fluorometric methods. Although these methods are technically simple, they do not allow exact quantification as can be achieved using a real-time quantitative PCR (qPCR) approach. A qPCR protocol for DNA quantification with applications in NGS library preparation studies is presented here. This can be applied in various fields of study such as medical disorders resulting from nutritional programming disturbances.
Baxter, Laura L; Hsu, Benjamin J; Umayam, Lowell; Wolfsberg, Tyra G; Larson, Denise M; Frith, Martin C; Kawai, Jun; Hayashizaki, Yoshihide; Carninci, Piero; Pavan, William J
2007-06-01
As part of the RIKEN mouse encyclopedia project, two cDNA libraries were prepared from melanocyte-derived cell lines, using techniques of full-length clone selection and subtraction/normalization to enrich for rare transcripts. End sequencing showed that these libraries display over 83% complete coding sequence at the 5' end and 96-97% complete coding sequence at the 3' end. Evaluation of the libraries, derived from B16F10Y tumor cells and melan-c cells, revealed that they contain clones for a majority of the genes previously demonstrated to function in melanocyte biology. Analysis of genomic locations for transcripts revealed that the distribution of melanocyte genes is non-random throughout the genome. Three genomic regions identified that showed significant clustering of melanocyte-expressed genes contain one or more genes previously shown to regulate melanocyte development or function. A catalog of genes expressed in these libraries is presented, providing a valuable resource of cDNA clones and sequence information that can be used for identification of new genes important for melanocyte development, function, and disease.
Kim, Sungmin; Song, Kyo-Hong; Ree, Han-Il; Kim, Won
2012-01-01
Non-biting midges (Diptera: Chironomidae) are a diverse population that commonly causes respiratory allergies in humans. Chironomid larvae can be used to indicate freshwater pollution, but accurate identification on the basis of morphological characteristics is difficult. In this study, we constructed a mitochondrial cytochrome c oxidase subunit I (COI)-based DNA barcode library for Korean chironomids. This library consists of 211 specimens from 49 species, including adults and unidentified larvae. The interspecies and intraspecies COI sequence variations were analyzed. Sophisticated indexes were developed in order to properly evaluate indistinct barcode gaps that are created by insufficient sampling on both the interspecies and intraspecies levels and by variable mutation rates across taxa. In a variety of insect datasets, these indexes were useful for re-evaluating large barcode datasets and for defining COI barcode gaps. The COI-based DNA barcode library will provide a rapid and reliable tool for the molecular identification of Korean chironomid species. Furthermore, this reverse-taxonomic approach will be improved by the continuous addition of other speceis’ sequences to the library. PMID:22138764
We examined the bacterial composition of chlorinated drinking water using 16S rRNA gene clone libraries derived from RNA and DNA extracted from twelve water samples collected in three different months (June, August, and September of 2007). Phylogenetic analysis of 1234 and 1117 ...
We examined the bacterial composition of chlorinated drinking water using 16S rRNA gene clone libraries derived from RNA and DNA extracted from twelve water samples collected in three different months (June, August, and September of 2007). Phylogenetic analysis of 1234 and 1117 ...
2011-01-01
Background Common bean is an important legume crop with only a moderate number of short expressed sequence tags (ESTs) made with traditional methods. The goal of this research was to use full-length cDNA technology to develop ESTs that would overlap with the beginning of open reading frames and therefore be useful for gene annotation of genomic sequences. The library was also constructed to represent genes expressed under drought, low soil phosphorus and high soil aluminum toxicity. We also undertook comparisons of the full-length cDNA library to two previous non-full clone EST sets for common bean. Results Two full-length cDNA libraries were constructed: one for the drought tolerant Mesoamerican genotype BAT477 and the other one for the acid-soil tolerant Andean genotype G19833 which has been selected for genome sequencing. Plants were grown in three soil types using deep rooting cylinders subjected to drought and non-drought stress and tissues were collected from both roots and above ground parts. A total of 20,000 clones were selected robotically, half from each library. Then, nearly 10,000 clones from the G19833 library were sequenced with an average read length of 850 nucleotides. A total of 4,219 unigenes were identified consisting of 2,981 contigs and 1,238 singletons. These were functionally annotated with gene ontology terms and placed into KEGG pathways. Compared to other EST sequencing efforts in common bean, about half of the sequences were novel or represented the 5' ends of known genes. Conclusions The present full-length cDNA libraries add to the technological toolbox available for common bean and our sequencing of these clones substantially increases the number of unique EST sequences available for the common bean genome. All of this should be useful for both functional gene annotation, analysis of splice site variants and intron/exon boundary determination by comparison to soybean genes or with common bean whole-genome sequences. In addition the library has a large number of transcription factors and will be interesting for discovery and validation of drought or abiotic stress related genes in common bean. PMID:22118559
Development of High Throughput Process for Constructing 454 Titanium and Illumina Libraries
DOE Office of Scientific and Technical Information (OSTI.GOV)
Deshpande, Shweta; Hack, Christopher; Tang, Eric
2010-05-28
We have developed two processes with the Biomek FX robot to construct 454 titanium and Illumina libraries in order to meet the increasing library demands. All modifications in the library construction steps were made to enable the adaptation of the entire processes to work with the 96-well plate format. The key modifications include the shearing of DNA with Covaris E210 and the enzymatic reaction cleaning and fragment size selection with SPRI beads and magnetic plate holders. The construction of 96 Titanium libraries takes about 8 hours from sheared DNA to ssDNA recovery. The processing of 96 Illumina libraries takes lessmore » time than that of the Titanium library process. Although both processes still require manual transfer of plates from robot to other work stations such as thermocyclers, these robotic processes represent about 12- to 24-folds increase of library capacity comparing to the manual processes. To enable the sequencing of many libraries in parallel, we have also developed sets of molecular barcodes for both library types. The requirements for the 454 library barcodes include 10 bases, 40-60percent GC, no consecutive same base, and no less than 3 bases difference between barcodes. We have used 96 of the resulted 270 barcodes to construct libraries and pool to test the ability of accurately assigning reads to the right samples. When allowing 1 base error occurred in the 10 base barcodes, we could assign 99.6percent of the total reads and 100percent of them were uniquely assigned. As for the Illumina barcodes, the requirements include 4 bases, balanced GC, and at least 2 bases difference between barcodes. We have begun to assess the ability to assign reads after pooling different number of libraries. We will discuss the progress and the challenges of these scale-up processes.« less
Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng
2012-01-01
To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing. PMID:23202944
Tian, Yang; Li, Yan Hong
2017-01-01
To understand the differences of the bacteria associated with different mosses, a phylogenetic study of bacterial communities in three mosses was carried out based on 16S rDNA and 16S rRNA sequencing. The mosses used were Hygroamblystegium noterophilum, Entodon compressus and Grimmia montana, representing hygrophyte, shady plant and xerophyte, respectively. In total, the operational taxonomic units (OTUs), richness and diversity were different regardless of the moss species and the library level. All the examined 1183 clones were assigned to 248 OTUs, 56 genera were assigned in rDNA libraries and 23 genera were determined at the rRNA level. Proteobacteria and Bacteroidetes were considered as the most dominant phyla in all the libraries, whereas abundant Actinobacteria and Acidobacteria were detected in the rDNA library of Entodon compressus and approximately 24.7% clones were assigned to Candidate division TM7 in Grimmia montana at rRNA level. The heatmap showed the bacterial profiles derived from rRNA and rDNA were partly overlapping. However, the principle component analysis of all the profiles derived from rDNA showed sharper differences between the different mosses than that of rRNA-based profiles. This suggests that the metabolically active bacterial compositions in different mosses were more phylogenetically similar and the differences of the bacteria associated with different mosses were mainly detected at the rDNA level. Obtained results clearly demonstrate that combination of 16S rDNA and 16S rRNA sequencing is preferred approach to have a good understanding on the constitution of the microbial communities in mosses. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Next-generation sequencing library construction on a surface.
Feng, Kuan; Costa, Justin; Edwards, Jeremy S
2018-05-30
Next-generation sequencing (NGS) has revolutionized almost all fields of biology, agriculture and medicine, and is widely utilized to analyse genetic variation. Over the past decade, the NGS pipeline has been steadily improved, and the entire process is currently relatively straightforward. However, NGS instrumentation still requires upfront library preparation, which can be a laborious process, requiring significant hands-on time. Herein, we present a simple but robust approach to streamline library preparation by utilizing surface bound transposases to construct DNA libraries directly on a flowcell surface. The surface bound transposases directly fragment genomic DNA while simultaneously attaching the library molecules to the flowcell. We sequenced and analysed a Drosophila genome library generated by this surface tagmentation approach, and we showed that our surface bound library quality was comparable to the quality of the library from a commercial kit. In addition to the time and cost savings, our approach does not require PCR amplification of the library, which eliminates potential problems associated with PCR duplicates. We described the first study to construct libraries directly on a flowcell. We believe our technique could be incorporated into the existing Illumina sequencing pipeline to simplify the workflow, reduce costs, and improve data quality.
A comparative study of ChIP-seq sequencing library preparation methods.
Sundaram, Arvind Y M; Hughes, Timothy; Biondi, Shea; Bolduc, Nathalie; Bowman, Sarah K; Camilli, Andrew; Chew, Yap C; Couture, Catherine; Farmer, Andrew; Jerome, John P; Lazinski, David W; McUsic, Andrew; Peng, Xu; Shazand, Kamran; Xu, Feng; Lyle, Robert; Gilfillan, Gregor D
2016-10-21
ChIP-seq is the primary technique used to investigate genome-wide protein-DNA interactions. As part of this procedure, immunoprecipitated DNA must undergo "library preparation" to enable subsequent high-throughput sequencing. To facilitate the analysis of biopsy samples and rare cell populations, there has been a recent proliferation of methods allowing sequencing library preparation from low-input DNA amounts. However, little information exists on the relative merits, performance, comparability and biases inherent to these procedures. Notably, recently developed single-cell ChIP procedures employing microfluidics must also employ library preparation reagents to allow downstream sequencing. In this study, seven methods designed for low-input DNA/ChIP-seq sample preparation (Accel-NGS® 2S, Bowman-method, HTML-PCR, SeqPlex™, DNA SMART™, TELP and ThruPLEX®) were performed on five replicates of 1 ng and 0.1 ng input H3K4me3 ChIP material, and compared to a "gold standard" reference PCR-free dataset. The performance of each method was examined for the prevalence of unmappable reads, amplification-derived duplicate reads, reproducibility, and for the sensitivity and specificity of peak calling. We identified consistent high performance in a subset of the tested reagents, which should aid researchers in choosing the most appropriate reagents for their studies. Furthermore, we expect this work to drive future advances by identifying and encouraging use of the most promising methods and reagents. The results may also aid judgements on how comparable are existing datasets that have been prepared with different sample library preparation reagents.
Yang, Bing-Yan; Huo, Xiu-Ai; Li, Peng-Fei; Wang, Cui-Xia; Duan, Hui-Jun
2014-08-01
Full-length cDNAs are very important for genome annotation and functional analysis of genes. The number of full-length cDNAs from watermelon remains limited. Here we report first the construction of a full-length enriched cDNA library from Fusarium wilt stressed watermelon (Citrullus lanatus Thunb.) cultivar PI296341 root tissues using the SMART method. The titer of primary cDNA library and amplified library was 2.21 x 10(6) and 2.0 x 10(10) pfu/ml, respectively and the rate of recombinant was above 85%. The size of insert fragment ranged from 0.3 to 2.0 kb. In this study, we first cloned a gene named ClWRKY1, which was 1981 bp long and encoded a protein consisting of 394 amino acids. It contained two characteristic WRKY domains and two zinc finger motifs. Quantitative real-time PCR showed that ClWRKY1 expression levels reached maximum level at 12 h after inoculation with Fusarium oxysporum f. sp. niveum. The full-length cDNA library of watermelon root tissues is not only essential for the cloning of genes which are known, but also an initial key for the screening and cloning of new genes that might be involved in resistance to Fusarium wilt.
Current and future resources for functional metagenomics.
Lam, Kathy N; Cheng, Jiujun; Engel, Katja; Neufeld, Josh D; Charles, Trevor C
2015-01-01
Functional metagenomics is a powerful experimental approach for studying gene function, starting from the extracted DNA of mixed microbial populations. A functional approach relies on the construction and screening of metagenomic libraries-physical libraries that contain DNA cloned from environmental metagenomes. The information obtained from functional metagenomics can help in future annotations of gene function and serve as a complement to sequence-based metagenomics. In this Perspective, we begin by summarizing the technical challenges of constructing metagenomic libraries and emphasize their value as resources. We then discuss libraries constructed using the popular cloning vector, pCC1FOS, and highlight the strengths and shortcomings of this system, alongside possible strategies to maximize existing pCC1FOS-based libraries by screening in diverse hosts. Finally, we discuss the known bias of libraries constructed from human gut and marine water samples, present results that suggest bias may also occur for soil libraries, and consider factors that bias metagenomic libraries in general. We anticipate that discussion of current resources and limitations will advance tools and technologies for functional metagenomics research.
Townsley, Brad T; Covington, Michael F; Ichihashi, Yasunori; Zumstein, Kristina; Sinha, Neelima R
2015-01-01
Next Generation Sequencing (NGS) is driving rapid advancement in biological understanding and RNA-sequencing (RNA-seq) has become an indispensable tool for biology and medicine. There is a growing need for access to these technologies although preparation of NGS libraries remains a bottleneck to wider adoption. Here we report a novel method for the production of strand specific RNA-seq libraries utilizing the terminal breathing of double-stranded cDNA to capture and incorporate a sequencing adapter. Breath Adapter Directional sequencing (BrAD-seq) reduces sample handling and requires far fewer enzymatic steps than most available methods to produce high quality strand-specific RNA-seq libraries. The method we present is optimized for 3-prime Digital Gene Expression (DGE) libraries and can easily extend to full transcript coverage shotgun (SHO) type strand-specific libraries and is modularized to accommodate a diversity of RNA and DNA input materials. BrAD-seq offers a highly streamlined and inexpensive option for RNA-seq libraries.
Brouilette, Scott; Kuersten, Scott; Mein, Charles; Bozek, Monika; Terry, Anna; Dias, Kerith-Rae; Bhaw-Rosun, Leena; Shintani, Yasunori; Coppen, Steven; Ikebe, Chiho; Sawhney, Vinit; Campbell, Niall; Kaneko, Masahiro; Tano, Nobuko; Ishida, Hidekazu; Suzuki, Ken; Yashiro, Kenta
2012-10-01
Deep sequencing of single cell-derived cDNAs offers novel insights into oncogenesis and embryogenesis. However, traditional library preparation for RNA-seq analysis requires multiple steps with consequent sample loss and stochastic variation at each step significantly affecting output. Thus, a simpler and better protocol is desirable. The recently developed hyperactive Tn5-mediated library preparation, which brings high quality libraries, is likely one of the solutions. Here, we tested the applicability of hyperactive Tn5-mediated library preparation to deep sequencing of single cell cDNA, optimized the protocol, and compared it with the conventional method based on sonication. This new technique does not require any expensive or special equipment, which secures wider availability. A library was constructed from only 100 ng of cDNA, which enables the saving of precious specimens. Only a few steps of robust enzymatic reaction resulted in saved time, enabling more specimens to be prepared at once, and with a more reproducible size distribution among the different specimens. The obtained RNA-seq results were comparable to the conventional method. Thus, this Tn5-mediated preparation is applicable for anyone who aims to carry out deep sequencing for single cell cDNAs. Copyright © 2012 Wiley Periodicals, Inc.
Sequence evaluation of four specific cDNA libraries for developmental genomics of sunflower.
Tamborindeguy, C; Ben, C; Liboz, T; Gentzbittel, L
2004-04-01
Four different cDNA libraries were constructed from sunflower protoplasts growing under embryogenic and non-embryogenic conditions: one standard library from each condition and two subtractive libraries in opposite sense. A total of 22,876 cDNA clones were obtained and 4800 ESTs were sequenced, giving rise to 2479 high quality ESTs representing an unigene set of 1502 sequences. This set was compared with ESTs represented in public databases using the programs BLASTN and BLASTX, and its members were classified according to putative function using the catalog in the Kyoto Encyclopedia of Genes and Genomes (KEGG). Some 33% of sequences failed to align with existing plant ESTs and therefore represent putative novel genes. The libraries show a low level of redundancy and, on average, 50% of the present ESTs have not been previously reported for sunflower. Several potentially interesting genes were identified, based on their homology with genes involved in animal zygotic division or plant embryogenesis. We also identified two ESTs that show significantly different levels of expression under embryogenic and non-embryogenic conditions. The libraries described here represent an original and valuable resource for the discovery of yet unknown genes putatively involved in dicot embryogenesis and improving our knowledge of the mechanisms involved in polarity acquisition by plant embryos.
Mora-Castilla, Sergio; To, Cuong; Vaezeslami, Soheila; Morey, Robert; Srinivasan, Srimeenakshi; Dumdie, Jennifer N; Cook-Andersen, Heidi; Jenkins, Joby; Laurent, Louise C
2016-08-01
As the cost of next-generation sequencing has decreased, library preparation costs have become a more significant proportion of the total cost, especially for high-throughput applications such as single-cell RNA profiling. Here, we have applied novel technologies to scale down reaction volumes for library preparation. Our system consisted of in vitro differentiated human embryonic stem cells representing two stages of pancreatic differentiation, for which we prepared multiple biological and technical replicates. We used the Fluidigm (San Francisco, CA) C1 single-cell Autoprep System for single-cell complementary DNA (cDNA) generation and an enzyme-based tagmentation system (Nextera XT; Illumina, San Diego, CA) with a nanoliter liquid handler (mosquito HTS; TTP Labtech, Royston, UK) for library preparation, reducing the reaction volume down to 2 µL and using as little as 20 pg of input cDNA. The resulting sequencing data were bioinformatically analyzed and correlated among the different library reaction volumes. Our results showed that decreasing the reaction volume did not interfere with the quality or the reproducibility of the sequencing data, and the transcriptional data from the scaled-down libraries allowed us to distinguish between single cells. Thus, we have developed a process to enable efficient and cost-effective high-throughput single-cell transcriptome sequencing. © 2016 Society for Laboratory Automation and Screening.
Belyanskaya, Svetlana L; Ding, Yun; Callahan, James F; Lazaar, Aili L; Israel, David I
2017-05-04
DNA-encoded chemical library technology was developed with the vision of its becoming a transformational platform for drug discovery. The hope was that a new paradigm for the discovery of low-molecular-weight drugs would be enabled by combining the vast molecular diversity achievable with combinatorial chemistry, the information-encoding attributes of DNA, the power of molecular biology, and a streamlined selection-based discovery process. Here, we describe the discovery and early clinical development of GSK2256294, an inhibitor of soluble epoxide hydrolase (sEH, EPHX2), by using encoded-library technology (ELT). GSK2256294 is an orally bioavailable, potent and selective inhibitor of sEH that has a long half life and produced no serious adverse events in a first-time-in-human clinical study. To our knowledge, GSK2256294 is the first molecule discovered from this technology to enter human clinical testing and represents a realization of the vision that DNA-encoded chemical library technology can efficiently yield molecules with favorable properties that can be readily progressed into high-quality drugs. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genomic sequencing of Pleistocene cave bears
DOE Office of Scientific and Technical Information (OSTI.GOV)
Noonan, James P.; Hofreiter, Michael; Smith, Doug
2005-04-01
Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome,more » the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.« less
Informatic selection of a neural crest-melanocyte cDNA set for microarray analysis
Loftus, S. K.; Chen, Y.; Gooden, G.; Ryan, J. F.; Birznieks, G.; Hilliard, M.; Baxevanis, A. D.; Bittner, M.; Meltzer, P.; Trent, J.; Pavan, W.
1999-01-01
With cDNA microarrays, it is now possible to compare the expression of many genes simultaneously. To maximize the likelihood of finding genes whose expression is altered under the experimental conditions, it would be advantageous to be able to select clones for tissue-appropriate cDNA sets. We have taken advantage of the extensive sequence information in the dbEST expressed sequence tag (EST) database to identify a neural crest-derived melanocyte cDNA set for microarray analysis. Analysis of characterized genes with dbEST identified one library that contained ESTs representing 21 neural crest-expressed genes (library 198). The distribution of the ESTs corresponding to these genes was biased toward being derived from library 198. This is in contrast to the EST distribution profile for a set of control genes, characterized to be more ubiquitously expressed in multiple tissues (P < 1 × 10−9). From library 198, a subset of 852 clustered ESTs were selected that have a library distribution profile similar to that of the 21 neural crest-expressed genes. Microarray analysis demonstrated the majority of the neural crest-selected 852 ESTs (Mel1 array) were differentially expressed in melanoma cell lines compared with a non-neural crest kidney epithelial cell line (P < 1 × 10−8). This was not observed with an array of 1,238 ESTs that was selected without library origin bias (P = 0.204). This study presents an approach for selecting tissue-appropriate cDNAs that can be used to examine the expression profiles of developmental processes and diseases. PMID:10430933
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stapleton, Mark; Liao, Guochun; Brokstein, Peter
2002-08-12
Collections of full-length nonredundant cDNA clones are critical reagents for functional genomics. The first step toward these resources is the generation and single-pass sequencing of cDNA libraries that contain a high proportion of full-length clones. The first release of the Drosophila Gene Collection Release 1 (DGCr1) was produced from six libraries representing various tissues, developmental stages, and the cultured S2 cell line. Nearly 80,000 random 5prime expressed sequence tags (EST) from these libraries were collapsed into a nonredundant set of 5849 cDNAs, corresponding to {approx}40 percent of the 13,474 predicted genes in Drosophila. To obtain cDNA clones representing the remainingmore » genes, we have generated an additional 157,835 5prime ESTs from two previously existing and three new libraries. One new library is derived from adult testis, a tissue we previously did not exploit for gene discovery; two new cap-trapped normalized libraries are derived from 0-22hr embryos and adult heads. Taking advantage of the annotated D. melanogaster genome sequence, we clustered the ESTs by aligning them to the genome. Clusters that overlap genes not already represented by cDNA clones in the DGCr1 were analyzed further, and putative full-length clones were selected for inclusion in the new DGC. This second release of the DGC (DGCr2) contains 5061 additional clones, extending the collection to 10,910 cDNAs representing >70 percent of the predicted genes in Drosophila.« less
Ding, Yun; O'Keefe, Heather; DeLorey, Jennifer L; Israel, David I; Messer, Jeffrey A; Chiu, Cynthia H; Skinner, Steven R; Matico, Rosalie E; Murray-Thompson, Monique F; Li, Fan; Clark, Matthew A; Cuozzo, John W; Arico-Muendel, Christopher; Morgan, Barry A
2015-08-13
The aggrecan degrading metalloprotease ADAMTS-4 has been identified as a novel therapeutic target for osteoarthritis. Here, we use DNA-encoded Library Technology (ELT) to identify novel ADAMTS-4 inhibitors from a DNA-encoded triazine library by affinity selection. Structure-activity relationship studies based on the selection information led to the identification of potent and highly selective inhibitors. For example, 4-(((4-(6,7-dimethoxy-3,4-dihydroisoquinolin-2(1H)-yl)-6-(((4-methylpiperazin-1-yl)methyl)amino)-1,3,5-triazin-2-yl)amino)methyl)-N-ethyl-N-(m-tolyl)benzamide has IC50 of 10 nM against ADAMTS-4, with >1000-fold selectivity over ADAMT-5, MMP-13, TACE, and ADAMTS-13. These inhibitors have no obvious zinc ligand functionality.
2015-01-01
The aggrecan degrading metalloprotease ADAMTS-4 has been identified as a novel therapeutic target for osteoarthritis. Here, we use DNA-encoded Library Technology (ELT) to identify novel ADAMTS-4 inhibitors from a DNA-encoded triazine library by affinity selection. Structure–activity relationship studies based on the selection information led to the identification of potent and highly selective inhibitors. For example, 4-(((4-(6,7-dimethoxy-3,4-dihydroisoquinolin-2(1H)-yl)-6-(((4-methylpiperazin-1-yl)methyl)amino)-1,3,5-triazin-2-yl)amino)methyl)-N-ethyl-N-(m-tolyl)benzamide has IC50 of 10 nM against ADAMTS-4, with >1000-fold selectivity over ADAMT-5, MMP-13, TACE, and ADAMTS-13. These inhibitors have no obvious zinc ligand functionality. PMID:26288689
Construction of a general human chromosome jumping library, with application to cystic fibrosis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Collins, F.S.; Drumm, M.L.; Cole, J.L.
1987-02-27
In many genetic disorders, the responsible gene and its protein product are unknown. The technique known as reverse genetics, in which chromosomal map positions and genetically linked DNA markers are used to identify and clone such genes, is complicated by the fact that the molecular distances from the closest DNA markers to the gene itself are often too large to traverse by standard cloning techniques. To address this situation, a general human chromosome jumping library was constructed that allows the cloning of DNA sequences approximately 100 kilobases away from any starting point in genomic DNA. As an illustration of itsmore » usefulness, this library was searched for a jumping clone, starting at the met oncogene, which is a marker tightly linked to the cystic fibrosis gene that is located on human chromosome 7. Mapping of the new genomic fragment by pulsed field gel electrophoresis confirmed that it resides on chromosome 7 within 240 kilobases downstream of the met gene. The use of chromosome jumping should be applicable to any genetic locus for which a closely linked DNA marker is available.« less
Inkinen, J; Jayaprakash, B; Santo Domingo, J W; Keinänen-Toivola, M M; Ryu, H; Pitkänen, T
2016-06-01
Next-generation sequencing of 16S ribosomal RNA genes (rDNA) and ribosomal RNA (rRNA) was used to characterize water and biofilm microbiome collected from a drinking water distribution system of an office building after its first year of operation. The total bacterial community (rDNA) and active bacterial members (rRNA) sequencing databases were generated by Illumina MiSeq PE250 platform. As estimated by Chao1 index, species richness in cold water system was lower (180-260) in biofilms (Sphingomonas spp., Methylobacterium spp., Limnohabitans spp., Rhizobiales order) than in waters (250-580), (also Methylotenera spp.) (P = 0·005, n = 20). Similarly species richness (Chao1) was slightly higher (210-580) in rDNA libraries compared to rRNA libraries (150-400; P = 0·054, n = 24). Active Mycobacterium spp. was found in cross-linked polyethylene (PEX), but not in corresponding copper pipeline biofilm. Nonpathogenic Legionella spp. was found in rDNA libraries but not in rRNA libraries. Microbial communities differed between water and biofilms, between cold and hot water systems, locations in the building and between water rRNA and rDNA libraries, as shown by clear clusters in principal component analysis (PcoA). By using the rRNA method, we found that not all bacterial community members were active (e.g. Legionella spp.), whereas other members showed increased activity in some locations; for example, Pseudomonas spp. in hot water circulations' biofilm and order Rhizobiales and Limnohabitans spp. in stagnated locations' water and biofilm. rRNA-based methods may be better than rDNA-based methods for evaluating human health implications as rRNA methods can be used to describe the active bacterial fraction. This study indicates that copper as a pipeline material might have an adverse impact on the occurrence of Mycobacterium spp. The activity of Legionella spp. maybe questionable when detected solely by using DNA-based methods. © 2016 The Society for Applied Microbiology.
Ogunremi, Oladele; Benjamin, Jane; MacDonald, Lily; Schimpf, Robert
2008-12-01
Newly developed serological tests for diagnosing parelaphostrongylosis in cervids, using the excretory-secretory products (ES) of the infective larvae of Parelaphostrongylus tenuis in enzyme-linked immunosorbent assays (ELISAs), have demonstrable superiority over the traditional method of larval recovery and microscopic identification. To generate a source of ELISA antigen by genetic engineering, we created a complementary DNA (cDNA) expression library by the reverse transcription of mRNA of P. tenuis adult worms, and ligation with the vector lambda-ZAP II. The library was screened using antisera produced in mice by immunization with a somatic antigen preparation of adult worms. Seventeen clones were isolated, sequenced, and checked for similarity to other DNA sequences in GenBank. A previously identified parasite gene encoding an aspartyl protease inhibitor (API) was isolated from the cDNA library, subcloned and expressed using the pET expression vector to produce a glutathione S transferase (GST)-His-S.Tag-P. tenuis API fusion protein (molecular weight = 63 kDa). An enzyme-linked immunosorbent assay utilizing the API fusion protein as the coating antigen was used to serologically diagnose all white-tailed deer (WTD, 10 out of 10) that had been inoculated with 6 - 150 L3 P. tenuis, indicating that the antigen may be a useful serodiagnostic antigen for P. tenuis infection in this cervid species.
USDA-ARS?s Scientific Manuscript database
For map-based cloning of genes conferring important traits in the hexaploid wheat line 92R137, a bacterial artificial chromosome (BAC) library, including two sub libraries, was constructed using the genomic DNA of 92R137 digested with restriction enzymes HindIII and BamHI. The BAC library was compos...
Shinozuka, Hiroshi; Forster, John W
2016-01-01
Background. Multiplexed sequencing is commonly performed on massively parallel short-read sequencing platforms such as Illumina, and the efficiency of library normalisation can affect the quality of the output dataset. Although several library normalisation approaches have been established, none are ideal for highly multiplexed sequencing due to issues of cost and/or processing time. Methods. An inexpensive and high-throughput library quantification method has been developed, based on an adaptation of the melting curve assay. Sequencing libraries were subjected to the assay using the Bio-Rad Laboratories CFX Connect(TM) Real-Time PCR Detection System. The library quantity was calculated through summation of reduction of relative fluorescence units between 86 and 95 °C. Results.PCR-enriched sequencing libraries are suitable for this quantification without pre-purification of DNA. Short DNA molecules, which ideally should be eliminated from the library for subsequent processing, were differentiated from the target DNA in a mixture on the basis of differences in melting temperature. Quantification results for long sequences targeted using the melting curve assay were correlated with those from existing methods (R (2) > 0.77), and that observed from MiSeq sequencing (R (2) = 0.82). Discussion.The results of multiplexed sequencing suggested that the normalisation performance of the described method is equivalent to that of another recently reported high-throughput bead-based method, BeNUS. However, costs for the melting curve assay are considerably lower and processing times shorter than those of other existing methods, suggesting greater suitability for highly multiplexed sequencing applications.
A Novel Approach to Assay DNA Methylation in Prostate Cancer
2016-10-01
prepared into libraries according to standard protocols using Bioo Scientific’s DNA Sample Kit (cat. no. 514101, Austin , TX , USA). Libraries were...Medical Research and Materiel Command Fort Detrick, Maryland 21702-5012 DISTRIBUTION STATEMENT: Approved for Public Release; Distribution Unlimited...ADDRESS(ES) 10. SPONSOR/MONITOR’S ACRONYM(S) U.S. Army Medical Research and Materiel Command Fort Detrick, Maryland 21702-5012 11. SPONSOR/MONITOR’S
Shi, Xue; Zeng, Haiyang; Xue, Yadong; Luo, Meizhong
2011-10-11
Large-insert BAC and BIBAC libraries are important tools for structural and functional genomics studies of eukaryotic genomes. To facilitate the construction of BAC and BIBAC libraries and the transfer of complete large BAC inserts into BIBAC vectors, which is desired in positional cloning, we developed a pair of new BAC and BIBAC vectors. The new BAC vector pIndigoBAC536-S and the new BIBAC vector BIBAC-S have the following features: 1) both contain two 18-bp non-palindromic I-SceI sites in an inverted orientation at positions that flank an identical DNA fragment containing the lacZ selection marker and the cloning site. Large DNA inserts can be excised from the vectors as single fragments by cutting with I-SceI, allowing the inserts to be easily sized. More importantly, because the two vectors contain different antibiotic resistance genes for transformant selection and produce the same non-complementary 3' protruding ATAA ends by I-SceI that suppress self- and inter-ligations, the exchange of intact large genomic DNA inserts between the BAC and BIBAC vectors is straightforward; 2) both were constructed as high-copy composite vectors. Reliable linearized and dephosphorylated original low-copy pIndigoBAC536-S and BIBAC-S vectors that are ready for library construction can be prepared from the high-copy composite vectors pHZAUBAC1 and pHZAUBIBAC1, respectively, without the need for additional preparation steps or special reagents, thus simplifying the construction of BAC and BIBAC libraries. BIBAC clones constructed with the new BIBAC-S vector are stable in both E. coli and Agrobacterium. The vectors can be accessed through our website http://GResource.hzau.edu.cn. The two new vectors and their respective high-copy composite vectors can largely facilitate the construction and characterization of BAC and BIBAC libraries. The transfer of complete large genomic DNA inserts from one vector to the other is made straightforward.
Covalent antibody display—an in vitro antibody-DNA library selection system
Reiersen, Herald; Løbersli, Inger; Løset, Geir Å.; Hvattum, Else; Simonsen, Bjørg; Stacy, John E.; McGregor, Duncan; FitzGerald, Kevin; Welschof, Martin; Brekke, Ole H.; Marvik, Ole J.
2005-01-01
The endonuclease P2A initiates the DNA replication of the bacteriophage P2 by making a covalent bond with its own phosphate backbone. This enzyme has now been exploited as a new in vitro display tool for antibody fragments. We have constructed genetic fusions of P2A with single-chain antibodies (scFvs). Linear DNA of these fusion proteins were processed in an in vitro coupled transcription–translation mixture of Escherichia coli S30 lysate. Complexes of scFv–P2A fusion proteins covalently bound to their own DNA were isolated after panning on immobilized antigen, and the enriched DNAs were recovered by PCR and prepared for the subsequent cycles of panning. We have demonstrated the enrichment of scFvs from spiked libraries and the specific selection of different anti-tetanus toxoid scFvs from a V-gene library with 50 million different members prepared from human lymphocytes. This covalent antibody display technology offers a complete in vitro selection system based exclusively on DNA–protein complexes. PMID:15653626
Czar, Michael J; Cai, Yizhi; Peccoud, Jean
2009-07-01
Chemical synthesis of custom DNA made to order calls for software streamlining the design of synthetic DNA sequences. GenoCAD (www.genocad.org) is a free web-based application to design protein expression vectors, artificial gene networks and other genetic constructs composed of multiple functional blocks called genetic parts. By capturing design strategies in grammatical models of DNA sequences, GenoCAD guides the user through the design process. By successively clicking on icons representing structural features or actual genetic parts, complex constructs composed of dozens of functional blocks can be designed in a matter of minutes. GenoCAD automatically derives the construct sequence from its comprehensive libraries of genetic parts. Upon completion of the design process, users can download the sequence for synthesis or further analysis. Users who elect to create a personal account on the system can customize their workspace by creating their own parts libraries, adding new parts to the libraries, or reusing designs to quickly generate sets of related constructs.
Oishi, M; Gohma, H; Lejukole, H Y; Taniguchi, Y; Yamada, T; Suzuki, K; Shinkai, H; Uenishi, H; Yasue, H; Sasaki, Y
2004-05-01
Expressed sequence tags (ESTs) generated based on characterization of clones isolated randomly from cDNA libraries are used to study gene expression profiles in specific tissues and to provide useful information for characterizing tissue physiology. In this study, two directionally cloned cDNA libraries were constructed from 60 day-old bovine whole fetus and fetal placenta. We have characterized 5357 and 1126 clones, and then identified 3464 and 795 unique sequences for the fetus and placenta cDNA libraries: 1851 and 504 showed homology to already identified genes, and 1613 and 291 showed no significant matches to any of the sequences in DNA databases, respectively. Further, we found 94 unique sequences overlapping in both the fetus and the placenta, leading to a catalog of 4165 genes expressed in 60 day-old fetus and placenta. The catalog is used to examine expression profile of genes in 60 day-old bovine fetus and placenta.
Kollmann, Christopher S; Bai, Xiaopeng; Tsai, Ching-Hsuan; Yang, Hongfang; Lind, Kenneth E; Skinner, Steven R; Zhu, Zhengrong; Israel, David I; Cuozzo, John W; Morgan, Barry A; Yuki, Koichi; Xie, Can; Springer, Timothy A; Shimaoka, Motomu; Evindar, Ghotas
2014-04-01
The inhibition of protein-protein interactions remains a challenge for traditional small molecule drug discovery. Here we describe the use of DNA-encoded library technology for the discovery of small molecules that are potent inhibitors of the interaction between lymphocyte function-associated antigen 1 and its ligand intercellular adhesion molecule 1. A DNA-encoded library with a potential complexity of 4.1 billion compounds was exposed to the I-domain of the target protein and the bound ligands were affinity selected, yielding an enriched small-molecule hit family. Compounds representing this family were synthesized without their DNA encoding moiety and found to inhibit the lymphocyte function-associated antigen 1/intercellular adhesion molecule-1 interaction with submicromolar potency in both ELISA and cell adhesion assays. Re-synthesized compounds conjugated to DNA or a fluorophore were demonstrated to bind to cells expressing the target protein. Copyright © 2014 Elsevier Ltd. All rights reserved.
Construction and application of EST library from Setaria italica in response to dehydration stress.
Zhang, Jinpeng; Liu, Tingsong; Fu, Junjie; Zhu, Yun; Jia, Jinping; Zheng, Jun; Zhao, Yinhe; Zhang, Ying; Wang, Guoying
2007-07-01
Foxtail millet is a gramineous crop with low water requirement. Despite its high water use efficiency, less attention has been paid to the molecular genetics of foxtail millet. This article reports the construction of subtracted cDNA libraries from foxtail millet seedlings under dehydration stress and the expression profile analysis of 1947 UniESTs from the subtracted cDNA libraries by a cDNA microarray. The results showed that 95 and 57 ESTs were upregulated by dehydration stress, respectively, in roots and shoots of seedlings and that 10 and 27 ESTs were downregulated, respectively, in roots and shoots. The expression profile analysis showed that genes induced in foxtail millet roots were different from those in shoots during dehydration stress and that the early response to dehydration stress in foxtail millet roots was the activation of the glycolysis metabolism. Moreover, protein degradation pathway may also play a pivotal role in drought-tolerant responses of foxtail millet. Finally, Northern blot analysis validated well the cDNA microarray data.
The Evolution of DNA-Templated Synthesis as a Tool for Materials Discovery.
O'Reilly, Rachel K; Turberfield, Andrew J; Wilks, Thomas R
2017-10-17
Precise control over reactivity and molecular structure is a fundamental goal of the chemical sciences. Billions of years of evolution by natural selection have resulted in chemical systems capable of information storage, self-replication, catalysis, capture and production of light, and even cognition. In all these cases, control over molecular structure is required to achieve a particular function: without structural control, function may be impaired, unpredictable, or impossible. The search for molecules with a desired function is often achieved by synthesizing a combinatorial library, which contains many or all possible combinations of a set of chemical building blocks (BBs), and then screening this library to identify "successful" structures. The largest libraries made by conventional synthesis are currently of the order of 10 8 distinct molecules. To put this in context, there are 10 13 ways of arranging the 21 proteinogenic amino acids in chains up to 10 units long. Given that we know that a number of these compounds have potent biological activity, it would be highly desirable to be able to search them all to identify leads for new drug molecules. Large libraries of oligonucleotides can be synthesized combinatorially and translated into peptides using systems based on biological replication such as mRNA display, with selected molecules identified by DNA sequencing; but these methods are limited to BBs that are compatible with cellular machinery. In order to search the vast tracts of chemical space beyond nucleic acids and natural peptides, an alternative approach is required. DNA-templated synthesis (DTS) could enable us to meet this challenge. DTS controls chemical product formation by using the specificity of DNA hybridization to bring selected reactants into close proximity, and is capable of the programmed synthesis of many distinct products in the same reaction vessel. By making use of dynamic, programmable DNA processes, it is possible to engineer a system that can translate instructions coded as a sequence of DNA bases into a chemical structure-a process analogous to the action of the ribosome in living organisms but with the potential to create a much more chemically diverse set of products. It is also possible to ensure that each product molecule is tagged with its identifying DNA sequence. Compound libraries synthesized in this way can be exposed to selection against suitable targets, enriching successful molecules. The encoding DNA can then be amplified using the polymerase chain reaction and decoded by DNA sequencing. More importantly, the DNA instruction sequences can be mutated and reused during multiple rounds of amplification, translation, and selection. In other words, DTS could be used as the foundation for a system of synthetic molecular evolution, which could allow us to efficiently search a vast chemical space. This has huge potential to revolutionize materials discovery-imagine being able to evolve molecules for light harvesting, or catalysts for CO 2 fixation. The field of DTS has developed to the point where a wide variety of reactions can be performed on a DNA template. Complex architectures and autonomous "DNA robots" have been implemented for the controlled assembly of BBs, and these mechanisms have in turn enabled the one-pot synthesis of large combinatorial libraries. Indeed, DTS libraries are being exploited by pharmaceutical companies and have already found their way into drug lead discovery programs. This Account explores the processes involved in DTS and highlights the challenges that remain in creating a general system for molecular discovery by evolution.
The Evolution of DNA-Templated Synthesis as a Tool for Materials Discovery
2017-01-01
Conspectus Precise control over reactivity and molecular structure is a fundamental goal of the chemical sciences. Billions of years of evolution by natural selection have resulted in chemical systems capable of information storage, self-replication, catalysis, capture and production of light, and even cognition. In all these cases, control over molecular structure is required to achieve a particular function: without structural control, function may be impaired, unpredictable, or impossible. The search for molecules with a desired function is often achieved by synthesizing a combinatorial library, which contains many or all possible combinations of a set of chemical building blocks (BBs), and then screening this library to identify “successful” structures. The largest libraries made by conventional synthesis are currently of the order of 108 distinct molecules. To put this in context, there are 1013 ways of arranging the 21 proteinogenic amino acids in chains up to 10 units long. Given that we know that a number of these compounds have potent biological activity, it would be highly desirable to be able to search them all to identify leads for new drug molecules. Large libraries of oligonucleotides can be synthesized combinatorially and translated into peptides using systems based on biological replication such as mRNA display, with selected molecules identified by DNA sequencing; but these methods are limited to BBs that are compatible with cellular machinery. In order to search the vast tracts of chemical space beyond nucleic acids and natural peptides, an alternative approach is required. DNA-templated synthesis (DTS) could enable us to meet this challenge. DTS controls chemical product formation by using the specificity of DNA hybridization to bring selected reactants into close proximity, and is capable of the programmed synthesis of many distinct products in the same reaction vessel. By making use of dynamic, programmable DNA processes, it is possible to engineer a system that can translate instructions coded as a sequence of DNA bases into a chemical structure—a process analogous to the action of the ribosome in living organisms but with the potential to create a much more chemically diverse set of products. It is also possible to ensure that each product molecule is tagged with its identifying DNA sequence. Compound libraries synthesized in this way can be exposed to selection against suitable targets, enriching successful molecules. The encoding DNA can then be amplified using the polymerase chain reaction and decoded by DNA sequencing. More importantly, the DNA instruction sequences can be mutated and reused during multiple rounds of amplification, translation, and selection. In other words, DTS could be used as the foundation for a system of synthetic molecular evolution, which could allow us to efficiently search a vast chemical space. This has huge potential to revolutionize materials discovery—imagine being able to evolve molecules for light harvesting, or catalysts for CO2 fixation. The field of DTS has developed to the point where a wide variety of reactions can be performed on a DNA template. Complex architectures and autonomous “DNA robots” have been implemented for the controlled assembly of BBs, and these mechanisms have in turn enabled the one-pot synthesis of large combinatorial libraries. Indeed, DTS libraries are being exploited by pharmaceutical companies and have already found their way into drug lead discovery programs. This Account explores the processes involved in DTS and highlights the challenges that remain in creating a general system for molecular discovery by evolution. PMID:28915003
Encoded libraries of chemically modified peptides.
Heinis, Christian; Winter, Greg
2015-06-01
The use of powerful technologies for generating and screening DNA-encoded protein libraries has helped drive the development of proteins as pharmaceutical ligands. However the development of peptides as pharmaceutical ligands has been more limited. Although encoded peptide libraries are typically several orders of magnitude larger than classical chemical libraries, can be more readily screened, and can give rise to higher affinity ligands, their use as pharmaceutical ligands is limited by their intrinsic properties. Two of the intrinsic limitations include the rotational flexibility of the peptide backbone and the limited number (20) of natural amino acids. However these limitations can be overcome by use of chemical modification. For example, the libraries can be modified to introduce topological constraints such as cyclization linkers, or to introduce new chemical entities such as small molecule ligands, fluorophores and photo-switchable compounds. This article reviews the chemistry involved, the properties of the peptide ligands, and the new opportunities offered by chemical modification of DNA-encoded peptide libraries. Copyright © 2015. Published by Elsevier Ltd.
Primary culture of cat intestinal epithelial cells in vitro and the cDNA library construction.
Zhao, Gui Hua; Liu, Ye; Cheng, Yun Tang; Zhao, Qing Song; Qiu, Xiao; Xu, Chao; Xiao, Ting; Zhu, Song; Liu, Gong Zhen; Yin, Kun
2018-06-26
Felids are the only definitive hosts of Toxoplasma gondii. To lay a foundation for screening the T. gondii-felids interaction factors, we have developed a reproducible primary culture method for cat intestinal epithelial cells (IECs). The primary IECs were isolated from a new born cat's small intestine jejunum region without food ingress, and respectively in vitro cultured by tissue cultivation and combined digestion method with collagenase XI and dispase I, then purified by trypsinization. After identification, the ds cDNA of cat IECs was synthesized for constructing pGADT7 homogenization three-frame plasmid, and transformed into the yeast Y187 for generating the cDNA library. Our results indicated that cultivation of primary cat IECs relays on combined digestion to form polarized and confluent monolayers within 3 days with typical features of normal epithelial cells. The purified cells cultured by digestion method were identified to be nature intestinal epithelial cells using immunohistochemical analysis and were able to maintain viability for at least 15 passages. The homogenizable ds cDNA, which is synthesized from the total RNA extracted from our cultured IECs, distributed among 0.5-2.0 kb, and generated satisfying three-frame cDNA library with the capacity of 1.2 × 106 and the titer of 5.2 × 107 pfu/mL. Our results established an optimal method for the culturing and passage of cat IECs model in vitro, and laid a cDNA library foundation for the subsequent interaction factors screening by yeast two-hybrid.
Establishing a community-wide DNA barcode library as a new tool for arctic research.
Wirta, H; Várkonyi, G; Rasmussen, C; Kaartinen, R; Schmidt, N M; Hebert, P D N; Barták, M; Blagoev, G; Disney, H; Ertl, S; Gjelstrup, P; Gwiazdowicz, D J; Huldén, L; Ilmonen, J; Jakovlev, J; Jaschhof, M; Kahanpää, J; Kankaanpää, T; Krogh, P H; Labbee, R; Lettner, C; Michelsen, V; Nielsen, S A; Nielsen, T R; Paasivirta, L; Pedersen, S; Pohjoismäki, J; Salmela, J; Vilkamaa, P; Väre, H; von Tschirnhaus, M; Roslin, T
2016-05-01
DNA sequences offer powerful tools for describing the members and interactions of natural communities. In this study, we establish the to-date most comprehensive library of DNA barcodes for a terrestrial site, including all known macroscopic animals and vascular plants of an intensively studied area of the High Arctic, the Zackenberg Valley in Northeast Greenland. To demonstrate its utility, we apply the library to identify nearly 20 000 arthropod individuals from two Malaise traps, each operated for two summers. Drawing on this material, we estimate the coverage of previous morphology-based species inventories, derive a snapshot of faunal turnover in space and time and describe the abundance and phenology of species in the rapidly changing arctic environment. Overall, 403 terrestrial animal and 160 vascular plant species were recorded by morphology-based techniques. DNA barcodes (CO1) offered high resolution in discriminating among the local animal taxa, with 92% of morphologically distinguishable taxa assigned to unique Barcode Index Numbers (BINs) and 93% to monophyletic clusters. For vascular plants, resolution was lower, with 54% of species forming monophyletic clusters based on barcode regions rbcLa and ITS2. Malaise catches revealed 122 BINs not detected by previous sampling and DNA barcoding. The insect community was dominated by a few highly abundant taxa. Even closely related taxa differed in phenology, emphasizing the need for species-level resolution when describing ongoing shifts in arctic communities and ecosystems. The DNA barcode library now established for Zackenberg offers new scope for such explorations, and for the detailed dissection of interspecific interactions throughout the community. © 2015 John Wiley & Sons Ltd.
Current and future resources for functional metagenomics
Lam, Kathy N.; Cheng, Jiujun; Engel, Katja; Neufeld, Josh D.; Charles, Trevor C.
2015-01-01
Functional metagenomics is a powerful experimental approach for studying gene function, starting from the extracted DNA of mixed microbial populations. A functional approach relies on the construction and screening of metagenomic libraries—physical libraries that contain DNA cloned from environmental metagenomes. The information obtained from functional metagenomics can help in future annotations of gene function and serve as a complement to sequence-based metagenomics. In this Perspective, we begin by summarizing the technical challenges of constructing metagenomic libraries and emphasize their value as resources. We then discuss libraries constructed using the popular cloning vector, pCC1FOS, and highlight the strengths and shortcomings of this system, alongside possible strategies to maximize existing pCC1FOS-based libraries by screening in diverse hosts. Finally, we discuss the known bias of libraries constructed from human gut and marine water samples, present results that suggest bias may also occur for soil libraries, and consider factors that bias metagenomic libraries in general. We anticipate that discussion of current resources and limitations will advance tools and technologies for functional metagenomics research. PMID:26579102
Use of RecA protein to enrich for homologous genes in a genomic library
DOE Office of Scientific and Technical Information (OSTI.GOV)
Taidi-Laskowski, B.; Grumet, F.C.; Tyan, D.
1988-08-25
RecA protein-coated probe has been utilized to enrich genomic digests for desired genes in order to facilitate cloning from genomic libraries. Using a previously cloned HLA-B27 gene as the recA-coated enrichment probe, the authors obtained a mean 108x increase in the ratio of specific to nonspecific plaques in lambda libraries screened for B27 variant alleles of estimated 99% homology to the probe. Class I genes of lesser homology were less enriched. Loss of genomic DNA during the enrichment procedure can, however, restrict application of this technique whenever starting genomic DNA is very limited. Nevertheless, the impressive reduction in cloning effortmore » and material makes recA enrichment a useful new tool for cloning homologous genes from genomic DNA.« less
Metagenomic Analysis of Viral Communities in (Hado)Pelagic Sediments
Yoshida, Mitsuhiro; Takaki, Yoshihiro; Eitoku, Masamitsu; Nunoura, Takuro; Takai, Ken
2013-01-01
In this study, we analyzed viral metagenomes (viromes) in the sedimentary habitats of three geographically and geologically distinct (hado)pelagic environments in the northwest Pacific; the Izu-Ogasawara Trench (water depth = 9,760 m) (OG), the Challenger Deep in the Mariana Trench (10,325 m) (MA), and the forearc basin off the Shimokita Peninsula (1,181 m) (SH). Virus abundance ranged from 106 to 1011 viruses/cm3 of sediments (down to 30 cm below the seafloor [cmbsf]). We recovered viral DNA assemblages (viromes) from the (hado)pelagic sediment samples and obtained a total of 37,458, 39,882, and 70,882 sequence reads by 454 GS FLX Titanium pyrosequencing from the virome libraries of the OG, MA, and SH (hado)pelagic sediments, respectively. Only 24−30% of the sequence reads from each virome library exhibited significant similarities to the sequences deposited in the public nr protein database (E-value <10−3 in BLAST). Among the sequences identified as potential viral genes based on the BLAST search, 95−99% of the sequence reads in each library were related to genes from single-stranded DNA (ssDNA) viral families, including Microviridae, Circoviridae, and Geminiviridae. A relatively high abundance of sequences related to the genetic markers (major capsid protein [VP1] and replication protein [Rep]) of two ssDNA viral groups were also detected in these libraries, thereby revealing a high genotypic diversity of their viruses (833 genotypes for VP1 and 2,551 genotypes for Rep). A majority of the viral genes predicted from each library were classified into three ssDNA viral protein categories: Rep, VP1, and minor capsid protein. The deep-sea sedimentary viromes were distinct from the viromes obtained from the oceanic and fresh waters and marine eukaryotes, and thus, deep-sea sediments harbor novel viromes, including previously unidentified ssDNA viruses. PMID:23468952
Metagenomic analysis of viral communities in (hado)pelagic sediments.
Yoshida, Mitsuhiro; Takaki, Yoshihiro; Eitoku, Masamitsu; Nunoura, Takuro; Takai, Ken
2013-01-01
In this study, we analyzed viral metagenomes (viromes) in the sedimentary habitats of three geographically and geologically distinct (hado)pelagic environments in the northwest Pacific; the Izu-Ogasawara Trench (water depth = 9,760 m) (OG), the Challenger Deep in the Mariana Trench (10,325 m) (MA), and the forearc basin off the Shimokita Peninsula (1,181 m) (SH). Virus abundance ranged from 10(6) to 10(11) viruses/cm(3) of sediments (down to 30 cm below the seafloor [cmbsf]). We recovered viral DNA assemblages (viromes) from the (hado)pelagic sediment samples and obtained a total of 37,458, 39,882, and 70,882 sequence reads by 454 GS FLX Titanium pyrosequencing from the virome libraries of the OG, MA, and SH (hado)pelagic sediments, respectively. Only 24-30% of the sequence reads from each virome library exhibited significant similarities to the sequences deposited in the public nr protein database (E-value <10(-3) in BLAST). Among the sequences identified as potential viral genes based on the BLAST search, 95-99% of the sequence reads in each library were related to genes from single-stranded DNA (ssDNA) viral families, including Microviridae, Circoviridae, and Geminiviridae. A relatively high abundance of sequences related to the genetic markers (major capsid protein [VP1] and replication protein [Rep]) of two ssDNA viral groups were also detected in these libraries, thereby revealing a high genotypic diversity of their viruses (833 genotypes for VP1 and 2,551 genotypes for Rep). A majority of the viral genes predicted from each library were classified into three ssDNA viral protein categories: Rep, VP1, and minor capsid protein. The deep-sea sedimentary viromes were distinct from the viromes obtained from the oceanic and fresh waters and marine eukaryotes, and thus, deep-sea sediments harbor novel viromes, including previously unidentified ssDNA viruses.
[Cosmid libraries containing DNA from human chromosome 13].
Kapanadze, B I; Brodianskiĭ, V M; Baranova, A V; Sevat'ianov, S Iu; Fedorova, N D; Kurskov, M M; Kostina, M A; Mironov, A A; Sineokiĭ, S P; Zakhar'ev, V M; Grafodatskiĭ, A S; Modianov, N N; Iankovskiĭ, N K
1996-03-01
We characterized two cosmid libraries constructed from flow-sorted chromosome 13 at the Imperial Cancer Research Fund (ICRF), UK (13,000 clones) and Los Alamos National Laboratory (LANL), USA (17,000 clones). After storage for two years, clones showed high viability (95%) and structural stability. EcoR I and Hind III restriction patterns were studied in more than 500 ICRF and 200 LANL cosmids. The average size of inserts was shown to be 35-37 kb in both the libraries. Most cosmids (83% and 93% of ICRF and LANL libraries, respectively) exceed the lower size limit of DNA fragments that can be packaged and represent a good source for physical mapping of chromosome 13. Total length of inserts is four and five genome equivalents in the ICRF and LANL libraries, respectively. ICRF cosmids showed hybridization to 22 of 24 unique probes tested, which corresponds to a 90% probability of having any DNA fragment represented in the library. More than 1 Mb of chromosome 13 is overlapped by 90 cosmids of 22 groups revealed. A chromosomal region of more than 150 kb, containing the ATP1AL1 gene for alpha-1 peptide of Na+, K(+)-ATPase, is covered by 12 cosmids forming a contig. The results of restriction and hybridization analyses are stored in a CLONE database. These data and all the cosmids described are publicly available.
Howland, Shanshan W; Poh, Chek-Meng; Rénia, Laurent
2011-09-01
Directional cloning of complementary DNA (cDNA) primed by oligo(dT) is commonly achieved by appending a restriction site to the primer, whereas the second strand is synthesized through the combined action of RNase H and Escherichia coli DNA polymerase I (PolI). Although random primers provide more uniform and complete coverage, directional cloning with the same strategy is highly inefficient. We report that phosphorothioate linkages protect the tail sequence appended to random primers from the 5'→3' exonuclease activity of PolI. We present a simple strategy for constructing a random-primed cDNA library using the efficient, size-independent, and seamless In-Fusion cloning method instead of restriction enzymes. Copyright © 2011 Elsevier Inc. All rights reserved.
Carbohydrate active enzymes revealed in Coptotermes formosanus transcriptome
USDA-ARS?s Scientific Manuscript database
A normalized cDNA library of Coptotermes formosanus was constructed using mixed RNA isolated from workers, soldiers, nymphs and alates of both sexes. Sequencing of this library generated 131,637 EST and 25,939 unigenes were assembled. Carbohydrate active enzymes (CAZymes) revealed in this library we...
Cao, Shuanghe; Siriwardana, Chamindika L; Kumimoto, Roderick W; Holt, Ben F
2011-05-19
Monocots, especially the temperate grasses, represent some of the most agriculturally important crops for both current food needs and future biofuel development. Because most of the agriculturally important grass species are difficult to study (e.g., they often have large, repetitive genomes and can be difficult to grow in laboratory settings), developing genetically tractable model systems is essential. Brachypodium distachyon (hereafter Brachypodium) is an emerging model system for the temperate grasses. To fully realize the potential of this model system, publicly accessible discovery tools are essential. High quality cDNA libraries that can be readily adapted for multiple downstream purposes are a needed resource. Additionally, yeast two-hybrid (Y2H) libraries are an important discovery tool for protein-protein interactions and are not currently available for Brachypodium. We describe the creation of two high quality, publicly available Gateway™ cDNA entry libraries and their derived Y2H libraries for Brachypodium. The first entry library represents cloned cDNA populations from both short day (SD, 8/16-h light/dark) and long day (LD, 20/4-h light/dark) grown plants, while the second library was generated from hormone treated tissues. Both libraries have extensive genome coverage (~5 × 107 primary clones each) and average clone lengths of ~1.5 Kb. These entry libraries were then used to create two recombination-derived Y2H libraries. Initial proof-of-concept screens demonstrated that a protein with known interaction partners could readily re-isolate those partners, as well as novel interactors. Accessible community resources are a hallmark of successful biological model systems. Brachypodium has the potential to be a broadly useful model system for the grasses, but still requires many of these resources. The Gateway™ compatible entry libraries created here will facilitate studies for multiple user-defined purposes and the derived Y2H libraries can be immediately applied to large scale screening and discovery of novel protein-protein interactions. All libraries are freely available for distribution to the research community.
2011-01-01
Background When a specimen belongs to a species not yet represented in DNA barcode reference libraries there is disagreement over the effectiveness of using sequence comparisons to assign the query accurately to a higher taxon. Library completeness and the assignment criteria used have been proposed as critical factors affecting the accuracy of such assignments but have not been thoroughly investigated. We explored the accuracy of assignments to genus, tribe and subfamily in the Sphingidae, using the almost complete global DNA barcode reference library (1095 species) available for this family. Costa Rican sphingids (118 species), a well-documented, diverse subset of the family, with each of the tribes and subfamilies represented were used as queries. We simulated libraries with different levels of completeness (10-100% of the available species), and recorded assignments (positive or ambiguous) and their accuracy (true or false) under six criteria. Results A liberal tree-based criterion assigned 83% of queries accurately to genus, 74% to tribe and 90% to subfamily, compared to a strict tree-based criterion, which assigned 75% of queries accurately to genus, 66% to tribe and 84% to subfamily, with a library containing 100% of available species (but excluding the species of the query). The greater number of true positives delivered by more relaxed criteria was negatively balanced by the occurrence of more false positives. This effect was most sharply observed with libraries of the lowest completeness where, for example at the genus level, 32% of assignments were false positives with the liberal criterion versus < 1% when using the strict. We observed little difference (< 8% using the liberal criterion) however, in the overall accuracy of the assignments between the lowest and highest levels of library completeness at the tribe and subfamily level. Conclusions Our results suggest that when using a strict tree-based criterion for higher taxon assignment with DNA barcodes, the likelihood of assigning a query a genus name incorrectly is very low, if a genus name is provided it has a high likelihood of being accurate, and if no genus match is available the query can nevertheless be assigned to a subfamily with high accuracy regardless of library completeness. DNA barcoding often correctly assigned sphingid moths to higher taxa when species matches were unavailable, suggesting that barcode reference libraries can be useful for higher taxon assignments long before they achieve complete species coverage. PMID:21806794
Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias
2013-09-24
Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp.
Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias
2013-01-01
Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp. PMID:24019490
Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun
2013-01-01
In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105
Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun
2013-05-24
In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.
Construction of C35 gene bait recombinants and T47D cell cDNA library.
Yin, Kun; Xu, Chao; Zhao, Gui-Hua; Liu, Ye; Xiao, Ting; Zhu, Song; Yan, Ge
2017-11-20
C35 is a novel tumor biomarker associated with metastasis progression. To investigate the interaction factors of C35 in its high expressed breast cancer cell lines, we constructed bait recombinant plasmids of C35 gene and T47D cell cDNA library for yeast two-hybrid screening. Full length C35 sequences were subcloned using RT-PCR from cDNA template extracted from T47D cells. Based on functional domain analysis, the full-length C35 1-348bp was also truncated into two fragments C351-153bp and C35154-348bp to avoid auto-activation. The three kinds of C35 genes were successfully amplified and inserted into pGBKT7 to construct bait recombinant plasmids pGBKT7-C351-348bp, pGBKT7-C351-153bp and pGBKT7-C35154-348bp, then transformed into Y187 yeast cells by the lithium acetate method. Auto-activation and toxicity of C35 baits were detected using nutritional deficient medium and X-α-Gal assays. The T47D cell ds cDNA was generated by SMART TM technology and the library was constructed using in vivo recombination-mediated cloning in the AH109 yeast strain using a pGADT7-Rec plasmid. The transformed Y187/pGBKT7-C351-348bp line was intensively inhibited while the truncated Y187/pGBKT7-C35 lines had no auto-activation and toxicity in yeast cells. The titer of established cDNA library was 2 × 10 7 pfu/mL with high transformation efficiency of 1.4 × 10 6 , and the insert size of ds cDNA was distributed homogeneously between 0.5-2.0 kb. Our research generated a T47D cell cDNA library with high titer, and the constructed two C35 "baits" contained a respective functional immunoreceptor tyrosine based activation motif (ITAM) and the conserved last four amino acids Cys-Ile-Leu-Val (CILV) motif, and therefore laid a foundation for screening the C35 interaction factors in a BC cell line.
Li, XiaoChing; Wang, Xiu-Jie; Tannenhauser, Jonathan; Podell, Sheila; Mukherjee, Piali; Hertel, Moritz; Biane, Jeremy; Masuda, Shoko; Nottebohm, Fernando; Gaasterland, Terry
2007-01-01
Vocal learning and neuronal replacement have been studied extensively in songbirds, but until recently, few molecular and genomic tools for songbird research existed. Here we describe new molecular/genomic resources developed in our laboratory. We made cDNA libraries from zebra finch (Taeniopygia guttata) brains at different developmental stages. A total of 11,000 cDNA clones from these libraries, representing 5,866 unique gene transcripts, were randomly picked and sequenced from the 3′ ends. A web-based database was established for clone tracking, sequence analysis, and functional annotations. Our cDNA libraries were not normalized. Sequencing ESTs without normalization produced many developmental stage-specific sequences, yielding insights into patterns of gene expression at different stages of brain development. In particular, the cDNA library made from brains at posthatching day 30–50, corresponding to the period of rapid song system development and song learning, has the most diverse and richest set of genes expressed. We also identified five microRNAs whose sequences are highly conserved between zebra finch and other species. We printed cDNA microarrays and profiled gene expression in the high vocal center of both adult male zebra finches and canaries (Serinus canaria). Genes differentially expressed in the high vocal center were identified from the microarray hybridization results. Selected genes were validated by in situ hybridization. Networks among the regulated genes were also identified. These resources provide songbird biologists with tools for genome annotation, comparative genomics, and microarray gene expression analysis. PMID:17426146
Sanders, Ashley D; Falconer, Ester; Hills, Mark; Spierings, Diana C J; Lansdorp, Peter M
2017-06-01
The ability to distinguish between genome sequences of homologous chromosomes in single cells is important for studies of copy-neutral genomic rearrangements (such as inversions and translocations), building chromosome-length haplotypes, refining genome assemblies, mapping sister chromatid exchange events and exploring cellular heterogeneity. Strand-seq is a single-cell sequencing technology that resolves the individual homologs within a cell by restricting sequence analysis to the DNA template strands used during DNA replication. This protocol, which takes up to 4 d to complete, relies on the directionality of DNA, in which each single strand of a DNA molecule is distinguished based on its 5'-3' orientation. Culturing cells in a thymidine analog for one round of cell division labels nascent DNA strands, allowing for their selective removal during genomic library construction. To preserve directionality of template strands, genomic preamplification is bypassed and labeled nascent strands are nicked and not amplified during library preparation. Each single-cell library is multiplexed for pooling and sequencing, and the resulting sequence data are aligned, mapping to either the minus or plus strand of the reference genome, to assign template strand states for each chromosome in the cell. The major adaptations to conventional single-cell sequencing protocols include harvesting of daughter cells after a single round of BrdU incorporation, bypassing of whole-genome amplification, and removal of the BrdU + strand during Strand-seq library preparation. By sequencing just template strands, the structure and identity of each homolog are preserved.
Determination of a Screening Metric for High Diversity DNA Libraries.
Guido, Nicholas J; Handerson, Steven; Joseph, Elaine M; Leake, Devin; Kung, Li A
2016-01-01
The fields of antibody engineering, enzyme optimization and pathway construction rely increasingly on screening complex variant DNA libraries. These highly diverse libraries allow researchers to sample a maximized sequence space; and therefore, more rapidly identify proteins with significantly improved activity. The current state of the art in synthetic biology allows for libraries with billions of variants, pushing the limits of researchers' ability to qualify libraries for screening by measuring the traditional quality metrics of fidelity and diversity of variants. Instead, when screening variant libraries, researchers typically use a generic, and often insufficient, oversampling rate based on a common rule-of-thumb. We have developed methods to calculate a library-specific oversampling metric, based on fidelity, diversity, and representation of variants, which informs researchers, prior to screening the library, of the amount of oversampling required to ensure that the desired fraction of variant molecules will be sampled. To derive this oversampling metric, we developed a novel alignment tool to efficiently measure frequency counts of individual nucleotide variant positions using next-generation sequencing data. Next, we apply a method based on the "coupon collector" probability theory to construct a curve of upper bound estimates of the sampling size required for any desired variant coverage. The calculated oversampling metric will guide researchers to maximize their efficiency in using highly variant libraries.
Vartanian, Jean-Pierre; Wain-Hobson, Simon
2002-05-28
Nuclear mtDNA sequences (numts) are a widespread family of paralogs evolving as pseudogenes in chromosomal DNA [Zhang, D. E. & Hewitt, G. M. (1996) TREE 11, 247-251 and Bensasson, D., Zhang, D., Hartl, D. L. & Hewitt, G. M. (2001) TREE 16, 314-321]. When trying to identify the species origin of an unknown DNA sample by way of an mtDNA locus, PCR may amplify both mtDNA and numts. Indeed, occasionally numts dominate confounding attempts at species identification [Bensasson, D., Zhang, D. X. & Hewitt, G. M. (2000) Mol. Biol. Evol. 17, 406-415; Wallace, D. C., et al. (1997) Proc. Natl. Acad. Sci. USA 94, 14900-14905]. Rhesus and cynomolgus macaque mtDNA haplotypes were identified in a study of oral polio vaccine samples dating from the late 1950s [Blancou, P., et al. (2001) Nature (London) 410, 1045-1046]. They were accompanied by a number of putative numts. To confirm that these putative numts were of macaque origin, a library of numts corresponding to a small segment of 12S rDNA locus has been made by using DNA from a Chinese rhesus macaque. A broad distribution was found with up to 30% sequence variation. Phylogenetic analysis showed that the evolutionary trajectories of numts and bona fide mtDNA haplotypes do not overlap with the signal exception of the host species; mtDNA fragments are continually crossing over into the germ line. In the case of divergent mtDNA sequences from old oral polio vaccine samples [Blancou, P., et al. (2001) Nature (London) 410, 1045-1046], all were closely related to numts in the Chinese macaque library.
Lloyd, Sonja J; LaPatra, Scott E; Snekvik, Kevin R; St-Hilaire, Sophie; Cain, Kenneth D; Call, Douglas R
2008-11-20
Strawberry disease (SD) in the USA is a skin disorder of unknown etiology that occurs in rainbow trout Oncorhynchus mykiss and is characterized by bright red inflammatory lesions. To identify a candidate bacterial agent responsible for SD, we constructed 16S rDNA libraries from 7 SD lesion samples and 2 apparently healthy skin samples from SD-affected fish. A 16S rDNA sequence highly similar to members of the order Rickettsiales was present in 3 lesion libraries at 1%, 32% and 54% prevalence, but this sequence was not found in either healthy tissue library. Based on phylogenetic analysis, this Rickettsia-like organism (RLO) sequence is most closely related to 16S rDNA sequences of bacteria that may form a novel lineage within the Rickettsiales. We used nested PCR assays to screen 25 SD-affected fish for RLO or Flavobacterium psychrophilum DNA. Sixteen lesion samples were positive for the RLO sequence and 4 of the matched healthy samples were positive resulting in a significant association between SD lesions and presence of RLO DNA. While F. psychrophilum is reportedly associated with 'cold water strawberry disease' in the UK, we found no significant association between SD lesions and the presence of F. psychrophilum DNA. The statistical association between SD lesions and presence of RLO DNA is not proof of etiology, but these data suggest that RLO may play a role in SD in southern Idaho, USA.
2009-01-01
Background This study reports progress in assembling a DNA barcode reference library for Ephemeroptera, Plecoptera, and Trichoptera ("EPTs") from a Canadian subarctic site, which is the focus of a comprehensive biodiversity inventory using DNA barcoding. These three groups of aquatic insects exhibit a moderate level of species diversity, making them ideal for testing the feasibility of DNA barcoding for routine biotic surveys. We explore the correlation between the morphological species delineations, DNA barcode-based haplotype clusters delimited by a sequence threshold (2%), and a threshold-free approach to biodiversity quantification--phylogenetic diversity. Results A DNA barcode reference library is built for 112 EPT species for the focal region, consisting of 2277 COI sequences. Close correspondence was found between EPT morphospecies and haplotype clusters as designated using a standard threshold value. Similarly, the shapes of taxon accumulation curves based upon haplotype clusters were very similar to those generated using phylogenetic diversity accumulation curves, but were much more computationally efficient. Conclusion The results of this study will facilitate other lines of research on northern EPTs and also bode well for rapidly conducting initial biodiversity assessments in unknown EPT faunas. PMID:20003245
NASA Astrophysics Data System (ADS)
Yu, Jianzhong; Ma, Xiaolei; Pan, Kehou; Yang, Guanpin; Yu, Wengong
2010-07-01
We constructed and characterized a normalized cDNA library of Nannochloropsis oculata CS-179, and obtained 905 nonredundant sequences (NRSs) ranging from 431-1 756 bp in length. Among them, 496 were very similar to nonredundant ones in the GenBank ( E ≤1.0e-05), and 349 ESTs had significant hits with the clusters of eukaryotic orthologous groups (KOG). Bases G and/or C at the third position of codons of 14 amino acid residues suggested a strong bias in the conserved domain of 362 NRSs (>60%). We also identified the unigenes encoding phosphorus and nitrogen transporters, suggesting that N. oculata could efficiently transport and metabolize phosphorus and nitrogen, and recognized the unigenes that involved in biosynthesis and storage of both fatty acids and polyunsaturated fatty acids (PUFAs), which will facilitate the demonstration of eicosapentaenoic acid (EPA) biosynthesis pathway of N. oculata. In comparison with the original cDNA library, the normalized library significantly increased the efficiencies of random sequencing and rarely expressed genes discovering, and decreased the frequency of abundant gene sequences.
Capture-SELEX: Selection of DNA Aptamers for Aminoglycoside Antibiotics
2012-01-01
Small organic molecules are challenging targets for an aptamer selection using the SELEX technology (SELEX—Systematic Evolution of Ligans by EXponential enrichment). Often they are not suitable for immobilization on solid surfaces, which is a common procedure in known aptamer selection methods. The Capture-SELEX procedure allows the selection of DNA aptamers for solute targets. A special SELEX library was constructed with the aim to immobilize this library on magnetic beads or other surfaces. For this purpose a docking sequence was incorporated into the random region of the library enabling hybridization to a complementary oligo fixed on magnetic beads. Oligonucleotides of the library which exhibit high affinity to the target and a secondary structure fitting to the target are released from the beads for binding to the target during the aptamer selection process. The oligonucleotides of these binding complexes were amplified, purified, and immobilized via the docking sequence to the magnetic beads as the starting point of the following selection round. Based on this Capture-SELEX procedure, the successful DNA aptamer selection for the aminoglycoside antibiotic kanamycin A as a small molecule target is described. PMID:23326761
Castañón, Jesús; Román, José Pablo; Jessop, Theodore C; de Blas, Jesús; Haro, Rubén
2018-06-01
DNA-encoded libraries (DELs) have emerged as an efficient and cost-effective drug discovery tool for the exploration and screening of very large chemical space using small-molecule collections of unprecedented size. Herein, we report an integrated automation and informatics system designed to enhance the quality, efficiency, and throughput of the production and affinity selection of these libraries. The platform is governed by software developed according to a database-centric architecture to ensure data consistency, integrity, and availability. Through its versatile protocol management functionalities, this application captures the wide diversity of experimental processes involved with DEL technology, keeps track of working protocols in the database, and uses them to command robotic liquid handlers for the synthesis of libraries. This approach provides full traceability of building-blocks and DNA tags in each split-and-pool cycle. Affinity selection experiments and high-throughput sequencing reads are also captured in the database, and the results are automatically deconvoluted and visualized in customizable representations. Researchers can compare results of different experiments and use machine learning methods to discover patterns in data. As of this writing, the platform has been validated through the generation and affinity selection of various libraries, and it has become the cornerstone of the DEL production effort at Lilly.
The construction of an EST database for Bombyx mori and its application
Mita, Kazuei; Morimyo, Mitsuoki; Okano, Kazuhiro; Koike, Yoshiko; Nohata, Junko; Kawasaki, Hideki; Kadono-Okuda, Keiko; Yamamoto, Kimiko; Suzuki, Masataka G.; Shimada, Toru; Goldsmith, Marian R.; Maeda, Susumu
2003-01-01
To build a foundation for the complete genome analysis of Bombyx mori, we have constructed an EST database. Because gene expression patterns deeply depend on tissues as well as developmental stages, we analyzed many cDNA libraries prepared from various tissues and different developmental stages to cover the entire set of Bombyx genes. So far, the Bombyx EST database contains 35,000 ESTs from 36 cDNA libraries, which are grouped into ≈11,000 nonredundant ESTs with the average length of 1.25 kb. The comparison with FlyBase suggests that the present EST database, SilkBase, covers >55% of all genes of Bombyx. The fraction of library-specific ESTs in each cDNA library indicates that we have not yet reached saturation, showing the validity of our strategy for constructing an EST database to cover all genes. To tackle the coming saturation problem, we have checked two methods, subtraction and normalization, to increase coverage and decrease the number of housekeeping genes, resulting in a 5–11% increase of library-specific ESTs. The identification of a number of genes and comprehensive cloning of gene families have already emerged from the SilkBase search. Direct links of SilkBase with FlyBase and WormBase provide ready identification of candidate Lepidoptera-specific genes. PMID:14614147
Thaitrong, Numrin; Kim, Hanyoup; Renzi, Ronald F; Bartsch, Michael S; Meagher, Robert J; Patel, Kamlesh D
2012-12-01
We have developed an automated quality control (QC) platform for next-generation sequencing (NGS) library characterization by integrating a droplet-based digital microfluidic (DMF) system with a capillary-based reagent delivery unit and a quantitative CE module. Using an in-plane capillary-DMF interface, a prepared sample droplet was actuated into position between the ground electrode and the inlet of the separation capillary to complete the circuit for an electrokinetic injection. Using a DNA ladder as an internal standard, the CE module with a compact LIF detector was capable of detecting dsDNA in the range of 5-100 pg/μL, suitable for the amount of DNA required by the Illumina Genome Analyzer sequencing platform. This DMF-CE platform consumes tenfold less sample volume than the current Agilent BioAnalyzer QC technique, preserving precious sample while providing necessary sensitivity and accuracy for optimal sequencing performance. The ability of this microfluidic system to validate NGS library preparation was demonstrated by examining the effects of limited-cycle PCR amplification on the size distribution and the yield of Illumina-compatible libraries, demonstrating that as few as ten cycles of PCR bias the size distribution of the library toward undesirable larger fragments. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Problem-Solving Test: Expression Cloning of the Erythropoietin Receptor
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2008-01-01
Terms to be familiar with before you start to solve the test: cytokines, cytokine receptors, cDNA library, cDNA synthesis, poly(A)[superscript +] RNA, primer, template, reverse transcriptase, restriction endonucleases, cohesive ends, expression vector, promoter, Shine-Dalgarno sequence, poly(A) signal, DNA helicase, DNA ligase, topoisomerases,…
Travis, G H; Sutcliffe, J G
1988-01-01
To isolate cDNA clones of low-abundance mRNAs expressed in monkey cerebral cortex but absent from cerebellum, we developed an improved subtractive cDNA cloning procedure that requires only modest quantities of mRNA. Plasmid DNA from a monkey cerebellum cDNA library was hybridized in large excess to radiolabeled monkey cortex cDNA in a phenol emulsion-enhanced reaction. The unhybridized cortex cDNA was isolated by chromatography on hydroxyapatite and used to probe colonies from a monkey cortex cDNA library. Of 60,000 colonies screened, 163 clones were isolated and confirmed by colony hybridization or RNA blotting to represent mRNAs, ranging from 0.001% to 0.1% abundance, specific to or highly enriched in cerebral cortex relative to cerebellum. Clones of one medium-abundance mRNA were recovered almost quantitatively. Two of the lower-abundance mRNAs were expressed at levels reduced by a factor of 10 in Alzheimer disease relative to normal human cortex. One of these was identified as the monkey preprosomatostatin I mRNA. Images PMID:2894033
Comparison of the canine and human acid {beta}-galactosidase gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahern-Rindell, A.J.; Kretz, K.A.; O`Brien, J.S.
Several canine cDNA libraries were screened with human {beta}-galactosidase cDNA as probe. Seven positive clones were isolated and sequenced yielding a partial (2060 bp) canine {beta}-galactosidase cDNA with 86% identity to the human {beta}-galactosidase cDNA. Preliminary analysis of a canine genomic library indicated conservation of exon number and size. Analysis by Northern blotting disclosed a single mRNA of 2.4 kb in fibroblasts and liver from normal dogs and dogs affected with GM1 gangliosidosis. Although incomplete, these results indicate canine GM1 gangliosidosis is a suitable animal model of the human disease and should further efforts to devise a gene therapy strategymore » for its treatment. 20 refs., 2 figs., 1 tab.« less
Genetic Constructor: An Online DNA Design Platform.
Bates, Maxwell; Lachoff, Joe; Meech, Duncan; Zulkower, Valentin; Moisy, Anaïs; Luo, Yisha; Tekotte, Hille; Franziska Scheitz, Cornelia Johanna; Khilari, Rupal; Mazzoldi, Florencio; Chandran, Deepak; Groban, Eli
2017-12-15
Genetic Constructor is a cloud Computer Aided Design (CAD) application developed to support synthetic biologists from design intent through DNA fabrication and experiment iteration. The platform allows users to design, manage, and navigate complex DNA constructs and libraries, using a new visual language that focuses on functional parts abstracted from sequence. Features like combinatorial libraries and automated primer design allow the user to separate design from construction by focusing on functional intent, and design constraints aid iterative refinement of designs. A plugin architecture enables contributions from scientists and coders to leverage existing powerful software and connect to DNA foundries. The software is easily accessible and platform agnostic, free for academics, and available in an open-source community edition. Genetic Constructor seeks to democratize DNA design, manufacture, and access to tools and services from the synthetic biology community.
Surveying the repair of ancient DNA from bones via high-throughput sequencing.
Mouttham, Nathalie; Klunk, Jennifer; Kuch, Melanie; Fourney, Ron; Poinar, Hendrik
2015-07-01
DNA damage in the form of abasic sites, chemically altered nucleotides, and strand fragmentation is the foremost limitation in obtaining genetic information from many ancient samples. Upon cell death, DNA continues to endure various chemical attacks such as hydrolysis and oxidation, but repair pathways found in vivo no longer operate. By incubating degraded DNA with specific enzyme combinations adopted from these pathways, it is possible to reverse some of the post-mortem nucleic acid damage prior to downstream analyses such as library preparation, targeted enrichment, and high-throughput sequencing. Here, we evaluate the performance of two available repair protocols on previously characterized DNA extracts from four mammoths. Both methods use endonucleases and glycosylases along with a DNA polymerase-ligase combination. PreCR Repair Mix increases the number of molecules converted to sequencing libraries, leading to an increase in endogenous content and a decrease in cytosine-to-thymine transitions due to cytosine deamination. However, the effects of Nelson Repair Mix on repair of DNA damage remain inconclusive.
He, Bifang; Tjhung, Katrina F; Bennett, Nicholas J; Chou, Ying; Rau, Andrea; Huang, Jian; Derda, Ratmir
2018-01-19
Understanding the composition of a genetically-encoded (GE) library is instrumental to the success of ligand discovery. In this manuscript, we investigate the bias in GE-libraries of linear, macrocyclic and chemically post-translationally modified (cPTM) tetrapeptides displayed on the M13KE platform, which are produced via trinucleotide cassette synthesis (19 codons) and NNK-randomized codon. Differential enrichment of synthetic DNA {S}, ligated vector {L} (extension and ligation of synthetic DNA into the vector), naïve libraries {N} (transformation of the ligated vector into the bacteria followed by expression of the library for 4.5 hours to yield a "naïve" library), and libraries chemically modified by aldehyde ligation and cysteine macrocyclization {M} characterized by paired-end deep sequencing, detected a significant drop in diversity in {L} → {N}, but only a minor compositional difference in {S} → {L} and {N} → {M}. Libraries expressed at the N-terminus of phage protein pIII censored positively charged amino acids Arg and Lys; libraries expressed between pIII domains N1 and N2 overcame Arg/Lys-censorship but introduced new bias towards Gly and Ser. Interrogation of biases arising from cPTM by aldehyde ligation and cysteine macrocyclization unveiled censorship of sequences with Ser/Phe. Analogous analysis can be used to explore library diversity in new display platforms and optimize cPTM of these libraries.
An improved yeast transformation method for the generation of very large human antibody libraries.
Benatuil, Lorenzo; Perez, Jennifer M; Belk, Jonathan; Hsieh, Chung-Ming
2010-04-01
Antibody library selection by yeast display technology is an efficient and highly sensitive method to identify binders to target antigens. This powerful selection tool, however, is often hampered by the typically modest size of yeast libraries (approximately 10(7)) due to the limited yeast transformation efficiency, and the full potential of the yeast display technology for antibody discovery and engineering can only be realized if it can be coupled with a mean to generate very large yeast libraries. We describe here a yeast transformation method by electroporation that allows for the efficient generation of large antibody libraries up to 10(10) in size. Multiple components and conditions including CaCl(2), MgCl(2), sucrose, sorbitol, lithium acetate, dithiothreitol, electroporation voltage, DNA input and cell volume have been tested to identify the best combination. By applying this developed protocol, we have constructed a 1.4 x 10(10) human spleen antibody library essentially in 1 day with a transformation efficiency of 1-1.5 x 10(8) transformants/microg vector DNA. Taken together, we have developed a highly efficient yeast transformation method that enables the generation of very large and productive human antibody libraries for antibody discovery, and we are now routinely making 10(9) libraries in a day for antibody engineering purposes.
NASA Astrophysics Data System (ADS)
Liang, R.; Lau, M.; Vishnivetskaya, T. A.; Lloyd, K. G.; Pfiffner, S. M.; Rivkina, E.; Onstott, T. C.
2017-12-01
The prevalence of microorganisms in frozen permafrost has been well documented in ancient sediment up to several million years old. However, the long term survivability and metabolic activity of microbes over geological timespans remain underexplored. Siberian permafrost sediment was collected at various depths (1.4m, 11.8 m and 24.8m) to represent a wide range of geological time from thousands to millions of years. Extracellular (eDNA) and intracellular DNA (iDNA) was simultaneously recovered for sequencing to characterize the potentially extinct and extant microbial community. Additionally, aspartic acid racemization assay (D/L Asp) was used to infer the metabolic activity of microbes in ancient permafrost. As compared with the young sample (1.4m), DNA yield and content of aspartic acid dramatically decreased in old samples (11.8m and 24.8m). However, D/L Asp and eDNA/iDNA significantly increased with the geological age. Such findings suggested that ancient microbiomes might be subjected to racemization or even DNA/proteins degradation at subzero temperature over the wide geological time scale. Preliminary characterization of microbial community indicated that the majority of sequences in old samples were identified as bacteria and only a small fraction was identified as archaea from the iDNA pool. While the eDNA and iDNA fractions shared similar dominant taxa at phylum level, the relative abundance of Proteobacteria in eDNA library was much higher than iDNA. By contrast, the phylum affiliated with Firmicutes was more numerically abundant in the iDNA fraction. More dramatic differences were observed between eDNA and iDNA library at lower taxonomic levels. Particularly, the microbial lineages affiliated with the genera Methanoregula, Desulfosporosinus and Syntrophomonas were only detected in the iDNA library. Such taxonomic difference between the relic eDNA and iDNA suggested that numerous species become locally "extinct" whereas many other taxa might survive in ancient sediment. Ultimately, when coupling our current findings to the D/L Asp in cellular proteins and metaproteomics, a better understanding will be achieved about the microbial activity of the extant microbial community and their roles in biogeochemical cycling in ancient permafrost.
Helm, Jared R.; Hertz-Fowler, Christiane; Aslett, Martin; Berriman, Matthew; Sanders, Mandy; Quail, Michael A.; Soares, Marcelo B.; Bonaldo, Maria F.; Sakurai, Tatsuya; Inoue, Noboru; Donelson, John E.
2009-01-01
Trypanosoma congolense is one of the most economically important pathogens of livestock in Africa. Culture-derived parasites of each of the three main insect stages of the T. congolense life cycle, i.e., the procyclic, epimastigote and metacyclic stages, and bloodstream stage parasites isolated from infected mice, were used to construct stage-specific cDNA libraries and expressed sequence tags (ESTs or cDNA clones) in each library were sequenced. Thirteen EST clusters encoding different variant surface glycoproteins (VSGs) were detected in the metacyclic library and twenty-six VSG EST clusters were found in the bloodstream library, six of which are shared by the metacyclic library. Rare VSG ESTs are present in the epimastigote library, and none were detected in the procyclic library. ESTs encoding enzymes that catalyze oxidative phosphorylation and amino acid metabolism are about twice as abundant in the procyclic and epimastigote stages as in the metacyclic and bloodstream stages. In contrast, ESTs encoding enzymes involved in glycolysis, the citric acid cycle and nucleotide metabolism are about the same in all four developmental stages. Cysteine proteases, kinases and phosphatases are the most abundant enzyme groups represented by the ESTs. All four libraries contain T. congolense-specific expressed sequences not present in the T. brucei and T. cruzi genomes. Normalized cDNA libraries were constructed from the metacyclic and bloodstream stages, and found to be further enriched for T. congolense-specific ESTs. Given that cultured T. congolense offers an experimental advantage over other African trypanosome species, these ESTs provide a basis for further investigation of the molecular properties of these four developmental stages, especially the epimastigote and metacyclic stages for which it is difficult to obtain large quantities of organisms. The T. congolense EST databases are available at: http://www.sanger.ac.uk/Projects/T_congolense/EST_index.shtml. PMID:19559733
Darwin Assembly: fast, efficient, multi-site bespoke mutagenesis
Cozens, Christopher
2018-01-01
Abstract Engineering proteins for designer functions and biotechnological applications almost invariably requires (or at least benefits from) multiple mutations to non-contiguous residues. Several methods for multiple site-directed mutagenesis exist, but there remains a need for fast and simple methods to efficiently introduce such mutations – particularly for generating large, high quality libraries for directed evolution. Here, we present Darwin Assembly, which can deliver high quality libraries of >108 transformants, targeting multiple (>10) distal sites with minimal wild-type contamination (<0.25% of total population) and which takes a single working day from purified plasmid to library transformation. We demonstrate its efficacy with whole gene codon reassignment of chloramphenicol acetyl transferase, mutating 19 codons in a single reaction in KOD DNA polymerase and generating high quality, multiple-site libraries in T7 RNA polymerase and Tgo DNA polymerase. Darwin Assembly uses commercially available enzymes, can be readily automated, and offers a cost-effective route to highly complex and customizable library generation. PMID:29409059
Horse cDNA clones encoding two MHC class I genes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barbis, D.P.; Maher, J.K.; Stanek, J.
1994-12-31
Two full-length clones encoding MHC class I genes were isolated by screening a horse cDNA library, using a probe encoding in human HLA-A2.2Y allele. The library was made in the pcDNA1 vector (Invitrogen, San Diego, CA), using mRNA from peripheral blood lymphocytes obtained from a Thoroughbred stallion (No. 0834) homozygous for a common horse MHC haplotype (ELA-A2, -B2, -D2; Antczak et al. 1984; Donaldson et al. 1988). The clones were sequenced, using SP6 and T7 universal primers and horse-specific oligonucleotides designed to extend previously determined sequences.
An Integrated Microfluidic Processor for DNA-Encoded Combinatorial Library Functional Screening
2017-01-01
DNA-encoded synthesis is rekindling interest in combinatorial compound libraries for drug discovery and in technology for automated and quantitative library screening. Here, we disclose a microfluidic circuit that enables functional screens of DNA-encoded compound beads. The device carries out library bead distribution into picoliter-scale assay reagent droplets, photochemical cleavage of compound from the bead, assay incubation, laser-induced fluorescence-based assay detection, and fluorescence-activated droplet sorting to isolate hits. DNA-encoded compound beads (10-μm diameter) displaying a photocleavable positive control inhibitor pepstatin A were mixed (1920 beads, 729 encoding sequences) with negative control beads (58 000 beads, 1728 encoding sequences) and screened for cathepsin D inhibition using a biochemical enzyme activity assay. The circuit sorted 1518 hit droplets for collection following 18 min incubation over a 240 min analysis. Visual inspection of a subset of droplets (1188 droplets) yielded a 24% false discovery rate (1166 pepstatin A beads; 366 negative control beads). Using template barcoding strategies, it was possible to count hit collection beads (1863) using next-generation sequencing data. Bead-specific barcodes enabled replicate counting, and the false discovery rate was reduced to 2.6% by only considering hit-encoding sequences that were observed on >2 beads. This work represents a complete distributable small molecule discovery platform, from microfluidic miniaturized automation to ultrahigh-throughput hit deconvolution by sequencing. PMID:28199790
An Integrated Microfluidic Processor for DNA-Encoded Combinatorial Library Functional Screening.
MacConnell, Andrew B; Price, Alexander K; Paegel, Brian M
2017-03-13
DNA-encoded synthesis is rekindling interest in combinatorial compound libraries for drug discovery and in technology for automated and quantitative library screening. Here, we disclose a microfluidic circuit that enables functional screens of DNA-encoded compound beads. The device carries out library bead distribution into picoliter-scale assay reagent droplets, photochemical cleavage of compound from the bead, assay incubation, laser-induced fluorescence-based assay detection, and fluorescence-activated droplet sorting to isolate hits. DNA-encoded compound beads (10-μm diameter) displaying a photocleavable positive control inhibitor pepstatin A were mixed (1920 beads, 729 encoding sequences) with negative control beads (58 000 beads, 1728 encoding sequences) and screened for cathepsin D inhibition using a biochemical enzyme activity assay. The circuit sorted 1518 hit droplets for collection following 18 min incubation over a 240 min analysis. Visual inspection of a subset of droplets (1188 droplets) yielded a 24% false discovery rate (1166 pepstatin A beads; 366 negative control beads). Using template barcoding strategies, it was possible to count hit collection beads (1863) using next-generation sequencing data. Bead-specific barcodes enabled replicate counting, and the false discovery rate was reduced to 2.6% by only considering hit-encoding sequences that were observed on >2 beads. This work represents a complete distributable small molecule discovery platform, from microfluidic miniaturized automation to ultrahigh-throughput hit deconvolution by sequencing.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tomkinson, B.; Jonsson, A-K
1991-01-01
Tripeptidyl peptidase II is a high molecular weight serine exopeptidase, which has been purified from rat liver and human erythrocytes. Four clones, representing 4453 bp, or 90{percent} of the mRNA of the human enzyme, have been isolated from two different cDNA libraries. One clone, designated A2, was obtained after screening a human B-lymphocyte cDNA library with a degenerated oligonucleotide mixture. The B-lymphocyte cDNA library, obtained from human fibroblasts, were rescreened with a 147 bp fragment from the 5{prime} part of the A2 clone, whereby three different overlapping cDNA clones could be isolated. The deduced amino acid sequence, 1196 amino acidmore » residues, corresponding to the longest open rading frame of the assembled nucleotide sequence, was compared to sequences of current databases. This revealed a 56{percent} similarity between the bacterial enzyme subtilisin and the N-terminal part of tripeptidyl peptidase II. The enzyme was found to be represented by two different mRNAs of 4.2 and 5.0 kilobases, respectively, which probably result from the utilziation of two different polyadenylation sites. Futhermore, cDNA corresponding to both the N-terminal and C-terminal part of tripeptidyl peptidase II hybridized with genomic DNA from mouse, horse, calf, and hen, even under fairly high stringency conditions, indicating that tripeptidyl peptidase II is highly conserved.« less
Morinière, Jérôme; Hendrich, Lars; Balke, Michael; Beermann, Arne J; König, Tobias; Hess, Monika; Koch, Stefan; Müller, Reinhard; Leese, Florian; Hebert, Paul D N; Hausmann, Axel; Schubart, Christoph D; Haszprunar, Gerhard
2017-11-01
Mayflies, stoneflies and caddisflies (Ephemeroptera, Plecoptera and Trichoptera) are prominent representatives of aquatic macroinvertebrates, commonly used as indicator organisms for water quality and ecosystem assessments. However, unambiguous morphological identification of EPT species, especially their immature life stages, is a challenging, yet fundamental task. A comprehensive DNA barcode library based upon taxonomically well-curated specimens is needed to overcome the problematic identification. Once available, this library will support the implementation of fast, cost-efficient and reliable DNA-based identifications and assessments of ecological status. This study represents a major step towards a DNA barcode reference library as it covers for two-thirds of Germany's EPT species including 2,613 individuals belonging to 363 identified species. As such, it provides coverage for 38 of 44 families (86%) and practically all major bioindicator species. DNA barcode compliant sequences (≥500 bp) were recovered from 98.74% of the analysed specimens. Whereas most species (325, i.e., 89.53%) were unambiguously assigned to a single Barcode Index Number (BIN) by its COI sequence, 38 species (18 Ephemeroptera, nine Plecoptera and 11 Trichoptera) were assigned to a total of 89 BINs. Most of these additional BINs formed nearest neighbour clusters, reflecting the discrimination of geographical subclades of a currently recognized species. BIN sharing was uncommon, involving only two species pairs of Ephemeroptera. Interestingly, both maximum pairwise and nearest neighbour distances were substantially higher for Ephemeroptera compared to Plecoptera and Trichoptera, possibly indicating older speciation events, stronger positive selection or faster rate of molecular evolution. © 2017 John Wiley & Sons Ltd.
Jensen, Sigmund; Fortunato, Sofia A V; Hoffmann, Friederike; Hoem, Solveig; Rapp, Hans Tore; Øvreås, Lise; Torsvik, Vigdis L
2017-04-01
During the last decades, our knowledge about the activity of sponge-associated microorganisms and their contribution to biogeochemical cycling has gradually increased. Functional groups involved in carbon and nitrogen metabolism are well documented, whereas knowledge about microorganisms involved in the sulfur cycle is still limited. Both sulfate reduction and sulfide oxidation has been detected in the cold water sponge Geodia barretti from Korsfjord in Norway, and with specimens from this site, the present study aims to identify extant versus active sponge-associated microbiota with focus on sulfur metabolism. Comparative analysis of small subunit ribosomal RNA (16S rRNA) gene (DNA) and transcript (complementary DNA (cDNA)) libraries revealed profound differences. The transcript library was predominated by Chloroflexi despite their low abundance in the gene library. An opposite result was found for Acidobacteria. Proteobacteria were detected in both libraries with representatives of the Alpha- and Gammaproteobacteria related to clades with presumably thiotrophic bacteria from sponges and other marine invertebrates. Sequences that clustered with sponge-associated Deltaproteobacteria were remotely related to cultivated sulfate-reducing bacteria. The microbes involved in sulfur cycling were identified by the functional gene aprA (adenosine-5'-phosphosulfate reductase) and its transcript. Of the aprA sequences (DNA and cDNA), 87 % affiliated with sulfur-oxidizing bacteria. They clustered with Alphaproteobacteria and with clades of deep-branching Gammaproteobacteria. The remaining sequences clustered with sulfate-reducing Archaea of the phylum Euryarchaeota. These results indicate an active role of yet uncharacterized Bacteria and Archaea in the sponge's sulfur cycle.
Amplification of chromosomal DNA in situ
Christian, Allen T.; Coleman, Matthew A.; Tucker, James D.
2002-01-01
Amplification of chromosomal DNA in situ to increase the amount of DNA associated with a chromosome or chromosome region is described. The amplification of chromosomal DNA in situ provides for the synthesis of Fluorescence in situ Hybridization (FISH) painting probes from single dissected chromosome fragments, the production of cDNA libraries from low copy mRNAs and improved in Comparative Genomic Hybridization (CGH) procedures.
Extending the spectrum of DNA sequences retrieved from ancient bones and teeth
Glocke, Isabelle; Meyer, Matthias
2017-01-01
The number of DNA fragments surviving in ancient bones and teeth is known to decrease with fragment length. Recent genetic analyses of Middle Pleistocene remains have shown that the recovery of extremely short fragments can prove critical for successful retrieval of sequence information from particularly degraded ancient biological material. Current sample preparation techniques, however, are not optimized to recover DNA sequences from fragments shorter than ∼35 base pairs (bp). Here, we show that much shorter DNA fragments are present in ancient skeletal remains but lost during DNA extraction. We present a refined silica-based DNA extraction method that not only enables efficient recovery of molecules as short as 25 bp but also doubles the yield of sequences from longer fragments due to improved recovery of molecules with single-strand breaks. Furthermore, we present strategies for monitoring inefficiencies in library preparation that may result from co-extraction of inhibitory substances during DNA extraction. The combination of DNA extraction and library preparation techniques described here substantially increases the yield of DNA sequences from ancient remains and provides access to a yet unexploited source of highly degraded DNA fragments. Our work may thus open the door for genetic analyses on even older material. PMID:28408382
Shinozuka, Hiroshi; Cogan, Noel O I; Shinozuka, Maiko; Marshall, Alexis; Kay, Pippa; Lin, Yi-Han; Spangenberg, German C; Forster, John W
2015-04-11
Fragmentation at random nucleotide locations is an essential process for preparation of DNA libraries to be used on massively parallel short-read DNA sequencing platforms. Although instruments for physical shearing, such as the Covaris S2 focused-ultrasonicator system, and products for enzymatic shearing, such as the Nextera technology and NEBNext dsDNA Fragmentase kit, are commercially available, a simple and inexpensive method is desirable for high-throughput sequencing library preparation. MspJI is a recently characterised restriction enzyme which recognises the sequence motif CNNR (where R = G or A) when the first base is modified to 5-methylcytosine or 5-hydroxymethylcytosine. A semi-random enzymatic DNA amplicon fragmentation method was developed based on the unique cleavage properties of MspJI. In this method, random incorporation of 5-methyl-2'-deoxycytidine-5'-triphosphate is achieved through DNA amplification with DNA polymerase, followed by DNA digestion with MspJI. Due to the recognition sequence of the enzyme, DNA amplicons are fragmented in a relatively sequence-independent manner. The size range of the resulting fragments was capable of control through optimisation of 5-methyl-2'-deoxycytidine-5'-triphosphate concentration in the reaction mixture. A library suitable for sequencing using the Illumina MiSeq platform was prepared and processed using the proposed method. Alignment of generated short reads to a reference sequence demonstrated a relatively high level of random fragmentation. The proposed method may be performed with standard laboratory equipment. Although the uniformity of coverage was slightly inferior to the Covaris physical shearing procedure, due to efficiencies of cost and labour, the method may be more suitable than existing approaches for implementation in large-scale sequencing activities, such as bacterial artificial chromosome (BAC)-based genome sequence assembly, pan-genomic studies and locus-targeted genotyping-by-sequencing.
Zhang, Wei Yun; Zhang, Wenhua; Liu, Zhiyuan; Li, Cong; Zhu, Zhi; Yang, Chaoyong James
2012-01-03
We have developed a novel method for efficiently screening affinity ligands (aptamers) from a complex single-stranded DNA (ssDNA) library by employing single-molecule emulsion polymerase chain reaction (PCR) based on the agarose droplet microfluidic technology. In a typical systematic evolution of ligands by exponential enrichment (SELEX) process, the enriched library is sequenced first, and tens to hundreds of aptamer candidates are analyzed via a bioinformatic approach. Possible candidates are then chemically synthesized, and their binding affinities are measured individually. Such a process is time-consuming, labor-intensive, inefficient, and expensive. To address these problems, we have developed a highly efficient single-molecule approach for aptamer screening using our agarose droplet microfluidic technology. Statistically diluted ssDNA of the pre-enriched library evolved through conventional SELEX against cancer biomarker Shp2 protein was encapsulated into individual uniform agarose droplets for droplet PCR to generate clonal agarose beads. The binding capacity of amplified ssDNA from each clonal bead was then screened via high-throughput fluorescence cytometry. DNA clones with high binding capacity and low K(d) were chosen as the aptamer and can be directly used for downstream biomedical applications. We have identified an ssDNA aptamer that selectively recognizes Shp2 with a K(d) of 24.9 nM. Compared to a conventional sequencing-chemical synthesis-screening work flow, our approach avoids large-scale DNA sequencing and expensive, time-consuming DNA synthesis of large populations of DNA candidates. The agarose droplet microfluidic approach is thus highly efficient and cost-effective for molecular evolution approaches and will find wide application in molecular evolution technologies, including mRNA display, phage display, and so on. © 2011 American Chemical Society
From the Cover: A polymer library approach to suicide gene therapy for cancer
NASA Astrophysics Data System (ADS)
Anderson, Daniel G.; Peng, Weidan; Akinc, Akin; Hossain, Naushad; Kohn, Anat; Padera, Robert; Langer, Robert; Sawicki, Janet A.
2004-11-01
Optimal gene therapy for cancer must (i) deliver DNA to tumor cells with high efficiency, (ii) induce minimal toxicity, and (iii) avoid gene expression in healthy tissues. To this end, we generated a library of >500 degradable, poly(-amino esters) for potential use as nonviral DNA vectors. Using high-throughput methods, we screened this library in vitro for transfection efficiency and cytotoxicity. We tested the best performing polymer, C32, in mice for toxicity and DNA delivery after intratumor and i.m. injection. C32 delivered DNA intratumorally 4-fold better than one of the best commercially available reagents, jetPEI (polyethyleneimine), and 26-fold better than naked DNA. Conversely, the highest transfection levels after i.m. administration were achieved with naked DNA, followed by polyethyleneimine; transfection was rarely observed with C32. Additionally, polyethyleneimine induced significant local toxicity after i.m. injection, whereas C32 demonstrated no toxicity. Finally, we used C32 to deliver a DNA construct encoding the A chain of diphtheria toxin (DT-A) to xenografts derived from LNCaP human prostate cancer cells. This construct regulates toxin expression both at the transcriptional level by the use of a chimeric-modified enhancer/promoter sequence of the human prostate-specific antigen gene and by DNA recombination mediated by Flp recombinase. C32 delivery of the A chain of diphtheria toxin DNA to LNCaP xenografts suppressed tumor growth and even caused 40% of tumors to regress in size. Because C32 transfects tumors locally at high levels, transfects healthy muscle poorly, and displays no toxicity, it may provide a vehicle for the local treatment of cancer. prostate | cationic polymers
BASIC: A Simple and Accurate Modular DNA Assembly Method.
Storch, Marko; Casini, Arturo; Mackrow, Ben; Ellis, Tom; Baldwin, Geoff S
2017-01-01
Biopart Assembly Standard for Idempotent Cloning (BASIC) is a simple, accurate, and robust DNA assembly method. The method is based on linker-mediated DNA assembly and provides highly accurate DNA assembly with 99 % correct assemblies for four parts and 90 % correct assemblies for seven parts [1]. The BASIC standard defines a single entry vector for all parts flanked by the same prefix and suffix sequences and its idempotent nature means that the assembled construct is returned in the same format. Once a part has been adapted into the BASIC format it can be placed at any position within a BASIC assembly without the need for reformatting. This allows laboratories to grow comprehensive and universal part libraries and to share them efficiently. The modularity within the BASIC framework is further extended by the possibility of encoding ribosomal binding sites (RBS) and peptide linker sequences directly on the linkers used for assembly. This makes BASIC a highly versatile library construction method for combinatorial part assembly including the construction of promoter, RBS, gene variant, and protein-tag libraries. In comparison with other DNA assembly standards and methods, BASIC offers a simple robust protocol; it relies on a single entry vector, provides for easy hierarchical assembly, and is highly accurate for up to seven parts per assembly round [2].
Dorraj, Ghamar Soltan; Rassaee, Mohammad Javad; Latifi, Ali Mohammad; Pishgoo, Bahram; Tavallaei, Mahmood
2015-08-20
Troponin T and I are ideal markers which are highly sensitive and specific for myocardial injury and have shown better efficacy than earlier markers. Since aptamers are ssDNA or RNA that bind to a wide variety of target molecules, the purpose of this research was to select an aptamer from a 79bp single-stranded DNA (ssDNA) random library that was used to bind the Human Cardiac Troponin I from a synthetic nucleic acids library by systematic evolution of ligands exponential enrichment (Selex) based on several selection and amplification steps. Human Cardiac Troponin I protein was coated onto the surface of streptavidin magnetic beads to extract specific aptamer from a large and diverse random ssDNA initial oligonucleotide library. As a result, several aptamers were selected and further examined for binding affinity and specificity. Finally TnIApt 23 showed beast affinity in nanomolar range (2.69nM) toward the target protein. A simple and rapid colorimetric detection assay for Human Cardiac Troponin I using the novel and specific aptamer-AuNPs conjugates based on dot blot assay was developed. The detection limit for this protein using aptamer-AuNPs-based assay was found to be 5ng/ml. Copyright © 2015 Elsevier B.V. All rights reserved.
Nicosia, Aldo; Maggio, Teresa; Mazzola, Salvatore; Cuttitta, Angela
2013-10-30
Anemonia viridis is a widespread and extensively studied Mediterranean species of sea anemone from which a large number of polypeptide toxins, such as blood depressing substances (BDS) peptides, have been isolated. The first members of this class, BDS-1 and BDS-2, are polypeptides belonging to the β-defensin fold family and were initially described for their antihypertensive and antiviral activities. BDS-1 and BDS-2 are 43 amino acid peptides characterised by three disulfide bonds that act as neurotoxins affecting Kv3.1, Kv3.2 and Kv3.4 channel gating kinetics. In addition, BDS-1 inactivates the Nav1.7 and Nav1.3 channels. The development of a large dataset of A. viridis expressed sequence tags (ESTs) and the identification of 13 putative BDS-like cDNA sequences has attracted interest, especially as scientific and diagnostic tools. A comparison of BDS cDNA sequences showed that the untranslated regions are more conserved than the protein-coding regions. Moreover, the KA/KS ratios calculated for all pairwise comparisons showed values greater than 1, suggesting mechanisms of accelerated evolution. The structures of the BDS homologs were predicted by molecular modelling. All toxins possess similar 3D structures that consist of a triple-stranded antiparallel β-sheet and an additional small antiparallel β-sheet located downstream of the cleavage/maturation site; however, the orientation of the triple-stranded β-sheet appears to differ among the toxins. To characterise the spatial expression profile of the putative BDS cDNA sequences, tissue-specific cDNA libraries, enriched for BDS transcripts, were constructed. In addition, the proper amplification of ectodermal or endodermal markers ensured the tissue specificity of each library. Sequencing randomly selected clones from each library revealed ectodermal-specific expression of ten BDS transcripts, while transcripts of BDS-8, BDS-13, BDS-14 and BDS-15 failed to be retrieved, likely due to under-representation in our cDNA libraries. The calculation of the relative abundance of BDS transcripts in the cDNA libraries revealed that BDS-1, BDS-3, BDS-4, BDS-5 and BDS-6 are the most represented transcripts.
Cuozzo, John W; Centrella, Paolo A; Gikunju, Diana; Habeshian, Sevan; Hupp, Christopher D; Keefe, Anthony D; Sigel, Eric A; Soutter, Holly H; Thomson, Heather A; Zhang, Ying; Clark, Matthew A
2017-05-04
We have identified and characterized novel potent inhibitors of Bruton's tyrosine kinase (BTK) from a single DNA-encoded library of over 110 million compounds by using multiple parallel selection conditions, including variation in target concentration and addition of known binders to provide competition information. Distinct binding profiles were observed by comparing enrichments of library building block combinations under these conditions; one enriched only at high concentrations of BTK and was competitive with ATP, and another enriched at both high and low concentrations of BTK and was not competitive with ATP. A compound representing the latter profile showed low nanomolar potency in biochemical and cellular BTK assays. Results from kinetic mechanism of action studies were consistent with the selection profiles. Analysis of the co-crystal structure of the most potent compound demonstrated a novel binding mode that revealed a new pocket in BTK. Our results demonstrate that profile-based selection strategies using DNA-encoded libraries form the basis of a new methodology to rapidly identify small molecule inhibitors with novel binding modes to clinically relevant targets. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Mapping of Drug-like Chemical Universe with Reduced Complexity Molecular Frameworks.
Kontijevskis, Aleksejs
2017-04-24
The emergence of the DNA-encoded chemical libraries (DEL) field in the past decade has attracted the attention of the pharmaceutical industry as a powerful mechanism for the discovery of novel drug-like hits for various biological targets. Nuevolution Chemetics technology enables DNA-encoded synthesis of billions of chemically diverse drug-like small molecule compounds, and the efficient screening and optimization of these, facilitating effective identification of drug candidates at an unprecedented speed and scale. Although many approaches have been developed by the cheminformatics community for the analysis and visualization of drug-like chemical space, most of them are restricted to the analysis of a maximum of a few millions of compounds and cannot handle collections of 10 8 -10 12 compounds typical for DELs. To address this big chemical data challenge, we developed the Reduced Complexity Molecular Frameworks (RCMF) methodology as an abstract and very general way of representing chemical structures. By further introducing RCMF descriptors, we constructed a global framework map of drug-like chemical space and demonstrated how chemical space occupied by multi-million-member drug-like Chemetics DNA-encoded libraries and virtual combinatorial libraries with >10 12 members could be analyzed and mapped without a need for library enumeration. We further validate the approach by performing RCMF-based searches in a drug-like chemical universe and mapping Chemetics library selection outputs for LSD1 targets on a global framework chemical space map.
Process of labeling specific chromosomes using recombinant repetitive DNA
Moyzis, R.K.; Meyne, J.
1988-02-12
Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
Genotype Specification Language.
Wilson, Erin H; Sagawa, Shiori; Weis, James W; Schubert, Max G; Bissell, Michael; Hawthorne, Brian; Reeves, Christopher D; Dean, Jed; Platt, Darren
2016-06-17
We describe here the Genotype Specification Language (GSL), a language that facilitates the rapid design of large and complex DNA constructs used to engineer genomes. The GSL compiler implements a high-level language based on traditional genetic notation, as well as a set of low-level DNA manipulation primitives. The language allows facile incorporation of parts from a library of cloned DNA constructs and from the "natural" library of parts in fully sequenced and annotated genomes. GSL was designed to engage genetic engineers in their native language while providing a framework for higher level abstract tooling. To this end we define four language levels, Level 0 (literal DNA sequence) through Level 3, with increasing abstraction of part selection and construction paths. GSL targets an intermediate language based on DNA slices that translates efficiently into a wide range of final output formats, such as FASTA and GenBank, and includes formats that specify instructions and materials such as oligonucleotide primers to allow the physical construction of the GSL designs by individual strain engineers or an automated DNA assembly core facility.
Liew, Pauline Woanying; Jong, Bor Chyan
2008-05-01
Two culture-independent methods, namely ribosomal DNA libraries and denaturing gradient gel electrophoresis (DGGE), were adopted to examine the microbial community of a Malaysian light crude oil. In this study, both 16S and 18S rDNAs were PCR-amplified from bulk DNA of crude oil samples, cloned, and sequenced. Analyses of restriction fragment length polymorphism (RFLP) and phylogenetics clustered the 16S and 18S rDNA sequences into seven and six groups, respectively. The ribosomal DNA sequences obtained showed sequence similarity between 90 to 100% to those available in the GenBank database. The closest relatives documented for the 16S rDNAs include member species of Thermoincola and Rhodopseudomonas, whereas the closest fungal relatives include Acremonium, Ceriporiopsis, Xeromyces, Lecythophora, and Candida. Others were affiliated to uncultured bacteria and uncultured ascomycete. The 16S rDNA library demonstrated predomination by a single uncultured bacterial type by >80% relative abundance. The predomination was confirmed by DGGE analysis.
Sequence of Spider Aciniform and Piriform Silks
2001-09-19
7/98nd subtan-6/01 4. TITLE AND SUBTITLE Sequence of Spider Aciniform and Piriform Silks 5. FUNDING NUMBERS DAAD19-01-1-0569 6...aciniform glands from Argiope trifasciata were used to construct a cDNA library. The library was probed with various DNA probes based on known spider silk ...sequence in a number of other spider silks . The 5’end of the clone still appears to be repetitive sequence and thus it is unlikely to be a full-length
2017-06-01
Milestone Achieved: HRPO/ACURO Approval 6 Finished Major Task 2 CRISPR knockout/RNAseq Viral infection/prep 3-6 CRISPR KO virus library prep...finished; RNA-Seq: ~75% Cell manipulation 3-6 CRISPR KO virus infection: 50%; Single cDNA infections: finished Bioinformatics 1 CRISPR KO library...characterization 1-3 Finished Update: production of iPSC clones harboring DC mutations generated by CRISPR : Design 1 Finished Update: production of
Mills, Heath J.; Martinez, Robert J.; Story, Sandra; Sobecky, Patricia A.
2005-01-01
The characterization of microbial assemblages within solid gas hydrate, especially those that may be physiologically active under in situ hydrate conditions, is essential to gain a better understanding of the effects and contributions of microbial activities in Gulf of Mexico (GoM) hydrate ecosystems. In this study, the composition of the Bacteria and Archaea communities was determined by 16S rRNA phylogenetic analyses of clone libraries derived from RNA and DNA extracted from sediment-entrained hydrate (SEH) and interior hydrate (IH). The hydrate was recovered from an exposed mound located in the northern GoM continental slope with a hydrate chipper designed for use on the manned-submersible Johnson Sea Link (water depth, 550 m). Previous geochemical analyses indicated that there was increased metabolic activity in the SEH compared to the IH layer (B. N. Orcutt, A. Boetius, S. K. Lugo, I. R. Macdonald, V. A. Samarkin, and S. Joye, Chem. Geol. 205:239-251). Phylogenetic analysis of RNA- and DNA-derived clones indicated that there was greater diversity in the SEH libraries than in the IH libraries. A majority of the clones obtained from the metabolically active fraction of the microbial community were most closely related to putative sulfate-reducing bacteria and anaerobic methane-oxidizing archaea. Several novel bacterial and archaeal phylotypes for which there were no previously identified closely related cultured isolates were detected in the RNA- and DNA-derived clone libraries. This study was the first phylogenetic analysis of the metabolically active fraction of the microbial community extant in the distinct SEH and IH layers of GoM gas hydrate. PMID:15933026
Liu, Jie; Milne, Richard I; Möller, Michael; Zhu, Guang-Fu; Ye, Lin-Jiang; Luo, Ya-Huang; Yang, Jun-Bo; Wambulwa, Moses C; Wang, Chun-Neng; Li, De-Zhu; Gao, Lian-Ming
2018-05-22
Rapid and accurate identification of endangered species is a critical component of biosurveillance and conservation management, and potentially policing illegal trades. However, this is often not possible using traditional taxonomy, especially where only small or preprocessed parts of plants are available. Reliable identification can be achieved via a comprehensive DNA barcode reference library, accompanied by precise distribution data. However, these require extensive sampling at spatial and taxonomic scales, which has rarely been achieved for cosmopolitan taxa. Here, we construct a comprehensive DNA barcode reference library and generate distribution maps using species distribution modelling (SDM), for all 15 Taxus species worldwide. We find that trnL-trnF is the ideal barcode for Taxus: It can distinguish all Taxus species and in combination with ITS identify hybrids. Among five analysis methods tested, NJ was the most effective. Among 4,151 individuals screened for trnL-trnF, 73 haplotypes were detected, all species-specific and some population private. Taxonomical, geographical and genetic dimensions of sampling strategy were all found to affect the comprehensiveness of the resulting DNA barcode library. Maps from SDM showed that most species had allopatric distributions, except T. mairei in the Sino-Himalayan region. Using the barcode library and distribution map data, two unknown forensic samples were identified to species (and in one case, population) level and another was determined as a putative interspecific hybrid. This integrated species identification system for Taxus can be used for biosurveillance, conservation management and to monitor and prosecute illegal trade. Similar identification systems are recommended for other IUCN- and CITES-listed taxa. © 2018 John Wiley & Sons Ltd.
Ramond, J-B; Makhalanyane, T P; Tuffin, M I; Cowan, D A
2015-04-01
Normalization is a procedure classically employed to detect rare sequences in cellular expression profiles (i.e. cDNA libraries). Here, we present a normalization protocol involving the direct treatment of extracted environmental metagenomic DNA with S1 nuclease, referred to as normalization of metagenomic DNA: NmDNA. We demonstrate that NmDNA, prior to post hoc PCR-based experiments (16S rRNA gene T-RFLP fingerprinting and clone library), increased the diversity of sequences retrieved from environmental microbial communities by detection of rarer sequences. This approach could be used to enhance the resolution of detection of ecologically relevant rare members in environmental microbial assemblages and therefore is promising in enabling a better understanding of ecosystem functioning. This study is the first testing 'normalization' on environmental metagenomic DNA (mDNA). The aim of this procedure was to improve the identification of rare phylotypes in environmental communities. Using hypoliths as model systems, we present evidence that this post-mDNA extraction molecular procedure substantially enhances the detection of less common phylotypes and could even lead to the discovery of novel microbial genotypes within a given environment. © 2014 The Society for Applied Microbiology.
PNA-encoded chemical libraries.
Zambaldo, Claudio; Barluenga, Sofia; Winssinger, Nicolas
2015-06-01
Peptide nucleic acid (PNA)-encoded chemical libraries along with DNA-encoded libraries have provided a powerful new paradigm for library synthesis and ligand discovery. PNA-encoding stands out for its compatibility with standard solid phase synthesis and the technology has been used to prepare libraries of peptides, heterocycles and glycoconjugates. Different screening formats have now been reported including selection-based and microarray-based methods that have yielded specific ligands against diverse target classes including membrane receptors, lectins and challenging targets such as Hsp70. Copyright © 2015 Elsevier Ltd. All rights reserved.
Digitally encoded DNA nanostructures for multiplexed, single-molecule protein sensing with nanopores
NASA Astrophysics Data System (ADS)
Bell, Nicholas A. W.; Keyser, Ulrich F.
2016-07-01
The simultaneous detection of a large number of different analytes is important in bionanotechnology research and in diagnostic applications. Nanopore sensing is an attractive method in this regard as the approach can be integrated into small, portable device architectures, and there is significant potential for detecting multiple sub-populations in a sample. Here, we show that highly multiplexed sensing of single molecules can be achieved with solid-state nanopores by using digitally encoded DNA nanostructures. Based on the principles of DNA origami, we designed a library of DNA nanostructures in which each member contains a unique barcode; each bit in the barcode is signalled by the presence or absence of multiple DNA dumbbell hairpins. We show that a 3-bit barcode can be assigned with 94% accuracy by electrophoretically driving the DNA structures through a solid-state nanopore. Select members of the library were then functionalized to detect a single, specific antibody through antigen presentation at designed positions on the DNA. This allows us to simultaneously detect four different antibodies of the same isotype at nanomolar concentration levels.
Bell, Nicholas A W; Keyser, Ulrich F
2016-07-01
The simultaneous detection of a large number of different analytes is important in bionanotechnology research and in diagnostic applications. Nanopore sensing is an attractive method in this regard as the approach can be integrated into small, portable device architectures, and there is significant potential for detecting multiple sub-populations in a sample. Here, we show that highly multiplexed sensing of single molecules can be achieved with solid-state nanopores by using digitally encoded DNA nanostructures. Based on the principles of DNA origami, we designed a library of DNA nanostructures in which each member contains a unique barcode; each bit in the barcode is signalled by the presence or absence of multiple DNA dumbbell hairpins. We show that a 3-bit barcode can be assigned with 94% accuracy by electrophoretically driving the DNA structures through a solid-state nanopore. Select members of the library were then functionalized to detect a single, specific antibody through antigen presentation at designed positions on the DNA. This allows us to simultaneously detect four different antibodies of the same isotype at nanomolar concentration levels.
Design of 240,000 orthogonal 25mer DNA barcode probes.
Xu, Qikai; Schlabach, Michael R; Hannon, Gregory J; Elledge, Stephen J
2009-02-17
DNA barcodes linked to genetic features greatly facilitate screening these features in pooled formats using microarray hybridization, and new tools are needed to design large sets of barcodes to allow construction of large barcoded mammalian libraries such as shRNA libraries. Here we report a framework for designing large sets of orthogonal barcode probes. We demonstrate the utility of this framework by designing 240,000 barcode probes and testing their performance by hybridization. From the test hybridizations, we also discovered new probe design rules that significantly reduce cross-hybridization after their introduction into the framework of the algorithm. These rules should improve the performance of DNA microarray probe designs for many applications.
Design of 240,000 orthogonal 25mer DNA barcode probes
Xu, Qikai; Schlabach, Michael R.; Hannon, Gregory J.; Elledge, Stephen J.
2009-01-01
DNA barcodes linked to genetic features greatly facilitate screening these features in pooled formats using microarray hybridization, and new tools are needed to design large sets of barcodes to allow construction of large barcoded mammalian libraries such as shRNA libraries. Here we report a framework for designing large sets of orthogonal barcode probes. We demonstrate the utility of this framework by designing 240,000 barcode probes and testing their performance by hybridization. From the test hybridizations, we also discovered new probe design rules that significantly reduce cross-hybridization after their introduction into the framework of the algorithm. These rules should improve the performance of DNA microarray probe designs for many applications. PMID:19171886
Hussey, Richard S; Huang, Guozhong; Allen, Rex
2011-01-01
Identifying parasitism genes encoding proteins secreted from a plant-parasitic nematode's esophageal gland cells and injected through its stylet into plant tissue is the key to understanding the molecular basis of nematode parasitism of plants. Parasitism genes have been cloned by directly microaspirating the cytoplasm from the esophageal gland cells of different parasitic stages of cyst or root-knot nematodes to provide mRNA to create a gland cell-specific cDNA library by long-distance reverse-transcriptase polymerase chain reaction. cDNA clones are sequenced and deduced protein sequences with a signal peptide for secretion are identified for high-throughput in situ hybridization to confirm gland-specific expression.
Cloning of a Gene Whose Expression is Increased in Scrapie and in Senile Plaques in Human Brain
NASA Astrophysics Data System (ADS)
Wietgrefe, S.; Zupancic, M.; Haase, A.; Chesebro, B.; Race, R.; Frey, W.; Rustan, T.; Friedman, R. L.
1985-12-01
A complementary DNA library was constructed from messenger RNA's extracted from the brains of mice infected with the scrapie agent. The library was differentially screened with the objectives of finding clones that might be used as markers of infection and finding clones of genes whose increased expression might be correlated with the pathological changes common to scrapie and Alzheimer's disease. A gene was identified whose expression is increased in scrapie. The complementary DNA corresponding to this gene hybridized preferentially and focally to cells in the brains of scrapie-infected animals. The cloned DNA also hybridized to the neuritic plaques found with increased frequency in brains of patients with Alzheimer's disease.
Activity and bacterial diversity of snow around Russian Antarctic stations.
Lopatina, Anna; Krylenkov, Vjacheslav; Severinov, Konstantin
2013-11-01
The diversity and temporal dynamics of bacterial communities in pristine snow around two Russian Antarctic stations was investigated. Taxonomic analysis of rDNA libraries revealed that snow communities were dominated by bacteria from a small number of operational taxonomic units (OTUs) that underwent dramatic swings in abundance between the 54th (2008-2009) and 55th (2009-2010) Russian Antarctic expeditions. Moreover, analysis of the 55th expedition samples indicated that there was very little, if any, correspondence in abundance of clones belonging to the same OTU present in rDNA and rRNA libraries. The latter result suggests that most rDNA clones originate from bacteria that are not alive and/or active and may have been deposited on the snow surface from the atmosphere. In contrast, clones most abundant in rRNA libraries (mostly belonging to Variovorax, Janthinobacterium, Pseudomonas, and Sphingomonas genera) may be considered as endogenous Antarctic snow inhabitants. Copyright © 2013 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Brown, Dean G; Brown, Giles A; Centrella, Paolo; Certel, Kaan; Cooke, Robert M; Cuozzo, John W; Dekker, Niek; Dumelin, Christoph E; Ferguson, Andrew; Fiez-Vandal, Cédric; Geschwindner, Stefan; Guié, Marie-Aude; Habeshian, Sevan; Keefe, Anthony D; Schlenker, Oliver; Sigel, Eric A; Snijder, Arjan; Soutter, Holly T; Sundström, Linda; Troast, Dawn M; Wiggin, Giselle; Zhang, Jing; Zhang, Ying; Clark, Matthew A
2018-06-01
The discovery of ligands via affinity-mediated selection of DNA-encoded chemical libraries is driven by the quality and concentration of the protein target. G-protein-coupled receptors (GPCRs) and other membrane-bound targets can be difficult to isolate in their functional state and at high concentrations, and therefore have been challenging for affinity-mediated selection. Here, we report a successful selection campaign against protease-activated receptor 2 (PAR2). Using a thermo-stabilized mutant of PAR2, we conducted affinity selection using our >100-billion-compound DNA-encoded library. We observed a number of putative ligands enriched upon selection, and subsequent cellular profiling revealed these ligands to comprise both agonists and antagonists. The agonist series shared structural similarity with known agonists. The antagonists were shown to bind in a novel allosteric binding site on the PAR2 protein. This report serves to demonstrate that cell-free affinity selection against GPCRs can be achieved with mutant stabilized protein targets.
Calibrating genomic and allelic coverage bias in single-cell sequencing.
Zhang, Cheng-Zhong; Adalsteinsson, Viktor A; Francis, Joshua; Cornils, Hauke; Jung, Joonil; Maire, Cecile; Ligon, Keith L; Meyerson, Matthew; Love, J Christopher
2015-04-16
Artifacts introduced in whole-genome amplification (WGA) make it difficult to derive accurate genomic information from single-cell genomes and require different analytical strategies from bulk genome analysis. Here, we describe statistical methods to quantitatively assess the amplification bias resulting from whole-genome amplification of single-cell genomic DNA. Analysis of single-cell DNA libraries generated by different technologies revealed universal features of the genome coverage bias predominantly generated at the amplicon level (1-10 kb). The magnitude of coverage bias can be accurately calibrated from low-pass sequencing (∼0.1 × ) to predict the depth-of-coverage yield of single-cell DNA libraries sequenced at arbitrary depths. We further provide a benchmark comparison of single-cell libraries generated by multi-strand displacement amplification (MDA) and multiple annealing and looping-based amplification cycles (MALBAC). Finally, we develop statistical models to calibrate allelic bias in single-cell whole-genome amplification and demonstrate a census-based strategy for efficient and accurate variant detection from low-input biopsy samples.
Using single nuclei for RNA-seq to capture the transcriptome of postmortem neurons
Krishnaswami, Suguna Rani; Grindberg, Rashel V; Novotny, Mark; Venepally, Pratap; Lacar, Benjamin; Bhutani, Kunal; Linker, Sara B; Pham, Son; Erwin, Jennifer A; Miller, Jeremy A; Hodge, Rebecca; McCarthy, James K; Kelder, Martin; McCorrison, Jamison; Aevermann, Brian D; Fuertes, Francisco Diez; Scheuermann, Richard H; Lee, Jun; Lein, Ed S; Schork, Nicholas; McConnell, Michael J; Gage, Fred H; Lasken, Roger S
2016-01-01
A protocol is described for sequencing the transcriptome of a cell nucleus. Nuclei are isolated from specimens and sorted by FACS, cDNA libraries are constructed and RNA-seq is performed, followed by data analysis. Some steps follow published methods (Smart-seq2 for cDNA synthesis and Nextera XT barcoded library preparation) and are not described in detail here. Previous single-cell approaches for RNA-seq from tissues include cell dissociation using protease treatment at 30 °C, which is known to alter the transcriptome. We isolate nuclei at 4 °C from tissue homogenates, which cause minimal damage. Nuclear transcriptomes can be obtained from postmortem human brain tissue stored at −80 °C, making brain archives accessible for RNA-seq from individual neurons. The method also allows investigation of biological features unique to nuclei, such as enrichment of certain transcripts and precursors of some noncoding RNAs. By following this procedure, it takes about 4 d to construct cDNA libraries that are ready for sequencing. PMID:26890679
Calibrating genomic and allelic coverage bias in single-cell sequencing
Francis, Joshua; Cornils, Hauke; Jung, Joonil; Maire, Cecile; Ligon, Keith L.; Meyerson, Matthew; Love, J. Christopher
2016-01-01
Artifacts introduced in whole-genome amplification (WGA) make it difficult to derive accurate genomic information from single-cell genomes and require different analytical strategies from bulk genome analysis. Here, we describe statistical methods to quantitatively assess the amplification bias resulting from whole-genome amplification of single-cell genomic DNA. Analysis of single-cell DNA libraries generated by different technologies revealed universal features of the genome coverage bias predominantly generated at the amplicon level (1–10 kb). The magnitude of coverage bias can be accurately calibrated from low-pass sequencing (~0.1 ×) to predict the depth-of-coverage yield of single-cell DNA libraries sequenced at arbitrary depths. We further provide a benchmark comparison of single-cell libraries generated by multi-strand displacement amplification (MDA) and multiple annealing and looping-based amplification cycles (MALBAC). Finally, we develop statistical models to calibrate allelic bias in single-cell whole-genome amplification and demonstrate a census-based strategy for efficient and accurate variant detection from low-input biopsy samples. PMID:25879913
Liu, Guan-Jun; Liu, Ming-Kun; Xu, Zhi-Ru; Yan, Xiu-Feng; Wei, Zhi-Gang; Yang, Chuan-Ping
2009-04-01
Using cDNAs prepared from the leaves and stems of Polygonum sibiricum Laxm. treated with NaHCO3 stress for 48 h as testers and cDNAs from unstressed P. sibiricum leaves and stems as drivers library, suppression subtractive hybridization (SSH) was employed to construct a cDNA subtracted library, which contained 2 282 valid sequences including 598 ESTs in the stems forward SSH library and 490 ESTs in the stem reverse SSH library, 627 ESTs in the leaf forward SSH library and 567 in the leaf reverse SSH library. According to the functional catalogue of MIPs and the comparison of the reverse and forward SSH libraries of the stem and leaf, the responses to NaHCO3 stress were different between leaf and stem, except for the same trend in cell rescue defense and transport facilitation. The trend in the metabolism, energy, photosynthesis, protein synthesis, transcription, and signal transduction was opposite. RT-PCR analysis demonstrated that the expression of 12 putative stress related genes in the NaHCO3-treated leaves and stems was different from that in the untreated leaves and stems. This indicated that different mechanisms might be responsible for reactions of leaf and stem in P. sibiricum. The results from this study are useful in understanding the molecular mechanism of saline-alkali tolerance in P. sibiricum.
Pratt, Lee H.; Liang, Chun; Shah, Manish; Sun, Feng; Wang, Haiming; Reid, St. Patrick; Gingle, Alan R.; Paterson, Andrew H.; Wing, Rod; Dean, Ralph; Klein, Robert; Nguyen, Henry T.; Ma, Hong-mei; Zhao, Xin; Morishige, Daryl T.; Mullet, John E.; Cordonnier-Pratt, Marie-Michèle
2005-01-01
Improved knowledge of the sorghum transcriptome will enhance basic understanding of how plants respond to stresses and serve as a source of genes of value to agriculture. Toward this goal, Sorghum bicolor L. Moench cDNA libraries were prepared from light- and dark-grown seedlings, drought-stressed plants, Colletotrichum-infected seedlings and plants, ovaries, embryos, and immature panicles. Other libraries were prepared with meristems from Sorghum propinquum (Kunth) Hitchc. that had been photoperiodically induced to flower, and with rhizomes from S. propinquum and johnsongrass (Sorghum halepense L. Pers.). A total of 117,682 expressed sequence tags (ESTs) were obtained representing both 3′ and 5′ sequences from about half that number of cDNA clones. A total of 16,801 unique transcripts, representing tentative UniScripts (TUs), were identified from 55,783 3′ ESTs. Of these TUs, 9,032 are represented by two or more ESTs. Collectively, these libraries were predicted to contain a total of approximately 31,000 TUs. Individual libraries, however, were predicted to contain no more than about 6,000 to 9,000, with the exception of light-grown seedlings, which yielded an estimate of close to 13,000. In addition, each library exhibits about the same level of complexity with respect to both the number of TUs preferentially expressed in that library and the frequency with which two or more ESTs is found in only that library. These results indicate that the sorghum genome is expressed in highly selective fashion in the individual organs and in response to the environmental conditions surveyed here. Close to 2,000 differentially expressed TUs were identified among the cDNA libraries examined, of which 775 were differentially expressed at a confidence level of 98%. From these 775 TUs, signature genes were identified defining drought, Colletotrichum infection, skotomorphogenesis (etiolation), ovary, immature panicle, and embryo. PMID:16169961
Technical Considerations for Reduced Representation Bisulfite Sequencing with Multiplexed Libraries
Chatterjee, Aniruddha; Rodger, Euan J.; Stockwell, Peter A.; Weeks, Robert J.; Morison, Ian M.
2012-01-01
Reduced representation bisulfite sequencing (RRBS), which couples bisulfite conversion and next generation sequencing, is an innovative method that specifically enriches genomic regions with a high density of potential methylation sites and enables investigation of DNA methylation at single-nucleotide resolution. Recent advances in the Illumina DNA sample preparation protocol and sequencing technology have vastly improved sequencing throughput capacity. Although the new Illumina technology is now widely used, the unique challenges associated with multiplexed RRBS libraries on this platform have not been previously described. We have made modifications to the RRBS library preparation protocol to sequence multiplexed libraries on a single flow cell lane of the Illumina HiSeq 2000. Furthermore, our analysis incorporates a bioinformatics pipeline specifically designed to process bisulfite-converted sequencing reads and evaluate the output and quality of the sequencing data generated from the multiplexed libraries. We obtained an average of 42 million paired-end reads per sample for each flow-cell lane, with a high unique mapping efficiency to the reference human genome. Here we provide a roadmap of modifications, strategies, and trouble shooting approaches we implemented to optimize sequencing of multiplexed libraries on an a RRBS background. PMID:23193365
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leana-Cox, J.; Wulfsberg, E.; Raffel, L.J.
Fluorescence in situ hybridization (FISH) with chromosome-specific DNA libraries was performed on samples from eight patients with de novo chromosomal duplications. In five cases, the clinical phenotype and/or cytogenetic evaluations suggested a likely origin of the duplicated material. In the remaining three cases, careful examination of the GTG-banding pattern indicated multiple possible origins; hybridization with more than one chromosome-specific library was performed on two of these cases. In all cases, FISH conclusively identified the chromosomal origin of the duplicated material. In addition, the hybridization pattern was useful in quantitatively delineating the duplication in two cases. 21 refs., 2 figs., 1more » tab.« less
Assembling and auditing a comprehensive DNA barcode reference library for European marine fishes.
Oliveira, L M; Knebelsberger, T; Landi, M; Soares, P; Raupach, M J; Costa, F O
2016-12-01
A large-scale comprehensive reference library of DNA barcodes for European marine fishes was assembled, allowing the evaluation of taxonomic uncertainties and species genetic diversity that were otherwise hidden in geographically restricted studies. A total of 4118 DNA barcodes were assigned to 358 species generating 366 Barcode Index Numbers (BIN). Initial examination revealed as much as 141 BIN discordances (more than one species in each BIN). After implementing an auditing and five-grade (A-E) annotation protocol, the number of discordant species BINs was reduced to 44 (13% grade E), while concordant species BINs amounted to 271 (78% grades A and B) and 14 other had insufficient data (grade D). Fifteen species displayed comparatively high intraspecific divergences ranging from 2·6 to 18·5% (grade C), which is biologically paramount information to be considered in fish species monitoring and stock assessment. On balance, this compilation contributed to the detection of 59 European fish species probably in need of taxonomic clarification or re-evaluation. The generalized implementation of an auditing and annotation protocol for reference libraries of DNA barcodes is recommended. © 2016 The Fisheries Society of the British Isles.
Separation and parallel sequencing of the genomes and transcriptomes of single cells using G&T-seq.
Macaulay, Iain C; Teng, Mabel J; Haerty, Wilfried; Kumar, Parveen; Ponting, Chris P; Voet, Thierry
2016-11-01
Parallel sequencing of a single cell's genome and transcriptome provides a powerful tool for dissecting genetic variation and its relationship with gene expression. Here we present a detailed protocol for G&T-seq, a method for separation and parallel sequencing of genomic DNA and full-length polyA(+) mRNA from single cells. We provide step-by-step instructions for the isolation and lysis of single cells; the physical separation of polyA(+) mRNA from genomic DNA using a modified oligo-dT bead capture and the respective whole-transcriptome and whole-genome amplifications; and library preparation and sequence analyses of these amplification products. The method allows the detection of thousands of transcripts in parallel with the genetic variants captured by the DNA-seq data from the same single cell. G&T-seq differs from other currently available methods for parallel DNA and RNA sequencing from single cells, as it involves physical separation of the DNA and RNA and does not require bespoke microfluidics platforms. The process can be implemented manually or through automation. When performed manually, paired genome and transcriptome sequencing libraries from eight single cells can be produced in ∼3 d by researchers experienced in molecular laboratory work. For users with experience in the programming and operation of liquid-handling robots, paired DNA and RNA libraries from 96 single cells can be produced in the same time frame. Sequence analysis and integration of single-cell G&T-seq DNA and RNA data requires a high level of bioinformatics expertise and familiarity with a wide range of informatics tools.
Boeneman, Kelly; Fossum, Solveig; Yang, Yanhua; Fingland, Nicholas; Skarstad, Kirsten; Crooke, Elliott
2009-05-01
DnaA initiates chromosomal replication in Escherichia coli at a well-regulated time in the cell cycle. To determine how the spatial distribution of DnaA is related to the location of chromosomal replication and other cell cycle events, the localization of DnaA in living cells was visualized by confocal fluorescence microscopy. The gfp gene was randomly inserted into a dnaA-bearing plasmid via in vitro transposition to create a library that included internally GFP-tagged DnaA proteins. The library was screened for the ability to rescue dnaA(ts) mutants, and a candidate gfp-dnaA was used to replace the dnaA gene of wild-type cells. The resulting cells produce close to physiological levels of GFP-DnaA from the endogenous promoter as their only source of DnaA and somewhat under-initiate replication with moderate asynchrony. Visualization of GFP-tagged DnaA in living cells revealed that DnaA adopts a helical pattern that spirals along the long axis of the cell, a pattern also seen in wild-type cells by immunofluorescence with affinity purified anti-DnaA antibody. Although the DnaA helices closely resemble the helices of the actin analogue MreB, co-visualization of GFP-tagged DnaA and RFP-tagged MreB demonstrates that DnaA and MreB adopt discrete helical structures along the length of the longitudinal cell axis.
Wolbachia and DNA barcoding insects: patterns, potential, and problems.
Smith, M Alex; Bertrand, Claudia; Crosby, Kate; Eveleigh, Eldon S; Fernandez-Triana, Jose; Fisher, Brian L; Gibbs, Jason; Hajibabaei, Mehrdad; Hallwachs, Winnie; Hind, Katharine; Hrcek, Jan; Huang, Da-Wei; Janda, Milan; Janzen, Daniel H; Li, Yanwei; Miller, Scott E; Packer, Laurence; Quicke, Donald; Ratnasingham, Sujeevan; Rodriguez, Josephine; Rougerie, Rodolphe; Shaw, Mark R; Sheffield, Cory; Stahlhut, Julie K; Steinke, Dirk; Whitfield, James; Wood, Monty; Zhou, Xin
2012-01-01
Wolbachia is a genus of bacterial endosymbionts that impacts the breeding systems of their hosts. Wolbachia can confuse the patterns of mitochondrial variation, including DNA barcodes, because it influences the pathways through which mitochondria are inherited. We examined the extent to which these endosymbionts are detected in routine DNA barcoding, assessed their impact upon the insect sequence divergence and identification accuracy, and considered the variation present in Wolbachia COI. Using both standard PCR assays (Wolbachia surface coding protein--wsp), and bacterial COI fragments we found evidence of Wolbachia in insect total genomic extracts created for DNA barcoding library construction. When >2 million insect COI trace files were examined on the Barcode of Life Datasystem (BOLD) Wolbachia COI was present in 0.16% of the cases. It is possible to generate Wolbachia COI using standard insect primers; however, that amplicon was never confused with the COI of the host. Wolbachia alleles recovered were predominantly Supergroup A and were broadly distributed geographically and phylogenetically. We conclude that the presence of the Wolbachia DNA in total genomic extracts made from insects is unlikely to compromise the accuracy of the DNA barcode library; in fact, the ability to query this DNA library (the database and the extracts) for endosymbionts is one of the ancillary benefits of such a large scale endeavor--which we provide several examples. It is our conclusion that regular assays for Wolbachia presence and type can, and should, be adopted by large scale insect barcoding initiatives. While COI is one of the five multi-locus sequence typing (MLST) genes used for categorizing Wolbachia, there is limited overlap with the eukaryotic DNA barcode region.
Loudig, Olivier; Wang, Tao; Ye, Kenny; Lin, Juan; Wang, Yihong; Ramnauth, Andrew; Liu, Christina; Stark, Azadeh; Chitale, Dhananjay; Greenlee, Robert; Multerer, Deborah; Honda, Stacey; Daida, Yihe; Spencer Feigelson, Heather; Glass, Andrew; Couch, Fergus J.; Rohan, Thomas; Ben-Dov, Iddo Z.
2017-01-01
Formalin-fixed paraffin-embedded (FFPE) specimens, when used in conjunction with patient clinical data history, represent an invaluable resource for molecular studies of cancer. Even though nucleic acids extracted from archived FFPE tissues are degraded, their molecular analysis has become possible. In this study, we optimized a laboratory-based next-generation sequencing barcoded cDNA library preparation protocol for analysis of small RNAs recovered from archived FFPE tissues. Using matched fresh and FFPE specimens, we evaluated the robustness and reproducibility of our optimized approach, as well as its applicability to archived clinical specimens stored for up to 35 years. We then evaluated this cDNA library preparation protocol by performing a miRNA expression analysis of archived breast ductal carcinoma in situ (DCIS) specimens, selected for their relation to the risk of subsequent breast cancer development and obtained from six different institutions. Our analyses identified six miRNAs (miR-29a, miR-221, miR-375, miR-184, miR-363, miR-455-5p) differentially expressed between DCIS lesions from women who subsequently developed an invasive breast cancer (cases) and women who did not develop invasive breast cancer within the same time interval (control). Our thorough evaluation and application of this laboratory-based miRNA sequencing analysis indicates that the preparation of small RNA cDNA libraries can reliably be performed on older, archived, clinically-classified specimens. PMID:28335433
Zaidi, Shane; Blanchard, Miran; Shim, Kevin; Ilett, Elizabeth; Rajani, Karishma; Parrish, Christopher; Boisgerault, Nicolas; Kottke, Tim; Thompson, Jill; Celis, Esteban; Pulido, Jose; Selby, Peter; Pandha, Hardev; Melcher, Alan; Harrington, Kevin; Vile, Richard
2015-05-01
We used a VSV-cDNA library to treat recurrent melanoma, identifying immunogenic antigens, allowing us to target recurrences with immunotherapy or chemotherapy. Primary B16 melanoma tumors were induced to regress by frontline therapy. Mice with recurrent tumors were treated with VSV-cDNA immunotherapy. A Th17 recall response was used to screen the VSV-cDNA library for individual viruses encoding rejection antigens, subsequently targeted using immunotherapy or chemotherapy. Recurrent tumors were effectively treated with a VSV-cDNA library using cDNA from recurrent B16 tumors. Recurrence-associated rejection antigens identified included Topoisomerase-IIα, YB-1, cdc7 kinase, and BRAF. Fourteen out of 16 recurrent tumors carried BRAF mutations (595-605 region) following frontline therapy, even though the parental B16 tumors were BRAF wild type. The emergence of mutated BRAF-containing recurrences served as an excellent target for BRAF-specific immune-(VSV-BRAF), or chemo-(PLX-4720) therapies. Successful PLX-4720 therapy of recurrent tumors was associated with the development of a broad spectrum of T-cell responses. VSV-cDNA technology can be used to identify recurrence specific antigens. Emergence of mutated BRAF may be a major effector of melanoma recurrence which could serve as a target for chemo or immune therapy. This study suggests a rationale for offering patients with initially wild-type BRAF melanomas an additional biopsy to screen for mutant BRAF upon recurrence.
Zaidi, Shane; Blanchard, Miran; Shim, Kevin; Ilett, Elizabeth; Rajani, Karishma; Parrish, Christopher; Boisgerault, Nicolas; Kottke, Tim; Thompson, Jill; Celis, Esteban; Pulido, Jose; Selby, Peter; Pandha, Hardev; Melcher, Alan; Harrington, Kevin; Vile, Richard
2015-01-01
We used a VSV-cDNA library to treat recurrent melanoma, identifying immunogenic antigens, allowing us to target recurrences with immunotherapy or chemotherapy. Primary B16 melanoma tumors were induced to regress by frontline therapy. Mice with recurrent tumors were treated with VSV-cDNA immunotherapy. A Th17 recall response was used to screen the VSV-cDNA library for individual viruses encoding rejection antigens, subsequently targeted using immunotherapy or chemotherapy. Recurrent tumors were effectively treated with a VSV-cDNA library using cDNA from recurrent B16 tumors. Recurrence-associated rejection antigens identified included Topoisomerase-IIα, YB-1, cdc7 kinase, and BRAF. Fourteen out of 16 recurrent tumors carried BRAF mutations (595–605 region) following frontline therapy, even though the parental B16 tumors were BRAF wild type. The emergence of mutated BRAF-containing recurrences served as an excellent target for BRAF-specific immune-(VSV-BRAF), or chemo-(PLX-4720) therapies. Successful PLX-4720 therapy of recurrent tumors was associated with the development of a broad spectrum of T-cell responses. VSV-cDNA technology can be used to identify recurrence specific antigens. Emergence of mutated BRAF may be a major effector of melanoma recurrence which could serve as a target for chemo or immune therapy. This study suggests a rationale for offering patients with initially wild-type BRAF melanomas an additional biopsy to screen for mutant BRAF upon recurrence. PMID:25544599
DNA-based identification of mixed-organism samples offers the potential to greatly reduce the need for resource-intensive morphological identification, which would be of value both to biotic condition assessment and non-native species early-detection monitoring. However, the abi...
Seal, S N; Hoet, R M; Raats, J M; Radic, M Z
2000-09-01
To examine anti-double-stranded DNA (anti-dsDNA) IgG autoantibodies from the bone marrow of individuals with systemic lupus erythematosus (SLE). A library of single-chain variable fragments (scFv) was constructed from SLE bone marrow complementary DNA of gamma, kappa, and lambda isotype by cloning into the pHENIX phagemid vector. The library was screened with dsDNA in solution, and 2 anti-DNA phage, DNA1 and DNA4, were isolated and their Ig V genes sequenced. Soluble scFv corresponding to DNA1 and DNA4, and their heavy (H)- and light (L)-chain recombinants, were prepared, purified, and analyzed for binding to DNA by enzyme-linked immunosorbent assay. DNA1 and DNA4 used different Ig H-chain (3-30 and 5-51, respectively) and L-chain (DPK15 and DPK22, respectively) V genes. The ratios of replacement mutations to silent mutations in DNA1 and DNA4 suggest that their V genes were selected for improved antigen binding in vivo. The recombinant between DNA4VH and DNA1VL showed the highest relative affinity for both single-stranded DNA and dsDNA. These 2 Ig subunits contained third complementarity-determining region arginines and had acquired the majority of replacement mutations. Anti-dsDNA IgG autoantibodies from the bone marrow of SLE patients exploit diverse V genes and cationic V-D-J and V-J junctions for DNA binding, and accumulate replacement mutations that enhance binding.
CORALINA: a universal method for the generation of gRNA libraries for CRISPR-based screening.
Köferle, Anna; Worf, Karolina; Breunig, Christopher; Baumann, Valentin; Herrero, Javier; Wiesbeck, Maximilian; Hutter, Lukas H; Götz, Magdalena; Fuchs, Christiane; Beck, Stephan; Stricker, Stefan H
2016-11-14
The bacterial CRISPR system is fast becoming the most popular genetic and epigenetic engineering tool due to its universal applicability and adaptability. The desire to deploy CRISPR-based methods in a large variety of species and contexts has created an urgent need for the development of easy, time- and cost-effective methods enabling large-scale screening approaches. Here we describe CORALINA (comprehensive gRNA library generation through controlled nuclease activity), a method for the generation of comprehensive gRNA libraries for CRISPR-based screens. CORALINA gRNA libraries can be derived from any source of DNA without the need of complex oligonucleotide synthesis. We show the utility of CORALINA for human and mouse genomic DNA, its reproducibility in covering the most relevant genomic features including regulatory, coding and non-coding sequences and confirm the functionality of CORALINA generated gRNAs. The simplicity and cost-effectiveness make CORALINA suitable for any experimental system. The unprecedented sequence complexities obtainable with CORALINA libraries are a necessary pre-requisite for less biased large scale genomic and epigenomic screens.
Star, Bastiaan; Nederbragt, Alexander J.; Hansen, Marianne H. S.; Skage, Morten; Gilfillan, Gregor D.; Bradbury, Ian R.; Pampoulie, Christophe; Stenseth, Nils Chr; Jakobsen, Kjetill S.; Jentoft, Sissel
2014-01-01
Degradation-specific processes and variation in laboratory protocols can bias the DNA sequence composition from samples of ancient or historic origin. Here, we identify a novel artifact in sequences from historic samples of Atlantic cod (Gadus morhua), which forms interrupted palindromes consisting of reverse complementary sequence at the 5′ and 3′-ends of sequencing reads. The palindromic sequences themselves have specific properties – the bases at the 5′-end align well to the reference genome, whereas extensive misalignments exists among the bases at the terminal 3′-end. The terminal 3′ bases are artificial extensions likely caused by the occurrence of hairpin loops in single stranded DNA (ssDNA), which can be ligated and amplified in particular library creation protocols. We propose that such hairpin loops allow the inclusion of erroneous nucleotides, specifically at the 3′-end of DNA strands, with the 5′-end of the same strand providing the template. We also find these palindromes in previously published ancient DNA (aDNA) datasets, albeit at varying and substantially lower frequencies. This artifact can negatively affect the yield of endogenous DNA in these types of samples and introduces sequence bias. PMID:24608104
Enzymatically Generated CRISPR Libraries for Genome Labeling and Screening
Lane, Andrew B.; Strzelecka, Magdalena; Ettinger, Andreas; Grenfell, Andrew W.; Wittmann, Torsten; Heald, Rebecca
2015-01-01
Summary CRISPR-based technologies have emerged as powerful tools to alter genomes and mark chromosomal loci, but an inexpensive method for generating large numbers of RNA guides for whole genome screening and labeling is lacking. Using a method that permits library construction from any source of DNA, we generated guide libraries that label repetitive loci or a single chromosomal locus in Xenopus egg extracts and show that a complex library can target the E. coli genome at high frequency. PMID:26212133
Pietras, D F; Bennett, K L; Siracusa, L D; Woodworth-Gutai, M; Chapman, V M; Gross, K W; Kane-Haas, C; Hastie, N D
1983-01-01
We report the construction of a small library of recombinant plasmids containing Mus musculus repetitive DNA inserts. The repetitive cloned fraction was derived from denatured genomic DNA by reassociation to a Cot value at which repetitive, but not unique, sequences have reannealed followed by exhaustive S1 nuclease treatment to degrade single stranded DNA. Initial characterizations of this library by colony filter hybridizations have led to the identification of a previously undetected M. musculus minor satellite as well as to clones containing M. musculus major satellite sequences. This new satellite is repeated 10-20 times less than the major satellite in the M. musculus genome. It has a repeat length of 130 nucleotides compared with the M. musculus major satellite with a repeat length of 234 nucleotides. Sequence analysis of the minor satellite has shown that it has a 29 base pair region with extensive homology to one of the major satellite repeating subunits. We also show by in situ hybridization that this minor satellite sequence is located at the centromeres and possibly the arms of at least half the M musculus chromosomes. Sequences related to the minor satellite have been found in the DNA of a related Mus species, Mus spretus, and may represent the major satellite of that species. Images PMID:6314268
Role of messenger RNA-ribosome complex in complementary DNA display.
Naimuddin, Mohammed; Ohtsuka, Isao; Kitamura, Koichiro; Kudou, Motonori; Kimura, Shinnosuke
2013-07-15
In vitro display technologies such as ribosome display and messenger RNA (mRNA)/complementary DNA (cDNA) display are powerful methods that can generate library diversities on the order of 10(10-14). However, in mRNA and cDNA display methods, the end use diversity is two orders of magnitude lower than initial diversity and is dependent on the downstream processes that act as limiting factors. We found that in our previous cDNA display protocol, the purification of protein fusions by the use of streptavidin matrices from cell-free translation mixtures had poor efficiency (∼10-15%) that seriously affected the diversity of the purified library. Here, we have investigated and optimized the protocols that provided remarkable purification efficiencies. The stalled ribosome in the mRNA-ribosome complex was found to impede this purification efficiency. Among the various conditions tested, destabilization of ribosomes by appropriate concentration of metal chelating agents in combination with an optimal temperature of 30°C were found to be crucial and effective for nearly complete isolation of protein fusions from the cell-free translation system. Thus, this protocol provided 8- to 10-fold increased efficiency of purification over the previous method and results in retaining the diversity of the library by approximately an order of magnitude-important for directed evolution. We also discuss the possible effects in the fabrication of protein chips. Copyright © 2013 Elsevier Inc. All rights reserved.
Wei, Hong-Ying; Huang, Sheng; Wang, Jiang-Yong; Gao, Fang; Jiang, Jing-Zhe
2018-03-01
The emergence and widespread use of high-throughput sequencing technologies have promoted metagenomic studies on environmental or animal samples. Library construction for metagenome sequencing and annotation of the produced sequence reads are important steps in such studies and influence the quality of metagenomic data. In this study, we collected some marine mollusk samples, such as Crassostrea hongkongensis, Chlamys farreri, and Ruditapes philippinarum, from coastal areas in South China. These samples were divided into two batches to compare two library construction methods for shellfish viral metagenome. Our analysis showed that reverse-transcribing RNA into cDNA and then amplifying it simultaneously with DNA by whole genome amplification (WGA) yielded a larger amount of DNA compared to using only WGA or WTA (whole transcriptome amplification). Moreover, higher quality libraries were obtained by agarose gel extraction rather than with AMPure bead size selection. However, the latter can also provide good results if combined with the adjustment of the filter parameters. This, together with its simplicity, makes it a viable alternative. Finally, we compared three annotation tools (BLAST, DIAMOND, and Taxonomer) and two reference databases (NCBI's NR and Uniprot's Uniref). Considering the limitations of computing resources and data transfer speed, we propose the use of DIAMOND with Uniref for annotating metagenomic short reads as its running speed can guarantee a good annotation rate. This study may serve as a useful reference for selecting methods for Shellfish viral metagenome library construction and read annotation.
Wang, Chao; Shi, Xue; Liu, Lin; Li, Haiyan; Ammiraju, Jetty S S; Kudrna, David A; Xiong, Wentao; Wang, Hao; Dai, Zhaozhao; Zheng, Yonglian; Lai, Jinsheng; Jin, Weiwei; Messing, Joachim; Bennetzen, Jeffrey L; Wing, Rod A; Luo, Meizhong
2013-11-01
Maize is one of the most important food crops and a key model for genetics and developmental biology. A genetically anchored and high-quality draft genome sequence of maize inbred B73 has been obtained to serve as a reference sequence. To facilitate evolutionary studies in maize and its close relatives, much like the Oryza Map Alignment Project (OMAP) (www.OMAP.org) bacterial artificial chromosome (BAC) resource did for the rice community, we constructed BAC libraries for maize inbred lines Zheng58, Chang7-2, and Mo17 and maize wild relatives Zea mays ssp. parviglumis and Tripsacum dactyloides. Furthermore, to extend functional genomic studies to maize and sorghum, we also constructed binary BAC (BIBAC) libraries for the maize inbred B73 and the sorghum landrace Nengsi-1. The BAC/BIBAC vectors facilitate transfer of large intact DNA inserts from BAC clones to the BIBAC vector and functional complementation of large DNA fragments. These seven Zea Map Alignment Project (ZMAP) BAC/BIBAC libraries have average insert sizes ranging from 92 to 148 kb, organellar DNA from 0.17 to 2.3%, empty vector rates between 0.35 and 5.56%, and genome equivalents of 4.7- to 8.4-fold. The usefulness of the Parviglumis and Tripsacum BAC libraries was demonstrated by mapping clones to the reference genome. Novel genes and alleles present in these ZMAP libraries can now be used for functional complementation studies and positional or homology-based cloning of genes for translational genomics.
Preparation and screening of an arrayed human genomic library generated with the P1 cloning system.
Shepherd, N S; Pfrogner, B D; Coulby, J N; Ackerman, S L; Vaidyanathan, G; Sauer, R H; Balkenhol, T C; Sternberg, N
1994-01-01
We describe here the construction and initial characterization of a 3-fold coverage genomic library of the human haploid genome that was prepared using the bacteriophage P1 cloning system. The cloned DNA inserts were produced by size fractionation of a Sau3AI partial digest of high molecular weight genomic DNA isolated from primary cells of human foreskin fibroblasts. The inserts were cloned into the pAd10sacBII vector and packaged in vitro into P1 phage. These were used to generate recombinant bacterial clones, each of which was picked robotically from an agar plate into a well of a 96-well microtiter dish, grown overnight, and stored at -70 degrees C. The resulting library, designated DMPC-HFF#1 series A, consists of approximately 130,000-140,000 recombinant clones that were stored in 1500 microtiter dishes. To screen the library, clones were combined in a pooling strategy and specific loci were identified by PCR analysis. On average, the library contains two or three different clones for each locus screened. To date we have identified a total of 17 clones containing the hypoxanthine-guanine phosphoribosyltransferase, human serum albumin-human alpha-fetoprotein, p53, cyclooxygenase I, human apurinic endonuclease, beta-polymerase, and DNA ligase I genes. The cloned inserts average 80 kb in size and range from 70 to 95 kb, with one 49-kb insert and one 62-kb insert. Images PMID:8146166
Overview of hybridization and detection techniques.
Hilario, Elena
2007-01-01
A misconception regarding the sensitivity of nonradioactive methods for screening genomic DNA libraries often hinders the establishment of these environmentally friendly techniques in molecular biology laboratories. Nonradioactive probes, properly prepared and quantified, can detect DNA target molecules to the femtomole range. However, appropriate hybridization techniques and detection methods should also be adopted for an efficient use of nonradioactive techniques. Detailed descriptions of genomic library handling before and during the nonradioactive hybridization and detection are often omitted from publications. This chapter aims to fill this void by providing a collection of technical tips on hybridization and detection techniques.
Microvariation Artifacts Introduced by PCR and Cloning of Closely Related 16S rRNA Gene Sequences†
Speksnijder, Arjen G. C. L.; Kowalchuk, George A.; De Jong, Sander; Kline, Elizabeth; Stephen, John R.; Laanbroek, Hendrikus J.
2001-01-01
A defined template mixture of seven closely related 16S-rDNA clones was used in a PCR-cloning experiment to assess and track sources of artifactual sequence variation in 16S rDNA clone libraries. At least 14% of the recovered clones contained aberrations. Artifact sources were polymerase errors, a mutational hot spot, and cloning of heteroduplexes and chimeras. These data may partially explain the high degree of microheterogeneity typical of sequence clusters detected in environmental clone libraries. PMID:11133483
NASA Astrophysics Data System (ADS)
Tsao, Shih-Ming; Lai, Ji-Ching; Horng, Horng-Er; Liu, Tu-Chen; Hong, Chin-Yih
2017-04-01
Aptamers are oligonucleotides that can bind to specific target molecules. Most aptamers are generated using random libraries in the standard systematic evolution of ligands by exponential enrichment (SELEX). Each random library contains oligonucleotides with a randomized central region and two fixed primer regions at both ends. The fixed primer regions are necessary for amplifying target-bound sequences by PCR. However, these extra-sequences may cause non-specific bindings, which potentially interfere with good binding for random sequences. The Magnetic-Assisted Rapid Aptamer Selection (MARAS) is a newly developed protocol for generating single-strand DNA aptamers. No repeat selection cycle is required in the protocol. This study proposes and demonstrates a method to isolate aptamers for C-reactive proteins (CRP) from a randomized ssDNA library containing no fixed sequences at 5‧ and 3‧ termini using the MARAS platform. Furthermore, the isolated primer-free aptamer was sequenced and binding affinity for CRP was analyzed. The specificity of the obtained aptamer was validated using blind serum samples. The result was consistent with monoclonal antibody-based nephelometry analysis, which indicated that a primer-free aptamer has high specificity toward targets. MARAS is a feasible platform for efficiently generating primer-free aptamers for clinical diagnoses.
Randrianjatovo-Gbalou, Irina; Rosario, Sandrine; Sismeiro, Odile; Varet, Hugo; Legendre, Rachel; Coppée, Jean-Yves; Huteau, Valérie; Pochet, Sylvie; Delarue, Marc
2018-05-21
Nucleic acid aptamers, especially RNA, exhibit valuable advantages compared to protein therapeutics in terms of size, affinity and specificity. However, the synthesis of libraries of large random RNAs is still difficult and expensive. The engineering of polymerases able to directly generate these libraries has the potential to replace the chemical synthesis approach. Here, we start with a DNA polymerase that already displays a significant template-free nucleotidyltransferase activity, human DNA polymerase theta, and we mutate it based on the knowledge of its three-dimensional structure as well as previous mutational studies on members of the same polA family. One mutant exhibited a high tolerance towards ribonucleotides (NTPs) and displayed an efficient ribonucleotidyltransferase activity that resulted in the assembly of long RNA polymers. HPLC analysis and RNA sequencing of the products were used to quantify the incorporation of the four NTPs as a function of initial NTP concentrations and established the randomness of each generated nucleic acid sequence. The same mutant revealed a propensity to accept other modified nucleotides and to extend them in long fragments. Hence, this mutant can deliver random natural and modified RNA polymers libraries ready to use for SELEX, with custom lengths and balanced or unbalanced ratios.
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries
Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P
2008-01-01
Background Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. Results We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. Conclusion EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects. PMID:18402700
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries.
Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P
2008-04-10
Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects.
The effects of variable sample biomass on comparative metagenomics.
Chafee, Meghan; Maignien, Loïs; Simmons, Sheri L
2015-07-01
Longitudinal studies that integrate samples with variable biomass are essential to understand microbial community dynamics across space or time. Shotgun metagenomics is widely used to investigate these communities at the functional level, but little is known about the effects of combining low and high biomass samples on downstream analysis. We investigated the interacting effects of DNA input and library amplification by polymerase chain reaction on comparative metagenomic analysis using dilutions of a single complex template from an Arabidopsis thaliana-associated microbial community. We modified the Illumina Nextera kit to generate high-quality large-insert (680 bp) paired-end libraries using a range of 50 pg to 50 ng of input DNA. Using assembly-based metagenomic analysis, we demonstrate that DNA input level has a significant impact on community structure due to overrepresentation of low-GC genomic regions following library amplification. In our system, these differences were largely superseded by variations between biological replicates, but our results advocate verifying the influence of library amplification on a case-by-case basis. Overall, this study provides recommendations for quality filtering and de-replication prior to analysis, as well as a practical framework to address the issue of low biomass or biomass heterogeneity in longitudinal metagenomic surveys. © 2014 Society for Applied Microbiology and John Wiley & Sons Ltd.
Raupach, Michael J.; Hannig, Karsten; Moriniére, Jérôme; Hendrich, Lars
2018-01-01
Abstract The genus Amara Bonelli, 1810 is a very speciose and taxonomically difficult genus of the Carabidae. The identification of many of the species is accomplished with considerable difficulty, in particular for females and immature stages. In this study the effectiveness of DNA barcoding, the most popular method for molecular species identification, was examined to discriminate various species of this genus from Central Europe. DNA barcodes from 690 individuals and 47 species were analysed, including sequences from previous studies and more than 350 newly generated DNA barcodes. Our analysis revealed unique BINs for 38 species (81%). Interspecific K2P distances below 2.2% were found for three species pairs and one species trio, including haplotype sharing between Amara alpina/Amara torrida and Amara communis/Amara convexior/Amara makolskii. This study represents another step in generating an extensive reference library of DNA barcodes for carabids, highly valuable bioindicators for characterizing disturbances in various habitats. PMID:29853775
Bentley, L; Fehrsen, J; Jordaan, F; Huismans, H; du Plessis, D H
2000-04-01
VP2 is an outer capsid protein of African horsesickness virus (AHSV) and is recognized by serotype-discriminatory neutralizing antibodies. With the objective of locating its antigenic regions, a filamentous phage library was constructed that displayed peptides derived from the fragmentation of a cDNA copy of the gene encoding VP2. Peptides ranging in size from approximately 30 to 100 amino acids were fused with pIII, the attachment protein of the display vector, fUSE2. To ensure maximum diversity, the final library consisted of three sub-libraries. The first utilized enzymatically fragmented DNA encoding only the VP2 gene, the second included plasmid sequences, while the third included a PCR step designed to allow different peptide-encoding sequences to recombine before ligation into the vector. The resulting composite library was subjected to immunoaffinity selection with AHSV-specific polyclonal chicken IgY, polyclonal horse immunoglobulins and a monoclonal antibody (MAb) known to neutralize AHSV. Antigenic peptides were located by sequencing the DNA of phages bound by the antibodies. Most antigenic determinants capable of being mapped by this method were located in the N-terminal half of VP2. Important binding areas were mapped with high resolution by identifying the minimum overlapping areas of the selected peptides. The MAb was also used to screen a random 17-mer epitope library. Sequences that may be part of a discontinuous neutralization epitope were identified. The amino acid sequences of the antigenic regions on VP2 of serotype 3 were compared with corresponding regions on three other serotypes, revealing regions with the potential to discriminate AHSV serotypes serologically.
Yuen, Lik Hang; Franzini, Raphael M
2017-05-04
DNA-encoded chemical libraries (DECLs) are pools of DNA-tagged small molecules that enable facile screening and identification of bio-macromolecule binders. The successful development of DECLs has led to their increasingly important role in drug development, and screening hits have entered clinical trials. In this review, we summarize the development and currently active research areas of DECLs with a focus on contributions from groups at academic institutes. We further look at opportunities and future directions of DECL research in medicinal chemistry and chemical biology based on the symbiotic relationship between academia and industry. Challenges associated with the application of DECLs in academic drug discovery are further discussed. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Réfega, Susana; Girard-Misguich, Fabienne; Bourdieu, Christiane; Péry, Pierre; Labbé, Marie
2003-04-02
Specific antibodies were produced ex vivo from intestinal culture of Eimeria tenella infected chickens. The specificity of these intestinal antibodies was tested against different parasite stages. These antibodies were used to immunoscreen first generation schizont and sporozoite cDNA libraries permitting the identification of new E. tenella antigens. We obtained a total of 119 cDNA clones which were subjected to sequence analysis. The sequences coding for the proteins inducing local immune responses were compared with nucleotide or protein databases and with expressed sequence tags (ESTs) databases. We identified new Eimeria genes coding for heat shock proteins, a ribosomal protein, a pyruvate kinase and a pyridoxine kinase. Specific features of other sequences are discussed.
Lin, Xiaodong; Liu, Yaqing; Deng, Jiankang; Lyu, Yanlong; Qian, Pengcheng; Li, Yunfei; Wang, Shuo
2018-02-21
The integration of multiple DNA logic gates on a universal platform to implement advance logic functions is a critical challenge for DNA computing. Herein, a straightforward and powerful strategy in which a guanine-rich DNA sequence lighting up a silver nanocluster and fluorophore was developed to construct a library of logic gates on a simple DNA-templated silver nanoclusters (DNA-AgNCs) platform. This library included basic logic gates, YES, AND, OR, INHIBIT, and XOR, which were further integrated into complex logic circuits to implement diverse advanced arithmetic/non-arithmetic functions including half-adder, half-subtractor, multiplexer, and demultiplexer. Under UV irradiation, all the logic functions could be instantly visualized, confirming an excellent repeatability. The logic operations were entirely based on DNA hybridization in an enzyme-free and label-free condition, avoiding waste accumulation and reducing cost consumption. Interestingly, a DNA-AgNCs-based multiplexer was, for the first time, used as an intelligent biosensor to identify pathogenic genes, E. coli and S. aureus genes, with a high sensitivity. The investigation provides a prototype for the wireless integration of multiple devices on even the simplest single-strand DNA platform to perform diverse complex functions in a straightforward and cost-effective way.
Isolation and characterization of cDNA clones for carrot extensin and a proline-rich 33-kDa protein
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, J.; Varner, J.E.
1985-07-01
Extensins are hydroxyproline-rich glycoproteins associated with most dicotyledonous plant cell walls. To isolate cDNA clones encoding extensin, the authors started by isolating poly(A) RNA from carrot root tissue, and then translating the RNA in vitro, in the presence of tritiated leucine or proline. A 33-kDa peptide was identified in the translation products as a putative extensin precursor. From a cDNA library constructed with poly(A) RNA from wounded carrots, one cDNA clone (pDC5) was identified that specifically hybridized to poly(A) RNA encoding this 33-kDa peptide. They isolated three cDNA clones (pDC11, pDC12, and pDC16) from another cDNA library using pCD5 asmore » a probe. DNA sequence data, RNA hybridization analysis, and hybrid released in vitro translation indicate that the cDNA clones pDC11 encodes extensin and that cDNA clones pDC12 and pDC16 encode the 33-kDa peptide, which as yet has an unknown identity and function. The assumption that the 33-kDa peptide was an extensin precursor was invalid. RNA hybridization analysis showed that RNA encoded by both clone types is accumulated upon wounding.« less
cDNA encoding a polypeptide including a hevein sequence
Raikhel, N.V.; Broekaert, W.F.; Namhai Chua; Kush, A.
1993-02-16
A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids.
Discovery, SAR, and X-ray Binding Mode Study of BCATm Inhibitors from a Novel DNA-Encoded Library
2015-01-01
As a potential target for obesity, human BCATm was screened against more than 14 billion DNA encoded compounds of distinct scaffolds followed by off-DNA synthesis and activity confirmation. As a consequence, several series of BCATm inhibitors were discovered. One representative compound (R)-3-((1-(5-bromothiophene-2-carbonyl)pyrrolidin-3-yl)oxy)-N-methyl-2′-(methylsulfonamido)-[1,1′-biphenyl]-4-carboxamide (15e) from a novel compound library synthesized via on-DNA Suzuki–Miyaura cross-coupling showed BCATm inhibitory activity with IC50 = 2.0 μM. A protein crystal structure of 15e revealed that it binds to BCATm within the catalytic site adjacent to the PLP cofactor. The identification of this novel inhibitor series plus the establishment of a BCATm protein structure provided a good starting point for future structure-based discovery of BCATm inhibitors. PMID:26288694
Hit-Validation Methodologies for Ligands Isolated from DNA-Encoded Chemical Libraries.
Zimmermann, Gunther; Li, Yizhou; Rieder, Ulrike; Mattarella, Martin; Neri, Dario; Scheuermann, Jörg
2017-05-04
DNA-encoded chemical libraries (DECLs) are large collections of compounds linked to DNA fragments, serving as amplifiable barcodes, which can be screened on target proteins of interest. In typical DECL selections, preferential binders are identified by high-throughput DNA sequencing, by comparing their frequency before and after the affinity capture step. Hits identified in this procedure need to be confirmed, by resynthesis and by performing affinity measurements. In this article we present new methods based on hybridization of oligonucleotide conjugates with fluorescently labeled complementary oligonucleotides; these facilitate the determination of affinity constants and kinetic dissociation constants. The experimental procedures were demonstrated with acetazolamide, a binder to carbonic anhydrase IX with a dissociation constant in the nanomolar range. The detection of binding events was compatible not only with fluorescence polarization methodologies, but also with Alphascreen technology and with microscale thermophoresis. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Compilation of DNA sequences of Escherichia coli (update 1991)
Kröger, Manfred; Wahl, Ralf; Rice, Peter
1991-01-01
We have compiled the DNA sequence data for E.coli available from the GENBANK and EMBL data libraries and over a period of several years independently from the literature. This is the third listing replacing and increasing the former listing roughly by one fifth. However, in order to save space this printed version contains DNA sequence information only. The complete compilation is now available in machine readable form from the EMBL data library (ECD release 6). After deletion of all detected overlaps a total of 1 492 282 individual bp is found to be determined till the beginning of 1991. This corresponds to a total of 31.62% of the entire E.coli chromosome consisting of about 4,720 kbp. This number may actually be higher by some extra 2,5% derived from lysogenic bacteriophage lambda and various DNA sequences already received for statistical purposes only. PMID:2041799
Single Day Construction of Multigene Circuits with 3G Assembly.
Halleran, Andrew D; Swaminathan, Anandh; Murray, Richard M
2018-05-18
The ability to rapidly design, build, and test prototypes is of key importance to every engineering discipline. DNA assembly often serves as a rate limiting step of the prototyping cycle for synthetic biology. Recently developed DNA assembly methods such as isothermal assembly and type IIS restriction enzyme systems take different approaches to accelerate DNA construction. We introduce a hybrid method, Golden Gate-Gibson (3G), that takes advantage of modular part libraries introduced by type IIS restriction enzyme systems and isothermal assembly's ability to build large DNA constructs in single pot reactions. Our method is highly efficient and rapid, facilitating construction of entire multigene circuits in a single day. Additionally, 3G allows generation of variant libraries enabling efficient screening of different possible circuit constructions. We characterize the efficiency and accuracy of 3G assembly for various construct sizes, and demonstrate 3G by characterizing variants of an inducible cell-lysis circuit.
Shahsavarian, Melody A; Le Minoux, Damien; Matti, Kalyankumar M; Kaveri, Srini; Lacroix-Desmazes, Sébastien; Boquet, Didier; Friboulet, Alain; Avalle, Bérangère; Padiolleau-Lefèvre, Séverine
2014-05-01
Phage display antibody libraries have proven to have a significant role in the discovery of therapeutic antibodies and polypeptides with desired biological and physicochemical properties. Obtaining a large and diverse phage display antibody library, however, is always a challenging task. Various steps of this technique can still undergo optimization in order to obtain an efficient library. In the construction of a single chain fragment variable (scFv) phage display library, the cloning of the scFv fragments into a phagemid vector is of crucial importance. An efficient restriction enzyme digestion of the scFv DNA leads to its proper ligation with the phagemid followed by its successful cloning and expression. Here, we are reporting a different approach to enhance the efficiency of the restriction enzyme digestion step. We have exploited rolling circle amplification (RCA) to produce a long strand of DNA with tandem repeats of scFv sequences, which is found to be highly susceptible to restriction digestion. With this important modification, we are able to construct a large phage display antibody library of naive SJL/J mice. The size of the library is estimated as ~10(8) clones. The number of clones containing a scFv fragment is estimated at 90%. Hence, the present results could considerably aid the utilization of the phage-display technique in order to get an efficiently large antibody library. Copyright © 2014 Elsevier B.V. All rights reserved.
Gong, Qian; Li, Chang-ying; Chang, Ji-wu; Zhu, Tie-hong
2012-06-01
To screen monoclonal antibodies to amylin from a constructed human phage antibody library and identify their antigenic specificity and combining activities. The heavy chain Fd fragment and light chain of human immunoglobulin genes were amplified from peripheral blood lymphocytes of healthy donors using RT-PCR, and then inserted into phagemid pComb3XSS to generate a human phage antibody library. The insertion of light chain or heavy chain Fd genes were identified by PCR after the digestion of Sac I, Xba I, Xho Iand Spe I. One of positive clones was analyzed by DNA sequencing. The specific anti-amylin clones were screened from antibody library against human amylin antigens and then the positive clones were determined by Phage-ELISA analysis. A Fab phage antibody library with 0.8×10(8); members was constructed with the efficacy of about 70%. DNA sequence analysis indicated V(H); gene belonged to V(H);3 gene family and V(λ); gene belonged to the V(λ); gene family. Using human amylin as panning antigen, specific anti-amylin Fab antibodies were enriched by screening the library for three times. Phage-ELISA assay showed the positive clones had very good specificity to amylin antigen. The successful construction of a phage antibody library and the identification of anti-amylin Fab antibodies provide a basis for further study and preparation of human anti-amylin antibodies.
Poisson Statistics of Combinatorial Library Sampling Predict False Discovery Rates of Screening
2017-01-01
Microfluidic droplet-based screening of DNA-encoded one-bead-one-compound combinatorial libraries is a miniaturized, potentially widely distributable approach to small molecule discovery. In these screens, a microfluidic circuit distributes library beads into droplets of activity assay reagent, photochemically cleaves the compound from the bead, then incubates and sorts the droplets based on assay result for subsequent DNA sequencing-based hit compound structure elucidation. Pilot experimental studies revealed that Poisson statistics describe nearly all aspects of such screens, prompting the development of simulations to understand system behavior. Monte Carlo screening simulation data showed that increasing mean library sampling (ε), mean droplet occupancy, or library hit rate all increase the false discovery rate (FDR). Compounds identified as hits on k > 1 beads (the replicate k class) were much more likely to be authentic hits than singletons (k = 1), in agreement with previous findings. Here, we explain this observation by deriving an equation for authenticity, which reduces to the product of a library sampling bias term (exponential in k) and a sampling saturation term (exponential in ε) setting a threshold that the k-dependent bias must overcome. The equation thus quantitatively describes why each hit structure’s FDR is based on its k class, and further predicts the feasibility of intentionally populating droplets with multiple library beads, assaying the micromixtures for function, and identifying the active members by statistical deconvolution. PMID:28682059
Singleton, David R.; Powell, Sabrina N.; Sangaiah, Ramiah; Gold, Avram; Ball, Louise M.; Aitken, Michael D.
2005-01-01
[13C6]salicylate, [U-13C]naphthalene, and [U-13C]phenanthrene were synthesized and separately added to slurry from a bench-scale, aerobic bioreactor used to treat soil contaminated with polycyclic aromatic hydrocarbons. Incubations were performed for either 2 days (salicylate, naphthalene) or 7 days (naphthalene, phenanthrene). Total DNA was extracted from the incubations, the “heavy” and “light” DNA were separated, and the bacterial populations associated with the heavy fractions were examined by denaturing gradient gel electrophoresis (DGGE) and 16S rRNA gene clone libraries. Unlabeled DNA from Escherichia coli K-12 was added to each sample as an internal indicator of separation efficiency. While E. coli was not detected in most analyses of heavy DNA, a low number of E. coli sequences was recovered in the clone libraries associated with the heavy DNA fraction of [13C]phenanthrene incubations. The number of E. coli clones recovered proved useful in determining the relative amount of light DNA contamination of the heavy fraction in that sample. Salicylate- and naphthalene-degrading communities displayed similar DGGE profiles and their clone libraries were composed primarily of sequences belonging to the Pseudomonas and Ralstonia genera. In contrast, heavy DNA from the phenanthrene incubations displayed a markedly different DGGE profile and was composed primarily of sequences related to the Acidovorax genus. There was little difference in the DGGE profiles and types of sequences recovered from 2- and 7-day incubations with naphthalene, so secondary utilization of the 13C during the incubation did not appear to be an issue in this experiment. PMID:15746319
16S rDNA clone libraries were evaluated for detection of fecal source-identifying bacteria from a collapsed equine manure pile. Libraries were constructed using universal eubacterial primers and Bacteroides-Prevotella group-specific primers. Eubacterial sequences indicat...
A BIOINFORMATIC STRATEGY TO RAPIDLY CHARACTERIZE CDNA LIBRARIES
A Bioinformatic Strategy to Rapidly Characterize cDNA Libraries
G. Charles Ostermeier1, David J. Dix2 and Stephen A. Krawetz1.
1Departments of Obstetrics and Gynecology, Center for Molecular Medicine and Genetics, & Institute for Scientific Computing, Wayne State Univer...
Johnson, LeeAnn K; Brown, Mary B; Carruthers, Ethan A; Ferguson, John A; Dombek, Priscilla E; Sadowsky, Michael J
2004-08-01
A horizontal, fluorophore-enhanced, repetitive extragenic palindromic-PCR (rep-PCR) DNA fingerprinting technique (HFERP) was developed and evaluated as a means to differentiate human from animal sources of Escherichia coli. Box A1R primers and PCR were used to generate 2,466 rep-PCR and 1,531 HFERP DNA fingerprints from E. coli strains isolated from fecal material from known human and 12 animal sources: dogs, cats, horses, deer, geese, ducks, chickens, turkeys, cows, pigs, goats, and sheep. HFERP DNA fingerprinting reduced within-gel grouping of DNA fingerprints and improved alignment of DNA fingerprints between gels, relative to that achieved using rep-PCR DNA fingerprinting. Jackknife analysis of the complete rep-PCR DNA fingerprint library, done using Pearson's product-moment correlation coefficient, indicated that animal and human isolates were assigned to the correct source groups with an 82.2% average rate of correct classification. However, when only unique isolates were examined, isolates from a single animal having a unique DNA fingerprint, Jackknife analysis showed that isolates were assigned to the correct source groups with a 60.5% average rate of correct classification. The percentages of correctly classified isolates were about 15 and 17% greater for rep-PCR and HFERP, respectively, when analyses were done using the curve-based Pearson's product-moment correlation coefficient, rather than the band-based Jaccard algorithm. Rarefaction analysis indicated that, despite the relatively large size of the known-source database, genetic diversity in E. coli was very great and is most likely accounting for our inability to correctly classify many environmental E. coli isolates. Our data indicate that removal of duplicate genotypes within DNA fingerprint libraries, increased database size, proper methods of statistical analysis, and correct alignment of band data within and between gels improve the accuracy of microbial source tracking methods.
2013-10-09
have desirable traits. We aim to enlarge the E. coli genome using Lactobacillusplantarum genes to build cells tolerant to EtOH and BT. L. plantarum is...chemicals III. Approach Objective 1 & la: Integrated heterologous (L. plantarum ) DNA into the E. coli chromosome and selected for insertions that...developed in combination with genes identified from screening L. plantarum libraries. Additionally, we have screened heterologous libraries for
Dokarry, Melissa; Laurendon, Caroline; O'Maille, Paul E
2012-01-01
Structure-based combinatorial protein engineering (SCOPE) is a homology-independent recombination method to create multiple crossover gene libraries by assembling defined combinations of structural elements ranging from single mutations to domains of protein structure. SCOPE was originally inspired by DNA shuffling, which mimics recombination during meiosis, where mutations from parental genes are "shuffled" to create novel combinations in the resulting progeny. DNA shuffling utilizes sequence identity between parental genes to mediate template-switching events (the annealing and extension of one parental gene fragment on another) in PCR reassembly reactions to generate crossovers and hence recombination between parental genes. In light of the conservation of protein structure and degeneracy of sequence, SCOPE was developed to enable the "shuffling" of distantly related genes with no requirement for sequence identity. The central principle involves the use of oligonucleotides to encode for crossover regions to choreograph template-switching events during PCR assembly of gene fragments to create chimeric genes. This approach was initially developed to create libraries of hybrid DNA polymerases from distantly related parents, and later developed to create a combinatorial mutant library of sesquiterpene synthases to explore the catalytic landscapes underlying the functional divergence of related enzymes. This chapter presents a simplified protocol of SCOPE that can be integrated with different mutagenesis techniques and is suitable for automation by liquid-handling robots. Two examples are presented to illustrate the application of SCOPE to create gene libraries using plant sesquiterpene synthases as the model system. In the first example, we outline how to create an active-site library as a series of complex mixtures of diverse mutants. In the second example, we outline how to create a focused library as an array of individual clones to distil minimal combinations of functionally important mutations. Through these examples, the principles of the technique are illustrated and the suitability of automating various aspects of the procedure for given applications are discussed. Copyright © 2012 Elsevier Inc. All rights reserved.
Analysis of cDNA libraries from developing seeds of guar (Cyamopsis tetragonoloba (L.) Taub)
Naoumkina, Marina; Torres-Jerez, Ivone; Allen, Stacy; He, Ji; Zhao, Patrick X; Dixon, Richard A; May, Gregory D
2007-01-01
Background Guar, Cyamopsis tetragonoloba (L.) Taub, is a member of the Leguminosae (Fabaceae) family and is economically the most important of the four species in the genus. The endosperm of guar seed is a rich source of mucilage or gum, which forms a viscous gel in cold water, and is used as an emulsifier, thickener and stabilizer in a wide range of foods and industrial applications. Guar gum is a galactomannan, consisting of a linear (1→4)-β-linked D-mannan backbone with single-unit, (1→6)-linked, α-D-galactopyranosyl side chains. To better understand regulation of guar seed development and galactomannan metabolism we created cDNA libraries and a resulting EST dataset from different developmental stages of guar seeds. Results A database of 16,476 guar seed ESTs was constructed, with 8,163 and 8,313 ESTs derived from cDNA libraries I and II, respectively. Library I was constructed from seeds at an early developmental stage (15–25 days after flowering, DAF), and library II from seeds at 30–40 DAF. Quite different sets of genes were represented in these two libraries. Approximately 27% of the clones were not similar to known sequences, suggesting that these ESTs represent novel genes or may represent non-coding RNA. The high flux of energy into carbohydrate and storage protein synthesis in guar seeds was reflected by a high representation of genes annotated as involved in signal transduction, carbohydrate metabolism, chaperone and proteolytic processes, and translation and ribosome structure. Guar unigenes involved in galactomannan metabolism were identified. Among the seed storage proteins, the most abundant contig represented a conglutin accounting for 3.7% of the total ESTs from both libraries. Conclusion The present EST collection and its annotation provide a resource for understanding guar seed biology and galactomannan metabolism. PMID:18034910
Zhu, Ziguo; Shi, Jiangli; Cao, Jiangling; He, Mingyang; Wang, Yuejin
2012-11-01
Chinese wild grapevine Vitis pseudoreticulata accession 'Baihe-35-1' is identified as the precious resource with multiple resistances to pathogens. A directional cDNA library was constructed from the young leaves inoculated with Erysiphe necator. A total of 3,500 clones were sequenced, yielding 1,727 unigenes. Among them, 762 unigenes were annotated and classified into three classes, respectively, using Gene Ontology, including 22 ESTs related to transcription regulator activity. A novel WRKY transcription factor was isolated from the library, and designated as VpWRKY3 (GenBank Accession No. JF500755). The full-length cDNA is 1,280 bp, encoding a WRKY protein of 320 amino acids. VpWRKY3 is localized to nucleus and functions as a transcriptional activator. QRT-PCR analysis showed that the VpWRKY3 specifically accumulated in response to pathogen, salicylic acid, ethylene and drought stress. Overexpression of VpWRKY3 in tobacco increased the resistance to Ralstonia solanacearum, indicating that VpWRKY3 participates in defense response. Furthermore, VpWRKY3 is also involved in abscisic acid signal pathway and salt stress. This experiment provided an important basis for understanding the defense mechanisms mediated by WRKY genes in China wild grapevine. Generation of the EST collection from the cDNA library provided valuable information for the grapevine breeding. Key message We constructed a cDNA library from Chinese wild grapevine leaves inoculated with powdery mildew. VpWRKY3 was isolated and demonstrated that it was involved in biotic and abiotic stress responses.
Preparation of metagenomic libraries from naturally occurring marine viruses.
Solonenko, Sergei A; Sullivan, Matthew B
2013-01-01
Microbes are now well recognized as major drivers of the biogeochemical cycling that fuels the Earth, and their viruses (phages) are known to be abundant and important in microbial mortality, horizontal gene transfer, and modulating microbial metabolic output. Investigation of environmental phages has been frustrated by an inability to culture the vast majority of naturally occurring diversity coupled with the lack of robust, quantitative, culture-independent methods for studying this uncultured majority. However, for double-stranded DNA phages, a quantitative viral metagenomic sample-to-sequence workflow now exists. Here, we review these advances with special emphasis on the technical details of preparing DNA sequencing libraries for metagenomic sequencing from environmentally relevant low-input DNA samples. Library preparation steps broadly involve manipulating the sample DNA by fragmentation, end repair and adaptor ligation, size fractionation, and amplification. One critical area of future research and development is parallel advances for alternate nucleic acid types such as single-stranded DNA and RNA viruses that are also abundant in nature. Combinations of recent advances in fragmentation (e.g., acoustic shearing and tagmentation), ligation reactions (adaptor-to-template ratio reference table availability), size fractionation (non-gel-sizing), and amplification (linear amplification for deep sequencing and linker amplification protocols) enhance our ability to generate quantitatively representative metagenomic datasets from low-input DNA samples. Such datasets are already providing new insights into the role of viruses in marine systems and will continue to do so as new environments are explored and synergies and paradigms emerge from large-scale comparative analyses. © 2013 Elsevier Inc. All rights reserved.
Capturing the 'ome': the expanding molecular toolbox for RNA and DNA library construction.
Boone, Morgane; De Koker, Andries; Callewaert, Nico
2018-04-06
All sequencing experiments and most functional genomics screens rely on the generation of libraries to comprehensively capture pools of targeted sequences. In the past decade especially, driven by the progress in the field of massively parallel sequencing, numerous studies have comprehensively assessed the impact of particular manipulations on library complexity and quality, and characterized the activities and specificities of several key enzymes used in library construction. Fortunately, careful protocol design and reagent choice can substantially mitigate many of these biases, and enable reliable representation of sequences in libraries. This review aims to guide the reader through the vast expanse of literature on the subject to promote informed library generation, independent of the application.
Identification of species adulteration in traded medicinal plant raw drugs using DNA barcoding.
Nithaniyal, Stalin; Vassou, Sophie Lorraine; Poovitha, Sundar; Raju, Balaji; Parani, Madasamy
2017-02-01
Plants are the major source of therapeutic ingredients in complementary and alternative medicine (CAM). However, species adulteration in traded medicinal plant raw drugs threatens the reliability and safety of CAM. Since morphological features of medicinal plants are often not intact in the raw drugs, DNA barcoding was employed for species identification. Adulteration in 112 traded raw drugs was tested after creating a reference DNA barcode library consisting of 1452 rbcL and matK barcodes from 521 medicinal plant species. Species resolution of this library was 74.4%, 90.2%, and 93.0% for rbcL, matK, and rbcL + matK, respectively. DNA barcoding revealed adulteration in about 20% of the raw drugs, and at least 6% of them were derived from plants with completely different medicinal or toxic properties. Raw drugs in the form of dried roots, powders, and whole plants were found to be more prone to adulteration than rhizomes, fruits, and seeds. Morphological resemblance, co-occurrence, mislabeling, confusing vernacular names, and unauthorized or fraudulent substitutions might have contributed to species adulteration in the raw drugs. Therefore, this library can be routinely used to authenticate traded raw drugs for the benefit of all stakeholders: traders, consumers, and regulatory agencies.
Malc, Ewa P.; Jayakody, Chatura N.; Tsuruta, James K.; Mieczkowski, Piotr A.; Janzen, William P.; Dayton, Paul A.
2015-01-01
A perfluorocarbon nanodroplet formulation is shown to be an effective cavitation enhancement agent, enabling rapid and consistent fragmentation of genomic DNA in a standard ultrasonic water bath. This nanodroplet-enhanced method produces genomic DNA libraries and next-generation sequencing results indistinguishable from DNA samples fragmented in dedicated commercial acoustic sonication equipment, and with higher throughput. This technique thus enables widespread access to fast bench-top genomic DNA fragmentation. PMID:26186461
Chen, He; Yao, Jiacheng; Fu, Yusi; Pang, Yuhong; Wang, Jianbin; Huang, Yanyi
2018-04-11
The next generation sequencing (NGS) technologies have been rapidly evolved and applied to various research fields, but they often suffer from losing long-range information due to short library size and read length. Here, we develop a simple, cost-efficient, and versatile NGS library preparation method, called tagmentation on microbeads (TOM). This method is capable of recovering long-range information through tagmentation mediated by microbead-immobilized transposomes. Using transposomes with DNA barcodes to identically label adjacent sequences during tagmentation, we can restore inter-read connection of each fragment from original DNA molecule by fragment-barcode linkage after sequencing. In our proof-of-principle experiment, more than 4.5% of the reads are linked with their adjacent reads, and the longest linkage is over 1112 bp. We demonstrate TOM with eight barcodes, but the number of barcodes can be scaled up by an ultrahigh complexity construction. We also show this method has low amplification bias and effectively fits the applications to identify copy number variations.
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
Enzymatically Generated CRISPR Libraries for Genome Labeling and Screening.
Lane, Andrew B; Strzelecka, Magdalena; Ettinger, Andreas; Grenfell, Andrew W; Wittmann, Torsten; Heald, Rebecca
2015-08-10
CRISPR-based technologies have emerged as powerful tools to alter genomes and mark chromosomal loci, but an inexpensive method for generating large numbers of RNA guides for whole genome screening and labeling is lacking. Using a method that permits library construction from any source of DNA, we generated guide libraries that label repetitive loci or a single chromosomal locus in Xenopus egg extracts and show that a complex library can target the E. coli genome at high frequency. Copyright © 2015 Elsevier Inc. All rights reserved.
Gene Expression Differences in Infected and Noninfected Middle Ear Complementary DNA Libraries
Kerschner, Joseph E.; Horsey, Edward; Ahmed, Azad; Erbe, Christy; Khampang, Pawjai; Cioffi, Joseph; Hu, Fen Ze; Post, James Christopher; Ehrlich, Garth D.
2010-01-01
Objectives To investigate genetic differences in middle ear mucosa (MEM) with nontypeable Haemophilus influenzae (NTHi) infection. Genetic upregulation and downregulation occurs in MEM during otitis media (OM) pathogenesis. A comprehensive assessment of these genetic differences using the techniques of complementary DNA (cDNA) library creation has not been performed. Design The cDNA libraries were constructed from NTHi-infected and noninfected chinchilla MEM. Random clones were picked, sequenced bidirectionally, and submitted to the National Center for Biotechnology Information (NCBI) Expressed Sequence Tags database, where they were assigned accession numbers. These numbers were used with the basic local alignment search tool (BLAST) to align clones against the nonredundant nucleotide database at NCBI. Results Analysis with the Web-based statistical program FatiGO identified several biological processes with significant differences in numbers of represented genes. Processes involved in immune, stress, and wound responses were more prevalent in the NTHi-infected library. S100 calcium-binding protein A9 (S100A9); secretory leukoprotease inhibitor (SLPI); β2-microglobulin (B2M); ferritin, heavy-chain polypeptide 1 (FTH1); and S100 calcium-binding protein A8 (S100A8) were expressed at significantly higher levels in the NTHi-infected library. Calcium-binding proteins S100A9 and S100A8 serve as markers for inflammation and have antibacterial effects. Secretory leukoprotease inhibitor is an antibacterial protein that inhibits stimuli-induced MUC1, MUC2, and MUC5AC production. Conclusions A number of genes demonstrate changes during the pathogenesis of OM, including SLPI, which has an impact on mucin gene expression; this expression is known to be an important regulator in OM. The techniques described herein provide a framework for future investigations to more thoroughly understand molecular changes in the middle ear, which will likely be important in developing new therapeutic and intervention strategies. PMID:19153305
Boltaña, Sebastian; Castellana, Barbara; Goetz, Giles; Tort, Lluis; Teles, Mariana; Mulero, Victor; Novoa, Beatriz; Figueras, Antonio; Goetz, Frederick W; Gallardo-Escarate, Cristian; Planas, Josep V; Mackenzie, Simon
2017-02-03
This study describes the development and validation of an enriched oligonucleotide-microarray platform for Sparus aurata (SAQ) to provide a platform for transcriptomic studies in this species. A transcriptome database was constructed by assembly of gilthead sea bream sequences derived from public repositories of mRNA together with reads from a large collection of expressed sequence tags (EST) from two extensive targeted cDNA libraries characterizing mRNA transcripts regulated by both bacterial and viral challenge. The developed microarray was further validated by analysing monocyte/macrophage activation profiles after challenge with two Gram-negative bacterial pathogen-associated molecular patterns (PAMPs; lipopolysaccharide (LPS) and peptidoglycan (PGN)). Of the approximately 10,000 EST sequenced, we obtained a total of 6837 EST longer than 100 nt, with 3778 and 3059 EST obtained from the bacterial-primed and from the viral-primed cDNA libraries, respectively. Functional classification of contigs from the bacterial- and viral-primed cDNA libraries by Gene Ontology (GO) showed that the top five represented categories were equally represented in the two libraries: metabolism (approximately 24% of the total number of contigs), carrier proteins/membrane transport (approximately 15%), effectors/modulators and cell communication (approximately 11%), nucleoside, nucleotide and nucleic acid metabolism (approximately 7.5%) and intracellular transducers/signal transduction (approximately 5%). Transcriptome analyses using this enriched oligonucleotide platform identified differential shifts in the response to PGN and LPS in macrophage-like cells, highlighting responsive gene-cassettes tightly related to PAMP host recognition. As observed in other fish species, PGN is a powerful activator of the inflammatory response in S. aurata macrophage-like cells. We have developed and validated an oligonucleotide microarray (SAQ) that provides a platform enriched for the study of gene expression in S. aurata with an emphasis upon immunity and the immune response.
Yang, Hongmei; Yao, Wenbin; Wang, Yihan; Shi, Lei; Su, Rui; Wan, Debin; Xu, Niusheng; Lian, Wenhui; Chen, Changbao; Liu, Shuying
2017-02-14
Conventional strategies for the screening of DNA triplex binders cannot be used for complicated samples, such as ligand libraries created by combinatorial chemistry or from natural product extracts. In the current study, an ultra-high-performance liquid chromatography coupled with an Orbitrap mass spectrometry (UHPLC-Orbitrap-MS)-based approach, which we call peak area-fading (PAF) UHPLC-Orbitrap-MS and was designed for just such a purpose, is reported. The triplex DNA modified 96-well plate and the single stranded oligonucleotide modified 96-well plate (as control) were incubated with ligand libraries, and the unbound ligands were directly determined via UHPLC-ESI-MS. The binders were detected through the decrease (fading) in the peak areas compared to those of the control group. Several factors, such as incubation time, incubation temperature, and buffer, which might affect the binding affinity and reproducibility, were optimized. The potential of the approach was examined using the extracts of Rhizoma Coptidis and Phellodendron chinense Schneid cortexe. The triplex DNA-binding capabilities of the five components (epiberberine, coptisine, jatrorrhizine, berberrubine, and columbamine) were found for the first time, indicating their efficiency for the analysis of complicated samples. In contrast to our previous study, which suffered from a serious drawback of poor reproducibility, this method is more robust and more suitable for high-throughput measurements, opening a new experimental strategy in assessing large libraries of potential drug candidates that work by forming a drug/DNA complex.
Undermethylated DNA as a source of microsatellites from a conifer genome.
Zhou, Y; Bui, T; Auckland, L D; Williams, C G
2002-02-01
Developing microsatellites from the large, highly duplicated conifer genome requires special tools. To improve the efficiency of developing Pinus taeda L. microsatellites, undermethylated (UM) DNA fragments were used to construct a microsatellite-enriched copy library. A methylation-sensitive restriction enzyme, McrBC, was used to enrich for UM DNA before library construction. Digested DNA fragments larger than 9 kb were then excised and digested with RsaI and used to construct nine dinucleotide and trinucleotide libraries. A total of 1016 microsatellite-positive clones were detected among 11 904 clones and 620 of these were unique. Of 245 primer sets that produced a PCR product, 113 could be developed as UM microsatellite markers and 70 were polymorphic. Inheritance and marker informativeness were tested for a random sample of 36 polymorphic markers using a three-generation outbred pedigree. Thirty-one microsatellites (86%) had single-locus inheritance despite the highly duplicated nature of the P. taeda genome. Nineteen UM microsatellites had highly informative intercross mating type configurations. Allele number and frequency were estimated for eleven UM microsatellites using a population survey. Allele numbers for these UM microsatellites ranged from 3 to 12 with an average of 5.7 alleles/locus. Frequencies for the 63 alleles were mostly in the low-common range; only 14 of the 63 were in the rare allele (q < 0.05) class. Enriching for UM DNA was an efficient method for developing polymorphic microsatellites from a large plant genome.
Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jakob; Gilchrist, Michael J; Panitz, Frank; Jørgensen, Claus; Scheibye-Knudsen, Karsten; Arvin, Troels; Lumholdt, Steen; Sawera, Milena; Green, Trine; Nielsen, Bente J; Havgaard, Jakob H; Rosenkilde, Carina; Wang, Jun; Li, Heng; Li, Ruiqiang; Liu, Bin; Hu, Songnian; Dong, Wei; Li, Wei; Yu, Jun; Wang, Jian; Stærfeldt, Hans-Henrik; Wernersson, Rasmus; Madsen, Lone B; Thomsen, Bo; Hornshøj, Henrik; Bujie, Zhan; Wang, Xuegang; Wang, Xuefei; Bolund, Lars; Brunak, Søren; Yang, Huanming; Bendixen, Christian; Fredholm, Merete
2007-01-01
Background Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. Results Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories. Conclusion This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies. PMID:17407547
Li, Hong-Mei; Guo, Kang; Yu, Zhuang; Feng, Rui; Xu, Ping
2015-07-01
Traditional diagnostic technology with tumor biomarkers is inefficient, expensive and requires a large number of serum samples. The purpose of this study was to construct human lung cancer protein chips with new lung cancer biomarkers screened by the T7-phage display library, and improve the early diagnosis rate of lung cancer. A T7-phage cDNA display library was constructed of fresh samples from 30 lung cancer patients. With biopanning and high-throughput screening, we gained the immunogenic phage clones from the cDNA library. The insert of selected phage was blasted at GeneBank for alignment to find the exact or the most similar known genes. Protein chips were then constructed and used to assay their expression level in lung cancer serum from 217 cases of lung cancer groups:80 cases of benign lung disease and 220 healthy controls. After four rounds of Biopanning and two rounds of enzyme-linked immunosorbent assay, 12 phage monoclonal samples were selected from 2880 phage monoclonal samples. After blasting at GeneBank, six similar genes were used to construct diagnostic protein chips. The protein chips were then used to assay expression level in lung cancer serum. The expression level of six genes in lung cancer groups was significantly higher than those in the other two groups (P < 0.05). In this study, we successfully constructed diagnostic protein chips with biomarkers selected from the lung cancer T7-phage cDNA library, which can be used for the early screening of lung cancer patients.
Development of DNA-Free Sediment for Ecological Assays with Genomic Endpoints
Recent advances in genomics are currently being exploited to discern ecological changes that have conventionally been measured using laborious counting techniques. For example, next generation sequencing technologies can be used to create DNA libraries from benthic community ass...
Protein–DNA Interactions: The Story so Far and a New Method for Prediction
Jones, Susan; Thornton, Janet M.
2003-01-01
This review describes methods for the prediction of DNA binding function, and specifically summarizes a new method using 3D structural templates. The new method features the HTH motif that is found in approximately one-third of DNAbinding protein families. A library of 3D structural templates of HTH motifs was derived from proteins in the PDB. Templates were scanned against complete protein structures and the optimal superposition of a template on a structure calculated. Significance thresholds in terms of a minimum root mean squared deviation (rmsd) of an optimal superposition, and a minimum motif accessible surface area (ASA), have been calculated. Inmore » this way, it is possible to scan the template library against proteins of unknown function to make predictions about DNA-binding functionality.« less
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
Cooper, James; Ding, Yi; Song, Jiuzhou; Zhao, Keji
2017-11-01
Increased chromatin accessibility is a feature of cell-type-specific cis-regulatory elements; therefore, mapping of DNase I hypersensitive sites (DHSs) enables the detection of active regulatory elements of transcription, including promoters, enhancers, insulators and locus-control regions. Single-cell DNase sequencing (scDNase-seq) is a method of detecting genome-wide DHSs when starting with either single cells or <1,000 cells from primary cell sources. This technique enables genome-wide mapping of hypersensitive sites in a wide range of cell populations that cannot be analyzed using conventional DNase I sequencing because of the requirement for millions of starting cells. Fresh cells, formaldehyde-cross-linked cells or cells recovered from formalin-fixed paraffin-embedded (FFPE) tissue slides are suitable for scDNase-seq assays. To generate scDNase-seq libraries, cells are lysed and then digested with DNase I. Circular carrier plasmid DNA is included during subsequent DNA purification and library preparation steps to prevent loss of the small quantity of DHS DNA. Libraries are generated for high-throughput sequencing on the Illumina platform using standard methods. Preparation of scDNase-seq libraries requires only 2 d. The materials and molecular biology techniques described in this protocol should be accessible to any general molecular biology laboratory. Processing of high-throughput sequencing data requires basic bioinformatics skills and uses publicly available bioinformatics software.
Preferential cleavage sites for Sau3A restriction endonuclease in human ribosomal DNA.
Kupriyanova, N S; Kirilenko, P M; Netchvolodov, K K; Ryskov, A P
2000-07-21
Previous studies of cloned ribosomal DNA (rDNA) variants isolated from the cosmid library of human chromosome 13 have revealed some disproportion in representativity of different rDNA regions (N. S. Kupriyanova, K. K. Netchvolodov, P. M. Kirilenko, B. I. Kapanadze, N. K. Yankovsky, and A. P. Ryskov, Mol. Biol. 30, 51-60, 1996). Here we show nonrandom cleavage of human rDNA with Sau3A or its isoshizomer MboI under mild hydrolysis conditions. The hypersensitive cleavage sites were found to be located in the ribosomal intergenic spacer (rIGS), especially in the regions of about 5-5.5 and 11 kb upstream of the rRNA transcription start point. This finding is based on sequencing mapping of the rDNA insert ends in randomly selected cosmid clones of human chromosome 13 and on the data of digestion kinetics of cloned and noncloned human genomic rDNA with Sau3A and MboI. The results show that a methylation status and superhelicity state of the rIGS have no effect on cleavage site sensitivity. It is interesting that all primary cleavage sites are adjacent to or entering into Alu or Psi cdc 27 retroposons of the rIGS suggesting a possible role of neighboring sequences in nuclease accessibility. The results explain nonequal representation of rDNA sequences in the human genomic DNA library used for this study. Copyright 2000 Academic Press.
Effects of field-grown genetically modified Zoysia grass on bacterial community structure.
Lee, Yong-Eok; Yang, Sang-Hwan; Bae, Tae-Woong; Kang, Hong-Gyu; Lim, Pyung-Ok; Lee, Hyo-Yeon
2011-04-01
Herbicide-tolerant Zoysia grass has been previously developed through Agrobacterium-mediated transformation. We investigated the effects of genetically modified (GM) Zoysia grass and the associated herbicide application on bacterial community structure by using culture-independent approaches. To assess the possible horizontal gene transfer (HGT) of transgenic DNA to soil microorganisms, total soil DNAs were amplified by PCR with two primer sets for the bar and hpt genes, which were introduced into the GM Zoysia grass by a callus-type transformation. The transgenic genes were not detected from the total genomic DNAs extracted from 1.5 g of each rhizosphere soils of GM and non-GM Zoysia grasses. The structures and diversities of the bacterial communities in rhizosphere soils of GM and non-GM Zoysia grasses were investigated by constructing 16S rDNA clone libraries. Classifier, provided in the RDP II, assigned 100 clones in the 16S rRNA gene sequences library into 11 bacterial phyla. The most abundant phyla in both clone libraries were Acidobacteria and Proteobacteria. The bacterial diversity of the GM clone library was lower than that of the non- GM library. The former contained four phyla, whereas the latter had seven phyla. Phylogenetic trees were constructed to confirm these results. Phylogenetic analyses of the two clone libraries revealed considerable difference from each other. The significance of difference between clone libraries was examined with LIBSHUFF statistics. LIBSHUFF analysis revealed that the two clone libraries differed significantly (P〈0.025), suggesting alterations in the composition of the microbial community associated with GM Zoysia grass.
Bowers, Robert M.; Clum, Alicia; Tice, Hope; ...
2015-10-24
Background: The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composi tion and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for largemore » initial DNA template quantities during library preparation. Results: Here, we investigate the effects of varying template concentrations across three low biomass library preparation protocols on their ability to accurately reconstruct a mock microbial community of known composition. We analyze the effects of input DNA quantity and library preparation method on library insert size, GC content, community composition, assembly quality and metagenomic binning. We found that library preparation method and the amount of starting material had significant impacts on the mock community metagenomes. In particular, GC content shifted towards more GC rich sequences at the lower input quantities regardless of library prep method, the number of low quality reads that could not be mapped to the reference genomes increased with decreasing input quantities, and the different library preparation methods had an impact on overall metagenomic community composition. Conclusions: This benchmark study provides recommendations for library creation of representative and minimally biased metagenome shotgun sequencing, enabling insights into functional attributes of low biomass ecosystem microbial communities.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bowers, Robert M.; Clum, Alicia; Tice, Hope
Background: The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composi tion and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for largemore » initial DNA template quantities during library preparation. Results: Here, we investigate the effects of varying template concentrations across three low biomass library preparation protocols on their ability to accurately reconstruct a mock microbial community of known composition. We analyze the effects of input DNA quantity and library preparation method on library insert size, GC content, community composition, assembly quality and metagenomic binning. We found that library preparation method and the amount of starting material had significant impacts on the mock community metagenomes. In particular, GC content shifted towards more GC rich sequences at the lower input quantities regardless of library prep method, the number of low quality reads that could not be mapped to the reference genomes increased with decreasing input quantities, and the different library preparation methods had an impact on overall metagenomic community composition. Conclusions: This benchmark study provides recommendations for library creation of representative and minimally biased metagenome shotgun sequencing, enabling insights into functional attributes of low biomass ecosystem microbial communities.« less
Genomics approach to the environmental community of microorganisms
NASA Astrophysics Data System (ADS)
Kawarabayasi, Y.; Maruyama, A.
2004-12-01
It was indicated by microscopic observation or comparison of 16S rDNA sequence that many extremophiles were surviving in many hydrothermal environments. But it is generally said that over 99% of total microbes are now uncultivable. Thus, we planned to identify uncultivable microbes through direct sequencing of environmental DNA. At first, shotgun plasmid libraries were directly constructed with the DNA molecules prepared from mixed microbes collected from low-temperature hydrothermal water at RM24 in the Southern East Pacific Rise (S-EPR). It was shown that the sequences of some number of clones indicated the similar feature to the intron in eukaryote or tandem repetitive sequence identified in some human familiar diseases. The results indicated that many microorganisms with eukaryotic feature were dominant in low temperature water of S-EPR. Secondly, shotgun plasmid libraries were constructed from the environmental DNA prepared from Beppu hot springs. The ORFs were easily identified all clones determined entire sequence. Thus it can be said that hot springs is good resources for searching novel genes. At last, the mixed microbes isolated from Suiyo seamount were used for construction of shotgun library. The clones in this library contained the ORFs. From some clones in hot spring and Suiyo sample, aminoacyl-tRNA synthatase, which is generally present in all organisms, was isolated by similarity. The phylogenetic analysis of aminoacyl-tRNA synthetase identified indicated that novel and unidentified microorganisms should be present in hot spring or Suiyo seamount. The novel genes identified from Suiyo seamount were also utilized for expression in E. coli. Some gene products were successfully obtained from the E. coli cells as soluble proteins. Some protein indicated the thermostability up to 70_E#8249;C, meaning that the original host cell of this gene should be stable up to the same temperature. Our work indicates that environmental genomics, including the direct cloning, sequencing of environmental DNA and expression of gene identified, is powerful approach to collect novel uncultivable microbes or novel active genes.
Diversity of Metabolically Active Bacteria in Water-Flooded High-Temperature Heavy Oil Reservoir
Nazina, Tamara N.; Shestakova, Natalya M.; Semenova, Ekaterina M.; Korshunova, Alena V.; Kostrukova, Nadezda K.; Tourova, Tatiana P.; Min, Liu; Feng, Qingxian; Poltaraus, Andrey B.
2017-01-01
The goal of this work was to study the overall genomic diversity of microorganisms of the Dagang high-temperature oilfield (PRC) and to characterize the metabolically active fraction of these populations. At this water-flooded oilfield, the microbial community of formation water from the near-bottom zone of an injection well where the most active microbial processes of oil degradation occur was investigated using molecular, cultural, radiotracer, and physicochemical techniques. The samples of microbial DNA and RNA from back-flushed water were used to obtain the clone libraries for the 16S rRNA gene and cDNA of 16S rRNA, respectively. The DNA-derived clone libraries were found to contain bacterial and archaeal 16S rRNA genes and the alkB genes encoding alkane monooxygenases similar to those encoded by alkB-geo1 and alkB-geo6 of geobacilli. The 16S rRNA genes of methanogens (Methanomethylovorans, Methanoculleus, Methanolinea, Methanothrix, and Methanocalculus) were predominant in the DNA-derived library of Archaea cloned sequences; among the bacterial sequences, the 16S rRNA genes of members of the genus Geobacillus were the most numerous. The RNA-derived library contained only bacterial cDNA of the 16S rRNA sequences belonging to metabolically active aerobic organotrophic bacteria (Tepidimonas, Pseudomonas, Acinetobacter), as well as of denitrifying (Azoarcus, Tepidiphilus, Calditerrivibrio), fermenting (Bellilinea), iron-reducing (Geobacter), and sulfate- and sulfur-reducing bacteria (Desulfomicrobium, Desulfuromonas). The presence of the microorganisms of the main functional groups revealed by molecular techniques was confirmed by the results of cultural, radioisotope, and geochemical research. Functioning of the mesophilic and thermophilic branches was shown for the microbial food chain of the near-bottom zone of the injection well, which included the microorganisms of the carbon, sulfur, iron, and nitrogen cycles. PMID:28487680
Genomic resources for Myzus persicae: EST sequencing, SNP identification, and microarray design
Ramsey, John S; Wilson, Alex CC; de Vos, Martin; Sun, Qi; Tamborindeguy, Cecilia; Winfield, Agnese; Malloch, Gaynor; Smith, Dawn M; Fenton, Brian; Gray, Stewart M; Jander, Georg
2007-01-01
Background The green peach aphid, Myzus persicae (Sulzer), is a world-wide insect pest capable of infesting more than 40 plant families, including many crop species. However, despite the significant damage inflicted by M. persicae in agricultural systems through direct feeding damage and by its ability to transmit plant viruses, limited genomic information is available for this species. Results Sequencing of 16 M. persicae cDNA libraries generated 26,669 expressed sequence tags (ESTs). Aphids for library construction were raised on Arabidopsis thaliana, Nicotiana benthamiana, Brassica oleracea, B. napus, and Physalis floridana (with and without Potato leafroll virus infection). The M. persicae cDNA libraries include ones made from sexual and asexual whole aphids, guts, heads, and salivary glands. In silico comparison of cDNA libraries identified aphid genes with tissue-specific expression patterns, and gene expression that is induced by feeding on Nicotiana benthamiana. Furthermore, 2423 genes that are novel to science and potentially aphid-specific were identified. Comparison of cDNA data from three aphid lineages identified single nucleotide polymorphisms that can be used as genetic markers and, in some cases, may represent functional differences in the protein products. In particular, non-conservative amino acid substitutions in a highly expressed gut protease may be of adaptive significance for M. persicae feeding on different host plants. The Agilent eArray platform was used to design an M. persicae oligonucleotide microarray representing over 10,000 unique genes. Conclusion New genomic resources have been developed for M. persicae, an agriculturally important insect pest. These include previously unknown sequence data, a collection of expressed genes, molecular markers, and a DNA microarray that can be used to study aphid gene expression. These resources will help elucidate the adaptations that allow M. persicae to develop compatible interactions with its host plants, complementing ongoing work illuminating plant molecular responses to phloem-feeding insects. PMID:18021414
Dominant genetics using a yeast genomic library under the control of a strong inducible promoter.
Ramer, S W; Elledge, S J; Davis, R W
1992-12-01
In Saccharomyces cerevisiae, numerous genes have been identified by selection from high-copy-number libraries based on "multicopy suppression" or other phenotypic consequences of overexpression. Although fruitful, this approach suffers from two major drawbacks. First, high copy number alone may not permit high-level expression of tightly regulated genes. Conversely, other genes expressed in proportion to dosage cannot be identified if their products are toxic at elevated levels. This work reports construction of a genomic DNA expression library for S. cerevisiae that circumvents both limitations by fusing randomly sheared genomic DNA to the strong, inducible yeast GAL1 promoter, which can be regulated by carbon source. The library obtained contains 5 x 10(7) independent recombinants, representing a breakpoint at every base in the yeast genome. This library was used to examine aberrant gene expression in S. cerevisiae. A screen for dominant activators of yeast mating response identified eight genes that activate the pathway in the absence of exogenous mating pheromone, including one previously unidentified gene. One activator was a truncated STE11 gene lacking approximately 1000 base pairs of amino-terminal coding sequence. In two different clones, the same GAL1 promoter-proximal ATG is in-frame with the coding sequence of STE11, suggesting that internal initiation of translation there results in production of a biologically active, truncated STE11 protein. Thus this library allows isolation based on dominant phenotypes of genes that might have been difficult or impossible to isolate from high-copy-number libraries.
2017-05-01
TERMS Ovarian cancer, drug resistance, rucaparib, phase 2, DNA repair, homologous recombination, nonhomologous end-joining (NHEJ), poly(ADP-ribose...tissues from AA patients with OC. This should add 50 AA OC patients. We are also requesting anonymized DNA from AA OC patients who participated on...extracts DNA and creates library pretps for DNA sequencing. He performs Sanger sequencing validations. Funding Support: Has there been a change
Development of DNA-Free Sediment for Ecological Assays with Genomic Endpoints (NAC SETAC)
Recent advances in genomics are currently being exploited to discern ecological changes that have conventionally been measured using laborious counting techniques. For example, next generation sequencing technologies can be used to create DNA libraries from benthic community ass...
Chromosome specific repetitive DNA sequences
Moyzis, Robert K.; Meyne, Julianne
1991-01-01
A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
Nucleotide exchange and excision technology DNA shuffling and directed evolution.
Speck, Janina; Stebel, Sabine C; Arndt, Katja M; Müller, Kristian M
2011-01-01
Remarkable success in optimizing complex properties within DNA and proteins has been achieved by directed evolution. In contrast to various random mutagenesis methods and high-throughput selection methods, the number of available DNA shuffling procedures is limited, and protocols are often difficult to adjust. The strength of the nucleotide exchange and excision technology (NExT) DNA shuffling described here is the robust, efficient, and easily controllable DNA fragmentation step based on random incorporation of the so-called 'exchange nucleotides' by PCR. The exchange nucleotides are removed enzymatically, followed by chemical cleavage of the DNA backbone. The oligonucleotide pool is reassembled into full-length genes by internal primer extension, and the recombined gene library is amplified by standard PCR. The technique has been demonstrated by shuffling a defined gene library of chloramphenicol acetyltransferase variants using uridine as fragmentation defining exchange nucleotide. Substituting 33% of the dTTP with dUTP in the incorporation PCR resulted in shuffled clones with an average parental fragment size of 86 bases and revealed a mutation rate of only 0.1%. Additionally, a computer program (NExTProg) has been developed that predicts the fragment size distribution depending on the relative amount of the exchange nucleotide.
Chung, Jongsuk; Son, Dae-Soon; Jeon, Hyo-Jeong; Kim, Kyoung-Mee; Park, Gahee; Ryu, Gyu Ha; Park, Woong-Yang; Park, Donghyun
2016-01-01
Targeted capture massively parallel sequencing is increasingly being used in clinical settings, and as costs continue to decline, use of this technology may become routine in health care. However, a limited amount of tissue has often been a challenge in meeting quality requirements. To offer a practical guideline for the minimum amount of input DNA for targeted sequencing, we optimized and evaluated the performance of targeted sequencing depending on the input DNA amount. First, using various amounts of input DNA, we compared commercially available library construction kits and selected Agilent’s SureSelect-XT and KAPA Biosystems’ Hyper Prep kits as the kits most compatible with targeted deep sequencing using Agilent’s SureSelect custom capture. Then, we optimized the adapter ligation conditions of the Hyper Prep kit to improve library construction efficiency and adapted multiplexed hybrid selection to reduce the cost of sequencing. In this study, we systematically evaluated the performance of the optimized protocol depending on the amount of input DNA, ranging from 6.25 to 200 ng, suggesting the minimal input DNA amounts based on coverage depths required for specific applications. PMID:27220682
Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain
2011-01-01
cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.
ChIP-chip versus ChIP-seq: Lessons for experimental design and data analysis
2011-01-01
Background Chromatin immunoprecipitation (ChIP) followed by microarray hybridization (ChIP-chip) or high-throughput sequencing (ChIP-seq) allows genome-wide discovery of protein-DNA interactions such as transcription factor bindings and histone modifications. Previous reports only compared a small number of profiles, and little has been done to compare histone modification profiles generated by the two technologies or to assess the impact of input DNA libraries in ChIP-seq analysis. Here, we performed a systematic analysis of a modENCODE dataset consisting of 31 pairs of ChIP-chip/ChIP-seq profiles of the coactivator CBP, RNA polymerase II (RNA PolII), and six histone modifications across four developmental stages of Drosophila melanogaster. Results Both technologies produce highly reproducible profiles within each platform, ChIP-seq generally produces profiles with a better signal-to-noise ratio, and allows detection of more peaks and narrower peaks. The set of peaks identified by the two technologies can be significantly different, but the extent to which they differ varies depending on the factor and the analysis algorithm. Importantly, we found that there is a significant variation among multiple sequencing profiles of input DNA libraries and that this variation most likely arises from both differences in experimental condition and sequencing depth. We further show that using an inappropriate input DNA profile can impact the average signal profiles around genomic features and peak calling results, highlighting the importance of having high quality input DNA data for normalization in ChIP-seq analysis. Conclusions Our findings highlight the biases present in each of the platforms, show the variability that can arise from both technology and analysis methods, and emphasize the importance of obtaining high quality and deeply sequenced input DNA libraries for ChIP-seq analysis. PMID:21356108
Shiue, Yow-Ling; Chen, Lih-Ren; Chen, Chih-Feng; Chen, Yi-Ling; Ju, Jhy-Phen; Chao, Ching-Hsien; Lin, Yuan-Ping; Kuo, Yu-Ming; Tang, Pin-Chi; Lee, Yen-Pai
2006-09-15
To identify transcripts related to high egg production expressed specifically in the hypothalamus and pituitary gland of the chicken, two subtracted cDNA libraries were constructed. Two divergently selected strains of Taiwan Country Chickens (TCCs), B (sire line) and L2 (dam line) were used; they had originated from a single population and were further subjected (since 1982) to selection for egg production to 40 wk of age and body weight/comb size, respectively. A total of 324 and 370 clones were identified from the L2-B (L2-subtract-B) and the B-L2 subtracted cDNA libraries, respectively. After sequencing and annotation, 175 and 136 transcripts that represented 53 known and 65 unknown non-redundant sequences were characterized in the L2-B subtracted cDNA library. Quantitative reverse-transcription (RT)-PCR was used to screen the mRNA expression levels of 32 randomly selected transcripts in another 78 laying hens from five different strains. These strains included the two original strains (B and L2) used to construct the subtracted cDNA libraries and an additional three commercial strains, i.e., Black- and Red-feather TCCs and Single-Comb White Leghorn (WL) layer. The mRNA expression levels of 16 transcripts were significantly higher in the L2 than in the B strain, whereas the mRNA expression levels of nine transcripts, BDH, NCAM1, PCDHA@, PGDS, PLAG1, PRL, SAR1A, SCG2 and STMN2, were significantly higher in two high egg production strains, L2 and Single-Comb WL; this indicated their usefulness as molecular markers of high egg production.
Zhou, Peilan; Jiang, Jiebing; Dong, Zhaoqi; Yan, Hui; You, Zhendong; Su, Ruibin; Gong, Zehui
2015-12-15
Opioid addiction is associated with long-term adaptive changes in the brain that involve protein expression. The carboxyl-terminal of the μ opioid receptor (MOR-C) is important for receptor signal transduction under opioid treatment. However, the proteins that interact with MOR-C after chronic morphine exposure remain unknown. The brain cDNA library of chronic morphine treatment rats was screened using rat MOR-C to investigate the regulator of opioids dependence in the present study. The brain cDNA library from chronic morphine-dependent rats was constructed using the SMART (Switching Mechanism At 5' end of RNA Transcript) technique. Bacterial two-hybrid system was used to screening the rat MOR-C interacting proteins from the cDNA library. RT-qPCR and immunoblotting were used to determine the variation of MOR-C interacting proteins in rat brain after chronic morphine treatment. Column overlay assays, immunocytochemistry and coimmunoprecipitation were used to demonstrate the interaction of MOR-C and p75NTR-associated cell death executor (NADE). 21 positive proteins, including 19 known proteins were screened to interact with rat MOR-C. Expression of several of these proteins was altered in specific rat brain regions after chronic morphine treatment. Among these proteins, NADE was confirmed to interact with rat MOR-C by in vitro protein-protein binding and coimmunoprecipitation in Chinese hamster ovary (CHO) cells and rat brain with or without chronic morphine treatment. Understanding the rat MOR-C interacting proteins and the proteins variation under chronic morphine treatment may be critical for determining the pathophysiological basis of opioid tolerance and addiction. Copyright © 2015. Published by Elsevier Inc.
Loudig, Olivier; Liu, Christina; Rohan, Thomas; Ben-Dov, Iddo Z
2018-05-05
-Archived, clinically classified formalin-fixed paraffin-embedded (FFPE) tissues can provide nucleic acids for retrospective molecular studies of cancer development. By using non-invasive or pre-malignant lesions from patients who later develop invasive disease, gene expression analyses may help identify early molecular alterations that predispose to cancer risk. It has been well described that nucleic acids recovered from FFPE tissues have undergone severe physical damage and chemical modifications, which make their analysis difficult and generally requires adapted assays. MicroRNAs (miRNAs), however, which represent a small class of RNA molecules spanning only up to ~18-24 nucleotides, have been shown to withstand long-term storage and have been successfully analyzed in FFPE samples. Here we present a 3' barcoded complementary DNA (cDNA) library preparation protocol specifically optimized for the analysis of small RNAs extracted from archived tissues, which was recently demonstrated to be robust and highly reproducible when using archived clinical specimens stored for up to 35 years. This library preparation is well adapted to the multiplex analysis of compromised/degraded material where RNA samples (up to 18) are ligated with individual 3' barcoded adapters and then pooled together for subsequent enzymatic and biochemical preparations prior to analysis. All purifications are performed by polyacrylamide gel electrophoresis (PAGE), which allows size-specific selections and enrichments of barcoded small RNA species. This cDNA library preparation is well adapted to minute RNA inputs, as a pilot polymerase chain reaction (PCR) allows determination of a specific amplification cycle to produce optimal amounts of material for next-generation sequencing (NGS). This approach was optimized for the use of degraded FFPE RNA from specimens archived for up to 35 years and provides highly reproducible NGS data.
Huang, Renhua; Fang, Pete; Kay, Brian K
2012-09-01
Site-directed mutagenesis is routinely performed in protein engineering experiments. One method, termed Kunkel mutagenesis, is frequently used for constructing libraries of peptide or protein variants in M13 bacteriophage, followed by affinity selection of phage particles. To make this method more efficient, the following two modifications were introduced: culture was incubated at 25°C for phage replication, which yielded two- to sevenfold more single-stranded DNA template compared to growth at 37°C, and restriction endonuclease recognition sites were used to remove non-recombinants. With both of the improvements, we could construct primary libraries of high complexity and that were 99-100% recombinant. Finally, with a third modification to the standard protocol of Kunkel mutagenesis, two secondary (mutagenic) libraries of a fibronectin type III (FN3) monobody were constructed with DNA segments that were amplified by error-prone and asymmetric PCR. Two advantages of this modification are that it bypasses the lengthy steps of restriction enzyme digestion and ligation, and that the pool of phage clones, recovered after affinity selection, can be used directly to generate a secondary library. Screening one of the two mutagenic libraries yielded variants that bound two- to fourfold tighter to human Pak1 kinase than the starting clone. The protocols described in this study should accelerate the discovery of phage-displayed recombinant affinity reagents. Copyright © 2012 Elsevier Inc. All rights reserved.
Lin, Xiaodong; Deng, Jiankang; Lyu, Yanlong; Qian, Pengcheng; Li, Yunfei
2018-01-01
The integration of multiple DNA logic gates on a universal platform to implement advance logic functions is a critical challenge for DNA computing. Herein, a straightforward and powerful strategy in which a guanine-rich DNA sequence lighting up a silver nanocluster and fluorophore was developed to construct a library of logic gates on a simple DNA-templated silver nanoclusters (DNA-AgNCs) platform. This library included basic logic gates, YES, AND, OR, INHIBIT, and XOR, which were further integrated into complex logic circuits to implement diverse advanced arithmetic/non-arithmetic functions including half-adder, half-subtractor, multiplexer, and demultiplexer. Under UV irradiation, all the logic functions could be instantly visualized, confirming an excellent repeatability. The logic operations were entirely based on DNA hybridization in an enzyme-free and label-free condition, avoiding waste accumulation and reducing cost consumption. Interestingly, a DNA-AgNCs-based multiplexer was, for the first time, used as an intelligent biosensor to identify pathogenic genes, E. coli and S. aureus genes, with a high sensitivity. The investigation provides a prototype for the wireless integration of multiple devices on even the simplest single-strand DNA platform to perform diverse complex functions in a straightforward and cost-effective way. PMID:29675221
PTools: an opensource molecular docking library
Saladin, Adrien; Fiorucci, Sébastien; Poulain, Pierre; Prévost, Chantal; Zacharias, Martin
2009-01-01
Background Macromolecular docking is a challenging field of bioinformatics. Developing new algorithms is a slow process generally involving routine tasks that should be found in a robust library and not programmed from scratch for every new software application. Results We present an object-oriented Python/C++ library to help the development of new docking methods. This library contains low-level routines like PDB-format manipulation functions as well as high-level tools for docking and analyzing results. We also illustrate the ease of use of this library with the detailed implementation of a 3-body docking procedure. Conclusion The PTools library can handle molecules at coarse-grained or atomic resolution and allows users to rapidly develop new software. The library is already in use for protein-protein and protein-DNA docking with the ATTRACT program and for simulation analysis. This library is freely available under the GNU GPL license, together with detailed documentation. PMID:19409097
PTools: an opensource molecular docking library.
Saladin, Adrien; Fiorucci, Sébastien; Poulain, Pierre; Prévost, Chantal; Zacharias, Martin
2009-05-01
Macromolecular docking is a challenging field of bioinformatics. Developing new algorithms is a slow process generally involving routine tasks that should be found in a robust library and not programmed from scratch for every new software application. We present an object-oriented Python/C++ library to help the development of new docking methods. This library contains low-level routines like PDB-format manipulation functions as well as high-level tools for docking and analyzing results. We also illustrate the ease of use of this library with the detailed implementation of a 3-body docking procedure. The PTools library can handle molecules at coarse-grained or atomic resolution and allows users to rapidly develop new software. The library is already in use for protein-protein and protein-DNA docking with the ATTRACT program and for simulation analysis. This library is freely available under the GNU GPL license, together with detailed documentation.
Novel transcripts of the estrogen receptor α gene in channel catfish
Patino, Reynaldo; Xia, Zhenfang; Gale, William L.; Wu, Chunfa; Maule, Alec G.; Chang, Xiaotian
2000-01-01
Complementary DNA libraries from liver and ovary of an immature female channel catfish were screened with a homologous ERα cDNA probe. The hepatic library yielded two new channel catfish ER cDNAs that encode N-terminal ERα variants of different sizes. Relative to the catfish ERα (medium size; 581 residues) previously reported, these new cDNAs encode Long-ERα (36 residues longer) and Short-ERα (389 residues shorter). The 5′-end of Long-ERα cDNA is identical to that of Medium-ERα but has an additional 503-bp segment with an upstream, in-frame translation-start codon. Recombinant Long-ERα binds estrogen with high affinity (Kd = 3.4 nM), similar to that previously reported for Medium-ERα but lower than reported for catfish ERβ. Short-ERα cDNA encodes a protein that lacks most of the receptor protein and does not bind estrogen. Northern hybridization confirmed the existence of multiple hepatic ERα RNAs that include the size range of the ERα cDNAs obtained from the libraries as well as additional sizes. Using primers for RT-PCR that target locations internal to the protein-coding sequence, we also established the presence of several ERα cDNA variants with in-frame insertions in the ligand-binding and DNA-binding domains and in-frame or out-of-frame deletions in the ligand-binding domain. These internal variants showed patterns of expression that differed between the ovary and liver. Further, the ovarian library yielded a full-length, ERα antisense cDNA containing a poly(A) signal and tail. A limited survey of histological preparations from juvenile catfish by in situ hybridization using directionally synthesized cRNA probes also suggested the expression of ERα antisense RNA in a tissue-specific manner. In conclusion, channel catfish seemingly have three broad classes of ERα mRNA variants: those encoding N-terminal truncated variants, those encoding internal variants (including C-terminal truncated variants), and antisense mRNA. The sense variants may encode functional ERα or related proteins that modulate ERα or ERβ activity. The existence of ER antisense mRNA is reported in this study for the first time. Its role may be to participate in the regulation of ER gene expression.
ERIC Educational Resources Information Center
Galewsky, Samuel
2000-01-01
Introduces a series of molecular genetics laboratories where students pick a single colony from a Drosophila melanogester embryo cDNA library and purify the plasmid, then analyze the insert through restriction digests and gel electrophoresis. (Author/YDS)
DNA barcodes for bio-surveillance: regulated and economically important arthropod plant pests.
Ashfaq, Muhammad; Hebert, Paul D N
2016-11-01
Many of the arthropod species that are important pests of agriculture and forestry are impossible to discriminate morphologically throughout all of their life stages. Some cannot be differentiated at any life stage. Over the past decade, DNA barcoding has gained increasing adoption as a tool to both identify known species and to reveal cryptic taxa. Although there has not been a focused effort to develop a barcode library for them, reference sequences are now available for 77% of the 409 species of arthropods documented on major pest databases. Aside from developing the reference library needed to guide specimen identifications, past barcode studies have revealed that a significant fraction of arthropod pests are a complex of allied taxa. Because of their importance as pests and disease vectors impacting global agriculture and forestry, DNA barcode results on these arthropods have significant implications for quarantine detection, regulation, and management. The current review discusses these implications in light of the presence of cryptic species in plant pests exposed by DNA barcoding.
Yung, Pui Yi; Burke, Catherine; Lewis, Matt; Egan, Suhelen; Kjelleberg, Staffan; Thomas, Torsten
2009-01-01
Metagenomics provides access to the uncultured majority of the microbial world. The approaches employed in this field have, however, had limited success in linking functional genes to the taxonomic or phylogenetic origin of the organism they belong to. Here we present an efficient strategy to recover environmental DNA fragments that contain phylogenetic marker genes from metagenomic libraries. Our method involves the cleavage of 23S ribsosmal RNA (rRNA) genes within pooled library clones by the homing endonuclease I-CeuI followed by the insertion and selection of an antibiotic resistance cassette. This approach was applied to screen a library of 6500 fosmid clones derived from the microbial community associated with the sponge Cymbastela concentrica. Several fosmid clones were recovered after the screen and detailed phylogenetic and taxonomic assignment based on the rRNA gene showed that they belong to previously unknown organisms. In addition, compositional features of these fosmid clones were used to classify and taxonomically assign a dataset of environmental shotgun sequences. Our approach represents a valuable tool for the analysis of rapidly increasing, environmental DNA sequencing information. PMID:19767618
Dean, Frank B.; Nelson, John R.; Giesler, Theresa L.; Lasken, Roger S.
2001-01-01
We describe a simple method of using rolling circle amplification to amplify vector DNA such as M13 or plasmid DNA from single colonies or plaques. Using random primers and φ29 DNA polymerase, circular DNA templates can be amplified 10,000-fold in a few hours. This procedure removes the need for lengthy growth periods and traditional DNA isolation methods. Reaction products can be used directly for DNA sequencing after phosphatase treatment to inactivate unincorporated nucleotides. Amplified products can also be used for in vitro cloning, library construction, and other molecular biology applications. PMID:11381035
Paramyosin from the parasitic mite Sarcoptes scabiei: cDNA cloning and heterologous expression.
Mattsson, J G; Ljunggren, E L; Bergström, K
2001-05-01
The burrowing mite Sarcoptes scabiei is the causative agent of the highly contagious disease sarcoptic mange or scabies. So far, there is no in vitro propagation system for S. scabiei available, and mites used for various purposes must be isolated from infected hosts. Lack of parasite-derived material has limited the possibilities to study several aspects of scabies, including pathogenesis and immunity. It has also hampered the development of high performance serological assays. We have now constructed an S. scabiei cDNA expression library with mRNA purified from mites isolated from red foxes. Immunoscreening of the library enabled us to clone a full-length cDNA coding for a 102.5 kDa protein. Sequence similarity searches identified the protein as a paramyosin. Recombinant S. scabiei paramyosin expressed in Escherichia coli was recognized by sera from dogs and swine infected with S. scabiei. We also designed a small paramyosin construct of about 17 kDa that included the N-terminal part, an evolutionary variable part of the helical core, and the C-terminal part of the molecule. The miniaturized protein was efficiently expressed in E. coli and was recognized by sera from immunized rabbits. These data demonstrate that the cDNA library can assist in the isolation of important S. scabiei antigens and that recombinant proteins can be useful for the study of scabies.
Bricheux, G; Brugerolle, G
1997-08-01
The parasitic protozoan Trichomonas vaginalis is known to contain the ubiquitous and highly conserved protein actin. A genomic library and a cDNA library have been screened to identify and clone the actin gene(s) of T. vaginalis. The nucleotide sequence of one gene and its flanking regions have been determined. The open reading frame encodes a protein of 376 amino acids. The sequence is not interrupted by any introns and the promoter could be represented by a 10 bp motif close to a consensus motif also found upstream of most sequenced T. vaginalis genes. The five different clones isolated from the cDNA library have similar sequences and encode three actin proteins differing only by one or two amino acids. A phylogenetic analysis of 31 actin sequences by distance matrix and parsimony methods, using centractin as outgroup, gives congruent trees with Parabasala branching above Diplomonadida.
Designing oligo libraries taking alternative splicing into account
NASA Astrophysics Data System (ADS)
Shoshan, Avi; Grebinskiy, Vladimir; Magen, Avner; Scolnicov, Ariel; Fink, Eyal; Lehavi, David; Wasserman, Alon
2001-06-01
We have designed sequences for DNA microarrays and oligo libraries, taking alternative splicing into account. Alternative splicing is a common phenomenon, occurring in more than 25% of the human genes. In many cases, different splice variants have different functions, are expressed in different tissues or may indicate different stages of disease. When designing sequences for DNA microarrays or oligo libraries, it is very important to take into account the sequence information of all the mRNA transcripts. Therefore, when a gene has more than one transcript (as a result of alternative splicing, alternative promoter sites or alternative poly-adenylation sites), it is very important to take all of them into account in the design. We have used the LEADS transcriptome prediction system to cluster and assemble the human sequences in GenBank and design optimal oligonucleotides for all the human genes with a known mRNA sequence based on the LEADS predictions.
ESTminer: a Web interface for mining EST contig and cluster databases.
Huang, Yecheng; Pumphrey, Janie; Gingle, Alan R
2005-03-01
ESTminer is a Web application and database schema for interactive mining of expressed sequence tag (EST) contig and cluster datasets. The Web interface contains a query frame that allows the selection of contigs/clusters with specific cDNA library makeup or a threshold number of members. The results are displayed as color-coded tree nodes, where the color indicates the fractional size of each cDNA library component. The nodes are expandable, revealing library statistics as well as EST or contig members, with links to sequence data, GenBank records or user configurable links. Also, the interface allows 'queries within queries' where the result set of a query is further filtered by the subsequent query. ESTminer is implemented in Java/JSP and the package, including MySQL and Oracle schema creation scripts, is available from http://cggc.agtec.uga.edu/Data/download.asp agingle@uga.edu.
Comparative genome map of human and cattle
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solinas-Toldo, S.; Fries, R.; Lengauer, C.
Chromosomal homologies between individual human chromosomes and the bovine karyotype have been established by using a new approach termed Zoo-FISH. Labeled DNA libraries from flow-sorted human chromosomes were used as probes for fluorescence in situ hybridization on cattle chromosomes. All human DNA libraries, except the Y chromosome library, hybridized to one or more cattle chromosomes, identifying and delineating 50 segments of homology, most of them corresponding to the regions of homology as identified by the previous mapping of individual conserved loci. However, Zoo-FISH refines the comparative maps constructed by molecular gene mapping of individual loci by providing information on themore » boundaries of conserved regions in the absence of obvious cytogenetic homologies of human and bovine chromosomes. It allows study of karyotypic evolution and opens new avenues for genomic analysis by facilitating the extrapolation of results from the human genome initiative. 50 refs., 3 figs., 1 tab.« less
Xiao, Yongli; Sheng, Zong-Mei; Taubenberger, Jeffery K.
2015-01-01
The vast majority of surgical biopsy and post-mortem tissue samples are formalin-fixed and paraffin-embedded (FFPE), but this process leads to RNA degradation that limits gene expression analysis. As an example, the viral RNA genome of the 1918 pandemic influenza A virus was previously determined in a 9-year effort by overlapping RT-PCR from post-mortem samples. Using the protocols described here, the full genome of the 1918 virus at high coverage was determined in one high-throughput sequencing run of a cDNA library derived from total RNA of a 1918 FFPE sample after duplex-specific nuclease treatments. This basic methodological approach should assist in the analysis of FFPE tissue samples isolated over the past century from a variety of infectious diseases. PMID:26344216
DNA Probe for Lactobacillus delbrueckii
Delley, Michèle; Mollet, Beat; Hottinger, Herbert
1990-01-01
From a genomic DNA library of Lactobacillus delbrueckii subsp. bulgaricus, a clone was isolated which complements a leucine auxotrophy of an Escherichia coli strain (GE891). Subsequent analysis of the clone indicated that it could serve as a specific DNA probe. Dot-blot hybridizations with over 40 different Lactobacillus strains showed that this clone specifically recognizes L. delbrueckii subsp. delbrueckii, bulgaricus, and lactis. The sensitivity of the method was tested by using an α-32P-labeled DNA probe. Images PMID:16348233
Selection and Screening of DNA Aptamers for Inorganic Nanomaterials.
Zhou, Yibo; Huang, Zhicheng; Yang, Ronghua; Liu, Juewen
2018-02-21
Searching for DNA sequences that can strongly and selectively bind to inorganic surfaces is a long-standing topic in bionanotechnology, analytical chemistry and biointerface research. This can be achieved either by aptamer selection starting with a very large library of ≈10 14 random DNA sequences, or by careful screening of a much smaller library (usually from a few to a few hundred) with rationally designed sequences. Unlike typical molecular targets, inorganic surfaces often have quite strong DNA adsorption affinities due to polyvalent binding and even chemical interactions. This leads to a very high background binding making aptamer selection difficult. Screening, on the other hand, can be designed to compare relative binding affinities of different DNA sequences and could be more appropriate for inorganic surfaces. The resulting sequences have been used for DNA-directed assembly, sorting of carbon nanotubes, and DNA-controlled growth of inorganic nanomaterials. It was recently discovered that poly-cytosine (C) DNA can strongly bind to a diverse range of nanomaterials including nanocarbons (graphene oxide and carbon nanotubes), various metal oxides and transition-metal dichalcogenides. In this Concept article, we articulate the need for screening and potential artifacts associated with traditional aptamer selection methods for inorganic surfaces. Representative examples of application are discussed, and a few future research opportunities are proposed towards the end of this article. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genome-Wide Profiling of RNA–Protein Interactions Using CLIP-Seq
Stork, Cheryl; Zheng, Sika
2017-01-01
UV crosslinking immunoprecipitation (CLIP) is an increasingly popular technique to study protein–RNA interactions in tissues and cells. Whole cells or tissues are ultraviolet irradiated to generate a covalent bond between RNA and proteins that are in close contact. After partial RNase digestion, antibodies specific to an RNA binding protein (RBP) or a protein–epitope tag is then used to immunoprecipitate the protein–RNA complexes. After stringent washing and gel separation the RBP–RNA complex is excised. The RBP is protease digested to allow purification of the bound RNA. Reverse transcription of the RNA followed by high-throughput sequencing of the cDNA library is now often used to identify protein bound RNA on a genome-wide scale. UV irradiation can result in cDNA truncations and/or mutations at the crosslink sites, which complicates the alignment of the sequencing library to the reference genome and the identification of the crosslinking sites. Meanwhile, one or more amino acids of a crosslinked RBP can remain attached to its bound RNA due to incomplete digestion of the protein. As a result, reverse transcriptase may not read through the crosslink sites, and produce cDNA ending at the crosslinked nucleotide. This is harnessed by one variant of CLIP methods to identify crosslinking sites at a nucleotide resolution. This method, individual nucleotide resolution CLIP (iCLIP) circularizes cDNA to capture the truncated cDNA and also increases the efficiency of ligating sequencing adapters to the library. Here, we describe the detailed procedure of iCLIP. PMID:26965263
One-step random mutagenesis by error-prone rolling circle amplification
Fujii, Ryota; Kitaoka, Motomitsu; Hayashi, Kiyoshi
2004-01-01
In vitro random mutagenesis is a powerful tool for altering properties of enzymes. We describe here a novel random mutagenesis method using rolling circle amplification, named error-prone RCA. This method consists of only one DNA amplification step followed by transformation of the host strain, without treatment with any restriction enzymes or DNA ligases, and results in a randomly mutated plasmid library with 3–4 mutations per kilobase. Specific primers or special equipment, such as a thermal-cycler, are not required. This method permits rapid preparation of randomly mutated plasmid libraries, enabling random mutagenesis to become a more commonly used technique. PMID:15507684
Lu, Emily; Elizondo-Riojas, Miguel-Angel; Chang, Jeffrey T; Volk, David E
2014-06-10
Next-generation sequencing results from bead-based aptamer libraries have demonstrated that traditional DNA/RNA alignment software is insufficient. This is particularly true for X-aptamers containing specialty bases (W, X, Y, Z, ...) that are identified by special encoding. Thus, we sought an automated program that uses the inherent design scheme of bead-based X-aptamers to create a hypothetical reference library and Markov modeling techniques to provide improved alignments. Aptaligner provides this feature as well as length error and noise level cutoff features, is parallelized to run on multiple central processing units (cores), and sorts sequences from a single chip into projects and subprojects.
Blakskjaer, Peter; Heitner, Tara; Hansen, Nils Jakob Vest
2015-06-01
DNA-encoded small-molecule library (DEL) technology allows vast drug-like small molecule libraries to be efficiently synthesized in a combinatorial fashion and screened in a single tube method for binding, with an assay readout empowered by advances in next generation sequencing technology. This approach has increasingly been applied as a viable technology for the identification of small-molecule modulators to protein targets and as precursors to drugs in the past decade. Several strategies for producing and for screening DELs have been devised by both academic and industrial institutions. This review highlights some of the most significant and recent strategies along with important results. A special focus on the production of high fidelity DEL technologies with the ability to eliminate screening noise and false positives is included: using a DNA junction called the Yoctoreactor, building blocks (BBs) are spatially confined at the center of the junction facilitating both the chemical reaction between BBs and encoding of the synthetic route. A screening method, known as binder trap enrichment, permits DELs to be screened robustly in a homogeneous manner delivering clean data sets and potent hits for even the most challenging targets. Copyright © 2015 Elsevier Ltd. All rights reserved.
Huemer, Peter; Mutanen, Marko; Sefc, Kristina M; Hebert, Paul D N
2014-01-01
This study examines the performance of DNA barcodes (mt cytochrome c oxidase 1 gene) in the identification of 1004 species of Lepidoptera shared by two localities (Finland, Austria) that are 1600 km apart. Maximum intraspecific distances for the pooled data were less than 2% for 880 species (87.6%), while deeper divergence was detected in 124 species. Despite such variation, the overall DNA barcode library possessed diagnostic COI sequences for 98.8% of the taxa. Because a reference library based on Finnish specimens was highly effective in identifying specimens from Austria, we conclude that barcode libraries based on regional sampling can often be effective for a much larger area. Moreover, dispersal ability (poor, good) and distribution patterns (disjunct, fragmented, continuous, migratory) had little impact on levels of intraspecific geographic divergence. Furthermore, the present study revealed that, despite the intensity of past taxonomic work on European Lepidoptera, nearly 20% of the species shared by Austria and Finland require further work to clarify their status. Particularly discordant BIN (Barcode Index Number) cases should be checked to ascertain possible explanatory factors such as incorrect taxonomy, hybridization, introgression, and Wolbachia infections.
Developing a Bacteroides System for Function-Based Screening of DNA from the Human Gut Microbiome.
Lam, Kathy N; Martens, Eric C; Charles, Trevor C
2018-01-01
Functional metagenomics is a powerful method that allows the isolation of genes whose role may not have been predicted from DNA sequence. In this approach, first, environmental DNA is cloned to generate metagenomic libraries that are maintained in Escherichia coli, and second, the cloned DNA is screened for activities of interest. Typically, functional screens are carried out using E. coli as a surrogate host, although there likely exist barriers to gene expression, such as lack of recognition of native promoters. Here, we describe efforts to develop Bacteroides thetaiotaomicron as a surrogate host for screening metagenomic DNA from the human gut. We construct a B. thetaiotaomicron-compatible fosmid cloning vector, generate a fosmid clone library using DNA from the human gut, and show successful functional complementation of a B. thetaiotaomicron glycan utilization mutant. Though we were unable to retrieve the physical fosmid after complementation, we used genome sequencing to identify the complementing genes derived from the human gut microbiome. Our results demonstrate that the use of B. thetaiotaomicron to express metagenomic DNA is promising, but they also exemplify the challenges that can be encountered in the development of new surrogate hosts for functional screening. IMPORTANCE Human gut microbiome research has been supported by advances in DNA sequencing that make it possible to obtain gigabases of sequence data from metagenomes but is limited by a lack of knowledge of gene function that leads to incomplete annotation of these data sets. There is a need for the development of methods that can provide experimental data regarding microbial gene function. Functional metagenomics is one such method, but functional screens are often carried out using hosts that may not be able to express the bulk of the environmental DNA being screened. We expand the range of current screening hosts and demonstrate that human gut-derived metagenomic libraries can be introduced into the gut microbe Bacteroides thetaiotaomicron to identify genes based on activity screening. Our results support the continuing development of genetically tractable systems to obtain information about gene function.
Potential for DNA-based ID of Great Lakes fauna: Species inventories vs. barcode libraries
DNA-based identification of mixed-organism samples offers the potential to greatly reduce the need for resource-intensive morphological identification, which would be of value both to biotic condition assessment and non-native species early-detection monitoring. However the abil...
Great Lakes DNA barcode reference library: Mollusca, annelida, and minor phyla
In recent years, the research and development of DNA-based tools has improved both their sensitivity and costs. This technology has the potential to be useful in the early detection of aquatic invasive species, and can increase the scope of surveillance compared with traditional ...
Genetic Regulation in the Aiptasia pallida Symbiosis - Performance Report, Year 1.
1997-02-01
and symbiotic zooxanthellae is one developed for serial analysis of gene expression (SAGE). We initially tested the SAGE protocol with cDNA generated...technically difficult. We are now focusing on constructing representative cDNA libraries from cultured and symbiotic zooxanthellae and will sequence
Videvall, Elin; Strandh, Maria; Engelbrecht, Anel; Cloete, Schalk; Cornwallis, Charlie K
2017-01-01
The gut microbiome of animals is emerging as an important factor influencing ecological and evolutionary processes. A major bottleneck in obtaining microbiome data from large numbers of samples is the time-consuming laboratory procedures required, specifically the isolation of DNA and generation of amplicon libraries. Recently, direct PCR kits have been developed that circumvent conventional DNA extraction steps, thereby streamlining the laboratory process by reducing preparation time and costs. However, the reliability and efficacy of direct PCR for measuring host microbiomes have not yet been investigated other than in humans with 454 sequencing. Here, we conduct a comprehensive evaluation of the microbial communities obtained with direct PCR and the widely used Mo Bio PowerSoil DNA extraction kit in five distinct gut sample types (ileum, cecum, colon, feces, and cloaca) from 20 juvenile ostriches, using 16S rRNA Illumina MiSeq sequencing. We found that direct PCR was highly comparable over a range of measures to the DNA extraction method in cecal, colon, and fecal samples. However, the two methods significantly differed in samples with comparably low bacterial biomass: cloacal and especially ileal samples. We also sequenced 100 replicate sample pairs to evaluate repeatability during both extraction and PCR stages and found that both methods were highly consistent for cecal, colon, and fecal samples ( r s > 0.7) but had low repeatability for cloacal ( r s = 0.39) and ileal ( r s = -0.24) samples. This study indicates that direct PCR provides a fast, cheap, and reliable alternative to conventional DNA extraction methods for retrieving 16S rRNA data, which can aid future gut microbiome studies. IMPORTANCE The microbial communities of animals can have large impacts on their hosts, and the number of studies using high-throughput sequencing to measure gut microbiomes is rapidly increasing. However, the library preparation procedure in microbiome research is both costly and time-consuming, especially for large numbers of samples. We investigated a cheaper and faster direct PCR method designed to bypass the DNA isolation steps during 16S rRNA library preparation and compared it with a standard DNA extraction method. We used both techniques on five different gut sample types collected from 20 juvenile ostriches and sequenced samples with Illumina MiSeq. The methods were highly comparable and highly repeatable in three sample types with high microbial biomass (cecum, colon, and feces), but larger differences and low repeatability were found in the microbiomes obtained from the ileum and cloaca. These results will help microbiome researchers assess library preparation procedures and plan their studies accordingly.
Olova, Nelly; Krueger, Felix; Andrews, Simon; Oxley, David; Berrens, Rebecca V; Branco, Miguel R; Reik, Wolf
2018-03-15
Whole-genome bisulfite sequencing (WGBS) is becoming an increasingly accessible technique, used widely for both fundamental and disease-oriented research. Library preparation methods benefit from a variety of available kits, polymerases and bisulfite conversion protocols. Although some steps in the procedure, such as PCR amplification, are known to introduce biases, a systematic evaluation of biases in WGBS strategies is missing. We perform a comparative analysis of several commonly used pre- and post-bisulfite WGBS library preparation protocols for their performance and quality of sequencing outputs. Our results show that bisulfite conversion per se is the main trigger of pronounced sequencing biases, and PCR amplification builds on these underlying artefacts. The majority of standard library preparation methods yield a significantly biased sequence output and overestimate global methylation. Importantly, both absolute and relative methylation levels at specific genomic regions vary substantially between methods, with clear implications for DNA methylation studies. We show that amplification-free library preparation is the least biased approach for WGBS. In protocols with amplification, the choice of bisulfite conversion protocol or polymerase can significantly minimize artefacts. To aid with the quality assessment of existing WGBS datasets, we have integrated a bias diagnostic tool in the Bismark package and offer several approaches for consideration during the preparation and analysis of WGBS datasets.
A strategy for rapid production and screening of yeast artificial chromosome libraries.
Strauss, W M; Jaenisch, E; Jaenisch, R
1992-01-01
We describe methods for rapid production and screening of yeast artificial chromosome (YAC) libraries. Utilizing complete restriction digests of mouse genomic DNA for ligations in agarose, a 32,000-clone library was produced and screened in seven weeks. Screening was accomplished by subdividing primary transformation plates into pools of approximately 100 clones which were transferred into a master glycerol stock. These master stocks were used to inoculate liquid cultures to produce culture "pools," and ten pools of 100 clones were then combined to yield superpools of 1,000 clones. Both pool and superpool DNA was screened by polymerase chain reaction (PCR) and positive pools representing 100 clones were then plated on selective medium and screened by in situ hybridization. Screening by the two tiered PCR assay and by in situ hybridization was completed in 4-5 days. Utilizing this methodology we have isolated a 150 kb clone spanning the alpha 1(I) collagen (Col1a1) gene as well as 40 kb clones from the Hox-2 locus. To characterize the representation of the YAC library, the size distribution of genomic Sal I fragments was compared to that of clones picked at random from the library. The results demonstrate significant biasing of the cloned fragment distribution, resulting in a loss of representation for larger fragments.
Preparation of BAC libraries from marine microbial populations.
Sabehi, Gazalah; Béjà, Oded
2013-01-01
A protocol is presented here for the construction of BAC (bacterial artificial chromosome) libraries from planktonic microbial communities collected in marine environments. The protocol describes the collection and preparation of the planktonic microbial cells, high molecular weight DNA purification from those cells, the preparation of the BAC vector, and the special ligation and electrotransformation procedures required for successful library preparation. With small modifications, this protocol can be applied to microbes collected from other environments. © 2013 Elsevier Inc. All rights reserved.
Brightwell, Gale; Boerema, Jackie; Mills, John; Mowat, Eilidh; Pulford, David
2006-05-25
We examined the bacterial community present on an Intralox conveyor belt system in an operating lamb boning room by sequencing the 16S ribosomal DNA (rDNA) of bacteria extracted in the presence or absence of cultivation. RFLP patterns for 16S rDNA clone library and cultures were generated using HaeIII and MspI restriction endonucleases. 16S rDNA amplicons produced 8 distinct RFLP pattern groups. RFLP groups I-IV were represented in the clone library and RFLP groups I and V-VIII were represented amongst the cultured isolates. Partial DNA sequences from each RFLP group revealed that all group I, II and VIII representatives were Pseudomonas spp., group III were Sphingomonas spp., group IV clones were most similar to an uncultured alpha proteobacterium, group V was similar to a Serratia spp., group VI with an Alcaligenes spp., and group VII with Microbacterium spp. Sphingomonads were numerically dominant in the culture-independent clone library and along with the group IV alpha proteobacterium were not represented amongst the cultured isolates. Serratia, Alcaligenes and Microbacterium spp. were only represented with cultured isolates. Pseudomonads were detected by both culture-dependent (84% of isolates) and culture-independent (12.5% of clones) methods and their presence at high frequency does pose the risk of product spoilage if transferred onto meat stored under aerobic conditions. The detection of sphingomonads in large numbers by the culture-independent method demands further analysis because sphingomonads may represent a new source of meat spoilage that has not been previously recognised in the meat processing environment. The 16S rDNA collections generated by both methods were important at representing the diversity of the bacterial population associated with an Intralox conveyor belt system.
Si, Zengzhi; Du, Bing; Huo, Jinxi; He, Shaozhen; Liu, Qingchang; Zhai, Hong
2016-11-21
Sweetpotato, Ipomoea batatas (L.) Lam., is an important food crop widely grown in the world. However, little is known about the genome of this species because it is a highly heterozygous hexaploid. Gaining a more in-depth knowledge of sweetpotato genome is therefore necessary and imperative. In this study, the first bacterial artificial chromosome (BAC) library of sweetpotato was constructed. Clones from the BAC library were end-sequenced and analyzed to provide genome-wide information about this species. The BAC library contained 240,384 clones with an average insert size of 101 kb and had a 7.93-10.82 × coverage of the genome, and the probability of isolating any single-copy DNA sequence from the library was more than 99%. Both ends of 8310 BAC clones randomly selected from the library were sequenced to generate 11,542 high-quality BAC-end sequences (BESs), with an accumulative length of 7,595,261 bp and an average length of 658 bp. Analysis of the BESs revealed that 12.17% of the sweetpotato genome were known repetitive DNA, including 7.37% long terminal repeat (LTR) retrotransposons, 1.15% Non-LTR retrotransposons and 1.42% Class II DNA transposons etc., 18.31% of the genome were identified as sweetpotato-unique repetitive DNA and 10.00% of the genome were predicted to be coding regions. In total, 3,846 simple sequences repeats (SSRs) were identified, with a density of one SSR per 1.93 kb, from which 288 SSRs primers were designed and tested for length polymorphism using 20 sweetpotato accessions, 173 (60.07%) of them produced polymorphic bands. Sweetpotato BESs had significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum than those of Vitis vinifera, Theobroma cacao and Arabidopsis thaliana. The first BAC library for sweetpotato has been successfully constructed. The high quality BESs provide first insights into sweetpotato genome composition, and have significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum. These resources as a robust platform will be used in high-resolution mapping, gene cloning, assembly of genome sequences, comparative genomics and evolution for sweetpotato.
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].
Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y
2017-08-01
To analyze and detect the whole genome sequence of human mitochondrial DNA (mtDNA) by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine
Hurt, Jr., Richard A.; Robeson II, Michael S.; Shakya, Migun; ...
2014-07-14
Despite more than three decades of progress, efficient nucleic acid extraction from microbial communities has remained difficult, particularly from clay environments. Lysis with concentrated guanidine followed by concentrated sodium phosphate extraction supported DNA and RNA recovery from high iron, low humus content clay. Alterating the extraction pH or using other ionic solutions (Na 2SO 4 and NH 4H 2PO 4) yielded no detectable nucleic acid. DNA recovered using a lysis solution with 500 mM phosphate buffer (PB) followed by a 1 M PB wash was 15.22±2.33 g DNA/g clay, with most DNA consisting of >20 Kb fragments, compared to 2.46±0.25more » g DNA/g clay with the Powerlyzer soil DNA system (MoBio). Increasing [PB] in the lysis reagent coincided with increasing DNA fragment length. Rarefaction plots based on16S rRNA (V1/V3 region) pyrosequencing libraries from A-horizon and clay soils showed an ~80% and ~400% larger accessed diversity compared to a previous grinding protocol or the Powerlyzer soil DNA system, respectively. The observed diversity from the Firmicutes showed the strongest increase with >3-fold more bacterial species recovered using this system. Additionally, some OTU's having more than 100 sequences in these libraries were absent in samples extracted using the PowerLyzer reagents or the previous lysis method.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hurt, Jr., Richard A.; Robeson II, Michael S.; Shakya, Migun
Despite more than three decades of progress, efficient nucleic acid extraction from microbial communities has remained difficult, particularly from clay environments. Lysis with concentrated guanidine followed by concentrated sodium phosphate extraction supported DNA and RNA recovery from high iron, low humus content clay. Alterating the extraction pH or using other ionic solutions (Na 2SO 4 and NH 4H 2PO 4) yielded no detectable nucleic acid. DNA recovered using a lysis solution with 500 mM phosphate buffer (PB) followed by a 1 M PB wash was 15.22±2.33 g DNA/g clay, with most DNA consisting of >20 Kb fragments, compared to 2.46±0.25more » g DNA/g clay with the Powerlyzer soil DNA system (MoBio). Increasing [PB] in the lysis reagent coincided with increasing DNA fragment length. Rarefaction plots based on16S rRNA (V1/V3 region) pyrosequencing libraries from A-horizon and clay soils showed an ~80% and ~400% larger accessed diversity compared to a previous grinding protocol or the Powerlyzer soil DNA system, respectively. The observed diversity from the Firmicutes showed the strongest increase with >3-fold more bacterial species recovered using this system. Additionally, some OTU's having more than 100 sequences in these libraries were absent in samples extracted using the PowerLyzer reagents or the previous lysis method.« less
Shore, Sabrina; Henderson, Jordana M; Lebedev, Alexandre; Salcedo, Michelle P; Zon, Gerald; McCaffrey, Anton P; Paul, Natasha; Hogrefe, Richard I
2016-01-01
For most sample types, the automation of RNA and DNA sample preparation workflows enables high throughput next-generation sequencing (NGS) library preparation. Greater adoption of small RNA (sRNA) sequencing has been hindered by high sample input requirements and inherent ligation side products formed during library preparation. These side products, known as adapter dimer, are very similar in size to the tagged library. Most sRNA library preparation strategies thus employ a gel purification step to isolate tagged library from adapter dimer contaminants. At very low sample inputs, adapter dimer side products dominate the reaction and limit the sensitivity of this technique. Here we address the need for improved specificity of sRNA library preparation workflows with a novel library preparation approach that uses modified adapters to suppress adapter dimer formation. This workflow allows for lower sample inputs and elimination of the gel purification step, which in turn allows for an automatable sRNA library preparation protocol.
Reiterative Recombination for the in vivo assembly of libraries of multigene pathways.
Wingler, Laura M; Cornish, Virginia W
2011-09-13
The increasing sophistication of synthetic biology is creating a demand for robust, broadly accessible methodology for constructing multigene pathways inside of the cell. Due to the difficulty of rationally designing pathways that function as desired in vivo, there is a further need to assemble libraries of pathways in parallel, in order to facilitate the combinatorial optimization of performance. While some in vitro DNA assembly methods can theoretically make libraries of pathways, these techniques are resource intensive and inherently require additional techniques to move the DNA back into cells. All previously reported in vivo assembly techniques have been low yielding, generating only tens to hundreds of constructs at a time. Here, we develop "Reiterative Recombination," a robust method for building multigene pathways directly in the yeast chromosome. Due to its use of endonuclease-induced homologous recombination in conjunction with recyclable markers, Reiterative Recombination provides a highly efficient, technically simple strategy for sequentially assembling an indefinite number of DNA constructs at a defined locus. In this work, we describe the design and construction of the first Reiterative Recombination system in Saccharomyces cerevisiae, and we show that it can be used to assemble multigene constructs. We further demonstrate that Reiterative Recombination can construct large mock libraries of at least 10(4) biosynthetic pathways. We anticipate that our system's simplicity and high efficiency will make it a broadly accessible technology for pathway construction and render it a valuable tool for optimizing pathways in vivo.
Reiterative Recombination for the in vivo assembly of libraries of multigene pathways
Wingler, Laura M.; Cornish, Virginia W.
2011-01-01
The increasing sophistication of synthetic biology is creating a demand for robust, broadly accessible methodology for constructing multigene pathways inside of the cell. Due to the difficulty of rationally designing pathways that function as desired in vivo, there is a further need to assemble libraries of pathways in parallel, in order to facilitate the combinatorial optimization of performance. While some in vitro DNA assembly methods can theoretically make libraries of pathways, these techniques are resource intensive and inherently require additional techniques to move the DNA back into cells. All previously reported in vivo assembly techniques have been low yielding, generating only tens to hundreds of constructs at a time. Here, we develop “Reiterative Recombination,” a robust method for building multigene pathways directly in the yeast chromosome. Due to its use of endonuclease-induced homologous recombination in conjunction with recyclable markers, Reiterative Recombination provides a highly efficient, technically simple strategy for sequentially assembling an indefinite number of DNA constructs at a defined locus. In this work, we describe the design and construction of the first Reiterative Recombination system in Saccharomyces cerevisiae, and we show that it can be used to assemble multigene constructs. We further demonstrate that Reiterative Recombination can construct large mock libraries of at least 104 biosynthetic pathways. We anticipate that our system’s simplicity and high efficiency will make it a broadly accessible technology for pathway construction and render it a valuable tool for optimizing pathways in vivo. PMID:21876185
Orphan, V J; Taylor, L T; Hafenbradl, D; Delong, E F
2000-02-01
Recent investigations of oil reservoirs in a variety of locales have indicated that these habitats may harbor active thermophilic prokaryotic assemblages. In this study, we used both molecular and culture-based methods to characterize prokaryotic consortia associated with high-temperature, sulfur-rich oil reservoirs in California. Enrichment cultures designed for anaerobic thermophiles, both autotrophic and heterotrophic, were successful at temperatures ranging from 60 to 90 degrees C. Heterotrophic enrichments from all sites yielded sheathed rods (Thermotogales), pleomorphic rods resembling Thermoanaerobacter, and Thermococcus-like isolates. The predominant autotrophic microorganisms recovered from inorganic enrichments using H(2), acetate, and CO(2) as energy and carbon sources were methanogens, including isolates closely related to Methanobacterium, Methanococcus, and Methanoculleus species. Two 16S rRNA gene (rDNA) libraries were generated from total community DNA collected from production wellheads, using either archaeal or universal oligonucleotide primer sets. Sequence analysis of the universal library indicated that a large percentage of clones were highly similar to known bacterial and archaeal isolates recovered from similar habitats. Represented genera in rDNA clone libraries included Thermoanaerobacter, Thermococcus, Desulfothiovibrio, Aminobacterium, Acidaminococcus, Pseudomonas, Halomonas, Acinetobacter, Sphingomonas, Methylobacterium, and Desulfomicrobium. The archaeal library was dominated by methanogen-like rDNAs, with a lower percentage of clones belonging to the Thermococcales. Our results strongly support the hypothesis that sulfur-utilizing and methane-producing thermophilic microorganisms have a widespread distribution in oil reservoirs and the potential to actively participate in the biogeochemical transformation of carbon, hydrogen, and sulfur in situ.
2013-01-01
Background Olive cDNA libraries to isolate candidate genes that can help enlightening the molecular mechanism of periodicity and / or fruit production were constructed and analyzed. For this purpose, cDNA libraries from the leaves of trees in “on year” and in “off year” in July (when fruits start to appear) and in November (harvest time) were constructed. Randomly selected 100 positive clones from each library were analyzed with respect to sequence and size. A fruit-flesh cDNA library was also constructed and characterized to confirm the reliability of each library’s temporal and spatial properties. Results Quantitative real-time RT-PCR (qRT-PCR) analyses of the cDNA libraries confirmed cDNA molecules that are associated with different developmental stages (e. g. “on year” leaves in July, “off year” leaves in July, leaves in November) and fruits. Hence, a number of candidate cDNAs associated with “on year” and “off year” were isolated. Comparison of the detected cDNAs to the current EST database of GenBank along with other non - redundant databases of NCBI revealed homologs of previously described genes along with several unknown cDNAs. Of around 500 screened cDNAs, 48 cDNA elements were obtained after eliminating ribosomal RNA sequences. These independent transcripts were analyzed using BLAST searches (cutoff E-value of 1.0E-5) against the KEGG and GenBank nucleotide databases and 37 putative transcripts corresponding to known gene functions were annotated with gene names and Gene Ontology (GO) terms. Transcripts in the biological process were found to be related with metabolic process (27%), cellular process (23%), response to stimulus (17%), localization process (8.5%), multicellular organismal process (6.25%), developmental process (6.25%) and reproduction (4.2%). Conclusions A putative P450 monooxigenase expressed fivefold more in the “on year” than that of “off year” leaves in July. Two putative dehydrins expressed significantly more in “on year” leaves than that of “off year” leaves in November. Homologs of UDP – glucose epimerase, acyl - CoA binding protein, triose phosphate isomerase and a putative nuclear core anchor protein were significant in fruits only, while a homolog of an embryo binding protein / small GTPase regulator was detected in “on year” leaves only. One of the two unknown cDNAs was specific to leaves in July while the other was detected in all of the libraries except fruits. KEGG pathway analyses for the obtained sequences correlated with essential metabolisms such as galactose metabolism, amino sugar and nucleotide sugar metabolisms and photosynthesis. Detailed analysis of the results presents candidate cDNAs that can be used to dissect further the genetic basis of fruit production and / or alternate bearing which causes significant economical loss for olive growers. PMID:23552171
Grant, Susan; Grant, William D; Cowan, Don A; Jones, Brian E; Ma, Yanhe; Ventosa, Antonio; Heaphy, Shaun
2006-01-01
Here we describe the application of metagenomic technologies to construct cDNA libraries from RNA isolated from environmental samples. RNAlater (Ambion) was shown to stabilize RNA in environmental samples for periods of at least 3 months at -20 degrees C. Protocols for library construction were established on total RNA extracted from Acanthamoeba polyphaga trophozoites. The methodology was then used on algal mats from geothermal hot springs in Tengchong county, Yunnan Province, People's Republic of China, and activated sludge from a sewage treatment plant in Leicestershire, United Kingdom. The Tenchong libraries were dominated by RNA from prokaryotes, reflecting the mainly prokaryote microbial composition. The majority of these clones resulted from rRNA; only a few appeared to be derived from mRNA. In contrast, many clones from the activated sludge library had significant similarity to eukaryote mRNA-encoded protein sequences. A library was also made using polyadenylated RNA isolated from total RNA from activated sludge; many more clones in this library were related to eukaryotic mRNA sequences and proteins. Open reading frames (ORFs) up to 378 amino acids in size could be identified. Some resembled known proteins over their full length, e.g., 36% match to cystatin, 49% match to ribosomal protein L32, 63% match to ribosomal protein S16, 70% to CPC2 protein. The methodology described here permits the polyadenylated transcriptome to be isolated from environmental samples with no knowledge of the identity of the microorganisms in the sample or the necessity to culture them. It has many uses, including the identification of novel eukaryotic ORFs encoding proteins and enzymes.
USDA-ARS?s Scientific Manuscript database
Next generation sequencing (NGS) technology was used to analyze the occurrence of viruses in Sorghum almum plants in Florida exhibiting mosaic symptoms. Total RNA was extracted from symptomatic leaves and used as a template for cDNA library preparation. The resulting library was sequenced on an Illu...
USDA-ARS?s Scientific Manuscript database
Oocyte-specific genes play critical roles in oogenesis, folliculogenesis and early embryonic development. Through analysis of expressed sequence tags (ESTs) from a rainbow trout oocyte cDNA library, we identified a novel transcript which is represented by ESTs only from the oocyte library. The novel...
JICST Factual Database JICST DNA Database
NASA Astrophysics Data System (ADS)
Shirokizawa, Yoshiko; Abe, Atsushi
Japan Information Center of Science and Technology (JICST) has started the on-line service of DNA database in October 1988. This database is composed of EMBL Nucleotide Sequence Library and Genetic Sequence Data Bank. The authors outline the database system, data items and search commands. Examples of retrieval session are presented.
The spermatogenic cell-specific variant of glyceraldehyde 3-phosphate dehydrogenase (GAPDS) has been cloned from a rat testis cDNA library and its pattern of expression determined. A 1417 nucleotide cDNA has been found to encode an enzyme with substantial homology to mouse GAPDS...
Towards the construction of high-quality mutagenesis libraries.
Li, Heng; Li, Jing; Jin, Ruinan; Chen, Wei; Liang, Chaoning; Wu, Jieyuan; Jin, Jian-Ming; Tang, Shuang-Yan
2018-07-01
To improve the quality of mutagenesis libraries in directed evolution strategy. In the process of library transformation, transformants which have been shown to take up more than one plasmid might constitute more than 20% of the constructed library, thereby extensively impairing the quality of the library. We propose a practical transformation method to prevent the occurrence of multiple-plasmid transformants while maintaining high transformation efficiency. A visual library model containing plasmids expressing different fluorescent proteins was used. Multiple-plasmid transformants can be reduced through optimizing plasmid DNA amount used for transformation based on the positive correlation between the occurrence frequency of multiple-plasmid transformants and the logarithmic ratio of plasmid molecules to competent cells. This method provides a simple solution for a seemingly common but often neglected problem, and should be valuable for improving the quality of mutagenesis libraries to enhance the efficiency of directed evolution strategies.
T7 lytic phage-displayed peptide libraries: construction and diversity characterization.
Krumpe, Lauren R H; Mori, Toshiyuki
2014-01-01
In this chapter, we describe the construction of T7 bacteriophage (phage)-displayed peptide libraries and the diversity analyses of random amino acid sequences obtained from the libraries. We used commercially available reagents, Novagen's T7Select system, to construct the libraries. Using a combination of biotinylated extension primer and streptavidin-coupled magnetic beads, we were able to prepare library DNA without applying gel purification, resulting in extremely high ligation efficiencies. Further, we describe the use of bioinformatics tools to characterize library diversity. Amino acid frequency and positional amino acid diversity and hydropathy are estimated using the REceptor LIgand Contacts website http://relic.bio.anl.gov. Peptide net charge analysis and peptide hydropathy analysis are conducted using the Genetics Computer Group Wisconsin Package computational tools. A comprehensive collection of the estimated number of recombinants and titers of T7 phage-displayed peptide libraries constructed in our lab is included.
Bernstein, Steven L; Guo, Yan; Peterson, Katherine; Wistow, Graeme
2009-01-01
Background The optic nerve is a pure white matter central nervous system (CNS) tract with an isolated blood supply, and is widely used in physiological studies of white matter response to various insults. We examined the gene expression profile of human optic nerve (ON) and, through the NEIBANK online resource, to provide a resource of sequenced verified cDNA clones. An un-normalized cDNA library was constructed from pooled human ON tissues and was used in expressed sequence tag (EST) analysis. Location of an abundant oligodendrocyte marker was examined by immunofluorescence. Quantitative real time polymerase chain reaction (qRT-PCR) and Western analysis were used to compare levels of expression for key calcium channel protein genes and protein product in primate and rodent ON. Results Our analyses revealed a profile similar in many respects to other white matter related tissues, but significantly different from previously available ON cDNA libraries. The previous libraries were found to include specific markers for other eye tissues, suggesting contamination. Immune/inflammatory markers were abundant in the new ON library. The oligodendrocyte marker QKI was abundant at the EST level. Immunofluorescence revealed that this protein is a useful oligodendrocyte cell-type marker in rodent and primate ONs. L-type calcium channel EST abundance was found to be particularly low. A qRT-PCR-based comparative mammalian species analysis reveals that L-type calcium channel expression levels are significantly lower in primate than in rodent ON, which may help account for the class-specific difference in responsiveness to calcium channel blocking agents. Several known eye disease genes are abundantly expressed in ON. Many genes associated with normal axonal function, mRNAs associated with axonal transport, inflammation and neuroprotection are observed. Conclusion We conclude that the new cDNA library is a faithful representation of human ON and EST data provide an initial overview of gene expression patterns in this tissue. The data provide clues for tissue-specific and species-specific properties of human ON that will help in design of therapeutic models. PMID:19778450
Michlewski, Gracjan; Finnegan, David J.; Elfick, Alistair; Rosser, Susan J.
2017-01-01
Abstract Delivery of DNA to cells and its subsequent integration into the host genome is a fundamental task in molecular biology, biotechnology and gene therapy. Here we describe an IP-free one-step method that enables stable genome integration into either prokaryotic or eukaryotic cells. A synthetic mariner transposon is generated by flanking a DNA sequence with short inverted repeats. When purified recombinant Mos1 or Mboumar-9 transposase is co-transfected with transposon-containing plasmid DNA, it penetrates prokaryotic or eukaryotic cells and integrates the target DNA into the genome. In vivo integrations by purified transposase can be achieved by electroporation, chemical transfection or Lipofection of the transposase:DNA mixture, in contrast to other published transposon-based protocols which require electroporation or microinjection. As in other transposome systems, no helper plasmids are required since transposases are not expressed inside the host cells, thus leading to generation of stable cell lines. Since it does not require electroporation or microinjection, this tool has the potential to be applied for automated high-throughput creation of libraries of random integrants for purposes including gene knock-out libraries, screening for optimal integration positions or safe genome locations in different organisms, selection of the highest production of valuable compounds for biotechnology, and sequencing. PMID:28204586
Raupach, Michael J.; Hannig, Karsten; Morinière, Jérome; Hendrich, Lars
2016-01-01
Abstract As molecular identification method, DNA barcoding based on partial cytochrome c oxidase subunit 1 (COI) sequences has been proven to be a useful tool for species determination in many insect taxa including ground beetles. In this study we tested the effectiveness of DNA barcodes to discriminate species of the ground beetle genus Bembidion and some closely related taxa of Germany. DNA barcodes were obtained from 819 individuals and 78 species, including sequences from previous studies as well as more than 300 new generated DNA barcodes. We found a 1:1 correspondence between BIN and traditionally recognized species for 69 species (89%). Low interspecific distances with maximum pairwise K2P values below 2.2% were found for three species pairs, including two species pairs with haplotype sharing (Bembidion atrocaeruleum/Bembidion varicolor and Bembidion guttula/Bembidion mannerheimii). In contrast to this, deep intraspecific sequence divergences with distinct lineages were revealed for two species (Bembidion geniculatum/Ocys harpaloides). Our study emphasizes the use of DNA barcodes for the identification of the analyzed ground beetles species and represents an important step in building-up a comprehensive barcode library for the Carabidae in Germany and Central Europe as well. PMID:27408547
Googling DNA sequences on the World Wide Web.
Hajibabaei, Mehrdad; Singer, Gregory A C
2009-11-10
New web-based technologies provide an excellent opportunity for sharing and accessing information and using web as a platform for interaction and collaboration. Although several specialized tools are available for analyzing DNA sequence information, conventional web-based tools have not been utilized for bioinformatics applications. We have developed a novel algorithm and implemented it for searching species-specific genomic sequences, DNA barcodes, by using popular web-based methods such as Google. We developed an alignment independent character based algorithm based on dividing a sequence library (DNA barcodes) and query sequence to words. The actual search is conducted by conventional search tools such as freely available Google Desktop Search. We implemented our algorithm in two exemplar packages. We developed pre and post-processing software to provide customized input and output services, respectively. Our analysis of all publicly available DNA barcode sequences shows a high accuracy as well as rapid results. Our method makes use of conventional web-based technologies for specialized genetic data. It provides a robust and efficient solution for sequence search on the web. The integration of our search method for large-scale sequence libraries such as DNA barcodes provides an excellent web-based tool for accessing this information and linking it to other available categories of information on the web.
PCR cycles above routine numbers do not compromise high-throughput DNA barcoding results.
Vierna, J; Doña, J; Vizcaíno, A; Serrano, D; Jovani, R
2017-10-01
High-throughput DNA barcoding has become essential in ecology and evolution, but some technical questions still remain. Increasing the number of PCR cycles above the routine 20-30 cycles is a common practice when working with old-type specimens, which provide little amounts of DNA, or when facing annealing issues with the primers. However, increasing the number of cycles can raise the number of artificial mutations due to polymerase errors. In this work, we sequenced 20 COI libraries in the Illumina MiSeq platform. Libraries were prepared with 40, 45, 50, 55, and 60 PCR cycles from four individuals belonging to four species of four genera of cephalopods. We found no relationship between the number of PCR cycles and the number of mutations despite using a nonproofreading polymerase. Moreover, even when using a high number of PCR cycles, the resulting number of mutations was low enough not to be an issue in the context of high-throughput DNA barcoding (but may still remain an issue in DNA metabarcoding due to chimera formation). We conclude that the common practice of increasing the number of PCR cycles should not negatively impact the outcome of a high-throughput DNA barcoding study in terms of the occurrence of point mutations.
pH-programmable DNA logic arrays powered by modular DNAzyme libraries.
Elbaz, Johann; Wang, Fuan; Remacle, Francoise; Willner, Itamar
2012-12-12
Nature performs complex information processing circuits, such the programmed transformations of versatile stem cells into targeted functional cells. Man-made molecular circuits are, however, unable to mimic such sophisticated biomachineries. To reach these goals, it is essential to construct programmable modular components that can be triggered by environmental stimuli to perform different logic circuits. We report on the unprecedented design of artificial pH-programmable DNA logic arrays, constructed by modular libraries of Mg(2+)- and UO(2)(2+)-dependent DNAzyme subunits and their substrates. By the appropriate modular design of the DNA computation units, pH-programmable logic arrays of various complexities are realized, and the arrays can be erased, reused, and/or reprogrammed. Such systems may be implemented in the near future for nanomedical applications by pH-controlled regulation of cellular functions or may be used to control biotransformations stimulated by bacteria.
Protein Science by DNA Sequencing: How Advances in Molecular Biology Are Accelerating Biochemistry.
Higgins, Sean A; Savage, David F
2018-01-09
A fundamental goal of protein biochemistry is to determine the sequence-function relationship, but the vastness of sequence space makes comprehensive evaluation of this landscape difficult. However, advances in DNA synthesis and sequencing now allow researchers to assess the functional impact of every single mutation in many proteins, but challenges remain in library construction and the development of general assays applicable to a diverse range of protein functions. This Perspective briefly outlines the technical innovations in DNA manipulation that allow massively parallel protein biochemistry and then summarizes the methods currently available for library construction and the functional assays of protein variants. Areas in need of future innovation are highlighted with a particular focus on assay development and the use of computational analysis with machine learning to effectively traverse the sequence-function landscape. Finally, applications in the fundamentals of protein biochemistry, disease prediction, and protein engineering are presented.
Structure solution of DNA-binding proteins and complexes with ARCIMBOLDO libraries
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pröpper, Kevin; Instituto de Biologia Molecular de Barcelona; Meindl, Kathrin
2014-06-01
The structure solution of DNA-binding protein structures and complexes based on the combination of location of DNA-binding protein motif fragments with density modification in a multi-solution frame is described. Protein–DNA interactions play a major role in all aspects of genetic activity within an organism, such as transcription, packaging, rearrangement, replication and repair. The molecular detail of protein–DNA interactions can be best visualized through crystallography, and structures emphasizing insight into the principles of binding and base-sequence recognition are essential to understanding the subtleties of the underlying mechanisms. An increasing number of high-quality DNA-binding protein structure determinations have been witnessed despite themore » fact that the crystallographic particularities of nucleic acids tend to pose specific challenges to methods primarily developed for proteins. Crystallographic structure solution of protein–DNA complexes therefore remains a challenging area that is in need of optimized experimental and computational methods. The potential of the structure-solution program ARCIMBOLDO for the solution of protein–DNA complexes has therefore been assessed. The method is based on the combination of locating small, very accurate fragments using the program Phaser and density modification with the program SHELXE. Whereas for typical proteins main-chain α-helices provide the ideal, almost ubiquitous, small fragments to start searches, in the case of DNA complexes the binding motifs and DNA double helix constitute suitable search fragments. The aim of this work is to provide an effective library of search fragments as well as to determine the optimal ARCIMBOLDO strategy for the solution of this class of structures.« less
Yim, Young-Sun; Davis, Georgia L.; Duru, Ngozi A.; Musket, Theresa A.; Linton, Eric W.; Messing, Joachim W.; McMullen, Michael D.; Soderlund, Carol A.; Polacco, Mary L.; Gardiner, Jack M.; Coe, Edward H.
2002-01-01
Three maize (Zea mays) bacterial artificial chromosome (BAC) libraries were constructed from inbred line B73. High-density filter sets from all three libraries, made using different restriction enzymes (HindIII, EcoRI, and MboI, respectively), were evaluated with a set of complex probes including the185-bp knob repeat, ribosomal DNA, two telomere-associated repeat sequences, four centromere repeats, the mitochondrial genome, a multifragment chloroplast DNA probe, and bacteriophage λ. The results indicate that the libraries are of high quality with low contamination by organellar and λ-sequences. The use of libraries from multiple enzymes increased the chance of recovering each region of the genome. Ninety maize restriction fragment-length polymorphism core markers were hybridized to filters of the HindIII library, representing 6× coverage of the genome, to initiate development of a framework for anchoring BAC contigs to the intermated B73 × Mo17 genetic map and to mark the bin boundaries on the physical map. All of the clones used as hybridization probes detected at least three BACs. Twenty-two single-copy number core markers identified an average of 7.4 ± 3.3 positive clones, consistent with the expectation of six clones. This information is integrated into fingerprinting data generated by the Arizona Genomics Institute to assemble the BAC contigs using fingerprint contig and contributed to the process of physical map construction. PMID:12481051
Non-biased and efficient global amplification of a single-cell cDNA library
Huang, Huan; Goto, Mari; Tsunoda, Hiroyuki; Sun, Lizhou; Taniguchi, Kiyomi; Matsunaga, Hiroko; Kambara, Hideki
2014-01-01
Analysis of single-cell gene expression promises a more precise understanding of molecular mechanisms of a living system. Most techniques only allow studies of the expressions for limited numbers of gene species. When amplification of cDNA was carried out for analysing more genes, amplification biases were frequently reported. A non-biased and efficient global-amplification method, which uses a single-cell cDNA library immobilized on beads, was developed for analysing entire gene expressions for single cells. Every step in this analysis from reverse transcription to cDNA amplification was optimized. By removing degrading excess primers, the bias due to the digestion of cDNA was prevented. Since the residual reagents, which affect the efficiency of each subsequent reaction, could be removed by washing beads, the conditions for uniform and maximized amplification of cDNAs were achieved. The differences in the amplification rates for randomly selected eight genes were within 1.5-folds, which could be negligible for most of the applications of single-cell analysis. The global amplification gives a large amount of amplified cDNA (>100 μg) from a single cell (2-pg mRNA), and that amount is enough for downstream analysis. The proposed global-amplification method was used to analyse transcript ratios of multiple cDNA targets (from several copies to several thousand copies) quantitatively. PMID:24141095
Huang, D; Wu, W; Zhou, Y; Hu, Z; Lu, L
2004-05-01
Construction of single chromosomal DNA libraries by means of chromosome microdissection and microcloning will be useful for genomic research, especially for those species that have not been extensively studied genetically. Application of the technology of microdissection and microcloning to woody fruit plants has not been reported hitherto, largely due to the generally small sizes of metaphase chromosomes and the difficulty of chromosome preparation. The present study was performed to establish a method for single chromosome microdissection and microcloning in woody fruit species using pomelo as a model. The standard karyotype of a pomelo cultivar ( Citrus grandis cv. Guanxi) was established based on 20 prometaphase photomicrographs. According to the standard karyotype, chromosome 1 was identified and isolated with fine glass microneedles controlled by a micromanipulator. DNA fragments ranging from 0.3 kb to 2 kb were acquired from the isolated single chromosome 1 via two rounds of PCR mediated by Sau3A linker adaptors and then cloned into T-easy vectors to generate a DNA library of chromosome 1. Approximately 30,000 recombinant clones were obtained. Evaluation based on 108 randomly selected clones showed that the sizes of the cloned inserts varied from 0.5 kb to 1.5 kb with an average of 860 bp. Our research suggests that microdissection and microcloning of single small chromosomes in woody plants is feasible.
Kresse, Stine H; Namløs, Heidi M; Lorenz, Susanne; Berner, Jeanne-Marie; Myklebost, Ola; Bjerkehagen, Bodil; Meza-Zepeda, Leonardo A
2018-01-01
Nucleic acid material of adequate quality is crucial for successful high-throughput sequencing (HTS) analysis. DNA and RNA isolated from archival FFPE material are frequently degraded and not readily amplifiable due to chemical damage introduced during fixation. To identify optimal nucleic acid extraction kits, DNA and RNA quantity, quality and performance in HTS applications were evaluated. DNA and RNA were isolated from five sarcoma archival FFPE blocks, using eight extraction protocols from seven kits from three different commercial vendors. For DNA extraction, the truXTRAC FFPE DNA kit from Covaris gave higher yields and better amplifiable DNA, but all protocols gave comparable HTS library yields using Agilent SureSelect XT and performed well in downstream variant calling. For RNA extraction, all protocols gave comparable yields and amplifiable RNA. However, for fusion gene detection using the Archer FusionPlex Sarcoma Assay, the truXTRAC FFPE RNA kit from Covaris and Agencourt FormaPure kit from Beckman Coulter showed the highest percentage of unique read-pairs, providing higher complexity of HTS data and more frequent detection of recurrent fusion genes. truXTRAC simultaneous DNA and RNA extraction gave similar outputs as individual protocols. These findings show that although successful HTS libraries could be generated in most cases, the different protocols gave variable quantity and quality for FFPE nucleic acid extraction. Selecting the optimal procedure is highly valuable and may generate results in borderline quality specimens.
Sharma, Nandita; Tanksale, Himgouri; Kapley, Atya; Purohit, Hemant J
2012-12-01
Metagenomic libraries herald the era of magnifying the microbial world, tapping into the vast metabolic potential of uncultivated microbes, and enhancing the rate of discovery of novel genes and pathways. In this paper, we describe a method that facilitates the extraction of metagenomic DNA from activated sludge of an industrial wastewater treatment plant and its use in mining the metagenome via library construction. The efficiency of this method was demonstrated by the large representation of the bacterial genome in the constructed metagenomic libraries and by the functional clones obtained. The BAC library represented 95.6 times the bacterial genome, while, the pUC library represented 41.7 times the bacterial genome. Twelve clones in the BAC library demonstrated lipolytic activity, while four clones demonstrated dioxygenase activity. Four clones in pUC library tested positive for cellulase activity. This method, using FTA cards, not only can be used for library construction, but can also store the metagenome at room temperature.
Key, Katherine C; Sublette, Kerry L; Duncan, Kathleen; Mackay, Douglas M; Scow, Kate M; Ogles, Dora
2013-01-01
Although the anaerobic biodegradation of methyl tert -butyl ether (MTBE) and tert -butyl alcohol (TBA) has been documented in the laboratory and the field, knowledge of the microorganisms and mechanisms involved is still lacking. In this study, DNA-stable isotope probing (SIP) was used to identify microorganisms involved in anaerobic fuel oxygenate biodegradation in a sulfate-reducing MTBE and TBA plume. Microorganisms were collected in the field using Bio-Sep® beads amended with 13 C 5 -MTBE, 13 C 1 -MTBE (only methoxy carbon labeled), or 13 C 4 -TBA. 13 C-DNA and 12 C-DNA extracted from the Bio-Sep beads were cloned and 16S rRNA gene sequences were used to identify the indigenous microorganisms involved in degrading the methoxy group of MTBE and the tert -butyl group of MTBE and TBA. Results indicated that microorganisms were actively degrading 13 C-labeled MTBE and TBA in situ and the 13 C was incorporated into their DNA. Several sequences related to known MTBE- and TBA-degraders in the Burkholderiales and the Sphingomonadales orders were detected in all three 13 C clone libraries and were likely to be primary degraders at the site. Sequences related to sulfate-reducing bacteria and iron-reducers, such as Geobacter and Geothrix , were only detected in the clone libraries where MTBE and TBA were fully labeled with 13 C, suggesting that they were involved in processing carbon from the tert -butyl group. Sequences similar to the Pseudomonas genus predominated in the clone library where only the methoxy carbon of MTBE was labeled with 13 C. It is likely that members of this genus were secondary degraders cross-feeding on 13 C-labeled metabolites such as acetate.
Key, Katherine C.; Sublette, Kerry L.; Duncan, Kathleen; Mackay, Douglas M.; Scow, Kate M.; Ogles, Dora
2014-01-01
Although the anaerobic biodegradation of methyl tert-butyl ether (MTBE) and tert-butyl alcohol (TBA) has been documented in the laboratory and the field, knowledge of the microorganisms and mechanisms involved is still lacking. In this study, DNA-stable isotope probing (SIP) was used to identify microorganisms involved in anaerobic fuel oxygenate biodegradation in a sulfate-reducing MTBE and TBA plume. Microorganisms were collected in the field using Bio-Sep® beads amended with 13C5-MTBE, 13C1-MTBE (only methoxy carbon labeled), or13C4-TBA. 13C-DNA and 12C-DNA extracted from the Bio-Sep beads were cloned and 16S rRNA gene sequences were used to identify the indigenous microorganisms involved in degrading the methoxy group of MTBE and the tert-butyl group of MTBE and TBA. Results indicated that microorganisms were actively degrading 13C-labeled MTBE and TBA in situ and the 13C was incorporated into their DNA. Several sequences related to known MTBE- and TBA-degraders in the Burkholderiales and the Sphingomonadales orders were detected in all three13C clone libraries and were likely to be primary degraders at the site. Sequences related to sulfate-reducing bacteria and iron-reducers, such as Geobacter and Geothrix, were only detected in the clone libraries where MTBE and TBA were fully labeled with 13C, suggesting that they were involved in processing carbon from the tert-butyl group. Sequences similar to the Pseudomonas genus predominated in the clone library where only the methoxy carbon of MTBE was labeled with 13C. It is likely that members of this genus were secondary degraders cross-feeding on 13C-labeled metabolites such as acetate. PMID:25525320
Hering, Daniel; Borja, Angel; Jones, J Iwan; Pont, Didier; Boets, Pieter; Bouchez, Agnes; Bruce, Kat; Drakare, Stina; Hänfling, Bernd; Kahlert, Maria; Leese, Florian; Meissner, Kristian; Mergen, Patricia; Reyjol, Yorick; Segurado, Pedro; Vogler, Alfried; Kelly, Martyn
2018-07-01
Assessment of ecological status for the European Water Framework Directive (WFD) is based on "Biological Quality Elements" (BQEs), namely phytoplankton, benthic flora, benthic invertebrates and fish. Morphological identification of these organisms is a time-consuming and expensive procedure. Here, we assess the options for complementing and, perhaps, replacing morphological identification with procedures using eDNA, metabarcoding or similar approaches. We rate the applicability of DNA-based identification for the individual BQEs and water categories (rivers, lakes, transitional and coastal waters) against eleven criteria, summarised under the headlines representativeness (for example suitability of current sampling methods for DNA-based identification, errors from DNA-based species detection), sensitivity (for example capability to detect sensitive taxa, unassigned reads), precision of DNA-based identification (knowledge about uncertainty), comparability with conventional approaches (for example sensitivity of metrics to differences in DNA-based identification), cost effectiveness and environmental impact. Overall, suitability of DNA-based identification is particularly high for fish, as eDNA is a well-suited sampling approach which can replace expensive and potentially harmful methods such as gill-netting, trawling or electrofishing. Furthermore, there are attempts to replace absolute by relative abundance in metric calculations. For invertebrates and phytobenthos, the main challenges include the modification of indices and completing barcode libraries. For phytoplankton, the barcode libraries are even more problematic, due to the high taxonomic diversity in plankton samples. If current assessment concepts are kept, DNA-based identification is least appropriate for macrophytes (rivers, lakes) and angiosperms/macroalgae (transitional and coastal waters), which are surveyed rather than sampled. We discuss general implications of implementing DNA-based identification into standard ecological assessment, in particular considering any adaptations to the WFD that may be required to facilitate the transition to molecular data. Copyright © 2018 Elsevier Ltd. All rights reserved.
F.N. Martin; M. Coffey; R. Hamelin; P. Tooley; M. Garbelotto; K. Hughes; T. Kubisiak
2008-01-01
A number of molecular diagnostic procedures for detection of Phytophthora ramorum have been reported in the literature. In an effort to evaluate the specificity of 10 of these techniques a standardized DNA library for 317 isolates was assembled that included 60 described species as well as 22 taxonomically unclassified isolates. These were sent blind...
F.N. Martin; M.D. Coffey; K. Zeller; R.C. Hamelin; P. Tooley; M. Garbelotto; K.J.D. Hughes; T. Kubisiak; G.J. Bilodeau; L. Levy; C. Blomquist; P.H. Berger
2009-01-01
Given the importance of Phytophthora ramorum from a regulatory standpoint, it is imperative that molecular markers for pathogen detection are fully tested to evaluate their specificity in detection of the pathogen. In an effort to evaluate 11 reported diagnostic techniques, we assembled a standardized DNA library using accessions from the World...
Favalli, Nicholas; Biendl, Stefan; Hartmann, Marco; Piazzi, Jacopo; Sladojevich, Filippo; Gräslund, Susanne; Brown, Peter J; Näreoja, Katja; Schüler, Herwig; Scheuermann, Jörg; Franzini, Raphael; Neri, Dario
2018-06-01
A DNA-encoded chemical library (DECL) with 1.2 million compounds was synthesized by combinatorial reaction of seven central scaffolds with two sets of 343×492 building blocks. Library screening by affinity capture revealed that for some target proteins, the chemical nature of building blocks dominated the selection results, whereas for other proteins, the central scaffold also crucially contributed to ligand affinity. Molecules based on a 3,5-bis(aminomethyl)benzoic acid core structure were found to bind human serum albumin with a K d value of 6 nm, while compounds with the same substituents on an equidistant but flexible l-lysine scaffold showed 140-fold lower affinity. A 18 nm tankyrase-1 binder featured l-lysine as linking moiety, while molecules based on d-Lysine or (2S,4S)-amino-l-proline showed no detectable binding to the target. This work suggests that central scaffolds which predispose the orientation of chemical building blocks toward the protein target may enhance the screening productivity of encoded libraries. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stark, M.; Naiman, T.; Canaani, D.
1989-08-15
In a previous work, an immortal xeroderma pigmentosum cell line belonging to complementation group C was complemented to a UV-resistant phenotype by transfection with a human cDNA clone library. We now report that the primary transformants selected for UV-resistance also acquired normal levels of DNA repair. This was assessed both by measurement of UV-induced ({sup 3}H)thymidine incorporation and by equilibrium sedimentation analysis of repair-DNA synthesis. Therefore, the transduced DNA element which confers normal UV-resistance also corrects the excision repair defect of the xeroderma pigmentosum group C cell line.
Molecular Cloning and Sequencing of Channel Catfish, Ictalurus punctatus, Cathepsin H and L cDNA
USDA-ARS?s Scientific Manuscript database
Cathepsin H and L, a lysosomal cysteine endopeptidase of the papain family, are ubiquitously expressed and involve in antigen processing. In this communication, the channel catfish cathepsin H and L transcripts were sequenced and analyzed. Total RNA from tissues was extracted and cDNA libraries we...
USDA-ARS?s Scientific Manuscript database
As an initial step to explore the transcriptome genetic diversity and to discover single nucleotide polymorphic (SNP)-biomarkers for marker assisted breeding within Pima (Gossypium barbadense L.) cotton, leaves from 25 day plants of three diverse genotypes were used to develop cDNA libraries. Using ...
High-throughput microtitre plate-based assay for DNA topoisomerases.
Taylor, James A; Burton, Nicolas P; Maxwell, Anthony
2012-01-01
We have developed a rapid, high-throughput assay for measuring the catalytic activity (DNA supercoiling or relaxation) of DNA topoisomerases. The assay utilizes intermolecular triplex formation between an immobilized triplex-forming oligo (TFO) and a triplex-forming region inserted into the plasmid substrate (pNO1), and capitalizes on the observation that supercoiled DNA forms triplexes more readily than relaxed DNA. Thus, supercoiled DNA is preferentially retained by the TFO under triplex-forming conditions while relaxed DNA can be washed away. Due to its high speed of sample analysis and reduced sample handling over conventional gel-based techniques, this assay can be used to screen chemical libraries for novel inhibitors of topoisomerases.
High Bacterial Diversity in Permanently Cold Marine Sediments
Ravenschlag, Katrin; Sahm, Kerstin; Pernthaler, Jakob; Amann, Rudolf
1999-01-01
A 16S ribosomal DNA (rDNA) clone library from permanently cold marine sediments was established. Screening 353 clones by dot blot hybridization with group-specific oligonucleotide probes suggested a predominance of sequences related to bacteria of the sulfur cycle (43.4% potential sulfate reducers). Within this fraction, the major cluster (19.0%) was affiliated with Desulfotalea sp. and other closely related psychrophilic sulfate reducers isolated from the same habitat. The cloned sequences showed between 93 and 100% similarity to these bacteria. Two additional groups were frequently encountered: 13% of the clones were related to Desulfuromonas palmitatis, and a second group was affiliated with Myxobacteria spp. and Bdellovibrio spp. Many clones (18.1%) belonged to the γ subclass of the class Proteobacteria and were closest to symbiotic or free-living sulfur oxidizers. Probe target groups were further characterized by amplified rDNA restriction analysis to determine diversity within the groups and within the clone library. Rarefaction analysis suggested that the total diversity assessed by 16S rDNA analysis was very high in these permanently cold sediments and was only partially revealed by screening of 353 clones. PMID:10473405
Gardenia jasminoides Encodes an Inhibitor-2 Protein for Protein Phosphatase Type 1
NASA Astrophysics Data System (ADS)
Gao, Lan; Li, Hao-Ming
2017-08-01
Protein phosphatase-1 (PP1) regulates diverse, essential cellular processes such as cell cycle progression, protein synthesis, muscle contraction, carbohydrate metabolism, transcription and neuronal signaling. Inhibitor-2 (I-2) can inhibit the activity of PP1 and has been found in diverse organisms. In this work, a Gardenia jasminoides fruit cDNA library was constructed, and the GjI-2 cDNA was isolated from the cDNA library by sequencing method. The GjI-2 cDNA contains a predicted 543 bp open reading frame that encodes 180 amino acids. The bioinformatics analysis suggested that the GjI-2 has conserved PP1c binding motif, and contains a conserved phosphorylation site, which is important in regulation of its activity. The three-dimensional model structure of GjI-2 was buite, its similar with the structure of I-2 from mouse. The results suggest that GjI-2 has relatively conserved RVxF, FxxR/KxR/K and HYNE motif, and these motifs are involved in interaction with PP1.
The ChIP-exo Method: Identifying Protein-DNA Interactions with Near Base Pair Precision.
Perreault, Andrea A; Venters, Bryan J
2016-12-23
Chromatin immunoprecipitation (ChIP) is an indispensable tool in the fields of epigenetics and gene regulation that isolates specific protein-DNA interactions. ChIP coupled to high throughput sequencing (ChIP-seq) is commonly used to determine the genomic location of proteins that interact with chromatin. However, ChIP-seq is hampered by relatively low mapping resolution of several hundred base pairs and high background signal. The ChIP-exo method is a refined version of ChIP-seq that substantially improves upon both resolution and noise. The key distinction of the ChIP-exo methodology is the incorporation of lambda exonuclease digestion in the library preparation workflow to effectively footprint the left and right 5' DNA borders of the protein-DNA crosslink site. The ChIP-exo libraries are then subjected to high throughput sequencing. The resulting data can be leveraged to provide unique and ultra-high resolution insights into the functional organization of the genome. Here, we describe the ChIP-exo method that we have optimized and streamlined for mammalian systems and next-generation sequencing-by-synthesis platform.
Tissue Gene Expression Analysis Using Arrayed Normalized cDNA Libraries
Eickhoff, Holger; Schuchhardt, Johannes; Ivanov, Igor; Meier-Ewert, Sebastian; O'Brien, John; Malik, Arif; Tandon, Neeraj; Wolski, Eryk-Witold; Rohlfs, Elke; Nyarsik, Lajos; Reinhardt, Richard; Nietfeld, Wilfried; Lehrach, Hans
2000-01-01
We have used oligonucleotide-fingerprinting data on 60,000 cDNA clones from two different mouse embryonic stages to establish a normalized cDNA clone set. The normalized set of 5,376 clones represents different clusters and therefore, in almost all cases, different genes. The inserts of the cDNA clones were amplified by PCR and spotted on glass slides. The resulting arrays were hybridized with mRNA probes prepared from six different adult mouse tissues. Expression profiles were analyzed by hierarchical clustering techniques. We have chosen radioactive detection because it combines robustness with sensitivity and allows the comparison of multiple normalized experiments. Sensitive detection combined with highly effective clustering algorithms allowed the identification of tissue-specific expression profiles and the detection of genes specifically expressed in the tissues investigated. The obtained results are publicly available (http://www.rzpd.de) and can be used by other researchers as a digital expression reference. [The sequence data described in this paper have been submitted to the EMBL data library under accession nos. AL360374–AL36537.] PMID:10958641
Critical factors for assembling a high volume of DNA barcodes
Hajibabaei, Mehrdad; deWaard, Jeremy R; Ivanova, Natalia V; Ratnasingham, Sujeevan; Dooh, Robert T; Kirk, Stephanie L; Mackie, Paula M; Hebert, Paul D.N
2005-01-01
Large-scale DNA barcoding projects are now moving toward activation while the creation of a comprehensive barcode library for eukaryotes will ultimately require the acquisition of some 100 million barcodes. To satisfy this need, analytical facilities must adopt protocols that can support the rapid, cost-effective assembly of barcodes. In this paper we discuss the prospects for establishing high volume DNA barcoding facilities by evaluating key steps in the analytical chain from specimens to barcodes. Alliances with members of the taxonomic community represent the most effective strategy for provisioning the analytical chain with specimens. The optimal protocols for DNA extraction and subsequent PCR amplification of the barcode region depend strongly on their condition, but production targets of 100K barcode records per year are now feasible for facilities working with compliant specimens. The analysis of museum collections is currently challenging, but PCR cocktails that combine polymerases with repair enzyme(s) promise future success. Barcode analysis is already a cost-effective option for species identification in some situations and this will increasingly be the case as reference libraries are assembled and analytical protocols are simplified. PMID:16214753
Yang, Hongfang; Medeiros, Patricia F; Raha, Kaushik; Elkins, Patricia; Lind, Kenneth E; Lehr, Ruth; Adams, Nicholas D; Burgess, Joelle L; Schmidt, Stanley J; Knight, Steven D; Auger, Kurt R; Schaber, Michael D; Franklin, G Joseph; Ding, Yun; DeLorey, Jennifer L; Centrella, Paolo A; Mataruse, Sibongile; Skinner, Steven R; Clark, Matthew A; Cuozzo, John W; Evindar, Ghotas
2015-05-14
In the search of PI3K p110α wild type and H1047R mutant selective small molecule leads, an encoded library technology (ELT) campaign against the desired target proteins was performed which led to the discovery of a selective chemotype for PI3K isoforms from a three-cycle DNA encoded library. An X-ray crystal structure of a representative inhibitor from this chemotype demonstrated a unique binding mode in the p110α protein.
2015-01-01
In the search of PI3K p110α wild type and H1047R mutant selective small molecule leads, an encoded library technology (ELT) campaign against the desired target proteins was performed which led to the discovery of a selective chemotype for PI3K isoforms from a three-cycle DNA encoded library. An X-ray crystal structure of a representative inhibitor from this chemotype demonstrated a unique binding mode in the p110α protein. PMID:26005528
Tomko, Timothy A; Dunlop, Mary J
2015-01-01
Recent metabolic engineering efforts have generated microorganisms that can produce biofuels, including bio-jet fuels, however these fuels are often toxic to cells, limiting production yields. There are natural examples of microorganisms that have evolved mechanisms for tolerating hydrocarbon-rich environments, such as those that thrive near natural oil seeps and in oil-polluted waters. Using genomic DNA from the hydrocarbon-degrading microbe Marinobacter aquaeolei, we constructed a transgenic library that we expressed in Escherichia coli. We exposed cells to inhibitory levels of pinene, a monoterpene that can serve as a jet fuel precursor with chemical properties similar to existing tactical fuels. Using a sequential strategy with a fosmid library followed by a plasmid library, we were able to isolate a region of DNA from the M. aquaeolei genome that conferred pinene tolerance when expressed in E. coli. We determined that a single gene, yceI, was responsible for the tolerance improvements. Overexpression of this gene placed no additional burden on the host. We also tested tolerance to other monoterpenes and showed that yceI selectively improves tolerance. The genomes of hydrocarbon-tolerant microbes represent a rich resource for tolerance engineering. Using a transgenic library, we were able to identify a single gene that improves E. coli's tolerance to the bio-jet fuel precursor pinene.
Improved Modeling of Side-Chain–Base Interactions and Plasticity in Protein–DNA Interface Design
Thyme, Summer B.; Baker, David; Bradley, Philip
2012-01-01
Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed “motifs”) was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein–DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent. PMID:22426128
Plans and progress for building a Great Lakes fauna DNA ...
DNA reference libraries provide researchers with an important tool for assessing regional biodiversity by allowing unknown genetic sequences to be assigned identities, while also providing a means for taxonomists to validate identifications. Expanding the representation of Great Lakes species in such reference libraries is an explicit component of research at EPA’s Mid-Continent Ecology Division. Our DNA reference library building efforts began in 2012 with the goal of providing barcodes for at least 5 specimens of each native and nonindigenous fish and aquatic invertebrate species currently present in the Great Lakes. The approach is to pull taxonomically validated specimen for sequencing from EPA led sampling efforts of adult/juvenile fish, larval fish, benthic macroinvertebrates, and zooplankton; while also soliciting aid from state and federal agencies for tissue from “shopping list” organisms. The barcodes we generate are made available through the publicly accessible BOLD (Barcode of Life) database, and help inform a planned Great Lakes biodiversity inventory. To date, our submissions to BOLD are limited to fishes; of the 88 fish species listed as being present within Lake Superior, roughly half were successfully barcoded, while only 22 species met the desired quota of 5 barcoded specimens per species. As we continue to generate genomic information from our collections and the taxonomic representations become more complete, we will continue to
Quambusch, Mona; Pirttilä, Anna Maria; Tejesvi, Mysore V; Winkelmann, Traud; Bartsch, Melanie
2014-05-01
The endophytic bacterial communities of six Prunus avium L. genotypes differing in their growth patterns during in vitro propagation were identified by culture-dependent and culture-independent methods. Five morphologically distinct isolates from tissue culture material were identified by 16S rDNA sequence analysis. To detect and analyze the uncultivable fraction of endophytic bacteria, a clone library was established from the amplified 16S rDNA of total plant extract. Bacterial diversity within the clone libraries was analyzed by amplified ribosomal rDNA restriction analysis and by sequencing a clone for each identified operational taxonomic unit. The most abundant bacterial group was Mycobacterium sp., which was identified in the clone libraries of all analyzed Prunus genotypes. Other dominant bacterial genera identified in the easy-to-propagate genotypes were Rhodopseudomonas sp. and Microbacterium sp. Thus, the community structures in the easy- and difficult-to-propagate cherry genotypes differed significantly. The bacterial genera, which were previously reported to have plant growth-promoting effects, were detected only in genotypes with high propagation success, indicating a possible positive impact of these bacteria on in vitro propagation of P. avium, which was proven in an inoculation experiment. © The Author 2014. Published by Oxford University Press. All rights reserved.
Improved modeling of side-chain--base interactions and plasticity in protein--DNA interface design.
Thyme, Summer B; Baker, David; Bradley, Philip
2012-06-08
Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed "motifs") was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein-DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent. Published by Elsevier Ltd.
Henry, Kevin A
2018-01-01
Immunogenetic analyses of expressed antibody repertoires are becoming increasingly common experimental investigations and are critical to furthering our understanding of autoimmunity, infectious disease, and cancer. Next-generation DNA sequencing (NGS) technologies have now made it possible to interrogate antibody repertoires to unprecedented depths, typically by sequencing of cDNAs encoding immunoglobulin variable domains. In this chapter, we describe simple, fast, and reliable methods for producing and sequencing multiplex PCR amplicons derived from the variable regions (V H , V H H or V L ) of rearranged immunoglobulin heavy and light chain genes using the Illumina MiSeq platform. We include complete protocols and primer sets for amplicon sequencing of V H /V H H/V L repertoires directly from human, mouse, and llama lymphocytes as well as from phage-displayed V H /V H H/V L libraries; these can be easily be adapted to other types of amplicons with little modification. The resulting amplicons are diverse and representative, even using as few as 10 3 input B cells, and their generation is relatively inexpensive, requiring no special equipment and only a limited set of primers. In the absence of heavy-light chain pairing, single-domain antibodies are uniquely amenable to NGS analyses. We present a number of applications of NGS technology useful in discovery of single-domain antibodies from phage display libraries, including: (i) assessment of library functionality; (ii) confirmation of desired library randomization; (iii) estimation of library diversity; and (iv) monitoring the progress of panning experiments. While the case studies presented here are of phage-displayed single-domain antibody libraries, the principles extend to other types of in vitro display libraries.
Optimizing Illumina next-generation sequencing library preparation for extremely AT-biased genomes.
Oyola, Samuel O; Otto, Thomas D; Gu, Yong; Maslen, Gareth; Manske, Magnus; Campino, Susana; Turner, Daniel J; Macinnis, Bronwyn; Kwiatkowski, Dominic P; Swerdlow, Harold P; Quail, Michael A
2012-01-03
Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences. We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates. We have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of DNA starting material.
Tan, Swee Jin; Phan, Huan; Gerry, Benjamin Michael; Kuhn, Alexandre; Hong, Lewis Zuocheng; Min Ong, Yao; Poon, Polly Suk Yean; Unger, Marc Alexander; Jones, Robert C; Quake, Stephen R; Burkholder, William F
2013-01-01
Library preparation for next-generation DNA sequencing (NGS) remains a key bottleneck in the sequencing process which can be relieved through improved automation and miniaturization. We describe a microfluidic device for automating laboratory protocols that require one or more column chromatography steps and demonstrate its utility for preparing Next Generation sequencing libraries for the Illumina and Ion Torrent platforms. Sixteen different libraries can be generated simultaneously with significantly reduced reagent cost and hands-on time compared to manual library preparation. Using an appropriate column matrix and buffers, size selection can be performed on-chip following end-repair, dA tailing, and linker ligation, so that the libraries eluted from the chip are ready for sequencing. The core architecture of the device ensures uniform, reproducible column packing without user supervision and accommodates multiple routine protocol steps in any sequence, such as reagent mixing and incubation; column packing, loading, washing, elution, and regeneration; capture of eluted material for use as a substrate in a later step of the protocol; and removal of one column matrix so that two or more column matrices with different functional properties can be used in the same protocol. The microfluidic device is mounted on a plastic carrier so that reagents and products can be aliquoted and recovered using standard pipettors and liquid handling robots. The carrier-mounted device is operated using a benchtop controller that seals and operates the device with programmable temperature control, eliminating any requirement for the user to manually attach tubing or connectors. In addition to NGS library preparation, the device and controller are suitable for automating other time-consuming and error-prone laboratory protocols requiring column chromatography steps, such as chromatin immunoprecipitation.
Grant, Susan; Grant, William D.; Cowan, Don A.; Jones, Brian E.; Ma, Yanhe; Ventosa, Antonio; Heaphy, Shaun
2006-01-01
Here we describe the application of metagenomic technologies to construct cDNA libraries from RNA isolated from environmental samples. RNAlater (Ambion) was shown to stabilize RNA in environmental samples for periods of at least 3 months at −20°C. Protocols for library construction were established on total RNA extracted from Acanthamoeba polyphaga trophozoites. The methodology was then used on algal mats from geothermal hot springs in Tengchong county, Yunnan Province, People's Republic of China, and activated sludge from a sewage treatment plant in Leicestershire, United Kingdom. The Tenchong libraries were dominated by RNA from prokaryotes, reflecting the mainly prokaryote microbial composition. The majority of these clones resulted from rRNA; only a few appeared to be derived from mRNA. In contrast, many clones from the activated sludge library had significant similarity to eukaryote mRNA-encoded protein sequences. A library was also made using polyadenylated RNA isolated from total RNA from activated sludge; many more clones in this library were related to eukaryotic mRNA sequences and proteins. Open reading frames (ORFs) up to 378 amino acids in size could be identified. Some resembled known proteins over their full length, e.g., 36% match to cystatin, 49% match to ribosomal protein L32, 63% match to ribosomal protein S16, 70% to CPC2 protein. The methodology described here permits the polyadenylated transcriptome to be isolated from environmental samples with no knowledge of the identity of the microorganisms in the sample or the necessity to culture them. It has many uses, including the identification of novel eukaryotic ORFs encoding proteins and enzymes. PMID:16391035
Tan, Swee Jin; Phan, Huan; Gerry, Benjamin Michael; Kuhn, Alexandre; Hong, Lewis Zuocheng; Min Ong, Yao; Poon, Polly Suk Yean; Unger, Marc Alexander; Jones, Robert C.; Quake, Stephen R.; Burkholder, William F.
2013-01-01
Library preparation for next-generation DNA sequencing (NGS) remains a key bottleneck in the sequencing process which can be relieved through improved automation and miniaturization. We describe a microfluidic device for automating laboratory protocols that require one or more column chromatography steps and demonstrate its utility for preparing Next Generation sequencing libraries for the Illumina and Ion Torrent platforms. Sixteen different libraries can be generated simultaneously with significantly reduced reagent cost and hands-on time compared to manual library preparation. Using an appropriate column matrix and buffers, size selection can be performed on-chip following end-repair, dA tailing, and linker ligation, so that the libraries eluted from the chip are ready for sequencing. The core architecture of the device ensures uniform, reproducible column packing without user supervision and accommodates multiple routine protocol steps in any sequence, such as reagent mixing and incubation; column packing, loading, washing, elution, and regeneration; capture of eluted material for use as a substrate in a later step of the protocol; and removal of one column matrix so that two or more column matrices with different functional properties can be used in the same protocol. The microfluidic device is mounted on a plastic carrier so that reagents and products can be aliquoted and recovered using standard pipettors and liquid handling robots. The carrier-mounted device is operated using a benchtop controller that seals and operates the device with programmable temperature control, eliminating any requirement for the user to manually attach tubing or connectors. In addition to NGS library preparation, the device and controller are suitable for automating other time-consuming and error-prone laboratory protocols requiring column chromatography steps, such as chromatin immunoprecipitation. PMID:23894273
Double Dutch: A Tool for Designing Combinatorial Libraries of Biological Systems.
Roehner, Nicholas; Young, Eric M; Voigt, Christopher A; Gordon, D Benjamin; Densmore, Douglas
2016-06-17
Recently, semirational approaches that rely on combinatorial assembly of characterized DNA components have been used to engineer biosynthetic pathways. In practice, however, it is not practical to assemble and test millions of pathway variants in order to elucidate how different DNA components affect the behavior of a pathway. To address this challenge, we apply a rigorous mathematical approach known as design of experiments (DOE) that can be used to construct empirical models of system behavior without testing all variants. To support this approach, we have developed a tool named Double Dutch, which uses a formal grammar and heuristic algorithms to automate the process of DOE library design. Compared to designing by hand, Double Dutch enables users to more efficiently and scalably design libraries of pathway variants that can be used in a DOE framework and uniquely provides a means to flexibly balance design considerations of statistical analysis, construction cost, and risk of homologous recombination, thereby demonstrating the utility of automating decision making when faced with complex design trade-offs.
RNA-Seq analysis to capture the transcriptome landscape of a single cell
Tang, Fuchou; Barbacioru, Catalin; Nordman, Ellen; Xu, Nanlan; Bashkirov, Vladimir I; Lao, Kaiqin; Surani, M. Azim
2013-01-01
We describe here a protocol for digital transcriptome analysis in a single mouse blastomere using a deep sequencing approach. An individual blastomere was first isolated and put into lysate buffer by mouth pipette. Reverse transcription was then performed directly on the whole cell lysate. After this, the free primers were removed by Exonuclease I and a poly(A) tail was added to the 3′ end of the first-strand cDNA by Terminal Deoxynucleotidyl Transferase. Then the single cell cDNAs were amplified by 20 plus 9 cycles of PCR. Then 100-200 ng of these amplified cDNAs were used to construct a sequencing library. The sequencing library can be used for deep sequencing using the SOLiD system. Compared with the cDNA microarray technique, our assay can capture up to 75% more genes expressed in early embryos. The protocol can generate deep sequencing libraries within 6 days for 16 single cell samples. PMID:20203668
Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard
2013-01-01
Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce. PMID:23409088
Interaction Analysis through Proteomic Phage Display
2014-01-01
Phage display is a powerful technique for profiling specificities of peptide binding domains. The method is suited for the identification of high-affinity ligands with inhibitor potential when using highly diverse combinatorial peptide phage libraries. Such experiments further provide consensus motifs for genome-wide scanning of ligands of potential biological relevance. A complementary but considerably less explored approach is to display expression products of genomic DNA, cDNA, open reading frames (ORFs), or oligonucleotide libraries designed to encode defined regions of a target proteome on phage particles. One of the main applications of such proteomic libraries has been the elucidation of antibody epitopes. This review is focused on the use of proteomic phage display to uncover protein-protein interactions of potential relevance for cellular function. The method is particularly suited for the discovery of interactions between peptide binding domains and their targets. We discuss the largely unexplored potential of this method in the discovery of domain-motif interactions of potential biological relevance. PMID:25295249
Matvienko, Marta; Kozik, Alexander; Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard
2013-01-01
Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce.
Upregulated Genes In Sporadic, Idiopathic Pulmonary Arterial Hypertension
Edgar, Alasdair J; Chacón, Matilde R; Bishop, Anne E; Yacoub, Magdi H; Polak, Julia M
2006-01-01
Background To elucidate further the pathogenesis of sporadic, idiopathic pulmonary arterial hypertension (IPAH) and identify potential therapeutic avenues, differential gene expression in IPAH was examined by suppression subtractive hybridisation (SSH). Methods Peripheral lung samples were obtained immediately after removal from patients undergoing lung transplant for IPAH without familial disease, and control tissues consisted of similarly sampled pieces of donor lungs not utilised during transplantation. Pools of lung mRNA from IPAH cases containing plexiform lesions and normal donor lungs were used to generate the tester and driver cDNA libraries, respectively. A subtracted IPAH cDNA library was made by SSH. Clones isolated from this subtracted library were examined for up regulated expression in IPAH using dot blot arrays of positive colony PCR products using both pooled cDNA libraries as probes. Clones verified as being upregulated were sequenced. For two genes the increase in expression was verified by northern blotting and data analysed using Student's unpaired two-tailed t-test. Results We present preliminary findings concerning candidate genes upregulated in IPAH. Twenty-seven upregulated genes were identified out of 192 clones examined. Upregulation in individual cases of IPAH was shown by northern blot for tissue inhibitor of metalloproteinase-3 and decorin (P < 0.01) compared with the housekeeping gene glyceraldehydes-3-phosphate dehydrogenase. Conclusion Four of the up regulated genes, magic roundabout, hevin, thrombomodulin and sucrose non-fermenting protein-related kinase-1 are expressed specifically by endothelial cells and one, muscleblind-1, by muscle cells, suggesting that they may be associated with plexiform lesions and hypertrophic arterial wall remodelling, respectively. PMID:16390543
Sakurai, Tetsuya; Plata, Germán; Rodríguez-Zapata, Fausto; Seki, Motoaki; Salcedo, Andrés; Toyoda, Atsushi; Ishiwata, Atsushi; Tohme, Joe; Sakaki, Yoshiyuki; Shinozaki, Kazuo; Ishitani, Manabu
2007-01-01
Background Cassava, an allotetraploid known for its remarkable tolerance to abiotic stresses is an important source of energy for humans and animals and a raw material for many industrial processes. A full-length cDNA library of cassava plants under normal, heat, drought, aluminum and post harvest physiological deterioration conditions was built; 19968 clones were sequence-characterized using expressed sequence tags (ESTs). Results The ESTs were assembled into 6355 contigs and 9026 singletons that were further grouped into 10577 scaffolds; we found 4621 new cassava sequences and 1521 sequences with no significant similarity to plant protein databases. Transcripts of 7796 distinct genes were captured and we were able to assign a functional classification to 78% of them while finding more than half of the enzymes annotated in metabolic pathways in Arabidopsis. The annotation of sequences that were not paired to transcripts of other species included many stress-related functional categories showing that our library is enriched with stress-induced genes. Finally, we detected 230 putative gene duplications that include key enzymes in reactive oxygen species signaling pathways and could play a role in cassava stress response features. Conclusion The cassava full-length cDNA library here presented contains transcripts of genes involved in stress response as well as genes important for different areas of cassava research. This library will be an important resource for gene discovery, characterization and cloning; in the near future it will aid the annotation of the cassava genome. PMID:18096061
NASA Astrophysics Data System (ADS)
Tomko, Timothy
Microorganisms are capable of producing advanced biofuels that can be used as 'drop-in' alternatives to conventional liquid fuels. However, vital physiological processes and membrane properties are often disrupted by the presence of biofuel and limit the production yields. In order to make microbial biofuels a competitive fuel source, finding mechanisms for improving resistance to the toxic effects of biofuel production is vital. This investigation aims to identify resistance mechanisms from microorganisms that have evolved to withstand hydrocarbon-rich environments, such as those that thrive near natural oil seeps and in oil-polluted waters. First, using genomic DNA from Marinobacter aquaeolei, we constructed a transgenic library that we expressed in Escherichia coli. We exposed cells to inhibitory levels of pinene, a monoterpene that can serve as a jet fuel precursor with chemical properties similar to existing tactical fuels. Using a sequential strategy of a fosmid library followed by a plasmid library, we were able to isolate a region of DNA from the M. aquaeolei genome that conferred pinene tolerance when expressed in E. coli. We determined that a single gene, yceI, was responsible for the tolerance improvements. Overexpression of this gene placed no additional burden on the host. We also tested tolerance to other monoterpenes and showed that yceI selectively improves tolerance. Additionally, we used genomic DNA from Pseudomonas putida KT2440, which has innate solvent-tolerance properties, to create transgenic libraries in an E. coli host. We exposed cells containing the library to pinene, selecting for genes that improved tolerance. Importantly, we found that expressing the sigma factor RpoD from P. putida greatly expanded the diversity of tolerance genes recovered. With low expression of rpoDP. putida, we isolated a single pinene tolerance gene; with increased expression of the sigma factor our selection experiments returned multiple distinct tolerance mechanisms, including some that have been previously documented and also new mechanisms. Interestingly, high levels of rpoDP. putida, induction resulted in decreased diversity. We found that the tolerance levels provided by some genes are highly sensitive to the level of induction of rpoD P. putida,, while others provide tolerance across a wide range of rpoDP. putida, levels. This method for unlocking diversity in tolerance screening using heterologous sigma factor expression was applicable to both plasmid and fosmid-based transgenic libraries. These results suggest that by controlling the expression of appropriate heterologous sigma factors, we can greatly increase the searchable genomic space within transgenic libraries. This dissertation describes a method of effectively screening genomic DNA from multiple organisms for genes to mitigate biofuel stress and shows how tolerance genes can improve bacterial growth in the presence of toxic biofuel compounds. These identified genes can be targeted in future studies as candidates for use in biofuel production strains to increase biofuel yields.
High yield of functional metagenomic library from mangroves constructed in fosmid vector.
Gonçalves, A C S; dos Santos, A C F; dos Santos, T F; Pessoa, T B A; Dias, J C T; Rezende, R P
2015-10-02
In the present study, metagenomic technique and fosmid vectors were used to construct a library of clones for exploring the biotechnological potential of mangrove soils by isolation of functional genes encoding hydrolytic enzymes. The library was built with genomic DNA from the soil samples of mangrove sediments and the functional screening of 1824 clones (~64 Mbp) was performed to detect the hydrolytic activity specific for cellulases, amylases (at acidic, neutral and basic pH), lipases/esterases, proteases, and nitrilases. Significant numbers of clones, positive for the tested enzyme activities were obtained. Our results indicate the importance and biotechnological potential of mangrove soils especially when compared to those obtained using other soil metagenomic libraries.
Langevin, Stanley A.; Bent, Zachary W.; Solberg, Owen D.; Curtis, Deanna J.; Lane, Pamela D.; Williams, Kelly P.; Schoeniger, Joseph S.; Sinha, Anupama; Lane, Todd W.; Branda, Steven S.
2013-01-01
Use of second generation sequencing (SGS) technologies for transcriptional profiling (RNA-Seq) has revolutionized transcriptomics, enabling measurement of RNA abundances with unprecedented specificity and sensitivity and the discovery of novel RNA species. Preparation of RNA-Seq libraries requires conversion of the RNA starting material into cDNA flanked by platform-specific adaptor sequences. Each of the published methods and commercial kits currently available for RNA-Seq library preparation suffers from at least one major drawback, including long processing times, large starting material requirements, uneven coverage, loss of strand information and high cost. We report the development of a new RNA-Seq library preparation technique that produces representative, strand-specific RNA-Seq libraries from small amounts of starting material in a fast, simple and cost-effective manner. Additionally, we have developed a new quantitative PCR-based assay for precisely determining the number of PCR cycles to perform for optimal enrichment of the final library, a key step in all SGS library preparation workflows. PMID:23558773
Laurie, Matthew T; Bertout, Jessica A; Taylor, Sean D; Burton, Joshua N; Shendure, Jay A; Bielas, Jason H
2013-08-01
Due to the high cost of failed runs and suboptimal data yields, quantification and determination of fragment size range are crucial steps in the library preparation process for massively parallel sequencing (or next-generation sequencing). Current library quality control methods commonly involve quantification using real-time quantitative PCR and size determination using gel or capillary electrophoresis. These methods are laborious and subject to a number of significant limitations that can make library calibration unreliable. Herein, we propose and test an alternative method for quality control of sequencing libraries using droplet digital PCR (ddPCR). By exploiting a correlation we have discovered between droplet fluorescence and amplicon size, we achieve the joint quantification and size determination of target DNA with a single ddPCR assay. We demonstrate the accuracy and precision of applying this method to the preparation of sequencing libraries.
Janku, Filip; Zhang, Shile; Waters, Jill; Liu, Li; Huang, Helen J; Subbiah, Vivek; Hong, David S; Karp, Daniel D; Fu, Siqing; Cai, Xuyu; Ramzanali, Nishma M; Madwani, Kiran; Cabrilo, Goran; Andrews, Debra L; Zhao, Yue; Javle, Milind; Kopetz, E Scott; Luthra, Rajyalakshmi; Kim, Hyunsung J; Gnerre, Sante; Satya, Ravi Vijaya; Chuang, Han-Yu; Kruglyak, Kristina M; Toung, Jonathan; Zhao, Chen; Shen, Richard; Heymach, John V; Meric-Bernstam, Funda; Mills, Gordon B; Fan, Jian-Bing; Salathia, Neeraj S
2017-09-15
Purpose: Tumor-derived cell-free DNA (cfDNA) in plasma can be used for molecular testing and provide an attractive alternative to tumor tissue. Commonly used PCR-based technologies can test for limited number of alterations at the time. Therefore, novel ultrasensitive technologies capable of testing for a broad spectrum of molecular alterations are needed to further personalized cancer therapy. Experimental Design: We developed a highly sensitive ultradeep next-generation sequencing (NGS) assay using reagents from TruSeqNano library preparation and NexteraRapid Capture target enrichment kits to generate plasma cfDNA sequencing libraries for mutational analysis in 61 cancer-related genes using common bioinformatics tools. The results were retrospectively compared with molecular testing of archival primary or metastatic tumor tissue obtained at different points of clinical care. Results: In a study of 55 patients with advanced cancer, the ultradeep NGS assay detected 82% (complete detection) to 87% (complete and partial detection) of the aberrations identified in discordantly collected corresponding archival tumor tissue. Patients with a low variant allele frequency (VAF) of mutant cfDNA survived longer than those with a high VAF did ( P = 0.018). In patients undergoing systemic therapy, radiological response was positively associated with changes in cfDNA VAF ( P = 0.02), and compared with unchanged/increased mutant cfDNA VAF, decreased cfDNA VAF was associated with longer time to treatment failure (TTF; P = 0.03). Conclusions: Ultradeep NGS assay has good sensitivity compared with conventional clinical mutation testing of archival specimens. A high VAF in mutant cfDNA corresponded with shorter survival. Changes in VAF of mutated cfDNA were associated with TTF. Clin Cancer Res; 23(18); 5648-56. ©2017 AACR . ©2017 American Association for Cancer Research.
Ottoni, Júlia Ronzella; Cabral, Lucélia; de Sousa, Sanderson Tarciso Pereira; Júnior, Gileno Vieira Lacerda; Domingos, Daniela Ferreira; Soares Junior, Fábio Lino; da Silva, Mylenne Calciolari Pinheiro; Marcon, Joelma; Dias, Armando Cavalcante Franco; de Melo, Itamar Soares; de Souza, Anete Pereira; Andreote, Fernando Dini; de Oliveira, Valéria Maia
2017-07-01
Mangroves are located in coastal wetlands and are susceptible to the consequences of oil spills, what may threaten the diversity of microorganisms responsible for the nutrient cycling and the consequent ecosystem functioning. Previous reports show that high concentration of oil favors the incidence of epoxide hydrolases and haloalkane dehalogenases in mangroves. This finding has guided the goals of this study in an attempt to broaden the analysis to other hydrolases and thereby verify whether oil contamination interferes with the prevalence of particular hydrolases and their assigned microorganisms. For this, an in-depth survey of the taxonomic and functional microbial diversity recovered in a fosmid library (Library_Oil Mgv) constructed from oil-impacted Brazilian mangrove sediment was carried out. Fosmid DNA of the whole library was extracted and submitted to Illumina HiSeq sequencing. The resulting Library Oil_Mgv dataset was further compared with those obtained by direct sequencing of environmental DNA from Brazilian mangroves (from distinct regions and affected by distinct sources of contamination), focusing on hydrolases with potential use in biotechnological processes. The most abundant hydrolases found were proteases, esterases and amylases, with similar occurrence profile in all datasets. The main microbial groups harboring such hydrolase-encoding genes were distinct in each mangrove, and in the fosmid library these enzymes were mainly assigned to Chloroflexaceae (for amylases), Planctomycetaceae (for esterases) and Bradyrhizobiaceae (for proteases). Assembly and analysis of Library_Oil Mgv reads revealed three potentially novel enzymes, one epoxide hydrolase, one xylanase and one amylase, to be further investigated via heterologous expression assays.
USDA-ARS?s Scientific Manuscript database
Trichinella spiralis infection confers effective resistance to tumor cell expansion. In this study, a T7 phage cDNA display library was constructed to express genes encoded by T. spiralis. Organic phase multi-cell screening was used to sort through candidate proteins in a transfected human chronic m...
Comparative Analysis of Expressed Genes from Cacao Meristems Infected by Moniliophthora perniciosa
Gesteira, Abelmon S.; Micheli, Fabienne; Carels, Nicolas; Da Silva, Aline C.; Gramacho, Karina P.; Schuster, Ivan; Macêdo, Joci N.; Pereira, Gonçalo A. G.; Cascardo, Júlio C. M.
2007-01-01
Background and Aims Witches' broom disease is caused by the hemibiotrophic basidiomycete Moniliophthora perniciosa, and is one of the most important diseases of cacao in the western hemisphere. Because very little is known about the global process of such disease development, expressed sequence tags (ESTs) were used to identify genes expressed during the Theobroma cacao–Moniliophthora perniciosa interaction. Methods Two cDNA libraries corresponding to the resistant (RT) and susceptible (SP) cacao–M. perniciosa interactions were constructed from total RNA, using the DB SMART Creator cDNA library kit (Clontech). Clones were randomly selected, sequenced from the 5′ end and analysed using bioinformatics tools including in silico analysis of the differential gene expression. Key Results A total of 6884 ESTs were generated from the RT and SP cDNA libraries. These ESTs were composed of 2585 singlets and 341 contigs for a total of 2926 non-redundant sequences. The redundancy of the libraries was low and their specificity high when compared with the few other cacao libraries already published. Sequence analysis allowed the assignment of a putative functional category for 54 % of sequences, whereas approx. 22 % of sequences corresponded to unknown function and approx. 24 % of sequences did not show any significant similarity with other proteins present in the database. Despite the similar overall distribution of the sequences in functional categories between the two libraries, qualitative differences were observed. Genes involved during the defence response to pathogen infection or in programmed cell death were identified, such as pathogenesis related-proteins, trypsin inhibitor or oxalate oxidase, and some of them showed an in silico differential expression between the resistant and the susceptible interactions. Conclusions As far as is known this is the first EST resource from the cacao–M. perniciosa interaction and it is believed that it will provide a significant contribution to the understanding of the molecular mechanisms of the resistance and susceptibility of cacao to M. perniciosa, to develop strategies to control witches broom, and as a source of polymorphism for molecular marker development and marker-assisted selection. PMID:17557832
Camanocha, Anuj; Dewhirst, Floyd E.
2014-01-01
Background and objective In addition to the well-known phyla Firmicutes, Proteobacteria, Bacteroidetes, Actinobacteria, Spirochaetes, Fusobacteria, Tenericutes, and Chylamydiae, the oral microbiomes of mammals contain species from the lesser-known phyla or candidate divisions, including Synergistetes, TM7, Chlorobi, Chloroflexi, GN02, SR1, and WPS-2. The objectives of this study were to create phyla-selective 16S rDNA PCR primer pairs, create selective 16S rDNA clone libraries, identify novel oral taxa, and update canine and human oral microbiome databases. Design 16S rRNA gene sequences for members of the lesser-known phyla were downloaded from GenBank and Greengenes databases and aligned with sequences in our RNA databases. Primers with potential phylum level selectivity were designed heuristically with the goal of producing nearly full-length 16S rDNA amplicons. The specificity of primer pairs was examined by making clone libraries from PCR amplicons and determining phyla identity by BLASTN analysis. Results Phylum-selective primer pairs were identified that allowed construction of clone libraries with 96–100% specificity for each of the lesser-known phyla. From these clone libraries, seven human and two canine novel oral taxa were identified and added to their respective taxonomic databases. For each phylum, genome sequences closest to human oral taxa were identified and added to the Human Oral Microbiome Database to facilitate metagenomic, transcriptomic, and proteomic studies that involve tiling sequences to the most closely related taxon. While examining ribosomal operons in lesser-known phyla from single-cell genomes and metagenomes, we identified a novel rRNA operon order (23S-5S-16S) in three SR1 genomes and the splitting of the 23S rRNA gene by an I-CeuI-like homing endonuclease in a WPS-2 genome. Conclusions This study developed useful primer pairs for making phylum-selective 16S rRNA clone libraries. Phylum-specific libraries were shown to be useful for identifying previously unrecognized taxa in lesser-known phyla and would be useful for future environmental and host-associated studies. PMID:25317252
2013-01-01
Background To understand the carcinogenesis caused by accumulated genetic and epigenetic alterations and seek novel biomarkers for various cancers, studying differentially expressed genes between cancerous and normal tissues is crucial. In the study, two cDNA libraries of lung cancer were constructed and screened for identification of differentially expressed genes. Methods Two cDNA libraries of differentially expressed genes were constructed using lung adenocarcinoma tissue and adjacent nonmalignant lung tissue by suppression subtractive hybridization. The data of the cDNA libraries were then analyzed and compared using bioinformatics analysis. Levels of mRNA and protein were measured by quantitative real-time polymerase chain reaction (q-RT-PCR) and western blot respectively, as well as expression and localization of proteins were determined by immunostaining. Gene functions were investigated using proliferation and migration assays after gene silencing and gene over-expression. Results Two libraries of differentially expressed genes were obtained. The forward-subtracted library (FSL) and the reverse-subtracted library (RSL) contained 177 and 59 genes, respectively. Bioinformatic analysis demonstrated that these genes were involved in a wide range of cellular functions. The vast majority of these genes were newly identified to be abnormally expressed in lung cancer. In the first stage of the screening for 16 genes, we compared lung cancer tissues with their adjacent non-malignant tissues at the mRNA level, and found six genes (ERGIC3, DDR1, HSP90B1, SDC1, RPSA, and LPCAT1) from the FSL were significantly up-regulated while two genes (GPX3 and TIMP3) from the RSL were significantly down-regulated (P < 0.05). The ERGIC3 protein was also over-expressed in lung cancer tissues and cultured cells, and expression of ERGIC3 was correlated with the differentiated degree and histological type of lung cancer. The up-regulation of ERGIC3 could promote cellular migration and proliferation in vitro. Conclusions The two libraries of differentially expressed genes may provide the basis for new insights or clues for finding novel lung cancer-related genes; several genes were newly found in lung cancer with ERGIC3 seeming a novel lung cancer-related gene. ERGIC3 may play an active role in the development and progression of lung cancer. PMID:23374247
DNA probe for lactobacillus delbrueckii
DOE Office of Scientific and Technical Information (OSTI.GOV)
Delley, M.; Mollet, B.; Hottinger, H.
1990-06-01
From a genomic DNA library of Lactobacillus delbrueckii subsp. bulgaricus, a clone was isolated which complements a leucine auxotrophy of an Escherichia coli strain (GE891). Subsequent analysis of the clone indicated that it could serve as a specific DNA probe. Dot-blot hybridizations with over 40 different Lactobacillus strains showed that this clone specifically recognized L. delbrueckii subsp. delbrueckii, bulgaricus, and lactis. The sensitivity of the method was tested by using an {alpha}-{sup 32}P-labeled probe.
Langevin, Stanley A; Bent, Zachary W; Solberg, Owen D; Curtis, Deanna J; Lane, Pamela D; Williams, Kelly P; Schoeniger, Joseph S; Sinha, Anupama; Lane, Todd W; Branda, Steven S
2013-04-01
Use of second generation sequencing (SGS) technologies for transcriptional profiling (RNA-Seq) has revolutionized transcriptomics, enabling measurement of RNA abundances with unprecedented specificity and sensitivity and the discovery of novel RNA species. Preparation of RNA-Seq libraries requires conversion of the RNA starting material into cDNA flanked by platform-specific adaptor sequences. Each of the published methods and commercial kits currently available for RNA-Seq library preparation suffers from at least one major drawback, including long processing times, large starting material requirements, uneven coverage, loss of strand information and high cost. We report the development of a new RNA-Seq library preparation technique that produces representative, strand-specific RNA-Seq libraries from small amounts of starting material in a fast, simple and cost-effective manner. Additionally, we have developed a new quantitative PCR-based assay for precisely determining the number of PCR cycles to perform for optimal enrichment of the final library, a key step in all SGS library preparation workflows.
Schäffer, Sylvia; Zachos, Frank E.
2017-01-01
DNA-barcoding is a rapidly developing method for efficiently identifying samples to species level by means of short standard DNA sequences. However, reliable species assignment requires the availability of a comprehensive DNA barcode reference library, and hence numerous initiatives aim at generating such barcode databases for particular taxa or geographic regions. Historical museum collections represent a potentially invaluable source for the DNA-barcoding of many taxa. This is particularly true for birds and mammals, for which collecting fresh (voucher) material is often very difficult to (nearly) impossible due to the special animal welfare and conservation regulations that apply to vertebrates in general, and birds and mammals in particular. Moreover, even great efforts might not guarantee sufficiently complete sampling of fresh material in a short period of time. DNA extracted from historical samples is usually degraded, such that only short fragments can be amplified, rendering the recovery of the barcoding region as a single fragment impossible. Here, we present a new set of primers that allows the efficient amplification and sequencing of the entire barcoding region in most higher taxa of Central European birds and mammals in six overlapping fragments, thus greatly increasing the value of historical museum collections for generating DNA barcode reference libraries. Applying our new primer set in recently established NGS protocols promises to further increase the efficiency of barcoding old bird and mammal specimens. PMID:28358863
Schäffer, Sylvia; Zachos, Frank E; Koblmüller, Stephan
2017-01-01
DNA-barcoding is a rapidly developing method for efficiently identifying samples to species level by means of short standard DNA sequences. However, reliable species assignment requires the availability of a comprehensive DNA barcode reference library, and hence numerous initiatives aim at generating such barcode databases for particular taxa or geographic regions. Historical museum collections represent a potentially invaluable source for the DNA-barcoding of many taxa. This is particularly true for birds and mammals, for which collecting fresh (voucher) material is often very difficult to (nearly) impossible due to the special animal welfare and conservation regulations that apply to vertebrates in general, and birds and mammals in particular. Moreover, even great efforts might not guarantee sufficiently complete sampling of fresh material in a short period of time. DNA extracted from historical samples is usually degraded, such that only short fragments can be amplified, rendering the recovery of the barcoding region as a single fragment impossible. Here, we present a new set of primers that allows the efficient amplification and sequencing of the entire barcoding region in most higher taxa of Central European birds and mammals in six overlapping fragments, thus greatly increasing the value of historical museum collections for generating DNA barcode reference libraries. Applying our new primer set in recently established NGS protocols promises to further increase the efficiency of barcoding old bird and mammal specimens.
Magic Pools: Parallel Assessment of Transposon Delivery Vectors in Bacteria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Hualan; Price, Morgan N.; Waters, Robert Jordan
Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach for discovering the functions of bacterial genes. However, the development of a suitable TnSeq strategy for a given bacterium can be costly and time-consuming. To meet this challenge, we describe a part-based strategy for constructing libraries of hundreds of transposon delivery vectors, which we term “magic pools.” Within a magic pool, each transposon vector has a different combination of upstream sequences (promoters and ribosome binding sites) and antibiotic resistance markers as well as a random DNA barcode sequence, which allows the tracking of each vector during mutagenesis experiments. Tomore » identify an efficient vector for a given bacterium, we mutagenize it with a magic pool and sequence the resulting insertions; we then use this efficient vector to generate a large mutant library. We used the magic pool strategy to construct transposon mutant libraries in five genera of bacteria, including three genera of the phylumBacteroidetes. IMPORTANCEMolecular genetics is indispensable for interrogating the physiology of bacteria. However, the development of a functional genetic system for any given bacterium can be time-consuming. Here, we present a streamlined approach for identifying an effective transposon mutagenesis system for a new bacterium. Our strategy first involves the construction of hundreds of different transposon vector variants, which we term a “magic pool.” The efficacy of each vector in a magic pool is monitored in parallel using a unique DNA barcode that is introduced into each vector design. Using archived DNA “parts,” we next reassemble an effective vector for making a whole-genome transposon mutant library that is suitable for large-scale interrogation of gene function using competitive growth assays. Here, we demonstrate the utility of the magic pool system to make mutant libraries in five genera of bacteria.« less
Magic Pools: Parallel Assessment of Transposon Delivery Vectors in Bacteria
Liu, Hualan; Price, Morgan N.; Waters, Robert Jordan; ...
2018-01-16
Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach for discovering the functions of bacterial genes. However, the development of a suitable TnSeq strategy for a given bacterium can be costly and time-consuming. To meet this challenge, we describe a part-based strategy for constructing libraries of hundreds of transposon delivery vectors, which we term “magic pools.” Within a magic pool, each transposon vector has a different combination of upstream sequences (promoters and ribosome binding sites) and antibiotic resistance markers as well as a random DNA barcode sequence, which allows the tracking of each vector during mutagenesis experiments. Tomore » identify an efficient vector for a given bacterium, we mutagenize it with a magic pool and sequence the resulting insertions; we then use this efficient vector to generate a large mutant library. We used the magic pool strategy to construct transposon mutant libraries in five genera of bacteria, including three genera of the phylumBacteroidetes. IMPORTANCEMolecular genetics is indispensable for interrogating the physiology of bacteria. However, the development of a functional genetic system for any given bacterium can be time-consuming. Here, we present a streamlined approach for identifying an effective transposon mutagenesis system for a new bacterium. Our strategy first involves the construction of hundreds of different transposon vector variants, which we term a “magic pool.” The efficacy of each vector in a magic pool is monitored in parallel using a unique DNA barcode that is introduced into each vector design. Using archived DNA “parts,” we next reassemble an effective vector for making a whole-genome transposon mutant library that is suitable for large-scale interrogation of gene function using competitive growth assays. Here, we demonstrate the utility of the magic pool system to make mutant libraries in five genera of bacteria.« less
Single-cell genomic sequencing using Multiple Displacement Amplification.
Lasken, Roger S
2007-10-01
Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
Sugai, Kyoko; Setsuko, Suzuki; Uchiyama, Kentaro; Murakami, Noriaki; Kato, Hidetoshi; Yoshimaru, Hiroshi
2012-02-01
Expressed sequence tag (EST)-derived microsatellite markers were developed for Elaeocarpus photiniifolia, an endemic taxon of the Bonin Islands. Initially, a complementary DNA (cDNA) library was constructed by de novo pyrosequencing of total RNA extracted from a seedling. A total of 267 primer pairs were designed from the library. Of the 48 tested loci, 25 loci were polymorphic among 41 individuals representing the entire geographical range of the species, with the number of alleles per locus and expected heterozygosity ranging from two to 14 and 0.09 to 0.86, respectively. Most loci were transferable to a related species, E. sylvestris. The developed markers will be useful for evaluating the genetic structure of E. photiniifolia.
Screening and analyzing genes associated with Amur tiger placental development.
Li, Q; Lu, T F; Liu, D; Hu, P F; Sun, B; Ma, J Z; Wang, W J; Wang, K F; Zhang, W X; Chen, J; Guan, W J; Ma, Y H; Zhang, M H
2014-09-26
The Amur tiger is a unique endangered species in the world, and thus, protection of its genetic resources is extremely important. In this study, an Amur tiger placenta cDNA library was constructed using the SMART cDNA Library Construction kit. A total of 508 colonies were sequenced, in which 205 (76%) genes were annotated and mapped to 74 KEGG pathways, including 29 metabolism, 29 genetic information processing, 4 environmental information processing, 7 cell motility, and 5 organismal system pathways. Additionally, PLAC8, PEG10 and IGF-II were identified after screening genes from the expressed sequence tags, and they were associated with placental development. These findings could lay the foundation for future functional genomic studies of the Amur tiger.
Goetz, Frederick W; Norberg, Birgitta; McCauley, Linda A R; Iliev, Dimitar B
2004-03-01
The full-length cDNA for the cod (Gadus morhua) StAR was cloned by RT-PCR and library screening using ovarian RNA. From the library screening, 2 size classes of cDNA were obtained; a 1577 bp cDNA (cStAR1) and a 2851 bp cDNA (cStAR2). The cStAR1 cDNA presumably encodes a protein of 286 amino acids. The cStAR2 cDNA was composed of 6 separated sequences that contained all of the coding regions of cStAR1 when added together, but also contained 5 noncoding regions not observed in cStAR1. Polymerase chain reactions of cod genomic DNA produced products slightly larger than cStAR2. The sequence of these products were the same as cStAR2 but revealed one additional noncoding region (intron). Thus, the fish StAR gene contains the same number of exons (7) and introns (6) as observed in mammals, but is approximately half the size of the mammalian gene. Using Northern analysis and RT-PCR, cStAR1 expression was observed only in testes, ovaries and head kidneys. Polymerase chain reaction products were also observed using cDNA from steroidogenic tissues and primers designed to regions specific for cStAR2, indicating that cStAR2 is expressed in tissues and may account for the presence of larger transcripts observed on Northern blots.
Roehner, Nicholas; Myers, Chris J
2014-02-21
Recently, we have begun to witness the potential of synthetic biology, noted here in the form of bacteria and yeast that have been genetically engineered to produce biofuels, manufacture drug precursors, and even invade tumor cells. The success of these projects, however, has often failed in translation and application to new projects, a problem exacerbated by a lack of engineering standards that combine descriptions of the structure and function of DNA. To address this need, this paper describes a methodology to connect the systems biology markup language (SBML) to the synthetic biology open language (SBOL), existing standards that describe biochemical models and DNA components, respectively. Our methodology involves first annotating SBML model elements such as species and reactions with SBOL DNA components. A graph is then constructed from the model, with vertices corresponding to elements within the model and edges corresponding to the cause-and-effect relationships between these elements. Lastly, the graph is traversed to assemble the annotating DNA components into a composite DNA component, which is used to annotate the model itself and can be referenced by other composite models and DNA components. In this way, our methodology can be used to build up a hierarchical library of models annotated with DNA components. Such a library is a useful input to any future genetic technology mapping algorithm that would automate the process of composing DNA components to satisfy a behavioral specification. Our methodology for SBML-to-SBOL annotation is implemented in the latest version of our genetic design automation (GDA) software tool, iBioSim.
Blakney, Anna K; Yilmaz, Gokhan; McKay, Paul F; Becer, C Remzi; Shattock, Robin J
2018-05-03
Nucleic acid delivery systems are commonly translated between different modalities, such as DNA and RNA of varying length and structure, despite physical differences in these molecules that yield disparate delivery efficiency with the same system. Here, we synthesized a library of poly(2-ethyl-2-oxazoline)/poly(ethylene imine) copolymers with varying molar mass and charge densities in order to probe how pDNA, mRNA, and RepRNA polyplex characteristics affect transfection efficiency. The library was utilized in a full factorial design of experiment (DoE) screening, with outputs of luciferase expression, particle size, surface charge, and particle concentration. The optimal copolymer molar mass and charge density was found as 83 kDa/100%, 72 kDa/100%, and 45 kDa/80% for pDNA, RepRNA, and mRNA, respectively. While 10 of the synthesized copolymers enhanced the transfection efficiency of pDNA and mRNA, only 2 copolymers enhanced RepRNA transfection efficiency, indicating a narrow and more stringent design space for RepRNA. These findings suggest that there is not a "one size fits all" polymer for different nucleic acid species.
Sun, Wei; Dai, Shikun; Jiang, Shumei; Wang, Guanghua; Liu, Guohui; Wu, Houbo; Li, Xiang
2010-06-01
In this report, the diversity of Actinobacteria associated with the marine sponge Hymeniacidon perleve collected from a remote island of the South China Sea was investigated employing classical cultivation and characterization, 16S rDNA library construction, 16S rDNA-restriction fragment length polymorphism (rDNA-RFLP) and phylogenetic analysis. A total of 184 strains were isolated using seven different media and 24 isolates were selected according to their morphological characteristics for phylogenetic analysis on the basis of their 16S rRNA gene sequences. Results showed that the 24 isolates were assigned to six genera including Salinispora, Gordonia, Mycobacterium, Nocardia, Rhodococcus and Streptomyces. This is the first report that Salinispora is present in a marine sponge from the South China Sea. Subsequently, 26 rDNA clones were selected from 191 clones in an Actinobacteria-specific 16S rDNA library of the H. perleve sample, using the RFLP technique for sequencing and phylogenetic analysis. In total, 26 phylotypes were clustered in eight known genera of Actinobacteria including Mycobacterium, Amycolatopsis, Arthrobacter, Brevibacterium, Microlunatus, Nocardioides, Pseudonocardia and Streptomyces. This study contributes to our understanding of actinobacterial diversity in the marine sponge H. perleve from the South China Sea.
A Versatile Microfluidic Device for Automating Synthetic Biology.
Shih, Steve C C; Goyal, Garima; Kim, Peter W; Koutsoubelis, Nicolas; Keasling, Jay D; Adams, Paul D; Hillson, Nathan J; Singh, Anup K
2015-10-16
New microbes are being engineered that contain the genetic circuitry, metabolic pathways, and other cellular functions required for a wide range of applications such as producing biofuels, biobased chemicals, and pharmaceuticals. Although currently available tools are useful in improving the synthetic biology process, further improvements in physical automation would help to lower the barrier of entry into this field. We present an innovative microfluidic platform for assembling DNA fragments with 10× lower volumes (compared to that of current microfluidic platforms) and with integrated region-specific temperature control and on-chip transformation. Integration of these steps minimizes the loss of reagents and products compared to that with conventional methods, which require multiple pipetting steps. For assembling DNA fragments, we implemented three commonly used DNA assembly protocols on our microfluidic device: Golden Gate assembly, Gibson assembly, and yeast assembly (i.e., TAR cloning, DNA Assembler). We demonstrate the utility of these methods by assembling two combinatorial libraries of 16 plasmids each. Each DNA plasmid is transformed into Escherichia coli or Saccharomyces cerevisiae using on-chip electroporation and further sequenced to verify the assembly. We anticipate that this platform will enable new research that can integrate this automated microfluidic platform to generate large combinatorial libraries of plasmids and will help to expedite the overall synthetic biology process.
NASA Astrophysics Data System (ADS)
Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.
1984-08-01
A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.
Kim, Stephanie; Eliot, Melissa; Koestler, Devin C; Houseman, Eugene A; Wetmur, James G; Wiencke, John K; Kelsey, Karl T
2016-09-01
We examined whether variation in blood-based epigenome-wide association studies could be more completely explained by augmenting existing reference DNA methylation libraries. We compared existing and enhanced libraries in predicting variability in three publicly available 450K methylation datasets that collected whole-blood samples. Models were fit separately to each CpG site and used to estimate the additional variability when adjustments for cell composition were made with each library. Calculation of the mean difference in the CpG-specific residual sums of squares error between models for an arthritis, aging and metabolic syndrome dataset, indicated that an enhanced library explained significantly more variation across all three datasets (p < 10(-3)). Pathologically important immune cell subtypes can explain important variability in epigenome-wide association studies done in blood.
Molecular cloning and characterization of Hymenolepis diminuta alpha-tubulin gene.
Mohajer-Maghari, Behrokh; Amini-Bavil-Olyaee, Samad; Webb, Rodney A; Coe, Imogen R
2007-02-01
To isolate a full-length alpha-tubulin cDNA from an eucestode, Hymenolepis diminuta, a lambda phage cDNA library was constructed. The alpha-tubulin gene was cloned, sequenced and characterized. The H. diminuta alpha-tubulin consisted of 450 amino acids. This protein contained putative sites for all posttranslational modifications as detyrosination/tyrosination at the carboxyl-terminal of protien, phosphorylation at residues R79 and K336, glycylation/glutamylation at residue G445 and acetylation at residue K40. Comparisons of H. diminuta alpha-tubulin with all full-length alpha-tubulin proteins revealed that H. diminuta alpha-tubulin possesses 10 distinctive residues, which are not found in any other alpha-tubulins. Phylogenetic analysis showed that H. diminuta alpha-tubulin has grouped in a separated branch adjacent eucestode and trematodes branch with 92% bootstrap value (1000 replicates). In conclusion, this is the first report of H. diminuta cDNA library construction, cloning and characterization of H. diminuta alpha-tubulin gene.
Reducing DNA context dependence in bacterial promoters
Carr, Swati B.; Densmore, Douglas M.
2017-01-01
Variation in the DNA sequence upstream of bacterial promoters is known to affect the expression levels of the products they regulate, sometimes dramatically. While neutral synthetic insulator sequences have been found to buffer promoters from upstream DNA context, there are no established methods for designing effective insulator sequences with predictable effects on expression levels. We address this problem with Degenerate Insulation Screening (DIS), a novel method based on a randomized 36-nucleotide insulator library and a simple, high-throughput, flow-cytometry-based screen that randomly samples from a library of 436 potential insulated promoters. The results of this screen can then be compared against a reference uninsulated device to select a set of insulated promoters providing a precise level of expression. We verify this method by insulating the constitutive, inducible, and repressible promotors of a four transcriptional-unit inverter (NOT-gate) circuit, finding both that order dependence is largely eliminated by insulation and that circuit performance is also significantly improved, with a 5.8-fold mean improvement in on/off ratio. PMID:28422998
Guo, Fei; Yu, Jiao; Zhang, Lu; Li, Jun
2017-11-01
The ForenSeq™ DNA Signature Prep Kit (ForenSeq Kit) is designed to detect more than 200 forensically relevant markers in a single reaction on the MiSeq FGx™ Forensic Genomics System (MiSeq FGx System), including Amelogenin, 27 autosomal short tandem repeats (A-STRs), 7 X chromosomal STRs (X-STRs), 24 Y chromosomal STRs (Y-STRs) and 94 identity-informative single nucleotide polymorphisms (iSNPs) with the option to contain 22 phenotypic-informative SNPs (pSNPs) and 56 ancestry-informative SNPs (aSNPs). In this study, we evaluated the MiSeq FGx System on three major parts: methodological optimization (DNA extraction, sample quantification, library normalization, diluted libraries concentration, and sample-to-cell arrangement), massively parallel sequencing (MPS) performance (depth of coverage, sequence coverage ratio, and allele coverage ratio), and ForenSeq Kit characteristics (repeatability and concordance, sensitivity, mixture, stability and case-type samples). Results showed that quantitative polymerase chain reaction (qPCR)-based sample quantification and library normalization and the appropriate number of pooled libraries and concentration of diluted libraries provided a greater level of MPS performance and repeatability. Repeatable and concordant genotypes were obtained by the ForenSeq Kit. Full profiles were obtained from ≥100pg input DNA for STRs and ≥200pg for SNPs. A sample with ≥5% minor contributors was considered as a mixture by imbalanced allele coverage ratio distribution, and full profiles from minor contributors were easily detected between 9:1 and 1:9 mixtures with known reference profiles. The ForenSeq Kit tolerated considerable concentrations of inhibitors like ≤200μM hematin and ≤50μg/ml humic acid, and >56% STR profiles and >88% SNP profiles were obtained from ≥200-bp degraded samples. Also, it was adapted to case-type samples. As a whole, the ForenSeq Kit is a well-performed, robust, reliable, reproducible and highly informative assay, and it can fully meet requirements for human identification. Further, sensitive QC indicator and automated sample comparison function in the ForenSeq™ Universal Analysis Software are quite helpful, so that we can concentrate on questionable genotypes and avoid tedious and time-consuming labor to maximum the time spent in data analysis. Copyright © 2017 Elsevier B.V. All rights reserved.
Krefft, Daria; Papkov, Aliaksei; Prusinowski, Maciej; Zylicz-Stachula, Agnieszka; Skowron, Piotr M
2018-05-11
Acoustic or hydrodynamic shearing, sonication and enzymatic digestion are used to fragment DNA. However, these methods have several disadvantages, such as DNA damage, difficulties in fragmentation control, irreproducibility and under-representation of some DNA segments. The DNA fragmentation tool would be a gentle enzymatic method, offering cleavage frequency high enough to eliminate DNA fragments distribution bias and allow for easy control of partial digests. Only three such frequently cleaving natural restriction endonucleases (REases) were discovered: CviJI, SetI and FaiI. Therefore, we have previously developed two artificial enzymatic specificities, cleaving DNA approximately every ~ 3-bp: TspGWI/sinefungin (SIN) and TaqII/SIN. In this paper we present the third developed specificity: TthHB27I/SIN(SAM) - a new genomic tool, based on Type IIS/IIC/IIG Thermus-family REases-methyltransferases (MTases). In the presence of dimethyl sulfoxide (DMSO) and S-adenosyl-L-methionine (SAM) or its analogue SIN, the 6-bp cognate TthHB27I recognition sequence 5'-CAARCA-3' is converted into a combined 3.2-3.0-bp 'site' or its statistical equivalent, while a cleavage distance of 11/9 nt is retained. Protocols for various modes of limited DNA digestions were developed. In the presence of DMSO and SAM or SIN, TthHB27I is transformed from rare 6-bp cutter to a very frequent one, approximately 3-bp. Thus, TthHB27I/SIN(SAM) comprises a new tool in the very low-represented segment of such prototype REases specificities. Moreover, this modified TthHB27I enzyme is uniquely suited for controlled DNA fragmentation, due to partial DNA cleavage, which is an inherent feature of the Thermus-family enzymes. Such tool can be used for quasi-random libraries generation as well as for other DNA manipulations, requiring high frequency cleavage and uniform distribution of cuts along DNA.
Determining the Location of DNA Modification and Mutation Caused by UVB Light in Skin Cancer
2015-09-01
Award Number: W81XWH-12-1-0333 TITLE: Determining the Location of DNA Modification and Mutation Caused by UVB Light in Skin Cancer PRINCIPAL...COVERED 15 Aug 2012 – 14 Aug 2015 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER W81XWH-12-1-0333 Determining the Location of DNA Modification and Mutation ...sequencing libraries generated for both yeast and human cells show pyrimidine bias on the 5’ end, indicating that we are sequencing the dimers
Takahara, Hiroyuki; Dolf, Andreas; Endl, Elmar; O'Connell, Richard
2009-08-01
Generation of stage-specific cDNA libraries is a powerful approach to identify pathogen genes that are differentially expressed during plant infection. Biotrophic pathogens develop specialized infection structures inside living plant cells, but sampling the transcriptome of these structures is problematic due to the low ratio of fungal to plant RNA, and the lack of efficient methods to isolate them from infected plants. Here we established a method, based on fluorescence-activated cell sorting (FACS), to purify the intracellular biotrophic hyphae of Colletotrichum higginsianum from homogenates of infected Arabidopsis leaves. Specific selection of viable hyphae using a fluorescent vital marker provided intact RNA for cDNA library construction. Pilot-scale sequencing showed that the library was enriched with plant-induced and pathogenicity-related fungal genes, including some encoding small, soluble secreted proteins that represent candidate fungal effectors. The high purity of the hyphae (94%) prevented contamination of the library by sequences derived from host cells or other fungal cell types. RT-PCR confirmed that genes identified in the FACS-purified hyphae were also expressed in planta. The method has wide applicability for isolating the infection structures of other plant pathogens, and will facilitate cell-specific transcriptome analysis via deep sequencing and microarray hybridization, as well as proteomic analyses.
2010-01-01
Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum. PMID:20230644
Popp, Nicole; Schlömann, Michael; Mau, Margit
2006-11-01
Soils contaminated with mineral oil hydrocarbons are often cleaned in off-site bioremediation systems. In order to find out which bacteria are active during the degradation phase in such systems, the diversity of the active microflora in a degrading soil remediation system was investigated by small-subunit (SSU) rRNA analysis. Two sequential RNA extracts from one soil sample were generated by a procedure incorporating bead beating. Both extracts were analysed separately by generating individual SSU rDNA clone libraries from cDNA of the two extracts. The sequencing results showed moderate diversity. The two clone libraries were dominated by Gammaproteobacteria, especially Pseudomonas spp. Alphaproteobacteria and Betaproteobacteria were two other large groups in the clone libraries. Actinobacteria, Firmicutes, Bacteroidetes and Epsilonproteobacteria were detected in lower numbers. The obtained sequences were predominantly related to genera for which cultivated representatives have been described, but were often clustered together in the phylogenetic tree, and the sequences that were most similar were originally obtained from soils and not from pure cultures. Most of the dominant genera in the clone libraries, e.g. Pseudomonas, Acinetobacter, Sphingomonas, Acidovorax and Thiobacillus, had already been detected in (mineral oil hydrocarbon) contaminated environmental samples. The occurrence of the genera Zymomonas and Rhodoferax was novel in mineral oil hydrocarbon-contaminated soil.
Devirgiliis, Chiara; Barile, Simona; Perozzi, Giuditta
2014-01-01
Lactic acid bacteria (LAB) represent the predominant microbiota in fermented foods. Foodborne LAB have received increasing attention as potential reservoir of antibiotic resistance (AR) determinants, which may be horizontally transferred to opportunistic pathogens. We have previously reported isolation of AR LAB from the raw ingredients of a fermented cheese, while AR genes could be detected in the final, marketed product only by PCR amplification, thus pointing at the need for more sensitive microbial isolation techniques. We turned therefore to construction of a metagenomic library containing microbial DNA extracted directly from the food matrix. To maximize yield and purity and to ensure that genomic complexity of the library was representative of the original bacterial population, we defined a suitable protocol for total DNA extraction from cheese which can also be applied to other lipid-rich foods. Functional library screening on different antibiotics allowed recovery of ampicillin and kanamycin resistant clones originating from Streptococcus salivarius subsp. thermophilus and Lactobacillus helveticus genomes. We report molecular characterization of the cloned inserts, which were fully sequenced and shown to confer AR phenotype to recipient bacteria. We also show that metagenomics can be applied to food microbiota to identify underrepresented species carrying specific genes of interest. PMID:25243126
Trinucleotide cassettes increase diversity of T7 phage-displayed peptide library.
Krumpe, Lauren R H; Schumacher, Kathryn M; McMahon, James B; Makowski, Lee; Mori, Toshiyuki
2007-10-05
Amino acid sequence diversity is introduced into a phage-displayed peptide library by randomizing library oligonucleotide DNA. We recently evaluated the diversity of peptide libraries displayed on T7 lytic phage and M13 filamentous phage and showed that T7 phage can display a more diverse amino acid sequence repertoire due to differing processes of viral morphogenesis. In this study, we evaluated and compared the diversity of a 12-mer T7 phage-displayed peptide library randomized using codon-corrected trinucleotide cassettes with a T7 and an M13 12-mer phage-displayed peptide library constructed using the degenerate codon randomization method. We herein demonstrate that the combination of trinucleotide cassette amino acid codon randomization and T7 phage display construction methods resulted in a significant enhancement to the functional diversity of a 12-mer peptide library. This novel library exhibited superior amino acid uniformity and order-of-magnitude increases in amino acid sequence diversity as compared to degenerate codon randomized peptide libraries. Comparative analyses of the biophysical characteristics of the 12-mer peptide libraries revealed the trinucleotide cassette-randomized library to be a unique resource. The combination of T7 phage display and trinucleotide cassette randomization resulted in a novel resource for the potential isolation of binding peptides for new and previously studied molecular targets.
Chun, Carlene K; Scheetz, Todd E; Bonaldo, Maria de Fatima; Brown, Bartley; Clemens, Anik; Crookes-Goodson, Wendy J; Crouch, Keith; DeMartini, Tad; Eyestone, Mari; Goodson, Michael S; Janssens, Bernadette; Kimbell, Jennifer L; Koropatnick, Tanya A; Kucaba, Tamara; Smith, Christina; Stewart, Jennifer J; Tong, Deyan; Troll, Joshua V; Webster, Sarahrose; Winhall-Rice, Jane; Yap, Cory; Casavant, Thomas L; McFall-Ngai, Margaret J; Soares, M Bento
2006-01-01
Background Biologists are becoming increasingly aware that the interaction of animals, including humans, with their coevolved bacterial partners is essential for health. This growing awareness has been a driving force for the development of models for the study of beneficial animal-bacterial interactions. In the squid-vibrio model, symbiotic Vibrio fischeri induce dramatic developmental changes in the light organ of host Euprymna scolopes over the first hours to days of their partnership. We report here the creation of a juvenile light-organ specific EST database. Results We generated eleven cDNA libraries from the light organ of E. scolopes at developmentally significant time points with and without colonization by V. fischeri. Single pass 3' sequencing efforts generated 42,564 expressed sequence tags (ESTs) of which 35,421 passed our quality criteria and were then clustered via the UIcluster program into 13,962 nonredundant sequences. The cDNA clones representing these nonredundant sequences were sequenced from the 5' end of the vector and 58% of these resulting sequences overlapped significantly with the associated 3' sequence to generate 8,067 contigs with an average sequence length of 1,065 bp. All sequences were annotated with BLASTX (E-value < -03) and Gene Ontology (GO). Conclusion Both the number of ESTs generated from each library and GO categorizations are reflective of the activity state of the light organ during these early stages of symbiosis. Future analyses of the sequences identified in these libraries promise to provide valuable information not only about pathways involved in colonization and early development of the squid light organ, but also about pathways conserved in response to bacterial colonization across the animal kingdom. PMID:16780587
Rojas-Cartagena, Carmencita; Ortíz-Pineda, Pablo; Ramírez-Gómez, Francisco; Suárez-Castillo, Edna C.; Matos-Cruz, Vanessa; Rodríguez, Carlos; Ortíz-Zuazaga, Humberto; García-Arrarás, José E.
2010-01-01
Repair and regeneration are key processes for tissue maintenance, and their disruption may lead to disease states. Little is known about the molecular mechanisms that underline the repair and regeneration of the digestive tract. The sea cucumber Holothuria glaberrima represents an excellent model to dissect and characterize the molecular events during intestinal regeneration. To study the gene expression profile, cDNA libraries were constructed from normal, 3-day, and 7-day regenerating intestines of H. glaberrima. Clones were randomly sequenced and queried against the nonredundant protein database at the National Center for Biotechnology Information. RT-PCR analyses were made of several genes to determine their expression profile during intestinal regeneration. A total of 5,173 sequences from three cDNA libraries were obtained. About 46.2, 35.6, and 26.2% of the sequences for the normal, 3-days, and 7-days cDNA libraries, respectively, shared significant similarity with known sequences in the protein database of GenBank but only present 10% of similarity among them. Analysis of the libraries in terms of functional processes, protein domains, and most common sequences suggests that a differential expression profile is taking place during the regeneration process. Further examination of the expressed sequence tag dataset revealed that 12 putative genes are differentially expressed at significant level (R > 6). Experimental validation by RT-PCR analysis reveals that at least three genes (unknown C-4677-1, melanotransferrin, and centaurin) present a differential expression during regeneration. These findings strongly suggest that the gene expression profile varies among regeneration stages and provide evidence for the existence of differential gene expression. PMID:17579180
Yu, Bing; Ni, Ming; Li, Wen-Han; Lei, Ping; Xing, Wei; Xiao, Dai-Wen; Huang, Yu; Tang, Zhen-Jie; Zhu, Hui-Fen; Shen, Guan-Xin
2005-07-14
To identify the scFv antibody fragments specific for hepatocellular carcinoma by biopanning from a large human naive scFv phage display library. A large human naive scFv phage library was used to search for the specific targets by biopanning with the hepatocellular carcinoma cell line HepG2 for the positive-selecting and the normal liver cell line L02 for the counter-selecting. After three rounds of biopanning, individual scFv phages binding selectively to HepG2 cells were picked out. PCR was carried out for identification of the clones containing scFv gene sequence. The specific scFv phages were selected by ELISA and flow cytometry. DNA sequences of positive clones were analyzed by using Applied Biosystem Automated DNA sequencers 3 730. The expression proteins of the specific scFv antibody fragments in E.coli HB2151 were purified by the affinity chromatography and detected by SDS-PAGE, Western blot and ELISA. The biological effect of the soluble antibody fragments on the HepG2 cells was investigated by observing the cell proliferation. Two different positive clones were obtained and the functional variable sequences were identified. Their DNA sequences of the scFv antibody fragments were submitted to GenBank (accession nos: AY686498 and AY686499). The soluble scFv antibody fragments were successfully expressed in E.coli HB2151. The relative molecular mass of the expression products was about 36 ku, according to its predicted M(r) value. The two soluble scFv antibody fragments also had specific binding activity and obvious growth inhibition properties to HepG2 cells. The phage library biopanning permits identification of specific antibody fragments for hepatocellular carcinoma and affords experiment evidence for its immunotherapy study.
Ndungu, John Maina; Suponitsky-Kroyter, Irena; Cavett, Valerie J.; McEnaney, Patrick J.; MacConnell, Andrew B.; Doran, Todd. M.; Ronacher, Katharina; Stanley, Kim; Utset, Ofelia; Walzl, Gerhard; Paegel, Brian M.; Kodadek, Thomas
2017-01-01
The circulating antibody repertoire encodes a patient's health status and pathogen exposure history, but identifying antibodies with diagnostic potential usually requires knowledge of the antigen(s). We previously circumvented this problem by screening libraries of bead-displayed small molecules against case and control serum samples to discover “epitope surrogates” (ligands of IgGs enriched in the case sample). Here, we describe an improved version of this technology that employs DNA-encoded libraries and high-throughput FACS-based screening to discover epitope surrogates that differentiate noninfectious/latent (LTB) patients from infectious/active TB (ATB) patients, which is imperative for proper treatment selection and antibiotic stewardship. Normal control/LTB (10 patients each, NCL) and ATB (10 patients) serum pools were screened against a library (5 × 106 beads, 448k unique compounds) using fluorescent anti-human IgG to label hit compound beads for FACS. Deep sequencing decoded all hit structures and each hit's occurrence frequencies. ATB hits were pruned of NCL hits and prioritized for resynthesis based on occurrence and homology. Several structurally homologous families were identified and 16/21 resynthesized representative hits validated as selective ligands of ATB serum IgGs (p < 0.005). The native secreted TB protein Ag85B (though not the E. coli recombinant form) competed with one of the validated ligands for binding to antibodies, suggesting that it mimics a native Ag85B epitope. The use of DNA-encoded libraries and FACS-based screening in epitope surrogate discovery reveals thousands of potential hit structures. Distilling this list down to several consensus chemical structures yielded a diagnostic panel for ATB composed of thermally stable and economically produced small molecule ligands in place of protein antigens. PMID:27957856
NASA Astrophysics Data System (ADS)
Liu, Hongzhan; Zheng, Fengrong; Sun, Xiuqin; Cai, Yimei
2012-06-01
The aquaculture of sea cucumber Apostichopus japonicus (Echinodermata, Holothuroidea) has grown rapidly during recent years and has become an important sector of the marine industry in Northern China. However, with the rapid growth of the industry and the use of non-standard culture techniques, epidemic diseases of A. japonicus now pose increasing problems to the industry. To screen the genes with stress response to bacterial infection in sea cucumber at a genome wide level, we constructed a cDNA library from A. japonicus Selenka (Aspidochirotida: Stichopodidae) after infecting them with Vibrio sp. for 48 h. Total RNA was extracted from the intestine, mesentery and coelomocyte of infected sea cucumber using Trizol and mRNA was isolated by Oligotex mRNA Kits. The ligated cDNAs were transformed into DH5α, and a library of 3.24×105 clones (3.24×105 cfu mL-1) was obtained with the sizes of inserted fragments ranging from 0.8 to 2.5 kb. Sequencing the cDNA clones resulted in a total of 1106 ESTs that passed the quality control. BlastX and BlastN searches have identified 168 (31.5%) ESTs sharing significant homology with known sequences in NCBI protein or nucleotide databases. Among a panel of 25 putative immunity-related genes, serum lectin isoform, complement component 3, complement component 3-like genes were further studied by real-time PCR and they all increased more than 5 fold in response to Vibrio sp. challenge. Our library provides a valuable molecular tool for future study of invertebrate immunity against bacterial infection and our gene expression data indicates the importance of the immune system in the evolution and development of sea cucumber.